Anshuman Aggarwal
2017-02-07 03:39:06 UTC
Hi,
the email notifications generated by smartmon tools do not work in a
very smart manner, hence limitiing their usefulness. Here is an
example:
I have a drive that is supposedly failing but will likely last me 6-8
months more (as part of a raid 6 cluster so I don't feel compelled to
replace right away, it also allowed a full reconstruction of the 2 TB
cluster without any issues) It gives the following error notifications
everyday for the last 3 months via emails.
Device: /dev/sdg [SAT], 11 Currently unreadable (pending) sectors
Device: /dev/sdg [SAT], 11 Offline uncorrectable sectors
However the number of reallocated and unreadable sectors is NOT going
up. It stays the same at 11 and 5 and has been for the last 3-4
months.
What I *would like* to see is a notification only when the number
*changes* from the last time the error failed so that I can react to
that change that the drive is 'worsening'.
Right now I get the same information repeated every day, which I have
to ignore on my own and will probably end up ignoring it when the
drive actually fails.
Anybody else feel this issue? Have I missed something in the settings etc?
Cheers,
Anshuman
the email notifications generated by smartmon tools do not work in a
very smart manner, hence limitiing their usefulness. Here is an
example:
I have a drive that is supposedly failing but will likely last me 6-8
months more (as part of a raid 6 cluster so I don't feel compelled to
replace right away, it also allowed a full reconstruction of the 2 TB
cluster without any issues) It gives the following error notifications
everyday for the last 3 months via emails.
Device: /dev/sdg [SAT], 11 Currently unreadable (pending) sectors
Device: /dev/sdg [SAT], 11 Offline uncorrectable sectors
However the number of reallocated and unreadable sectors is NOT going
up. It stays the same at 11 and 5 and has been for the last 3-4
months.
What I *would like* to see is a notification only when the number
*changes* from the last time the error failed so that I can react to
that change that the drive is 'worsening'.
Right now I get the same information repeated every day, which I have
to ignore on my own and will probably end up ignoring it when the
drive actually fails.
Anybody else feel this issue? Have I missed something in the settings etc?
Cheers,
Anshuman