Discussion:
[smartmontools-support] Captive Mode Tests Interrupted (Host Reset) on ST3500630A
M P
2008-03-24 18:20:31 UTC
Permalink
My disk is a Seagate ST3500630A. I noticed that it
was very slow to mount on Saturday (about two
minutes). Though it could well have been in power
saving mode (the machine was on for several days
without mounting that disk), I decided to do a smart
scan anyway. As such, I ran:

smartctl -d ata -C -t short /dev/sdc

Notably, I ran it in captive mode. The terminal then
froze (I was able to switch to other tabs and other
windows, though - it wouldn't respond to Ctrl-C). For
about three minutes, it seemed to be trying to send
commands to the drive without success. In fact,
several times the drive seemed to be STRUGGLING -
making an odd buzzing noise for a second or two,
though this MIGHT just be it running the test. I got
several errors, stating that the drive had had a short
test - which was interrupted every time with a "host
reset" at 30% remaining. I ran it OUT of captive mode
and the test returned fine. I had similar results
with the long test - it failed with 90% remaining with
captive mode on, and without captive mode it returned
fine.

Long story short - just what's going on here? Is my
drive in trouble? Is this a fluke? What should I do
at this point?

Thank you for any help you can give me or anywhere
else you can point me to.




INFORMATION:

Controller driver: pata_pdc2027x (unsure of the
model), and if this is causing a problem it would be
the first time - I ran similar tests on the other
drive connected to it, with no problems.

Firmware version is 3.AAF.

=== START OF READ SMART DATA SECTION ===
SMART overall-health self-assessment test result:
PASSED

General SMART Values:
Offline data collection status: (0x82) Offline data
collection activity
was completed
without error.
Auto Offline
Data Collection: Enabled.
Self-test execution status: ( 0) The previous
self-test routine completed
without error
or no self-test has ever
been run.
Total time to complete Offline
data collection: ( 430) seconds.
Offline data collection
capabilities: (0x5b) SMART execute
Offline immediate.
Auto Offline
data collection on/off support.
Suspend
Offline collection upon new
command.
Offline
surface scan supported.
Self-test
supported.
No Conveyance
Self-test supported.
Selective
Self-test supported.
SMART capabilities: (0x0003) Saves SMART
data before entering
power-saving
mode.
Supports SMART
auto save timer.
Error logging capability: (0x01) Error logging
supported.
General
Purpose Logging supported.
Short self-test routine
recommended polling time: ( 1) minutes.
Extended self-test routine
recommended polling time: ( 163) minutes.

SMART Attributes Data Structure revision number: 10
Vendor Specific SMART Attributes with Thresholds:
ID# ATTRIBUTE_NAME FLAG VALUE WORST
THRESH TYPE UPDATED WHEN_FAILED RAW_VALUE
1 Raw_Read_Error_Rate 0x000f 109 099 006
Pre-fail Always - 160693144
3 Spin_Up_Time 0x0003 092 092 000
Pre-fail Always - 0
4 Start_Stop_Count 0x0032 100 100 020
Old_age Always - 14
5 Reallocated_Sector_Ct 0x0033 100 100 036
Pre-fail Always - 0
7 Seek_Error_Rate 0x000f 076 060 030
Pre-fail Always - 48042050
9 Power_On_Hours 0x0032 100 100 000
Old_age Always - 729
10 Spin_Retry_Count 0x0013 100 100 097
Pre-fail Always - 0
12 Power_Cycle_Count 0x0032 100 100 020
Old_age Always - 14
187 Reported_Uncorrect 0x0032 100 100 000
Old_age Always - 0
189 High_Fly_Writes 0x003a 100 100 000
Old_age Always - 0
190 Airflow_Temperature_Cel 0x0022 061 048 045
Old_age Always - 39 (Lifetime Min/Max
24/41)
194 Temperature_Celsius 0x0022 039 052 000
Old_age Always - 39 (0 24 0 0)
195 Hardware_ECC_Recovered 0x001a 063 057 000
Old_age Always - 140584844
197 Current_Pending_Sector 0x0012 100 100 000
Old_age Always - 0
198 Offline_Uncorrectable 0x0010 100 100 000
Old_age Offline - 0
199 UDMA_CRC_Error_Count 0x003e 200 200 000
Old_age Always - 0
200 Multi_Zone_Error_Rate 0x0000 100 253 000
Old_age Offline - 0
202 TA_Increase_Count 0x0032 100 253 000
Old_age Always - 0

SMART Error Log Version: 1
No Errors Logged

SMART Self-test log structure revision number 1
Num Test_Description Status
Remaining LifeTime(hours) LBA_of_first_error
# 1 Extended offline Completed without error
00% 729 -
# 2 Short offline Completed without error
00% 726 -
# 3 Extended captive Interrupted (host reset)
90% 726 -
# 4 Extended captive Interrupted (host reset)
90% 726 -
# 5 Extended captive Interrupted (host reset)
90% 726 -
# 6 Extended captive Interrupted (host reset)
90% 726 -
# 7 Extended captive Interrupted (host reset)
90% 726 -
# 8 Extended captive Interrupted (host reset)
90% 726 -
# 9 Short captive Interrupted (host reset)
30% 726 -
#10 Short captive Interrupted (host reset)
30% 726 -
#11 Short captive Interrupted (host reset)
30% 726 -
#12 Short captive Interrupted (host reset)
30% 726 -
#13 Short captive Interrupted (host reset)
30% 726 -
#14 Short captive Interrupted (host reset)
30% 726 -
#15 Short captive Interrupted (host reset)
30% 726 -
#16 Short captive Interrupted (host reset)
30% 726 -
#17 Short captive Interrupted (host reset)
30% 726 -
#18 Short captive Interrupted (host reset)
30% 726 -
#19 Short captive Interrupted (host reset)
30% 726 -
#20 Short captive Interrupted (host reset)
30% 726 -
#21 Short offline Completed without error
00% 726 -

SMART Selective self-test log data structure revision
number 1
SPAN MIN_LBA MAX_LBA CURRENT_TEST_STATUS
1 0 0 Not_testing
2 0 0 Not_testing
3 0 0 Not_testing
4 0 0 Not_testing
5 0 0 Not_testing
Selective self-test flags (0x0):
After scanning selected spans, do NOT read-scan
remainder of disk.
If Selective self-test is pending on power-up, resume
after 0 minute delay.




____________________________________________________________________________________
Be a better friend, newshound, and
know-it-all with Yahoo! Mobile. Try it now. http://mobile.yahoo.com/;_ylt=Ahu06i62sR8HDtDypao8Wcj9tAcJ
Bruce Allen
2008-03-25 03:06:45 UTC
Permalink
I've never run tests in captive mode except on devices with no mounted
file systems. In your situation, I would only run self-tests in
non-captive mode. After the non-captive short test, if the disk self-test
log shows no errors, I would then run a non-captive long test.

Cheers,
Bruce
Post by M P
My disk is a Seagate ST3500630A. I noticed that it
was very slow to mount on Saturday (about two
minutes). Though it could well have been in power
saving mode (the machine was on for several days
without mounting that disk), I decided to do a smart
smartctl -d ata -C -t short /dev/sdc
Notably, I ran it in captive mode. The terminal then
froze (I was able to switch to other tabs and other
windows, though - it wouldn't respond to Ctrl-C). For
about three minutes, it seemed to be trying to send
commands to the drive without success. In fact,
several times the drive seemed to be STRUGGLING -
making an odd buzzing noise for a second or two,
though this MIGHT just be it running the test. I got
several errors, stating that the drive had had a short
test - which was interrupted every time with a "host
reset" at 30% remaining. I ran it OUT of captive mode
and the test returned fine. I had similar results
with the long test - it failed with 90% remaining with
captive mode on, and without captive mode it returned
fine.
Long story short - just what's going on here? Is my
drive in trouble? Is this a fluke? What should I do
at this point?
Thank you for any help you can give me or anywhere
else you can point me to.
Controller driver: pata_pdc2027x (unsure of the
model), and if this is causing a problem it would be
the first time - I ran similar tests on the other
drive connected to it, with no problems.
Firmware version is 3.AAF.
=== START OF READ SMART DATA SECTION ===
PASSED
Offline data collection status: (0x82) Offline data
collection activity
was completed
without error.
Auto Offline
Data Collection: Enabled.
Self-test execution status: ( 0) The previous
self-test routine completed
without error
or no self-test has ever
been run.
Total time to complete Offline
data collection: ( 430) seconds.
Offline data collection
capabilities: (0x5b) SMART execute
Offline immediate.
Auto Offline
data collection on/off support.
Suspend
Offline collection upon new
command.
Offline
surface scan supported.
Self-test
supported.
No Conveyance
Self-test supported.
Selective
Self-test supported.
SMART capabilities: (0x0003) Saves SMART
data before entering
power-saving
mode.
Supports SMART
auto save timer.
Error logging capability: (0x01) Error logging
supported.
General
Purpose Logging supported.
Short self-test routine
recommended polling time: ( 1) minutes.
Extended self-test routine
recommended polling time: ( 163) minutes.
SMART Attributes Data Structure revision number: 10
ID# ATTRIBUTE_NAME FLAG VALUE WORST
THRESH TYPE UPDATED WHEN_FAILED RAW_VALUE
1 Raw_Read_Error_Rate 0x000f 109 099 006
Pre-fail Always - 160693144
3 Spin_Up_Time 0x0003 092 092 000
Pre-fail Always - 0
4 Start_Stop_Count 0x0032 100 100 020
Old_age Always - 14
5 Reallocated_Sector_Ct 0x0033 100 100 036
Pre-fail Always - 0
7 Seek_Error_Rate 0x000f 076 060 030
Pre-fail Always - 48042050
9 Power_On_Hours 0x0032 100 100 000
Old_age Always - 729
10 Spin_Retry_Count 0x0013 100 100 097
Pre-fail Always - 0
12 Power_Cycle_Count 0x0032 100 100 020
Old_age Always - 14
187 Reported_Uncorrect 0x0032 100 100 000
Old_age Always - 0
189 High_Fly_Writes 0x003a 100 100 000
Old_age Always - 0
190 Airflow_Temperature_Cel 0x0022 061 048 045
Old_age Always - 39 (Lifetime Min/Max
24/41)
194 Temperature_Celsius 0x0022 039 052 000
Old_age Always - 39 (0 24 0 0)
195 Hardware_ECC_Recovered 0x001a 063 057 000
Old_age Always - 140584844
197 Current_Pending_Sector 0x0012 100 100 000
Old_age Always - 0
198 Offline_Uncorrectable 0x0010 100 100 000
Old_age Offline - 0
199 UDMA_CRC_Error_Count 0x003e 200 200 000
Old_age Always - 0
200 Multi_Zone_Error_Rate 0x0000 100 253 000
Old_age Offline - 0
202 TA_Increase_Count 0x0032 100 253 000
Old_age Always - 0
SMART Error Log Version: 1
No Errors Logged
SMART Self-test log structure revision number 1
Num Test_Description Status
Remaining LifeTime(hours) LBA_of_first_error
# 1 Extended offline Completed without error
00% 729 -
# 2 Short offline Completed without error
00% 726 -
# 3 Extended captive Interrupted (host reset)
90% 726 -
# 4 Extended captive Interrupted (host reset)
90% 726 -
# 5 Extended captive Interrupted (host reset)
90% 726 -
# 6 Extended captive Interrupted (host reset)
90% 726 -
# 7 Extended captive Interrupted (host reset)
90% 726 -
# 8 Extended captive Interrupted (host reset)
90% 726 -
# 9 Short captive Interrupted (host reset)
30% 726 -
#10 Short captive Interrupted (host reset)
30% 726 -
#11 Short captive Interrupted (host reset)
30% 726 -
#12 Short captive Interrupted (host reset)
30% 726 -
#13 Short captive Interrupted (host reset)
30% 726 -
#14 Short captive Interrupted (host reset)
30% 726 -
#15 Short captive Interrupted (host reset)
30% 726 -
#16 Short captive Interrupted (host reset)
30% 726 -
#17 Short captive Interrupted (host reset)
30% 726 -
#18 Short captive Interrupted (host reset)
30% 726 -
#19 Short captive Interrupted (host reset)
30% 726 -
#20 Short captive Interrupted (host reset)
30% 726 -
#21 Short offline Completed without error
00% 726 -
SMART Selective self-test log data structure revision
number 1
SPAN MIN_LBA MAX_LBA CURRENT_TEST_STATUS
1 0 0 Not_testing
2 0 0 Not_testing
3 0 0 Not_testing
4 0 0 Not_testing
5 0 0 Not_testing
After scanning selected spans, do NOT read-scan
remainder of disk.
If Selective self-test is pending on power-up, resume
after 0 minute delay.
____________________________________________________________________________________
Be a better friend, newshound, and
know-it-all with Yahoo! Mobile. Try it now. http://mobile.yahoo.com/;_ylt=Ahu06i62sR8HDtDypao8Wcj9tAcJ
-------------------------------------------------------------------------
This SF.net email is sponsored by: Microsoft
Defy all challenges. Microsoft(R) Visual Studio 2008.
http://clk.atdmt.com/MRT/go/vse0120000070mrt/direct/01/
_______________________________________________
Smartmontools-support mailing list
https://lists.sourceforge.net/lists/listinfo/smartmontools-support
M P
2008-03-28 19:32:50 UTC
Permalink
Bruce -

First, I'd like to thank you for your prompt reply.

Second, I had the drive entirely unmounted. I should
have clarified this. As such, the captive test was
failing with repeated host resets with the drive
completely unmounted. To verify this I repeated the
test with the drive in captive mode and all partitions
unmounted and the results were the same - several
attempts, all failing at 30% remaining for the short
test and 10% for the long test.

I repeated the tests while not captive and received a
clean bill of health.

Does this fact make any difference in your advice?

Thanks for any help you can give.

-MP
Post by Bruce Allen
I've never run tests in captive mode except on
devices with no mounted
file systems. In your situation, I would only run
self-tests in
non-captive mode. After the non-captive short test,
if the disk self-test
log shows no errors, I would then run a non-captive
long test.
Cheers,
Bruce
Post by M P
My disk is a Seagate ST3500630A. I noticed that
it
Post by M P
was very slow to mount on Saturday (about two
minutes). Though it could well have been in power
saving mode (the machine was on for several days
without mounting that disk), I decided to do a
smart
Post by M P
smartctl -d ata -C -t short /dev/sdc
Notably, I ran it in captive mode. The terminal
then
Post by M P
froze (I was able to switch to other tabs and
other
Post by M P
windows, though - it wouldn't respond to Ctrl-C).
For
Post by M P
about three minutes, it seemed to be trying to
send
Post by M P
commands to the drive without success. In fact,
several times the drive seemed to be STRUGGLING -
making an odd buzzing noise for a second or two,
though this MIGHT just be it running the test. I
got
Post by M P
several errors, stating that the drive had had a
short
Post by M P
test - which was interrupted every time with a
"host
Post by M P
reset" at 30% remaining. I ran it OUT of captive
mode
Post by M P
and the test returned fine. I had similar results
with the long test - it failed with 90% remaining
with
Post by M P
captive mode on, and without captive mode it
returned
Post by M P
fine.
Long story short - just what's going on here? Is
my
Post by M P
drive in trouble? Is this a fluke? What should I
do
Post by M P
at this point?
Thank you for any help you can give me or anywhere
else you can point me to.
Controller driver: pata_pdc2027x (unsure of the
model), and if this is causing a problem it would
be
Post by M P
the first time - I ran similar tests on the other
drive connected to it, with no problems.
Firmware version is 3.AAF.
=== START OF READ SMART DATA SECTION ===
PASSED
Offline data collection status: (0x82) Offline
data
Post by M P
collection activity
was
completed
Post by M P
without error.
Auto
Offline
Post by M P
Data Collection: Enabled.
Self-test execution status: ( 0) The
previous
Post by M P
self-test routine completed
without
error
Post by M P
or no self-test has ever
been run.
Total time to complete Offline
data collection: ( 430) seconds.
Offline data collection
capabilities: (0x5b) SMART
execute
Post by M P
Offline immediate.
Auto
Offline
Post by M P
data collection on/off support.
Suspend
Offline collection upon new
command.
Offline
surface scan supported.
Self-test
supported.
No
Conveyance
Post by M P
Self-test supported.
Selective
Self-test supported.
SMART capabilities: (0x0003) Saves
SMART
Post by M P
data before entering
power-saving
Post by M P
mode.
Supports
SMART
Post by M P
auto save timer.
Error logging capability: (0x01) Error
logging
Post by M P
supported.
General
Purpose Logging supported.
Short self-test routine
recommended polling time: ( 1) minutes.
Extended self-test routine
recommended polling time: ( 163) minutes.
10
Post by M P
ID# ATTRIBUTE_NAME FLAG VALUE WORST
THRESH TYPE UPDATED WHEN_FAILED RAW_VALUE
1 Raw_Read_Error_Rate 0x000f 109 099
006
Post by M P
Pre-fail Always - 160693144
3 Spin_Up_Time 0x0003 092 092
000
Post by M P
Pre-fail Always - 0
4 Start_Stop_Count 0x0032 100 100
020
Post by M P
Old_age Always - 14
5 Reallocated_Sector_Ct 0x0033 100 100
036
Post by M P
Pre-fail Always - 0
7 Seek_Error_Rate 0x000f 076 060
030
Post by M P
Pre-fail Always - 48042050
9 Power_On_Hours 0x0032 100 100
000
Post by M P
Old_age Always - 729
10 Spin_Retry_Count 0x0013 100 100
097
Post by M P
Pre-fail Always - 0
12 Power_Cycle_Count 0x0032 100 100
020
Post by M P
Old_age Always - 14
187 Reported_Uncorrect 0x0032 100 100
000
Post by M P
Old_age Always - 0
189 High_Fly_Writes 0x003a 100 100
000
Post by M P
Old_age Always - 0
190 Airflow_Temperature_Cel 0x0022 061 048
045
Post by M P
Old_age Always - 39 (Lifetime
Min/Max
Post by M P
24/41)
194 Temperature_Celsius 0x0022 039 052
000
Post by M P
Old_age Always - 39 (0 24 0 0)
195 Hardware_ECC_Recovered 0x001a 063 057
000
Post by M P
Old_age Always - 140584844
197 Current_Pending_Sector 0x0012 100 100
000
Post by M P
Old_age Always - 0
198 Offline_Uncorrectable 0x0010 100 100
000
Post by M P
Old_age Offline - 0
199 UDMA_CRC_Error_Count 0x003e 200 200
000
Post by M P
Old_age Always - 0
200 Multi_Zone_Error_Rate 0x0000 100 253
000
Post by M P
Old_age Offline - 0
202 TA_Increase_Count 0x0032 100 253
000
Post by M P
Old_age Always - 0
SMART Error Log Version: 1
No Errors Logged
SMART Self-test log structure revision number 1
Num Test_Description Status
=== message truncated ===




____________________________________________________________________________________
Looking for last minute shopping deals?
Find them fast with Yahoo! Search. http://tools.search.yahoo.com/newsearch/category.php?category=shopping
Bruce Allen
2008-04-08 10:47:16 UTC
Permalink
I will do some experiments with self-tests run in Captive mode.

Smartmontools Mailing list: does anyone out there run self-tests on
unmounted file systems in captive mode on Linux systems with libata?
Does it work?

Cheers,
Bruce
Post by M P
Bruce -
First, I'd like to thank you for your prompt reply.
Second, I had the drive entirely unmounted. I should
have clarified this. As such, the captive test was
failing with repeated host resets with the drive
completely unmounted. To verify this I repeated the
test with the drive in captive mode and all partitions
unmounted and the results were the same - several
attempts, all failing at 30% remaining for the short
test and 10% for the long test.
I repeated the tests while not captive and received a
clean bill of health.
Does this fact make any difference in your advice?
Thanks for any help you can give.
-MP
Post by Bruce Allen
I've never run tests in captive mode except on
devices with no mounted
file systems. In your situation, I would only run
self-tests in
non-captive mode. After the non-captive short test,
if the disk self-test
log shows no errors, I would then run a non-captive
long test.
Cheers,
Bruce
Post by M P
My disk is a Seagate ST3500630A. I noticed that
it
Post by M P
was very slow to mount on Saturday (about two
minutes). Though it could well have been in power
saving mode (the machine was on for several days
without mounting that disk), I decided to do a
smart
Post by M P
smartctl -d ata -C -t short /dev/sdc
Notably, I ran it in captive mode. The terminal
then
Post by M P
froze (I was able to switch to other tabs and
other
Post by M P
windows, though - it wouldn't respond to Ctrl-C).
For
Post by M P
about three minutes, it seemed to be trying to
send
Post by M P
commands to the drive without success. In fact,
several times the drive seemed to be STRUGGLING -
making an odd buzzing noise for a second or two,
though this MIGHT just be it running the test. I
got
Post by M P
several errors, stating that the drive had had a
short
Post by M P
test - which was interrupted every time with a
"host
Post by M P
reset" at 30% remaining. I ran it OUT of captive
mode
Post by M P
and the test returned fine. I had similar results
with the long test - it failed with 90% remaining
with
Post by M P
captive mode on, and without captive mode it
returned
Post by M P
fine.
Long story short - just what's going on here? Is
my
Post by M P
drive in trouble? Is this a fluke? What should I
do
Post by M P
at this point?
Thank you for any help you can give me or anywhere
else you can point me to.
Controller driver: pata_pdc2027x (unsure of the
model), and if this is causing a problem it would
be
Post by M P
the first time - I ran similar tests on the other
drive connected to it, with no problems.
Firmware version is 3.AAF.
=== START OF READ SMART DATA SECTION ===
PASSED
Offline data collection status: (0x82) Offline
data
Post by M P
collection activity
was
completed
Post by M P
without error.
Auto
Offline
Post by M P
Data Collection: Enabled.
Self-test execution status: ( 0) The
previous
Post by M P
self-test routine completed
without
error
Post by M P
or no self-test has ever
been run.
Total time to complete Offline
data collection: ( 430) seconds.
Offline data collection
capabilities: (0x5b) SMART
execute
Post by M P
Offline immediate.
Auto
Offline
Post by M P
data collection on/off support.
Suspend
Offline collection upon new
command.
Offline
surface scan supported.
Self-test
supported.
No
Conveyance
Post by M P
Self-test supported.
Selective
Self-test supported.
SMART capabilities: (0x0003) Saves
SMART
Post by M P
data before entering
power-saving
Post by M P
mode.
Supports
SMART
Post by M P
auto save timer.
Error logging capability: (0x01) Error
logging
Post by M P
supported.
General
Purpose Logging supported.
Short self-test routine
recommended polling time: ( 1) minutes.
Extended self-test routine
recommended polling time: ( 163) minutes.
10
Post by M P
ID# ATTRIBUTE_NAME FLAG VALUE WORST
THRESH TYPE UPDATED WHEN_FAILED RAW_VALUE
1 Raw_Read_Error_Rate 0x000f 109 099
006
Post by M P
Pre-fail Always - 160693144
3 Spin_Up_Time 0x0003 092 092
000
Post by M P
Pre-fail Always - 0
4 Start_Stop_Count 0x0032 100 100
020
Post by M P
Old_age Always - 14
5 Reallocated_Sector_Ct 0x0033 100 100
036
Post by M P
Pre-fail Always - 0
7 Seek_Error_Rate 0x000f 076 060
030
Post by M P
Pre-fail Always - 48042050
9 Power_On_Hours 0x0032 100 100
000
Post by M P
Old_age Always - 729
10 Spin_Retry_Count 0x0013 100 100
097
Post by M P
Pre-fail Always - 0
12 Power_Cycle_Count 0x0032 100 100
020
Post by M P
Old_age Always - 14
187 Reported_Uncorrect 0x0032 100 100
000
Post by M P
Old_age Always - 0
189 High_Fly_Writes 0x003a 100 100
000
Post by M P
Old_age Always - 0
190 Airflow_Temperature_Cel 0x0022 061 048
045
Post by M P
Old_age Always - 39 (Lifetime
Min/Max
Post by M P
24/41)
194 Temperature_Celsius 0x0022 039 052
000
Post by M P
Old_age Always - 39 (0 24 0 0)
195 Hardware_ECC_Recovered 0x001a 063 057
000
Post by M P
Old_age Always - 140584844
197 Current_Pending_Sector 0x0012 100 100
000
Post by M P
Old_age Always - 0
198 Offline_Uncorrectable 0x0010 100 100
000
Post by M P
Old_age Offline - 0
199 UDMA_CRC_Error_Count 0x003e 200 200
000
Post by M P
Old_age Always - 0
200 Multi_Zone_Error_Rate 0x0000 100 253
000
Post by M P
Old_age Offline - 0
202 TA_Increase_Count 0x0032 100 253
000
Post by M P
Old_age Always - 0
SMART Error Log Version: 1
No Errors Logged
SMART Self-test log structure revision number 1
Num Test_Description Status
=== message truncated ===
____________________________________________________________________________________
Looking for last minute shopping deals?
Find them fast with Yahoo! Search. http://tools.search.yahoo.com/newsearch/category.php?category=shopping
-------------------------------------------------------------------------
Check out the new SourceForge.net Marketplace.
It's the best place to buy or sell services for
just about anything Open Source.
http://ad.doubleclick.net/clk;164216239;13503038;w?http://sf.net/marketplace
_______________________________________________
Smartmontools-support mailing list
https://lists.sourceforge.net/lists/listinfo/smartmontools-support
Mario 'BitKoenig' Holbe
2008-04-08 13:30:33 UTC
Permalink
Post by Bruce Allen
Smartmontools Mailing list: does anyone out there run self-tests on
unmounted file systems in captive mode on Linux systems with libata?
Not with libata, but a while ago with 2.4 I did.
Post by Bruce Allen
Does it work?
Well, I have no idea if it works with libata. However, the most common
mistake I can imagine (just because it was mine as well :)) is to forget
that there is a bunch of tools that access drives more or less
periodically and thus interrupt captive tests even though it has no
mounted partitions, like smartd, hddtemp or - on newer machines -
probably even hald.
I have no idea how many more they will become once you run one of those
sophisticated graphical interfaces ;)

So I'd primarily suggest to run captive tests in single-user or
maintenance mode.


regards
Mario
--
I've never been certain whether the moral of the Icarus story should
only be, as is generally accepted, "Don't try to fly too high," or
whether it might also be thought of as, "Forget the wax and feathers
and do a better job on the wings." -- Stanley Kubrick
Tomáš Smetana
2008-04-09 06:17:56 UTC
Permalink
On Tue, 8 Apr 2008 05:47:16 -0500 (CDT)
Post by Bruce Allen
I will do some experiments with self-tests run in Captive mode.
Smartmontools Mailing list: does anyone out there run self-tests on
unmounted file systems in captive mode on Linux systems with libata?
Does it work?
I tried the short test (with piix_sata over libata; Samsung HD160JJ/P on
Intel 82801G (ICH7)) and it looked to be working fine:


[***@localhost ~]# smartctl -d ata -C -t short /dev/sda
smartctl version 5.38 [i386-redhat-linux-gnu] Copyright (C) 2002-8 Bruce Allen
Home page is http://smartmontools.sourceforge.net/

=== START OF OFFLINE IMMEDIATE AND SELF-TEST SECTION ===
Sending command: "Execute SMART Short self-test routine immediately in captive
mode". Drive command "Execute SMART Short self-test routine immediately in
captive mode" successful. Testing has begun.
Please wait 1 minutes for test to complete.
Test will complete after Tue Apr 8 13:03:22 2008


[***@localhost ~]#


Regards.
--
Tomáš Smetana
Base OS Software Engineer, Red Hat
RH IRC: #brno #devel #base-os; Freenode IRC: #fedora-devel
Bruce Allen
2008-04-09 06:28:14 UTC
Permalink
Ok -- no complaints from the kernel, apparently. Just for fun you might
retry this with -d sat to use the SAT pass-through.
Post by Tomáš Smetana
On Tue, 8 Apr 2008 05:47:16 -0500 (CDT)
Post by Bruce Allen
I will do some experiments with self-tests run in Captive mode.
Smartmontools Mailing list: does anyone out there run self-tests on
unmounted file systems in captive mode on Linux systems with libata?
Does it work?
I tried the short test (with piix_sata over libata; Samsung HD160JJ/P on
smartctl version 5.38 [i386-redhat-linux-gnu] Copyright (C) 2002-8 Bruce Allen
Home page is http://smartmontools.sourceforge.net/
=== START OF OFFLINE IMMEDIATE AND SELF-TEST SECTION ===
Sending command: "Execute SMART Short self-test routine immediately in captive
mode". Drive command "Execute SMART Short self-test routine immediately in
captive mode" successful. Testing has begun.
Please wait 1 minutes for test to complete.
Test will complete after Tue Apr 8 13:03:22 2008
Regards.
Loading...