[prev in list] [next in list] [prev in thread] [next in thread]
List: smartmontools-support
Subject: [smartmontools-support] Problems with two broken disks
From: "Richard Hartmann" <richih.mailinglist () gmail ! com>
Date: 2008-12-03 14:56:56
Message-ID: 2d460de70812030656o1240a6b5w5bfd9d5fadd702e5 () mail ! gmail ! com
[Download RAW message or body]
Hi all,
I have two disks which used to be in a RAID which managed to die at the
same time. I will paste the relevant dd & smartctl output below.
Does anyone have any ideas how I could get more data off those disks? Is
professional help our only chance? A huge thanks in advance!
If you need any other information, please do not hesitate to contact me.
Richard
Disk 3LJ33MQ7 :
root@grml ~ # dd if=/dev/sda of=/mnt/sdc1/3LJ33MQ7.img bs=64k
dd: reading `/dev/sda': Input/output error
49164+1 records in
49164+1 records out
3222016000 bytes (3.2 GB) copied, 210.116 s, 15.3 MB/s
dd if=/dev/sda of=/mnt/sdc1/3LJ33MQ7.img bs=64k 0.06s user 9.42s
system 4% cpu 3:30.12 total
1 root@grml ~ #smartctl -a /dev/sda
smartctl version 5.38 [i686-pc-linux-gnu] Copyright (C) 2002-8 Bruce
Allen
Home page is http://smartmontools.sourceforge.net/
=== START OF INFORMATION SECTION ===
Model Family: Seagate Barracuda 7200.7 and 7200.7 Plus family
Device Model: ST3200822AS
Serial Number: 3LJ33MQ7
Firmware Version: 3.01
User Capacity: 200,049,647,616 bytes
Device is: In smartctl database [for details use: -P show]
ATA Version is: 6
ATA Standard is: ATA/ATAPI-6 T13 1410D revision 2
Local Time is: Wed Dec 3 15:19:28 2008 UTC
SMART support is: Available - device has SMART capability.
SMART support is: Enabled
=== START OF READ SMART DATA SECTION ===
SMART overall-health self-assessment test result: PASSED
General SMART Values:
Offline data collection status: (0x82) Offline data collection
activity
was completed without error.
Auto Offline Data Collection:
Enabled.
Self-test execution status: ( 0) The previous self-test routine
completed
without error or no self-test
has ever
been run.
Total time to complete Offline
data collection: ( 430) seconds.
Offline data collection
capabilities: (0x5b) SMART execute Offline
immediate.
Auto Offline data collection
on/off support.
Suspend Offline collection
upon new
command.
Offline surface scan
supported.
Self-test supported.
No Conveyance Self-test
supported.
Selective Self-test supported.
SMART capabilities: (0x0003) Saves SMART data before
entering
power-saving mode.
Supports SMART auto save
timer.
Error logging capability: (0x01) Error logging supported.
No General Purpose Logging
support.
Short self-test routine
recommended polling time: ( 1) minutes.
Extended self-test routine
recommended polling time: ( 111) minutes.
SMART Attributes Data Structure revision number: 10
Vendor Specific SMART Attributes with Thresholds:
ID# ATTRIBUTE_NAME FLAG VALUE WORST THRESH TYPE
UPDATED WHEN_FAILED RAW_VALUE
1 Raw_Read_Error_Rate 0x000f 052 049 006 Pre-fail
Always - 168888113
3 Spin_Up_Time 0x0003 096 096 000 Pre-fail
Always - 0
4 Start_Stop_Count 0x0032 100 100 020 Old_age
Always - 46
5 Reallocated_Sector_Ct 0x0033 099 099 036 Pre-fail
Always - 40
7 Seek_Error_Rate 0x000f 080 060 030 Pre-fail
Always - 22034494182
9 Power_On_Hours 0x0032 064 064 000 Old_age
Always - 31926
10 Spin_Retry_Count 0x0013 100 100 097 Pre-fail
Always - 0
12 Power_Cycle_Count 0x0032 100 100 020 Old_age
Always - 56
194 Temperature_Celsius 0x0022 040 053 000 Old_age
Always - 40
195 Hardware_ECC_Recovered 0x001a 052 049 000 Old_age
Always - 168888113
197 Current_Pending_Sector 0x0012 100 100 000 Old_age
Always - 7
198 Offline_Uncorrectable 0x0010 100 100 000 Old_age
Offline - 7
199 UDMA_CRC_Error_Count 0x003e 200 200 000 Old_age
Always - 0
200 Multi_Zone_Error_Rate 0x0000 100 253 000 Old_age
Offline - 0
202 TA_Increase_Count 0x0032 100 253 000 Old_age
Always - 0
SMART Error Log Version: 1
ATA Error Count: 13 (device log contains only the most recent five
errors)
CR = Command Register [HEX]
FR = Features Register [HEX]
SC = Sector Count Register [HEX]
SN = Sector Number Register [HEX]
CL = Cylinder Low Register [HEX]
CH = Cylinder High Register [HEX]
DH = Device/Head Register [HEX]
DC = Device Command Register [HEX]
ER = Error register [HEX]
ST = Status register [HEX]
Powered_Up_Time is measured from power on, and printed as
DDd+hh:mm:SS.sss where DD=days, hh=hours, mm=minutes,
SS=sec, and sss=millisec. It "wraps" after 49.710 days.
Error 13 occurred at disk power-on lifetime: 31924 hours (1330 days +
4 hours)
When the command that caused the error occurred, the device was
active or idle.
After command completion occurred, registers were:
ER ST SC SN CL CH DH
-- -- -- -- -- -- --
40 51 0f 0f 06 60 e0 Error: UNC 15 sectors at LBA = 0x0060060f =
6293007
Commands leading to the command that caused the error were:
CR FR SC SN CL CH DH DC Powered_Up_Time Command/Feature_Name
-- -- -- -- -- -- -- -- ---------------- --------------------
25 00 00 00 05 60 e0 00 00:18:03.931 READ DMA EXT
ec 00 0f 0f 06 60 a0 00 00:18:03.926 IDENTIFY DEVICE
25 00 00 00 05 60 e0 00 00:17:59.952 READ DMA EXT
ec 00 0f 0f 06 60 a0 00 00:17:59.947 IDENTIFY DEVICE
25 00 00 00 05 60 e0 00 00:17:56.048 READ DMA EXT
Error 12 occurred at disk power-on lifetime: 31924 hours (1330 days +
4 hours)
When the command that caused the error occurred, the device was
active or idle.
After command completion occurred, registers were:
ER ST SC SN CL CH DH
-- -- -- -- -- -- --
40 51 0f 0f 06 60 e0 Error: UNC 15 sectors at LBA = 0x0060060f =
6293007
Commands leading to the command that caused the error were:
CR FR SC SN CL CH DH DC Powered_Up_Time Command/Feature_Name
-- -- -- -- -- -- -- -- ---------------- --------------------
25 00 00 00 05 60 e0 00 00:18:03.931 READ DMA EXT
ec 00 0f 0f 06 60 a0 00 00:18:03.926 IDENTIFY DEVICE
25 00 00 00 05 60 e0 00 00:17:59.952 READ DMA EXT
ec 00 0f 0f 06 60 a0 00 00:17:59.947 IDENTIFY DEVICE
25 00 00 00 05 60 e0 00 00:17:56.048 READ DMA EXT
Error 11 occurred at disk power-on lifetime: 31924 hours (1330 days +
4 hours)
When the command that caused the error occurred, the device was
active or idle.
After command completion occurred, registers were:
ER ST SC SN CL CH DH
-- -- -- -- -- -- --
40 51 0f 0f 06 60 e0 Error: UNC 15 sectors at LBA = 0x0060060f =
6293007
Commands leading to the command that caused the error were:
CR FR SC SN CL CH DH DC Powered_Up_Time Command/Feature_Name
-- -- -- -- -- -- -- -- ---------------- --------------------
25 00 00 00 05 60 e0 00 00:17:47.917 READ DMA EXT
ec 00 0f 0f 06 60 a0 00 00:17:47.911 IDENTIFY DEVICE
25 00 00 00 05 60 e0 00 00:17:59.952 READ DMA EXT
ec 00 0f 0f 06 60 a0 00 00:17:59.947 IDENTIFY DEVICE
25 00 00 00 05 60 e0 00 00:17:56.048 READ DMA EXT
Error 10 occurred at disk power-on lifetime: 31924 hours (1330 days +
4 hours)
When the command that caused the error occurred, the device was
active or idle.
After command completion occurred, registers were:
ER ST SC SN CL CH DH
-- -- -- -- -- -- --
40 51 0f 0f 06 60 e0 Error: UNC 15 sectors at LBA = 0x0060060f =
6293007
Commands leading to the command that caused the error were:
CR FR SC SN CL CH DH DC Powered_Up_Time Command/Feature_Name
-- -- -- -- -- -- -- -- ---------------- --------------------
25 00 00 00 05 60 e0 00 00:17:47.917 READ DMA EXT
ec 00 0f 0f 06 60 a0 00 00:17:47.911 IDENTIFY DEVICE
25 00 00 00 05 60 e0 00 00:17:47.906 READ DMA EXT
ec 00 0f 0f 06 60 a0 00 00:17:47.899 IDENTIFY DEVICE
25 00 00 00 05 60 e0 00 00:17:56.048 READ DMA EXT
Error 9 occurred at disk power-on lifetime: 31924 hours (1330 days +
4
hours)
When the command that caused the error occurred, the device was
active or idle.
After command completion occurred, registers were:
ER ST SC SN CL CH DH
-- -- -- -- -- -- --
40 51 0f 0f 06 60 e0 Error: UNC 15 sectors at LBA = 0x0060060f =
6293007
Commands leading to the command that caused the error were:
CR FR SC SN CL CH DH DC Powered_Up_Time Command/Feature_Name
-- -- -- -- -- -- -- -- ---------------- --------------------
25 00 00 00 05 60 e0 00 00:17:47.917 READ DMA EXT
ec 00 0f 0f 06 60 a0 00 00:17:47.911 IDENTIFY DEVICE
25 00 00 00 05 60 e0 00 00:17:47.906 READ DMA EXT
25 00 00 00 03 60 e0 00 00:17:47.899 READ DMA EXT
25 00 00 00 01 60 e0 00 00:17:45.030 READ DMA EXT
SMART Self-test log structure revision number 1
No self-tests have been logged. [To run self-tests, use: smartctl -t]
SMART Selective self-test log data structure revision number 1
SPAN MIN_LBA MAX_LBA CURRENT_TEST_STATUS
1 0 0 Not_testing
2 0 0 Not_testing
3 0 0 Not_testing
4 0 0 Not_testing
5 0 0 Not_testing
Selective self-test flags (0x0):
After scanning selected spans, do NOT read-scan remainder of disk.
If Selective self-test is pending on power-up, resume after 0 minute
delay.
64 root@grml ~ #
====================
====================
====================
Second disk, 3LJ2Y6CG :
root@grml ~ # dd if=/dev/sdb of=/mnt/sdd1/3LJ2Y6CG.img bs=64k
dd: reading `/dev/sdb': Input/output error
208847+1 records in
208847+1 records out
13687058432 bytes (14 GB) copied, 487.497 s, 28.1 MB/s
dd if=/dev/sdb of=/mnt/sdd1/3LJ2Y6CG.img bs=64k 0.27s user 38.35s
system 7% cpu 8:07.52 total
1 root@grml ~ # smartctl -a /dev/sdb
smartctl version 5.38 [i686-pc-linux-gnu] Copyright (C) 2002-8 Bruce
Allen
Home page is http://smartmontools.sourceforge.net/
=== START OF INFORMATION SECTION ===
Model Family: Seagate Barracuda 7200.7 and 7200.7 Plus family
Device Model: ST3200822AS
Serial Number: 3LJ2Y6CG
Firmware Version: 3.01
User Capacity: 200,049,647,616 bytes
Device is: In smartctl database [for details use: -P show]
ATA Version is: 6
ATA Standard is: ATA/ATAPI-6 T13 1410D revision 2
Local Time is: Wed Dec 3 15:19:49 2008 UTC
SMART support is: Available - device has SMART capability.
SMART support is: Enabled
=== START OF READ SMART DATA SECTION ===
SMART overall-health self-assessment test result: PASSED
General SMART Values:
Offline data collection status: (0x82) Offline data collection
activity
was completed without error.
Auto Offline Data Collection:
Enabled.
Self-test execution status: ( 0) The previous self-test routine
completed
without error or no self-test
has ever
been run.
Total time to complete Offline
data collection: ( 430) seconds.
Offline data collection
capabilities: (0x5b) SMART execute Offline
immediate.
Auto Offline data collection
on/off support.
Suspend Offline collection
upon new
command.
Offline surface scan
supported.
Self-test supported.
No Conveyance Self-test
supported.
Selective Self-test supported.
SMART capabilities: (0x0003) Saves SMART data before
entering
power-saving mode.
Supports SMART auto save
timer.
Error logging capability: (0x01) Error logging supported.
No General Purpose Logging
support.
Short self-test routine
recommended polling time: ( 1) minutes.
Extended self-test routine
recommended polling time: ( 111) minutes.
SMART Attributes Data Structure revision number: 10
Vendor Specific SMART Attributes with Thresholds:
ID# ATTRIBUTE_NAME FLAG VALUE WORST THRESH TYPE
UPDATED WHEN_FAILED RAW_VALUE
1 Raw_Read_Error_Rate 0x000f 051 048 006 Pre-fail
Always - 32469081
3 Spin_Up_Time 0x0003 096 096 000 Pre-fail
Always - 0
4 Start_Stop_Count 0x0032 100 100 020 Old_age
Always - 49
5 Reallocated_Sector_Ct 0x0033 100 100 036 Pre-fail
Always - 11
7 Seek_Error_Rate 0x000f 087 060 030 Pre-fail
Always - 531666625
9 Power_On_Hours 0x0032 064 064 000 Old_age
Always - 31923
10 Spin_Retry_Count 0x0013 100 100 097 Pre-fail
Always - 0
12 Power_Cycle_Count 0x0032 100 100 020 Old_age
Always - 57
194 Temperature_Celsius 0x0022 044 053 000 Old_age
Always - 44
195 Hardware_ECC_Recovered 0x001a 051 048 000 Old_age
Always - 32469081
197 Current_Pending_Sector 0x0012 100 100 000 Old_age
Always - 1
198 Offline_Uncorrectable 0x0010 100 100 000 Old_age
Offline - 1
199 UDMA_CRC_Error_Count 0x003e 200 200 000 Old_age
Always - 0
200 Multi_Zone_Error_Rate 0x0000 100 253 000 Old_age
Offline - 0
202 TA_Increase_Count 0x0032 100 253 000 Old_age
Always - 0
SMART Error Log Version: 1
ATA Error Count: 6 (device log contains only the most recent five
errors)
CR = Command Register [HEX]
FR = Features Register [HEX]
SC = Sector Count Register [HEX]
SN = Sector Number Register [HEX]
CL = Cylinder Low Register [HEX]
CH = Cylinder High Register [HEX]
DH = Device/Head Register [HEX]
DC = Device Command Register [HEX]
ER = Error register [HEX]
ST = Status register [HEX]
Powered_Up_Time is measured from power on, and printed as
DDd+hh:mm:SS.sss where DD=days, hh=hours, mm=minutes,
SS=sec, and sss=millisec. It "wraps" after 49.710 days.
Error 6 occurred at disk power-on lifetime: 31920 hours (1330 days +
0
hours)
When the command that caused the error occurred, the device was
active or idle.
After command completion occurred, registers were:
ER ST SC SN CL CH DH
-- -- -- -- -- -- --
40 51 88 f8 e7 97 e1 Error: UNC 136 sectors at LBA = 0x0197e7f8 =
26732536
Commands leading to the command that caused the error were:
CR FR SC SN CL CH DH DC Powered_Up_Time Command/Feature_Name
-- -- -- -- -- -- -- -- ---------------- --------------------
c8 00 90 70 e7 97 e1 00 01:32:13.111 READ DMA
ec 00 88 f8 e7 97 a0 00 01:32:13.106 IDENTIFY DEVICE
c8 00 90 70 e7 97 e1 00 01:32:09.328 READ DMA
ec 00 88 f8 e7 97 a0 00 01:32:09.327 IDENTIFY DEVICE
c8 00 90 70 e7 97 e1 00 01:32:09.325 READ DMA
Error 5 occurred at disk power-on lifetime: 31920 hours (1330 days +
0
hours)
When the command that caused the error occurred, the device was
active or idle.
After command completion occurred, registers were:
ER ST SC SN CL CH DH
-- -- -- -- -- -- --
40 51 88 f8 e7 97 e1 Error: UNC 136 sectors at LBA = 0x0197e7f8 =
26732536
Commands leading to the command that caused the error were:
CR FR SC SN CL CH DH DC Powered_Up_Time Command/Feature_Name
-- -- -- -- -- -- -- -- ---------------- --------------------
c8 00 90 70 e7 97 e1 00 01:32:13.111 READ DMA
ec 00 88 f8 e7 97 a0 00 01:32:13.106 IDENTIFY DEVICE
c8 00 90 70 e7 97 e1 00 01:32:09.328 READ DMA
ec 00 88 f8 e7 97 a0 00 01:32:09.327 IDENTIFY DEVICE
c8 00 90 70 e7 97 e1 00 01:32:09.325 READ DMA
Error 4 occurred at disk power-on lifetime: 31920 hours (1330 days +
0
hours)
When the command that caused the error occurred, the device was
active or idle.
After command completion occurred, registers were:
ER ST SC SN CL CH DH
-- -- -- -- -- -- --
40 51 88 f8 e7 97 e1 Error: UNC 136 sectors at LBA = 0x0197e7f8 =
26732536
Commands leading to the command that caused the error were:
CR FR SC SN CL CH DH DC Powered_Up_Time Command/Feature_Name
-- -- -- -- -- -- -- -- ---------------- --------------------
c8 00 90 70 e7 97 e1 00 01:32:13.111 READ DMA
ec 00 88 f8 e7 97 a0 00 01:32:13.106 IDENTIFY DEVICE
c8 00 90 70 e7 97 e1 00 01:32:09.328 READ DMA
ec 00 88 f8 e7 97 a0 00 01:32:09.327 IDENTIFY DEVICE
c8 00 90 70 e7 97 e1 00 01:32:09.325 READ DMA
Error 3 occurred at disk power-on lifetime: 31920 hours (1330 days +
0
hours)
When the command that caused the error occurred, the device was
active or idle.
After command completion occurred, registers were:
ER ST SC SN CL CH DH
-- -- -- -- -- -- --
40 51 88 f8 e7 97 e1 Error: UNC 136 sectors at LBA = 0x0197e7f8 =
26732536
Commands leading to the command that caused the error were:
CR FR SC SN CL CH DH DC Powered_Up_Time Command/Feature_Name
-- -- -- -- -- -- -- -- ---------------- --------------------
c8 00 90 70 e7 97 e1 00 01:32:13.111 READ DMA
ec 00 88 f8 e7 97 a0 00 01:32:13.106 IDENTIFY DEVICE
c8 00 90 70 e7 97 e1 00 01:32:09.328 READ DMA
ec 00 88 f8 e7 97 a0 00 01:32:09.327 IDENTIFY DEVICE
c8 00 90 70 e7 97 e1 00 01:32:09.325 READ DMA
Error 2 occurred at disk power-on lifetime: 31920 hours (1330 days +
0
hours)
When the command that caused the error occurred, the device was
active or idle.
After command completion occurred, registers were:
ER ST SC SN CL CH DH
-- -- -- -- -- -- --
40 51 88 f8 e7 97 e1 Error: UNC 136 sectors at LBA = 0x0197e7f8 =
26732536
Commands leading to the command that caused the error were:
CR FR SC SN CL CH DH DC Powered_Up_Time Command/Feature_Name
-- -- -- -- -- -- -- -- ---------------- --------------------
c8 00 90 70 e7 97 e1 00 01:32:13.111 READ DMA
ec 00 88 f8 e7 97 a0 00 01:32:13.106 IDENTIFY DEVICE
c8 00 90 70 e7 97 e1 00 01:32:09.328 READ DMA
c8 00 70 00 e7 97 e1 00 01:32:09.327 READ DMA
c8 00 98 68 e6 97 e1 00 01:32:09.325 READ DMA
SMART Self-test log structure revision number 1
No self-tests have been logged. [To run self-tests, use: smartctl -t]
SMART Selective self-test log data structure revision number 1
SPAN MIN_LBA MAX_LBA CURRENT_TEST_STATUS
1 0 0 Not_testing
2 0 0 Not_testing
3 0 0 Not_testing
4 0 0 Not_testing
5 0 0 Not_testing
Selective self-test flags (0x0):
After scanning selected spans, do NOT read-scan remainder of disk.
If Selective self-test is pending on power-up, resume after 0 minute
delay.
64 root@grml ~ #
-------------------------------------------------------------------------
This SF.Net email is sponsored by the Moblin Your Move Developer's challenge
Build the coolest Linux based applications with Moblin SDK & win great prizes
Grand prize is a trip for two to an Open Source event anywhere in the world
http://moblin-contest.org/redirect.php?banner_id=100&url=/
_______________________________________________
Smartmontools-support mailing list
Smartmontools-support@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/smartmontools-support
[prev in list] [next in list] [prev in thread] [next in thread]
Configure |
About |
News |
Add a list |
Sponsored by KoreLogic