[prev in list] [next in list] [prev in thread] [next in thread] 

List:       smartmontools-support
Subject:    [smartmontools-support] Problems with two broken disks
From:       "Richard Hartmann" <richih.mailinglist () gmail ! com>
Date:       2008-12-03 14:56:56
Message-ID: 2d460de70812030656o1240a6b5w5bfd9d5fadd702e5 () mail ! gmail ! com
[Download RAW message or body]

Hi all,

I have two disks which used to be in a RAID which managed to die at the
same time. I will paste the relevant dd & smartctl output below.
Does anyone have any ideas how I could get more data off those disks? Is
professional help our only chance? A huge thanks in advance!

If you need any other information, please do not hesitate to contact me.


Richard


Disk 3LJ33MQ7 :

root@grml ~ # dd if=/dev/sda of=/mnt/sdc1/3LJ33MQ7.img bs=64k
dd: reading `/dev/sda': Input/output error
49164+1 records in
49164+1 records out
3222016000 bytes (3.2 GB) copied, 210.116 s, 15.3 MB/s
dd if=/dev/sda of=/mnt/sdc1/3LJ33MQ7.img bs=64k  0.06s user 9.42s
system 4% cpu 3:30.12 total
1 root@grml ~ #smartctl -a /dev/sda
 smartctl version 5.38 [i686-pc-linux-gnu] Copyright (C) 2002-8 Bruce
 Allen
 Home page is http://smartmontools.sourceforge.net/



 === START OF INFORMATION SECTION ===


 Model Family: Seagate Barracuda 7200.7 and 7200.7 Plus family
 Device Model: ST3200822AS
 Serial Number: 3LJ33MQ7
 Firmware Version: 3.01
 User Capacity: 200,049,647,616 bytes
 Device is: In smartctl database [for details use: -P show]
 ATA Version is: 6
 ATA Standard is: ATA/ATAPI-6 T13 1410D revision 2
 Local Time is: Wed Dec 3 15:19:28 2008 UTC
 SMART support is: Available - device has SMART capability.
 SMART support is: Enabled



 === START OF READ SMART DATA SECTION ===


 SMART overall-health self-assessment test result: PASSED

 General SMART Values:
 Offline data collection status: (0x82) Offline data collection
 activity
 was completed without error.
 Auto Offline Data Collection:
 Enabled.
 Self-test execution status: ( 0) The previous self-test routine
 completed
 without error or no self-test
 has ever
 been run.
 Total time to complete Offline
 data collection: ( 430) seconds.
 Offline data collection
 capabilities: (0x5b) SMART execute Offline
 immediate.
 Auto Offline data collection
 on/off support.
 Suspend Offline collection
 upon new
 command.
 Offline surface scan
 supported.
 Self-test supported.
 No Conveyance Self-test
 supported.
 Selective Self-test supported.
 SMART capabilities: (0x0003) Saves SMART data before
 entering
 power-saving mode.
 Supports SMART auto save
 timer.
 Error logging capability: (0x01) Error logging supported.
 No General Purpose Logging
 support.
 Short self-test routine
 recommended polling time: ( 1) minutes.
 Extended self-test routine
 recommended polling time: ( 111) minutes.

 SMART Attributes Data Structure revision number: 10
 Vendor Specific SMART Attributes with Thresholds:
 ID# ATTRIBUTE_NAME FLAG VALUE WORST THRESH TYPE
 UPDATED WHEN_FAILED RAW_VALUE
 1 Raw_Read_Error_Rate 0x000f 052 049 006 Pre-fail
 Always - 168888113
 3 Spin_Up_Time 0x0003 096 096 000 Pre-fail
 Always - 0
 4 Start_Stop_Count 0x0032 100 100 020 Old_age
 Always - 46
 5 Reallocated_Sector_Ct 0x0033 099 099 036 Pre-fail
 Always - 40
 7 Seek_Error_Rate 0x000f 080 060 030 Pre-fail
 Always - 22034494182
 9 Power_On_Hours 0x0032 064 064 000 Old_age
 Always - 31926
 10 Spin_Retry_Count 0x0013 100 100 097 Pre-fail
 Always - 0
 12 Power_Cycle_Count 0x0032 100 100 020 Old_age
 Always - 56
 194 Temperature_Celsius 0x0022 040 053 000 Old_age
 Always - 40
 195 Hardware_ECC_Recovered 0x001a 052 049 000 Old_age
 Always - 168888113
 197 Current_Pending_Sector 0x0012 100 100 000 Old_age
 Always - 7
 198 Offline_Uncorrectable 0x0010 100 100 000 Old_age
 Offline - 7
 199 UDMA_CRC_Error_Count 0x003e 200 200 000 Old_age
 Always - 0
 200 Multi_Zone_Error_Rate 0x0000 100 253 000 Old_age
 Offline - 0
 202 TA_Increase_Count 0x0032 100 253 000 Old_age
 Always - 0

 SMART Error Log Version: 1
 ATA Error Count: 13 (device log contains only the most recent five
 errors)
 CR = Command Register [HEX]
 FR = Features Register [HEX]
 SC = Sector Count Register [HEX]
 SN = Sector Number Register [HEX]
 CL = Cylinder Low Register [HEX]
 CH = Cylinder High Register [HEX]
 DH = Device/Head Register [HEX]
 DC = Device Command Register [HEX]
 ER = Error register [HEX]
 ST = Status register [HEX]
 Powered_Up_Time is measured from power on, and printed as
 DDd+hh:mm:SS.sss where DD=days, hh=hours, mm=minutes,
 SS=sec, and sss=millisec. It "wraps" after 49.710 days.

 Error 13 occurred at disk power-on lifetime: 31924 hours (1330 days +
 4 hours)
 When the command that caused the error occurred, the device was
 active or idle.

 After command completion occurred, registers were:
 ER ST SC SN CL CH DH
 -- -- -- -- -- -- --
 40 51 0f 0f 06 60 e0 Error: UNC 15 sectors at LBA = 0x0060060f =
 6293007

 Commands leading to the command that caused the error were:
 CR FR SC SN CL CH DH DC Powered_Up_Time Command/Feature_Name
 -- -- -- -- -- -- -- -- ---------------- --------------------
 25 00 00 00 05 60 e0 00 00:18:03.931 READ DMA EXT
 ec 00 0f 0f 06 60 a0 00 00:18:03.926 IDENTIFY DEVICE
 25 00 00 00 05 60 e0 00 00:17:59.952 READ DMA EXT
 ec 00 0f 0f 06 60 a0 00 00:17:59.947 IDENTIFY DEVICE
 25 00 00 00 05 60 e0 00 00:17:56.048 READ DMA EXT

 Error 12 occurred at disk power-on lifetime: 31924 hours (1330 days +
 4 hours)
 When the command that caused the error occurred, the device was
 active or idle.

 After command completion occurred, registers were:
 ER ST SC SN CL CH DH
 -- -- -- -- -- -- --
 40 51 0f 0f 06 60 e0 Error: UNC 15 sectors at LBA = 0x0060060f =
 6293007

 Commands leading to the command that caused the error were:
 CR FR SC SN CL CH DH DC Powered_Up_Time Command/Feature_Name
 -- -- -- -- -- -- -- -- ---------------- --------------------
 25 00 00 00 05 60 e0 00 00:18:03.931 READ DMA EXT
 ec 00 0f 0f 06 60 a0 00 00:18:03.926 IDENTIFY DEVICE
 25 00 00 00 05 60 e0 00 00:17:59.952 READ DMA EXT
 ec 00 0f 0f 06 60 a0 00 00:17:59.947 IDENTIFY DEVICE
 25 00 00 00 05 60 e0 00 00:17:56.048 READ DMA EXT

 Error 11 occurred at disk power-on lifetime: 31924 hours (1330 days +
 4 hours)
 When the command that caused the error occurred, the device was
 active or idle.

 After command completion occurred, registers were:
 ER ST SC SN CL CH DH
 -- -- -- -- -- -- --
 40 51 0f 0f 06 60 e0 Error: UNC 15 sectors at LBA = 0x0060060f =
 6293007

 Commands leading to the command that caused the error were:
 CR FR SC SN CL CH DH DC Powered_Up_Time Command/Feature_Name
 -- -- -- -- -- -- -- -- ---------------- --------------------
 25 00 00 00 05 60 e0 00 00:17:47.917 READ DMA EXT
 ec 00 0f 0f 06 60 a0 00 00:17:47.911 IDENTIFY DEVICE
 25 00 00 00 05 60 e0 00 00:17:59.952 READ DMA EXT
 ec 00 0f 0f 06 60 a0 00 00:17:59.947 IDENTIFY DEVICE
 25 00 00 00 05 60 e0 00 00:17:56.048 READ DMA EXT

 Error 10 occurred at disk power-on lifetime: 31924 hours (1330 days +
 4 hours)
 When the command that caused the error occurred, the device was
 active or idle.

 After command completion occurred, registers were:
 ER ST SC SN CL CH DH
 -- -- -- -- -- -- --
 40 51 0f 0f 06 60 e0 Error: UNC 15 sectors at LBA = 0x0060060f =
 6293007

 Commands leading to the command that caused the error were:
 CR FR SC SN CL CH DH DC Powered_Up_Time Command/Feature_Name
 -- -- -- -- -- -- -- -- ---------------- --------------------
 25 00 00 00 05 60 e0 00 00:17:47.917 READ DMA EXT
 ec 00 0f 0f 06 60 a0 00 00:17:47.911 IDENTIFY DEVICE
 25 00 00 00 05 60 e0 00 00:17:47.906 READ DMA EXT
 ec 00 0f 0f 06 60 a0 00 00:17:47.899 IDENTIFY DEVICE
 25 00 00 00 05 60 e0 00 00:17:56.048 READ DMA EXT

 Error 9 occurred at disk power-on lifetime: 31924 hours (1330 days +
 4
 hours)
 When the command that caused the error occurred, the device was
 active or idle.

 After command completion occurred, registers were:
 ER ST SC SN CL CH DH
 -- -- -- -- -- -- --
 40 51 0f 0f 06 60 e0 Error: UNC 15 sectors at LBA = 0x0060060f =
 6293007

 Commands leading to the command that caused the error were:
 CR FR SC SN CL CH DH DC Powered_Up_Time Command/Feature_Name
 -- -- -- -- -- -- -- -- ---------------- --------------------
 25 00 00 00 05 60 e0 00 00:17:47.917 READ DMA EXT
 ec 00 0f 0f 06 60 a0 00 00:17:47.911 IDENTIFY DEVICE
 25 00 00 00 05 60 e0 00 00:17:47.906 READ DMA EXT
 25 00 00 00 03 60 e0 00 00:17:47.899 READ DMA EXT
 25 00 00 00 01 60 e0 00 00:17:45.030 READ DMA EXT

 SMART Self-test log structure revision number 1
 No self-tests have been logged. [To run self-tests, use: smartctl -t]


 SMART Selective self-test log data structure revision number 1
 SPAN MIN_LBA MAX_LBA CURRENT_TEST_STATUS
 1 0 0 Not_testing
 2 0 0 Not_testing
 3 0 0 Not_testing
 4 0 0 Not_testing
 5 0 0 Not_testing
 Selective self-test flags (0x0):
 After scanning selected spans, do NOT read-scan remainder of disk.
 If Selective self-test is pending on power-up, resume after 0 minute
 delay.

 64 root@grml ~ #



====================
====================
====================


Second disk, 3LJ2Y6CG :

root@grml ~ # dd if=/dev/sdb of=/mnt/sdd1/3LJ2Y6CG.img bs=64k
dd: reading `/dev/sdb': Input/output error
208847+1 records in
208847+1 records out
13687058432 bytes (14 GB) copied, 487.497 s, 28.1 MB/s
dd if=/dev/sdb of=/mnt/sdd1/3LJ2Y6CG.img bs=64k  0.27s user 38.35s
system 7% cpu 8:07.52 total
1 root@grml ~ # smartctl -a /dev/sdb
 smartctl version 5.38 [i686-pc-linux-gnu] Copyright (C) 2002-8 Bruce
 Allen
 Home page is http://smartmontools.sourceforge.net/



 === START OF INFORMATION SECTION ===


 Model Family: Seagate Barracuda 7200.7 and 7200.7 Plus family
 Device Model: ST3200822AS
 Serial Number: 3LJ2Y6CG
 Firmware Version: 3.01
 User Capacity: 200,049,647,616 bytes
 Device is: In smartctl database [for details use: -P show]
 ATA Version is: 6
 ATA Standard is: ATA/ATAPI-6 T13 1410D revision 2
 Local Time is: Wed Dec 3 15:19:49 2008 UTC
 SMART support is: Available - device has SMART capability.
 SMART support is: Enabled



 === START OF READ SMART DATA SECTION ===


 SMART overall-health self-assessment test result: PASSED

 General SMART Values:
 Offline data collection status: (0x82) Offline data collection
 activity
 was completed without error.
 Auto Offline Data Collection:
 Enabled.
 Self-test execution status: ( 0) The previous self-test routine
 completed
 without error or no self-test
 has ever
 been run.
 Total time to complete Offline
 data collection: ( 430) seconds.
 Offline data collection
 capabilities: (0x5b) SMART execute Offline
 immediate.
 Auto Offline data collection
 on/off support.
 Suspend Offline collection
 upon new
 command.
 Offline surface scan
 supported.
 Self-test supported.
 No Conveyance Self-test
 supported.
 Selective Self-test supported.
 SMART capabilities: (0x0003) Saves SMART data before
 entering
 power-saving mode.
 Supports SMART auto save
 timer.
 Error logging capability: (0x01) Error logging supported.
 No General Purpose Logging
 support.
 Short self-test routine
 recommended polling time: ( 1) minutes.
 Extended self-test routine
 recommended polling time: ( 111) minutes.

 SMART Attributes Data Structure revision number: 10
 Vendor Specific SMART Attributes with Thresholds:
 ID# ATTRIBUTE_NAME FLAG VALUE WORST THRESH TYPE
 UPDATED WHEN_FAILED RAW_VALUE
 1 Raw_Read_Error_Rate 0x000f 051 048 006 Pre-fail
 Always - 32469081
 3 Spin_Up_Time 0x0003 096 096 000 Pre-fail
 Always - 0
 4 Start_Stop_Count 0x0032 100 100 020 Old_age
 Always - 49
 5 Reallocated_Sector_Ct 0x0033 100 100 036 Pre-fail
 Always - 11
 7 Seek_Error_Rate 0x000f 087 060 030 Pre-fail
 Always - 531666625
 9 Power_On_Hours 0x0032 064 064 000 Old_age
 Always - 31923
 10 Spin_Retry_Count 0x0013 100 100 097 Pre-fail
 Always - 0
 12 Power_Cycle_Count 0x0032 100 100 020 Old_age
 Always - 57
 194 Temperature_Celsius 0x0022 044 053 000 Old_age
 Always - 44
 195 Hardware_ECC_Recovered 0x001a 051 048 000 Old_age
 Always - 32469081
 197 Current_Pending_Sector 0x0012 100 100 000 Old_age
 Always - 1
 198 Offline_Uncorrectable 0x0010 100 100 000 Old_age
 Offline - 1
 199 UDMA_CRC_Error_Count 0x003e 200 200 000 Old_age
 Always - 0
 200 Multi_Zone_Error_Rate 0x0000 100 253 000 Old_age
 Offline - 0
 202 TA_Increase_Count 0x0032 100 253 000 Old_age
 Always - 0

 SMART Error Log Version: 1
 ATA Error Count: 6 (device log contains only the most recent five
 errors)
 CR = Command Register [HEX]
 FR = Features Register [HEX]
 SC = Sector Count Register [HEX]
 SN = Sector Number Register [HEX]
 CL = Cylinder Low Register [HEX]
 CH = Cylinder High Register [HEX]
 DH = Device/Head Register [HEX]
 DC = Device Command Register [HEX]
 ER = Error register [HEX]
 ST = Status register [HEX]
 Powered_Up_Time is measured from power on, and printed as
 DDd+hh:mm:SS.sss where DD=days, hh=hours, mm=minutes,
 SS=sec, and sss=millisec. It "wraps" after 49.710 days.

 Error 6 occurred at disk power-on lifetime: 31920 hours (1330 days +
 0
 hours)
 When the command that caused the error occurred, the device was
 active or idle.

 After command completion occurred, registers were:
 ER ST SC SN CL CH DH
 -- -- -- -- -- -- --
 40 51 88 f8 e7 97 e1 Error: UNC 136 sectors at LBA = 0x0197e7f8 =
 26732536

 Commands leading to the command that caused the error were:
 CR FR SC SN CL CH DH DC Powered_Up_Time Command/Feature_Name
 -- -- -- -- -- -- -- -- ---------------- --------------------
 c8 00 90 70 e7 97 e1 00 01:32:13.111 READ DMA
 ec 00 88 f8 e7 97 a0 00 01:32:13.106 IDENTIFY DEVICE
 c8 00 90 70 e7 97 e1 00 01:32:09.328 READ DMA
 ec 00 88 f8 e7 97 a0 00 01:32:09.327 IDENTIFY DEVICE
 c8 00 90 70 e7 97 e1 00 01:32:09.325 READ DMA

 Error 5 occurred at disk power-on lifetime: 31920 hours (1330 days +
 0
 hours)
 When the command that caused the error occurred, the device was
 active or idle.

 After command completion occurred, registers were:
 ER ST SC SN CL CH DH
 -- -- -- -- -- -- --
 40 51 88 f8 e7 97 e1 Error: UNC 136 sectors at LBA = 0x0197e7f8 =
 26732536

 Commands leading to the command that caused the error were:
 CR FR SC SN CL CH DH DC Powered_Up_Time Command/Feature_Name
 -- -- -- -- -- -- -- -- ---------------- --------------------
 c8 00 90 70 e7 97 e1 00 01:32:13.111 READ DMA
 ec 00 88 f8 e7 97 a0 00 01:32:13.106 IDENTIFY DEVICE
 c8 00 90 70 e7 97 e1 00 01:32:09.328 READ DMA
 ec 00 88 f8 e7 97 a0 00 01:32:09.327 IDENTIFY DEVICE
 c8 00 90 70 e7 97 e1 00 01:32:09.325 READ DMA

 Error 4 occurred at disk power-on lifetime: 31920 hours (1330 days +
 0
 hours)
 When the command that caused the error occurred, the device was
 active or idle.

 After command completion occurred, registers were:
 ER ST SC SN CL CH DH
 -- -- -- -- -- -- --
 40 51 88 f8 e7 97 e1 Error: UNC 136 sectors at LBA = 0x0197e7f8 =
 26732536

 Commands leading to the command that caused the error were:
 CR FR SC SN CL CH DH DC Powered_Up_Time Command/Feature_Name
 -- -- -- -- -- -- -- -- ---------------- --------------------
 c8 00 90 70 e7 97 e1 00 01:32:13.111 READ DMA
 ec 00 88 f8 e7 97 a0 00 01:32:13.106 IDENTIFY DEVICE
 c8 00 90 70 e7 97 e1 00 01:32:09.328 READ DMA
 ec 00 88 f8 e7 97 a0 00 01:32:09.327 IDENTIFY DEVICE
 c8 00 90 70 e7 97 e1 00 01:32:09.325 READ DMA

 Error 3 occurred at disk power-on lifetime: 31920 hours (1330 days +
 0
 hours)
 When the command that caused the error occurred, the device was
 active or idle.

 After command completion occurred, registers were:
 ER ST SC SN CL CH DH
 -- -- -- -- -- -- --
 40 51 88 f8 e7 97 e1 Error: UNC 136 sectors at LBA = 0x0197e7f8 =
 26732536

 Commands leading to the command that caused the error were:
 CR FR SC SN CL CH DH DC Powered_Up_Time Command/Feature_Name
 -- -- -- -- -- -- -- -- ---------------- --------------------
 c8 00 90 70 e7 97 e1 00 01:32:13.111 READ DMA
 ec 00 88 f8 e7 97 a0 00 01:32:13.106 IDENTIFY DEVICE
 c8 00 90 70 e7 97 e1 00 01:32:09.328 READ DMA
 ec 00 88 f8 e7 97 a0 00 01:32:09.327 IDENTIFY DEVICE
 c8 00 90 70 e7 97 e1 00 01:32:09.325 READ DMA

 Error 2 occurred at disk power-on lifetime: 31920 hours (1330 days +
 0
 hours)
 When the command that caused the error occurred, the device was
 active or idle.

 After command completion occurred, registers were:
 ER ST SC SN CL CH DH
 -- -- -- -- -- -- --
 40 51 88 f8 e7 97 e1 Error: UNC 136 sectors at LBA = 0x0197e7f8 =
 26732536

 Commands leading to the command that caused the error were:
 CR FR SC SN CL CH DH DC Powered_Up_Time Command/Feature_Name
 -- -- -- -- -- -- -- -- ---------------- --------------------
 c8 00 90 70 e7 97 e1 00 01:32:13.111 READ DMA
 ec 00 88 f8 e7 97 a0 00 01:32:13.106 IDENTIFY DEVICE
 c8 00 90 70 e7 97 e1 00 01:32:09.328 READ DMA
 c8 00 70 00 e7 97 e1 00 01:32:09.327 READ DMA
 c8 00 98 68 e6 97 e1 00 01:32:09.325 READ DMA

 SMART Self-test log structure revision number 1
 No self-tests have been logged. [To run self-tests, use: smartctl -t]


 SMART Selective self-test log data structure revision number 1
 SPAN MIN_LBA MAX_LBA CURRENT_TEST_STATUS
 1 0 0 Not_testing
 2 0 0 Not_testing
 3 0 0 Not_testing
 4 0 0 Not_testing
 5 0 0 Not_testing
 Selective self-test flags (0x0):
 After scanning selected spans, do NOT read-scan remainder of disk.
 If Selective self-test is pending on power-up, resume after 0 minute
 delay.

 64 root@grml ~ #

-------------------------------------------------------------------------
This SF.Net email is sponsored by the Moblin Your Move Developer's challenge
Build the coolest Linux based applications with Moblin SDK & win great prizes
Grand prize is a trip for two to an Open Source event anywhere in the world
http://moblin-contest.org/redirect.php?banner_id=100&url=/
_______________________________________________
Smartmontools-support mailing list
Smartmontools-support@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/smartmontools-support
[prev in list] [next in list] [prev in thread] [next in thread] 

Configure | About | News | Add a list | Sponsored by KoreLogic