Ensemble RAID1 logiciel défectueux

Bonjour,

depuis hier je suis confronté à un soucis avec un ensemble RAID 1 logiciel sur un serveur de développement. Ne sachant pas trop par où commencer, voici les infos sur la distribution et le matériel et les logs :

Edit : j’ai oublié de préciser qu’il s’agit d’une debian squeeze

$ uname -a
Linux orca 3.0.0-1-amd64 #1 SMP Sat Aug 27 16:21:11 UTC 2011 x86_64 GNU/Linux
$ lspci
00:00.0 Host bridge: Intel Corporation Core Processor DRAM Controller (rev 02)
00:02.0 VGA compatible controller: Intel Corporation Core Processor Integrated Graphics Controller (rev 02)
00:16.0 Communication controller: Intel Corporation 5 Series/3400 Series Chipset HECI Controller (rev 06)
00:16.2 IDE interface: Intel Corporation 5 Series/3400 Series Chipset PT IDER Controller (rev 06)
00:16.3 Serial controller: Intel Corporation 5 Series/3400 Series Chipset KT Controller (rev 06)
00:19.0 Ethernet controller: Intel Corporation 82578DC Gigabit Network Connection (rev 06)
00:1a.0 USB controller: Intel Corporation 5 Series/3400 Series Chipset USB2 Enhanced Host Controller (rev 06)
00:1b.0 Audio device: Intel Corporation 5 Series/3400 Series Chipset High Definition Audio (rev 06)
00:1c.0 PCI bridge: Intel Corporation 5 Series/3400 Series Chipset PCI Express Root Port 1 (rev 06)
00:1c.4 PCI bridge: Intel Corporation 5 Series/3400 Series Chipset PCI Express Root Port 5 (rev 06)
00:1d.0 USB controller: Intel Corporation 5 Series/3400 Series Chipset USB2 Enhanced Host Controller (rev 06)
00:1e.0 PCI bridge: Intel Corporation 82801 PCI Bridge (rev a6)
00:1f.0 ISA bridge: Intel Corporation 5 Series Chipset LPC Interface Controller (rev 06)
00:1f.2 IDE interface: Intel Corporation 5 Series/3400 Series Chipset 4 port SATA IDE Controller (rev 06)
00:1f.3 SMBus: Intel Corporation 5 Series/3400 Series Chipset SMBus Controller (rev 06)
00:1f.5 IDE interface: Intel Corporation 5 Series/3400 Series Chipset 2 port SATA IDE Controller (rev 06)
3f:00.0 Host bridge: Intel Corporation Core Processor QuickPath Architecture Generic Non-core Registers (rev 02)
3f:00.1 Host bridge: Intel Corporation Core Processor QuickPath Architecture System Address Decoder (rev 02)
3f:02.0 Host bridge: Intel Corporation Core Processor QPI Link 0 (rev 02)
3f:02.1 Host bridge: Intel Corporation Core Processor QPI Physical 0 (rev 02)
3f:02.2 Host bridge: Intel Corporation Core Processor Reserved (rev 02)
3f:02.3 Host bridge: Intel Corporation Core Processor Reserved (rev 02)
$ lsusb
Bus 001 Device 001: ID 1d6b:0002 Linux Foundation 2.0 root hub
Bus 002 Device 001: ID 1d6b:0002 Linux Foundation 2.0 root hub
Bus 001 Device 002: ID 8087:0020 Intel Corp. Integrated Rate Matching Hub
Bus 002 Device 002: ID 8087:0020 Intel Corp. Integrated Rate Matching Hub
Bus 001 Device 003: ID 046a:002b Cherry GmbH
# dmidecode
# dmidecode 2.11
SMBIOS 2.6 present.
83 structures occupying 3278 bytes.
Table at 0x000E9230.

Handle 0x0000, DMI type 0, 24 bytes
BIOS Information
	Vendor: Intel Corp.
	Version: TCIBX10H.86A.0035.2010.0429.1516
	Release Date: 04/29/2010
	Address: 0xF0000
	Runtime Size: 64 kB
	ROM Size: 1024 kB
	Characteristics:
		PCI is supported
		BIOS is upgradeable
		BIOS shadowing is allowed
		Boot from CD is supported
		Selectable boot is supported
		BIOS ROM is socketed
		EDD is supported
		5.25"/1.2 MB floppy services are supported (int 13h)
		3.5"/720 kB floppy services are supported (int 13h)
		3.5"/2.88 MB floppy services are supported (int 13h)
		Print screen service is supported (int 5h)
		8042 keyboard services are supported (int 9h)
		Serial services are supported (int 14h)
		Printer services are supported (int 17h)
		ACPI is supported
		USB legacy is supported
		BIOS boot specification is supported
		Targeted content distribution is supported

Handle 0x0004, DMI type 4, 42 bytes
Processor Information
	Socket Designation: XU1
	Type: Central Processor
	Family: Core i3
	Manufacturer: Intel            
	ID: 55 06 02 00 FF FB EB BF
	Signature: Type 0, Family 6, Model 37, Stepping 5
	Flags:
		FPU (Floating-point unit on-chip)
		VME (Virtual mode extension)
		DE (Debugging extension)
		PSE (Page size extension)
		TSC (Time stamp counter)
		MSR (Model specific registers)
		PAE (Physical address extension)
		MCE (Machine check exception)
		CX8 (CMPXCHG8 instruction supported)
		APIC (On-chip APIC hardware supported)
		SEP (Fast system call)
		MTRR (Memory type range registers)
		PGE (Page global enable)
		MCA (Machine check architecture)
		CMOV (Conditional move instruction supported)
		PAT (Page attribute table)
		PSE-36 (36-bit page size extension)
		CLFSH (CLFLUSH instruction supported)
		DS (Debug store)
		ACPI (ACPI supported)
		MMX (MMX technology supported)
		FXSR (FXSAVE and FXSTOR instructions supported)
		SSE (Streaming SIMD extensions)
		SSE2 (Streaming SIMD extensions 2)
		SS (Self-snoop)
		HTT (Multi-threading)
		TM (Thermal monitor supported)
		PBE (Pending break enabled)
	Version: Intel(R) Core(TM) i3 CPU         550  @ 3.20GHz
	Voltage: 0.0 V
	External Clock: 533 MHz
	Max Speed: 3192 MHz
	Current Speed: 3192 MHz
	Status: Populated, Enabled
	Upgrade: Other
	L1 Cache Handle: 0x0005
	L2 Cache Handle: 0x0006
	L3 Cache Handle: 0x0007
	Serial Number: To Be Filled By O.E.M.
	Asset Tag: To Be Filled By O.E.M.
	Part Number: To Be Filled By O.E.M.
	Core Count: 2
	Core Enabled: 1
	Thread Count: 2
	Characteristics:
		64-bit capable

Handle 0x0005, DMI type 7, 19 bytes
Cache Information
	Socket Designation: L1-Cache
	Configuration: Enabled, Not Socketed, Level 1
	Operational Mode: Write Back
	Location: Internal
	Installed Size: 64 kB
	Maximum Size: 64 kB
	Supported SRAM Types:
		Other
	Installed SRAM Type: Other
	Speed: Unknown
	Error Correction Type: None
	System Type: Unified
	Associativity: 4-way Set-associative

Handle 0x0006, DMI type 7, 19 bytes
Cache Information
	Socket Designation: L2-Cache
	Configuration: Enabled, Not Socketed, Level 2
	Operational Mode: Varies With Memory Address
	Location: Internal
	Installed Size: 512 kB
	Maximum Size: 512 kB
	Supported SRAM Types:
		Other
	Installed SRAM Type: Other
	Speed: Unknown
	Error Correction Type: None
	System Type: Instruction
	Associativity: 8-way Set-associative

Handle 0x0007, DMI type 7, 19 bytes
Cache Information
	Socket Designation: L3-Cache
	Configuration: Enabled, Not Socketed, Level 3
	Operational Mode: Unknown
	Location: Internal
	Installed Size: 4096 kB
	Maximum Size: 4096 kB
	Supported SRAM Types:
		Other
	Installed SRAM Type: Other
	Speed: Unknown
	Error Correction Type: None
	System Type: Instruction
	Associativity: 16-way Set-associative

Handle 0x0008, DMI type 8, 9 bytes
Port Connector Information
	Internal Reference Designator: J1A1
	Internal Connector Type: None
	External Reference Designator: PS2Mouse
	External Connector Type: PS/2
	Port Type: Mouse Port

Handle 0x0009, DMI type 8, 9 bytes
Port Connector Information
	Internal Reference Designator: J1A1
	Internal Connector Type: None
	External Reference Designator: Keyboard
	External Connector Type: PS/2
	Port Type: Keyboard Port

Handle 0x000A, DMI type 8, 9 bytes
Port Connector Information
	Internal Reference Designator: J2A1
	Internal Connector Type: None
	External Reference Designator: TV Out
	External Connector Type: Mini Centronics Type-14
	Port Type: Other

Handle 0x000B, DMI type 8, 9 bytes
Port Connector Information
	Internal Reference Designator: J2A2A
	Internal Connector Type: None
	External Reference Designator: COM A
	External Connector Type: DB-9 male
	Port Type: Serial Port 16550A Compatible

Handle 0x000C, DMI type 8, 9 bytes
Port Connector Information
	Internal Reference Designator: J2A2B
	Internal Connector Type: None
	External Reference Designator: Video
	External Connector Type: DB-15 female
	Port Type: Video Port

Handle 0x000D, DMI type 8, 9 bytes
Port Connector Information
	Internal Reference Designator: J3A1
	Internal Connector Type: None
	External Reference Designator: USB1
	External Connector Type: Access Bus (USB)
	Port Type: USB

Handle 0x000E, DMI type 8, 9 bytes
Port Connector Information
	Internal Reference Designator: J3A1
	Internal Connector Type: None
	External Reference Designator: USB2
	External Connector Type: Access Bus (USB)
	Port Type: USB

Handle 0x000F, DMI type 8, 9 bytes
Port Connector Information
	Internal Reference Designator: J3A1
	Internal Connector Type: None
	External Reference Designator: USB3
	External Connector Type: Access Bus (USB)
	Port Type: USB

Handle 0x0010, DMI type 8, 9 bytes
Port Connector Information
	Internal Reference Designator: J9A1 - TPM HDR
	Internal Connector Type: Other
	External Reference Designator: Not Specified
	External Connector Type: None
	Port Type: Other

Handle 0x0011, DMI type 8, 9 bytes
Port Connector Information
	Internal Reference Designator: J9C1 - PCIE DOCKING CONN
	Internal Connector Type: Other
	External Reference Designator: Not Specified
	External Connector Type: None
	Port Type: Other

Handle 0x0012, DMI type 8, 9 bytes
Port Connector Information
	Internal Reference Designator: J2B3 - CPU FAN
	Internal Connector Type: Other
	External Reference Designator: Not Specified
	External Connector Type: None
	Port Type: Other

Handle 0x0013, DMI type 8, 9 bytes
Port Connector Information
	Internal Reference Designator: J6C2 - EXT HDMI
	Internal Connector Type: Other
	External Reference Designator: Not Specified
	External Connector Type: None
	Port Type: Other

Handle 0x0014, DMI type 8, 9 bytes
Port Connector Information
	Internal Reference Designator: J3C1 - GMCH FAN
	Internal Connector Type: Other
	External Reference Designator: Not Specified
	External Connector Type: None
	Port Type: Other

Handle 0x0015, DMI type 8, 9 bytes
Port Connector Information
	Internal Reference Designator: J1D1 - ITP
	Internal Connector Type: Other
	External Reference Designator: Not Specified
	External Connector Type: None
	Port Type: Other

Handle 0x0016, DMI type 8, 9 bytes
Port Connector Information
	Internal Reference Designator: J9E2 - MDC INTPSR
	Internal Connector Type: Other
	External Reference Designator: Not Specified
	External Connector Type: None
	Port Type: Other

Handle 0x0017, DMI type 8, 9 bytes
Port Connector Information
	Internal Reference Designator: J9E4 - MDC INTPSR
	Internal Connector Type: Other
	External Reference Designator: Not Specified
	External Connector Type: None
	Port Type: Other

Handle 0x0018, DMI type 8, 9 bytes
Port Connector Information
	Internal Reference Designator: J9E3 - LPC HOT DOCKING
	Internal Connector Type: Other
	External Reference Designator: Not Specified
	External Connector Type: None
	Port Type: Other

Handle 0x0019, DMI type 8, 9 bytes
Port Connector Information
	Internal Reference Designator: J9E1 - SCAN MATRIX
	Internal Connector Type: Other
	External Reference Designator: Not Specified
	External Connector Type: None
	Port Type: Other

Handle 0x001A, DMI type 8, 9 bytes
Port Connector Information
	Internal Reference Designator: J9G1 - LPC SIDE BAND
	Internal Connector Type: Other
	External Reference Designator: Not Specified
	External Connector Type: None
	Port Type: Other

Handle 0x001B, DMI type 8, 9 bytes
Port Connector Information
	Internal Reference Designator: J8F1 - UNIFIED
	Internal Connector Type: Other
	External Reference Designator: Not Specified
	External Connector Type: None
	Port Type: Other

Handle 0x001C, DMI type 8, 9 bytes
Port Connector Information
	Internal Reference Designator: J6F1 - LVDS
	Internal Connector Type: Other
	External Reference Designator: Not Specified
	External Connector Type: None
	Port Type: Other

Handle 0x001D, DMI type 8, 9 bytes
Port Connector Information
	Internal Reference Designator: J2F1 - LAI FAN
	Internal Connector Type: Other
	External Reference Designator: Not Specified
	External Connector Type: None
	Port Type: Other

Handle 0x001E, DMI type 8, 9 bytes
Port Connector Information
	Internal Reference Designator: J2G1 - GFX VID
	Internal Connector Type: Other
	External Reference Designator: Not Specified
	External Connector Type: None
	Port Type: Other

Handle 0x001F, DMI type 8, 9 bytes
Port Connector Information
	Internal Reference Designator: J1G6 - AC JACK
	Internal Connector Type: Other
	External Reference Designator: Not Specified
	External Connector Type: None
	Port Type: Other

Handle 0x0020, DMI type 9, 17 bytes
System Slot Information
	Designation: IJ31
	Type: x1 PCI Express x1
	Current Usage: Available
	Length: Short
	ID: 1
	Characteristics:
		3.3 V is provided
		Opening is shared
		PME signal is supported
	Bus Address: 0000:00:1c.0

Handle 0x0021, DMI type 9, 17 bytes
System Slot Information
	Designation: IJ32
	Type: x1 PCI Express x1
	Current Usage: Available
	Length: Short
	ID: 2
	Characteristics:
		3.3 V is provided
		Opening is shared
		PME signal is supported
	Bus Address: 0000:00:1c.4

Handle 0x0022, DMI type 9, 17 bytes
System Slot Information
	Designation: IJ311
	Type: 32-bit PCI
	Current Usage: Available
	Length: Short
	ID: 3
	Characteristics:
		3.3 V is provided
		Opening is shared
		PME signal is supported
	Bus Address: 0000:00:1e.0

Handle 0x0023, DMI type 10, 6 bytes
On Board Device Information
	Type: Sound
	Status: Enabled
	Description:  Realtek High Definition Audio

Handle 0x0024, DMI type 10, 6 bytes
On Board Device Information
	Type: Ethernet
	Status: Enabled
	Description:  Intel(R) 82578DC Ethernet

Handle 0x0025, DMI type 11, 5 bytes
OEM Strings
	String 1: To Be Filled By O.E.M.

Handle 0x0026, DMI type 12, 5 bytes
System Configuration Options
	Option 1: To Be Filled By O.E.M.

Handle 0x0027, DMI type 13, 22 bytes
BIOS Language Information
	Language Description Format: Long
	Installable Languages: 1
		en|US|iso8859-1
	Currently Installed Language: en|US|iso8859-1

Handle 0x0028, DMI type 16, 15 bytes
Physical Memory Array
	Location: System Board Or Motherboard
	Use: System Memory
	Error Correction Type: None
	Maximum Capacity: 16 GB
	Error Information Handle: 0x0029
	Number Of Devices: 4

Handle 0x0029, DMI type 18, 23 bytes
32-bit Memory Error Information
	Type: Unknown
	Granularity: Unknown
	Operation: Unknown
	Vendor Syndrome: Unknown
	Memory Array Address: Unknown
	Device Address: Unknown
	Resolution: Unknown

Handle 0x002A, DMI type 19, 15 bytes
Memory Array Mapped Address
	Starting Address: 0x00000000000
	Ending Address: 0x000FFFFFFFF
	Range Size: 4 GB
	Physical Array Handle: 0x0028
	Partition Width: 1

Handle 0x002B, DMI type 17, 28 bytes
Memory Device
	Array Handle: 0x0028
	Error Information Handle: 0x002C
	Total Width: 64 bits
	Data Width: 64 bits
	Size: 2048 MB
	Form Factor: DIMM
	Set: None
	Locator: CHANNEL A
	Bank Locator: CHANNEL A-DIMM 0
	Type: DDR3
	Type Detail: Synchronous
	Speed: 1333 MHz
	Manufacturer: Kingston        
	Serial Number: 0E0FB248  
	Asset Tag: A1_AssetTagNum0
	Part Number: 99U5471-001.A00LF 
	Rank: 2

Handle 0x002C, DMI type 18, 23 bytes
32-bit Memory Error Information
	Type: Unknown
	Granularity: Unknown
	Operation: Unknown
	Vendor Syndrome: Unknown
	Memory Array Address: Unknown
	Device Address: Unknown
	Resolution: Unknown

Handle 0x002D, DMI type 20, 19 bytes
Memory Device Mapped Address
	Starting Address: 0x00000000000
	Ending Address: 0x0007FFFFFFF
	Range Size: 2 GB
	Physical Device Handle: 0x002B
	Memory Array Mapped Address Handle: 0x002A
	Partition Row Position: 1

Handle 0x002E, DMI type 17, 28 bytes
Memory Device
	Array Handle: 0x0028
	Error Information Handle: 0x002F
	Total Width: 64 bits
	Data Width: 64 bits
	Size: No Module Installed
	Form Factor: DIMM
	Set: None
	Locator: CHANNEL A
	Bank Locator: CHANNEL A-DIMM 1
	Type: Unknown
	Type Detail: Synchronous
	Speed: Unknown
	Manufacturer: A1_Manufacturer1
	Serial Number: A1_SerNum1
	Asset Tag: A1_AssetTagNum1
	Part Number: Array1_PartNumber1
	Rank: Unknown

Handle 0x002F, DMI type 18, 23 bytes
32-bit Memory Error Information
	Type: Unknown
	Granularity: Unknown
	Operation: Unknown
	Vendor Syndrome: Unknown
	Memory Array Address: Unknown
	Device Address: Unknown
	Resolution: Unknown

Handle 0x0030, DMI type 126, 19 bytes
Inactive

Handle 0x0031, DMI type 17, 28 bytes
Memory Device
	Array Handle: 0x0028
	Error Information Handle: 0x0032
	Total Width: 64 bits
	Data Width: 64 bits
	Size: 2048 MB
	Form Factor: DIMM
	Set: None
	Locator: CHANNEL B
	Bank Locator: CHANNEL B-DIMM 0
	Type: DDR3
	Type Detail: Synchronous
	Speed: 1333 MHz
	Manufacturer: Kingston        
	Serial Number: 5410197B  
	Asset Tag: A1_AssetTagNum2
	Part Number: 99U5471-001.A00LF 
	Rank: 2

Handle 0x0032, DMI type 18, 23 bytes
32-bit Memory Error Information
	Type: Unknown
	Granularity: Unknown
	Operation: Unknown
	Vendor Syndrome: Unknown
	Memory Array Address: Unknown
	Device Address: Unknown
	Resolution: Unknown

Handle 0x0033, DMI type 20, 19 bytes
Memory Device Mapped Address
	Starting Address: 0x00080000000
	Ending Address: 0x000FFFFFFFF
	Range Size: 2 GB
	Physical Device Handle: 0x0031
	Memory Array Mapped Address Handle: 0x002A
	Partition Row Position: 1

Handle 0x0034, DMI type 17, 28 bytes
Memory Device
	Array Handle: 0x0028
	Error Information Handle: 0x0035
	Total Width: 64 bits
	Data Width: 64 bits
	Size: No Module Installed
	Form Factor: DIMM
	Set: None
	Locator: CHANNEL B
	Bank Locator: CHANNEL B-DIMM 1
	Type: Unknown
	Type Detail: Synchronous
	Speed: Unknown
	Manufacturer: A1_Manufacturer3
	Serial Number: A1_SerNum3
	Asset Tag: A1_AssetTagNum3
	Part Number: Array1_PartNumber3
	Rank: Unknown

Handle 0x0035, DMI type 18, 23 bytes
32-bit Memory Error Information
	Type: Unknown
	Granularity: Unknown
	Operation: Unknown
	Vendor Syndrome: Unknown
	Memory Array Address: Unknown
	Device Address: Unknown
	Resolution: Unknown

Handle 0x0036, DMI type 126, 19 bytes
Inactive

Handle 0x0037, DMI type 32, 20 bytes
System Boot Information
	Status: No errors detected

Handle 0x0038, DMI type 34, 11 bytes
Management Device
	Description: LM78-1
	Type: LM78
	Address: 0x00000000
	Address Type: I/O Port

Handle 0x0039, DMI type 26, 22 bytes
Voltage Probe
	Description: LM78A
	Location: <OUT OF SPEC>
	Status: <OUT OF SPEC>
	Maximum Value: Unknown
	Minimum Value: Unknown
	Resolution: Unknown
	Tolerance: Unknown
	Accuracy: Unknown
	OEM-specific Information: 0x00000000
	Nominal Value: Unknown

Handle 0x003A, DMI type 36, 16 bytes
Management Device Threshold Data
	Lower Non-critical Threshold: 1
	Upper Non-critical Threshold: 2
	Lower Critical Threshold: 3
	Upper Critical Threshold: 4
	Lower Non-recoverable Threshold: 5
	Upper Non-recoverable Threshold: 6

Handle 0x003B, DMI type 35, 11 bytes
Management Device Component
	Description: To Be Filled By O.E.M.
	Management Device Handle: 0x0038
	Component Handle: 0x0038
	Threshold Handle: 0x0039

Handle 0x003C, DMI type 28, 22 bytes
Temperature Probe
	Description: LM78A
	Location: <OUT OF SPEC>
	Status: <OUT OF SPEC>
	Maximum Value: Unknown
	Minimum Value: Unknown
	Resolution: Unknown
	Tolerance: Unknown
	Accuracy: Unknown
	OEM-specific Information: 0x00000000
	Nominal Value: Unknown

Handle 0x003D, DMI type 36, 16 bytes
Management Device Threshold Data
	Lower Non-critical Threshold: 1
	Upper Non-critical Threshold: 2
	Lower Critical Threshold: 3
	Upper Critical Threshold: 4
	Lower Non-recoverable Threshold: 5
	Upper Non-recoverable Threshold: 6

Handle 0x003E, DMI type 35, 11 bytes
Management Device Component
	Description: To Be Filled By O.E.M.
	Management Device Handle: 0x0038
	Component Handle: 0x003B
	Threshold Handle: 0x003C

Handle 0x003F, DMI type 27, 14 bytes
Cooling Device
	Temperature Probe Handle: 0x003C
	Type: <OUT OF SPEC>
	Status: <OUT OF SPEC>
	Cooling Unit Group: 1
	OEM-specific Information: 0x00000000
	Nominal Speed: Unknown Or Non-rotating

Handle 0x0040, DMI type 36, 16 bytes
Management Device Threshold Data
	Lower Non-critical Threshold: 1
	Upper Non-critical Threshold: 2
	Lower Critical Threshold: 3
	Upper Critical Threshold: 4
	Lower Non-recoverable Threshold: 5
	Upper Non-recoverable Threshold: 6

Handle 0x0041, DMI type 35, 11 bytes
Management Device Component
	Description: To Be Filled By O.E.M.
	Management Device Handle: 0x0038
	Component Handle: 0x003E
	Threshold Handle: 0x003F

Handle 0x0042, DMI type 27, 14 bytes
Cooling Device
	Temperature Probe Handle: 0x003C
	Type: <OUT OF SPEC>
	Status: <OUT OF SPEC>
	Cooling Unit Group: 1
	OEM-specific Information: 0x00000000
	Nominal Speed: Unknown Or Non-rotating

Handle 0x0043, DMI type 36, 16 bytes
Management Device Threshold Data
	Lower Non-critical Threshold: 1
	Upper Non-critical Threshold: 2
	Lower Critical Threshold: 3
	Upper Critical Threshold: 4
	Lower Non-recoverable Threshold: 5
	Upper Non-recoverable Threshold: 6

Handle 0x0044, DMI type 35, 11 bytes
Management Device Component
	Description: To Be Filled By O.E.M.
	Management Device Handle: 0x0038
	Component Handle: 0x0041
	Threshold Handle: 0x0042

Handle 0x0045, DMI type 29, 22 bytes
Electrical Current Probe
	Description: ABC
	Location: <OUT OF SPEC>
	Status: <OUT OF SPEC>
	Maximum Value: Unknown
	Minimum Value: Unknown
	Resolution: Unknown
	Tolerance: Unknown
	Accuracy: Unknown
	OEM-specific Information: 0x00000000
	Nominal Value: Unknown

Handle 0x0046, DMI type 36, 16 bytes
Management Device Threshold Data

Handle 0x0047, DMI type 35, 11 bytes
Management Device Component
	Description: To Be Filled By O.E.M.
	Management Device Handle: 0x0038
	Component Handle: 0x0044
	Threshold Handle: 0x0042

Handle 0x0048, DMI type 39, 22 bytes
System Power Supply
	Power Unit Group: 1
	Location: To Be Filled By O.E.M.
	Name: To Be Filled By O.E.M.
	Manufacturer: To Be Filled By O.E.M.
	Serial Number: To Be Filled By O.E.M.
	Asset Tag: To Be Filled By O.E.M.
	Model Part Number: To Be Filled By O.E.M.
	Revision: To Be Filled By O.E.M.
	Max Power Capacity: Unknown
	Status: Not Present
	Type: <OUT OF SPEC>
	Input Voltage Range Switching: <OUT OF SPEC>
	Plugged: Yes
	Hot Replaceable: No
	Input Voltage Probe Handle: 0x0039
	Cooling Device Handle: 0x003F
	Input Current Probe Handle: 0x0045

Handle 0x0049, DMI type 41, 11 bytes
Onboard Device
	Reference Designation:  Realtek High Definition Audio
	Type: Sound
	Status: Enabled
	Type Instance: 1
	Bus Address: 0000:00:1b.0

Handle 0x004A, DMI type 41, 11 bytes
Onboard Device
	Reference Designation:  Intel(R) 82578DC Ethernet
	Type: Ethernet
	Status: Enabled
	Type Instance: 1
	Bus Address: 0000:00:19.0

Handle 0x0002, DMI type 2, 15 bytes
Base Board Information
	Manufacturer: Intel Corporation
	Product Name: DH55HC
	Version: AAE70933-504
	Serial Number: BTHC030001TK
	Asset Tag: To be filled by O.E.M.
	Features:
		Board is a hosting board
		Board is replaceable
	Location In Chassis: To be filled by O.E.M.
	Chassis Handle: 0x0003
	Type: Motherboard
	Contained Object Handles: 0

Handle 0x0003, DMI type 3, 21 bytes
Chassis Information
	Manufacturer:                                  
	Type: Desktop
	Lock: Not Present
	Version:                                  
	Serial Number:                                  
	Asset Tag:                                  
	Boot-up State: Safe
	Power Supply State: Safe
	Thermal State: Safe
	Security Status: None
	OEM Information: 0x00000000
	Height: Unspecified
	Number Of Power Cords: 1
	Contained Elements: 0

Handle 0x0001, DMI type 1, 27 bytes
System Information
	Manufacturer:                                  
	Product Name:                                  
	Version:                                  
	Serial Number:                                  
	UUID: 0005875A-A394-DF11-8A64-7071BC6C48C0
	Wake-up Type: Power Switch
	SKU Number: Not Specified
	Family: Not Specified

Handle 0x004B, DMI type 10, 6 bytes
On Board Device Information
	Type: Video
	Status: Enabled
	Description:  Intel(R) GMA HD Device

Handle 0x004C, DMI type 41, 11 bytes
Onboard Device
	Reference Designation:  Intel(R) GMA HD Device
	Type: Video
	Status: Enabled
	Type Instance: 1
	Bus Address: 0000:00:02.0

Handle 0x004D, DMI type 9, 17 bytes
System Slot Information
	Designation: IJ41
	Type: x16 PCI Express x16
	Current Usage: Available
	Length: Long
	ID: 0
	Characteristics:
		3.3 V is provided
		Opening is shared
		PME signal is supported
	Bus Address: 0000:00:01.0

Handle 0x004E, DMI type 9, 17 bytes
System Slot Information
	Designation: IJ312
	Type: 32-bit PCI
	Current Usage: Available
	Length: Short
	ID: 4
	Characteristics:
		3.3 V is provided
		Opening is shared
		PME signal is supported
	Bus Address: 0000:00:1e.0

Handle 0x004F, DMI type 9, 17 bytes
System Slot Information
	Designation: IJ313
	Type: 32-bit PCI
	Current Usage: Available
	Length: Short
	ID: 5
	Characteristics:
		3.3 V is provided
		Opening is shared
		PME signal is supported
	Bus Address: 0000:00:1e.0

Handle 0x0050, DMI type 129, 8 bytes
OEM-specific Type
	Header and Data:
		81 08 50 00 01 01 02 01
	Strings:
		Intel_ASF
		Intel_ASF_001

Handle 0x0051, DMI type 131, 64 bytes
OEM-specific Type
	Header and Data:
		83 40 51 00 31 00 00 00 00 00 00 00 00 00 00 00
		F8 00 06 3B FF FF FF FF 0B C0 00 00 01 00 06 00
		12 04 00 00 00 00 00 00 C8 00 F0 10 00 00 00 00
		00 00 00 00 F2 00 00 00 76 50 72 6F 00 00 00 00

Handle 0x0052, DMI type 127, 4 bytes
End Of Table

Ensuite voici les informations relative aux disques et au RAID :

# fdisk -l                                                                                                                                                                              10:39

Disk /dev/sda: 500.1 GB, 500107862016 bytes
255 heads, 63 sectors/track, 60801 cylinders, total 976773168 sectors
Units = sectors of 1 * 512 = 512 bytes
Sector size (logical/physical): 512 bytes / 512 bytes
I/O size (minimum/optimal): 512 bytes / 512 bytes
Disk identifier: 0x000c2f99

   Device Boot      Start         End      Blocks   Id  System
/dev/sda1            2048   968945663   484471808   83  Linux
/dev/sda2       968945664   976771071     3912704   82  Linux swap / Solaris

Disk /dev/sdb: 500.1 GB, 500107862016 bytes
255 heads, 63 sectors/track, 60801 cylinders, total 976773168 sectors
Units = sectors of 1 * 512 = 512 bytes
Sector size (logical/physical): 512 bytes / 512 bytes
I/O size (minimum/optimal): 512 bytes / 512 bytes
Disk identifier: 0x00000000

   Device Boot      Start         End      Blocks   Id  System
/dev/sdb1   *          63   976768064   488384001   83  Linux

Disk /dev/sdc: 1500.3 GB, 1500301910016 bytes
255 heads, 63 sectors/track, 182401 cylinders, total 2930277168 sectors
Units = sectors of 1 * 512 = 512 bytes
Sector size (logical/physical): 512 bytes / 512 bytes
I/O size (minimum/optimal): 512 bytes / 512 bytes
Disk identifier: 0x0001c8f7

   Device Boot      Start         End      Blocks   Id  System
/dev/sdc1            2048  2930276351  1465137152   fd  Linux raid autodetect

Disk /dev/sde: 500.1 GB, 500107862016 bytes
255 heads, 63 sectors/track, 60801 cylinders, total 976773168 sectors
Units = sectors of 1 * 512 = 512 bytes
Sector size (logical/physical): 512 bytes / 512 bytes
I/O size (minimum/optimal): 512 bytes / 512 bytes
Disk identifier: 0x00000000

   Device Boot      Start         End      Blocks   Id  System
/dev/sde1              63   976768064   488384001   fd  Linux raid autodetect

Disk /dev/sdd: 1500.3 GB, 1500301910016 bytes
255 heads, 63 sectors/track, 182401 cylinders, total 2930277168 sectors
Units = sectors of 1 * 512 = 512 bytes
Sector size (logical/physical): 512 bytes / 512 bytes
I/O size (minimum/optimal): 512 bytes / 512 bytes
Disk identifier: 0x0001e5fd

   Device Boot      Start         End      Blocks   Id  System
/dev/sdd1            2048  2930276351  1465137152   fd  Linux raid autodetect

Disk /dev/md0: 500.1 GB, 500105150464 bytes
2 heads, 4 sectors/track, 122095984 cylinders, total 976767872 sectors
Units = sectors of 1 * 512 = 512 bytes
Sector size (logical/physical): 512 bytes / 512 bytes
I/O size (minimum/optimal): 512 bytes / 512 bytes
Disk identifier: 0x00000000

Disk /dev/md0 doesn't contain a valid partition table

Les disques constituant l’ensemble RAID défectueux sont les 2 de 1,5To

[code]# mdadm --examine /dev/sdc1 10:43
/dev/sdc1:
Magic : a92b4efc
Version : 1.2
Feature Map : 0x0
Array UUID : a5053715:cbb40291:c1018b43:7a8bc48e
Name : orca:1 (local to host orca)
Creation Time : Tue Jan 25 23:25:43 2011
Raid Level : raid1
Raid Devices : 2

Avail Dev Size : 2930272256 (1397.26 GiB 1500.30 GB)
Array Size : 2930271984 (1397.26 GiB 1500.30 GB)
Used Dev Size : 2930271984 (1397.26 GiB 1500.30 GB)
Data Offset : 2048 sectors
Super Offset : 8 sectors
State : clean
Device UUID : 73e1d19d:dea345e4:d05be982:25b721ab

Update Time : Tue Dec 13 08:14:14 2011
   Checksum : 2bf3f45b - correct
     Events : 212

Device Role : Active device 0
Array State : AA (‘A’ == active, ‘.’ == missing)

mdadm --examine /dev/sdd1 10:45

/dev/sdd1:
Magic : a92b4efc
Version : 1.2
Feature Map : 0x0
Array UUID : a5053715:cbb40291:c1018b43:7a8bc48e
Name : orca:1 (local to host orca)
Creation Time : Tue Jan 25 23:25:43 2011
Raid Level : raid1
Raid Devices : 2

Avail Dev Size : 2930272256 (1397.26 GiB 1500.30 GB)
Array Size : 2930271984 (1397.26 GiB 1500.30 GB)
Used Dev Size : 2930271984 (1397.26 GiB 1500.30 GB)
Data Offset : 2048 sectors
Super Offset : 8 sectors
State : clean
Device UUID : c11e19dd:65a28ee8:033fbadd:1caece81

Update Time : Tue Dec 13 08:14:14 2011
   Checksum : a1020a5b - correct
     Events : 212

Device Role : Active device 1
Array State : AA (‘A’ == active, ‘.’ == missing)

[/code]

[code]$ cat /etc/mdadm/mdadm.conf 10:47

mdadm.conf

Please refer to mdadm.conf(5) for information about this file.

by default, scan all partitions (/proc/partitions) for MD superblocks.

alternatively, specify devices to scan, using wildcards if desired.

DEVICE partitions

auto-create devices with Debian standard permissions

CREATE owner=root group=disk mode=0660 auto=yes

automatically tag new arrays as belonging to the local system

HOMEHOST

instruct the monitoring daemon where to send mail alerts

MAILADDR root

definitions of existing MD arrays

ARRAY /dev/md0 UUID=1d049a72:b75332a6:a186962f:f812ee19
#ARRAY /dev/md/1 metadata=1.2 UUID=a5053715:cbb40291:c1018b43:7a8bc48e name=orca:1
ARRAY /dev/md1 level=raid1 num-devices=2 UUID=a5053715:cbb40291:c1018b43:7a8bc48e

This file was auto-generated on Tue, 25 Jan 2011 23:41:37 +0100

by mkconf 3.1.4-1+8efb9d1

[/code]

#mdadm --assemble /dev/md1 /dev/sdc1 /dev/sdd1 mdadm: failed to add /dev/sdd1 to /dev/md1: Invalid argument mdadm: failed to add /dev/sdc1 to /dev/md1: Invalid argument mdadm: /dev/md1 assembled from 0 drives - not enough to start the array.

Et enfin des choses intéressantes dans dmesg qui me font penser que les disques durs sont HS mais je ne comprend pas trop pourquoi ca arriverait tout d’un coup. A moins que ca ait à voir avec la coupure de courant d’hier qui aurait provoquer une surtension au moment de réenclencher le disjoncteur ?

# dmesg ... [ 97.547675] ata4.00: exception Emask 0x0 SAct 0x0 SErr 0x0 action 0x0 [ 97.549223] ata4.00: BMDMA stat 0x64 [ 97.550724] ata4.00: failed command: READ DMA [ 97.552216] ata4.00: cmd c8/00:08:08:08:00/00:00:00:00:00/e0 tag 0 dma 4096 in [ 97.552217] res 51/40:00:0a:08:00/00:00:00:00:00/e0 Emask 0x9 (media error) [ 97.555233] ata4.00: status: { DRDY ERR } [ 97.556744] ata4.00: error: { UNC } [ 97.573393] ata4.00: configured for UDMA/133 [ 97.581478] ata4.01: configured for UDMA/133 [ 97.581492] ata4: EH complete [ 97.603576] ata4.00: exception Emask 0x0 SAct 0x0 SErr 0x0 action 0x0 [ 97.605140] ata4.00: BMDMA stat 0x64 [ 97.606637] ata4.00: failed command: READ DMA [ 97.608140] ata4.00: cmd c8/00:08:08:08:00/00:00:00:00:00/e0 tag 0 dma 4096 in [ 97.608140] res 51/40:00:0a:08:00/00:00:00:00:00/e0 Emask 0x9 (media error) [ 97.611142] ata4.00: status: { DRDY ERR } [ 97.612626] ata4.00: error: { UNC } [ 97.629333] ata4.00: configured for UDMA/133 [ 97.637375] ata4.01: configured for UDMA/133 [ 97.637389] sd 3:0:0:0: [sdc] Unhandled sense code [ 97.637392] sd 3:0:0:0: [sdc] Result: hostbyte=DID_OK driverbyte=DRIVER_SENSE [ 97.637396] sd 3:0:0:0: [sdc] Sense Key : Medium Error [current] [descriptor] [ 97.637401] Descriptor sense data with sense descriptors (in hex): [ 97.637403] 72 03 11 04 00 00 00 0c 00 0a 80 00 00 00 00 00 [ 97.637413] 00 00 08 0a [ 97.637417] sd 3:0:0:0: [sdc] Add. Sense: Unrecovered read error - auto reallocate failed [ 97.637423] sd 3:0:0:0: [sdc] CDB: Read(10): 28 00 00 00 08 08 00 00 08 00 [ 97.637432] end_request: I/O error, dev sdc, sector 2058 [ 97.638931] Buffer I/O error on device sdc1, logical block 1 [ 97.640431] ata4: EH complete ... [ 104.671717] res 51/40:00:0a:08:00/00:00:00:00:00/f0 Emask 0x9 (media error) [ 104.674802] ata4.01: status: { DRDY ERR } [ 104.676330] ata4.01: error: { UNC } [ 104.691213] ata4.00: configured for UDMA/133 [ 104.699294] ata4.01: configured for UDMA/133 [ 104.699311] ata4: EH complete [ 104.716534] ata4.01: exception Emask 0x0 SAct 0x0 SErr 0x0 action 0x0 [ 104.718113] ata4.01: failed command: READ DMA [ 104.719657] ata4.01: cmd c8/00:08:08:08:00/00:00:00:00:00/f0 tag 0 dma 4096 in [ 104.719658] res 51/40:00:0a:08:00/00:00:00:00:00/f0 Emask 0x9 (media error) [ 104.722775] ata4.01: status: { DRDY ERR } [ 104.724322] ata4.01: error: { UNC } [ 104.739163] ata4.00: configured for UDMA/133 [ 104.747216] ata4.01: configured for UDMA/133 [ 104.747231] ata4: EH complete [ 104.764457] ata4.01: exception Emask 0x0 SAct 0x0 SErr 0x0 action 0x0 [ 104.766051] ata4.01: failed command: READ DMA [ 104.767593] ata4.01: cmd c8/00:08:08:08:00/00:00:00:00:00/f0 tag 0 dma 4096 in [ 104.767594] res 51/40:00:0a:08:00/00:00:00:00:00/f0 Emask 0x9 (media error) [ 104.770702] ata4.01: status: { DRDY ERR } [ 104.772249] ata4.01: error: { UNC } [ 104.787106] ata4.00: configured for UDMA/133 [ 104.795141] ata4.01: configured for UDMA/133 [ 104.795158] sd 3:0:1:0: [sdd] Unhandled sense code [ 104.795161] sd 3:0:1:0: [sdd] Result: hostbyte=DID_OK driverbyte=DRIVER_SENSE [ 104.795166] sd 3:0:1:0: [sdd] Sense Key : Medium Error [current] [descriptor] [ 104.795172] Descriptor sense data with sense descriptors (in hex): [ 104.795175] 72 03 11 04 00 00 00 0c 00 0a 80 00 00 00 00 00 [ 104.795187] 00 00 08 0a [ 104.795192] sd 3:0:1:0: [sdd] Add. Sense: Unrecovered read error - auto reallocate failed [ 104.795199] sd 3:0:1:0: [sdd] CDB: Read(10): 28 00 00 00 08 08 00 00 08 00 [ 104.795210] end_request: I/O error, dev sdd, sector 2058 [ 104.796779] ata4: EH complete ...

Ce sont des extraits de dmesg qui se répètent plusieurs fois (énormément).

Après des tas de recherche sur internet, des tentatives infructueuses de récupération…, je ne sais plus trop quoi faire. Toute aide sera la bienvenue. N’hésitez pas à demander plus d’infos si besoin.

Merci.

Installes le paquet smartmontools et lance les commandes suivantes sur chaque disque :

  • smartctl -i /dev/sda
  • smartctl -a /dev/sda

Cela permettra d’en savoir plus sur l’état de tes disques.

Pour le premier disque de l’ensemble RAID :

# smartctl -a /dev/sdc
smartctl 5.41 2011-06-09 r3365 [x86_64-linux-3.0.0-1-amd64] (local build)
Copyright (C) 2002-11 by Bruce Allen, http://smartmontools.sourceforge.net

=== START OF INFORMATION SECTION ===
Model Family:     Western Digital Caviar Green (Adv. Format)
Device Model:     WDC WD15EARS-32MVWB0
Serial Number:    WD-WCAZA2305722
LU WWN Device Id: 5 0014ee 2afe13824
Firmware Version: 51.0AB51
User Capacity:    1 500 301 910 016 bytes [1,50 TB]
Sector Size:      512 bytes logical/physical
Device is:        In smartctl database [for details use: -P show]
ATA Version is:   8
ATA Standard is:  Exact ATA specification draft version not indicated
Local Time is:    Wed Dec 14 12:59:49 2011 CET
SMART support is: Available - device has SMART capability.
SMART support is: Enabled

=== START OF READ SMART DATA SECTION ===
SMART overall-health self-assessment test result: PASSED

General SMART Values:
Offline data collection status:  (0x82)	Offline data collection activity
					was completed without error.
					Auto Offline Data Collection: Enabled.
Self-test execution status:      (   0)	The previous self-test routine completed
					without error or no self-test has ever 
					been run.
Total time to complete Offline 
data collection: 		(36600) seconds.
Offline data collection
capabilities: 			 (0x7b) SMART execute Offline immediate.
					Auto Offline data collection on/off support.
					Suspend Offline collection upon new
					command.
					Offline surface scan supported.
					Self-test supported.
					Conveyance Self-test supported.
					Selective Self-test supported.
SMART capabilities:            (0x0003)	Saves SMART data before entering
					power-saving mode.
					Supports SMART auto save timer.
Error logging capability:        (0x01)	Error logging supported.
					General Purpose Logging supported.
Short self-test routine 
recommended polling time: 	 (   2) minutes.
Extended self-test routine
recommended polling time: 	 ( 255) minutes.
Conveyance self-test routine
recommended polling time: 	 (   5) minutes.
SCT capabilities: 	       (0x3035)	SCT Status supported.
					SCT Feature Control supported.
					SCT Data Table supported.

SMART Attributes Data Structure revision number: 16
Vendor Specific SMART Attributes with Thresholds:
ID# ATTRIBUTE_NAME          FLAG     VALUE WORST THRESH TYPE      UPDATED  WHEN_FAILED RAW_VALUE
  1 Raw_Read_Error_Rate     0x002f   173   173   051    Pre-fail  Always       -       14046
  3 Spin_Up_Time            0x0027   253   253   021    Pre-fail  Always       -       991
  4 Start_Stop_Count        0x0032   100   100   000    Old_age   Always       -       14
  5 Reallocated_Sector_Ct   0x0033   200   200   140    Pre-fail  Always       -       0
  7 Seek_Error_Rate         0x002e   200   200   000    Old_age   Always       -       0
  9 Power_On_Hours          0x0032   090   090   000    Old_age   Always       -       7735
 10 Spin_Retry_Count        0x0032   100   253   000    Old_age   Always       -       0
 11 Calibration_Retry_Count 0x0032   100   253   000    Old_age   Always       -       0
 12 Power_Cycle_Count       0x0032   100   100   000    Old_age   Always       -       12
192 Power-Off_Retract_Count 0x0032   200   200   000    Old_age   Always       -       10
193 Load_Cycle_Count        0x0032   194   194   000    Old_age   Always       -       19343
194 Temperature_Celsius     0x0022   123   114   000    Old_age   Always       -       27
196 Reallocated_Event_Count 0x0032   200   200   000    Old_age   Always       -       0
197 Current_Pending_Sector  0x0032   200   200   000    Old_age   Always       -       0
198 Offline_Uncorrectable   0x0030   200   200   000    Old_age   Offline      -       0
199 UDMA_CRC_Error_Count    0x0032   200   200   000    Old_age   Always       -       0
200 Multi_Zone_Error_Rate   0x0008   200   200   000    Old_age   Offline      -       0

SMART Error Log Version: 1
ATA Error Count: 14046 (device log contains only the most recent five errors)
	CR = Command Register [HEX]
	FR = Features Register [HEX]
	SC = Sector Count Register [HEX]
	SN = Sector Number Register [HEX]
	CL = Cylinder Low Register [HEX]
	CH = Cylinder High Register [HEX]
	DH = Device/Head Register [HEX]
	DC = Device Command Register [HEX]
	ER = Error register [HEX]
	ST = Status register [HEX]
Powered_Up_Time is measured from power on, and printed as
DDd+hh:mm:SS.sss where DD=days, hh=hours, mm=minutes,
SS=sec, and sss=millisec. It "wraps" after 49.710 days.

Error 14046 occurred at disk power-on lifetime: 7733 hours (322 days + 5 hours)
  When the command that caused the error occurred, the device was active or idle.

  After command completion occurred, registers were:
  ER ST SC SN CL CH DH
  -- -- -- -- -- -- --
  40 51 08 0a 08 00 e0  Error: UNC 8 sectors at LBA = 0x0000080a = 2058

  Commands leading to the command that caused the error were:
  CR FR SC SN CL CH DH DC   Powered_Up_Time  Command/Feature_Name
  -- -- -- -- -- -- -- --  ----------------  --------------------
  c8 00 08 08 08 00 e0 08      00:52:23.310  READ DMA
  ec 00 00 00 00 00 a0 08      00:52:23.294  IDENTIFY DEVICE
  ef 03 46 00 00 00 a0 08      00:52:23.294  SET FEATURES [Set transfer mode]

Error 14045 occurred at disk power-on lifetime: 7733 hours (322 days + 5 hours)
  When the command that caused the error occurred, the device was active or idle.

  After command completion occurred, registers were:
  ER ST SC SN CL CH DH
  -- -- -- -- -- -- --
  40 51 08 0a 08 00 e0  Error: UNC 8 sectors at LBA = 0x0000080a = 2058

  Commands leading to the command that caused the error were:
  CR FR SC SN CL CH DH DC   Powered_Up_Time  Command/Feature_Name
  -- -- -- -- -- -- -- --  ----------------  --------------------
  c8 00 08 08 08 00 e0 08      00:52:23.254  READ DMA
  ec 00 00 00 00 00 a0 08      00:52:23.238  IDENTIFY DEVICE
  ef 03 46 00 00 00 a0 08      00:52:23.238  SET FEATURES [Set transfer mode]

Error 14044 occurred at disk power-on lifetime: 7733 hours (322 days + 5 hours)
  When the command that caused the error occurred, the device was active or idle.

  After command completion occurred, registers were:
  ER ST SC SN CL CH DH
  -- -- -- -- -- -- --
  40 51 08 0a 08 00 e0  Error: UNC 8 sectors at LBA = 0x0000080a = 2058

  Commands leading to the command that caused the error were:
  CR FR SC SN CL CH DH DC   Powered_Up_Time  Command/Feature_Name
  -- -- -- -- -- -- -- --  ----------------  --------------------
  c8 00 08 08 08 00 e0 08      00:52:23.198  READ DMA
  ec 00 00 00 00 00 a0 08      00:52:23.182  IDENTIFY DEVICE
  ef 03 46 00 00 00 a0 08      00:52:23.182  SET FEATURES [Set transfer mode]

Error 14043 occurred at disk power-on lifetime: 7733 hours (322 days + 5 hours)
  When the command that caused the error occurred, the device was active or idle.

  After command completion occurred, registers were:
  ER ST SC SN CL CH DH
  -- -- -- -- -- -- --
  40 51 08 0a 08 00 e0  Error: UNC 8 sectors at LBA = 0x0000080a = 2058

  Commands leading to the command that caused the error were:
  CR FR SC SN CL CH DH DC   Powered_Up_Time  Command/Feature_Name
  -- -- -- -- -- -- -- --  ----------------  --------------------
  c8 00 08 08 08 00 e0 08      00:52:23.142  READ DMA
  ec 00 00 00 00 00 a0 08      00:52:23.126  IDENTIFY DEVICE
  ef 03 46 00 00 00 a0 08      00:52:23.126  SET FEATURES [Set transfer mode]

Error 14042 occurred at disk power-on lifetime: 7733 hours (322 days + 5 hours)
  When the command that caused the error occurred, the device was active or idle.

  After command completion occurred, registers were:
  ER ST SC SN CL CH DH
  -- -- -- -- -- -- --
  40 51 08 0a 08 00 e0  Error: UNC 8 sectors at LBA = 0x0000080a = 2058

  Commands leading to the command that caused the error were:
  CR FR SC SN CL CH DH DC   Powered_Up_Time  Command/Feature_Name
  -- -- -- -- -- -- -- --  ----------------  --------------------
  c8 00 08 08 08 00 e0 08      00:52:23.086  READ DMA
  ec 00 00 00 00 00 a0 08      00:52:23.070  IDENTIFY DEVICE
  ef 03 46 00 00 00 a0 08      00:52:23.070  SET FEATURES [Set transfer mode]

SMART Self-test log structure revision number 1
No self-tests have been logged.  [To run self-tests, use: smartctl -t]


SMART Selective self-test log data structure revision number 1
 SPAN  MIN_LBA  MAX_LBA  CURRENT_TEST_STATUS
    1        0        0  Not_testing
    2        0        0  Not_testing
    3        0        0  Not_testing
    4        0        0  Not_testing
    5        0        0  Not_testing
Selective self-test flags (0x0):
  After scanning selected spans, do NOT read-scan remainder of disk.
If Selective self-test is pending on power-up, resume after 0 minute delay.

Pour le second disque :

# smartctl -a /dev/sdd
smartctl 5.41 2011-06-09 r3365 [x86_64-linux-3.0.0-1-amd64] (local build)
Copyright (C) 2002-11 by Bruce Allen, http://smartmontools.sourceforge.net

=== START OF INFORMATION SECTION ===
Model Family:     Western Digital Caviar Green (Adv. Format)
Device Model:     WDC WD15EARS-32MVWB0
Serial Number:    WD-WCAZA2316941
LU WWN Device Id: 5 0014ee 25a8b84d1
Firmware Version: 51.0AB51
User Capacity:    1 500 301 910 016 bytes [1,50 TB]
Sector Size:      512 bytes logical/physical
Device is:        In smartctl database [for details use: -P show]
ATA Version is:   8
ATA Standard is:  Exact ATA specification draft version not indicated
Local Time is:    Wed Dec 14 13:01:23 2011 CET
SMART support is: Available - device has SMART capability.
SMART support is: Enabled

=== START OF READ SMART DATA SECTION ===
SMART overall-health self-assessment test result: PASSED

General SMART Values:
Offline data collection status:  (0x84)	Offline data collection activity
					was suspended by an interrupting command from host.
					Auto Offline Data Collection: Enabled.
Self-test execution status:      (   0)	The previous self-test routine completed
					without error or no self-test has ever 
					been run.
Total time to complete Offline 
data collection: 		(37080) seconds.
Offline data collection
capabilities: 			 (0x7b) SMART execute Offline immediate.
					Auto Offline data collection on/off support.
					Suspend Offline collection upon new
					command.
					Offline surface scan supported.
					Self-test supported.
					Conveyance Self-test supported.
					Selective Self-test supported.
SMART capabilities:            (0x0003)	Saves SMART data before entering
					power-saving mode.
					Supports SMART auto save timer.
Error logging capability:        (0x01)	Error logging supported.
					General Purpose Logging supported.
Short self-test routine 
recommended polling time: 	 (   2) minutes.
Extended self-test routine
recommended polling time: 	 ( 255) minutes.
Conveyance self-test routine
recommended polling time: 	 (   5) minutes.
SCT capabilities: 	       (0x3035)	SCT Status supported.
					SCT Feature Control supported.
					SCT Data Table supported.

SMART Attributes Data Structure revision number: 16
Vendor Specific SMART Attributes with Thresholds:
ID# ATTRIBUTE_NAME          FLAG     VALUE WORST THRESH TYPE      UPDATED  WHEN_FAILED RAW_VALUE
  1 Raw_Read_Error_Rate     0x002f   171   171   051    Pre-fail  Always       -       13896
  3 Spin_Up_Time            0x0027   253   253   021    Pre-fail  Always       -       1141
  4 Start_Stop_Count        0x0032   100   100   000    Old_age   Always       -       14
  5 Reallocated_Sector_Ct   0x0033   200   200   140    Pre-fail  Always       -       0
  7 Seek_Error_Rate         0x002e   200   200   000    Old_age   Always       -       0
  9 Power_On_Hours          0x0032   090   090   000    Old_age   Always       -       7737
 10 Spin_Retry_Count        0x0032   100   253   000    Old_age   Always       -       0
 11 Calibration_Retry_Count 0x0032   100   253   000    Old_age   Always       -       0
 12 Power_Cycle_Count       0x0032   100   100   000    Old_age   Always       -       12
192 Power-Off_Retract_Count 0x0032   200   200   000    Old_age   Always       -       10
193 Load_Cycle_Count        0x0032   194   194   000    Old_age   Always       -       18970
194 Temperature_Celsius     0x0022   119   112   000    Old_age   Always       -       31
196 Reallocated_Event_Count 0x0032   200   200   000    Old_age   Always       -       0
197 Current_Pending_Sector  0x0032   200   200   000    Old_age   Always       -       0
198 Offline_Uncorrectable   0x0030   200   200   000    Old_age   Offline      -       0
199 UDMA_CRC_Error_Count    0x0032   200   200   000    Old_age   Always       -       0
200 Multi_Zone_Error_Rate   0x0008   200   200   000    Old_age   Offline      -       1

SMART Error Log Version: 1
ATA Error Count: 13896 (device log contains only the most recent five errors)
	CR = Command Register [HEX]
	FR = Features Register [HEX]
	SC = Sector Count Register [HEX]
	SN = Sector Number Register [HEX]
	CL = Cylinder Low Register [HEX]
	CH = Cylinder High Register [HEX]
	DH = Device/Head Register [HEX]
	DC = Device Command Register [HEX]
	ER = Error register [HEX]
	ST = Status register [HEX]
Powered_Up_Time is measured from power on, and printed as
DDd+hh:mm:SS.sss where DD=days, hh=hours, mm=minutes,
SS=sec, and sss=millisec. It "wraps" after 49.710 days.

Error 13896 occurred at disk power-on lifetime: 7734 hours (322 days + 6 hours)
  When the command that caused the error occurred, the device was active or idle.

  After command completion occurred, registers were:
  ER ST SC SN CL CH DH
  -- -- -- -- -- -- --
  40 51 08 0a 08 00 e0  Error: UNC 8 sectors at LBA = 0x0000080a = 2058

  Commands leading to the command that caused the error were:
  CR FR SC SN CL CH DH DC   Powered_Up_Time  Command/Feature_Name
  -- -- -- -- -- -- -- --  ----------------  --------------------
  c8 00 08 08 08 00 e0 00      00:52:23.922  READ DMA
  ec 00 00 00 00 00 a0 00      00:52:23.913  IDENTIFY DEVICE
  ef 03 46 00 00 00 a0 00      00:52:23.913  SET FEATURES [Set transfer mode]

Error 13895 occurred at disk power-on lifetime: 7734 hours (322 days + 6 hours)
  When the command that caused the error occurred, the device was active or idle.

  After command completion occurred, registers were:
  ER ST SC SN CL CH DH
  -- -- -- -- -- -- --
  40 51 08 0a 08 00 e0  Error: UNC 8 sectors at LBA = 0x0000080a = 2058

  Commands leading to the command that caused the error were:
  CR FR SC SN CL CH DH DC   Powered_Up_Time  Command/Feature_Name
  -- -- -- -- -- -- -- --  ----------------  --------------------
  c8 00 08 08 08 00 e0 00      00:52:23.874  READ DMA
  ec 00 00 00 00 00 a0 00      00:52:23.865  IDENTIFY DEVICE
  ef 03 46 00 00 00 a0 00      00:52:23.865  SET FEATURES [Set transfer mode]

Error 13894 occurred at disk power-on lifetime: 7734 hours (322 days + 6 hours)
  When the command that caused the error occurred, the device was active or idle.

  After command completion occurred, registers were:
  ER ST SC SN CL CH DH
  -- -- -- -- -- -- --
  40 51 08 0a 08 00 e0  Error: UNC 8 sectors at LBA = 0x0000080a = 2058

  Commands leading to the command that caused the error were:
  CR FR SC SN CL CH DH DC   Powered_Up_Time  Command/Feature_Name
  -- -- -- -- -- -- -- --  ----------------  --------------------
  c8 00 08 08 08 00 e0 00      00:52:23.826  READ DMA
  ec 00 00 00 00 00 a0 00      00:52:23.818  IDENTIFY DEVICE
  ef 03 46 00 00 00 a0 00      00:52:23.818  SET FEATURES [Set transfer mode]

Error 13893 occurred at disk power-on lifetime: 7734 hours (322 days + 6 hours)
  When the command that caused the error occurred, the device was active or idle.

  After command completion occurred, registers were:
  ER ST SC SN CL CH DH
  -- -- -- -- -- -- --
  40 51 08 0a 08 00 e0  Error: UNC 8 sectors at LBA = 0x0000080a = 2058

  Commands leading to the command that caused the error were:
  CR FR SC SN CL CH DH DC   Powered_Up_Time  Command/Feature_Name
  -- -- -- -- -- -- -- --  ----------------  --------------------
  c8 00 08 08 08 00 e0 00      00:52:23.778  READ DMA
  ec 00 00 00 00 00 a0 00      00:52:23.770  IDENTIFY DEVICE
  ef 03 46 00 00 00 a0 00      00:52:23.770  SET FEATURES [Set transfer mode]

Error 13892 occurred at disk power-on lifetime: 7734 hours (322 days + 6 hours)
  When the command that caused the error occurred, the device was active or idle.

  After command completion occurred, registers were:
  ER ST SC SN CL CH DH
  -- -- -- -- -- -- --
  40 51 08 0a 08 00 e0  Error: UNC 8 sectors at LBA = 0x0000080a = 2058

  Commands leading to the command that caused the error were:
  CR FR SC SN CL CH DH DC   Powered_Up_Time  Command/Feature_Name
  -- -- -- -- -- -- -- --  ----------------  --------------------
  c8 00 08 08 08 00 e0 00      00:52:23.730  READ DMA
  ec 00 00 00 00 00 a0 00      00:52:23.722  IDENTIFY DEVICE
  ef 03 46 00 00 00 a0 00      00:52:23.722  SET FEATURES [Set transfer mode]

SMART Self-test log structure revision number 1
No self-tests have been logged.  [To run self-tests, use: smartctl -t]


SMART Selective self-test log data structure revision number 1
 SPAN  MIN_LBA  MAX_LBA  CURRENT_TEST_STATUS
    1        0        0  Not_testing
    2        0        0  Not_testing
    3        0        0  Not_testing
    4        0        0  Not_testing
    5        0        0  Not_testing
Selective self-test flags (0x0):
  After scanning selected spans, do NOT read-scan remainder of disk.
If Selective self-test is pending on power-up, resume after 0 minute delay.

[1]    22955 exit 64    sudo smartctl -a /dev/sdd

A première vue tes disques ne sont pas très beaux, tu as beaucoup d’erreur …
En principe cela doit te donner : No Errors Logged

Afin d’éviter que ce soucis ne se produise à nouveau, je te conseille de faire l’acquisition d’une multiprise para-surtension voir mieux un onduleur.

Dois-je en déduire que mais disques sont bons à mettre au rebut ?
Y a-t-il une chance de récupérer quelque chose ou pas du tout ?

Je ne suis pas assez expert dans smart pour juger si ces disques sont à mettre à la poubelle ou pas.
Par contre je peux te dire que chez OVH, à la moindre erreur smart les disques sont changés.

Pour récupérer tes données il faudrait je pense déjà pouvoir monter le disque, ce qui n’est pas gagner d’avance.
Tu n’as pas de sauvegarde ?

Effectivement ce n’est vraiment pas gagné. C’est ce que j’ai essayé de faire mais sans succès.

C’était justement des sauvegardes qui se trouvaient sur ces disques. J’ai donc toujours mais données d’origine et un autre backup sur une autre machine à un autre endroit. Donc si les disques sont irrécupérables ce n’est pas très grave. La seule chose qui m’ennuie c’est que ce n’est pas la bonne période pour acheter des disques. Les prix ont flambés.

C’est quand même bizarre (mais heureusement) qu’il n’y ait que ces 2 disques qui aient étaient impactés sr les 5 de la machine.

C’est vrai que j’aurai dû commencer par là. Surtout qu’en début d’année la machine a été changée parce que la carte mère avait grillé suite à une coupure de courant. Je pense que la prise doit avoir une faiblesse. Donc je vais investir dans un premier temps dans une multiprise avec para-surtension et mettre la machine sur une autre prise, et par la suite en fonction du niveau des finances dans un onduleur.

En attendant, si je pouvais récupérer les données ça m’arrangerais. Si quelqu’un a une idée…