neulich ist mein Rootserver aus irgend einem Grund hängen geblieben. (nicht nachvollziehbar)
Nach einem Neustart bzw. auch einem Hard-Reset und durchführung des fsck beim Boot habe ich jetzt durch Munin eine Änderung der SMART Daten festgestellt.
Und zwar zeigt mir Munin nun den Parametetr smartctl_exit_status mit 6 statt normal 0 an. Ein Warning ab 1 ist ja auch standardmäßig eingestellt, was bedeutet dieser Parameter? Irgendwo hab ich mal gelesen solang es 0 ist alles in ordnung, wenn nicht sollte man aufpassen.
Ich habe auch mal mit smartctl ein long Test gemacht und ein paar Errors vorliegen womit ich aber nicht wirklich was anfangen kann. Kenne mich leider mit Smart nicht wirklich aus.
Hier mal eine Ausgabe von SMART:
Code: Select all
# smartctl -l error /dev/hda
smartctl version 5.32 Copyright (C) 2002-4 Bruce Allen
Home page is http://smartmontools.sourceforge.net/
=== START OF READ SMART DATA SECTION ===
SMART Error Log Version: 1
Warning: ATA error count 14 inconsistent with error log pointer 5
ATA Error Count: 14 (device log contains only the most recent five errors)
CR = Command Register [HEX]
FR = Features Register [HEX]
SC = Sector Count Register [HEX]
SN = Sector Number Register [HEX]
CL = Cylinder Low Register [HEX]
CH = Cylinder High Register [HEX]
DH = Device/Head Register [HEX]
DC = Device Command Register [HEX]
ER = Error register [HEX]
ST = Status register [HEX]
Powered_Up_Time is measured from power on, and printed as
DDd+hh:mm:SS.sss where DD=days, hh=hours, mm=minutes,
SS=sec, and sss=millisec. It "wraps" after 49.710 days.
Error 14 occurred at disk power-on lifetime: 27307 hours (1137 days + 19 hours)
When the command that caused the error occurred, the device was in an unknown state.
After command completion occurred, registers were:
ER ST SC SN CL CH DH
-- -- -- -- -- -- --
01 51 08 48 8a 55 e0 Error: AMNF 8 sectors at LBA = 0x00558a48 = 5605960
Commands leading to the command that caused the error were:
CR FR SC SN CL CH DH DC Powered_Up_Time Command/Feature_Name
-- -- -- -- -- -- -- -- ---------------- --------------------
c8 00 08 48 8a 55 e0 08 3d+21:17:24.656 READ DMA
ca 00 08 5f 34 30 ee 08 3d+21:17:23.200 WRITE DMA
ca 00 10 4f 34 30 ee 08 3d+21:17:23.200 WRITE DMA
ca 00 08 47 34 30 ee 08 3d+21:17:21.216 WRITE DMA
ca 00 10 37 34 30 ee 08 3d+21:17:21.216 WRITE DMA
Error 13 occurred at disk power-on lifetime: 27307 hours (1137 days + 19 hours)
When the command that caused the error occurred, the device was in an unknown state.
After command completion occurred, registers were:
ER ST SC SN CL CH DH
-- -- -- -- -- -- --
40 51 0f 98 30 57 e0 Error: UNC 15 sectors at LBA = 0x00573098 = 5714072
Commands leading to the command that caused the error were:
CR FR SC SN CL CH DH DC Powered_Up_Time Command/Feature_Name
-- -- -- -- -- -- -- -- ---------------- --------------------
c8 00 10 98 30 57 e0 08 3d+21:17:17.568 READ DMA
ca 00 08 2f 34 30 ee 08 3d+21:17:15.536 WRITE DMA
ca 00 10 1f 34 30 ee 08 3d+21:17:15.536 WRITE DMA
ca 00 08 17 34 30 ee 08 3d+21:17:11.712 WRITE DMA
ca 00 10 07 34 30 ee 08 3d+21:17:11.712 WRITE DMA
Error 12 occurred at disk power-on lifetime: 27306 hours (1137 days + 18 hours)
When the command that caused the error occurred, the device was in an unknown state.
After command completion occurred, registers were:
ER ST SC SN CL CH DH
-- -- -- -- -- -- --
40 51 08 18 8a 4a e0 Error: UNC 8 sectors at LBA = 0x004a8a18 = 4885016
Commands leading to the command that caused the error were:
CR FR SC SN CL CH DH DC Powered_Up_Time Command/Feature_Name
-- -- -- -- -- -- -- -- ---------------- --------------------
c8 00 08 18 8a 4a e0 08 3d+20:31:07.120 READ DMA
c8 00 08 b8 64 5a e0 08 3d+20:31:07.120 READ DMA
ca 00 00 d0 eb 08 e0 08 3d+20:31:07.040 WRITE DMA
ca 00 00 d0 ea 08 e0 08 3d+20:31:07.040 WRITE DMA
ca 00 00 d0 e9 08 e0 08 3d+20:31:07.040 WRITE DMA
Error 11 occurred at disk power-on lifetime: 27306 hours (1137 days + 18 hours)
When the command that caused the error occurred, the device was in an unknown state.
After command completion occurred, registers were:
ER ST SC SN CL CH DH
-- -- -- -- -- -- --
01 51 29 f0 81 55 e0 Error: AMNF 41 sectors at LBA = 0x005581f0 = 5603824
Commands leading to the command that caused the error were:
CR FR SC SN CL CH DH DC Powered_Up_Time Command/Feature_Name
-- -- -- -- -- -- -- -- ---------------- --------------------
c8 00 40 f0 81 55 e0 08 3d+20:16:47.184 READ DMA
ca 00 08 50 c3 04 e0 08 3d+20:16:47.152 WRITE DMA
ca 00 08 a8 74 06 e0 08 3d+20:16:47.152 WRITE DMA
ca 00 48 38 be 05 e0 08 3d+20:16:47.152 WRITE DMA
ca 00 00 38 bd 05 e0 08 3d+20:16:47.152 WRITE DMA
Error 10 occurred at disk power-on lifetime: 27306 hours (1137 days + 18 hours)
When the command that caused the error occurred, the device was in an unknown state.
After command completion occurred, registers were:
ER ST SC SN CL CH DH
-- -- -- -- -- -- --
01 51 06 08 fc 46 e0 Error: AMNF 6 sectors at LBA = 0x0046fc08 = 4652040
Commands leading to the command that caused the error were:
CR FR SC SN CL CH DH DC Powered_Up_Time Command/Feature_Name
-- -- -- -- -- -- -- -- ---------------- --------------------
c8 00 08 08 fc 46 e0 08 3d+20:15:14.256 READ DMA
c8 00 18 e8 90 ef e4 08 3d+20:15:14.144 READ DMA
ca 00 08 ef 73 30 ee 08 3d+20:15:13.744 WRITE DMA
ca 00 10 df 73 30 ee 08 3d+20:15:13.744 WRITE DMA
ca 00 08 d7 73 30 ee 08 3d+20:15:09.904 WRITE DMA