Abstürze

FreeBSD, Gentoo, openSUSE, CentOS, Ubuntu, Debian
Post Reply
hannes
Posts: 38
Joined: 2002-05-23 18:14
 

Abstürze

Post by hannes »

Wir betreiben einen ganz normalen Rootserver, dessen Konfiguration etwas an unsere Bedürfnisse angepasst ist. Jedoch haben wir alle 2 Wochen ein Problem:
Sobald wir anfangen unseren Newsletter zu verschicken stürzt der ganze Server ab und muss mittels 1und1 Tool neu gestartet werden. Der Newsletter hat insgesamt ca 3200 Empfänger.

In den Logs lässt sich kein Anhaltspunkt für den Absturz finden.Man sieht leidiglich, dass der Server anfängt den Newsletter zu verschicken. Danach bricht das Logging einfach ab und der Server ist nicht mehr erreichbar.

Kommt sowas öfter vor, dass ein Server bei starker Belastung einfach mal abstürzt? Kann mir jemand von euch vielleicht sagen, wie z.B. große Provider solchen Situationen vorbeugen?
captaincrunch
Userprojekt
Userprojekt
Posts: 7066
Joined: 2002-10-09 14:30
Location: Dorsten
Contact:
 

Re: Abstürze

Post by captaincrunch »

Ist es einer der neueren Rooties ?
Der via-rhine-Treiber dort ist bei hohem Durchsatz etwas buggy, ließ sich aber durch einen Parameter am Modul verbessern ... zu dem Thema gab's im Datentransfer und Backup schon mal etwas.
DebianHowTo
echo "[q]sa[ln0=aln256%Pln256/snlbx]sb729901041524823122snlbxq"|dc
scythe42
Posts: 154
Joined: 2002-10-14 18:30
Location: Internet
Contact:
 

Re: Abstürze

Post by scythe42 »

Was für nen MTA? Was für nen Mailing List Prog? Wie verschickst du den Newsletter? usw...

Paar Infos brauchen wir schon, um dir zu helfen.

In jedem Falle ist das nicht normal. Grosse Provider haben ihren MTA anständig geconft ;-)
hannes
Posts: 38
Joined: 2002-05-23 18:14
 

Re: Abstürze

Post by hannes »

@Crunch:
Der Rootie ist von 8/2002...und damit weniger neu. Von dem Bug im Via-Rhine habe ich auch schon gehört...Nur ich habe mir nicht vorstellen können, dass es wirklich daran liegt.

@scythe:
Wir haben unseren Server einmal neu initialisieren lassen und verwenden nun Suse 8.1 und damit auch Postfix als MTA. Als Mailinglisten Prog nutzen wir Aconon. (Aconon basiert auf CGI Scripten....)
Ich denke schon, dass unser MTA keine komplette Fehlconf ist, da diese hauptsächlich von 1und1 stammt.
Anonymous
 

Re: Abstürze

Post by Anonymous »

hannes wrote: @scythe:

Ich denke schon, dass unser MTA keine komplette Fehlconf ist, da diese hauptsächlich von 1und1 stammt.
Na, wenn 1&1 ein Qualitaetskriterium ist......

debug=2 fuer den via-rhine und alles ist gut
Oder den von SuSE8.2 nehmen, der ist ok
Dank Hubert *g

Karlo
hannes
Posts: 38
Joined: 2002-05-23 18:14
 

Re: Abstürze

Post by hannes »

Ich weiß nicht ob es hilft, aber ich kann als weiteren Anhaltspunkt eine sehr hohe Loadaverage geben...
hannes
Posts: 38
Joined: 2002-05-23 18:14
 

Re: Abstürze

Post by hannes »

Ich bin mir auch ziemlich sicher, dass wir das via-rhine Problem ausschließen können:

Ausschnitt der dmesg Ausgabe:

Code: Select all

8139too Fast Ethernet driver 0.9.26
eth0: RealTek RTL8139 Fast Ethernet at 0xd0000000, 00:20:ed:39:bd:5f, IRQ 15
eth0:  Identified 8139 chip type 'RTL-8139C'
Hat noch jemand eine andere Idee?
Anonymous
 

Re: Abstürze

Post by Anonymous »

Das log besagt rein gar nichts solange debug auf null steht.
Warum probierst Du es nicht einfach aus?
Immer nur sagen: "das schliessen wir aus", motiviert nicht gerade zu helfen.

Karlo
hannes
Posts: 38
Joined: 2002-05-23 18:14
 

Re: Abstürze

Post by hannes »

Das Log besagt doch, dass der Rechner kein Via-Rhine Chipsatz als Netzwerkkarte nutzt, oder? Natürlich kann ich das Loglevel für die Realtek auf Debug=2 setzen.

@Karlo: Meinst du das damit?
hannes
Posts: 38
Joined: 2002-05-23 18:14
 

Re: Abstürze

Post by hannes »

Sorry, dass ich nochmal störe... Ich habe die letzten Tage alle Logfiles durchwühlt, dabei habe ich leider nichts gefunden :-(

Ich gehe davon aus, dass der oben beschriebene Via-Rhine Fehler bei uns nicht vorliegt, vor allem, weil wir jeden Abend ein ca. 1,5 Gb Backup fahren und es dabei nie zum Absturz kam.

Also deshalb mein verzweifelter Aufruf: Hat vielleicht jemand aus der Rootforum-Experten Runde noch irgendeine Idee?

Ich denke, dass es irgendwie mit einer hohen Serverlast zusammenhängt....
captaincrunch
Userprojekt
Userprojekt
Posts: 7066
Joined: 2002-10-09 14:30
Location: Dorsten
Contact:
 

Re: Abstürze

Post by captaincrunch »

Ich denke, dass es irgendwie mit einer hohen Serverlast zusammenhängt....
Falls es "nur" eine hohe Serverlast wäre, würde dein Server zwar massiv langsam, abstürzen würde er aber im Normalfall nicht.
Ich würde daher fast eher auf einen Hardwarefehler tippen ...
DebianHowTo
echo "[q]sa[ln0=aln256%Pln256/snlbx]sb729901041524823122snlbxq"|dc
hannes
Posts: 38
Joined: 2002-05-23 18:14
 

Re: Abstürze

Post by hannes »

Einen Hardwarefehler kann ich mir weniger vorstellen, da wir auch auf einem zweiten Server den Newsletter verschickt haben... Dort das gleiche Resultat.

Bei einem solchen Absturz ist der Server übrigens noch via Ping zu erreichen...Aber kein httpd, smtp, pop3 oder ssh Service ist erreichbar.
outofbound
Posts: 470
Joined: 2002-05-14 13:02
Location: Karlsruhe City
 

Re: Abstürze

Post by outofbound »

Dann isses noch nich "ganz" abgestuerzt...

Was sagt "dmesg" ?

Kann es sein, dass Ihr vielleicht irgendwo ein "Leck" eingebaut habt
und euch die Festplatte totswappt?

Könnt Ihr den Newsletter an, sagen wir mal, 10 Personen verschicken
mit dem gleichen Resultat, oder klappt das? Wie siehts mit 2 aus?

Gruss,

Out
SCD
Posts: 8
Joined: 2002-07-07 02:52
 

Re: Abstürze

Post by SCD »

hannes wrote:Bei einem solchen Absturz ist der Server übrigens noch via Ping zu erreichen...Aber kein httpd, smtp, pop3 oder ssh Service ist erreichbar.
Kommt mir bekannt vor.
Mein RootServer ist auch schon dreimal abgestürzt (genau die gleichen Auswirkungen). Bei mir passiert das jedoch immer, wenn die CPU stark belastet wird.
dmesg (18 Stunden nach dem Absturz, falls es noch hilft) wrote:Linux version 2.4.20 (gcc version 2.95.3 20010315 (SuSE)) #2 SMP Sun Jan 5 17:09:40 CET 2003
BIOS-provided physical RAM map:
BIOS-e820: 0000000000000000 - 00000000000a0000 (usable)
BIOS-e820: 00000000000f0000 - 0000000000100000 (reserved)
BIOS-e820: 0000000000100000 - 000000000f7f0000 (usable)
BIOS-e820: 000000000f7f0000 - 000000000f7f3000 (ACPI NVS)
BIOS-e820: 000000000f7f3000 - 000000000f800000 (ACPI data)
BIOS-e820: 000000000f800000 - 0000000010000000 (reserved)
BIOS-e820: 00000000fec00000 - 0000000100000000 (reserved)
0MB HIGHMEM available.
247MB LOWMEM available.
found SMP MP-table at 000f5870
hm, page 000f5000 reserved twice.
hm, page 000f6000 reserved twice.
hm, page 000f1000 reserved twice.
hm, page 000f2000 reserved twice.
On node 0 totalpages: 63472
zone(0): 4096 pages.
zone(1): 59376 pages.
zone(2): 0 pages.
Intel MultiProcessor Specification v1.4
Virtual Wire compatibility mode.
OEM ID: OEM00000 Product ID: PROD00000000 APIC at: 0xFEE00000
Processor #0 Pentium(tm) Pro APIC version 17
I/O APIC #2 Version 17 at 0xFEC00000.
Processors: 1
Kernel command line: auto BOOT_IMAGE=linux ro root=303 BOOT_FILE=/boot/vmlinuz
Initializing CPU#0
Detected 1202.776 MHz processor.
Console: colour VGA+ 80x25
Calibrating delay loop... 2398.61 BogoMIPS
Memory: 248088k/253888k available (1316k kernel code, 5416k reserved, 406k data, 260k init, 0k highmem)
Dentry cache hash table entries: 32768 (order: 6, 262144 bytes)
Inode cache hash table entries: 16384 (order: 5, 131072 bytes)
Mount-cache hash table entries: 4096 (order: 3, 32768 bytes)
Buffer-cache hash table entries: 16384 (order: 4, 65536 bytes)
Page-cache hash table entries: 65536 (order: 6, 262144 bytes)
CPU: L1 I cache: 16K, L1 D cache: 16K
CPU: L2 cache: 256K
Intel machine check architecture supported.
Intel machine check reporting enabled on CPU#0.
CPU: After generic, caps: 0383fbff 00000000 00000000 00000000
CPU: Common caps: 0383fbff 00000000 00000000 00000000
Enabling fast FPU save and restore... done.
Enabling unmasked SIMD FPU exception support... done.
Checking 'hlt' instruction... OK.
POSIX conformance testing by UNIFIX
mtrr: v1.40 (20010327) Richard Gooch (rgooch@atnf.csiro.au)
mtrr: detected mtrr type: Intel
CPU: L1 I cache: 16K, L1 D cache: 16K
CPU: L2 cache: 256K
Intel machine check reporting enabled on CPU#0.
CPU: After generic, caps: 0383fbff 00000000 00000000 00000000
CPU: Common caps: 0383fbff 00000000 00000000 00000000
CPU0: Intel(R) Celeron(TM) CPU 1200MHz stepping 01
per-CPU timeslice cutoff: 731.53 usecs.
enabled ExtINT on CPU#0
ESR value before enabling vector: 00000000
ESR value after enabling vector: 00000000
Error: only one processor found.
ENABLING IO-APIC IRQs
Setting 2 in the phys_id_present_map
...changing IO-APIC physical APIC ID to 2 ... ok.
init IO_APIC IRQs
IO-APIC (apicid-pin) 2-0, 2-16, 2-17, 2-18, 2-19, 2-20, 2-21, 2-22, 2-23 not connected.
..TIMER: vector=0x31 pin1=2 pin2=0
number of MP IRQ sources: 18.
number of IO-APIC #2 registers: 24.
testing the IO APIC.......................

IO APIC #2......
.... register #00: 02000000
....... : physical APIC id: 02
.... register #01: 00178011
....... : max redirection entries: 0017
....... : PRQ implemented: 1
....... : IO APIC version: 0011
.... register #02: 00000000
....... : arbitration: 00
.... IRQ redirection table:
NR Log Phy Mask Trig IRR Pol Stat Dest Deli Vect:
00 000 00 1 0 0 0 0 0 0 00
01 001 01 0 0 0 0 0 1 1 39
02 001 01 0 0 0 0 0 1 1 31
03 001 01 0 0 0 0 0 1 1 41
04 001 01 0 0 0 0 0 1 1 49
05 001 01 0 0 0 0 0 1 1 51
06 001 01 0 0 0 0 0 1 1 59
07 001 01 0 0 0 0 0 1 1 61
08 001 01 0 0 0 0 0 1 1 69
09 001 01 0 0 0 0 0 1 1 71
0a 001 01 1 1 0 1 0 1 1 79
0b 001 01 1 1 0 1 0 1 1 81
0c 001 01 0 0 0 0 0 1 1 89
0d 001 01 0 0 0 0 0 1 1 91
0e 001 01 0 0 0 0 0 1 1 99
0f 001 01 1 1 0 1 0 1 1 A1
10 000 00 1 0 0 0 0 0 0 00
11 000 00 1 0 0 0 0 0 0 00
12 000 00 1 0 0 0 0 0 0 00
13 000 00 1 0 0 0 0 0 0 00
14 000 00 1 0 0 0 0 0 0 00
15 000 00 1 0 0 0 0 0 0 00
16 000 00 1 0 0 0 0 0 0 00
17 000 00 1 0 0 0 0 0 0 00
IRQ to pin mappings:
IRQ0 -> 0:2
IRQ1 -> 0:1
IRQ3 -> 0:3
IRQ4 -> 0:4
IRQ5 -> 0:5
IRQ6 -> 0:6
IRQ7 -> 0:7
IRQ8 -> 0:8
IRQ9 -> 0:9
IRQ10 -> 0:10
IRQ11 -> 0:11
IRQ12 -> 0:12
IRQ13 -> 0:13
IRQ14 -> 0:14
IRQ15 -> 0:15
.................................... done.
Using local APIC timer interrupts.
calibrating APIC timer ...
..... CPU clock speed is 1202.6887 MHz.
..... host bus clock speed is 100.2240 MHz.
cpu: 0, clocks: 1002240, slice: 501120
CPU0<T0:1002240,T1:501120,D:0,S:501120,C:1002240>
Waiting on wait_init_idle (map = 0x0)
All processors have done init_idle
PCI: PCI BIOS revision 2.10 entry at 0xfb370, last bus=1
PCI: Using configuration type 1
PCI: Probing PCI hardware
PCI: Using IRQ router VIA [1106/0686] at 00:07.0
PCI->APIC IRQ transform: (B0,I9,P0) -> 15
PCI->APIC IRQ transform: (B1,I0,P0) -> 11
PCI: Enabling Via external APIC routing
Linux NET4.0 for Linux 2.4
Based upon Swansea University Computer Society NET3.039
Initializing RT netlink socket
Starting kswapd
VFS: Diskquotas version dquot_6.4.0 initialized
Journalled Block Device driver loaded
pty: 256 Unix98 ptys configured
Serial driver version 5.05c (2001-07-08) with MANY_PORTS SHARE_IRQ SERIAL_PCI enabled
Real Time Clock Driver v1.10e
Uniform Multi-Platform E-IDE driver Revision: 6.31
ide: Assuming 33MHz system bus speed for PIO modes; override with idebus=xx
VP_IDE: IDE controller on PCI bus 00 dev 39
VP_IDE: chipset revision 6
VP_IDE: not 100% native mode: will probe irqs later
ide: Assuming 33MHz system bus speed for PIO modes; override with idebus=xx
VP_IDE: VIA vt82c686b (rev 40) IDE UDMA100 controller on pci00:07.1
ide0: BM-DMA at 0xe000-0xe007, BIOS settings: hda:DMA, hdb:pio
keyboard: Timeout - AT keyboard not present?(ed)
keyboard: Timeout - AT keyboard not present?(f4)
hda: IC35L040AVVA07-0, ATA DISK drive
ide0 at 0x1f0-0x1f7,0x3f6 on irq 14
blk: queue c0342644, I/O limit 4095Mb (mask 0xffffffff)
hda: 80418240 sectors (41174 MB) w/1863KiB Cache, CHS=5005/255/63, UDMA(100)
Partition check:
hda: hda1 hda2 hda3
8139too Fast Ethernet driver 0.9.26
eth0: RealTek RTL8139 Fast Ethernet at 0xd0000000, 00:e0:4c:39:0c:e3, IRQ 15
eth0: Identified 8139 chip type 'RTL-8139C'
SCSI subsystem driver Revision: 1.00
3ware Storage Controller device driver for Linux v1.02.00.031.
3w-xxxx: No cards with valid units found.
kmod: failed to exec /sbin/modprobe -s -k scsi_hostadapter, errno = 2
NET4: Linux TCP/IP 1.0 for NET4.0
IP Protocols: ICMP, UDP, TCP, IGMP
IP: routing cache hash table of 2048 buckets, 16Kbytes
TCP: Hash tables configured (established 16384 bind 16384)
NET4: Unix domain sockets 1.0/SMP for Linux NET4.0.
IPv6 v0.8 for NET4.0
IPv6 over IPv4 tunneling driver
EXT3-fs: INFO: recovery required on readonly filesystem.
EXT3-fs: write access will be enabled during recovery.
kjournald starting. Commit interval 5 seconds
EXT3-fs: ide0(3,3): orphan cleanup on readonly fs
ext3_orphan_cleanup: deleting unreferenced inode 98313
ext3_orphan_cleanup: deleting unreferenced inode 1540101
ext3_orphan_cleanup: deleting unreferenced inode 3457237
ext3_orphan_cleanup: deleting unreferenced inode 901432
ext3_orphan_cleanup: deleting unreferenced inode 1884223
ext3_orphan_cleanup: deleting unreferenced inode 1884221
ext3_orphan_cleanup: deleting unreferenced inode 1884220
ext3_orphan_cleanup: deleting unreferenced inode 1884219
ext3_orphan_cleanup: deleting unreferenced inode 1884191
ext3_orphan_cleanup: deleting unreferenced inode 1884187
ext3_orphan_cleanup: deleting unreferenced inode 1884188
ext3_orphan_cleanup: deleting unreferenced inode 1884186
ext3_orphan_cleanup: deleting unreferenced inode 1884185
ext3_orphan_cleanup: deleting unreferenced inode 1884184
ext3_orphan_cleanup: deleting unreferenced inode 1884181
ext3_orphan_cleanup: deleting unreferenced inode 1884178
ext3_orphan_cleanup: deleting unreferenced inode 1884177
ext3_orphan_cleanup: deleting unreferenced inode 1884172
ext3_orphan_cleanup: deleting unreferenced inode 3358893
ext3_orphan_cleanup: deleting unreferenced inode 3359165
ext3_orphan_cleanup: deleting unreferenced inode 49282
ext3_orphan_cleanup: deleting unreferenced inode 573443
ext3_orphan_cleanup: deleting unreferenced inode 4620430
ext3_orphan_cleanup: deleting unreferenced inode 49297
ext3_orphan_cleanup: deleting unreferenced inode 4211076
ext3_orphan_cleanup: deleting unreferenced inode 4211075
EXT3-fs: ide0(3,3): 26 orphan inodes deleted
EXT3-fs: recovery complete.
EXT3-fs: mounted filesystem with ordered data mode.
VFS: Mounted root (ext3 filesystem) readonly.
Freeing unused kernel memory: 260k freed
Adding Swap: 787176k swap-space (priority -1)
EXT3 FS 2.4-0.9.19, 19 August 2002 on ide0(3,3), internal journal
kjournald starting. Commit interval 5 seconds
EXT3 FS 2.4-0.9.19, 19 August 2002 on ide0(3,1), internal journal
EXT3-fs: mounted filesystem with ordered data mode.
eth0: Setting 100mbps full-duplex based on auto-negotiated partner ability 41e1.
eth0: no IPv6 routers present
Last edited by SCD on 2003-05-30 19:04, edited 1 time in total.
captaincrunch
Userprojekt
Userprojekt
Posts: 7066
Joined: 2002-10-09 14:30
Location: Dorsten
Contact:
 

Re: Abstürze

Post by captaincrunch »

Sorry, aber ich kann mir immer noch nicht vorstellen, dass dadurch gleich die ganze Kiste abraucht. Wahrscheinlich swappt sie gerade wie wild ... aus welchem Grund auch immer, könt aber nur ihr beurteilen, weil außer euch niemand weiß, was auf den Kisten läuft, und ihr auch keine genauen Angaben dazu macht ...
DebianHowTo
echo "[q]sa[ln0=aln256%Pln256/snlbx]sb729901041524823122snlbxq"|dc
SCD
Posts: 8
Joined: 2002-07-07 02:52
 

Re: Abstürze

Post by SCD »

Gestern lief zum Zeitpunkt ein Backup (lokales).

Code: Select all

export datum="`date +%Y%m%d`"

tar -c -z -f "/backup/full-$datum.tar.gz" /backup/mysql-$datum.sql.gz /root /home /etc/*.conf /etc/httpd/*.conf /var/named /etc/mail
Bei den anderen beiden Abstürzen ein Programm was die CPU zu 99% beansprucht ...
captaincrunch
Userprojekt
Userprojekt
Posts: 7066
Joined: 2002-10-09 14:30
Location: Dorsten
Contact:
 

Re: Abstürze

Post by captaincrunch »

Mal ins Blaue hinein :

versuch mal, deinem Kernel als Bootoption noapic mit auf den Weg zu geben, und beobachte das Verhalten dann mal.
DebianHowTo
echo "[q]sa[ln0=aln256%Pln256/snlbx]sb729901041524823122snlbxq"|dc
hannes
Posts: 38
Joined: 2002-05-23 18:14
 

Re: Abstürze

Post by hannes »

Hier ist übrigens mein dmesg:

Code: Select all

Linux version 2.4.20 (root@install-srv) (gcc version 2.95.4 20011002 (Debian prerelease)) #1 SMP Sun Dec 1 21:25:43 CET 2002 BIOS-provided physical RAM map:
 BIOS-e820: 0000000000000000 - 00000000000a0000 (usable)
 BIOS-e820: 00000000000f0000 - 0000000000100000 (reserved)
 BIOS-e820: 0000000000100000 - 000000000f7f0000 (usable)
 BIOS-e820: 000000000f7f0000 - 000000000f7f3000 (ACPI NVS)
 BIOS-e820: 000000000f7f3000 - 000000000f800000 (ACPI data)
 BIOS-e820: 000000000f800000 - 0000000010000000 (reserved)
 BIOS-e820: 00000000fec00000 - 0000000100000000 (reserved)
0MB HIGHMEM available.
247MB LOWMEM available.
found SMP MP-table at 000f5870
hm, page 000f5000 reserved twice.
hm, page 000f6000 reserved twice.
hm, page 000f1000 reserved twice.
hm, page 000f2000 reserved twice.
On node 0 totalpages: 63472
zone(0): 4096 pages.
zone(1): 59376 pages.
zone(2): 0 pages.
Intel MultiProcessor Specification v1.4
    Virtual Wire compatibility mode.
OEM ID: OEM00000 Product ID: PROD00000000 APIC at: 0xFEE00000 Processor #0 Pentium(tm) Pro APIC version 17 I/O APIC #2 Version 17 at 0xFEC00000.
Processors: 1
Kernel command line: root=/dev/hda3
Initializing CPU#0
Detected 1202.746 MHz processor.
Console: colour VGA+ 80x25
Calibrating delay loop... 2398.61 BogoMIPS
Memory: 248248k/253888k available (1196k kernel code, 5256k reserved, 385k data, 260k init, 0k highmem) Dentry cache hash table entries: 32768 (order: 6, 262144 bytes) Inode cache hash table entries: 16384 (order: 5, 131072 bytes) Mount-cache hash table entries: 4096 (order: 3, 32768 bytes) Buffer-cache hash table entries: 16384 (order: 4, 65536 bytes) Page-cache hash table entries: 65536 (order: 6, 262144 bytes)
CPU: L1 I cache: 16K, L1 D cache: 16K
CPU: L2 cache: 256K
Intel machine check architecture supported.
Intel machine check reporting enabled on CPU#0.
CPU:     After generic, caps: 0383fbff 00000000 00000000 00000000
CPU:             Common caps: 0383fbff 00000000 00000000 00000000
Enabling fast FPU save and restore... done.
Enabling unmasked SIMD FPU exception support... done.
Checking 'hlt' instruction... OK.
POSIX conformance testing by UNIFIX
mtrr: v1.40 (20010327) Richard Gooch (rgooch@atnf.csiro.au)
mtrr: detected mtrr type: Intel
CPU: L1 I cache: 16K, L1 D cache: 16K
CPU: L2 cache: 256K
Intel machine check reporting enabled on CPU#0.
CPU:     After generic, caps: 0383fbff 00000000 00000000 00000000
CPU:             Common caps: 0383fbff 00000000 00000000 00000000
CPU0: Intel(R) Celeron(TM) CPU                1200MHz stepping 01
per-CPU timeslice cutoff: 731.53 usecs.
enabled ExtINT on CPU#0
ESR value before enabling vector: 00000000
ESR value after enabling vector: 00000000
Error: only one processor found.
ENABLING IO-APIC IRQs
Setting 2 in the phys_id_present_map
...changing IO-APIC physical APIC ID to 2 ... ok.
init IO_APIC IRQs
 IO-APIC (apicid-pin) 2-0, 2-16, 2-17, 2-18, 2-19, 2-20, 2-21, 2-22, 2-23 not connected.
..TIMER: vector=0x31 pin1=2 pin2=0
number of MP IRQ sources: 18.
number of IO-APIC #2 registers: 24.
testing the IO APIC.......................

IO APIC #2......
.... register #00: 02000000
.......    : physical APIC id: 02
.... register #01: 00178011
.......     : max redirection entries: 0017
.......     : PRQ implemented: 1
.......     : IO APIC version: 0011
.... register #02: 00000000
.......     : arbitration: 00
.... IRQ redirection table:
 NR Log Phy Mask Trig IRR Pol Stat Dest Deli Vect:   
 00 000 00  1    0    0   0   0    0    0    00
 01 001 01  0    0    0   0   0    1    1    39
 02 001 01  0    0    0   0   0    1    1    31
 03 001 01  0    0    0   0   0    1    1    41
 04 001 01  0    0    0   0   0    1    1    49
 05 001 01  0    0    0   0   0    1    1    51
 06 001 01  0    0    0   0   0    1    1    59
 07 001 01  0    0    0   0   0    1    1    61
 08 001 01  0    0    0   0   0    1    1    69
 09 001 01  0    0    0   0   0    1    1    71
 0a 001 01  1    1    0   1   0    1    1    79
 0b 001 01  1    1    0   1   0    1    1    81
 0c 001 01  0    0    0   0   0    1    1    89
 0d 001 01  0    0    0   0   0    1    1    91
 0e 001 01  0    0    0   0   0    1    1    99
 0f 001 01  1    1    0   1   0    1    1    A1
 10 000 00  1    0    0   0   0    0    0    00
 11 000 00  1    0    0   0   0    0    0    00
 12 000 00  1    0    0   0   0    0    0    00
 13 000 00  1    0    0   0   0    0    0    00
 14 000 00  1    0    0   0   0    0    0    00
 15 000 00  1    0    0   0   0    0    0    00
 16 000 00  1    0    0   0   0    0    0    00
 17 000 00  1    0    0   0   0    0    0    00
IRQ to pin mappings:
IRQ0 -> 0:2
IRQ1 -> 0:1
IRQ3 -> 0:3
IRQ4 -> 0:4
IRQ5 -> 0:5
IRQ6 -> 0:6
IRQ7 -> 0:7
IRQ8 -> 0:8
IRQ9 -> 0:9
IRQ10 -> 0:10
IRQ11 -> 0:11
IRQ12 -> 0:12
IRQ13 -> 0:13
IRQ14 -> 0:14
IRQ15 -> 0:15
.................................... done.
Using local APIC timer interrupts.
calibrating APIC timer ...
..... CPU clock speed is 1202.6929 MHz.
..... host bus clock speed is 100.2243 MHz.
cpu: 0, clocks: 1002243, slice: 501121 CPU0<T0:1002240,T1:501104,D:15,S:501121,C:1002243>
Waiting on wait_init_idle (map = 0x0)
All processors have done init_idle
PCI: PCI BIOS revision 2.10 entry at 0xfb370, last bus=1
PCI: Using configuration type 1
PCI: Probing PCI hardware
PCI: Using IRQ router VIA [1106/0686] at 00:07.0
PCI->APIC IRQ transform: (B0,I13,P0) -> 15
PCI->APIC IRQ transform: (B1,I0,P0) -> 11
PCI: Enabling Via external APIC routing
Linux NET4.0 for Linux 2.4
Based upon Swansea University Computer Society NET3.039 Initializing RT netlink socket Starting kswapd
VFS: Diskquotas version dquot_6.4.0 initialized
Journalled Block Device driver loaded
pty: 256 Unix98 ptys configured
keyboard: Timeout - AT keyboard not present?(ed)
keyboard: Timeout - AT keyboard not present?(f4)
Serial driver version 5.05c (2001-07-08) with MANY_PORTS SHARE_IRQ SERIAL_PCI enabled ttyS00 at 0x03f8 (irq = 4) is a 16550A Real Time Clock Driver v1.10e Uniform Multi-Platform E-IDE driver Revision: 6.31
ide: Assuming 33MHz system bus speed for PIO modes; override with idebus=xx
VP_IDE: IDE controller on PCI bus 00 dev 39
VP_IDE: chipset revision 6
VP_IDE: not 100% native mode: will probe irqs later
ide: Assuming 33MHz system bus speed for PIO modes; override with idebus=xx
VP_IDE: VIA vt82c686b (rev 40) IDE UDMA100 controller on pci00:07.1
    ide0: BM-DMA at 0xe000-0xe007, BIOS settings: hda:DMA, hdb:pio
hda: IC35L040AVVN07-0, ATA DISK drive
ide0 at 0x1f0-0x1f7,0x3f6 on irq 14
blk: queue c031e5c4, I/O limit 4095Mb (mask 0xffffffff)
hda: 80418240 sectors (41174 MB) w/1863KiB Cache, CHS=5005/255/63, UDMA(33) Partition check:
 hda: hda1 hda2 hda3
8139too Fast Ethernet driver 0.9.26
eth0: RealTek RTL8139 Fast Ethernet at 0xd0000000, 00:20:ed:39:bd:5f, IRQ 15
eth0:  Identified 8139 chip type 'RTL-8139C'
SCSI subsystem driver Revision: 1.00
3ware Storage Controller device driver for Linux v1.02.00.031.
3w-xxxx: No cards with valid units found.
kmod: failed to exec /sbin/modprobe -s -k scsi_hostadapter, errno = 2
NET4: Linux TCP/IP 1.0 for NET4.0
IP Protocols: ICMP, UDP, TCP, IGMP
IP: routing cache hash table of 2048 buckets, 16Kbytes
TCP: Hash tables configured (established 16384 bind 16384)
NET4: Unix domain sockets 1.0/SMP for Linux NET4.0.
EXT3-fs: INFO: recovery required on readonly filesystem.
EXT3-fs: write access will be enabled during recovery. kjournald starting.  Commit interval 5 seconds
EXT3-fs: ide0(3,3): orphan cleanup on readonly fs
ext3_orphan_cleanup: deleting unreferenced inode 671889
ext3_orphan_cleanup: deleting unreferenced inode 770287
EXT3-fs: ide0(3,3): 2 orphan inodes deleted
EXT3-fs: recovery complete.
EXT3-fs: mounted filesystem with ordered data mode.
VFS: Mounted root (ext3 filesystem) readonly.
Freeing unused kernel memory: 260k freed
Adding Swap: 265064k swap-space (priority 42)
EXT3 FS 2.4-0.9.19, 19 August 2002 on ide0(3,3), internal journal kjournald starting.  Commit interval 5 seconds EXT3 FS 2.4-0.9.19, 19 August 2002 on ide0(3,1), internal journal
EXT3-fs: mounted filesystem with ordered data mode.
eth0: Setting 100mbps full-duplex based on auto-negotiated partner ability 41e1.
ip_tables: (C) 2000-2002 Netfilter core team
sending pkt_too_big (len[1500] pmtu[1488]) to self
sending pkt_too_big (len[1500] pmtu[1488]) to self
sending pkt_too_big (len[1500] pmtu[1488]) to self
sending pkt_too_big (len[1500] pmtu[1488]) to self
sending pkt_too_big (len[1500] pmtu[1488]) to self
sending pkt_too_big (len[1500] pmtu[1488]) to self
sending pkt_too_big (len[1500] pmtu[1488]) to self
sending pkt_too_big (len[1500] pmtu[1488]) to self
sending pkt_too_big (len[1500] pmtu[1488]) to self
sending pkt_too_big (len[1500] pmtu[1488]) to self 213.148.128.42 sent an invalid ICMP error to a broadcast. 213.148.128.42 sent an invalid ICMP error to a broadcast. sending pkt_too_big (len[1500] pmtu[1488]) to self sending pkt_too_big (len[1500] pmtu[1488]) to self sending pkt_too_big (len[1500] pmtu[1488]) to self sending pkt_too_big (len[1500] pmtu[1488]) to self sending pkt_too_big (len[1500] pmtu[1488]) to self sending pkt_too_big (len[1500] pmtu[1488]) to self 213.148.128.42 sent an invalid ICMP error to a broadcast. sending pkt_too_big (len[1500] pmtu[1488]) to self sending pkt_too_big (len[1500] pmtu[1488]) to self sending pkt_too_big (len[1500] pmtu[1488]) to self sending pkt_too_big (len[1500] pmtu[1488]) to self sending pkt_too_big (len[1500] pmtu[1488]) to self sending pkt_too_big (len[1500] pmtu[1488]) to self sending pkt_too_big (len[1500] pmtu[1488]) to self sending pkt_too_big (len[1500] pmtu[1488]) to self sending pkt_too_big (len[1500] pmtu[1488]) to self sending pkt_too_big (len[1500] pmtu[1488]) to self sending pkt_too_big (len[1500] pmtu[1488]) to self sending pkt_too_big (len[1500] pmtu[1488]) to self sending pkt_too_big (len[1500] pmtu[1488]) to self sending pkt_too_big (len[1500] pmtu[1488]) to self 213.148.128.42 sent an invalid ICMP error to a broadcast. 213.148.128.42 sent an invalid ICMP error to a broadcast. 213.148.128.42 sent an invalid ICMP error to a broadcast. 213.148.128.42 sent an invalid ICMP error to a broadcast. sending pkt_too_big (len[1500] pmtu[1488]) to self sending pkt_too_big (len[1500] pmtu[1488]) to self
Vielleicht findet ja jemand was...Ich verstehe zwar die letzten Zeilen, jedoch weiß ich nicht was für einen Rückschluss ich daraus ziehen soll.
captaincrunch
Userprojekt
Userprojekt
Posts: 7066
Joined: 2002-10-09 14:30
Location: Dorsten
Contact:
 

Re: Abstürze

Post by captaincrunch »

Wie schon gesagt : probier's mal mit noacpi
DebianHowTo
echo "[q]sa[ln0=aln256%Pln256/snlbx]sb729901041524823122snlbxq"|dc
hannes
Posts: 38
Joined: 2002-05-23 18:14
 

Re: Abstürze

Post by hannes »

Ich habe es erst leider heut geschafft "noapic" in Grub zu implementieren.

Trotzdem die Kernel Option laut dmesg erfolgreich übergeben wurde, scheint APIC dennoch angesprochen zu werden. Sehe ich das richtig?
Ahja, und was heisst das: hm, page 000f2000 reserved twice.

Hier ist meine neue dmesg:

Code: Select all

Linux version 2.4.20 (root@install-srv) (gcc version 2.95.4 20011002 (Debian prerelease)) #1 SMP Sun Dec 1 21:25:43 CET 2002 BIOS-provided physical RAM map:
 BIOS-e820: 0000000000000000 - 00000000000a0000 (usable)
 BIOS-e820: 00000000000f0000 - 0000000000100000 (reserved)
 BIOS-e820: 0000000000100000 - 000000000f7f0000 (usable)
 BIOS-e820: 000000000f7f0000 - 000000000f7f3000 (ACPI NVS)
 BIOS-e820: 000000000f7f3000 - 000000000f800000 (ACPI data)
 BIOS-e820: 000000000f800000 - 0000000010000000 (reserved)
 BIOS-e820: 00000000fec00000 - 0000000100000000 (reserved)
0MB HIGHMEM available.
247MB LOWMEM available.
found SMP MP-table at 000f5870
hm, page 000f5000 reserved twice.
hm, page 000f6000 reserved twice.
hm, page 000f1000 reserved twice.
hm, page 000f2000 reserved twice.
On node 0 totalpages: 63472
zone(0): 4096 pages.
zone(1): 59376 pages.
zone(2): 0 pages.
Intel MultiProcessor Specification v1.4
    Virtual Wire compatibility mode.
OEM ID: OEM00000 Product ID: PROD00000000 APIC at: 0xFEE00000 Processor #0 Pentium(tm) Pro APIC version 17 I/O APIC #2 Version 17 at 0xFEC00000.
Processors: 1
Kernel command line: root=/dev/hda3 noapic
Initializing CPU#0
Detected 1202.737 MHz processor.
Console: colour VGA+ 80x25
Calibrating delay loop... 2398.61 BogoMIPS
Memory: 248248k/253888k available (1196k kernel code, 5256k reserved, 385k data, 260k init, 0k highmem) Dentry cache hash table entries: 32768 (order: 6, 262144 bytes) Inode cache hash table entries: 16384 (order: 5, 131072 bytes) Mount-cache hash table entries: 4096 (order: 3, 32768 bytes) Buffer-cache hash table entries: 16384 (order: 4, 65536 bytes) Page-cache hash table entries: 65536 (order: 6, 262144 bytes)
CPU: L1 I cache: 16K, L1 D cache: 16K
CPU: L2 cache: 256K
Intel machine check architecture supported.
Intel machine check reporting enabled on CPU#0.
CPU:     After generic, caps: 0383fbff 00000000 00000000 00000000
CPU:             Common caps: 0383fbff 00000000 00000000 00000000
Enabling fast FPU save and restore... done.
Enabling unmasked SIMD FPU exception support... done.
Checking 'hlt' instruction... OK.
POSIX conformance testing by UNIFIX
mtrr: v1.40 (20010327) Richard Gooch (rgooch@atnf.csiro.au)
mtrr: detected mtrr type: Intel
CPU: L1 I cache: 16K, L1 D cache: 16K
CPU: L2 cache: 256K
Intel machine check reporting enabled on CPU#0.
CPU:     After generic, caps: 0383fbff 00000000 00000000 00000000
CPU:             Common caps: 0383fbff 00000000 00000000 00000000
CPU0: Intel(R) Celeron(TM) CPU                1200MHz stepping 01
per-CPU timeslice cutoff: 731.53 usecs.
enabled ExtINT on CPU#0
ESR value before enabling vector: 00000000
ESR value after enabling vector: 00000000
Error: only one processor found.
Using local APIC timer interrupts.
calibrating APIC timer ...
..... CPU clock speed is 1202.7274 MHz.
..... host bus clock speed is 100.2272 MHz.
cpu: 0, clocks: 1002272, slice: 501136 CPU0<T0:1002272,T1:501136,D:0,S:501136,C:1002272>
Waiting on wait_init_idle (map = 0x0)
All processors have done init_idle
PCI: PCI BIOS revision 2.10 entry at 0xfb370, last bus=1
PCI: Using configuration type 1
PCI: Probing PCI hardware
PCI: Using IRQ router VIA [1106/0686] at 00:07.0
PCI: Enabling Via external APIC routing
Linux NET4.0 for Linux 2.4
Based upon Swansea University Computer Society NET3.039 Initializing RT netlink socket Starting kswapd
VFS: Diskquotas version dquot_6.4.0 initialized
Journalled Block Device driver loaded
pty: 256 Unix98 ptys configured
Serial driver version 5.05c (2001-07-08) with MANY_PORTS SHARE_IRQ SERIAL_PCI enabled ttyS00 at 0x03f8 (irq = 4) is a 16550A Real Time Clock Driver v1.10e
keyboard: Timeout - AT keyboard not present?(ed)
keyboard: Timeout - AT keyboard not present?(f4)
Uniform Multi-Platform E-IDE driver Revision: 6.31
ide: Assuming 33MHz system bus speed for PIO modes; override with idebus=xx
VP_IDE: IDE controller on PCI bus 00 dev 39
VP_IDE: chipset revision 6
VP_IDE: not 100% native mode: will probe irqs later
ide: Assuming 33MHz system bus speed for PIO modes; override with idebus=xx
VP_IDE: VIA vt82c686b (rev 40) IDE UDMA100 controller on pci00:07.1
    ide0: BM-DMA at 0xe000-0xe007, BIOS settings: hda:DMA, hdb:pio
hda: IC35L040AVVA07-0, ATA DISK drive
ide0 at 0x1f0-0x1f7,0x3f6 on irq 14
blk: queue c031e5c4, I/O limit 4095Mb (mask 0xffffffff)
hda: 80418240 sectors (41174 MB) w/1863KiB Cache, CHS=5005/255/63, UDMA(100) Partition check:
 hda: hda1 hda2 hda3
8139too Fast Ethernet driver 0.9.26
PCI: Found IRQ 15 for device 00:0d.0
eth0: RealTek RTL8139 Fast Ethernet at 0xd0000000, 00:20:ed:2a:af:2c, IRQ 15
eth0:  Identified 8139 chip type 'RTL-8139B'
SCSI subsystem driver Revision: 1.00
3ware Storage Controller device driver for Linux v1.02.00.031.
3w-xxxx: No cards with valid units found.
kmod: failed to exec /sbin/modprobe -s -k scsi_hostadapter, errno = 2
NET4: Linux TCP/IP 1.0 for NET4.0
IP Protocols: ICMP, UDP, TCP, IGMP
IP: routing cache hash table of 2048 buckets, 16Kbytes
TCP: Hash tables configured (established 16384 bind 16384)
NET4: Unix domain sockets 1.0/SMP for Linux NET4.0.
EXT3-fs: INFO: recovery required on readonly filesystem.
EXT3-fs: write access will be enabled during recovery. kjournald starting.  Commit interval 5 seconds
EXT3-fs: recovery complete.
EXT3-fs: mounted filesystem with ordered data mode.
VFS: Mounted root (ext3 filesystem) readonly.
Freeing unused kernel memory: 260k freed
Adding Swap: 265064k swap-space (priority 42)
EXT3 FS 2.4-0.9.19, 19 August 2002 on ide0(3,3), internal journal kjournald starting.  Commit interval 5 seconds EXT3 FS 2.4-0.9.19, 19 August 2002 on ide0(3,1), internal journal
EXT3-fs: mounted filesystem with ordered data mode.
eth0: Setting 100mbps full-duplex based on auto-negotiated partner ability 41e1.
ip_tables: (C) 2000-2002 Netfilter core team
hannes
Posts: 38
Joined: 2002-05-23 18:14
 

Re: Abstürze

Post by hannes »

Ich bin nur nun fast sicher, dass ACPI dennoch am Laufen ist... :-(

Kennt jemand einen Ausweg aus dieser Absturz Misere?
momo
Posts: 33
Joined: 2003-08-03 16:35
 

Re: Abstürze

Post by momo »

Habe gestern Abend das erste Mal das gleiche Problem gehabt, heute Mittag erneut. Offenbar war der Server überlastet. Schuld ist ein Tipp bei GigaTV für ein einfaches Java-Script-Spiel gewesen. Seither hat sich der Traffic verzehnfacht und die peilen alle wie verrückt das Spiel an. Ich kann also die Stunden zählen, bis der Server wieder abk...t. :cry:

Was hilft da?
flolein
Posts: 113
Joined: 2003-12-11 14:47
 

Re: Abstürze

Post by flolein »

das spiel offline nehmen? :)
momo
Posts: 33
Joined: 2003-08-03 16:35
 

Re: Abstürze

Post by momo »

Das ist das naheliegende, aber nicht wirklich eine Lösung. Kann das an zu wenig Arbeitsspeicher liegen?
Post Reply