Server sometimes (1 or 2 hrs) down :/

Discussion in 'Server Operation' started by edge, Apr 22, 2006.

  1. edge

    edge HowtoForge Supporter

    Hi all,

    For some reason (I think some eth0 network problem) my server is sometimes not accessible anymore for some hrs! (it does fix itself (sometimes))

    1st I thought that it was some programe running on a cron job, but now (with help of Munin) I think I found the problem.

    When the server is not accessible anymore Munin is logging eth0 errors (see image)

    [​IMG]

    This is how eth0 looks normal

    [​IMG]

    The server is using Fedora RC4 with ISPconfig, and the Firewall that comes with ISPconfig.

    Anyone here who might know what could cause this eth0 error, and how I can fix it?
     
    Last edited: Apr 22, 2006
  2. falko

    falko Super Moderator

    Any errors in the logs?
     
  3. edge

    edge HowtoForge Supporter

    Hi falko,

    I've had a look at the logs, but did not see anything special! (lot's of info, so I might have missed it)

    Also.. I'm not sure what log to look at! Is there a log that is logging all the eth0 data?

    I really need to get this fixed a.s.a.p as sometimes the server will work again after some time, and sometimes it does not..
    As I'm off to the US over 1 week (for 3 weeks) I can not reset the server by hand :/
     
  4. falko

    falko Super Moderator

    Does your kernel use the correct driver for your network card? What kind of network card is it (model)?
    Do you see errors related to eth0 when you run
    Code:
    dmesg
    ?
     
  5. edge

    edge HowtoForge Supporter

    The server is a Dell PowerEdge SG1425.

    the NIC is/are
    (2 x Intel Corporation 82541GI/PI Gigabit Ethernet Controller (rev 05))
    note: I'm only using one NIC

    re: dmesg

    Lot's of info from dmesg :/

    Part 1 of dmesg

    Code:
    Linux version 2.6.15-1.1831_FC4smp (bhcompile@tweety.build.redhat.com) (gcc version 4.0.2 20051125 (Red Hat 4.0.2-8)) #1 SMP Tue Feb 7 13:48:31 EST 2006
    BIOS-provided physical RAM map:
     BIOS-e820: 0000000000000000 - 00000000000a0000 (usable)
     BIOS-e820: 0000000000100000 - 00000000dffc0000 (usable)
     BIOS-e820: 00000000dffc0000 - 00000000dffcfc00 (ACPI data)
     BIOS-e820: 00000000dffcfc00 - 00000000dffff000 (reserved)
     BIOS-e820: 00000000e0000000 - 00000000f0000000 (reserved)
     BIOS-e820: 00000000fec00000 - 00000000fec90000 (reserved)
     BIOS-e820: 00000000fed00000 - 00000000fed00400 (reserved)
     BIOS-e820: 00000000fee00000 - 00000000fee10000 (reserved)
     BIOS-e820: 00000000ffb00000 - 0000000100000000 (reserved)
     BIOS-e820: 0000000100000000 - 0000000120000000 (usable)
    3712MB HIGHMEM available.
    896MB LOWMEM available.
    found SMP MP-table at 000fe710
    NX (Execute Disable) protection: active
    On node 0 totalpages: 1179648
      DMA zone: 4096 pages, LIFO batch:0
      DMA32 zone: 0 pages, LIFO batch:0
      Normal zone: 225280 pages, LIFO batch:31
      HighMem zone: 950272 pages, LIFO batch:31
    DMI 2.3 present.
    Using APIC driver default
    ACPI: RSDP (v000 DELL                                  ) @ 0x000fd650
    ACPI: RSDT (v001 DELL   PESC1425 0x00000001 MSFT 0x0100000a) @ 0x000fd664
    ACPI: FADT (v001 DELL   PESC1425 0x00000001 MSFT 0x0100000a) @ 0x000fd6b0
    ACPI: MADT (v001 DELL   PESC1425 0x00000001 MSFT 0x0100000a) @ 0x000fd724
    ACPI: SPCR (v001 DELL   PESC1425 0x00000001 MSFT 0x0100000a) @ 0x000fd7c0
    ACPI: HPET (v001 DELL   PESC1425 0x00000001 MSFT 0x0100000a) @ 0x000fd810
    ACPI: MCFG (v001 DELL   PESC1425 0x00000001 MSFT 0x0100000a) @ 0x000fd848
    ACPI: DSDT (v001 DELL   PESC1425 0x00000001 MSFT 0x0100000e) @ 0x00000000
    ACPI: PM-Timer IO Port: 0x808
    ACPI: Local APIC enabled (0).
    ACPI: Local APIC address 0xfee00000
    ACPI: LAPIC (acpi_id[0x01] lapic_id[0x00] enabled)
    Processor #0 15:4 APIC version 20
    ACPI: LAPIC (acpi_id[0x02] lapic_id[0x06] enabled)
    Processor #6 15:4 APIC version 20
    ACPI: LAPIC (acpi_id[0x03] lapic_id[0x01] enabled)
    Processor #1 15:4 APIC version 20
    ACPI: LAPIC (acpi_id[0x04] lapic_id[0x07] enabled)
    Processor #7 15:4 APIC version 20
    ACPI: LAPIC_NMI (acpi_id[0x01] high edge lint[0x1])
    ACPI: LAPIC_NMI (acpi_id[0x02] high edge lint[0x1])
    ACPI: LAPIC_NMI (acpi_id[0x03] high edge lint[0x1])
    ACPI: LAPIC_NMI (acpi_id[0x04] high edge lint[0x1])
    ACPI: IOAPIC (id[0x08] address[0xfec00000] gsi_base[0])
    IOAPIC[0]: apic_id 8, version 32, address 0xfec00000, GSI 0-23
    ACPI: IOAPIC (id[0x09] address[0xfec80000] gsi_base[32])
    IOAPIC[1]: apic_id 9, version 32, address 0xfec80000, GSI 32-55
    ACPI: IOAPIC (id[0x0a] address[0xfec80800] gsi_base[64])
    IOAPIC[2]: apic_id 10, version 32, address 0xfec80800, GSI 64-87
    ACPI: INT_SRC_OVR (bus 0 bus_irq 0 global_irq 2 dfl dfl)
    ACPI: INT_SRC_OVR (bus 0 bus_irq 9 global_irq 9 high level)
    ACPI: IRQ0 used by override.
    ACPI: IRQ2 used by override.
    ACPI: IRQ9 used by override.
    Enabling APIC mode:  Flat.  Using 3 I/O APICs
    ACPI: HPET id: 0xffffffff base: 0xfed00000
    LAPIC enabled (0), calling get_smp_config
    Using ACPI (MADT) for SMP configuration information
    Allocating PCI resources starting at f1000000 (gap: f0000000:0ec00000)
    Built 1 zonelists
    Kernel command line: ro root=LABEL=/
    mapped APIC to ffffd000 (fee00000)
    mapped IOAPIC to ffffc000 (fec00000)
    mapped IOAPIC to ffffb000 (fec80000)
    mapped IOAPIC to ffffa000 (fec80800)
    Initializing CPU#0
    CPU 0 irqstacks, hard=c045f000 soft=c043f000
    PID hash table entries: 4096 (order: 12, 65536 bytes)
    Console: colour VGA+ 80x25
    Dentry cache hash table entries: 131072 (order: 7, 524288 bytes)
    Inode-cache hash table entries: 65536 (order: 6, 262144 bytes)
    Memory: 4134948k/4718592k available (2233k kernel code, 57816k reserved, 837k data, 228k init, 3276544k highmem)
    Checking if this processor honours the WP bit even in supervisor mode... Ok.
    hpet0: at MMIO 0xfed00000 (virtual 0xf8800000), IRQs 2, 8, 0
    hpet0: 3 64-bit timers, 14318180 Hz
    Using HPET for base-timer
    Using HPET for gettimeofday
    Detected 2800.182 MHz processor.
    Using hpet for high-res timesource
    Calibrating delay using timer specific routine.. 5604.64 BogoMIPS (lpj=11209282)
    Security Framework v1.0.0 initialized
    SELinux:  Initializing.
    SELinux:  Starting in permissive mode
    selinux_register_security:  Registering secondary module capability
    Capability LSM initialized as secondary
    Mount-cache hash table entries: 512
    CPU: After generic identify, caps: bfebfbff 20100000 00000000 00000000 0000641d 00000000 00000000
    CPU: After vendor identify, caps: bfebfbff 20100000 00000000 00000000 0000641d 00000000 00000000
    monitor/mwait feature present.
    using mwait in idle threads.
    CPU: Trace cache: 12K uops, L1 D cache: 16K
    CPU: L2 cache: 2048K
    CPU: Physical Processor ID: 0
    CPU: After all inits, caps: bfebfbff 20100000 00000000 00000080 0000641d 00000000 00000000
    Intel machine check architecture supported.
    Intel machine check reporting enabled on CPU#0.
    CPU0: Intel P4/Xeon Extended MCE MSRs (24) available
    CPU0: Thermal monitoring enabled
    mtrr: v2.0 (20020519)
    Enabling fast FPU save and restore... done.
    Enabling unmasked SIMD FPU exception support... done.
    Checking 'hlt' instruction... OK.
    CPU0: Intel(R) Xeon(TM) CPU 2.80GHz stepping 03
    Booting processor 1/1 eip 3000
    CPU 1 irqstacks, hard=c0460000 soft=c0440000
    Initializing CPU#1
    Calibrating delay using timer specific routine.. 5600.53 BogoMIPS (lpj=11201077)
    CPU: After generic identify, caps: bfebfbff 20100000 00000000 00000000 0000641d 00000000 00000000
    CPU: After vendor identify, caps: bfebfbff 20100000 00000000 00000000 0000641d 00000000 00000000
    monitor/mwait feature present.
    CPU: Trace cache: 12K uops, L1 D cache: 16K
    CPU: L2 cache: 2048K
    CPU: Physical Processor ID: 0
    CPU: After all inits, caps: bfebfbff 20100000 00000000 00000080 0000641d 00000000 00000000
    Intel machine check architecture supported.
    Intel machine check reporting enabled on CPU#1.
    CPU1: Intel P4/Xeon Extended MCE MSRs (24) available
    CPU1: Thermal monitoring enabled
    CPU1: Intel(R) Xeon(TM) CPU 2.80GHz stepping 03
    Booting processor 2/6 eip 3000
    CPU 2 irqstacks, hard=c0461000 soft=c0441000
    Initializing CPU#2
    Calibrating delay using timer specific routine.. 5600.40 BogoMIPS (lpj=11200804)
    CPU: After generic identify, caps: bfebfbff 20100000 00000000 00000000 0000641d 00000000 00000000
    CPU: After vendor identify, caps: bfebfbff 20100000 00000000 00000000 0000641d 00000000 00000000
    monitor/mwait feature present.
    CPU: Trace cache: 12K uops, L1 D cache: 16K
    CPU: L2 cache: 2048K
    CPU: Physical Processor ID: 3
    CPU: After all inits, caps: bfebfbff 20100000 00000000 00000080 0000641d 00000000 00000000
    Intel machine check architecture supported.
    Intel machine check reporting enabled on CPU#2.
    CPU2: Intel P4/Xeon Extended MCE MSRs (24) available
    CPU2: Thermal monitoring enabled
    CPU2: Intel(R) Xeon(TM) CPU 2.80GHz stepping 03
    Booting processor 3/7 eip 3000
    CPU 3 irqstacks, hard=c0462000 soft=c0442000
    Initializing CPU#3
    Calibrating delay using timer specific routine.. 5600.42 BogoMIPS (lpj=11200857)
    CPU: After generic identify, caps: bfebfbff 20100000 00000000 00000000 0000641d 00000000 00000000
    CPU: After vendor identify, caps: bfebfbff 20100000 00000000 00000000 0000641d 00000000 00000000
    monitor/mwait feature present.
    CPU: Trace cache: 12K uops, L1 D cache: 16K
    CPU: L2 cache: 2048K
    CPU: Physical Processor ID: 3
    CPU: After all inits, caps: bfebfbff 20100000 00000000 00000080 0000641d 00000000 00000000
    Intel machine check architecture supported.
    Intel machine check reporting enabled on CPU#3.
    CPU3: Intel P4/Xeon Extended MCE MSRs (24) available
    CPU3: Thermal monitoring enabled
    CPU3: Intel(R) Xeon(TM) CPU 2.80GHz stepping 03
    Total of 4 processors activated (22406.01 BogoMIPS).
    ENABLING IO-APIC IRQs
    ..TIMER: vector=0x31 apic1=0 pin1=2 apic2=-1 pin2=-1
    checking TSC synchronization across 4 CPUs: passed.
    Brought up 4 CPUs
    checking if image is initramfs... it is
    Freeing initrd memory: 1150k freed
    NET: Registered protocol family 16
    ACPI: bus type pci registered
    PCI: PCI BIOS revision 2.10 entry at 0xfc3de, last bus=4
    PCI: Using MMCONFIG
    ACPI: Subsystem revision 20050902
    ACPI: Interpreter enabled
    ACPI: Using IOAPIC for interrupt routing
    ACPI: PCI Root Bridge [PCI0] (0000:00)
    PCI: Probing PCI hardware (bus 00)
    PCI quirk: region 0800-087f claimed by ICH4 ACPI/GPIO/TCO
    PCI quirk: region 0880-08bf claimed by ICH4 GPIO
    PCI: Ignoring BAR0-3 of IDE controller 0000:00:1f.1
    PCI: PXH quirk detected, disabling MSI for SHPC device
    PCI: PXH quirk detected, disabling MSI for SHPC device
    Boot video device is 0000:04:0d.0
    PCI: Transparent bridge - 0000:00:1e.0
    ACPI: PCI Interrupt Routing Table [\_SB_.PCI0._PRT]
    ACPI: PCI Interrupt Routing Table [\_SB_.PCI0.PALO._PRT]
    ACPI: PCI Interrupt Routing Table [\_SB_.PCI0.PALO.PXHB._PRT]
    ACPI: PCI Interrupt Routing Table [\_SB_.PCI0.PALO.PXHA._PRT]
    ACPI: PCI Interrupt Routing Table [\_SB_.PCI0.PICH._PRT]
    ACPI: PCI Interrupt Link [LNKA] (IRQs 3 4 5 6 7 10 *11 12)
    ACPI: PCI Interrupt Link [LNKB] (IRQs 3 4 5 6 7 10 *11 12)
    ACPI: PCI Interrupt Link [LNKC] (IRQs 3 4 *5 6 7 10 11 12)
    ACPI: PCI Interrupt Link [LNKD] (IRQs 3 4 5 6 7 *10 11 12)
    ACPI: PCI Interrupt Link [LNKE] (IRQs *3 4 5 6 7 10 11 12)
    ACPI: PCI Interrupt Link [LNKF] (IRQs 3 4 5 6 7 10 11 12) *0, disabled.
    ACPI: PCI Interrupt Link [LNKG] (IRQs 3 4 5 6 7 10 11 12) *0, disabled.
    ACPI: PCI Interrupt Link [LNKH] (IRQs 3 4 5 *6 7 10 11 12)
    
     
    Last edited: Apr 22, 2006
  6. edge

    edge HowtoForge Supporter

    Part 2 of dmesg

    Code:
    Linux Plug and Play Support v0.97 (c) Adam Belay
    pnp: PnP ACPI init
    pnp: PnP ACPI: found 8 devices
    usbcore: registered new driver usbfs
    usbcore: registered new driver hub
    PCI: Using ACPI for IRQ routing
    PCI: If a device doesn't work, try "pci=routeirq".  If it helps, post a report
    pnp: 00:05: ioport range 0x800-0x87f could not be reserved
    pnp: 00:05: ioport range 0x880-0x8bf has been reserved
    pnp: 00:05: ioport range 0x8c0-0x8df has been reserved
    pnp: 00:05: ioport range 0x8e0-0x8e3 has been reserved
    pnp: 00:05: ioport range 0xc00-0xc0f has been reserved
    pnp: 00:05: ioport range 0xc10-0xc1f has been reserved
    pnp: 00:05: ioport range 0xca0-0xcaf has been reserved
    pnp: 00:05: ioport range 0xc20-0xc3f has been reserved
    PCI: Bridge: 0000:01:00.0
      IO window: e000-efff
      MEM window: fe600000-fe7fffff
      PREFETCH window: disabled.
    PCI: Bridge: 0000:01:00.2
      IO window: d000-dfff
      MEM window: fd000000-fdffffff
      PREFETCH window: disabled.
    PCI: Bridge: 0000:00:02.0
      IO window: d000-efff
      MEM window: fd000000-fe7fffff
      PREFETCH window: disabled.
    PCI: Bridge: 0000:00:1e.0
      IO window: c000-cfff
      MEM window: fe900000-feafffff
      PREFETCH window: f0000000-f7ffffff
    ACPI: PCI Interrupt 0000:00:02.0[A] -> GSI 16 (level, low) -> IRQ 169
    PCI: Setting latency timer of device 0000:00:02.0 to 64
    PCI: Setting latency timer of device 0000:01:00.0 to 64
    PCI: Setting latency timer of device 0000:01:00.2 to 64
    PCI: Setting latency timer of device 0000:00:1e.0 to 64
    apm: BIOS not found.
    audit: initializing netlink socket (disabled)
    audit(1145721008.892:1): initialized
    highmem bounce pool size: 64 pages
    Total HugeTLB memory allocated, 0
    VFS: Disk quotas dquot_6.5.1
    Dquot-cache hash table entries: 1024 (order 0, 4096 bytes)
    SELinux:  Registering netfilter hooks
    Initializing Cryptographic API
    ksign: Installing public key data
    Loading keyring
    - Added public key BBAAC218B8690C98
    - User ID: Red Hat, Inc. (Kernel Module GPG key)
    io scheduler noop registered
    io scheduler anticipatory registered
    io scheduler deadline registered
    io scheduler cfq registered
    Intel E7520/7320/7525 detected.<6>ACPI: PCI Interrupt 0000:00:02.0[A] -> GSI 16 (level, low) -> IRQ 169
    PCI: Setting latency timer of device 0000:00:02.0 to 64
    Allocate Port Service[pcie00]
    Allocate Port Service[pcie01]
    pci_hotplug: PCI Hot Plug PCI Core version: 0.5
    isapnp: Scanning for PnP cards...
    isapnp: No Plug & Play device found
    Real Time Clock Driver v1.12
    hpet_resources: 0xfed00000 is busy
    Linux agpgart interface v0.101 (c) Dave Jones
    PNP: No PS/2 controller found. Probing ports directly.
    serio: i8042 AUX port at 0x60,0x64 irq 12
    serio: i8042 KBD port at 0x60,0x64 irq 1
    Serial: 8250/16550 driver $Revision: 1.90 $ 4 ports, IRQ sharing enabled
    serial8250: ttyS0 at I/O 0x3f8 (irq = 4) is a 16550A
    00:04: ttyS0 at I/O 0x3f8 (irq = 4) is a 16550A
    RAMDISK driver initialized: 16 RAM disks of 16384K size 1024 blocksize
    Uniform Multi-Platform E-IDE driver Revision: 7.00alpha2
    ide: Assuming 33MHz system bus speed for PIO modes; override with idebus=xx
    ICH5: IDE controller at PCI slot 0000:00:1f.1
    PCI: Enabling device 0000:00:1f.1 (0005 -> 0007)
    ACPI: PCI Interrupt 0000:00:1f.1[A] -> GSI 18 (level, low) -> IRQ 177
    ICH5: chipset revision 2
    ICH5: not 100% native mode: will probe irqs later
        ide0: BM-DMA at 0xfc00-0xfc07, BIOS settings: hda:DMA, hdb:pio
    Probing IDE interface ide0...
    hda: TEAC CD-ROM CD-224E-N, ATAPI CD/DVD-ROM drive
    ide0 at 0x1f0-0x1f7,0x3f6 on irq 14
    Probing IDE interface ide1...
    hda: ATAPI 24X CD-ROM drive, 256kB Cache, UDMA(33)
    Uniform CD-ROM driver Revision: 3.20
    ide-floppy driver 0.99.newide
    usbcore: registered new driver libusual
    usbcore: registered new driver hiddev
    usbcore: registered new driver usbhid
    drivers/usb/input/hid-core.c: v2.6:USB HID core driver
    mice: PS/2 mouse device common for all mice
    md: md driver 0.90.3 MAX_MD_DEVS=256, MD_SB_DISKS=27
    md: bitmap version 4.39
    NET: Registered protocol family 2
    IP route cache hash table entries: 262144 (order: 8, 1048576 bytes)
    TCP established hash table entries: 131072 (order: 9, 2621440 bytes)
    TCP bind hash table entries: 65536 (order: 8, 1310720 bytes)
    TCP: Hash tables configured (established 131072 bind 65536)
    TCP reno registered
    TCP bic registered
    Initializing IPsec netlink socket
    NET: Registered protocol family 1
    NET: Registered protocol family 17
    Using IPI No-Shortcut mode
    ACPI wakeup devices: 
    PCI0 PALO  PXH PXHB PXHA PICH 
    ACPI: (supports S0 S4 S5)
    Freeing unused kernel memory: 228k freed
    Write protecting the kernel read-only data: 343k
    SCSI subsystem initialized
    3ware Storage Controller device driver for Linux v1.26.02.001.
    ACPI: PCI Interrupt 0000:03:07.0[A] -> GSI 66 (level, low) -> IRQ 185
    scsi0 : 3ware Storage Controller
    3w-xxxx: scsi0: Found a 3ware Storage Controller at 0xdcf0, IRQ: 185.
      Vendor: 3ware     Model: Logical Disk 0    Rev: 1.2 
      Type:   Direct-Access                      ANSI SCSI revision: 00
    SCSI device sda: 234439600 512-byte hdwr sectors (120033 MB)
    SCSI device sda: drive cache: write back
    SCSI device sda: 234439600 512-byte hdwr sectors (120033 MB)
    SCSI device sda: drive cache: write back
     sda: sda1 sda2 sda3
    sd 0:0:0:0: Attached scsi disk sda
    kjournald starting.  Commit interval 5 seconds
    EXT3-fs: mounted filesystem with ordered data mode.
    SELinux:  Disabled at runtime.
    SELinux:  Unregistering netfilter hooks
    floppy0: no floppy controllers found
    Intel(R) PRO/1000 Network Driver - version 6.1.16-k2-NAPI
    Copyright (c) 1999-2005 Intel Corporation.
    ACPI: PCI Interrupt 0000:02:04.0[A] -> GSI 32 (level, low) -> IRQ 193
    e1000: eth0: e1000_probe: Intel(R) PRO/1000 Network Connection
    ACPI: PCI Interrupt 0000:04:03.0[A] -> GSI 20 (level, low) -> IRQ 201
    e1000: eth1: e1000_probe: Intel(R) PRO/1000 Network Connection
    hw_random hardware driver 1.0.0 loaded
    ACPI: PCI Interrupt 0000:00:1d.7[D] -> GSI 23 (level, low) -> IRQ 209
    PCI: Setting latency timer of device 0000:00:1d.7 to 64
    ehci_hcd 0000:00:1d.7: EHCI Host Controller
    ehci_hcd 0000:00:1d.7: debug port 1
    PCI: cache line size of 128 is not supported by device 0000:00:1d.7
    ehci_hcd 0000:00:1d.7: new USB bus registered, assigned bus number 1
    ehci_hcd 0000:00:1d.7: irq 209, io mem 0xfeb00000
    ehci_hcd 0000:00:1d.7: USB 2.0 started, EHCI 1.00, driver 10 Dec 2004
    hub 1-0:1.0: USB hub found
    hub 1-0:1.0: 4 ports detected
    USB Universal Host Controller Interface driver v2.3
    ACPI: PCI Interrupt 0000:00:1d.0[A] -> GSI 16 (level, low) -> IRQ 169
    PCI: Setting latency timer of device 0000:00:1d.0 to 64
    uhci_hcd 0000:00:1d.0: UHCI Host Controller
    uhci_hcd 0000:00:1d.0: new USB bus registered, assigned bus number 2
    uhci_hcd 0000:00:1d.0: irq 169, io base 0x0000bce0
    hub 2-0:1.0: USB hub found
    hub 2-0:1.0: 2 ports detected
    ACPI: PCI Interrupt 0000:00:1d.1[B] -> GSI 19 (level, low) -> IRQ 217
    PCI: Setting latency timer of device 0000:00:1d.1 to 64
    uhci_hcd 0000:00:1d.1: UHCI Host Controller
    uhci_hcd 0000:00:1d.1: new USB bus registered, assigned bus number 3
    uhci_hcd 0000:00:1d.1: irq 217, io base 0x0000bcc0
    hub 3-0:1.0: USB hub found
    hub 3-0:1.0: 2 ports detected
    ACPI: Power Button (FF) [PWRF]
    ibm_acpi: ec object not found
    ACPI: Video Device [EVGA] (multi-head: no  rom: yes  post: no)
    md: Autodetecting RAID arrays.
    md: autorun ...
    md: ... autorun DONE.
    device-mapper: 4.4.0-ioctl (2005-01-12) initialised: [email]dm-devel@redhat.com[/email]
    cdrom: open failed.
    cdrom: open failed.
    EXT3 FS on sda2, internal journal
    kjournald starting.  Commit interval 5 seconds
    EXT3 FS on sda1, internal journal
    EXT3-fs: mounted filesystem with ordered data mode.
    Adding 787176k swap on /dev/sda3.  Priority:-1 extents:1 across:787176k
    e1000: eth0: e1000_watchdog_task: NIC Link is Up 100 Mbps Full Duplex
    NET: Registered protocol family 10
    lo: Disabled Privacy Extensions
    IPv6 over IPv4 tunneling driver
    i2c /dev entries driver
    lp: driver loaded but no devices found
    eth0: no IPv6 routers present
    ip_tables: (C) 2000-2002 Netfilter core team
    Netfilter messages via NETLINK v0.30.
    ip_conntrack version 2.4 (8192 buckets, 65536 max) - 232 bytes per conntrack
    
    and this is the output for /sbin/lspci

    Code:
    00:00.0 Host bridge: Intel Corporation E7520 Memory Controller Hub (rev 09)
    00:02.0 PCI bridge: Intel Corporation E7525/E7520/E7320 PCI Express Port A (rev 09)
    00:1d.0 USB Controller: Intel Corporation 82801EB/ER (ICH5/ICH5R) USB UHCI Controller #1 (rev 02)
    00:1d.1 USB Controller: Intel Corporation 82801EB/ER (ICH5/ICH5R) USB UHCI Controller #2 (rev 02)
    00:1d.7 USB Controller: Intel Corporation 82801EB/ER (ICH5/ICH5R) USB2 EHCI Controller (rev 02)
    00:1e.0 PCI bridge: Intel Corporation 82801 PCI Bridge (rev c2)
    00:1f.0 ISA bridge: Intel Corporation 82801EB/ER (ICH5/ICH5R) LPC Interface Bridge (rev 02)
    00:1f.1 IDE interface: Intel Corporation 82801EB/ER (ICH5/ICH5R) IDE Controller (rev 02)
    01:00.0 PCI bridge: Intel Corporation 6700PXH PCI Express-to-PCI Bridge A (rev 09)
    01:00.2 PCI bridge: Intel Corporation 6700PXH PCI Express-to-PCI Bridge B (rev 09)
    02:04.0 Ethernet controller: Intel Corporation 82541GI/PI Gigabit Ethernet Controller (rev 05)
    03:07.0 RAID bus controller: 3ware Inc 7xxx/8xxx-series PATA/SATA-RAID (rev 01)
    04:03.0 Ethernet controller: Intel Corporation 82541GI/PI Gigabit Ethernet Controller (rev 05)
    04:0d.0 VGA compatible controller: ATI Technologies Inc Radeon RV100 QY [Radeon 7000/VE]
    
    I hope it's of some use to you.. It's NOT for me :/
     
    Last edited: Apr 22, 2006
  7. falko

    falko Super Moderator

    Looks ok. I saw some output of SELinux. Is SELinux disabled?
     
  8. edge

    edge HowtoForge Supporter

    Hmm it should be disabled, but I do also see
    I guess that this is saying that SELinux is running. I have NO CLUE how it got installed!

    The install I did was the "perfect_setup_fedora_core_4", and I'm 100% sure that I disabled the SELinux option!

    Is there an easy way to 'kill' it again?

    Anyway... I've now made a small bash script that I run every hr to ping the gateway.. If no ping.. restart the network :/

    Code:
    #!/bin/bash
    
    pingAddress="10.0.0.138"	# IP of gateway
    emailAddress="root@host"	# address to send restart message
    timeNow=$(date)			# set the date and time
    
    if [ -n "$(ping -c 1 $pingAddress|grep 100%)" ]
    then
    # no ping reply.. Lets email root and restart the network :-(
        mail -s "No ping. Server network restart" $emailAddress <<-END
        This is to inform you that the server network was restarted at $timeNow
    END
    /etc/init.d/network restart
    fi
    
     
    Last edited: Apr 24, 2006
  9. thesquib

    thesquib New Member

    Simlar Problem

    <double post edited out - sorry>
     
    Last edited: Apr 24, 2006
  10. thesquib

    thesquib New Member

    Hey I'm having a similar problem - I followed the perfect Debian sarge install.
    Basically the server is fine... most of the time, but it goes down for periods of time where nothing will work - when trying to connect to websites it just times out. It wont' reply to pings, and email services are down.

    This is my dmesg output related to the ethernet.

    ~# dmesg | grep e100
    BIOS-e820: 00000000fee00000 - 00000000fee10000 (reserved)
    e100: selftest OK.
    e100: eth0: Intel(R) PRO/100 Network Connection
    e100: selftest OK.
    e100: eth1: Intel(R) PRO/100 Network Connection
    e100: eth1 NIC Link is Up 100 Mbps Full duplex

    I'm not really sure why this is happening. Have you found anything yet?
     
  11. edge

    edge HowtoForge Supporter

    My server got exactly the same symptoms.. I think that (in my case) it is something with the network (eth0) or firewall, as when I look at any other running service logfile (when the server is 'online again') I do not see any problems.

    I remeber when I was installing the server some time ago I 'kind of' had the same problem.

    The terminal (even when it was sitting at the login screen asking for a login) sometimes showed
    When it did show this, the network went down, and the only way to access the server was by direct access on the system.. A pain in the neck now as the server is about 1hr away from me :/

    Again thesquib, I'm here talking about my server. I'm not sure if this is also the problem with your server!
     
  12. falko

    falko Super Moderator

    You can disable SELinux in /etc/selinux/config. You must reboot your server afterwards.

    That's the normal message when iptables goes up. Maybe it's some kind of firewall rule that blocks networking.
     
  13. edge

    edge HowtoForge Supporter

    This is what it is set to (I did not change it)
    Code:
      GNU nano 1.3.5-cvs             File: config
    
    # This file controls the state of SELinux on the system.
    # SELINUX= can take one of these three values:
    #       enforcing - SELinux security policy is enforced.
    #       permissive - SELinux prints warnings instead of enforcing.
    #       disabled - SELinux is fully disabled.
    SELINUX=disabled
    # SELINUXTYPE= type of policy in use. Possible values are:
    #       targeted - Only targeted network daemons are protected.
    #       strict - Full SELinux protection.
    SELINUXTYPE=targeted
    
    It looks like it's set to: disabled allready!

    Okay, but is it also normal that it's showing it over and over and over again?
    Even when I'm not logged in (terminal is sitting at the login: prompt)
     
    Last edited: Apr 24, 2006
  14. edge

    edge HowtoForge Supporter

    More strange info about SELinux.

    uname -a

    So NO SELinux kernel....

    When I run 'cat /proc/1/environ' I get this:

     
  15. thesquib

    thesquib New Member

    Mine must be some sort of firewall... I just managed to figure out that even though services are not available - I can actually use everything fine on another server on the same ip range. Yet the internal network and the external network has intermittant access..

    I haven't configured a firewall, but I did enable it once or twice within the ispconfig system
     
  16. falko

    falko Super Moderator

    That's good.


    Yes, that's normal.
     
  17. falko

    falko Super Moderator

    Are you sure no other firewall was running when you started the ISPConfig firewall? If another one was running, both firewall will interfere with each other and produe unforeseeable results.
    Please disable the ISPConfig firewall, then reboot the system and then post the output of
    Code:
    iptables -L
     
  18. edge

    edge HowtoForge Supporter

    falko, I really appreciate you helping me with this problem..

    This is what I get after disabling the ISP's firewall and a reboot, and no I'm not using any other firewall with the ISP one.

    Code:
    [root@host ~]# iptables -L
    Chain FORWARD (policy ACCEPT)
    target     prot opt source               destination
    
    Chain INPUT (policy ACCEPT)
    target     prot opt source               destination
    
    Chain OUTPUT (policy ACCEPT)
    target     prot opt source               destination
    [root@host ~]#
    
     
    Last edited: Apr 24, 2006
  19. edge

    edge HowtoForge Supporter

    Reading a lot about 'sk98lin' and Gigabit Lan, and network problems.. Not sure if I'm using this with my Fedora RC4 server, but when I do a 'locate sk98lin' I do get a lot of files back!

    Could this be causing the problems?
     
  20. falko

    falko Super Moderator

    At least it's possible.
    Do you get the same problems if you leave the firewall off? That would mean it isn't caused by the firewall.
     

Share This Page