LKML Archive on lore.kernel.org
help / color / mirror / Atom feed
* The never ending BEEEEP/__smp_call_function_mask with 2.6.25-rc7
@ 2008-03-30 19:09 Chr
2008-03-30 19:25 ` Rafael J. Wysocki
2008-03-31 7:10 ` Andi Kleen
0 siblings, 2 replies; 25+ messages in thread
From: Chr @ 2008-03-30 19:09 UTC (permalink / raw)
To: linux-kernel
[-- Attachment #1: Type: text/plain, Size: 2183 bytes --]
Hi,
Ever since I went to the new 2.6.25-rcX, I encountered lots of random
system freezes after about 2 or 3 hours of uptime... but until now,
I couldn't _catch_ them, since the system
(AMD Athlon 4200+ X2 (Manchester) / nforce 4 SLI / x86_64) went
straight into oblivion (nothing on the serial console or anywhere else,
heck I couldn't even hit reset, It didn't POST anymore)
So. take a look at stuck_task... that's best backtrace I could get...
(I have other logs too. (Task-list / Lockdep / ... ) but it's too much
and I hope the _logs_ are already enough.
BTW: I noticed that the clock seems to jump forward and backwards.
Is this because the CPU-Cores aren't syncronized? (And why are they
out of sync?)
(And yes, 2.6.24.4 is fine... even hpet works there without any
rtc: lost 14 interrupts)
cat /proc/interrupts
CPU0 CPU1
0: 51 0 XT-PIC-XT timer
1: 43 22170 IO-APIC-edge i8042
4: 0 1 IO-APIC-edge
6: 0 5 IO-APIC-edge floppy
7: 1 0 IO-APIC-edge parport0
8: 0 199 IO-APIC-edge rtc
9: 0 0 IO-APIC-fasteoi acpi
14: 0 0 IO-APIC-edge pata_amd
15: 0 0 IO-APIC-edge pata_amd
16: 82 51311 IO-APIC-fasteoi firewire_ohci, EMU10K1
17: 0 50 IO-APIC-fasteoi firewire_ohci, eth1
18: 12 4862 IO-APIC-fasteoi nvidia
20: 421 304857 IO-APIC-fasteoi eth0
21: 57 33025 IO-APIC-fasteoi ohci_hcd:usb1
22: 185 12275 IO-APIC-fasteoi sata_nv
23: 276 83270 IO-APIC-fasteoi sata_nv, ehci_hcd:usb2
NMI: 0 0 Non-maskable interrupts
LOC: 339847 307599 Local timer interrupts
RES: 126709 38837 Rescheduling interrupts
CAL: 605 1054 function call interrupts
TLB: 1513 3813 TLB shootdowns
TRM: 0 0 Thermal event interrupts
THR: 0 0 Threshold APIC interrupts
SPU: 0 0 Spurious interrupts
ERR: 1
[-- Attachment #2: dmesg-bootlog --]
[-- Type: text/plain, Size: 24364 bytes --]
[ 0.605408] ACPI: PCI Interrupt Link [APC4] (IRQs 19) *0, disabled.
[ 0.610325] ACPI: PCI Interrupt Link [APC5] (IRQs *16), disabled.
[ 0.616700] ACPI: PCI Interrupt Link [APCF] (IRQs 20 21 22 23) *0
[ 0.624190] ACPI: PCI Interrupt Link [APCG] (IRQs 20 21 22 23) *0, disabled.
[ 0.631312] ACPI: PCI Interrupt Link [APCH] (IRQs 20 21 22 23) *0
[ 0.634324] ACPI: PCI Interrupt Link [APCJ] (IRQs 20 21 22 23) *0, disabled.
[ 0.642706] ACPI: PCI Interrupt Link [APCK] (IRQs 20 21 22 23) *0, disabled.
[ 0.650411] ACPI: PCI Interrupt Link [APCS] (IRQs 20 21 22 23) *0
[ 0.657417] ACPI: PCI Interrupt Link [APCL] (IRQs 20 21 22 23) *0
[ 0.664255] ACPI: PCI Interrupt Link [APCZ] (IRQs 20 21 22 23) *0, disabled.
[ 0.672093] ACPI: PCI Interrupt Link [APSI] (IRQs 20 21 22 23) *0
[ 0.678312] ACPI: PCI Interrupt Link [APSJ] (IRQs 20 21 22 23) *0
[ 0.685716] ACPI: PCI Interrupt Link [APCP] (IRQs 20 21 22 23) *0, disabled.
[ 0.692663] Linux Plug and Play Support v0.97 (c) Adam Belay
[ 0.696078] pnp: PnP ACPI init
[ 0.700068] ACPI: bus type pnp registered
[ 0.713296] pnp: PnP ACPI: found 13 devices
[ 0.716049] ACPI: ACPI bus type pnp unregistered
[ 0.720229] SCSI subsystem initialized
[ 0.724161] libata version 3.00 loaded.
[ 0.724307] PCI: Using ACPI for IRQ routing
[ 0.728050] PCI: If a device doesn't work, try "pci=routeirq". If it helps, post a report
[ 0.752050] hpet clockevent registered
[ 0.752050] hpet0: at MMIO 0xfed00000, IRQs 2, 8, 31
[ 0.753401] hpet0: 3 32-bit timers, 25000000 Hz
[ 0.757174] ACPI: RTC can wake from S4
[ 0.760058] Switched to high resolution mode on CPU 0
[ 0.636155] Switched to high resolution mode on CPU 1
[ 0.780054] system 00:01: ioport range 0x4000-0x407f has been reserved
[ 0.780057] system 00:01: ioport range 0x4080-0x40ff has been reserved
[ 0.784057] system 00:01: ioport range 0x4400-0x447f has been reserved
[ 0.788058] system 00:01: ioport range 0x4480-0x44ff has been reserved
[ 0.792058] system 00:01: ioport range 0x4800-0x487f has been reserved
[ 0.796058] system 00:01: ioport range 0x4880-0x48ff has been reserved
[ 0.800060] system 00:01: iomem range 0x0-0x0 could not be reserved
[ 0.804067] system 00:02: ioport range 0x4d0-0x4d1 has been reserved
[ 0.808059] system 00:02: ioport range 0x800-0x87f has been reserved
[ 0.812059] system 00:02: ioport range 0x290-0x297 has been reserved
[ 0.816075] system 00:0b: iomem range 0xe0000000-0xefffffff could not be reserved
[ 0.820070] system 00:0c: iomem range 0xf0000-0xf3fff could not be reserved
[ 0.824060] system 00:0c: iomem range 0xf4000-0xf7fff could not be reserved
[ 0.828060] system 00:0c: iomem range 0xf8000-0xfbfff could not be reserved
[ 0.832061] system 00:0c: iomem range 0xfc000-0xfffff could not be reserved
[ 0.836061] system 00:0c: iomem range 0x7fff0000-0x7fffffff could not be reserved
[ 0.840061] system 00:0c: iomem range 0xffff0000-0xffffffff has been reserved
[ 0.844063] system 00:0c: iomem range 0x0-0x9ffff could not be reserved
[ 0.848062] system 00:0c: iomem range 0x100000-0x7ffeffff could not be reserved
[ 0.852062] system 00:0c: iomem range 0xfec00000-0xfec00fff has been reserved
[ 0.856062] system 00:0c: iomem range 0xfee00000-0xfeefffff could not be reserved
[ 0.860062] system 00:0c: iomem range 0xfefff000-0xfeffffff has been reserved
[ 0.864062] system 00:0c: iomem range 0xfff80000-0xfff80fff has been reserved
[ 0.868063] system 00:0c: iomem range 0xfff90000-0xfffbffff has been reserved
[ 0.872063] system 00:0c: iomem range 0xfffed000-0xfffeffff has been reserved
[ 0.877102] PCI: Bridge: 0000:00:09.0
[ 0.880064] IO window: a000-afff
[ 0.884066] MEM window: 0xd4000000-0xd5ffffff
[ 0.888065] PREFETCH window: disabled.
[ 0.892066] PCI: Bridge: 0000:00:0b.0
[ 0.896064] IO window: disabled.
[ 0.900066] MEM window: disabled.
[ 0.904066] PREFETCH window: disabled.
[ 0.908067] PCI: Bridge: 0000:00:0c.0
[ 0.912065] IO window: disabled.
[ 0.916067] MEM window: disabled.
[ 0.920067] PREFETCH window: disabled.
[ 0.924069] PCI: Bridge: 0000:00:0d.0
[ 0.928066] IO window: disabled.
[ 0.932068] MEM window: disabled.
[ 0.936068] PREFETCH window: disabled.
[ 0.940070] PCI: Bridge: 0000:00:0e.0
[ 0.944068] IO window: 9000-9fff
[ 0.948069] MEM window: 0xd0000000-0xd3ffffff
[ 0.952069] PREFETCH window: 0x00000000c0000000-0x00000000cfffffff
[ 0.956076] PCI: Setting latency timer of device 0000:00:09.0 to 64
[ 0.956089] PCI: Setting latency timer of device 0000:00:0b.0 to 64
[ 0.956100] PCI: Setting latency timer of device 0000:00:0c.0 to 64
[ 0.956111] PCI: Setting latency timer of device 0000:00:0d.0 to 64
[ 0.956122] PCI: Setting latency timer of device 0000:00:0e.0 to 64
[ 0.956138] NET: Registered protocol family 2
[ 1.004070] IP route cache hash table entries: 65536 (order: 7, 524288 bytes)
[ 1.008427] TCP established hash table entries: 262144 (order: 10, 4194304 bytes)
[ 1.014415] TCP bind hash table entries: 65536 (order: 9, 3670016 bytes)
[ 1.025177] TCP: Hash tables configured (established 262144 bind 65536)
[ 1.028103] TCP reno registered
[ 1.044077] Unpacking initramfs... done
[ 1.189439] Freeing initrd memory: 4656k freed
[ 1.188762] VFS: Disk quotas dquot_6.5.1
[ 1.189798] Dquot-cache hash table entries: 512 (order 0, 4096 bytes)
[ 1.194550] fuse init (API version 7.9)
[ 1.198114] JFS: nTxBlock = 8192, nTxLock = 65536
[ 1.208192] Block layer SCSI generic (bsg) driver version 0.4 loaded (major 253)
[ 1.209654] io scheduler noop registered
[ 1.213709] io scheduler anticipatory registered (default)
[ 1.217699] pci 0000:00:00.0: Enabling HT MSI Mapping
[ 1.241641] pci 0000:00:0b.0: Enabling HT MSI Mapping
[ 1.241657] pci 0000:00:0b.0: Found enabled HT MSI Mapping
[ 1.245646] pci 0000:00:0b.0: Linking AER extended capability
[ 1.249656] pci 0000:00:0c.0: Enabling HT MSI Mapping
[ 1.253654] pci 0000:00:0c.0: Found enabled HT MSI Mapping
[ 1.257646] pci 0000:00:0c.0: Linking AER extended capability
[ 1.261657] pci 0000:00:0d.0: Enabling HT MSI Mapping
[ 1.265655] pci 0000:00:0d.0: Found enabled HT MSI Mapping
[ 1.269646] pci 0000:00:0d.0: Linking AER extended capability
[ 1.273658] pci 0000:00:0e.0: Enabling HT MSI Mapping
[ 1.277655] pci 0000:00:0e.0: Found enabled HT MSI Mapping
[ 1.281647] pci 0000:00:0e.0: Linking AER extended capability
[ 1.285675] pci 0000:01:00.0: Boot video device
[ 1.285840] PCI: Setting latency timer of device 0000:00:0b.0 to 64
[ 1.285875] assign_interrupt_mode Found MSI capability
[ 1.289676] Allocate Port Service[0000:00:0b.0:pcie00]
[ 1.289753] Allocate Port Service[0000:00:0b.0:pcie03]
[ 1.289871] PCI: Setting latency timer of device 0000:00:0c.0 to 64
[ 1.289905] assign_interrupt_mode Found MSI capability
[ 1.293672] Allocate Port Service[0000:00:0c.0:pcie00]
[ 1.293746] Allocate Port Service[0000:00:0c.0:pcie03]
[ 1.297733] PCI: Setting latency timer of device 0000:00:0d.0 to 64
[ 1.297767] assign_interrupt_mode Found MSI capability
[ 1.301671] Allocate Port Service[0000:00:0d.0:pcie00]
[ 1.301754] Allocate Port Service[0000:00:0d.0:pcie03]
[ 1.301879] PCI: Setting latency timer of device 0000:00:0e.0 to 64
[ 1.301913] assign_interrupt_mode Found MSI capability
[ 1.305672] Allocate Port Service[0000:00:0e.0:pcie00]
[ 1.305750] Allocate Port Service[0000:00:0e.0:pcie03]
[ 1.305995] pci_hotplug: PCI Hot Plug PCI Core version: 0.5
[ 1.309711] pciehp: PCI Express Hot Plug Controller Driver version: 0.4
[ 1.313804] input: Power Button (FF) as /class/input/input0
[ 1.317678] ACPI: Power Button (FF) [PWRF]
[ 1.321779] input: Power Button (CM) as /class/input/input1
[ 1.325651] ACPI: Power Button (CM) [PWRB]
[ 1.330048] ACPI: ACPI0007:00 is registered as cooling_device0
[ 1.333751] ACPI: ACPI0007:01 is registered as cooling_device1
[ 1.381650] Real Time Clock Driver v1.12ac
[ 1.381845] Linux agpgart interface v0.103
[ 1.385654] Serial: 8250/16550 driver $Revision: 1.90 $ 2 ports, IRQ sharing disabled
[ 1.389874] serial8250: ttyS0 at I/O 0x3f8 (irq = 4) is a 16550A
[ 1.394432] 00:08: ttyS0 at I/O 0x3f8 (irq = 4) is a 16550A
[ 1.397990] isa bounce pool size: 16 pages
[ 1.401687] Floppy drive(s): fd0 is 1.44M
[ 1.431547] FDC 0 is a post-1991 82077
[ 1.435778] loop: module loaded
[ 1.439648] Driver 'sd' needs updating - please use bus_type methods
[ 1.443615] Driver 'sr' needs updating - please use bus_type methods
[ 1.447796] sata_nv 0000:00:07.0: version 3.5
[ 1.448272] ACPI: PCI Interrupt Link [APSI] enabled at IRQ 23
[ 1.451560] ACPI: PCI Interrupt 0000:00:07.0[A] -> Link [APSI] -> GSI 23 (level, low) -> IRQ 23
[ 1.459552] sata_nv 0000:00:07.0: Using ADMA mode
[ 1.463631] PCI: Setting latency timer of device 0000:00:07.0 to 64
[ 1.463840] scsi0 : sata_nv
[ 1.467713] scsi1 : sata_nv
[ 1.471776] ata1: SATA max UDMA/133 cmd 0x9f0 ctl 0xbf0 bmdma 0xd800 irq 23
[ 1.475554] ata2: SATA max UDMA/133 cmd 0x970 ctl 0xb70 bmdma 0xd808 irq 23
[ 2.090406] ata1: SATA link up 1.5 Gbps (SStatus 113 SControl 300)
[ 2.255723] ata1.00: ATAPI: TSSTcorpCD/DVDW SH-S183L, SB00, max UDMA/33
[ 2.255727] ata1.00: applying bridge limits
[ 2.431991] ata1.00: configured for UDMA/33
[ 2.903284] ata2: SATA link up 3.0 Gbps (SStatus 123 SControl 300)
[ 2.806091] ata2.00: ATA-8: SAMSUNG HD501LJ, CR100-12, max UDMA7
[ 2.806096] ata2.00: 976773168 sectors, multi 1: LBA48 NCQ (depth 31/32)
[ 2.820431] ata2.00: configured for UDMA/133
[ 2.825918] scsi 0:0:0:0: CD-ROM TSSTcorp CD/DVDW SH-S183L SB00 PQ: 0 ANSI: 5
[ 2.828452] ata1: DMA mask 0xFFFFFFFF, segment boundary 0xFFFF, hw segs 127
[ 2.844743] sr0: scsi3-mmc drive: 48x/48x writer dvd-ram cd/rw xa/form2 cdda tray
[ 2.844748] Uniform CD-ROM driver Revision: 3.20
[ 2.848979] sr 0:0:0:0: Attached scsi CD-ROM sr0
[ 2.849216] sr 0:0:0:0: Attached scsi generic sg0 type 5
[ 2.853148] scsi 1:0:0:0: Direct-Access ATA SAMSUNG HD501LJ CR10 PQ: 0 ANSI: 5
[ 2.856752] ata2: DMA mask 0xFFFFFFFF, segment boundary 0xFFFFFFFF, hw segs 61
[ 2.861081] sd 1:0:0:0: [sda] 976773168 512-byte hardware sectors (500108 MB)
[ 2.864770] sd 1:0:0:0: [sda] Write Protect is off
[ 2.868748] sd 1:0:0:0: [sda] Mode Sense: 00 3a 00 00
[ 2.868789] sd 1:0:0:0: [sda] Write cache: enabled, read cache: enabled, doesn't support DPO or FUA
[ 2.873094] sd 1:0:0:0: [sda] 976773168 512-byte hardware sectors (500108 MB)
[ 2.876769] sd 1:0:0:0: [sda] Write Protect is off
[ 2.880749] sd 1:0:0:0: [sda] Mode Sense: 00 3a 00 00
[ 2.880790] sd 1:0:0:0: [sda] Write cache: enabled, read cache: enabled, doesn't support DPO or FUA
[ 2.884766] sda: sda1 sda2 sda3 sda4 < sda5 >
[ 2.933280] sd 1:0:0:0: [sda] Attached SCSI disk
[ 2.936108] sd 1:0:0:0: Attached scsi generic sg1 type 0
[ 2.940528] ACPI: PCI Interrupt Link [APSJ] enabled at IRQ 22
[ 2.943963] ACPI: PCI Interrupt 0000:00:08.0[A] -> Link [APSJ] -> GSI 22 (level, low) -> IRQ 22
[ 2.955956] sata_nv 0000:00:08.0: Using ADMA mode
[ 2.960005] PCI: Setting latency timer of device 0000:00:08.0 to 64
[ 2.960131] scsi2 : sata_nv
[ 2.964098] scsi3 : sata_nv
[ 2.967375] ata3: SATA max UDMA/133 cmd 0x9e0 ctl 0xbe0 bmdma 0xc400 irq 22
[ 2.967960] ata4: SATA max UDMA/133 cmd 0x960 ctl 0xb60 bmdma 0xc408 irq 22
[ 3.639970] ata3: SATA link up 1.5 Gbps (SStatus 113 SControl 300)
[ 3.454286] ata3.00: HPA detected: current 488395055, native 488397168
[ 3.454292] ata3.00: ATA-7: WDC WD2500KS-00MJB0, 02.01C03, max UDMA/133
[ 3.458289] ata3.00: 488395055 sectors, multi 1: LBA48
[ 3.470751] ata3.00: configured for UDMA/133
[ 4.147817] ata4: SATA link up 1.5 Gbps (SStatus 113 SControl 300)
[ 4.157219] ata4.00: ATA-7: WDC WD2500YD-01NVB1, 10.02E01, max UDMA/133
[ 4.157224] ata4.00: 490234752 sectors, multi 1: LBA48 NCQ (depth 1)
[ 4.171356] ata4.00: configured for UDMA/133
[ 3.976657] scsi 2:0:0:0: Direct-Access ATA WDC WD2500KS-00M 02.0 PQ: 0 ANSI: 5
[ 3.976664] ata3: DMA mask 0xFFFFFFFFFFFFFFFF, segment boundary 0xFFFFFFFF, hw segs 61
[ 3.980893] sd 2:0:0:0: [sdb] 488395055 512-byte hardware sectors (250058 MB)
[ 3.984683] sd 2:0:0:0: [sdb] Write Protect is off
[ 3.988661] sd 2:0:0:0: [sdb] Mode Sense: 00 3a 00 00
[ 3.988702] sd 2:0:0:0: [sdb] Write cache: enabled, read cache: enabled, doesn't support DPO or FUA
[ 3.992752] sd 2:0:0:0: [sdb] 488395055 512-byte hardware sectors (250058 MB)
[ 3.996683] sd 2:0:0:0: [sdb] Write Protect is off
[ 4.000661] sd 2:0:0:0: [sdb] Mode Sense: 00 3a 00 00
[ 4.000703] sd 2:0:0:0: [sdb] Write cache: enabled, read cache: enabled, doesn't support DPO or FUA
[ 4.004663] sdb: sdb1 sdb2
[ 4.014513] sd 2:0:0:0: [sdb] Attached SCSI disk
[ 4.017484] sd 2:0:0:0: Attached scsi generic sg2 type 0
[ 4.021523] scsi 3:0:0:0: Direct-Access ATA WDC WD2500YD-01N 10.0 PQ: 0 ANSI: 5
[ 4.025338] ata4: DMA mask 0xFFFFFFFFFFFFFFFF, segment boundary 0xFFFFFFFF, hw segs 61
[ 4.029551] sd 3:0:0:0: [sdc] 490234752 512-byte hardware sectors (251000 MB)
[ 4.033356] sd 3:0:0:0: [sdc] Write Protect is off
[ 4.037336] sd 3:0:0:0: [sdc] Mode Sense: 00 3a 00 00
[ 4.037376] sd 3:0:0:0: [sdc] Write cache: enabled, read cache: enabled, doesn't support DPO or FUA
[ 4.041417] sd 3:0:0:0: [sdc] 490234752 512-byte hardware sectors (251000 MB)
[ 4.045356] sd 3:0:0:0: [sdc] Write Protect is off
[ 4.049337] sd 3:0:0:0: [sdc] Mode Sense: 00 3a 00 00
[ 4.049377] sd 3:0:0:0: [sdc] Write cache: enabled, read cache: enabled, doesn't support DPO or FUA
[ 4.053338] sdc: sdc1
[ 4.062864] sd 3:0:0:0: [sdc] Attached SCSI disk
[ 4.063014] sd 3:0:0:0: Attached scsi generic sg3 type 0
[ 4.067063] pata_amd 0000:00:06.0: version 0.3.10
[ 4.067125] PCI: Setting latency timer of device 0000:00:06.0 to 64
[ 4.067195] scsi4 : pata_amd
[ 4.070997] scsi5 : pata_amd
[ 4.075227] ata5: PATA max UDMA/133 cmd 0x1f0 ctl 0x3f6 bmdma 0xf000 irq 14
[ 4.078868] ata6: PATA max UDMA/133 cmd 0x170 ctl 0x376 bmdma 0xf008 irq 15
[ 4.416726] PNP: PS/2 Controller [PNP0303:PS2K] at 0x60,0x64 irq 1
[ 4.416736] PNP: PS/2 appears to have AUX port disabled, if this is incorrect please boot with i8042.nopnp
[ 4.421261] serio: i8042 KBD port at 0x60,0x64 irq 1
[ 4.424989] mice: PS/2 mouse device common for all mice
[ 4.457886] input: AT Translated Set 2 keyboard as /class/input/input2
[ 4.477912] input: PC Speaker as /class/input/input3
[ 4.497937] rtc_cmos: probe of 00:04 failed with error -16
[ 4.498073] Software Watchdog Timer: 0.07 initialized. soft_noboot=0 soft_margin=60 sec (nowayout= 0)
[ 4.501953] md: linear personality registered for level -1
[ 4.505940] md: raid0 personality registered for level 0
[ 4.509941] md: raid1 personality registered for level 1
[ 4.514076] device-mapper: uevent: version 1.0.3
[ 4.518077] device-mapper: ioctl: 4.13.0-ioctl (2007-10-18) initialised: dm-devel@redhat.com
[ 4.522049] EDAC MC: Ver: 2.1.0 Mar 29 2008
[ 4.526185] cpuidle: using governor ladder
[ 4.529943] cpuidle: using governor menu
[ 4.534120] oprofile: using NMI interrupt.
[ 4.538789] TCP cubic registered
[ 4.542053] NET: Registered protocol family 1
[ 4.545970] NET: Registered protocol family 17
[ 4.549963] NET: Registered protocol family 15
[ 4.554393] drivers/rtc/hctosys.c: unable to open rtc device (rtc0)
[ 4.557957] Freeing unused kernel memory: 264k freed
[ 5.453810] usbcore: registered new interface driver usbfs
[ 5.457053] usbcore: registered new interface driver hub
[ 5.461061] usbcore: registered new device driver usb
[ 5.125737] forcedeth: Reverse Engineered nForce ethernet driver. Version 0.61.
[ 5.129289] ACPI: PCI Interrupt Link [APCH] enabled at IRQ 21
[ 5.132808] ACPI: PCI Interrupt 0000:00:0a.0[A] -> Link [APCH] -> GSI 21 (level, low) -> IRQ 21
[ 5.140803] PCI: Setting latency timer of device 0000:00:0a.0 to 64
[ 5.492607] ohci_hcd: 2006 August 04 USB 1.1 'Open' Host Controller (OHCI) Driver
[ 5.665814] forcedeth 0000:00:0a.0: ifname eth0, PHY OUI 0x5043 @ 9, addr 00:15:f2:50:c1:ef
[ 5.665821] forcedeth 0000:00:0a.0: highdma csum timirq gbit lnktim desc-v3
[ 6.031627] ACPI: PCI Interrupt Link [APCL] enabled at IRQ 20
[ 6.031639] ACPI: PCI Interrupt 0000:00:02.1[B] -> Link [APCL] -> GSI 20 (level, low) -> IRQ 20
[ 6.039643] PCI: Setting latency timer of device 0000:00:02.1 to 64
[ 6.039647] ehci_hcd 0000:00:02.1: EHCI Host Controller
[ 6.045098] ehci_hcd 0000:00:02.1: new USB bus registered, assigned bus number 1
[ 6.047689] ehci_hcd 0000:00:02.1: debug port 1
[ 6.051634] PCI: cache line size of 64 is not supported by device 0000:00:02.1
[ 6.051652] ehci_hcd 0000:00:02.1: irq 20, io mem 0xfeb00000
[ 6.075634] ehci_hcd 0000:00:02.1: USB 2.0 started, EHCI 1.00, driver 10 Dec 2004
[ 6.080202] usb usb1: configuration #1 chosen from 1 choice
[ 6.083805] hub 1-0:1.0: USB hub found
[ 6.087729] hub 1-0:1.0: 10 ports detected
[ 5.867634] ACPI: PCI Interrupt Link [APC2] enabled at IRQ 17
[ 5.871410] ACPI: PCI Interrupt 0000:05:06.2[B] -> ACPI: PCI Interrupt Link [APCF] enabled at IRQ 23
[ 6.205591] ACPI: PCI Interrupt 0000:00:02.0[A] -> Link [APCF] -> GSI 23 (level, low) -> IRQ 23
[ 6.205591] PCI: Setting latency timer of device 0000:00:02.0 to 64
[ 6.205591] ohci_hcd 0000:00:02.0: OHCI Host Controller
[ 6.205591] ohci_hcd 0000:00:02.0: new USB bus registered, assigned bus number 2
[ 6.205591] ohci_hcd 0000:00:02.0: irq 23, io mem 0xd6003000
[ 5.875402] Link [APC2] -> GSI 17 (level, low) -> IRQ 17
[ 6.261605] usb usb2: configuration #1 chosen from 1 choice
[ 6.261655] hub 2-0:1.0: USB hub found
[ 6.265620] hub 2-0:1.0: 10 ports detected
[ 5.954491] firewire_ohci: Added fw-ohci device 0000:05:06.2, OHCI version 1.10
[ 5.954893] ACPI: PCI Interrupt Link [APC1] enabled at IRQ 16
[ 5.958501] ACPI: PCI Interrupt 0000:05:0b.0[A] -> Link [APC1] -> GSI 16 (level, low) -> IRQ 16
[ 6.041492] firewire_ohci: Added fw-ohci device 0000:05:0b.0, OHCI version 1.10
[ 6.422109] ACPI: PCI Interrupt 0000:05:0c.0[A] -> Link [APC2] -> GSI 17 (level, low) -> IRQ 17
[ 6.429041] skge 1.13 addr 0xd5014000 irq 17 chip Yukon-Lite rev 9
[ 6.433669] skge eth1: addr 00:15:f2:50:d7:70
[ 6.538574] md: md0 stopped.
[ 6.177148] md: bind<sdc1>
[ 6.179951] md: bind<sda3>
[ 6.187657] raid1: raid set md0 active with 2 out of 2 mirrors
[ 6.651951] usb 1-3: new high speed USB device using ehci_hcd and address 3
[ 6.803647] usb 1-3: configuration #1 chosen from 1 choice
[ 6.458356] firewire_core: created device fw0: GUID 00023c015112bb16, S400
[ 6.550553] firewire_core: created device fw1: GUID 0011d800007ab4d6, S400
[ 6.706950] usb 2-2: new low speed USB device using ohci_hcd and address 2
[ 6.881906] usb 2-2: configuration #1 chosen from 1 choice
[ 7.277468] Initializing USB Mass Storage driver...
[ 7.279564] scsi6 : SCSI emulation for USB Mass Storage devices
[ 7.287549] usbcore: registered new interface driver usb-storage
[ 7.289715] USB Mass Storage support registered.
[ 6.928773] usb-storage: device found at 3
[ 6.928773] usb-storage: waiting for device to settle before scanning
[ 6.932887] input: Western Digital External HDD as /class/input/input4
[ 6.934938] input,hidraw0: USB HID v1.11 Device [Western Digital External HDD] on usb-0000:00:02.1-3
[ 6.951377] input: Logitech USB Receiver as /class/input/input5
[ 6.976045] input,hidraw1: USB HID v1.11 Mouse [Logitech USB Receiver] on usb-0000:00:02.0-2
[ 6.996455] input: Logitech USB Receiver as /class/input/input6
[ 7.021296] input,hidraw2: USB HID v1.11 Device [Logitech USB Receiver] on usb-0000:00:02.0-2
[ 7.021332] usbcore: registered new interface driver usbhid
[ 7.025301] drivers/hid/usbhid/hid-core.c: v2.6:USB HID core driver
[ 11.921331] scsi 6:0:0:0: Direct-Access WD 2500JS External 106a PQ: 0 ANSI: 4
[ 11.939160] sd 6:0:0:0: [sdd] 488397168 512-byte hardware sectors (250059 MB)
[ 11.942473] sd 6:0:0:0: [sdd] Write Protect is off
[ 11.943579] sd 6:0:0:0: [sdd] Mode Sense: 11 00 00 00
[ 11.943582] sd 6:0:0:0: [sdd] Assuming drive cache: write through
[ 11.955161] sd 6:0:0:0: [sdd] 488397168 512-byte hardware sectors (250059 MB)
[ 11.958536] sd 6:0:0:0: [sdd] Write Protect is off
[ 11.959581] sd 6:0:0:0: [sdd] Mode Sense: 11 00 00 00
[ 11.959583] sd 6:0:0:0: [sdd] Assuming drive cache: write through
[ 11.963581] sdd: sdd1
[ 11.968564] sd 6:0:0:0: [sdd] Attached SCSI disk
[ 11.968638] sd 6:0:0:0: Attached scsi generic sg4 type 0
[ 11.979982] usb-storage: device scan complete
[ 67.432221] ReiserFS: dm-5: found reiserfs format "3.6" with standard journal
[ 67.432378] ReiserFS: dm-5: using ordered data mode
[ 67.452106] ReiserFS: dm-5: journal params: device dm-5, size 8192, journal first block 18, max trans len 1024, max batch 900, max commit age 30, max trans age 30
[ 67.456714] ReiserFS: dm-5: checking transaction log (dm-5)
[ 88.747260] ReiserFS: dm-5: replayed 723 transactions in 14 seconds
[ 88.779399] ReiserFS: dm-5: Using r5 hash to sort names
[ 92.506218] i2c-adapter i2c-0: nForce2 SMBus adapter at 0x4c00
[ 92.509664] i2c-adapter i2c-1: nForce2 SMBus adapter at 0x4c40
[ 92.693297] parport_pc 00:09: reported by Plug and Play ACPI
[ 92.693822] parport0: PC-style at 0x378 (0x778), irq 7, dma 3 [PCSPP,TRISTATE,COMPAT,ECP,DMA]
[ 93.067467] ACPI: PCI Interrupt 0000:05:06.0[A] -> Link [APC1] -> GSI 16 (level, low) -> IRQ 16
[ 93.079989] Installing spdif_bug patch: Audigy 2 ZS [SB0350]
[ 102.633998] it87: Found IT8712F chip at 0x290, revision 7
[ 102.634011] it87: in3 is VCC (+5V)
[ 102.638001] it87: in7 is VCCH (+5V Stand-By)
[ 102.642097] it87 it87.656: Detected broken BIOS defaults, disabling PWM interface
[ 102.678211] powernow-k8: Found 1 AMD Athlon(tm) 64 X2 Dual Core Processor 4200+ processors (2 cpu cores) (version 2.20.00)
[ 96.210136] powernow-k8: 0 : fid 0xe (2200 MHz), vid 0x8
[ 96.210140] powernow-k8: 1 : fid 0xc (2000 MHz), vid 0xa
[ 96.214140] powernow-k8: 2 : fid 0xa (1800 MHz), vid 0xc
[ 96.218140] powernow-k8: 3 : fid 0x2 (1000 MHz), vid 0x12
[ 141.740697] ReiserFS: dm-13: found reiserfs format "3.6" with standard journal
[ 141.740721] ReiserFS: dm-13: using ordered data mode
[ 141.773961] ReiserFS: dm-13: journal params: device dm-13, size 8192, journal first block 18, max trans len 1024, max batch 900, max commit age 30, max trans age 30
[ 141.775508] ReiserFS: dm-13: checking transaction log (dm-13)
[ 141.917594] ReiserFS: dm-13: Using r5 hash to sort names
[ 133.669410] Adding 2000084k swap on /dev/mapper/sda2_crypt. Priority:-1 extents:1 across:2000084k
[ 137.733298] NET: Registered protocol family 10
[ 149.699559] lp0: using parport0 (interrupt-driven).
[ 149.701951] lp0: console ready
[ 149.734970] ppdev: user-space parallel port driver
[ 153.591107] Clocksource tsc unstable (delta = -90906073 ns)
[ 172.006108] warning: `avahi-daemon' uses 32-bit capabilities (legacy support in use)
[ 175.233381] Bluetooth: Core ver 2.11
[ 175.242749] NET: Registered protocol family 31
[ 175.244699] Bluetooth: HCI device and connection manager initialized
[ 175.248851] Bluetooth: HCI socket layer initialized
[ 175.289903] Bluetooth: L2CAP ver 2.9
[ 175.292135] Bluetooth: L2CAP socket layer initialized
[ 175.315643] Bluetooth: RFCOMM socket layer initialized
[ 175.318242] Bluetooth: RFCOMM TTY layer initialized
[ 175.322232] Bluetooth: RFCOMM ver 1.8
[ 169.236205] ip_tables: (C) 2000-2006 Netfilter Core Team
[ 301.292141] ACPI: PCI Interrupt Link [APC3] enabled at IRQ 18
[ 301.295601] ACPI: PCI Interrupt 0000:05:08.0[A] -> Link [APC3] -> GSI 18 (level, low) -> IRQ 18
[ 318.463318] p54: LM86 firmware
[ 318.463324] p54: FW rev 2.7.0.0 - Softmac protocol 4.1
[ 319.449440] phy0: Selected rate control algorithm 'pid'
[-- Attachment #3: stuck_task --]
[-- Type: text/plain, Size: 3116 bytes --]
[ 5214.923278] SysRq : Show Regs
[ 5214.926537] CPU 0:
[ 5214.928843] Modules linked in: p54pci p54common mac80211 arc4 ecb cfg80211 tcp_diag inet_diag xt_multiport iptable_filter ip_tables x_tables rfcomm l2cap bluetooth ppdev lp cpufreq_powersave cpufreq_conservative ipv6 cpufreq_ondemand cpufreq_userspace powernow_k8 it87 hwmon_vid firewire_sbp2 snd_emu10k1_synth snd_emux_synth snd_seq_virmidi snd_seq_midi_emul snd_emu10k1 snd_seq_dummy snd_seq_oss snd_seq_midi snd_seq_midi_event snd_seq snd_rawmidi snd_ac97_codec ac97_bus snd_pcm_oss snd_mixer_oss snd_pcm snd_seq_device snd_timer snd_page_alloc snd_util_mem snd_hwdep parport_pc parport k8temp snd hwmon soundcore i2c_nforce2 i2c_core evdev usbhid usb_storage dm_mirror dm_snapshot skge firewire_ohci firewire_core ohci_hcd ehci_hcd forcedeth usbcore [last unloaded: mac80211]
[ 5215.002056] Pid: 4766, comm: hddtemp Not tainted 2.6.25-rc7-wl #7
[ 5215.008417] RIP: 0010:[<ffffffff80217d9e>] [<ffffffff80217d9e>] __smp_call_function_mask+0x9f/0xc5
[ 5215.018010] RSP: 0018:ffff81007830bbd8 EFLAGS: 00000297
[ 5215.023591] RAX: 00000000000008fd RBX: 0000000000000001 RCX: 0000000000000000
[ 5215.030999] RDX: 0101010101010101 RSI: 0000000000000000 RDI: 0000000000000002
[ 5215.038398] RBP: 0000000000000000 R08: cf3cf3cf3cf3cf3d R09: 0000000000000000
[ 5215.045798] R10: 0000000000000001 R11: 0000000000000001 R12: 0101010101010101
[ 5215.053200] R13: ffff810080971000 R14: ffff81007830a000 R15: 0000000000000001
[ 5215.060609] FS: 00007f7b8afbf6e0(0000) GS:ffffffff80636000(0000) knlGS:0000000000000000
[ 5215.069197] CS: 0010 DS: 0000 ES: 0000 CR0: 000000008005003b
[ 5215.075210] CR2: 00007f90f817bf30 CR3: 0000000000201000 CR4: 00000000000006e0
[ 5215.082619] DR0: 0000000000000000 DR1: 0000000000000000 DR2: 0000000000000000
[ 5215.090018] DR3: 0000000000000000 DR6: 00000000ffff0ff0 DR7: 0000000000000400
[ 5215.097419]
[ 5215.097420] Call Trace:
[ 5215.101875] [<ffffffff80217d8f>] __smp_call_function_mask+0x90/0xc5
[ 5215.108493] [<ffffffff8029b433>] invalidate_bh_lru+0x0/0x42
[ 5215.114422] [<ffffffff8029b433>] invalidate_bh_lru+0x0/0x42
[ 5215.120348] [<ffffffff8029b433>] invalidate_bh_lru+0x0/0x42
[ 5215.126276] [<ffffffff80217e0d>] smp_call_function_mask+0x49/0x61
[ 5215.132723] [<ffffffff8029b433>] invalidate_bh_lru+0x0/0x42
[ 5215.138649] [<ffffffff802339a3>] on_each_cpu+0x10/0x2c
[ 5215.144135] [<ffffffff8029fb4b>] kill_bdev+0x1e/0x33
[ 5215.149456] [<ffffffff802a01da>] __blkdev_put+0x4f/0x128
[ 5215.155124] [<ffffffff8027dd93>] __fput+0xa1/0x164
[ 5215.160272] [<ffffffff8027b67d>] filp_close+0x5d/0x65
[ 5215.165679] [<ffffffff80231072>] put_files_struct+0x66/0xc4
[ 5215.171607] [<ffffffff802323ea>] do_exit+0x245/0x64c
[ 5215.176925] [<ffffffff80238b94>] __dequeue_signal+0xff/0x159
[ 5215.182941] [<ffffffff80232867>] do_group_exit+0x76/0x93
[ 5215.188606] [<ffffffff8023a0fd>] get_signal_to_deliver+0x2b9/0x2e5
[ 5215.195141] [<ffffffff8020a466>] do_notify_resume+0xaf/0x7e0
[ 5215.201155] [<ffffffff80291414>] mntput_no_expire+0x1f/0x86
[ 5215.201155] [<ffffffff80291414>] mntput_no_expire+0x1f/0x86
^ permalink raw reply [flat|nested] 25+ messages in thread
* Re: The never ending BEEEEP/__smp_call_function_mask with 2.6.25-rc7
2008-03-30 19:09 The never ending BEEEEP/__smp_call_function_mask with 2.6.25-rc7 Chr
@ 2008-03-30 19:25 ` Rafael J. Wysocki
2008-03-30 19:59 ` Thomas Gleixner
2008-03-31 7:10 ` Andi Kleen
1 sibling, 1 reply; 25+ messages in thread
From: Rafael J. Wysocki @ 2008-03-30 19:25 UTC (permalink / raw)
To: Chr; +Cc: linux-kernel, Andi Kleen, Andrew Morton, Ingo Molnar, Thomas Gleixner
[Added some CCs]
On Sunday, 30 of March 2008, Chr wrote:
> Hi,
>
> Ever since I went to the new 2.6.25-rcX, I encountered lots of random
> system freezes after about 2 or 3 hours of uptime... but until now,
> I couldn't _catch_ them, since the system
> (AMD Athlon 4200+ X2 (Manchester) / nforce 4 SLI / x86_64) went
> straight into oblivion (nothing on the serial console or anywhere else,
> heck I couldn't even hit reset, It didn't POST anymore)
>
> So. take a look at stuck_task... that's best backtrace I could get...
> (I have other logs too. (Task-list / Lockdep / ... ) but it's too much
> and I hope the _logs_ are already enough.
>
> BTW: I noticed that the clock seems to jump forward and backwards.
> Is this because the CPU-Cores aren't syncronized? (And why are they
> out of sync?)
>
> (And yes, 2.6.24.4 is fine... even hpet works there without any
> rtc: lost 14 interrupts)
>
> cat /proc/interrupts
> CPU0 CPU1
> 0: 51 0 XT-PIC-XT timer
> 1: 43 22170 IO-APIC-edge i8042
> 4: 0 1 IO-APIC-edge
> 6: 0 5 IO-APIC-edge floppy
> 7: 1 0 IO-APIC-edge parport0
> 8: 0 199 IO-APIC-edge rtc
> 9: 0 0 IO-APIC-fasteoi acpi
> 14: 0 0 IO-APIC-edge pata_amd
> 15: 0 0 IO-APIC-edge pata_amd
> 16: 82 51311 IO-APIC-fasteoi firewire_ohci, EMU10K1
> 17: 0 50 IO-APIC-fasteoi firewire_ohci, eth1
> 18: 12 4862 IO-APIC-fasteoi nvidia
> 20: 421 304857 IO-APIC-fasteoi eth0
> 21: 57 33025 IO-APIC-fasteoi ohci_hcd:usb1
> 22: 185 12275 IO-APIC-fasteoi sata_nv
> 23: 276 83270 IO-APIC-fasteoi sata_nv, ehci_hcd:usb2
> NMI: 0 0 Non-maskable interrupts
> LOC: 339847 307599 Local timer interrupts
> RES: 126709 38837 Rescheduling interrupts
> CAL: 605 1054 function call interrupts
> TLB: 1513 3813 TLB shootdowns
> TRM: 0 0 Thermal event interrupts
> THR: 0 0 Threshold APIC interrupts
> SPU: 0 0 Spurious interrupts
> ERR: 1
>
^ permalink raw reply [flat|nested] 25+ messages in thread
* Re: The never ending BEEEEP/__smp_call_function_mask with 2.6.25-rc7
2008-03-30 19:25 ` Rafael J. Wysocki
@ 2008-03-30 19:59 ` Thomas Gleixner
2008-03-30 20:18 ` Chr
0 siblings, 1 reply; 25+ messages in thread
From: Thomas Gleixner @ 2008-03-30 19:59 UTC (permalink / raw)
To: Rafael J. Wysocki
Cc: Chr, linux-kernel, Andi Kleen, Andrew Morton, Ingo Molnar
On Sun, 30 Mar 2008, Rafael J. Wysocki wrote:
> > So. take a look at stuck_task... that's best backtrace I could get...
> > (I have other logs too. (Task-list / Lockdep / ... ) but it's too much
> > and I hope the _logs_ are already enough.
> >
> > BTW: I noticed that the clock seems to jump forward and backwards.
> > Is this because the CPU-Cores aren't syncronized? (And why are they
> > out of sync?)
The cores of those AMD beasts are never in sync. That's a chip feature :)
Can you please provide the output of:
/sys/devices/system/clocksource/clocksource0/current_clocksource
for both 2.6.24 and 2.6.25-rc7
Thanks,
tglx
^ permalink raw reply [flat|nested] 25+ messages in thread
* Re: The never ending BEEEEP/__smp_call_function_mask with 2.6.25-rc7
2008-03-30 19:59 ` Thomas Gleixner
@ 2008-03-30 20:18 ` Chr
2008-03-30 20:37 ` Thomas Gleixner
0 siblings, 1 reply; 25+ messages in thread
From: Chr @ 2008-03-30 20:18 UTC (permalink / raw)
To: Thomas Gleixner
Cc: Rafael J. Wysocki, linux-kernel, Andi Kleen, Andrew Morton, Ingo Molnar
On Sunday 30 March 2008 21:59:51 Thomas Gleixner wrote:
> On Sun, 30 Mar 2008, Rafael J. Wysocki wrote:
> > > So. take a look at stuck_task... that's best backtrace I could get...
> > > (I have other logs too. (Task-list / Lockdep / ... ) but it's too much
> > > and I hope the _logs_ are already enough.
> > >
> > > BTW: I noticed that the clock seems to jump forward and backwards.
> > > Is this because the CPU-Cores aren't syncronized? (And why are they
> > > out of sync?)
>
> The cores of those AMD beasts are never in sync. That's a chip feature :)
>
> Can you please provide the output of:
> /sys/devices/system/clocksource/clocksource0/current_clocksource
>
> for both 2.6.24 and 2.6.25-rc7
it's "hpet" (in both cases...)
Regards,
Chr
^ permalink raw reply [flat|nested] 25+ messages in thread
* Re: The never ending BEEEEP/__smp_call_function_mask with 2.6.25-rc7
2008-03-30 20:18 ` Chr
@ 2008-03-30 20:37 ` Thomas Gleixner
2008-03-30 21:09 ` Chr
0 siblings, 1 reply; 25+ messages in thread
From: Thomas Gleixner @ 2008-03-30 20:37 UTC (permalink / raw)
To: Chr
Cc: Rafael J. Wysocki, linux-kernel, Andi Kleen, Andrew Morton, Ingo Molnar
On Sun, 30 Mar 2008, Chr wrote:
> On Sunday 30 March 2008 21:59:51 Thomas Gleixner wrote:
> > On Sun, 30 Mar 2008, Rafael J. Wysocki wrote:
> > > > So. take a look at stuck_task... that's best backtrace I could get...
> > > > (I have other logs too. (Task-list / Lockdep / ... ) but it's too much
> > > > and I hope the _logs_ are already enough.
> > > >
> > > > BTW: I noticed that the clock seems to jump forward and backwards.
> > > > Is this because the CPU-Cores aren't syncronized? (And why are they
> > > > out of sync?)
> >
> > The cores of those AMD beasts are never in sync. That's a chip feature :)
> >
> > Can you please provide the output of:
> > /sys/devices/system/clocksource/clocksource0/current_clocksource
> >
> > for both 2.6.24 and 2.6.25-rc7
> it's "hpet" (in both cases...)
Hmm. That should be stable. Which clock is jumping back and forth ?
The one in the logs or the system clock ?
Thanks,
tglx
^ permalink raw reply [flat|nested] 25+ messages in thread
* Re: The never ending BEEEEP/__smp_call_function_mask with 2.6.25-rc7
2008-03-30 20:37 ` Thomas Gleixner
@ 2008-03-30 21:09 ` Chr
2008-03-30 21:36 ` Thomas Gleixner
0 siblings, 1 reply; 25+ messages in thread
From: Chr @ 2008-03-30 21:09 UTC (permalink / raw)
To: Thomas Gleixner
Cc: Rafael J. Wysocki, linux-kernel, Andi Kleen, Andrew Morton, Ingo Molnar
On Sunday 30 March 2008 22:37:29 Thomas Gleixner wrote:
> On Sun, 30 Mar 2008, Chr wrote:
> > On Sunday 30 March 2008 21:59:51 Thomas Gleixner wrote:
> > > On Sun, 30 Mar 2008, Rafael J. Wysocki wrote:
> > > > > So. take a look at stuck_task... that's best backtrace I could
> > > > > get... (I have other logs too. (Task-list / Lockdep / ... ) but
> > > > > it's too much and I hope the _logs_ are already enough.
> > >
> > > The cores of those AMD beasts are never in sync. That's a chip feature
> > > :)
> > >
> > > Can you please provide the output of:
> > > /sys/devices/system/clocksource/clocksource0/current_clocksource
> > >
> > > for both 2.6.24 and 2.6.25-rc7
> >
> > it's "hpet" (in both cases...)
>
> Hmm. That should be stable. Which clock is jumping back and forth ?
> The one in the logs or the system clock ?
The one in the logs... (but it's because of the TSC, right?)
Regards,
Chr
^ permalink raw reply [flat|nested] 25+ messages in thread
* Re: The never ending BEEEEP/__smp_call_function_mask with 2.6.25-rc7
2008-03-30 21:09 ` Chr
@ 2008-03-30 21:36 ` Thomas Gleixner
2008-03-30 22:19 ` Chr
` (2 more replies)
0 siblings, 3 replies; 25+ messages in thread
From: Thomas Gleixner @ 2008-03-30 21:36 UTC (permalink / raw)
To: Chr
Cc: Rafael J. Wysocki, linux-kernel, Andi Kleen, Andrew Morton, Ingo Molnar
On Sun, 30 Mar 2008, Chr wrote:
> > > it's "hpet" (in both cases...)
> >
> > Hmm. That should be stable. Which clock is jumping back and forth ?
> > The one in the logs or the system clock ?
> The one in the logs... (but it's because of the TSC, right?)
Yes. I thought this was fixed already in Linus tree, but I might be
wrong as usual.
Anyway, that's not the real problem; I read your original post in
meantime :)
> Ever since I went to the new 2.6.25-rcX ...
Did this problem start with 25-rc1 ? What is the latest version you
tried ?
> [ 5215.101875] [<ffffffff80217d8f>] __smp_call_function_mask+0x90/0xc5
> [ 5215.108493] [<ffffffff8029b433>] invalidate_bh_lru+0x0/0x42
> [ 5215.114422] [<ffffffff8029b433>] invalidate_bh_lru+0x0/0x42
> [ 5215.120348] [<ffffffff8029b433>] invalidate_bh_lru+0x0/0x42
> [ 5215.126276] [<ffffffff80217e0d>] smp_call_function_mask+0x49/0x61
> [ 5215.132723] [<ffffffff8029b433>] invalidate_bh_lru+0x0/0x42
> [ 5215.138649] [<ffffffff802339a3>] on_each_cpu+0x10/0x2c
> [ 5215.144135] [<ffffffff8029fb4b>] kill_bdev+0x1e/0x33
> [ 5215.149456] [<ffffffff802a01da>] __blkdev_put+0x4f/0x128
> [ 5215.155124] [<ffffffff8027dd93>] __fput+0xa1/0x164
> [ 5215.160272] [<ffffffff8027b67d>] filp_close+0x5d/0x65
> [ 5215.165679] [<ffffffff80231072>] put_files_struct+0x66/0xc4
> [ 5215.171607] [<ffffffff802323ea>] do_exit+0x245/0x64c
> [ 5215.176925] [<ffffffff80238b94>] __dequeue_signal+0xff/0x159
> [ 5215.182941] [<ffffffff80232867>] do_group_exit+0x76/0x93
> [ 5215.188606] [<ffffffff8023a0fd>] get_signal_to_deliver+0x2b9/0x2e5
> [ 5215.195141] [<ffffffff8020a466>] do_notify_resume+0xaf/0x7e0
> [ 5215.201155] [<ffffffff80291414>] mntput_no_expire+0x1f/0x86
> [ 5215.201155] [<ffffffff80291414>] mntput_no_expire+0x1f/0x86
You mentioned that you have tons of other logs. Can you please
upload those to some place? If you don't have a possiblity, please
contact me private and I'll provide you one.
Thanks,
tglx
^ permalink raw reply [flat|nested] 25+ messages in thread
* Re: The never ending BEEEEP/__smp_call_function_mask with 2.6.25-rc7
2008-03-30 21:36 ` Thomas Gleixner
@ 2008-03-30 22:19 ` Chr
2008-03-31 12:56 ` Adrian Bunk
2008-03-31 22:01 ` Chr
2 siblings, 0 replies; 25+ messages in thread
From: Chr @ 2008-03-30 22:19 UTC (permalink / raw)
To: Thomas Gleixner
Cc: Rafael J. Wysocki, linux-kernel, Andi Kleen, Andrew Morton, Ingo Molnar
On Sunday 30 March 2008 23:36:40 Thomas Gleixner wrote:
> On Sun, 30 Mar 2008, Chr wrote:
> > > > it's "hpet" (in both cases...)
> > >
> > > Hmm. That should be stable. Which clock is jumping back and forth ?
> > > The one in the logs or the system clock ?
> >
> > The one in the logs... (but it's because of the TSC, right?)
>
> Yes. I thought this was fixed already in Linus tree, but I might be
> wrong as usual.
>
> Anyway, that's not the real problem; I read your original post in
> meantime :)
>
> > Ever since I went to the new 2.6.25-rcX ...
>
> Did this problem start with 25-rc1 ? What is the latest version you
> tried ?
I had other problems with 25-rc1 as well.. but this one is now gone with
the dm-crypto update. I'll try rc1 tomorrow...
>
> > [ 5215.101875] [<ffffffff80217d8f>] __smp_call_function_mask+0x90/0xc5
> > [ 5215.108493] [<ffffffff8029b433>] invalidate_bh_lru+0x0/0x42
> > [ 5215.114422] [<ffffffff8029b433>] invalidate_bh_lru+0x0/0x42
> > [ 5215.120348] [<ffffffff8029b433>] invalidate_bh_lru+0x0/0x42
> > [ 5215.126276] [<ffffffff80217e0d>] smp_call_function_mask+0x49/0x61
> > [ 5215.132723] [<ffffffff8029b433>] invalidate_bh_lru+0x0/0x42
> > [ 5215.138649] [<ffffffff802339a3>] on_each_cpu+0x10/0x2c
> > [ 5215.144135] [<ffffffff8029fb4b>] kill_bdev+0x1e/0x33
> > [ 5215.149456] [<ffffffff802a01da>] __blkdev_put+0x4f/0x128
> > [ 5215.155124] [<ffffffff8027dd93>] __fput+0xa1/0x164
> > [ 5215.160272] [<ffffffff8027b67d>] filp_close+0x5d/0x65
> > [ 5215.165679] [<ffffffff80231072>] put_files_struct+0x66/0xc4
> > [ 5215.171607] [<ffffffff802323ea>] do_exit+0x245/0x64c
> > [ 5215.176925] [<ffffffff80238b94>] __dequeue_signal+0xff/0x159
> > [ 5215.182941] [<ffffffff80232867>] do_group_exit+0x76/0x93
> > [ 5215.188606] [<ffffffff8023a0fd>] get_signal_to_deliver+0x2b9/0x2e5
> > [ 5215.195141] [<ffffffff8020a466>] do_notify_resume+0xaf/0x7e0
> > [ 5215.201155] [<ffffffff80291414>] mntput_no_expire+0x1f/0x86
> > [ 5215.201155] [<ffffffff80291414>] mntput_no_expire+0x1f/0x86
>
> You mentioned that you have tons of other logs. Can you please
> upload those to some place? If you don't have a possiblity, please
> contact me private and I'll provide you one.
>
http://www.file-upload.net/download-758265/logs.tar.bz2.html
and hit Download. ;-)
(BTW, the you can ignore some logs like freeze7, ... I tried
nolapic, but that doesn't play well with SATA?!)
Regards,
Chr
^ permalink raw reply [flat|nested] 25+ messages in thread
* Re: The never ending BEEEEP/__smp_call_function_mask with 2.6.25-rc7
2008-03-30 19:09 The never ending BEEEEP/__smp_call_function_mask with 2.6.25-rc7 Chr
2008-03-30 19:25 ` Rafael J. Wysocki
@ 2008-03-31 7:10 ` Andi Kleen
2008-03-31 12:51 ` Chr
1 sibling, 1 reply; 25+ messages in thread
From: Andi Kleen @ 2008-03-31 7:10 UTC (permalink / raw)
To: Chr; +Cc: linux-kernel
Chr <chunkeey@web.de> writes:
> Ever since I went to the new 2.6.25-rcX, I encountered lots of random
> system freezes after about 2 or 3 hours of uptime... but until now,
> I couldn't _catch_ them, since the system
> (AMD Athlon 4200+ X2 (Manchester) / nforce 4 SLI / x86_64) went
> straight into oblivion (nothing on the serial console or anywhere else,
> heck I couldn't even hit reset, It didn't POST anymore)
Not POSTing would point to some kind of hardware problem. Normally
the kernel shouldn't be able to cause that.
I would recommend you double check if the problem is really not
there with an older kernel by going back there and testing.
-Andi
^ permalink raw reply [flat|nested] 25+ messages in thread
* Re: The never ending BEEEEP/__smp_call_function_mask with 2.6.25-rc7
2008-03-31 7:10 ` Andi Kleen
@ 2008-03-31 12:51 ` Chr
0 siblings, 0 replies; 25+ messages in thread
From: Chr @ 2008-03-31 12:51 UTC (permalink / raw)
To: Andi Kleen; +Cc: linux-kernel, Thomas Gleixner
On Monday 31 March 2008 09:10:34 Andi Kleen wrote:
> Chr <chunkeey@web.de> writes:
> > Ever since I went to the new 2.6.25-rcX, I encountered lots of random
> > system freezes after about 2 or 3 hours of uptime... but until now,
> > I couldn't _catch_ them, since the system
> > (AMD Athlon 4200+ X2 (Manchester) / nforce 4 SLI / x86_64) went
> > straight into oblivion (nothing on the serial console or anywhere else,
> > heck I couldn't even hit reset, It didn't POST anymore)
>
> Not POSTing would point to some kind of hardware problem. Normally
> the kernel shouldn't be able to cause that.
Normally yes.. but this is the first time that this ever happend!
(And my system has been stable so far (more than a year now, in fact more like
2 years)... even with nvidia & madwifi drivers!).
>
> I would recommend you double check if the problem is really not
> there with an older kernel by going back there and testing.
Well... older kernels won't freeze that way..
I can hit reset whenever I want and it still POSTs.
But, I'll try... maybe I get another change.
(BTW, if it happends again, is there anything special, that
could help to find the real bugs?)
>
> -Andi
Regards,
Chr.
^ permalink raw reply [flat|nested] 25+ messages in thread
* Re: The never ending BEEEEP/__smp_call_function_mask with 2.6.25-rc7
2008-03-30 21:36 ` Thomas Gleixner
2008-03-30 22:19 ` Chr
@ 2008-03-31 12:56 ` Adrian Bunk
2008-03-31 13:23 ` Chr
2008-03-31 22:01 ` Chr
2 siblings, 1 reply; 25+ messages in thread
From: Adrian Bunk @ 2008-03-31 12:56 UTC (permalink / raw)
To: Thomas Gleixner
Cc: Chr, Rafael J. Wysocki, linux-kernel, Andi Kleen, Andrew Morton
On Sun, Mar 30, 2008 at 11:36:40PM +0200, Thomas Gleixner wrote:
> On Sun, 30 Mar 2008, Chr wrote:
>...
> You mentioned that you have tons of other logs. Can you please
> upload those to some place? If you don't have a possiblity, please
> contact me private and I'll provide you one.
The logical place would be
http://bugzilla.kernel.org/show_bug.cgi?id=10369
And linux-kernel has a size limit of 400 kB, so everything that is
smaller (at least when compressed) could simply be sent by email.
> Thanks,
>
> tglx
cu
Adrian
--
"Is there not promise of rain?" Ling Tan asked suddenly out
of the darkness. There had been need of rain for many days.
"Only a promise," Lao Er said.
Pearl S. Buck - Dragon Seed
^ permalink raw reply [flat|nested] 25+ messages in thread
* Re: The never ending BEEEEP/__smp_call_function_mask with 2.6.25-rc7
2008-03-31 12:56 ` Adrian Bunk
@ 2008-03-31 13:23 ` Chr
0 siblings, 0 replies; 25+ messages in thread
From: Chr @ 2008-03-31 13:23 UTC (permalink / raw)
To: Adrian Bunk
Cc: Thomas Gleixner, Rafael J. Wysocki, linux-kernel, Andi Kleen,
Andrew Morton
On Monday 31 March 2008 14:56:53 Adrian Bunk wrote:
> On Sun, Mar 30, 2008 at 11:36:40PM +0200, Thomas Gleixner wrote:
> > On Sun, 30 Mar 2008, Chr wrote:
> >...
> > You mentioned that you have tons of other logs. Can you please
> > upload those to some place? If you don't have a possiblity, please
> > contact me private and I'll provide you one.
>
> The logical place would be
> http://bugzilla.kernel.org/show_bug.cgi?id=10369
Done! The log is about 200KBs (~ 2MBs untarbz2ed)
Regards
Chr
^ permalink raw reply [flat|nested] 25+ messages in thread
* Re: The never ending BEEEEP/__smp_call_function_mask with 2.6.25-rc7
2008-03-30 21:36 ` Thomas Gleixner
2008-03-30 22:19 ` Chr
2008-03-31 12:56 ` Adrian Bunk
@ 2008-03-31 22:01 ` Chr
2008-03-31 23:14 ` Rafael J. Wysocki
2008-04-01 16:27 ` Thomas Gleixner
2 siblings, 2 replies; 25+ messages in thread
From: Chr @ 2008-03-31 22:01 UTC (permalink / raw)
To: Thomas Gleixner
Cc: Rafael J. Wysocki, linux-kernel, Andi Kleen, Andrew Morton, Ingo Molnar
On Sunday 30 March 2008 23:36:40 Thomas Gleixner wrote:
> On Sun, 30 Mar 2008, Chr wrote:
>
> You mentioned that you have tons of other logs. Can you please
> upload those to some place? If you don't have a possiblity, please
> contact me private and I'll provide you one.
>
Ahhm it happend again (well, actually it's the fifth time today, but this time
I _hopefully_ found something)
I made a new log: (check out bugzilla)
http://bugzilla.kernel.org/show_bug.cgi?id=10369
(direct link)
http://bugzilla.kernel.org/attachment.cgi?id=15542
so, this log was made from a _working_ machine over the serial
console... I put some _real_ "date" marks here and there...
to explain a bit, how the time STALLS... (well, it seems like
it goes a bit backward and forward and backwards again... it loops?!
whatever..!??!?!!)
just take a look at the jiffies (grep for them... it takes "minutes"
until jiffies+1 comes)
Maybe there's a "signed" problem somewhere in the timekeeping
code? Or does 2.6.25-rcX have a general problem with NTP-daemons
like chrony?
Regards,
Chr.
^ permalink raw reply [flat|nested] 25+ messages in thread
* Re: The never ending BEEEEP/__smp_call_function_mask with 2.6.25-rc7
2008-03-31 22:01 ` Chr
@ 2008-03-31 23:14 ` Rafael J. Wysocki
2008-03-31 23:30 ` Chr
2008-04-01 16:27 ` Thomas Gleixner
1 sibling, 1 reply; 25+ messages in thread
From: Rafael J. Wysocki @ 2008-03-31 23:14 UTC (permalink / raw)
To: Chr; +Cc: Thomas Gleixner, linux-kernel, Andi Kleen, Andrew Morton, Ingo Molnar
On Tuesday, 1 of April 2008, Chr wrote:
> On Sunday 30 March 2008 23:36:40 Thomas Gleixner wrote:
> > On Sun, 30 Mar 2008, Chr wrote:
> >
> > You mentioned that you have tons of other logs. Can you please
> > upload those to some place? If you don't have a possiblity, please
> > contact me private and I'll provide you one.
> >
> Ahhm it happend again (well, actually it's the fifth time today, but this time
> I _hopefully_ found something)
>
> I made a new log: (check out bugzilla)
> http://bugzilla.kernel.org/show_bug.cgi?id=10369
>
> (direct link)
> http://bugzilla.kernel.org/attachment.cgi?id=15542
>
> so, this log was made from a _working_ machine over the serial
> console... I put some _real_ "date" marks here and there...
> to explain a bit, how the time STALLS... (well, it seems like
> it goes a bit backward and forward and backwards again... it loops?!
> whatever..!??!?!!)
>
> just take a look at the jiffies (grep for them... it takes "minutes"
> until jiffies+1 comes)
>
> Maybe there's a "signed" problem somewhere in the timekeeping
> code? Or does 2.6.25-rcX have a general problem with NTP-daemons
> like chrony?
Have you posted the .config already?
Rafael
^ permalink raw reply [flat|nested] 25+ messages in thread
* Re: The never ending BEEEEP/__smp_call_function_mask with 2.6.25-rc7
2008-03-31 23:14 ` Rafael J. Wysocki
@ 2008-03-31 23:30 ` Chr
2008-04-01 21:16 ` Rafael J. Wysocki
0 siblings, 1 reply; 25+ messages in thread
From: Chr @ 2008-03-31 23:30 UTC (permalink / raw)
To: Rafael J. Wysocki
Cc: Thomas Gleixner, linux-kernel, Andi Kleen, Andrew Morton, Ingo Molnar
On Tuesday 01 April 2008 01:14:24 Rafael J. Wysocki wrote:
> On Tuesday, 1 of April 2008, Chr wrote:
>
> Have you posted the .config already?
Done!
BTW:
[ 5074.308547] .jiffies : 4296102117
[ 5074.308547] .jiffies : 4296102117
[ 5105.659185] .idle_jiffies : 4296102113
[ 5105.659185] .last_jiffies : 4296102117
[ 5105.659185] .next_jiffies : 4296102118
[ 5105.659185] jiffies: 4296102117
[ 5105.659185] .idle_jiffies : 4296102114
[ 5105.659185] .last_jiffies : 4296102113
[ 5105.659185] .next_jiffies : 4296102116
[ 5105.659185] jiffies: 4296102117
[ 5044.543918] .jiffies : 4296102117
[ 5044.640998] .jiffies : 4296102117
[ 4979.215327] .idle_jiffies : 4296102113
[ 4979.229471] .last_jiffies : 4296102117
[ 4979.231334] .next_jiffies : 4296102118
[ 4979.235415] jiffies: 4296102117
[ 4979.298216] .idle_jiffies : 4296102114
[ 4979.312360] .last_jiffies : 4296102113
[ 4979.314223] .next_jiffies : 4296102116
[ 4979.318303] jiffies: 4296102117
[ 5040.749212] .idle_jiffies : 4296102113
[ 5040.763356] .last_jiffies : 4296102117
[ 5040.765219] .next_jiffies : 4296102118
[ 5040.769299] jiffies: 4296102117
[ 5040.832119] .idle_jiffies : 4296102114
[ 5040.846264] .last_jiffies : 4296102113
[ 5040.848127] .next_jiffies : 4296102116
[ 5040.852206] jiffies: 4296102117
-OUCH - going backwards?
[ 4956.514210] .idle_jiffies : 4296102113
[ 4956.528354] .last_jiffies : 4296102117
[ 4956.530217] .next_jiffies : 4296102118
[ 4956.534298] jiffies: 4296102117
[ 4956.597024] .idle_jiffies : 4296102114
[ 4956.611168] .last_jiffies : 4296102113
[ 4956.613031] .next_jiffies : 4296102116
[ 4956.617111] jiffies: 4296102117
[ 5063.325509] .idle_jiffies : 4296102113
[ 5063.339653] .last_jiffies : 4296102117
[ 5063.341517] .next_jiffies : 4296102118
[ 5063.345597] jiffies: 4296102117
[ 5063.408519] .idle_jiffies : 4296102114
[ 5063.422664] .last_jiffies : 4296102113
[ 5063.424526] .next_jiffies : 4296102116
[ 5063.428607] jiffies: 4296102117
[ 5116.186763] .idle_jiffies : 4296102113
[ 5116.186764] .last_jiffies : 4296102117
[ 5116.186764] .next_jiffies : 4296102118
[ 5116.186764] jiffies: 4296102117
[ 5116.186764] .idle_jiffies : 4296102114
[ 5116.186764] .last_jiffies : 4296102113
[ 5116.186764] .next_jiffies : 4296102116
[ 5116.186764] jiffies: 4296102117
[ 5012.429020] .idle_jiffies : 4296102113
[ 5012.443164] .last_jiffies : 4296102117
[ 5012.445027] .next_jiffies : 4296102118
[ 5012.449107] jiffies: 4296102117
[ 5012.511908] .idle_jiffies : 4296102114
[ 5012.526052] .last_jiffies : 4296102113
[ 5012.527915] .next_jiffies : 4296102116
[ 5012.531995] jiffies: 4296102117
[ 5048.890792] .idle_jiffies : 4296102113
[ 5048.904936] .last_jiffies : 4296102117
[ 5048.906800] .next_jiffies : 4296102118
[ 5048.910880] jiffies: 4296102117
[ 5048.973842] .idle_jiffies : 4296102114
[ 5048.987987] .last_jiffies : 4296102113
[ 5048.989850] .next_jiffies : 4296102116
[ 5048.993930] jiffies: 4296102117
[ 5069.025049] .idle_jiffies : 4296102113
[ 5069.039194] .last_jiffies : 4296102117
[ 5069.041056] .next_jiffies : 4296102118
[ 5069.045136] jiffies: 4296102117
[ 5069.107874] .idle_jiffies : 4296102114
[ 5069.122019] .last_jiffies : 4296102113
[ 5069.123881] .next_jiffies : 4296102116
[ 5069.127962] jiffies: 4296102117
[ 5095.978692] .idle_jiffies : 4296102113
[ 5095.992836] .last_jiffies : 4296102117
[ 5095.994699] .next_jiffies : 4296102118
[ 5095.998780] jiffies: 4296102117
[ 5096.061623] .idle_jiffies : 4296102114
[ 5096.075768] .last_jiffies : 4296102113
[ 5096.077631] .next_jiffies : 4296102116
[ 5096.081711] jiffies: 4296102117
[ 5108.211902] .idle_jiffies : 4296102113
[ 5108.211902] .last_jiffies : 4296102117
[ 5108.211902] .next_jiffies : 4296102118
[ 5108.211902] jiffies: 4296102117
[ 5108.211903] .idle_jiffies : 4296102114
[ 5108.211903] .last_jiffies : 4296102113
[ 5108.211903] .next_jiffies : 4296102116
[ 5108.211903] jiffies: 4296102117
[ 5108.211903] .idle_jiffies : 4296102113
[ 5108.211903] .last_jiffies : 4296102117
[ 5108.211903] .next_jiffies : 4296102118
[ 5108.211903] jiffies: 4296102117
[ 5108.211903] .idle_jiffies : 4296102114
[ 5108.211903] .last_jiffies : 4296102113
[ 5108.211903] .next_jiffies : 4296102116
[ 5108.211903] jiffies: 4296102117
[ 5108.211903] .idle_jiffies : 4296102113
[ 5108.211903] .last_jiffies : 4296102117
[ 5108.211903] .next_jiffies : 4296102118
[ 5108.211903] jiffies: 4296102117
[ 5108.211903] .idle_jiffies : 4296102114
[ 5108.211903] .last_jiffies : 4296102113
[ 5108.211903] .next_jiffies : 4296102116
[ 5108.211903] jiffies: 4296102117
[ 5108.211904] .idle_jiffies : 4296102113
[ 5108.211904] .last_jiffies : 4296102117
[ 5108.211904] .next_jiffies : 4296102118
[ 5108.211904] jiffies: 4296102117
[ 5108.211904] .idle_jiffies : 4296102114
[ 5108.211904] .last_jiffies : 4296102113
[ 5108.211904] .next_jiffies : 4296102116
[ 5108.211904] jiffies: 4296102117
[ 5108.211904] .idle_jiffies : 4296102113
[ 5108.211904] .last_jiffies : 4296102117
[ 5108.211904] .next_jiffies : 4296102118
[ 5108.211904] jiffies: 4296102117
[ 5108.211904] .idle_jiffies : 4296102114
[ 5108.211904] .last_jiffies : 4296102113
[ 5108.211904] .next_jiffies : 4296102116
[ 5108.211904] jiffies: 4296102117
[ 5108.211905] .idle_jiffies : 4296102113
[ 5108.211905] .last_jiffies : 4296102117
[ 5108.211905] .next_jiffies : 4296102118
[ 5108.211905] jiffies: 4296102117
[ 5108.211905] .idle_jiffies : 4296102114
[ 5108.211905] .last_jiffies : 4296102113
[ 5108.211905] .next_jiffies : 4296102116
[ 5108.211905] jiffies: 4296102117
[ 5108.211905] .idle_jiffies : 4296102113
[ 5108.211905] .last_jiffies : 4296102117
[ 5108.211905] .next_jiffies : 4296102118
[ 5108.211905] jiffies: 4296102117
[ 5108.211905] .idle_jiffies : 4296102114
[ 5108.211905] .last_jiffies : 4296102113
[ 5108.211905] .next_jiffies : 4296102116
[ 5108.211905] jiffies: 4296102117
[ 5108.211905] .idle_jiffies : 4296102113
[ 5108.211905] .last_jiffies : 4296102117
[ 5108.211905] .next_jiffies : 4296102118
[ 5108.211905] jiffies: 4296102117
[ 5108.211906] .idle_jiffies : 4296102114
[ 5108.211906] .last_jiffies : 4296102113
[ 5108.211906] .next_jiffies : 4296102116
[ 5108.211906] jiffies: 4296102117
[ 5108.211906] .idle_jiffies : 4296102113
[ 5108.211906] .last_jiffies : 4296102117
[ 5108.211906] .next_jiffies : 4296102118
[ 5108.211906] jiffies: 4296102117
[ 5108.211906] .idle_jiffies : 4296102114
[ 5108.211906] .last_jiffies : 4296102113
[ 5108.211906] .next_jiffies : 4296102116
[ 5108.211906] jiffies: 4296102117
[ 5108.211906] .idle_jiffies : 4296102113
[ 5108.211906] .last_jiffies : 4296102117
[ 5108.211906] .next_jiffies : 4296102118
[ 5108.211906] jiffies: 4296102117
[ 5108.211906] .idle_jiffies : 4296102114
[ 5108.211906] .last_jiffies : 4296102113
[ 5108.211906] .next_jiffies : 4296102116
[ 5108.211906] jiffies: 4296102117
[ 5108.211924] .jiffies : 4296102117
[ 5108.211925] .jiffies : 4296102117
Regards,
Chr
^ permalink raw reply [flat|nested] 25+ messages in thread
* Re: The never ending BEEEEP/__smp_call_function_mask with 2.6.25-rc7
2008-03-31 22:01 ` Chr
2008-03-31 23:14 ` Rafael J. Wysocki
@ 2008-04-01 16:27 ` Thomas Gleixner
2008-04-01 18:57 ` Chr
2008-04-01 22:29 ` Chr
1 sibling, 2 replies; 25+ messages in thread
From: Thomas Gleixner @ 2008-04-01 16:27 UTC (permalink / raw)
To: Chr
Cc: Rafael J. Wysocki, linux-kernel, Andi Kleen, Andrew Morton, Ingo Molnar
On Tue, 1 Apr 2008, Chr wrote:
> On Sunday 30 March 2008 23:36:40 Thomas Gleixner wrote:
> > On Sun, 30 Mar 2008, Chr wrote:
> >
> > You mentioned that you have tons of other logs. Can you please
> > upload those to some place? If you don't have a possiblity, please
> > contact me private and I'll provide you one.
> >
> Ahhm it happend again (well, actually it's the fifth time today, but this time
> I _hopefully_ found something)
>
> I made a new log: (check out bugzilla)
> http://bugzilla.kernel.org/show_bug.cgi?id=10369
>
> (direct link)
> http://bugzilla.kernel.org/attachment.cgi?id=15542
# expires at 4839289000000 nsecs [in -168780949682 nsecs]
That's strange. Local APIC timer interrupt seems to be stuck.
Can you please upload a boot log and the output of
/proc/timer_list of 2.6.24.x to the bugzilla ?
Does adding "noapictimer" to the command line help ?
Thanks,
tglx
^ permalink raw reply [flat|nested] 25+ messages in thread
* Re: The never ending BEEEEP/__smp_call_function_mask with 2.6.25-rc7
2008-04-01 16:27 ` Thomas Gleixner
@ 2008-04-01 18:57 ` Chr
2008-04-01 22:29 ` Chr
1 sibling, 0 replies; 25+ messages in thread
From: Chr @ 2008-04-01 18:57 UTC (permalink / raw)
To: Thomas Gleixner
Cc: Rafael J. Wysocki, linux-kernel, Andi Kleen, Andrew Morton, Ingo Molnar
On Tuesday 01 April 2008 18:27:34 Thomas Gleixner wrote:
> On Tue, 1 Apr 2008, Chr wrote:
> > (direct link)
> > http://bugzilla.kernel.org/attachment.cgi?id=15542
>
> # expires at 4839289000000 nsecs [in -168780949682 nsecs]
>
> That's strange. Local APIC timer interrupt seems to be stuck.
>
> Can you please upload a boot log and the output of
> /proc/timer_list of 2.6.24.x to the bugzilla ?
>
done!
> Does adding "noapictimer" to the command line help ?
will try it now...
(BTW: disabling chrony (ntp-daemon) seems to help too...)
Regards,
Chr
^ permalink raw reply [flat|nested] 25+ messages in thread
* Re: The never ending BEEEEP/__smp_call_function_mask with 2.6.25-rc7
2008-03-31 23:30 ` Chr
@ 2008-04-01 21:16 ` Rafael J. Wysocki
2008-04-01 22:27 ` Chr
2008-04-02 12:45 ` Chr
0 siblings, 2 replies; 25+ messages in thread
From: Rafael J. Wysocki @ 2008-04-01 21:16 UTC (permalink / raw)
To: Chr; +Cc: Thomas Gleixner, linux-kernel, Andi Kleen, Andrew Morton, Ingo Molnar
On Tuesday, 1 of April 2008, Chr wrote:
> On Tuesday 01 April 2008 01:14:24 Rafael J. Wysocki wrote:
> > On Tuesday, 1 of April 2008, Chr wrote:
> >
> > Have you posted the .config already?
> Done!
Hmm. Can you please choose one RTC infrastructure and maybe disable
CONFIG_HPET_EMULATE_RTC?
Rafael
^ permalink raw reply [flat|nested] 25+ messages in thread
* Re: The never ending BEEEEP/__smp_call_function_mask with 2.6.25-rc7
2008-04-01 21:16 ` Rafael J. Wysocki
@ 2008-04-01 22:27 ` Chr
2008-04-02 12:45 ` Chr
1 sibling, 0 replies; 25+ messages in thread
From: Chr @ 2008-04-01 22:27 UTC (permalink / raw)
To: Rafael J. Wysocki
Cc: Thomas Gleixner, linux-kernel, Andi Kleen, Andrew Morton, Ingo Molnar
On Tuesday 01 April 2008 23:16:15 Rafael J. Wysocki wrote:
> On Tuesday, 1 of April 2008, Chr wrote:
> > On Tuesday 01 April 2008 01:14:24 Rafael J. Wysocki wrote:
> > > On Tuesday, 1 of April 2008, Chr wrote:
> > >
> > > Have you posted the .config already?
> >
> > Done!
>
> Hmm. Can you please choose one RTC infrastructure and maybe disable
> CONFIG_HPET_EMULATE_RTC?
>
Yep... compiling -rc8 now.
Regards,
Chr.
^ permalink raw reply [flat|nested] 25+ messages in thread
* Re: The never ending BEEEEP/__smp_call_function_mask with 2.6.25-rc7
2008-04-01 16:27 ` Thomas Gleixner
2008-04-01 18:57 ` Chr
@ 2008-04-01 22:29 ` Chr
2008-04-02 7:08 ` Thomas Gleixner
1 sibling, 1 reply; 25+ messages in thread
From: Chr @ 2008-04-01 22:29 UTC (permalink / raw)
To: Thomas Gleixner
Cc: Rafael J. Wysocki, linux-kernel, Andi Kleen, Andrew Morton, Ingo Molnar
On Tuesday 01 April 2008 18:27:34 Thomas Gleixner wrote:
> On Tue, 1 Apr 2008, Chr wrote:
> > Ahhm it happend again (well, actually it's the fifth time today, but this
> > time I _hopefully_ found something)
> >
> > (direct link)
> > http://bugzilla.kernel.org/attachment.cgi?id=15542
>
> # expires at 4839289000000 nsecs [in -168780949682 nsecs]
>
> That's strange. Local APIC timer interrupt seems to be stuck.
>
> Can you please upload a boot log and the output of
> /proc/timer_list of 2.6.24.x to the bugzilla ?
>
> Does adding "noapictimer" to the command line help ?
noapictimer seems to help too... (chrony is on)
Regards,
Chr.
^ permalink raw reply [flat|nested] 25+ messages in thread
* Re: The never ending BEEEEP/__smp_call_function_mask with 2.6.25-rc7
2008-04-01 22:29 ` Chr
@ 2008-04-02 7:08 ` Thomas Gleixner
2008-04-02 9:34 ` Chr
0 siblings, 1 reply; 25+ messages in thread
From: Thomas Gleixner @ 2008-04-02 7:08 UTC (permalink / raw)
To: Chr
Cc: Rafael J. Wysocki, linux-kernel, Andi Kleen, Andrew Morton, Ingo Molnar
On Wed, 2 Apr 2008, Chr wrote:
> On Tuesday 01 April 2008 18:27:34 Thomas Gleixner wrote:
> > On Tue, 1 Apr 2008, Chr wrote:
> > > Ahhm it happend again (well, actually it's the fifth time today, but this
> > > time I _hopefully_ found something)
> > >
> > > (direct link)
> > > http://bugzilla.kernel.org/attachment.cgi?id=15542
> >
> > # expires at 4839289000000 nsecs [in -168780949682 nsecs]
> >
> > That's strange. Local APIC timer interrupt seems to be stuck.
> >
> > Can you please upload a boot log and the output of
> > /proc/timer_list of 2.6.24.x to the bugzilla ?
> >
> > Does adding "noapictimer" to the command line help ?
> noapictimer seems to help too... (chrony is on)
Ok. Can you please provide the output of /proc/cpuinfo ?
Thanks,
tglx
^ permalink raw reply [flat|nested] 25+ messages in thread
* Re: The never ending BEEEEP/__smp_call_function_mask with 2.6.25-rc7
2008-04-02 7:08 ` Thomas Gleixner
@ 2008-04-02 9:34 ` Chr
0 siblings, 0 replies; 25+ messages in thread
From: Chr @ 2008-04-02 9:34 UTC (permalink / raw)
To: Thomas Gleixner
Cc: Rafael J. Wysocki, linux-kernel, Andi Kleen, Andrew Morton, Ingo Molnar
On Wednesday 02 April 2008 09:08:29 Thomas Gleixner wrote:
> On Wed, 2 Apr 2008, Chr wrote:
> > On Tuesday 01 April 2008 18:27:34 Thomas Gleixner wrote:
> > > On Tue, 1 Apr 2008, Chr wrote:
> > > > Ahhm it happend again (well, actually it's the fifth time today, but
> > > > this time I _hopefully_ found something)
> > > >
> > > > (direct link)
> > > > http://bugzilla.kernel.org/attachment.cgi?id=15542
> > >
> > > # expires at 4839289000000 nsecs [in -168780949682 nsecs]
> > >
> > > That's strange. Local APIC timer interrupt seems to be stuck.
> > >
> > > Can you please upload a boot log and the output of
> > > /proc/timer_list of 2.6.24.x to the bugzilla ?
> > >
> > > Does adding "noapictimer" to the command line help ?
> >
> > noapictimer seems to help too... (chrony is on)
>
> Ok. Can you please provide the output of /proc/cpuinfo ?
>
> Thanks,
>
> tglx
ok...
/proc/cpuinfo:
processor : 0
vendor_id : AuthenticAMD
cpu family : 15
model : 43
model name : AMD Athlon(tm) 64 X2 Dual Core Processor 4200+
stepping : 1
cpu MHz : 1000.000
cache size : 512 KB
physical id : 0
siblings : 2
core id : 0
cpu cores : 2
fpu : yes
fpu_exception : yes
cpuid level : 1
wp : yes
flags : fpu vme de pse tsc msr pae mce cx8 apic sep mtrr pge mca cmov pat
pse36 clflush mmx fxsr sse sse2 ht syscall nx mmxext fxsr_opt lm 3dnowext
3dnow rep_good pni lahf_lm cmp_legacy
bogomips : 2012.13
TLB size : 1024 4K pages
clflush size : 64
cache_alignment : 64
address sizes : 40 bits physical, 48 bits virtual
power management: ts fid vid ttp
processor : 1
vendor_id : AuthenticAMD
cpu family : 15
model : 43
model name : AMD Athlon(tm) 64 X2 Dual Core Processor 4200+
stepping : 1
cpu MHz : 1000.000
cache size : 512 KB
physical id : 0
siblings : 2
core id : 1
cpu cores : 2
fpu : yes
fpu_exception : yes
cpuid level : 1
wp : yes
flags : fpu vme de pse tsc msr pae mce cx8 apic sep mtrr pge mca cmov pat
pse36 clflush mmx fxsr sse sse2 ht syscall nx mmxext fxsr_opt lm 3dnowext
3dnow rep_good pni lahf_lm cmp_legacy
bogomips : 2012.13
TLB size : 1024 4K pages
clflush size : 64
cache_alignment : 64
address sizes : 40 bits physical, 48 bits virtual
power management: ts fid vid ttp
Regards,
Chr.
^ permalink raw reply [flat|nested] 25+ messages in thread
* Re: The never ending BEEEEP/__smp_call_function_mask with 2.6.25-rc7
2008-04-01 21:16 ` Rafael J. Wysocki
2008-04-01 22:27 ` Chr
@ 2008-04-02 12:45 ` Chr
1 sibling, 0 replies; 25+ messages in thread
From: Chr @ 2008-04-02 12:45 UTC (permalink / raw)
To: Rafael J. Wysocki
Cc: Thomas Gleixner, linux-kernel, Andi Kleen, Andrew Morton, Ingo Molnar
On Tuesday 01 April 2008 23:16:15 Rafael J. Wysocki wrote:
> On Tuesday, 1 of April 2008, Chr wrote:
> > On Tuesday 01 April 2008 01:14:24 Rafael J. Wysocki wrote:
> > > On Tuesday, 1 of April 2008, Chr wrote:
> > >
> > > Have you posted the .config already?
> >
> > Done!
>
> Hmm. Can you please choose one RTC infrastructure and maybe disable
> CONFIG_HPET_EMULATE_RTC?
well... another hang... but I noticed something while filling a new bugreport.
2.6.25-rc8 /arch/x86/Kconfig
config HPET_TIMER
def_bool X86_64
prompt "HPET Timer Support" if X86_32
[...]
config HPET_EMULATE_RTC
def_bool y
[...]
Geee... there's no way to disable it on x86_64 without messing with the
Kconfig. So I try to disable the other RTC framework...
(since there's no sign of any of those RTC mods in the clocksource/timerlist.
I guess it'll be worthless)
Regards,
Chr.
^ permalink raw reply [flat|nested] 25+ messages in thread
* Re: The never ending BEEEEP/__smp_call_function_mask with 2.6.25-rc7
2008-04-03 23:22 ` The never ending BEEEEP/__smp_call_function_mask with 2.6.25-rc7 Rafael J. Wysocki
@ 2008-04-04 1:49 ` Chr
0 siblings, 0 replies; 25+ messages in thread
From: Chr @ 2008-04-04 1:49 UTC (permalink / raw)
To: Rafael J. Wysocki; +Cc: Linux Kernel Mailing List
On Friday 04 April 2008 01:22:44 Rafael J. Wysocki wrote:
> The following report is on the current list of known regressions
> from 2.6.24. Please verify if the issue is still present in the
> mainline.
>
>
> Bug-Entry : http://bugzilla.kernel.org/show_bug.cgi?id=10369
> Subject : The never ending BEEEEP/__smp_call_function_mask with 2.6.25-rc7
> Submitter : Chr <chunkeey@web.de>
> Date : 2008-03-30 21:09 (5 days old)
> References : http://lkml.org/lkml/2008/3/30/87
Yep, it's still present... but I don't have no time to debug it. :(
Rhe proposed workaround: "noapictimer" and "hpet=force" works so far...
Maybe it's buggy/bad hardware after all and 2.6.24.4 just doesn't trigger it?!
Regards,
Chr.
^ permalink raw reply [flat|nested] 25+ messages in thread
* The never ending BEEEEP/__smp_call_function_mask with 2.6.25-rc7
2008-04-03 22:49 2.6.25-rc8-git2: Reported regressions from 2.6.24 Rafael J. Wysocki
@ 2008-04-03 23:22 ` Rafael J. Wysocki
2008-04-04 1:49 ` Chr
0 siblings, 1 reply; 25+ messages in thread
From: Rafael J. Wysocki @ 2008-04-03 23:22 UTC (permalink / raw)
To: Linux Kernel Mailing List; +Cc: Chr
The following report is on the current list of known regressions
from 2.6.24. Please verify if the issue is still present in the
mainline.
Bug-Entry : http://bugzilla.kernel.org/show_bug.cgi?id=10369
Subject : The never ending BEEEEP/__smp_call_function_mask with 2.6.25-rc7
Submitter : Chr <chunkeey@web.de>
Date : 2008-03-30 21:09 (5 days old)
References : http://lkml.org/lkml/2008/3/30/87
^ permalink raw reply [flat|nested] 25+ messages in thread
end of thread, other threads:[~2008-04-04 0:50 UTC | newest]
Thread overview: 25+ messages (download: mbox.gz / follow: Atom feed)
-- links below jump to the message on this page --
2008-03-30 19:09 The never ending BEEEEP/__smp_call_function_mask with 2.6.25-rc7 Chr
2008-03-30 19:25 ` Rafael J. Wysocki
2008-03-30 19:59 ` Thomas Gleixner
2008-03-30 20:18 ` Chr
2008-03-30 20:37 ` Thomas Gleixner
2008-03-30 21:09 ` Chr
2008-03-30 21:36 ` Thomas Gleixner
2008-03-30 22:19 ` Chr
2008-03-31 12:56 ` Adrian Bunk
2008-03-31 13:23 ` Chr
2008-03-31 22:01 ` Chr
2008-03-31 23:14 ` Rafael J. Wysocki
2008-03-31 23:30 ` Chr
2008-04-01 21:16 ` Rafael J. Wysocki
2008-04-01 22:27 ` Chr
2008-04-02 12:45 ` Chr
2008-04-01 16:27 ` Thomas Gleixner
2008-04-01 18:57 ` Chr
2008-04-01 22:29 ` Chr
2008-04-02 7:08 ` Thomas Gleixner
2008-04-02 9:34 ` Chr
2008-03-31 7:10 ` Andi Kleen
2008-03-31 12:51 ` Chr
2008-04-03 22:49 2.6.25-rc8-git2: Reported regressions from 2.6.24 Rafael J. Wysocki
2008-04-03 23:22 ` The never ending BEEEEP/__smp_call_function_mask with 2.6.25-rc7 Rafael J. Wysocki
2008-04-04 1:49 ` Chr
This is a public inbox, see mirroring instructions
for how to clone and mirror all data and code used for this inbox;
as well as URLs for NNTP newsgroup(s).