LKML Archive on lore.kernel.org
help / color / mirror / Atom feed
* Bug: 2.6.24-smp: Eeek! page_mapcount(page) went negative! (-1)
@ 2008-02-06 18:13 kerndev
  2008-02-06 19:55 ` Hugh Dickins
  0 siblings, 1 reply; 5+ messages in thread
From: kerndev @ 2008-02-06 18:13 UTC (permalink / raw)
  To: linux-kernel; +Cc: John Heil

[-- Attachment #1: Type: TEXT/PLAIN, Size: 7166 bytes --]


Hi,

I have recently had a couple of seemingly hard hangs requiring reboot by 
reset button. I could not ping the box.

Until now, the console log (log level 7) was empty/normal...

This morning I finally seem to have found out what the failures are...
an rmap bug for real mem > 4G...


First, some info on my system...

Kernel 2.6.24 SMP on AMD Athlon 64 X2 4000

top - 09:44:55 up 24 min, 42 users,  load average: 0.05, 0.11, 0.16
Tasks: 206 total,   3 running, 203 sleeping,   0 stopped,   0 zombie
Cpu(s):  0.3%us,  1.3%sy,  0.0%ni, 96.2%id,  0.0%wa,  0.0%hi,  2.2%si,
0.0%st
Mem:   8248340k total,  3769556k used,  4478784k free,    82732k buffers
Swap:  7828572k total,        0k used,  7828572k free,  3254360k cached

***  top used was about 6.2G  w 0k Swap usage.

I was working on local X (there were also 2 remote xdm KDE sessions as
well)  when my openoffice edit acted strangely. A couple minutes
later, X completely locked up.

A remote ssh session allowed me to do a normal shutdown.

On reboot, I found the bug log in the console log, included below


Before digging into this myself (don't really have enought time),
I thought I'd submit it here...

my .config is tar gz'd as an attachment

Any input would be appreciated...


Thnx much,

johnh


.
===========================================================
John Heil
South Coast Software
Custom firmware, device drivers and board bring up services
Ph: 1-714-774-6952
Fx: 1-714-774-7053
www.sc-software.com
email: johnhscs@sc-software.com
===========================================================


Feb  6 09:09:47 sc-software kernel: Eeek! page_mapcount(page) went 
negative! (-1)
Feb  6 09:09:47 sc-software kernel:   page pfn = 12b1b1
Feb  6 09:09:47 sc-software kernel:   page->flags = 80080014
Feb  6 09:09:47 sc-software kernel:   page->count = 0
Feb  6 09:09:47 sc-software kernel:   page->mapping = 00000000
Feb  6 09:09:47 sc-software kernel:   vma->vm_ops = 0x0
Feb  6 09:09:47 sc-software kernel: ------------[ cut here ]------------
Feb  6 09:09:47 sc-software kernel: kernel BUG at mm/rmap.c:631!
Feb  6 09:09:47 sc-software kernel: invalid opcode: 0000 [#2] PREEMPT SMP
Feb  6 09:09:47 sc-software kernel: Modules linked in: shpchp pci_hotplug 
ohci1394 ieee1394
Feb  6 09:09:47 sc-software kernel:
Feb  6 09:09:47 sc-software kernel: Pid: 3710, comm: X Tainted: G      D 
(2.6.24 #1)
Feb  6 09:09:47 sc-software kernel: EIP: 0060:[<c016fc36>] EFLAGS: 
00213246 CPU: 0
Feb  6 09:09:47 sc-software kernel: EIP is at page_remove_rmap+0xe6/0x110
Feb  6 09:09:47 sc-software kernel: EAX: 00000000 EBX: c516deb8 ECX: 
c01280a8 EDX: 00000001
Feb  6 09:09:47 sc-software kernel: ESI: f7120ee4 EDI: 11605000 EBP: 
f1b31d70 ESP: f1b31d60
Feb  6 09:09:47 sc-software kernel:  DS: 007b ES: 007b FS: 00d8 GS: 0000 
SS: 0068
Feb  6 09:09:47 sc-software kernel: Process X (pid: 3710, ti=f1b30000 
task=f0586aa0 task.ti=f1b30000)
Feb  6 09:09:47 sc-software kernel: Stack: c074dda2 00000000 c516deb8 
f0662028 f1b31db4 c01683ce 2b1b1065 00000001
Feb  6 09:09:47 sc-software kernel:        00000000 00000000 2b1b1065 
00000001 c1a9657c fffffffb ffffffff f71cb300
Feb  6 09:09:47 sc-software kernel:        f7120ee4 c8e36180 1a004000 
11800000 f07fe460 f1b31df8 c016858d 11600000
Feb  6 09:09:47 sc-software kernel: Call Trace:
Feb  6 09:09:47 sc-software kernel:  [<c0103e5a>] 
show_trace_log_lvl+0x1a/0x30
Feb  6 09:09:47 sc-software kernel:  [<c0103f2a>] 
show_stack_log_lvl+0x9a/0xc0
Feb  6 09:09:47 sc-software kernel:  [<c01040d7>] 
show_registers+0xc7/0x250
Feb  6 09:09:47 sc-software kernel:  [<c010441f>] die+0x11f/0x220
Feb  6 09:09:47 sc-software kernel:  [<c01045b1>] do_trap+0x91/0xd0
Feb  6 09:09:47 sc-software kernel:  [<c0104859>] do_invalid_op+0x89/0xa0
Feb  6 09:09:47 sc-software kernel:  [<c0657c22>] error_code+0x72/0x78
Feb  6 09:09:47 sc-software kernel:  [<c01683ce>] 
zap_pte_range+0x22e/0x320
Feb  6 09:09:47 sc-software kernel:  [<c016858d>] 
unmap_page_range+0xcd/0x180
Feb  6 09:09:47 sc-software kernel:  [<c016870d>] unmap_vmas+0xcd/0x210
Feb  6 09:09:47 sc-software kernel:  [<c016dd36>] exit_mmap+0x76/0x120
Feb  6 09:09:47 sc-software kernel:  [<c0124c39>] mmput+0x39/0xa0
Feb  6 09:09:47 sc-software kernel:  [<c0129b3e>] exit_mm+0x8e/0xf0
Feb  6 09:09:47 sc-software kernel:  [<c012a393>] do_exit+0x153/0x350
Feb  6 09:09:47 sc-software kernel:  [<c012a5e9>] do_group_exit+0x29/0x70
Feb  6 09:09:47 sc-software kernel:  [<c0134452>] 
get_signal_to_deliver+0x242/0x2e0
Feb  6 09:09:47 sc-software kernel:  [<c0102e4c>] do_signal+0x5c/0x170
Feb  6 09:09:47 sc-software kernel:  [<c0102f9c>] 
do_notify_resume+0x3c/0x40
Feb  6 09:09:47 sc-software kernel:  [<c01031b5>] work_notifysig+0x13/0x1a
Feb  6 09:09:47 sc-software kernel:  =======================
Feb  6 09:09:47 sc-software kernel: Code: bb dd 74 c0 e8 dc 2f fe ff 8b 46 
44 85 c0 75 1f 8b 46 4c 85
c0 74 14 8b 40 10 85 c0 74 0d 8b 50 2c b8 14 ec 74 c0 e8 ba 2f fe ff <0f> 
0b eb fe 8b 50 0c b8 38
ec 74 c0 e8 a9 2f fe ff 8b 46 44 8b
Feb  6 09:09:47 sc-software kernel: EIP: [<c016fc36>] 
page_remove_rmap+0xe6/0x110 SS:ESP 0068:f1b31d60
Feb  6 09:09:47 sc-software kernel: ---[ end trace 791c1b0a2eb87d50 ]---
Feb  6 09:09:47 sc-software kernel: Fixing recursive fault but reboot is 
needed!
Feb  6 09:09:47 sc-software kernel: BUG: scheduling while atomic: 
X/3710/0x00000003
Feb  6 09:09:47 sc-software kernel: INFO: lockdep is turned off.
Feb  6 09:09:47 sc-software kernel: Pid: 3710, comm: X Tainted: G      D 
2.6.24 #1
Feb  6 09:09:47 sc-software kernel:  [<c0103e5a>] 
show_trace_log_lvl+0x1a/0x30
Feb  6 09:09:47 sc-software kernel:  [<c0103e82>] show_trace+0x12/0x20
Feb  6 09:09:47 sc-software kernel:  [<c0103ffd>] dump_stack+0x6d/0x80
Feb  6 09:09:47 sc-software kernel:  [<c0120f56>] __schedule_bug+0x66/0x70
Feb  6 09:09:47 sc-software kernel:  [<c0655655>] 
__sched_text_start+0x295/0x490
Feb  6 09:09:47 sc-software kernel:  [<c012a4ab>] do_exit+0x26b/0x350
Feb  6 09:09:47 sc-software kernel:  [<c0104513>] die+0x213/0x220
Feb  6 09:09:47 sc-software kernel:  [<c01045b1>] do_trap+0x91/0xd0
Feb  6 09:09:47 sc-software kernel:  [<c0104859>] do_invalid_op+0x89/0xa0
Feb  6 09:09:47 sc-software kernel:  [<c0657c22>] error_code+0x72/0x78
Feb  6 09:09:47 sc-software kernel:  [<c01683ce>] 
zap_pte_range+0x22e/0x320
Feb  6 09:09:47 sc-software kernel:  [<c016858d>] 
unmap_page_range+0xcd/0x180
Feb  6 09:09:47 sc-software kernel:  [<c016870d>] unmap_vmas+0xcd/0x210
Feb  6 09:09:47 sc-software kernel:  [<c016dd36>] exit_mmap+0x76/0x120
Feb  6 09:09:47 sc-software kernel:  [<c0124c39>] mmput+0x39/0xa0
Feb  6 09:09:47 sc-software kernel:  [<c0129b3e>] exit_mm+0x8e/0xf0
Feb  6 09:09:47 sc-software kernel:  [<c012a393>] do_exit+0x153/0x350
Feb  6 09:09:47 sc-software kernel:  [<c012a5e9>] do_group_exit+0x29/0x70
Feb  6 09:09:47 sc-software kernel:  [<c0134452>] 
get_signal_to_deliver+0x242/0x2e0
Feb  6 09:09:47 sc-software kernel:  [<c0102e4c>] do_signal+0x5c/0x170
Feb  6 09:09:47 sc-software kernel:  [<c0102f9c>] 
do_notify_resume+0x3c/0x40
Feb  6 09:09:47 sc-software kernel:  [<c01031b5>] work_notifysig+0x13/0x1a
Feb  6 09:09:47 sc-software kernel:  =======================

[-- Attachment #2: .config - rmap bug --]
[-- Type: APPLICATION/octet-stream, Size: 14658 bytes --]

^ permalink raw reply	[flat|nested] 5+ messages in thread

* Re: Bug: 2.6.24-smp: Eeek! page_mapcount(page) went negative! (-1)
  2008-02-06 18:13 Bug: 2.6.24-smp: Eeek! page_mapcount(page) went negative! (-1) kerndev
@ 2008-02-06 19:55 ` Hugh Dickins
  2008-02-06 20:21   ` kerndev
  0 siblings, 1 reply; 5+ messages in thread
From: Hugh Dickins @ 2008-02-06 19:55 UTC (permalink / raw)
  To: John Heil; +Cc: linux-kernel

On Wed, 6 Feb 2008, kerndev@sc-software.com wrote:
> 
> Before digging into this myself (don't really have enought time),
> I thought I'd submit it here...

Thanks for the report.

> Feb  6 09:09:47 sc-software kernel: Eeek! page_mapcount(page) went negative!

These very often turn out the be due to bad RAM or some such:
worth running memtest86+ overnight when you can, though its
success wouldn't be conclusive.

> (-1)
> Feb  6 09:09:47 sc-software kernel:   page pfn = 12b1b1
> Feb  6 09:09:47 sc-software kernel:   page->flags = 80080014
> Feb  6 09:09:47 sc-software kernel:   page->count = 0

That's a free page which shouldn't have appeared in the pagetable.
It could be that there's a bit error somewhere in that pfn of 12b1b1.

> Feb  6 09:09:47 sc-software kernel:   page->mapping = 00000000
> Feb  6 09:09:47 sc-software kernel:   vma->vm_ops = 0x0
> Feb  6 09:09:47 sc-software kernel: ------------[ cut here ]------------
> Feb  6 09:09:47 sc-software kernel: kernel BUG at mm/rmap.c:631!
> Feb  6 09:09:47 sc-software kernel: invalid opcode: 0000 [#2] PREEMPT SMP
> Feb  6 09:09:47 sc-software kernel: Modules linked in: shpchp pci_hotplug
> ohci1394 ieee1394
> Feb  6 09:09:47 sc-software kernel:
> Feb  6 09:09:47 sc-software kernel: Pid: 3710, comm: X Tainted: G      D

But that [#2] and that D taint both say there was already a problem
before this: anything showing in the log for that?

Hugh

^ permalink raw reply	[flat|nested] 5+ messages in thread

* Re: Bug: 2.6.24-smp: Eeek! page_mapcount(page) went negative! (-1)
  2008-02-06 19:55 ` Hugh Dickins
@ 2008-02-06 20:21   ` kerndev
  2008-02-06 20:33     ` Pekka Enberg
  0 siblings, 1 reply; 5+ messages in thread
From: kerndev @ 2008-02-06 20:21 UTC (permalink / raw)
  To: Hugh Dickins; +Cc: linux-kernel, John Heil

On Wed, 6 Feb 2008, Hugh Dickins wrote:

> Date: Wed, 6 Feb 2008 19:55:36 +0000 (GMT)
> From: Hugh Dickins <hugh@veritas.com>
> To: John Heil <kerndev@sc-software.com>
> Cc: linux-kernel@vger.kernel.org
> Subject: Re: Bug: 2.6.24-smp: Eeek! page_mapcount(page) went negative! (-1)
> 
> On Wed, 6 Feb 2008, kerndev@sc-software.com wrote:
>>
>> Before digging into this myself (don't really have enought time),
>> I thought I'd submit it here...
>
> Thanks for the report.
>
>> Feb  6 09:09:47 sc-software kernel: Eeek! page_mapcount(page) went negative!
>
> These very often turn out the be due to bad RAM or some such:
> worth running memtest86+ overnight when you can, though its
> success wouldn't be conclusive.
>
>> (-1)
>> Feb  6 09:09:47 sc-software kernel:   page pfn = 12b1b1
>> Feb  6 09:09:47 sc-software kernel:   page->flags = 80080014
>> Feb  6 09:09:47 sc-software kernel:   page->count = 0
>
> That's a free page which shouldn't have appeared in the pagetable.
> It could be that there's a bit error somewhere in that pfn of 12b1b1.
>
>> Feb  6 09:09:47 sc-software kernel:   page->mapping = 00000000
>> Feb  6 09:09:47 sc-software kernel:   vma->vm_ops = 0x0
>> Feb  6 09:09:47 sc-software kernel: ------------[ cut here ]------------
>> Feb  6 09:09:47 sc-software kernel: kernel BUG at mm/rmap.c:631!
>> Feb  6 09:09:47 sc-software kernel: invalid opcode: 0000 [#2] PREEMPT SMP
>> Feb  6 09:09:47 sc-software kernel: Modules linked in: shpchp pci_hotplug
>> ohci1394 ieee1394
>> Feb  6 09:09:47 sc-software kernel:
>> Feb  6 09:09:47 sc-software kernel: Pid: 3710, comm: X Tainted: G      D
>
> But that [#2] and that D taint both say there was already a problem
> before this: anything showing in the log for that?
>
> Hugh


>From reboot after last hang on 02/03/08, I found this...


Feb  5 23:26:26 sc-software kernel: ------------[ cut here ]------------
Feb  5 23:26:26 sc-software kernel: kernel BUG at mm/slab.c:591!
Feb  5 23:26:26 sc-software kernel: invalid opcode: 0000 [#1] PREEMPT SMP
Feb  5 23:26:26 sc-software kernel: Modules linked in: shpchp pci_hotplug 
ohci1394 ieee1394
Feb  5 23:26:26 sc-software kernel:
Feb  5 23:26:26 sc-software kernel: Pid: 6040, comm: modprobe Not tainted 
(2.6.24 #1)
Feb  5 23:26:26 sc-software kernel: EIP: 0060:[<c017960f>] EFLAGS: 
00010046 CPU: 1
Feb  5 23:26:26 sc-software kernel: EIP is at kfree+0x8f/0xa0
Feb  5 23:26:26 sc-software kernel: EAX: 40000000 EBX: f5915380 ECX: 
c1bb4498 EDX: c1bb4498
Feb  5 23:26:26 sc-software kernel: ESI: eff354a0 EDI: f5815280 EBP: 
f0067e78 ESP: f0067e68
Feb  5 23:26:26 sc-software kernel:  DS: 007b ES: 007b FS: 00d8 GS: 0000 
SS: 0068
Feb  5 23:26:26 sc-software kernel: Process modprobe (pid: 6040, 
ti=f0066000 task=f00b5550 task.ti=f0066000)
Feb  5 23:26:26 sc-software kernel: Stack: 00000282 f5915380 eff354a0 
b7fb87b0 f0067f1c c01ac9ac 00000003 00001812
Feb  5 23:26:26 sc-software kernel:        f0067e98 0805ba64 00000418 
0805ba64 f0067eb4 0805ba64 00001812 00000000
Feb  5 23:26:26 sc-software kernel:        f5815280 00000001 00000000 
0805be7c 0805ba64 0805aa64 08048000 00000000
Feb  5 23:26:26 sc-software kernel: Call Trace:
Feb  5 23:26:26 sc-software kernel:  [<c0103e5a>] 
show_trace_log_lvl+0x1a/0x30
Feb  5 23:26:26 sc-software kernel:  [<c0103f2a>] 
show_stack_log_lvl+0x9a/0xc0
Feb  5 23:26:26 sc-software kernel:  [<c01040d7>] 
show_registers+0xc7/0x250
Feb  5 23:26:26 sc-software kernel:  [<c010441f>] die+0x11f/0x220
Feb  5 23:26:26 sc-software kernel:  [<c01045b1>] do_trap+0x91/0xd0
Feb  5 23:26:26 sc-software kernel:  [<c0104859>] do_invalid_op+0x89/0xa0
Feb  5 23:26:26 sc-software kernel:  [<c0657c22>] error_code+0x72/0x78
Feb  5 23:26:26 sc-software kernel:  [<c01ac9ac>] 
load_elf_binary+0x8cc/0xcf0
Feb  5 23:26:26 sc-software kernel:  [<c0181864>] 
search_binary_handler+0xc4/0x250
Feb  5 23:26:26 sc-software kernel:  [<c0181b35>] do_execve+0x145/0x190
Feb  5 23:26:26 sc-software kernel:  [<c0101c82>] sys_execve+0x32/0xa0
Feb  5 23:26:26 sc-software kernel:  [<c0103102>] syscall_call+0x7/0xb
Feb  5 23:26:26 sc-software kernel:  =======================
Feb  5 23:26:26 sc-software kernel: Code: 47 1d 00 8b 1c 86 8b 03 3b 43 04 
73 11 89 7c 83 30 40 89 03 ff 75 f0 9d 58 5b 5e 5f 5d c3 89 f0 89 da e8 65 
fc ff ff 8b 03 eb e2 <0f> 0b eb fe 8b 51 0c eb b7 8b 4a 0c
eb a2 8d 76 00 55 89 e5 5d
Feb  5 23:26:26 sc-software kernel: EIP: [<c017960f>] kfree+0x8f/0xa0 
SS:ESP 0068:f0067e68
Feb  5 23:26:26 sc-software kernel: ---[ end trace 791c1b0a2eb87d50 ]---





> --
> To unsubscribe from this list: send the line "unsubscribe linux-kernel" in
> the body of a message to majordomo@vger.kernel.org
> More majordomo info at  http://vger.kernel.org/majordomo-info.html
> Please read the FAQ at  http://www.tux.org/lkml/
>

.
===========================================================
John Heil
South Coast Software
Custom firmware, device drivers and board bring up services
Ph: 1-714-774-6952
Fx: 1-714-774-7053
www.sc-software.com
email: johnhscs@sc-software.com
===========================================================


^ permalink raw reply	[flat|nested] 5+ messages in thread

* Re: Bug: 2.6.24-smp: Eeek! page_mapcount(page) went negative! (-1)
  2008-02-06 20:21   ` kerndev
@ 2008-02-06 20:33     ` Pekka Enberg
  2008-02-06 20:38       ` kerndev
  0 siblings, 1 reply; 5+ messages in thread
From: Pekka Enberg @ 2008-02-06 20:33 UTC (permalink / raw)
  To: kerndev; +Cc: Hugh Dickins, linux-kernel

Hi,

On Feb 6, 2008 10:21 PM,  <kerndev@sc-software.com> wrote:
> From reboot after last hang on 02/03/08, I found this...
>
> Feb  5 23:26:26 sc-software kernel: ------------[ cut here ]------------
> Feb  5 23:26:26 sc-software kernel: kernel BUG at mm/slab.c:591!
> Feb  5 23:26:26 sc-software kernel: invalid opcode: 0000 [#1] PREEMPT SMP
> Feb  5 23:26:26 sc-software kernel: Modules linked in: shpchp pci_hotplug ohci1394 ieee1394
> Feb  5 23:26:26 sc-software kernel:
> Feb  5 23:26:26 sc-software kernel: Pid: 6040, comm: modprobe Not tainted (2.6.24 #1)
> Feb  5 23:26:26 sc-software kernel: EIP: 0060:[<c017960f>] EFLAGS: 00010046 CPU: 1
> Feb  5 23:26:26 sc-software kernel: EIP is at kfree+0x8f/0xa0
> Feb  5 23:26:26 sc-software kernel: EAX: 40000000 EBX: f5915380 ECX: c1bb4498 EDX: c1bb4498
> Feb  5 23:26:26 sc-software kernel: ESI: eff354a0 EDI: f5815280 EBP: f0067e78 ESP: f0067e68
> Feb  5 23:26:26 sc-software kernel:  DS: 007b ES: 007b FS: 00d8 GS: 0000 SS: 0068
> Feb  5 23:26:26 sc-software kernel: Process modprobe (pid: 6040, ti=f0066000 task=f00b5550 task.ti=f0066000)
> Feb  5 23:26:26 sc-software kernel: Stack: 00000282 f5915380 eff354a0 b7fb87b0 f0067f1c c01ac9ac 00000003 00001812
> Feb  5 23:26:26 sc-software kernel:        f0067e98 0805ba64 00000418 0805ba64 f0067eb4 0805ba64 00001812 00000000
> Feb  5 23:26:26 sc-software kernel:        f5815280 00000001 00000000 0805be7c 0805ba64 0805aa64 08048000 00000000
> Feb  5 23:26:26 sc-software kernel: Call Trace:
> Feb  5 23:26:26 sc-software kernel:  [<c0103e5a>] show_trace_log_lvl+0x1a/0x30
> Feb  5 23:26:26 sc-software kernel:  [<c0103f2a>] show_stack_log_lvl+0x9a/0xc0
> Feb  5 23:26:26 sc-software kernel:  [<c01040d7>] show_registers+0xc7/0x250
> Feb  5 23:26:26 sc-software kernel:  [<c010441f>] die+0x11f/0x220
> Feb  5 23:26:26 sc-software kernel:  [<c01045b1>] do_trap+0x91/0xd0
> Feb  5 23:26:26 sc-software kernel:  [<c0104859>] do_invalid_op+0x89/0xa0
> Feb  5 23:26:26 sc-software kernel:  [<c0657c22>] error_code+0x72/0x78
> Feb  5 23:26:26 sc-software kernel:  [<c01ac9ac>] load_elf_binary+0x8cc/0xcf0
> Feb  5 23:26:26 sc-software kernel:  [<c0181864>] search_binary_handler+0xc4/0x250
> Feb  5 23:26:26 sc-software kernel:  [<c0181b35>] do_execve+0x145/0x190
> Feb  5 23:26:26 sc-software kernel:  [<c0101c82>] sys_execve+0x32/0xa0
> Feb  5 23:26:26 sc-software kernel:  [<c0103102>] syscall_call+0x7/0xb

This means we pass a non-slab pointer from load_elf_binary() to
kfree() which doesn't seem likely to be a software bug reading the
code. As Hugh suggested, please run memtest86+.

^ permalink raw reply	[flat|nested] 5+ messages in thread

* Re: Bug: 2.6.24-smp: Eeek! page_mapcount(page) went negative! (-1)
  2008-02-06 20:33     ` Pekka Enberg
@ 2008-02-06 20:38       ` kerndev
  0 siblings, 0 replies; 5+ messages in thread
From: kerndev @ 2008-02-06 20:38 UTC (permalink / raw)
  To: Pekka Enberg; +Cc: Hugh Dickins, linux-kernel, John Heil

On Wed, 6 Feb 2008, Pekka Enberg wrote:

> Date: Wed, 6 Feb 2008 22:33:33 +0200
> From: Pekka Enberg <penberg@cs.helsinki.fi>
> To: kerndev@sc-software.com
> Cc: Hugh Dickins <hugh@veritas.com>, linux-kernel@vger.kernel.org
> Subject: Re: Bug: 2.6.24-smp: Eeek! page_mapcount(page) went negative! (-1)
> 
> Hi,
>
> On Feb 6, 2008 10:21 PM,  <kerndev@sc-software.com> wrote:
>> From reboot after last hang on 02/03/08, I found this...
>>
>> Feb  5 23:26:26 sc-software kernel: ------------[ cut here ]------------
>> Feb  5 23:26:26 sc-software kernel: kernel BUG at mm/slab.c:591!
>> Feb  5 23:26:26 sc-software kernel: invalid opcode: 0000 [#1] PREEMPT SMP
>> Feb  5 23:26:26 sc-software kernel: Modules linked in: shpchp pci_hotplug ohci1394 ieee1394
>> Feb  5 23:26:26 sc-software kernel:
>> Feb  5 23:26:26 sc-software kernel: Pid: 6040, comm: modprobe Not tainted (2.6.24 #1)
>> Feb  5 23:26:26 sc-software kernel: EIP: 0060:[<c017960f>] EFLAGS: 00010046 CPU: 1
>> Feb  5 23:26:26 sc-software kernel: EIP is at kfree+0x8f/0xa0
>> Feb  5 23:26:26 sc-software kernel: EAX: 40000000 EBX: f5915380 ECX: c1bb4498 EDX: c1bb4498
>> Feb  5 23:26:26 sc-software kernel: ESI: eff354a0 EDI: f5815280 EBP: f0067e78 ESP: f0067e68
>> Feb  5 23:26:26 sc-software kernel:  DS: 007b ES: 007b FS: 00d8 GS: 0000 SS: 0068
>> Feb  5 23:26:26 sc-software kernel: Process modprobe (pid: 6040, ti=f0066000 task=f00b5550 task.ti=f0066000)
>> Feb  5 23:26:26 sc-software kernel: Stack: 00000282 f5915380 eff354a0 b7fb87b0 f0067f1c c01ac9ac 00000003 00001812
>> Feb  5 23:26:26 sc-software kernel:        f0067e98 0805ba64 00000418 0805ba64 f0067eb4 0805ba64 00001812 00000000
>> Feb  5 23:26:26 sc-software kernel:        f5815280 00000001 00000000 0805be7c 0805ba64 0805aa64 08048000 00000000
>> Feb  5 23:26:26 sc-software kernel: Call Trace:
>> Feb  5 23:26:26 sc-software kernel:  [<c0103e5a>] show_trace_log_lvl+0x1a/0x30
>> Feb  5 23:26:26 sc-software kernel:  [<c0103f2a>] show_stack_log_lvl+0x9a/0xc0
>> Feb  5 23:26:26 sc-software kernel:  [<c01040d7>] show_registers+0xc7/0x250
>> Feb  5 23:26:26 sc-software kernel:  [<c010441f>] die+0x11f/0x220
>> Feb  5 23:26:26 sc-software kernel:  [<c01045b1>] do_trap+0x91/0xd0
>> Feb  5 23:26:26 sc-software kernel:  [<c0104859>] do_invalid_op+0x89/0xa0
>> Feb  5 23:26:26 sc-software kernel:  [<c0657c22>] error_code+0x72/0x78
>> Feb  5 23:26:26 sc-software kernel:  [<c01ac9ac>] load_elf_binary+0x8cc/0xcf0
>> Feb  5 23:26:26 sc-software kernel:  [<c0181864>] search_binary_handler+0xc4/0x250
>> Feb  5 23:26:26 sc-software kernel:  [<c0181b35>] do_execve+0x145/0x190
>> Feb  5 23:26:26 sc-software kernel:  [<c0101c82>] sys_execve+0x32/0xa0
>> Feb  5 23:26:26 sc-software kernel:  [<c0103102>] syscall_call+0x7/0xb
>
> This means we pass a non-slab pointer from load_elf_binary() to
> kfree() which doesn't seem likely to be a software bug reading the
> code. As Hugh suggested, please run memtest86+.
>

Will do!... Thnx

j

.
===========================================================
John Heil
South Coast Software
Custom firmware, device drivers and board bring up services
Ph: 1-714-774-6952
Fx: 1-714-774-7053
www.sc-software.com
email: johnhscs@sc-software.com
===========================================================


^ permalink raw reply	[flat|nested] 5+ messages in thread

end of thread, other threads:[~2008-02-06 20:38 UTC | newest]

Thread overview: 5+ messages (download: mbox.gz / follow: Atom feed)
-- links below jump to the message on this page --
2008-02-06 18:13 Bug: 2.6.24-smp: Eeek! page_mapcount(page) went negative! (-1) kerndev
2008-02-06 19:55 ` Hugh Dickins
2008-02-06 20:21   ` kerndev
2008-02-06 20:33     ` Pekka Enberg
2008-02-06 20:38       ` kerndev

This is a public inbox, see mirroring instructions
for how to clone and mirror all data and code used for this inbox;
as well as URLs for NNTP newsgroup(s).