LKML Archive on lore.kernel.org
help / color / mirror / Atom feed
* MCE Erros
@ 2007-03-22 19:50 Thomas Glanzmann
2007-03-22 19:54 ` Thomas Glanzmann
2007-03-22 19:54 ` Mws
0 siblings, 2 replies; 3+ messages in thread
From: Thomas Glanzmann @ 2007-03-22 19:50 UTC (permalink / raw)
To: LKML
Hello,
I have two Dual Opteron Machines where I get two MCE errors on. The
first one is:
MCE 0
HARDWARE ERROR. This is *NOT* a software problem!
Please contact your hardware vendor
CPU 0 4 northbridge TSC edc587de6e99
ADDR 1001a0000
Northbridge GART error
bit61 = error uncorrected
TLB error 'generic transaction, level generic'
STATUS a40000000005001b MCGSTATUS 0
I see this error exactly 8 times. What does 'GART' mean?
And here is another one another box:
MCE 0
HARDWARE ERROR. This is *NOT* a software problem!
Please contact your hardware vendor
CPU 1 4 northbridge TSC f23151075b21d
ADDR b8898250
Northbridge Chipkill ECC error
Chipkill ECC syndrome = f858
bit32 = err cpu0
bit46 = corrected ecc error
bit62 = error overflow (multiple errors)
bus error 'local node origin, request didn't time out
generic read mem transaction
memory access, level generic'
STATUS d42c4001f8080813 MCGSTATUS 0
How do I identify the broken Memory Modules?
Thomas
^ permalink raw reply [flat|nested] 3+ messages in thread
* Re: MCE Erros
2007-03-22 19:50 MCE Erros Thomas Glanzmann
@ 2007-03-22 19:54 ` Thomas Glanzmann
2007-03-22 19:54 ` Mws
1 sibling, 0 replies; 3+ messages in thread
From: Thomas Glanzmann @ 2007-03-22 19:54 UTC (permalink / raw)
To: LKML
Hello,
> MCE 0
> HARDWARE ERROR. This is *NOT* a software problem!
> Please contact your hardware vendor
> CPU 0 4 northbridge TSC edc587de6e99
> ADDR 1001a0000
> Northbridge GART error
> bit61 = error uncorrected
> TLB error 'generic transaction, level generic'
> STATUS a40000000005001b MCGSTATUS 0
> I see this error exactly 8 times. What does 'GART' mean?
dict says GART means 'Graphics Address Remapping Table (AGP)' but I
don't see how that fits in the picture?
(tomcat-01) [~] lspci
00:06.0 PCI bridge: Advanced Micro Devices [AMD] AMD-8111 PCI (rev 07)
00:07.0 ISA bridge: Advanced Micro Devices [AMD] AMD-8111 LPC (rev 05)
00:07.1 IDE interface: Advanced Micro Devices [AMD] AMD-8111 IDE (rev 03)
00:07.2 SMBus: Advanced Micro Devices [AMD] AMD-8111 SMBus 2.0 (rev 02)
00:07.3 Bridge: Advanced Micro Devices [AMD] AMD-8111 ACPI (rev 05)
00:0a.0 PCI bridge: Advanced Micro Devices [AMD] AMD-8131 PCI-X Bridge (rev 12)
00:0a.1 PIC: Advanced Micro Devices [AMD] AMD-8131 PCI-X IOAPIC (rev 01)
00:0b.0 PCI bridge: Advanced Micro Devices [AMD] AMD-8131 PCI-X Bridge (rev 12)
00:0b.1 PIC: Advanced Micro Devices [AMD] AMD-8131 PCI-X IOAPIC (rev 01)
00:18.0 Host bridge: Advanced Micro Devices [AMD] K8 [Athlon64/Opteron] HyperTransport Technology Configuration
00:18.1 Host bridge: Advanced Micro Devices [AMD] K8 [Athlon64/Opteron] Address Map
00:18.2 Host bridge: Advanced Micro Devices [AMD] K8 [Athlon64/Opteron] DRAM Controller
00:18.3 Host bridge: Advanced Micro Devices [AMD] K8 [Athlon64/Opteron] Miscellaneous Control
00:19.0 Host bridge: Advanced Micro Devices [AMD] K8 [Athlon64/Opteron] HyperTransport Technology Configuration
00:19.1 Host bridge: Advanced Micro Devices [AMD] K8 [Athlon64/Opteron] Address Map
00:19.2 Host bridge: Advanced Micro Devices [AMD] K8 [Athlon64/Opteron] DRAM Controller
00:19.3 Host bridge: Advanced Micro Devices [AMD] K8 [Athlon64/Opteron] Miscellaneous Control
01:03.0 RAID bus controller: 3ware Inc 7xxx/8xxx-series PATA/SATA-RAID (rev 01)
02:09.0 Ethernet controller: Broadcom Corporation NetXtreme BCM5704 Gigabit Ethernet (rev 03)
02:09.1 Ethernet controller: Broadcom Corporation NetXtreme BCM5704 Gigabit Ethernet (rev 03)
03:00.0 USB Controller: Advanced Micro Devices [AMD] AMD-8111 USB (rev 0b)
03:00.1 USB Controller: Advanced Micro Devices [AMD] AMD-8111 USB (rev 0b)
03:06.0 VGA compatible controller: ATI Technologies Inc Rage XL (rev 27)
03:08.0 Ethernet controller: Intel Corporation 82557/8/9 [Ethernet Pro 100] (rev 10)
(tomcat-01) [~] dmesg | grep -i agp
No AGP bridge found
PCI-DMA: Disabling AGP.
PCI-DMA: Reserving 64MB of IOMMU area in the AGP aperture
Linux agpgart interface v0.101 (c) Dave Jones
Greetings,
Thomas
^ permalink raw reply [flat|nested] 3+ messages in thread
* Re: MCE Erros
2007-03-22 19:50 MCE Erros Thomas Glanzmann
2007-03-22 19:54 ` Thomas Glanzmann
@ 2007-03-22 19:54 ` Mws
1 sibling, 0 replies; 3+ messages in thread
From: Mws @ 2007-03-22 19:54 UTC (permalink / raw)
To: linux-kernel; +Cc: Thomas Glanzmann
On Thursday 22 March 2007, Thomas Glanzmann wrote:
> Hello,
> I have two Dual Opteron Machines where I get two MCE errors on. The
> first one is:
>
> MCE 0
> HARDWARE ERROR. This is *NOT* a software problem!
> Please contact your hardware vendor
> CPU 0 4 northbridge TSC edc587de6e99
> ADDR 1001a0000
> Northbridge GART error
> bit61 = error uncorrected
> TLB error 'generic transaction, level generic'
> STATUS a40000000005001b MCGSTATUS 0
>
> I see this error exactly 8 times. What does 'GART' mean?
Graphics Address Remapping Table
used with agp
regards
marcel
^ permalink raw reply [flat|nested] 3+ messages in thread
end of thread, other threads:[~2007-03-22 19:55 UTC | newest]
Thread overview: 3+ messages (download: mbox.gz / follow: Atom feed)
-- links below jump to the message on this page --
2007-03-22 19:50 MCE Erros Thomas Glanzmann
2007-03-22 19:54 ` Thomas Glanzmann
2007-03-22 19:54 ` Mws
This is a public inbox, see mirroring instructions
for how to clone and mirror all data and code used for this inbox;
as well as URLs for NNTP newsgroup(s).