LKML Archive on lore.kernel.org
help / color / mirror / Atom feed
* MCE Erros
@ 2007-03-22 19:50 Thomas Glanzmann
  2007-03-22 19:54 ` Thomas Glanzmann
  2007-03-22 19:54 ` Mws
  0 siblings, 2 replies; 3+ messages in thread
From: Thomas Glanzmann @ 2007-03-22 19:50 UTC (permalink / raw)
  To: LKML

Hello,
I have two Dual Opteron Machines where I get two MCE errors on. The
first one is:

        MCE 0
        HARDWARE ERROR. This is *NOT* a software problem!
        Please contact your hardware vendor
        CPU 0 4 northbridge TSC edc587de6e99
        ADDR 1001a0000
          Northbridge GART error
               bit61 = error uncorrected
          TLB error 'generic transaction, level generic'
        STATUS a40000000005001b MCGSTATUS 0

I see this error exactly 8 times. What does 'GART' mean?

And here is another one another box:

        MCE 0
        HARDWARE ERROR. This is *NOT* a software problem!
        Please contact your hardware vendor
        CPU 1 4 northbridge TSC f23151075b21d
        ADDR b8898250
          Northbridge Chipkill ECC error
          Chipkill ECC syndrome = f858
               bit32 = err cpu0
               bit46 = corrected ecc error
               bit62 = error overflow (multiple errors)
          bus error 'local node origin, request didn't time out
              generic read mem transaction
              memory access, level generic'
        STATUS d42c4001f8080813 MCGSTATUS 0

How do I identify the broken Memory Modules?

        Thomas

^ permalink raw reply	[flat|nested] 3+ messages in thread

* Re: MCE Erros
  2007-03-22 19:50 MCE Erros Thomas Glanzmann
@ 2007-03-22 19:54 ` Thomas Glanzmann
  2007-03-22 19:54 ` Mws
  1 sibling, 0 replies; 3+ messages in thread
From: Thomas Glanzmann @ 2007-03-22 19:54 UTC (permalink / raw)
  To: LKML

Hello,

>         MCE 0
>         HARDWARE ERROR. This is *NOT* a software problem!
>         Please contact your hardware vendor
>         CPU 0 4 northbridge TSC edc587de6e99
>         ADDR 1001a0000
>           Northbridge GART error
>                bit61 = error uncorrected
>           TLB error 'generic transaction, level generic'
>         STATUS a40000000005001b MCGSTATUS 0

> I see this error exactly 8 times. What does 'GART' mean?

dict says GART means 'Graphics Address Remapping Table (AGP)' but I
don't see how that fits in the picture?

        (tomcat-01) [~] lspci
        00:06.0 PCI bridge: Advanced Micro Devices [AMD] AMD-8111 PCI (rev 07)
        00:07.0 ISA bridge: Advanced Micro Devices [AMD] AMD-8111 LPC (rev 05)
        00:07.1 IDE interface: Advanced Micro Devices [AMD] AMD-8111 IDE (rev 03)
        00:07.2 SMBus: Advanced Micro Devices [AMD] AMD-8111 SMBus 2.0 (rev 02)
        00:07.3 Bridge: Advanced Micro Devices [AMD] AMD-8111 ACPI (rev 05)
        00:0a.0 PCI bridge: Advanced Micro Devices [AMD] AMD-8131 PCI-X Bridge (rev 12)
        00:0a.1 PIC: Advanced Micro Devices [AMD] AMD-8131 PCI-X IOAPIC (rev 01)
        00:0b.0 PCI bridge: Advanced Micro Devices [AMD] AMD-8131 PCI-X Bridge (rev 12)
        00:0b.1 PIC: Advanced Micro Devices [AMD] AMD-8131 PCI-X IOAPIC (rev 01)
        00:18.0 Host bridge: Advanced Micro Devices [AMD] K8 [Athlon64/Opteron] HyperTransport Technology Configuration
        00:18.1 Host bridge: Advanced Micro Devices [AMD] K8 [Athlon64/Opteron] Address Map
        00:18.2 Host bridge: Advanced Micro Devices [AMD] K8 [Athlon64/Opteron] DRAM Controller
        00:18.3 Host bridge: Advanced Micro Devices [AMD] K8 [Athlon64/Opteron] Miscellaneous Control
        00:19.0 Host bridge: Advanced Micro Devices [AMD] K8 [Athlon64/Opteron] HyperTransport Technology Configuration
        00:19.1 Host bridge: Advanced Micro Devices [AMD] K8 [Athlon64/Opteron] Address Map
        00:19.2 Host bridge: Advanced Micro Devices [AMD] K8 [Athlon64/Opteron] DRAM Controller
        00:19.3 Host bridge: Advanced Micro Devices [AMD] K8 [Athlon64/Opteron] Miscellaneous Control
        01:03.0 RAID bus controller: 3ware Inc 7xxx/8xxx-series PATA/SATA-RAID (rev 01)
        02:09.0 Ethernet controller: Broadcom Corporation NetXtreme BCM5704 Gigabit Ethernet (rev 03)
        02:09.1 Ethernet controller: Broadcom Corporation NetXtreme BCM5704 Gigabit Ethernet (rev 03)
        03:00.0 USB Controller: Advanced Micro Devices [AMD] AMD-8111 USB (rev 0b)
        03:00.1 USB Controller: Advanced Micro Devices [AMD] AMD-8111 USB (rev 0b)
        03:06.0 VGA compatible controller: ATI Technologies Inc Rage XL (rev 27)
        03:08.0 Ethernet controller: Intel Corporation 82557/8/9 [Ethernet Pro 100] (rev 10)

        (tomcat-01) [~] dmesg | grep -i agp
        No AGP bridge found
        PCI-DMA: Disabling AGP.
        PCI-DMA: Reserving 64MB of IOMMU area in the AGP aperture
        Linux agpgart interface v0.101 (c) Dave Jones

Greetings,
        Thomas

^ permalink raw reply	[flat|nested] 3+ messages in thread

* Re: MCE Erros
  2007-03-22 19:50 MCE Erros Thomas Glanzmann
  2007-03-22 19:54 ` Thomas Glanzmann
@ 2007-03-22 19:54 ` Mws
  1 sibling, 0 replies; 3+ messages in thread
From: Mws @ 2007-03-22 19:54 UTC (permalink / raw)
  To: linux-kernel; +Cc: Thomas Glanzmann

On Thursday 22 March 2007, Thomas Glanzmann wrote:
> Hello,
> I have two Dual Opteron Machines where I get two MCE errors on. The
> first one is:
> 
>         MCE 0
>         HARDWARE ERROR. This is *NOT* a software problem!
>         Please contact your hardware vendor
>         CPU 0 4 northbridge TSC edc587de6e99
>         ADDR 1001a0000
>           Northbridge GART error
>                bit61 = error uncorrected
>           TLB error 'generic transaction, level generic'
>         STATUS a40000000005001b MCGSTATUS 0
> 
> I see this error exactly 8 times. What does 'GART' mean?

Graphics Address Remapping Table

used with agp

regards
marcel



^ permalink raw reply	[flat|nested] 3+ messages in thread

end of thread, other threads:[~2007-03-22 19:55 UTC | newest]

Thread overview: 3+ messages (download: mbox.gz / follow: Atom feed)
-- links below jump to the message on this page --
2007-03-22 19:50 MCE Erros Thomas Glanzmann
2007-03-22 19:54 ` Thomas Glanzmann
2007-03-22 19:54 ` Mws

This is a public inbox, see mirroring instructions
for how to clone and mirror all data and code used for this inbox;
as well as URLs for NNTP newsgroup(s).