środa, 8 lipca 2009

MCE - Machine Check Exceprion

$ parsemce -b 3 -s f62000020002010a -e 5 -a 0000000032c93500

Status: (5) Machine Check in progress.
Restart IP valid.
parsebank(3): f62000020002010a @ 32c93500
External tag parity error
CPU state corrupt. Restart not possible
Address in addr register valid
Error enabled in control register
Error not corrected.
Error overflow
Memory hierarchy error
Request: Generic error
Transaction type : Generic
Memory/IO : I/O

$ parsemce -b 5 -s f20000300c000e0f -e 4 -a 0
Status: (4) Machine Check in progress.
Restart IP invalid.
parsebank(5): f20000300c000e0f @ 0
External tag parity error
CPU state corrupt. Restart not possible
Error enabled in control register
Error not corrected.
Error overflow
Bus and interconnect error
Participation: Generic
Timeout: Request did not timeout
Request: Generic error
Transaction type : Invalid
Memory/IO : Other

My favorite test is cerberus(ctcs). Quite a few OEMs out there
use it to burn in their systems. For me it can typically find a problem
within a few hours. Whereas memtest I've let it run for a week and have
it not find anything useful.

Though the results of cerberus sometimes won't help you pinpoint the
problem(often the result is just a machine crash). But at least you
know there is an issue and can start swapping hardware until it's
fixed(or just replace the whole system).

http://sourceforge.net/projects/va-ctcs/


Have you checked to verify that the fans are spinning?

Since it is a new system, I think you should take it back to your HW
distributor and have them run cerberus(ctcs) on it, as Richard Karhuse
wrote.

If it takes a few days for it to get the Kernel Panic, I doubt that is
related to the OS.

Let your HW distributor do the work of troubleshooting and replacing
whatever component(s) are faulty. They can get a CentOS Live CD and
run that on it.

mcelog te

mcelog --ascii < crashlog.txt

Brak komentarzy:

Prześlij komentarz

Archiwum bloga