[SunHELP] Can u tell me whether the hardware is faulty ??

Lund, Dennis sunhelp at sunhelp.org
Tue Jan 30 15:40:33 CST 2001


This message is in MIME format. Since your mail reader does not understand
this format, some or all of this message may not be legible.

------_=_NextPart_001_01C08B05.46042CF0
Content-Type: text/plain

400Mhz 8M cache CPU's are the ones that have a problem.

-----Original Message-----
From: Steve Pribyl [mailto:spribyl at enteract.com]
Sent: Tuesday, January 30, 2001 11:00 AM
To: 'sunhelp at sunhelp.org'
Subject: RE: [SunHELP] Can u tell me whether the hardware is faulty ??


I have heard that sun ultra cpu memory can't catch double bit parity
errors. This can cause undetected data corruption and will cause the box
to crash at some point.

However I don't know what ultra cpu's this applies to.  You should ask sun
about this.  They do have a fix.

The cause for these cases would really be sun spots and solar storms.

Steve Pribyl
spribyl at enteract.com
http://www.enteract.com/~spribyl

On Tue, 30 Jan 2001, Nenno, Tim wrote:

> We've had the same problem with a 5149-08 440Mhz IIi on an Ultra 10. That
> same panic message, plus lots of other CPU-related
freezes/crashes/reboots.
> The machine would stay up anywhere from a few minutes to a week before
> failing.
>  
> The CPU's been replaced four times since November. One of them was DOA.
The
> third was a week ago, the fourth yesterday. Yesterday's engineer replaced
> the system board, too, on the off-chance that a glitch there might be
> triggering the CPU problem.
>  
> In the course of all that, he tested the power supply, says he found a
> problem there that might be causing the CPU problem, and so replaced the
> power supply.
>  
> So, if the machine holds up, we won't know what the source of the problem
> actually was. (Not that I'll want to know it was the power supply all
> along....)
> 
> I have 220R with 2*450Mhz processors and 1Gb of Memeory..OS is Solaris-7
> Yesterday the system crashed with a core dump and i got the following
> messages
> logged in /var/adm/messages:::
>  
> unix: panic[cpu0]/thread=300002216a0:
> 
> CPU0 Ecache Writeback Data Parity Error: AFSR 0x00000000.00800004 AFAR
> 0x000001fe.01800800
>  
> Savecore: reboot after panic: CPU0 Ecache Writeback Data Parity Error:
AFSR
> 0x00000000.00800004 AFAR 0x000001fe.01800800
>  
> Could you possibly tell me what went wrong ???
> 
> 

_______________________________________________
SunHELP maillist  -  SunHELP at sunhelp.org
http://www.sunhelp.org/mailman/listinfo/sunhelp


     - - - - - - -  Appended by Scientific-Atlanta, Inc.  - - - - - - -  
This e-mail and any attachments may contain information which is
confidential, proprietary, privileged or otherwise protected by law. The
information is solely intended for the named addressee (or a person
responsible for delivering it to the addressee). If you are not the intended
recipient of this message, you are not authorized to read, print, retain,
copy or disseminate this message or any part of it. If you have received
this e-mail in error, please notify the sender immediately by return e-mail
and delete it from your computer. 



------_=_NextPart_001_01C08B05.46042CF0
Content-Type: text/html

<!DOCTYPE HTML PUBLIC "-//W3C//DTD HTML 3.2//EN">
<HTML>
<HEAD>
<META HTTP-EQUIV="Content-Type" CONTENT="text/html; charset=us-ascii">
<META NAME="Generator" CONTENT="MS Exchange Server version 5.5.2650.12">
<TITLE>RE: [SunHELP] Can u tell me whether the hardware is faulty ??</TITLE>
</HEAD>
<BODY>

<P><FONT SIZE=2>400Mhz 8M cache CPU's are the ones that have a problem.</FONT>
</P>

<P><FONT SIZE=2>-----Original Message-----</FONT>
<BR><FONT SIZE=2>From: Steve Pribyl [<A HREF="mailto:spribyl at enteract.com">mailto:spribyl at enteract.com</A>]</FONT>
<BR><FONT SIZE=2>Sent: Tuesday, January 30, 2001 11:00 AM</FONT>
<BR><FONT SIZE=2>To: 'sunhelp at sunhelp.org'</FONT>
<BR><FONT SIZE=2>Subject: RE: [SunHELP] Can u tell me whether the hardware is faulty ??</FONT>
</P>
<BR>

<P><FONT SIZE=2>I have heard that sun ultra cpu memory can't catch double bit parity</FONT>
<BR><FONT SIZE=2>errors. This can cause undetected data corruption and will cause the box</FONT>
<BR><FONT SIZE=2>to crash at some point.</FONT>
</P>

<P><FONT SIZE=2>However I don't know what ultra cpu's this applies to.  You should ask sun</FONT>
<BR><FONT SIZE=2>about this.  They do have a fix.</FONT>
</P>

<P><FONT SIZE=2>The cause for these cases would really be sun spots and solar storms.</FONT>
</P>

<P><FONT SIZE=2>Steve Pribyl</FONT>
<BR><FONT SIZE=2>spribyl at enteract.com</FONT>
<BR><FONT SIZE=2><A HREF="http://www.enteract.com/~spribyl" TARGET="_blank">http://www.enteract.com/~spribyl</A></FONT>
</P>

<P><FONT SIZE=2>On Tue, 30 Jan 2001, Nenno, Tim wrote:</FONT>
</P>

<P><FONT SIZE=2>> We've had the same problem with a 5149-08 440Mhz IIi on an Ultra 10. That</FONT>
<BR><FONT SIZE=2>> same panic message, plus lots of other CPU-related freezes/crashes/reboots.</FONT>
<BR><FONT SIZE=2>> The machine would stay up anywhere from a few minutes to a week before</FONT>
<BR><FONT SIZE=2>> failing.</FONT>
<BR><FONT SIZE=2>>  </FONT>
<BR><FONT SIZE=2>> The CPU's been replaced four times since November. One of them was DOA. The</FONT>
<BR><FONT SIZE=2>> third was a week ago, the fourth yesterday. Yesterday's engineer replaced</FONT>
<BR><FONT SIZE=2>> the system board, too, on the off-chance that a glitch there might be</FONT>
<BR><FONT SIZE=2>> triggering the CPU problem.</FONT>
<BR><FONT SIZE=2>>  </FONT>
<BR><FONT SIZE=2>> In the course of all that, he tested the power supply, says he found a</FONT>
<BR><FONT SIZE=2>> problem there that might be causing the CPU problem, and so replaced the</FONT>
<BR><FONT SIZE=2>> power supply.</FONT>
<BR><FONT SIZE=2>>  </FONT>
<BR><FONT SIZE=2>> So, if the machine holds up, we won't know what the source of the problem</FONT>
<BR><FONT SIZE=2>> actually was. (Not that I'll want to know it was the power supply all</FONT>
<BR><FONT SIZE=2>> along....)</FONT>
<BR><FONT SIZE=2>> </FONT>
<BR><FONT SIZE=2>> I have 220R with 2*450Mhz processors and 1Gb of Memeory..OS is Solaris-7</FONT>
<BR><FONT SIZE=2>> Yesterday the system crashed with a core dump and i got the following</FONT>
<BR><FONT SIZE=2>> messages</FONT>
<BR><FONT SIZE=2>> logged in /var/adm/messages:::</FONT>
<BR><FONT SIZE=2>>  </FONT>
<BR><FONT SIZE=2>> unix: panic[cpu0]/thread=300002216a0:</FONT>
<BR><FONT SIZE=2>> </FONT>
<BR><FONT SIZE=2>> CPU0 Ecache Writeback Data Parity Error: AFSR 0x00000000.00800004 AFAR</FONT>
<BR><FONT SIZE=2>> 0x000001fe.01800800</FONT>
<BR><FONT SIZE=2>>  </FONT>
<BR><FONT SIZE=2>> Savecore: reboot after panic: CPU0 Ecache Writeback Data Parity Error: AFSR</FONT>
<BR><FONT SIZE=2>> 0x00000000.00800004 AFAR 0x000001fe.01800800</FONT>
<BR><FONT SIZE=2>>  </FONT>
<BR><FONT SIZE=2>> Could you possibly tell me what went wrong ???</FONT>
<BR><FONT SIZE=2>> </FONT>
<BR><FONT SIZE=2>> </FONT>
</P>

<P><FONT SIZE=2>_______________________________________________</FONT>
<BR><FONT SIZE=2>SunHELP maillist  -  SunHELP at sunhelp.org</FONT>
<BR><FONT SIZE=2><A HREF="http://www.sunhelp.org/mailman/listinfo/sunhelp" TARGET="_blank">http://www.sunhelp.org/mailman/listinfo/sunhelp</A></FONT>
</P>
<BR>

<P>     - - - - - - -  Appended by Scientific-Atlanta, Inc.  - - - - - - -  
<BR>This e-mail and any attachments may contain information which is confidential, proprietary, privileged or otherwise protected by law. The information is solely intended for the named addressee (or a person responsible for delivering it to the addressee). If you are not the intended recipient of this message, you are not authorized to read, print, retain, copy or disseminate this message or any part of it. If you have received this e-mail in error, please notify the sender immediately by return e-mail and delete it from your computer. </P>
<BR>

</BODY>
</HTML>
------_=_NextPart_001_01C08B05.46042CF0--



More information about the SunHELP mailing list