[rescue] Now what, SGI Challenge L Department

Sheldon T. Hall shel at cmhcsys.com
Fri Jan 16 14:34:57 CST 2004


OK, so I'm not getting enough grief from the Indigo2 ...

I'm getting memory errors on my Challenge L.  So far, there's not much of a
pattern, and none have been uncorrectible.  I'm getting one every few days.
A while back,I saw one in the log and started filtering for them; since
then, it seems to have one every 3 to 5 days:

12-17-03: Single Bit Error on physical addr. 0x582d6180, in slot 2 leaf 1
bank 7 was a read transient
12-19-03: Single Bit Error on physical addr. 0x256e9200, in slot 3 leaf 0
bank 1 CORRECTED by scrubbing
[machine was off for much of ther time until ...]
01-02-04: Single Bit Error on physical addr. 0x2c3c9c80, in slot 3 leaf 1
bank 5 CORRECTED by scrubbing
01-05-04: Single Bit Error on physical addr. 0x50611e80, in slot 4 leaf 1
bank 6 was a read transient
01-08-04: Single Bit Error on physical addr. 0x52692680, in slot 4 leaf 1
bank 6 was a read transient
01-11-04: Single Bit Error on physical addr. 0x1dfac200, in slot 2 leaf 0
bank 2 CORRECTED by scrubbing
01-16-04: Single Bit Error on physical addr. 0x58a63280, in slot 4 leaf 1
bank 6 was a read transient

Now, the machine's got ECC, this doesn't seem to be hurting anything, and,
except for 4,1,6 there seems to be no particular pattern. Should I be
worried?  I'm planning to re-seat all the SIMMs and boards when I get a
chance; is there anything else I should do?

-Shel



More information about the rescue mailing list