[rescue] weird Opteron 865 / Tyan 4882 problem

Phil Stracchino alaric at metrocast.net
Tue Jul 14 21:32:49 CDT 2009


Patrick Giagnocavo wrote:
> Jonathan Patschke wrote:
>> On Tue, 14 Jul 2009, Patrick Giagnocavo wrote:
>>
>>> My thinking is that either:
>>>
>>> 1.  CPU1 is partially busted (specifically its memory controller) and
>>> should be replaced
>> Swap CPU0 and CPU1 to see if the problem follows the CPU.
> 
> Each CPU is connected with a heat pipe to another one, e.g.
> 
> CPU0 <--> CPU2
> CPU1 <--> CPU3

Can you swap the PAIRS of CPUs together and not have to de-bond the heat
pipes?

> Looks like I may have to hold off on deployment until I can fully debug
> this sucker.
> 
>>> 2.  There is a problem with the DIMM slots itself, like maybe a resistor
>>> or some other electrical channel problem.
>> Swap the memory between CPU0 and CPU1 to see if the problem follows the
>> memory.
> 
> I have swapped the memory, it works in the other slot.

Do you mean CPU1's RAM works on CPU0, or that it'll boot with either
pair of RAM slots for CPU1 filled with either pair of modules, but not
with all four filled regardless of which modules are in them??


It sounds pretty clear you have a fault either on the board or on CPU1,
but so far there isn't enough information to nail down which of those it is.


-- 
  Phil Stracchino, CDK#2     DoD#299792458     ICBM: 43.5607, -71.355
  alaric at caerllewys.net   alaric at metrocast.net   phil at co.ordinate.org
         Renaissance Man, Unix ronin, Perl hacker, Free Stater
                 It's not the years, it's the mileage.



More information about the rescue mailing list