[SunHELP] WARNING: [AFT1] WP event on CPU1
DAUBIGNE Sebastien - BOR ( SDaubigne@bordeaux-bersol.sema.slb.com )
SDaubigne at bordeaux-bersol.sema.slb.com
Wed Oct 16 03:28:53 CDT 2002
"Syndrome 0x3 indicates that this may not be a memory module problem"
It seems you encountered a Ecache error (due to a poor UltraSparcII CPU
design).
We got many similar errors on our E6500 (10 in one year).
Finally Sun replaced each UltraSparcII CPU with "Sombra" model with
"mirrored E-cache".
Evything goes right now.
---
Sebastien DAUBIGNE
sdaubigne at bordeaux-bersol.sema.slb.com <mailto:sebastien.daubigne at sema.fr>
- (+33)5.57.26.56.36
SchlumbergerSema - SGS/DWH/Pessac
-----Message d'origine-----
De: Chris Hall [SMTP:chall at verio.net]
Date: mardi 15 octobre 2002 06:59
@: sunhelp at sunhelp.org
Objet: [SunHELP] WARNING: [AFT1] WP event on CPU1
Hello,
One of our systems crahsed/rebooted today and I just wanted to
make sure i am heading
in the right direction. This is the first time this has happened and
i saw nothing on sunsolve
about these errors. I reseated the CPU's and memory modules as
suggested on many other simalar
posts. I suppose i'll need to contact sun, but i just wanted to post
the info here to see if
anyone had any addidtional information on what i can do to track or
perhaps fix this problem.
Is this CPU or a Memory Problem ?? Mabee both ?
Thanks in advance
Chris H.
uname -a
SunOS hostname 5.8 Generic_108528-12 sun4u sparc SUNW,Ultra-80
--------------
adb -k unix.0 vmcore.0
physmem 3e123
$c
panicsys(10423630,2a10011d790,2a10011d548,78002000,104381e8,f) + 44
vpanic(2a10011d548,2a10011d790,3c,104381b8,0,2a10011d573) + cc
vcmn_err(3,2a10011d548,2a10011d790,3,81010100,ff00) + 18
cpu_aflt_log(2a10011d54e,1,10146ad8,2a10011d6d8,2a10011d59b,10146b00) + 4e0
cpu_async_error(104597f0,2a10011d7a0,80200000,0,650180080200000,2a10011d960)
+ 868
prom_rtt(31002a1c640,1,20,0,0,0)
fsflush(3651,3000e02d598,31002a1c640,30001758008,10439490,1041ad30)
+ 3e4
thread_start(0,0,0,0,0,0) + 4
$q
--------------
prtdiag:
System Configuration: Sun Microsystems sun4u Sun Enterprise 420R
(4 X UltraSPARC-II 450MHz)
System clock frequency: 113 MHz
Memory size: 2048 Megabytes
========================= CPUs =========================
Run Ecache CPU CPU
Brd CPU Module MHz MB Impl. Mask
--- --- ------- ----- ------ ------ ----
0 0 0 450 4.0 US-II 10.0
0 1 1 450 4.0 US-II 10.0
0 2 2 450 4.0 US-II 10.0
0 3 3 450 4.0 US-II 10.0
========================= IO Cards =========================
Bus Freq
Brd Type MHz Slot Name Model
--- ---- ---- ---- --------------------------------
----------------------
0 PCI 33 1 network-SUNW,hme
0 PCI 33 3 scsi-glm/disk (block)
Symbios,53C875
0 PCI 33 3 scsi-glm/disk (block)
Symbios,53C875
No failures found in System
===========================
----------------
/var/adm/messages:
[...]
Oct 14 22:03:13 hostname AFSR 0x00000000.00800001<WP> AFAR
0x00000003.41eab780
Oct 14 22:03:13 hostname AFSR.PSYND 0x0001(Score 95) AFSR.ETS
0x00 Fault_PC 0x1009421c
Oct 14 22:03:13 hostname UDBH 0x0000 UDBH.ESYND 0x00 UDBL 0x0000
UDBL.ESYND 0x00
Oct 14 22:03:16 hostname SUNW,UltraSPARC-II: [ID 700935
kern.warning] WARNING: [AFT1] Uncorrectable Memory Error on CPU2 Data7
+access at TL=0, errID 0x000b1064.1f8dc245
Oct 14 22:03:16 hostname AFSR 0x00000000.80200000<PRIV,UE> AFAR
0x00000000.7ba1c648
Oct 14 22:03:16 hostname AFSR.PSYND 0x0000(Score 05) AFSR.ETS
0x00 Fault_PC 0x10023c1c
Oct 14 22:03:16 hostname UDBH 0x0000 UDBH.ESYND 0x00 UDBL
0x0203<UE> UDBL.ESYND 0x03
Oct 14 22:03:16 hostname UDBL Syndrome 0x3 Memory Module U1402
U0402 U1401 U0401
Oct 14 22:03:16 hostname SUNW,UltraSPARC-II: [ID 625512
kern.warning] WARNING: [AFT1] errID 0x000b1064.1f8dc245 Syndrome 0x3
+indicates that this may not be a memory module problem
Oct 14 22:03:16 hostname SUNW,UltraSPARC-II: [ID 353974 kern.info]
[AFT2] errID 0x000b1064.1f8dc245 PA=0x00000000.7ba1c648
Oct 14 22:03:16 hostname E$tag 0x00000000.1ec00f74 E$State:
Exclusive E$parity 0x0f
Oct 14 22:03:16 hostname SUNW,UltraSPARC-II: [ID 359263 kern.info]
[AFT2] E$Data (0x00): 0x00000300.02f39e38
Oct 14 22:03:16 hostname SUNW,UltraSPARC-II: [ID 989652 kern.info]
[AFT2] E$Data (0x08): 0x00000310.01c68b61 *Bad* PSYND=0x00ff
Oct 14 22:03:16 hostname SUNW,UltraSPARC-II: [ID 359263 kern.info]
[AFT2] E$Data (0x10): 0x00000310.01c003a0
Oct 14 22:03:16 hostname SUNW,UltraSPARC-II: [ID 359263 kern.info]
[AFT2] E$Data (0x18): 0x00000310.023dc260
Oct 14 22:03:16 hostname SUNW,UltraSPARC-II: [ID 359263 kern.info]
[AFT2] E$Data (0x20): 0x00000310.02a1c640
Oct 14 22:03:16 hostname SUNW,UltraSPARC-II: [ID 359263 kern.info]
[AFT2] E$Data (0x28): 0x00000310.02a1c640
Oct 14 22:03:16 hostname SUNW,UltraSPARC-II: [ID 359263 kern.info]
[AFT2] E$Data (0x30): 0x00000000.1c2c4000
Oct 14 22:03:16 hostname SUNW,UltraSPARC-II: [ID 359263 kern.info]
[AFT2] E$Data (0x38): 0x00000000.00000000
Oct 14 22:03:16 hostname SUNW,UltraSPARC-II: [ID 189748 kern.info]
[AFT3] errID 0x000b1064.1f8dc245: cannot schedule clearing of+error on page
0x00000000.7ba1c000; page not in VM system
Oct 14 22:03:16 hostname SUNW,UltraSPARC-II: [ID 647932 kern.info]
[AFT3] errID 0x000b1064.1f8dc245 Above Error detected by
+protected Kernel code
Oct 14 22:03:16 hostname that will try to clear error from
system
Oct 14 22:03:16 hostname SUNW,UltraSPARC-II: [ID 618470
kern.warning] WARNING: [AFT1] Uncorrectable Memory Error on CPU2 Data
+access at TL=0, errID 0x000b1064.20c25222
Oct 14 22:03:16 hostname AFSR 0x00000000.80200000<PRIV,UE> AFAR
0x00000000.7ba1c648
Oct 14 22:03:16 hostname AFSR.PSYND 0x0000(Score 05) AFSR.ETS
0x00 Fault_PC 0x10023c1c
Oct 14 22:03:16 hostname UDBH 0x0000 UDBH.ESYND 0x00 UDBL
0x0203<UE> UDBL.ESYND 0x03
Oct 14 22:03:16 hostname UDBL Syndrome 0x3 Memory Module U1402
U0402 U1401 U0401
Oct 14 22:03:16 hostname SUNW,UltraSPARC-II: [ID 659561
kern.warning] WARNING: [AFT1] errID 0x000b1064.20c25222 Syndrome 0x3
+indicates that this may not be a memory module problem
Oct 14 22:03:16 hostname SUNW,UltraSPARC-II: [ID 927616 kern.info]
[AFT2] errID 0x000b1064.20c25222 PA=0x00000000.7ba1c648
Oct 14 22:03:16 hostname E$tag 0x00000000.1ec00f74 E$State:
Exclusive E$parity 0x0f
Oct 14 22:03:16 hostname SUNW,UltraSPARC-II: [ID 359263 kern.info]
[AFT2] E$Data (0x00): 0x00000300.02f39e38
Oct 14 22:03:16 hostname SUNW,UltraSPARC-II: [ID 989652 kern.info]
[AFT2] E$Data (0x08): 0x00000310.01c68b61 *Bad* PSYND=0x00ff
Oct 14 22:03:16 hostname SUNW,UltraSPARC-II: [ID 359263 kern.info]
[AFT2] E$Data (0x10): 0x00000310.01c003a0
Oct 14 22:03:16 hostname SUNW,UltraSPARC-II: [ID 359263 kern.info]
[AFT2] E$Data (0x18): 0x00000310.023dc260
Oct 14 22:03:16 hostname SUNW,UltraSPARC-II: [ID 359263 kern.info]
[AFT2] E$Data (0x20): 0x00000310.02a1c640
Oct 14 22:03:16 hostname SUNW,UltraSPARC-II: [ID 359263 kern.info]
[AFT2] E$Data (0x28): 0x00000310.02a1c640
Oct 14 22:03:16 hostname SUNW,UltraSPARC-II: [ID 359263 kern.info]
[AFT2] E$Data (0x30): 0x00000000.1c2c4000
Oct 14 22:03:16 hostname SUNW,UltraSPARC-II: [ID 359263 kern.info]
[AFT2] E$Data (0x38): 0x00000000.00000000
Oct 14 22:03:16 hostname SUNW,UltraSPARC-II: [ID 667191 kern.info]
[AFT3] errID 0x000b1064.20c25222: cannot schedule clearing of+error on page
0x00000000.7ba1c000; page not in VM system
Oct 14 22:03:16 hostname SUNW,UltraSPARC-II: [ID 565467 kern.info]
[AFT3] errID 0x000b1064.20c25222 Above Error detected by
+protected Kernel code
Oct 14 22:03:16 hostname that will try to clear error from
system
Oct 14 22:03:38 hostname SUNW,UltraSPARC-II: [ID 733957
kern.warning] WARNING: [AFT1] Uncorrectable Memory Error on CPU1 Data
+access at TL=0, errID 0x000b1069.1450b97d
Oct 14 22:03:38 hostname AFSR 0x00000000.80200000<PRIV,UE> AFAR
0x00000000.7ba1c648
Oct 14 22:03:38 hostname AFSR.PSYND 0x0000(Score 05) AFSR.ETS
0x00 Fault_PC 0x1009420c
Oct 14 22:03:38 hostname UDBH 0x0000 UDBH.ESYND 0x00 UDBL
0x0203<UE> UDBL.ESYND 0x03
Oct 14 22:03:38 hostname UDBL Syndrome 0x3 Memory Module U1402
U0402 U1401 U0401
Oct 14 22:03:38 hostname SUNW,UltraSPARC-II: [ID 181457
kern.warning] WARNING: [AFT1] errID 0x000b1069.1450b97d Syndrome 0x3
+indicates that this may not be a memory module problem
Oct 14 22:03:38 hostname SUNW,UltraSPARC-II: [ID 309063 kern.info]
[AFT2] errID 0x000b1069.1450b97d PA=0x00000000.7ba1c648
Oct 14 22:03:38 hostname E$tag 0x00000000.1ec00f74 E$State:
Exclusive E$parity 0x0f
Oct 14 22:03:38 hostname SUNW,UltraSPARC-II: [ID 359263 kern.info]
[AFT2] E$Data (0x00): 0x00000300.02f39e38
Oct 14 22:03:38 hostname SUNW,UltraSPARC-II: [ID 989652 kern.info]
[AFT2] E$Data (0x08): 0x00000310.01c68b61 *Bad* PSYND=0x00ff
Oct 14 22:03:38 hostname SUNW,UltraSPARC-II: [ID 359263 kern.info]
[AFT2] E$Data (0x10): 0x00000310.01c003a0
Oct 14 22:03:38 hostname SUNW,UltraSPARC-II: [ID 359263 kern.info]
[AFT2] E$Data (0x18): 0x00000310.023dc260
Oct 14 22:03:38 hostname SUNW,UltraSPARC-II: [ID 359263 kern.info]
[AFT2] E$Data (0x20): 0x00000310.02a1c640
Oct 14 22:03:38 hostname SUNW,UltraSPARC-II: [ID 359263 kern.info]
[AFT2] E$Data (0x28): 0x00000310.02a1c640
Oct 14 22:03:38 hostname SUNW,UltraSPARC-II: [ID 359263 kern.info]
[AFT2] E$Data (0x30): 0x00000000.1c2c4000
Oct 14 22:03:38 hostname SUNW,UltraSPARC-II: [ID 359263 kern.info]
[AFT2] E$Data (0x38): 0x00000000.00000000
Oct 14 22:03:38 hostname unix: [ID 836849 kern.notice]
Oct 14 22:03:38 hostname ^Mpanic[cpu1]/thread=300017537c0:
Oct 14 22:03:38 hostname unix: [ID 276526 kern.notice] [AFT1] errID
0x000b1069.1450b97d UE Error(s)
Oct 14 22:03:38 hostname See previous message(s) for details
[...]
_______________________________________________
SunHELP maillist - SunHELP at sunhelp.org
http://www.sunhelp.org/mailman/listinfo/sunhelp
More information about the SunHELP
mailing list