[SunHELP] WARNING: [AFT1] WP event on CPU1

DAUBIGNE Sebastien - BOR ( SDaubigne@bordeaux-bersol.sema.slb.com ) SDaubigne at bordeaux-bersol.sema.slb.com
Wed Oct 16 03:28:53 CDT 2002


 "Syndrome 0x3 indicates that this may not be a memory module problem"

It seems you encountered a Ecache error (due to a poor UltraSparcII CPU
design).
We got many similar errors on our E6500 (10 in one year).

Finally Sun replaced each UltraSparcII CPU with "Sombra" model with
"mirrored E-cache".
Evything goes right now.


---
Sebastien DAUBIGNE
sdaubigne at bordeaux-bersol.sema.slb.com <mailto:sebastien.daubigne at sema.fr>
- (+33)5.57.26.56.36
SchlumbergerSema - SGS/DWH/Pessac


	-----Message d'origine-----
	De:	Chris Hall [SMTP:chall at verio.net]
	Date:	mardi 15 octobre 2002 06:59
	@:	sunhelp at sunhelp.org
	Objet:	[SunHELP] WARNING: [AFT1] WP event on CPU1

	Hello,

	     One of our systems crahsed/rebooted today and I just wanted to
make sure i am heading
	in the right direction. This is the first time this has happened and
i saw nothing on sunsolve
	about these errors. I reseated the CPU's and memory modules as
suggested on many other simalar
	posts. I suppose i'll need to contact sun, but i just wanted to post
the info here to see if
	anyone had any addidtional information on what i can do to track or
perhaps fix this problem.
	Is this CPU or a Memory Problem ?? Mabee both ?

	Thanks in advance
	Chris H.


	uname -a
	SunOS hostname 5.8 Generic_108528-12 sun4u sparc SUNW,Ultra-80

	--------------

	  adb -k unix.0 vmcore.0
	physmem 3e123
	$c
	panicsys(10423630,2a10011d790,2a10011d548,78002000,104381e8,f) + 44
	vpanic(2a10011d548,2a10011d790,3c,104381b8,0,2a10011d573) + cc
	vcmn_err(3,2a10011d548,2a10011d790,3,81010100,ff00) + 18

cpu_aflt_log(2a10011d54e,1,10146ad8,2a10011d6d8,2a10011d59b,10146b00) + 4e0

cpu_async_error(104597f0,2a10011d7a0,80200000,0,650180080200000,2a10011d960)
+ 868
	prom_rtt(31002a1c640,1,20,0,0,0)
	fsflush(3651,3000e02d598,31002a1c640,30001758008,10439490,1041ad30)
+ 3e4
	thread_start(0,0,0,0,0,0) + 4
	$q

	--------------

	prtdiag:

	System Configuration:  Sun Microsystems  sun4u Sun Enterprise 420R
(4 X UltraSPARC-II 450MHz)
	System clock frequency: 113 MHz
	Memory size: 2048 Megabytes

	========================= CPUs =========================

	                     Run   Ecache   CPU    CPU
	Brd  CPU   Module   MHz     MB    Impl.   Mask
	---  ---  -------  -----  ------  ------  ----
	  0     0     0      450     4.0   US-II    10.0
	  0     1     1      450     4.0   US-II    10.0
	  0     2     2      450     4.0   US-II    10.0
	  0     3     3      450     4.0   US-II    10.0


	========================= IO Cards =========================

	      Bus   Freq
	Brd  Type  MHz   Slot  Name                              Model
	---  ----  ----  ----  --------------------------------
----------------------
	  0   PCI    33     1   network-SUNW,hme
	  0   PCI    33     3   scsi-glm/disk (block)
Symbios,53C875
	  0   PCI    33     3   scsi-glm/disk (block)
Symbios,53C875

	No failures found in System
	===========================

	----------------

	/var/adm/messages:

	[...]

	Oct 14 22:03:13 hostname     AFSR 0x00000000.00800001<WP> AFAR
0x00000003.41eab780
	Oct 14 22:03:13 hostname     AFSR.PSYND 0x0001(Score 95) AFSR.ETS
0x00 Fault_PC 0x1009421c
	Oct 14 22:03:13 hostname     UDBH 0x0000 UDBH.ESYND 0x00 UDBL 0x0000
UDBL.ESYND 0x00
	Oct 14 22:03:16 hostname SUNW,UltraSPARC-II: [ID 700935
kern.warning] WARNING: [AFT1] Uncorrectable Memory Error on CPU2 Data7
	+access at TL=0, errID 0x000b1064.1f8dc245
	Oct 14 22:03:16 hostname     AFSR 0x00000000.80200000<PRIV,UE> AFAR
0x00000000.7ba1c648
	Oct 14 22:03:16 hostname     AFSR.PSYND 0x0000(Score 05) AFSR.ETS
0x00 Fault_PC 0x10023c1c
	Oct 14 22:03:16 hostname     UDBH 0x0000 UDBH.ESYND 0x00 UDBL
0x0203<UE> UDBL.ESYND 0x03
	Oct 14 22:03:16 hostname     UDBL Syndrome 0x3 Memory Module U1402
U0402 U1401 U0401
	Oct 14 22:03:16 hostname SUNW,UltraSPARC-II: [ID 625512
kern.warning] WARNING: [AFT1] errID 0x000b1064.1f8dc245 Syndrome 0x3
	+indicates that this may not be a memory module problem
	Oct 14 22:03:16 hostname SUNW,UltraSPARC-II: [ID 353974 kern.info]
[AFT2] errID 0x000b1064.1f8dc245 PA=0x00000000.7ba1c648
	Oct 14 22:03:16 hostname     E$tag 0x00000000.1ec00f74 E$State:
Exclusive E$parity 0x0f
	Oct 14 22:03:16 hostname SUNW,UltraSPARC-II: [ID 359263 kern.info]
[AFT2] E$Data (0x00): 0x00000300.02f39e38
	Oct 14 22:03:16 hostname SUNW,UltraSPARC-II: [ID 989652 kern.info]
[AFT2] E$Data (0x08): 0x00000310.01c68b61 *Bad* PSYND=0x00ff
	Oct 14 22:03:16 hostname SUNW,UltraSPARC-II: [ID 359263 kern.info]
[AFT2] E$Data (0x10): 0x00000310.01c003a0
	Oct 14 22:03:16 hostname SUNW,UltraSPARC-II: [ID 359263 kern.info]
[AFT2] E$Data (0x18): 0x00000310.023dc260
	Oct 14 22:03:16 hostname SUNW,UltraSPARC-II: [ID 359263 kern.info]
[AFT2] E$Data (0x20): 0x00000310.02a1c640
	Oct 14 22:03:16 hostname SUNW,UltraSPARC-II: [ID 359263 kern.info]
[AFT2] E$Data (0x28): 0x00000310.02a1c640
	Oct 14 22:03:16 hostname SUNW,UltraSPARC-II: [ID 359263 kern.info]
[AFT2] E$Data (0x30): 0x00000000.1c2c4000
	Oct 14 22:03:16 hostname SUNW,UltraSPARC-II: [ID 359263 kern.info]
[AFT2] E$Data (0x38): 0x00000000.00000000
	Oct 14 22:03:16 hostname SUNW,UltraSPARC-II: [ID 189748 kern.info]
[AFT3] errID 0x000b1064.1f8dc245: cannot schedule clearing of+error on page
0x00000000.7ba1c000; page not in VM system
	Oct 14 22:03:16 hostname SUNW,UltraSPARC-II: [ID 647932 kern.info]
[AFT3] errID 0x000b1064.1f8dc245 Above Error detected by
	+protected Kernel code
	Oct 14 22:03:16 hostname     that will try to clear error from
system
	Oct 14 22:03:16 hostname SUNW,UltraSPARC-II: [ID 618470
kern.warning] WARNING: [AFT1] Uncorrectable Memory Error on CPU2 Data
	+access at TL=0, errID 0x000b1064.20c25222
	Oct 14 22:03:16 hostname     AFSR 0x00000000.80200000<PRIV,UE> AFAR
0x00000000.7ba1c648
	Oct 14 22:03:16 hostname     AFSR.PSYND 0x0000(Score 05) AFSR.ETS
0x00 Fault_PC 0x10023c1c
	Oct 14 22:03:16 hostname     UDBH 0x0000 UDBH.ESYND 0x00 UDBL
0x0203<UE> UDBL.ESYND 0x03
	Oct 14 22:03:16 hostname     UDBL Syndrome 0x3 Memory Module U1402
U0402 U1401 U0401
	Oct 14 22:03:16 hostname SUNW,UltraSPARC-II: [ID 659561
kern.warning] WARNING: [AFT1] errID 0x000b1064.20c25222 Syndrome 0x3
	+indicates that this may not be a memory module problem
	Oct 14 22:03:16 hostname SUNW,UltraSPARC-II: [ID 927616 kern.info]
[AFT2] errID 0x000b1064.20c25222 PA=0x00000000.7ba1c648
	Oct 14 22:03:16 hostname     E$tag 0x00000000.1ec00f74 E$State:
Exclusive E$parity 0x0f
	Oct 14 22:03:16 hostname SUNW,UltraSPARC-II: [ID 359263 kern.info]
[AFT2] E$Data (0x00): 0x00000300.02f39e38
	Oct 14 22:03:16 hostname SUNW,UltraSPARC-II: [ID 989652 kern.info]
[AFT2] E$Data (0x08): 0x00000310.01c68b61 *Bad* PSYND=0x00ff
	Oct 14 22:03:16 hostname SUNW,UltraSPARC-II: [ID 359263 kern.info]
[AFT2] E$Data (0x10): 0x00000310.01c003a0
	Oct 14 22:03:16 hostname SUNW,UltraSPARC-II: [ID 359263 kern.info]
[AFT2] E$Data (0x18): 0x00000310.023dc260
	Oct 14 22:03:16 hostname SUNW,UltraSPARC-II: [ID 359263 kern.info]
[AFT2] E$Data (0x20): 0x00000310.02a1c640
	Oct 14 22:03:16 hostname SUNW,UltraSPARC-II: [ID 359263 kern.info]
[AFT2] E$Data (0x28): 0x00000310.02a1c640
	Oct 14 22:03:16 hostname SUNW,UltraSPARC-II: [ID 359263 kern.info]
[AFT2] E$Data (0x30): 0x00000000.1c2c4000
	Oct 14 22:03:16 hostname SUNW,UltraSPARC-II: [ID 359263 kern.info]
[AFT2] E$Data (0x38): 0x00000000.00000000
	Oct 14 22:03:16 hostname SUNW,UltraSPARC-II: [ID 667191 kern.info]
[AFT3] errID 0x000b1064.20c25222: cannot schedule clearing of+error on page
0x00000000.7ba1c000; page not in VM system
	Oct 14 22:03:16 hostname SUNW,UltraSPARC-II: [ID 565467 kern.info]
[AFT3] errID 0x000b1064.20c25222 Above Error detected by
	+protected Kernel code
	Oct 14 22:03:16 hostname     that will try to clear error from
system
	Oct 14 22:03:38 hostname SUNW,UltraSPARC-II: [ID 733957
kern.warning] WARNING: [AFT1] Uncorrectable Memory Error on CPU1 Data
	+access at TL=0, errID 0x000b1069.1450b97d
	Oct 14 22:03:38 hostname     AFSR 0x00000000.80200000<PRIV,UE> AFAR
0x00000000.7ba1c648
	Oct 14 22:03:38 hostname     AFSR.PSYND 0x0000(Score 05) AFSR.ETS
0x00 Fault_PC 0x1009420c
	Oct 14 22:03:38 hostname     UDBH 0x0000 UDBH.ESYND 0x00 UDBL
0x0203<UE> UDBL.ESYND 0x03
	Oct 14 22:03:38 hostname     UDBL Syndrome 0x3 Memory Module U1402
U0402 U1401 U0401
	Oct 14 22:03:38 hostname SUNW,UltraSPARC-II: [ID 181457
kern.warning] WARNING: [AFT1] errID 0x000b1069.1450b97d Syndrome 0x3
	+indicates that this may not be a memory module problem
	Oct 14 22:03:38 hostname SUNW,UltraSPARC-II: [ID 309063 kern.info]
[AFT2] errID 0x000b1069.1450b97d PA=0x00000000.7ba1c648
	Oct 14 22:03:38 hostname     E$tag 0x00000000.1ec00f74 E$State:
Exclusive E$parity 0x0f
	Oct 14 22:03:38 hostname SUNW,UltraSPARC-II: [ID 359263 kern.info]
[AFT2] E$Data (0x00): 0x00000300.02f39e38
	Oct 14 22:03:38 hostname SUNW,UltraSPARC-II: [ID 989652 kern.info]
[AFT2] E$Data (0x08): 0x00000310.01c68b61 *Bad* PSYND=0x00ff
	Oct 14 22:03:38 hostname SUNW,UltraSPARC-II: [ID 359263 kern.info]
[AFT2] E$Data (0x10): 0x00000310.01c003a0
	Oct 14 22:03:38 hostname SUNW,UltraSPARC-II: [ID 359263 kern.info]
[AFT2] E$Data (0x18): 0x00000310.023dc260
	Oct 14 22:03:38 hostname SUNW,UltraSPARC-II: [ID 359263 kern.info]
[AFT2] E$Data (0x20): 0x00000310.02a1c640
	Oct 14 22:03:38 hostname SUNW,UltraSPARC-II: [ID 359263 kern.info]
[AFT2] E$Data (0x28): 0x00000310.02a1c640
	Oct 14 22:03:38 hostname SUNW,UltraSPARC-II: [ID 359263 kern.info]
[AFT2] E$Data (0x30): 0x00000000.1c2c4000
	Oct 14 22:03:38 hostname SUNW,UltraSPARC-II: [ID 359263 kern.info]
[AFT2] E$Data (0x38): 0x00000000.00000000
	Oct 14 22:03:38 hostname unix: [ID 836849 kern.notice]
	Oct 14 22:03:38 hostname ^Mpanic[cpu1]/thread=300017537c0:
	Oct 14 22:03:38 hostname unix: [ID 276526 kern.notice] [AFT1] errID
0x000b1069.1450b97d UE Error(s)
	Oct 14 22:03:38 hostname     See previous message(s) for details

	[...]
	_______________________________________________
	SunHELP maillist  -  SunHELP at sunhelp.org
	http://www.sunhelp.org/mailman/listinfo/sunhelp



More information about the SunHELP mailing list