[SunHELP] System Hang


Mon Apr 29 13:22:26 CDT 2002


I am having a problem on a E450 where occassionally (once a week at least)
the system hangs.  The only way to get the system back up is to do STOP-A,
or in some cases you have to turn the key off, then back on.

I believe I am having a problem with the system board.  I see the following
messages in /var/adm/messages when this happens:

Solaris 2.5.1

Apr 26 12:11:48 dncs unix: NOTICE: UNI failover: unit 0 failed
Apr 26 12:11:57 dncs ilmid: ILMI received a coldStart Trap on port 0
Apr 26 12:11:57 dncs ilmid: ILMI deregistered prefix
0x47.0005.80.ffe100.0000.f21c.5a74 on port 0
Apr 26 12:11:57 dncs ilmid: ILMI deregistered ATM address
0x47.0005.80.ffe100.0000.f21c.5a74.00204840dab4
.00 on port 0
Apr 26 12:11:57 dncs ilmid: ILMI deregistered ATM address
0x47.0005.80.ffe100.0000.f21c.5a74.02204840dab4
.00 on port 0
Apr 26 12:11:57 dncs ilmid: ILMI attempting to register ATM address
0x47.0005.80.ffe100.0000.f21c.5a74.02
204840dab4.00 on port 0
Apr 26 12:11:57 dncs ilmid: ILMI registered ATM address
0x47.0005.80.ffe100.0000.f21c.5a74.02204840dab4.0
0 on port 0
Apr 26 12:11:57 dncs ilmid: ILMI registered prefix
0x47.0005.80.ffe100.0000.f21c.5a74 on port 0
Apr 26 12:11:57 dncs ilmid: ILMI attempting to register ATM address
0x47.0005.80.ffe100.0000.f21c.5a74.00
204840dab4.00 on port 0
Apr 26 12:12:02 dncs unix: NOTICE: ELAN dncsdata (el0): LES connection
dropped
Apr 26 12:12:33 dncs unix: NOTICE: ELAN dncsdata (el0): failed to connect to
LES
Apr 26 12:28:13 dncs unix: glm2:        Cmd (0x618ea840) dump for Target 3
Lun 0:
Apr 26 12:29:11 dncs unix: glm2:                cdb=[ 0x28 0x0 0x0 0xd5 0x42
0xc 0x0 0x0 0xc 0x0 ]
Apr 26 12:29:11 dncs unix: glm2:        pkt_flags=0x4000 pkt_statistics=0x61
pkt_state=0x7
Apr 26 12:29:11 dncs unix: glm2:        pkt_scbp=0x0 cmd_flags=0x8e1
Apr 26 12:29:11 dncs unix: WARNING: /pci at 6,4000/scsi at 2 (glm2):
Apr 26 12:29:11 dncs unix:      Disconnected tagged cmd(s) (1) timeout for
Target 3.0
Apr 26 12:29:11 dncs unix: WARNING: ID[SUNWpd.glm.cmd_timeout.6018]
Apr 26 12:29:11 dncs unix: WARNING: /pci at 6,4000/scsi at 2/sd at 3,0 (sd33):
Apr 26 12:29:12 dncs unix:      SCSI transport failed: reason 'timeout':
retrying command
Apr 26 12:29:12 dncs unix: 
Apr 26 12:29:12 dncs unix: glm3:        Cmd (0x618f16e0) dump for Target 0
Lun 0:
Apr 26 12:29:12 dncs unix: glm3:                cdb=[ 0xa 0xf 0xac 0xdb 0x1
0x0 ]
Apr 26 12:29:12 dncs unix: glm3:        pkt_flags=0x4000 pkt_statistics=0x60
pkt_state=0x7
Apr 26 12:29:12 dncs unix: glm3:        pkt_scbp=0x0 cmd_flags=0x1860
Apr 26 12:29:12 dncs unix: WARNING: /pci at 6,4000/scsi at 2,1 (glm3):
Apr 26 12:29:12 dncs unix:      Disconnected tagged cmd(s) (1) timeout for
Target 0.0
Apr 26 12:29:12 dncs unix: WARNING: ID[SUNWpd.glm.cmd_timeout.6018]
Apr 26 12:29:12 dncs unix: glm3:        Cmd (0x618f16e0) dump for Target 0
Lun 0:
Apr 26 12:29:12 dncs unix: glm3:                cdb=[ 0xa 0xf 0xac 0xdb 0x1
0x0 ]
Apr 26 12:29:12 dncs unix: glm3:        pkt_flags=0x4000 pkt_statistics=0x60
pkt_state=0x7
Apr 26 12:29:12 dncs unix: glm3:        pkt_scbp=0x0 cmd_flags=0x1860
Apr 26 12:29:12 dncs unix: WARNING: /pci at 6,4000/scsi at 2,1 (glm3):
Apr 26 12:29:12 dncs unix:      Disconnected tagged cmd(s) (1) timeout for
Target 0.0
Apr 26 12:29:12 dncs unix: WARNING: ID[SUNWpd.glm.cmd_timeout.6018]
Apr 26 12:29:12 dncs unix: glm3:        Cmd (0x618f16e0) dump for Target 0
Lun 0:
Apr 26 12:29:12 dncs unix: glm3:                cdb=[ 0xa 0xf 0xac 0xdb 0x1
0x0 ]
Apr 26 12:29:12 dncs unix: glm3:        pkt_flags=0x4000 pkt_statistics=0x60
pkt_state=0x7
Apr 26 12:29:12 dncs unix: glm3:        pkt_scbp=0x0 cmd_flags=0x1860
Apr 26 12:29:12 dncs unix: WARNING: /pci at 6,4000/scsi at 2,1 (glm3):
Apr 26 12:29:12 dncs unix:      Disconnected tagged cmd(s) (1) timeout for
Target 0.0
Apr 26 12:29:12 dncs unix: WARNING: ID[SUNWpd.glm.cmd_timeout.6018]
Apr 26 12:29:12 dncs unix: glm3:        Cmd (0x618f16e0) dump for Target 0
Lun 0:
Apr 26 12:29:12 dncs unix: glm3:                cdb=[ 0xa 0xf 0xac 0xdb 0x1
0x0 ]
Apr 26 12:29:12 dncs unix: glm3:        pkt_flags=0x4000 pkt_statistics=0x60
pkt_state=0x7
Apr 26 12:29:12 dncs unix: glm3:        pkt_scbp=0x0 cmd_flags=0x1860
Apr 26 12:29:12 dncs unix: WARNING: /pci at 6,4000/scsi at 2,1 (glm3):
Apr 26 12:29:12 dncs unix:      Disconnected tagged cmd(s) (1) timeout for
Target 0.0
Apr 26 12:29:12 dncs unix: WARNING: ID[SUNWpd.glm.cmd_timeout.6018]
Apr 26 12:29:12 dncs unix: glm3:        Cmd (0x618f16e0) dump for Target 0
Lun 0:
Apr 26 12:29:12 dncs unix: glm3:                cdb=[ 0xa 0xf 0xac 0xdb 0x1
0x0 ]
Apr 26 12:29:12 dncs unix: glm3:        pkt_flags=0x4000 pkt_statistics=0x60
pkt_state=0x7
Apr 26 12:29:12 dncs unix: glm3:        pkt_scbp=0x0 cmd_flags=0x1860
Apr 26 12:29:12 dncs unix: WARNING: /pci at 6,4000/scsi at 2,1 (glm3):
Apr 26 12:29:12 dncs unix:      Disconnected tagged cmd(s) (1) timeout for
Target 0.0
Apr 26 12:29:12 dncs unix: WARNING: ID[SUNWpd.glm.cmd_timeout.6018]
Apr 26 12:29:12 dncs unix: WARNING: /pci at 6,4000/scsi at 2,1/sd at 0,0 (sd45):
Apr 26 12:29:12 dncs unix:      SCSI transport failed: reason 'timeout':
retrying command
Apr 26 12:29:12 dncs unix: 
Apr 26 12:29:12 dncs unix: WARNING: /pci at 6,4000/scsi at 2,1/sd at 0,0 (sd45):
Apr 26 12:29:12 dncs unix:      SCSI transport failed: reason 'reset':
retrying command
Apr 26 12:29:12 dncs unix: 
Apr 26 13:11:13 dncs unix: glm2:        Cmd (0x618eab80) dump for Target 0
Lun 0:
Apr 26 13:11:55 dncs unix: glm2:                cdb=[ 0x2a 0x0 0x0 0x6d 0x81
0x62 0x0 0x0 0x1 0x0 ]
Apr 26 13:11:55 dncs unix: glm2:        pkt_flags=0x4000 pkt_statistics=0x60
pkt_state=0x7
Apr 26 13:11:55 dncs unix: glm2:        pkt_scbp=0x0 cmd_flags=0x1860
Apr 26 13:11:55 dncs unix: WARNING: /pci at 6,4000/scsi at 2 (glm2):
Apr 26 13:11:55 dncs unix:      Disconnected tagged cmd(s) (1) timeout for
Target 0.0
Apr 26 13:11:55 dncs unix: WARNING: ID[SUNWpd.glm.cmd_timeout.6018]
Apr 26 13:11:55 dncs unix: glm3:        Cmd (0x618f0ab0) dump for Target 0
Lun 0:
Apr 26 13:11:55 dncs unix: glm3:                cdb=[ 0xa 0xf 0xac 0xdb 0x1
0x0 ]
Apr 26 13:11:55 dncs unix: glm3:        pkt_flags=0x4000 pkt_statistics=0x60
pkt_state=0x7
Apr 26 13:11:55 dncs unix: glm3:        pkt_scbp=0x0 cmd_flags=0x1860
Apr 26 13:11:55 dncs unix: WARNING: /pci at 6,4000/scsi at 2,1 (glm3):
Apr 26 13:11:55 dncs unix:      Disconnected tagged cmd(s) (1) timeout for
Target 0.0
Apr 26 13:11:55 dncs unix: WARNING: ID[SUNWpd.glm.cmd_timeout.6018]
Apr 26 13:11:55 dncs unix: glm2:        Cmd (0x618eab80) dump for Target 0
Lun 0:
Apr 26 13:11:55 dncs unix: glm2:                cdb=[ 0x2a 0x0 0x0 0x6d 0x81
0x62 0x0 0x0 0x1 0x0 ]
Apr 26 13:11:55 dncs unix: glm2:        pkt_flags=0x4000 pkt_statistics=0x60
pkt_state=0x7
Apr 26 13:11:55 dncs unix: glm2:        pkt_scbp=0x0 cmd_flags=0x1860
Apr 26 13:11:55 dncs unix: WARNING: /pci at 6,4000/scsi at 2 (glm2):
Apr 26 13:11:55 dncs unix:      Disconnected tagged cmd(s) (1) timeout for
Target 0.0
Apr 26 13:11:55 dncs unix: WARNING: ID[SUNWpd.glm.cmd_timeout.6018]
Apr 26 13:11:55 dncs unix: glm3:        Cmd (0x618f0ab0) dump for Target 0
Lun 0:
Apr 26 13:11:55 dncs unix: glm3:                cdb=[ 0xa 0xf 0xac 0xdb 0x1
0x0 ]
Apr 26 13:11:55 dncs unix: glm3:        pkt_flags=0x4000 pkt_statistics=0x60
pkt_state=0x7
Apr 26 13:11:55 dncs unix: glm3:        pkt_scbp=0x0 cmd_flags=0x1860
Apr 26 13:11:55 dncs unix: WARNING: /pci at 6,4000/scsi at 2,1 (glm3):
Apr 26 13:11:55 dncs unix:      Disconnected tagged cmd(s) (1) timeout for
Target 0.0
Apr 26 13:11:55 dncs unix: WARNING: ID[SUNWpd.glm.cmd_timeout.6018]
Apr 26 13:11:55 dncs unix: glm2:        Cmd (0x618eab80) dump for Target 0
Lun 0:
Apr 26 13:11:55 dncs unix: glm2:                cdb=[ 0x2a 0x0 0x0 0x6d 0x81
0x62 0x0 0x0 0x1 0x0 ]
Apr 26 13:11:55 dncs unix: glm2:        pkt_flags=0x4000 pkt_statistics=0x60
pkt_state=0x7
Apr 26 13:11:55 dncs unix: glm2:        pkt_scbp=0x0 cmd_flags=0x1860
Apr 26 13:11:55 dncs unix: WARNING: /pci at 6,4000/scsi at 2 (glm2):
Apr 26 13:11:55 dncs unix:      Disconnected tagged cmd(s) (1) timeout for
Target 0.0
Apr 26 13:11:55 dncs unix: WARNING: ID[SUNWpd.glm.cmd_timeout.6018]
Apr 26 13:11:56 dncs unix: glm3:        Cmd (0x618f0ab0) dump for Target 0
Lun 0:
Apr 26 13:11:56 dncs unix: glm3:                cdb=[ 0xa 0xf 0xac 0xdb 0x1
0x0 ]
Apr 26 13:11:56 dncs unix: glm3:        pkt_flags=0x4000 pkt_statistics=0x60
pkt_state=0x7
Apr 26 13:11:56 dncs unix: glm3:        pkt_scbp=0x0 cmd_flags=0x1860
Apr 26 13:11:56 dncs unix: WARNING: /pci at 6,4000/scsi at 2,1 (glm3):
Apr 26 13:11:56 dncs unix:      Disconnected tagged cmd(s) (1) timeout for
Target 0.0
Apr 26 13:11:56 dncs unix: WARNING: ID[SUNWpd.glm.cmd_timeout.6018]
Apr 26 13:11:56 dncs unix: glm2:        Cmd (0x618eab80) dump for Target 0
Lun 0:
Apr 26 13:11:56 dncs unix: glm2:                cdb=[ 0x2a 0x0 0x0 0x6d 0x81
0x62 0x0 0x0 0x1 0x0 ]
Apr 26 13:11:56 dncs unix: glm2:        pkt_flags=0x4000 pkt_statistics=0x60
pkt_state=0x7
Apr 26 13:11:56 dncs unix: glm2:        pkt_scbp=0x0 cmd_flags=0x1860
Apr 26 13:11:56 dncs unix: WARNING: /pci at 6,4000/scsi at 2 (glm2):
Apr 26 13:11:56 dncs unix:      Disconnected tagged cmd(s) (1) timeout for
Target 0.0
Apr 26 13:11:56 dncs unix: WARNING: ID[SUNWpd.glm.cmd_timeout.6018]
Apr 26 13:11:56 dncs unix: glm3:        Cmd (0x618f0ab0) dump for Target 0
Lun 0:
Apr 26 13:11:56 dncs unix: glm3:                cdb=[ 0xa 0xf 0xac 0xdb 0x1
0x0 ]
Apr 26 13:11:56 dncs unix: glm3:        pkt_flags=0x4000 pkt_statistics=0x60
pkt_state=0x7
Apr 26 13:11:56 dncs unix: glm3:        pkt_scbp=0x0 cmd_flags=0x1860
Apr 26 13:11:56 dncs unix: WARNING: /pci at 6,4000/scsi at 2,1 (glm3):
Apr 26 13:11:56 dncs unix:      Disconnected tagged cmd(s) (1) timeout for
Target 0.0
Apr 26 13:11:56 dncs unix: WARNING: ID[SUNWpd.glm.cmd_timeout.6018]
Apr 26 13:11:56 dncs unix: WARNING: /pci at 6,4000/scsi at 2/sd at 0,0 (sd30):
Apr 26 13:11:56 dncs unix:      SCSI transport failed: reason 'reset':
retrying command
Apr 26 13:11:56 dncs unix: 
Apr 26 13:11:56 dncs unix: WARNING: /pci at 6,4000/scsi at 2/sd at 0,0 (sd30):
Apr 26 13:11:56 dncs unix:      SCSI transport failed: reason 'timeout':
retrying command
Apr 26 13:11:56 dncs unix: 
Apr 26 13:11:56 dncs unix: WARNING: /pci at 6,4000/scsi at 2/sd at 1,0 (sd31):
Apr 26 13:11:56 dncs unix:      SCSI transport failed: reason 'reset':
retrying command
Apr 26 13:11:56 dncs unix: 
Apr 26 13:11:56 dncs unix: WARNING: /pci at 6,4000/scsi at 2/sd at 3,0 (sd33):
Apr 26 13:11:56 dncs unix:      SCSI transport failed: reason 'reset':
retrying command
Apr 26 13:11:56 dncs unix: 
Apr 26 13:11:56 dncs unix: WARNING: /pci at 6,4000/scsi at 2,1/sd at 0,0 (sd45):
Apr 26 13:11:56 dncs unix:      SCSI transport failed: reason 'timeout':
retrying command
Apr 26 13:11:56 dncs unix: 
Apr 26 13:11:56 dncs unix: WARNING: /pci at 6,4000/scsi at 2,1/sd at 0,0 (sd45):
Apr 26 13:11:56 dncs unix:      SCSI transport failed: reason 'reset':
retrying command
Apr 26 13:11:56 dncs unix:

At the OK> prompt probe-scsi-all shows all drives.
The disks are mirrrored using disksuite.  metastat shows all metadevices are
"Okay".

With the ATM and SCSI failures, am I looking at a system board problem?

Dennis Lund


<html>
<body>
<font size="3" face="Times New Roman"><span style="mso-fareast-font-family: Times New Roman; mso-ansi-language: EN-US; mso-fareast-language: EN-US; mso-bidi-language: AR-SA">
- - - - - - - Appended by Scientific-Atlanta, Inc. - - - - - - -
<span style="font-size:10.0pt;font-family:Times New Roman;
mso-fareast-font-family:"Times New Roman";mso-ansi-language:EN-US;mso-fareast-language:
EN-US;mso-bidi-language:AR-SA"></span><font face="Times New Roman" size="3"><span style="mso-fareast-font-family:Times New Roman; mso-ansi-language: EN-US; mso-fareast-language: EN-US; mso-bidi-language: AR-SA">This e-mail and any attachments may contain information which is confidential, proprietary, privileged or otherwise protected by law. The information is solely intended for the named addressee (or a person responsible for delivering it to the addressee). If you are not the intended recipient of this message, you are not authorized to read, print, retain, copy or disseminate this message or any part of it. If you have received this e-mail in error, please notify the sender immediately by return e-mail and delete it from your computer.</span></font></p>
</body>
</html>



More information about the SunHELP mailing list