[SunHELP] possible drive failure?

Joe Stump jstump at aa.acinc.com
Mon Jan 6 11:55:25 CST 2003


Recently at work we had an incident that cut power to the server room.
Needless to say every machine was hard powered down, which included our
E450. Since then we can't seem to keep the E450 up more than a few hours
before it dies.

In the dmesg I get a message like this:
Jan  5 13:54:23 zues scsi: [ID 107833 kern.warning] WARNING:
/pci at 1f,4000/scsi at 3 (glm0):
Jan  5 13:54:23 zues    Disconnected tagged cmd(s) (1) timeout for Target
0.0
Jan  5 13:54:23 zues genunix: [ID 408822 kern.info] NOTICE: glm0: fault
detected in device; service still available
Jan  5 13:54:23 zues genunix: [ID 611667 kern.info] NOTICE: glm0:
Disconnected tagged cmd(s) (1) timeout for Target 0.0
Jan  5 13:54:23 zues glm: [ID 401478 kern.warning] WARNING:
ID[SUNWpd.glm.cmd_timeout.6018]
Jan  5 13:54:23 zues scsi: [ID 107833 kern.warning] WARNING:
/pci at 1f,4000/scsi at 3 (glm0):
Jan  5 13:54:23 zues    got SCSI bus reset
Jan  5 13:54:23 zues genunix: [ID 408822 kern.info] NOTICE: glm0: fault
detected in device; service still available
Jan  5 13:54:23 zues genunix: [ID 611667 kern.info] NOTICE: glm0: got SCSI
bus reset
Jan  5 13:54:23 zues scsi: [ID 107833 kern.warning] WARNING:
/pci at 1f,4000/scsi at 3/sd at 0,0 (sd0):
Jan  5 13:54:23 zues    SCSI transport failed: reason 'timeout': retrying
command
Jan  5 13:54:27 zues scsi: [ID 365881 kern.info] /pci at 1f,4000/scsi at 3 (glm0):
Jan  5 13:54:27 zues    Cmd (0x16be018) dump for Target 0 Lun 0:
Jan  5 13:54:27 zues scsi: [ID 365881 kern.info] /pci at 1f,4000/scsi at 3 (glm0):
Jan  5 13:54:27 zues            cdb=[ 0x2a 0x0 0x0 0x40 0xe 0x7 0x0 0x0 0xa
0x0 ]
Jan  5 13:54:27 zues scsi: [ID 365881 kern.info] /pci at 1f,4000/scsi at 3 (glm0):
Jan  5 13:54:27 zues    pkt_flags=0xc000 pkt_statistics=0x60 pkt_state=0x7
Jan  5 13:54:27 zues scsi: [ID 365881 kern.info] /pci at 1f,4000/scsi at 3 (glm0):
Jan  5 13:54:27 zues    pkt_scbp=0x0 cmd_flags=0x1860
Jan  5 13:54:27 zues scsi: [ID 107833 kern.warning] WARNING:
/pci at 1f,4000/scsi at 3 (glm0):
Jan  5 13:54:27 zues    Disconnected tagged cmd(s) (1) timeout for Target
0.0
Jan  5 13:54:27 zues genunix: [ID 408822 kern.info] NOTICE: glm0: fault
detected in device; service still available
Jan  5 13:54:27 zues genunix: [ID 611667 kern.info] NOTICE: glm0:
Disconnected tagged cmd(s) (1) timeout for Target 0.0
Jan  5 13:54:27 zues glm: [ID 401478 kern.warning] WARNING:
ID[SUNWpd.glm.cmd_timeout.6018]
Jan  5 13:54:27 zues scsi: [ID 107833 kern.warning] WARNING:
/pci at 1f,4000/scsi at 3 (glm0):
Jan  5 13:54:27 zues    got SCSI bus reset
Jan  5 13:54:27 zues genunix: [ID 408822 kern.info] NOTICE: glm0: fault
detected in device; service still available
Jan  5 13:54:27 zues genunix: [ID 611667 kern.info] NOTICE: glm0: got SCSI
bus reset

Is there a way to reboot and force check the drives? Is there some other
possibility that I don't know about?

Any help would be great!

Thanks!

--Joe

--
Joe Stump <joe at joestump.net>
http://www.joestump.net


More information about the SunHELP mailing list