[SunHELP] possible drive failure?
Joe Stump
jstump at aa.acinc.com
Mon Jan 6 11:55:25 CST 2003
Recently at work we had an incident that cut power to the server room.
Needless to say every machine was hard powered down, which included our
E450. Since then we can't seem to keep the E450 up more than a few hours
before it dies.
In the dmesg I get a message like this:
Jan 5 13:54:23 zues scsi: [ID 107833 kern.warning] WARNING:
/pci at 1f,4000/scsi at 3 (glm0):
Jan 5 13:54:23 zues Disconnected tagged cmd(s) (1) timeout for Target
0.0
Jan 5 13:54:23 zues genunix: [ID 408822 kern.info] NOTICE: glm0: fault
detected in device; service still available
Jan 5 13:54:23 zues genunix: [ID 611667 kern.info] NOTICE: glm0:
Disconnected tagged cmd(s) (1) timeout for Target 0.0
Jan 5 13:54:23 zues glm: [ID 401478 kern.warning] WARNING:
ID[SUNWpd.glm.cmd_timeout.6018]
Jan 5 13:54:23 zues scsi: [ID 107833 kern.warning] WARNING:
/pci at 1f,4000/scsi at 3 (glm0):
Jan 5 13:54:23 zues got SCSI bus reset
Jan 5 13:54:23 zues genunix: [ID 408822 kern.info] NOTICE: glm0: fault
detected in device; service still available
Jan 5 13:54:23 zues genunix: [ID 611667 kern.info] NOTICE: glm0: got SCSI
bus reset
Jan 5 13:54:23 zues scsi: [ID 107833 kern.warning] WARNING:
/pci at 1f,4000/scsi at 3/sd at 0,0 (sd0):
Jan 5 13:54:23 zues SCSI transport failed: reason 'timeout': retrying
command
Jan 5 13:54:27 zues scsi: [ID 365881 kern.info] /pci at 1f,4000/scsi at 3 (glm0):
Jan 5 13:54:27 zues Cmd (0x16be018) dump for Target 0 Lun 0:
Jan 5 13:54:27 zues scsi: [ID 365881 kern.info] /pci at 1f,4000/scsi at 3 (glm0):
Jan 5 13:54:27 zues cdb=[ 0x2a 0x0 0x0 0x40 0xe 0x7 0x0 0x0 0xa
0x0 ]
Jan 5 13:54:27 zues scsi: [ID 365881 kern.info] /pci at 1f,4000/scsi at 3 (glm0):
Jan 5 13:54:27 zues pkt_flags=0xc000 pkt_statistics=0x60 pkt_state=0x7
Jan 5 13:54:27 zues scsi: [ID 365881 kern.info] /pci at 1f,4000/scsi at 3 (glm0):
Jan 5 13:54:27 zues pkt_scbp=0x0 cmd_flags=0x1860
Jan 5 13:54:27 zues scsi: [ID 107833 kern.warning] WARNING:
/pci at 1f,4000/scsi at 3 (glm0):
Jan 5 13:54:27 zues Disconnected tagged cmd(s) (1) timeout for Target
0.0
Jan 5 13:54:27 zues genunix: [ID 408822 kern.info] NOTICE: glm0: fault
detected in device; service still available
Jan 5 13:54:27 zues genunix: [ID 611667 kern.info] NOTICE: glm0:
Disconnected tagged cmd(s) (1) timeout for Target 0.0
Jan 5 13:54:27 zues glm: [ID 401478 kern.warning] WARNING:
ID[SUNWpd.glm.cmd_timeout.6018]
Jan 5 13:54:27 zues scsi: [ID 107833 kern.warning] WARNING:
/pci at 1f,4000/scsi at 3 (glm0):
Jan 5 13:54:27 zues got SCSI bus reset
Jan 5 13:54:27 zues genunix: [ID 408822 kern.info] NOTICE: glm0: fault
detected in device; service still available
Jan 5 13:54:27 zues genunix: [ID 611667 kern.info] NOTICE: glm0: got SCSI
bus reset
Is there a way to reboot and force check the drives? Is there some other
possibility that I don't know about?
Any help would be great!
Thanks!
--Joe
--
Joe Stump <joe at joestump.net>
http://www.joestump.net
More information about the SunHELP
mailing list