[SunHELP] Help : Lost one of our SVM volumes
Scott Walker
crimson at unspeakable.org
Mon Dec 19 13:18:22 CST 2005
If it takes 6 hours for it too cool down, I highly suggest you not run
servers in pizza ovens.
Seriously tho. Taking 6 hours to cool down is ridculous.
Richard Skelton wrote:
> Hi Managers
>
> I think that something in the FCAL loop over temperatured and shut down
> for over six hours (while it cooled down)
> I don't think I have a disk problem so I have run metareplace -e d80
> c4t7d0s2 so at around 18:00 I will see if the volume is OK
>
> Richard Skelton wrote:
>
>
>>Hi Managers,
>>Over the weekend we lost one of our SVM volumes d80
>>Looking at the messages file it looks like the fcal controlled went
>>offline for a while and when it came back SVM replaced a disk in
>>maintenance with the hot spare.
>>I have no more spare drives and SVM still wants me to replace more drives.
>>The system may have been over temperature during Sunday morning but I
>>have no temperature monitoring on this system.
>>
>>How can I recover from this situation?
>>
>>messages:-
>>
>>Dec 18 06:49:17 ccs001 socal: [ID 403145 kern.info]
>>ID[SUNWssa.socal.link.5010]
>>socal1: port 1: Fibre Channel is OFFLINE
>>Dec 18 06:50:03 ccs001 scsi: [ID 243001 kern.warning] WARNING:
>>/sbus at a,0/SUNW,so
>>cal at d,10000/sf at 1,0 (sf3):
>>Dec 18 06:50:03 ccs001 Offline Timeout
>>Dec 18 06:50:03 ccs001 scsi: [ID 243001 kern.info]
>>/sbus at a,0/SUNW,socal at d,10000/
>>sf at 1,0 (sf3):
>>Dec 18 06:50:03 ccs001 target 0x7 al_pa 0xda lun 0 offlined
>>Dec 18 06:50:03 ccs001 scsi: [ID 243001 kern.info]
>>/sbus at a,0/SUNW,socal at d,10000/
>>sf at 1,0 (sf3):
>>Dec 18 06:50:03 ccs001 target 0x4 al_pa 0xe1 lun 0 offlined
>>Dec 18 06:50:03 ccs001 scsi: [ID 243001 kern.info]
>>/sbus at a,0/SUNW,socal at d,10000/
>>sf at 1,0 (sf3):
>>Dec 18 06:50:03 ccs001 target 0x6 al_pa 0xdc lun 0 offlined
>>Dec 18 06:50:03 ccs001 scsi: [ID 243001 kern.info]
>>/sbus at a,0/SUNW,socal at d,10000/
>>sf at 1,0 (sf3):
>>Dec 18 06:50:03 ccs001 target 0x5 al_pa 0xe0 lun 0 offlined
>>Dec 18 06:50:03 ccs001 scsi: [ID 107833 kern.warning] WARNING:
>>/sbus at a,0/SUNW,so
>>cal at d,10000/sf at 1,0/ssd at w21000020376c8948,0 (ssd13):
>>Dec 18 06:50:03 ccs001 ssdrestart transport failed (fffffffe)
>>Dec 18 06:50:03 ccs001 md_stripe: [ID 641072 kern.warning] WARNING: md:
>>d82: wri
>>te error on /dev/dsk/c4t6d0s2
>>Dec 18 06:50:03 ccs001 scsi: [ID 107833 kern.warning] WARNING:
>>/sbus at a,0/SUNW,so
>>cal at d,10000/sf at 1,0/ssd at w21000020379862b2,0 (ssd12):
>>Dec 18 06:50:03 ccs001 ssdrestart transport failed (fffffffe)
>>Dec 18 06:50:03 ccs001 md_stripe: [ID 641072 kern.warning] WARNING: md:
>>d82: rea
>>d error on /dev/dsk/c4t7d0s2
>>Dec 18 06:50:03 ccs001 last message repeated 1 time
>>Dec 18 06:50:03 ccs001 md_mirror: [ID 842313 kern.info] NOTICE: md: d82:
>>B_FAILF
>>AST I/O retry
>>Dec 18 06:50:03 ccs001 md_stripe: [ID 641072 kern.warning] WARNING: md:
>>d82: wri
>>te error on /dev/dsk/c4t6d0s2
>>Dec 18 06:50:03 ccs001 md: [ID 680156 kern.info] NOTICE: md: d82:
>>B_FAILFAST I/O
>>retry, 2 buf(s) dequeued
>>Dec 18 06:50:07 ccs001 md_mirror: [ID 104909 kern.warning] WARNING: md:
>>d82: /de
>>v/dsk/c4t6d0s2 needs maintenance
>>Dec 18 06:50:11 ccs001 md_mirror: [ID 104909 kern.warning] WARNING: md:
>>d82: /de
>>v/dsk/c4t7d0s2 needs maintenance
>>Dec 18 06:50:11 ccs001 scsi: [ID 107833 kern.warning] WARNING:
>>/sbus at a,0/SUNW,so
>>cal at d,10000/sf at 1,0/ssd at w210000203747e732,0 (ssd14):
>>Dec 18 06:50:11 ccs001 transport rejected (-2)
>>Dec 18 06:50:11 ccs001 md_stripe: [ID 641072 kern.warning] WARNING: md:
>>d81: rea
>>d error on /dev/dsk/c4t5d0s2
>>Dec 18 06:50:11 ccs001 md_mirror: [ID 842313 kern.info] NOTICE: md: d81:
>>B_FAILF
>>AST I/O retry
>>Dec 18 06:50:11 ccs001 md_stripe: [ID 641072 kern.warning] WARNING: md:
>>d81: rea
>>d error on /dev/dsk/c4t5d0s2
>>Dec 18 06:50:11 ccs001 md_mirror: [ID 104909 kern.warning] WARNING: md:
>>d81: /de
>>v/dsk/c4t5d0s2 needs maintenance
>>Dec 18 06:50:11 ccs001 md_mirror: [ID 990438 kern.warning] WARNING: md:
>>d81: /de
>>v/dsk/c4t5d0s2 last erred
>>Dec 18 06:50:11 ccs001 md_stripe: [ID 641072 kern.warning] WARNING: md:
>>d81: rea
>>d error on /dev/dsk/c4t5d0s2
>>Dec 18 06:50:11 ccs001 md_mirror: [ID 842313 kern.info] NOTICE: md: d81:
>>B_FAILF
>>AST I/O retry
>>Dec 18 06:50:11 ccs001 md_stripe: [ID 641072 kern.warning] WARNING: md:
>>d81: rea
>>d error on /dev/dsk/c4t5d0s2
>>Dec 18 06:50:11 ccs001 last message repeated 1 time
>>Dec 18 06:50:11 ccs001 md_mirror: [ID 842313 kern.info] NOTICE: md: d81:
>>B_FAILF
>>AST I/O retry
>>Dec 18 06:50:11 ccs001 md_stripe: [ID 641072 kern.warning] WARNING: md:
>>d81: rea
>>d error on /dev/dsk/c4t5d0s2
>>Dec 18 06:50:11 ccs001 md_mirror: [ID 842313 kern.info] NOTICE: md: d81:
>>B_FAILF
>>AST I/O retry
>>Dec 18 06:50:11 ccs001 md_stripe: [ID 641072 kern.warning] WARNING: md:
>>d81: rea
>>d error on /dev/dsk/c4t5d0s2
>>Dec 18 06:50:11 ccs001 last message repeated 1 time
>>Dec 18 06:50:11 ccs001 md_stripe: [ID 241980 kern.notice] NOTICE: md:
>>d82: hotsp
>>ared device /dev/dsk/c4t6d0s2 with /dev/dsk/c3t3d0s2
>>Dec 18 06:50:12 ccs001 scsi: [ID 107833 kern.warning] WARNING:
>>/sbus at a,0/SUNW,so
>>cal at d,10000/sf at 1,0/ssd at w2100002037d0a281,0 (ssd15):
>>Dec 18 06:50:12 ccs001 transport rejected (-2)
>>Dec 18 06:50:12 ccs001 md_mirror: [ID 842313 kern.info] NOTICE: md: d81:
>>B_FAILF
>>AST I/O retry
>>Dec 18 06:50:12 ccs001 md_stripe: [ID 641072 kern.warning] WARNING: md:
>>d81: wri
>>te error on /dev/dsk/c4t5d0s2
>>Dec 18 06:50:12 ccs001 last message repeated 1 time
>>Dec 18 06:50:12 ccs001 md_mirror: [ID 842313 kern.info] NOTICE: md: d81:
>>B_FAILF
>>AST I/O retry
>>Dec 18 06:50:12 ccs001 md_stripe: [ID 641072 kern.warning] WARNING: md:
>>d81: wri
>>te error on /dev/dsk/c4t5d0s2
>>Dec 18 06:50:12 ccs001 last message repeated 1 time
>>Dec 18 06:50:12 ccs001 ufs_log: [ID 702911 kern.warning] WARNING: Error
>>writing
>>ufs log
>>Dec 18 06:50:12 ccs001 ufs_log: [ID 127457 kern.warning] WARNING: ufs
>>log for /e
>>xport/work changed state to Error
>>Dec 18 06:50:12 ccs001 ufs_log: [ID 616219 kern.warning] WARNING: Please
>>umount(
>>1M) /export/work and run fsck(1M)
>>Dec 18 06:50:12 ccs001 md_stripe: [ID 641072 kern.warning] WARNING: md:
>>d81: rea
>>d error on /dev/dsk/c4t5d0s2
>>Dec 18 06:50:12 ccs001 md_mirror: [ID 842313 kern.info] NOTICE: md: d81:
>>B_FAILF
>>AST I/O retry
>>Dec 18 06:50:12 ccs001 md_stripe: [ID 641072 kern.warning] WARNING: md:
>>d81: rea
>>d error on /dev/dsk/c4t5d0s2
>>
>>
>>
>>metstat:-
>>
>>d80: Mirror
>> Submirror 0: d81
>> State: Needs maintenance
>> Submirror 1: d82
>> State: Needs maintenance
>> Pass: 1
>> Read option: roundrobin (default)
>> Write option: parallel (default)
>> Size: 142245693 blocks
>>
>>d81: Submirror of d80
>> State: Needs maintenance
>> Invoke: after replacing "Maintenance" components:
>> metareplace d80 c4t5d0s2 <new device>
>> Hot spare pool: hsp001
>> Size: 142245693 blocks
>> Stripe 0: (interlace: 32 blocks)
>> Device Start Block Dbase State Hot Spare
>> c3t2d0s2 0 No Okay
>> c4t5d0s2 2889 No Last Erred
>>
>>
>>d82: Submirror of d80
>> State: Needs maintenance
>> Invoke: metareplace d80 c4t7d0s2 <new device>
>> Hot spare pool: hsp001
>> Size: 142245693 blocks
>> Stripe 0: (interlace: 32 blocks)
>> Device Start Block Dbase State Hot Spare
>> c4t6d0s2 0 No Okay c3t3d0s2
>> c4t7d0s2 2889 No Maintenance
>>
>>
>>hsp001: 1 hot spare
>> c3t3d0s2 In use 71127180 blocks
>>
>>
>>
>
>
More information about the SunHELP
mailing list