Thursday, November 28, 2013

Process to change the faulty hardisk on SUN Blade X6250 server. (ZFS filesystem)

Process to change the faulty hardisk on SUN Blade X6250 server. (ZFS filesystem)

c0t1d0s0  is faulty and need replacement.
root@now-rfe36.gbl:/root# zpool status
pool: rpool
state: DEGRADED
status: One or more devices could not be opened.  Sufficient replicas exist for
the pool to continue functioning in a degraded state.
action: Attach the missing device and online it using 'zpool online'.
see: http://www.sun.com/msg/ZFS-8000-2Q
scan: none requested
config:

NAME          STATE     READ WRITE CKSUM
rpool         DEGRADED     0     0     0
mirror-0    DEGRADED     0     0     0
c0t1d0s0  UNAVAIL      0     0     0  cannot open
c0t0d0s0  ONLINE       0     0     0

Errors: No known data errors

root@now-rfe36.gbl:/root# echo |format
Searching for disks...done

Disk not visible at OS level.
AVAILABLE DISK SELECTIONS:
       0. c0t0d0 <Sun    -STK RAID REM   -V1.0 cyl 17830 alt 2 hd 255 sec 63>
          /pci@0,0/pci8086,2690@1c/pci108e,7aac@0/sd@0,0
Specify disk (enter its number): Specify disk (enter its number):
root@now-rfe36.gbl:/root#
The below mention file is available in my home dir .. /export/home/pchitte1 on  ssh5.oam server

Unzip the file
root@now-rfe36.gbl:/root#
root@now-rfe36.gbl:/tmp# unzip solaris_x86_raidman-arcconf_v6.10.0-17557.zip
Archive:  solaris_x86_raidman-arcconf_v6.10.0-17557.zip
inflating: cmdline/arcconf
inflating: manager/Apache-LICENSE-2.0.txt
inflating: manager/StorMan.pkg
inflating: License_v2.pdf
inflating: Release Notes.Txt
inflating: books/readme.txt
inflating: books/Sun_StorageTek_Raid_Manager_Software_Users_Guide.pdf
inflating: books/Uniform_Command-Line_Interface_Users_Guide.pdf

root@now-rfe36.gbl:/tmp# cd cmdline



Run the ./arcconf getconfig 1 AL

root@now-rfe36.gbl:/tmp/cmdline# ./arcconf getconfig 1 AL
Controllers found: 1
----------------------------------------------------------------------
Controller information
----------------------------------------------------------------------
Controller Status                        : Optimal
Channel description                      : SAS/SATA
Controller Model                         : Sun STK RAID REM
Controller Serial Number                 : 133C9D
Physical Slot                            : 32
Temperature                              : 58 C/ 136 F (Normal)
Installed memory                         : 256 MB
Copyback                                 : Disabled
Background consistency check             : Disabled
Automatic Failover                       : Enabled
Global task priority                     : High
Defunct disk drive count                 : 0
Logical devices/Failed/Degraded          : 1/0/0
--------------------------------------------------------
Controller Version Information
--------------------------------------------------------
BIOS                                     : 5.2-0 (15872)
Firmware                                 : 5.2-0 (15872)
Driver                                   : 2.2-4 (1)
Boot Flash                               : 5.2-0 (15872)
--------------------------------------------------------
Controller Battery Information
--------------------------------------------------------
Status                                   : Optimal
Over temperature                         : No
Capacity remaining                       : 99 percent
Time remaining (at current draw)         : 3 days, 1 hours, 11 minutes

----------------------------------------------------------------------
Logical device information
----------------------------------------------------------------------
Logical device number 0
Logical device name                      : VOL0
RAID level                               : Simple_volume
Status of logical device                 : Optimal
Size                                     : 139890 MB
Read-cache mode                          : Enabled
Write-cache mode                         : Disabled (write-through)
Write-cache setting                      : Disabled (write-through)
Partitioned                              : Yes
Protected by Hot-Spare                   : No
Bootable                                 : Yes
Failed stripes                           : No
--------------------------------------------------------
Logical device segment information
--------------------------------------------------------
Segment 0                                : Present (0,0) 219969S9            3NM969S9


----------------------------------------------------------------------
Physical Device information
----------------------------------------------------------------------
Device #0
Device is a Hard drive
State                              : Online
Supported                          : Yes
Transfer Speed                     : SAS 3.0 Gb/s
Reported Channel,Device            : 0,0
Reported Location                  : Enclosure 0, Slot 0
Reported ESD                       : 2,0
Vendor                             : SEAGATE
Model                              : ST914602SSUN146G
Firmware                           : 0603
Serial number                      : 219969S9            3NM969S9
World-wide name                    : 5000C50012EB16CC
Size                               : 140009 MB
Write Cache                        : Disabled (write-through)
FRU                                : None
S.M.A.R.T.                         : No
Device #1
Device is a Hard drive
State                              : Ready   # Newly inserted disk will be in ready state.
Supported                          : Yes
Transfer Speed                     : SAS 3.0 Gb/s
Reported Channel,Device            : 0,1
Reported Location                  : Enclosure 0, Slot 1
Reported ESD                       : 2,0
Vendor                             : SEAGATE
Model                              : ST914602SSUN146G
Firmware                           : 0703
Serial number                      : 084998FKW2        3NM8FKW2
World-wide name                    : 5000C5003C9E42E0
Size                               : 140009 MB
Write Cache                        : Unknown
FRU                                : None
S.M.A.R.T.                         : No
Device #2
Device is an Enclosure services device
Reported Channel,Device            : 2,0
Enclosure ID                       : 0
Type                               : SES2
Vendor                             : ADAPTEC
Model                              : Virtual SGPIO
Firmware                           : 0001
Status of Enclosure services device


Command completed successfully.

root@now-rfe36.gbl:/tmp/cmdline#
Execute the  following command
./arcconf task start 1 device 0 1 initialize        # this will initialize the drive, please ensure the channel and device ID are correct. 0 is the channel and 1 is the device id.

./arcconf create 1 logicaldrive MAX volume 0 1 # This will create the simple volume on the drive at channel 0 and device id 1 with maximum available space.

root@now-rfe36.gbl:/tmp/cmdline# ./arcconf task start 1 device 0 1 initialize
Controllers found: 1
Initializing Channel 0, Device 1.

Command completed successfully.
root@now-rfe36.gbl:/tmp/cmdline# format
Searching for disks...done


AVAILABLE DISK SELECTIONS:
       0. c0t0d0 <Sun    -STK RAID REM   -V1.0 cyl 17830 alt 2 hd 255 sec 63>
          /pci@0,0/pci8086,2690@1c/pci108e,7aac@0/sd@0,0
Specify disk (enter its number): ^D
root@now-rfe36.gbl:/tmp/cmdline# ./arcconf create 1 logicaldrive MAX volume 0 1
Controllers found: 1

Do you want to add a logical device to the configuration?
Press y, then ENTER to continue or press ENTER to abort: y

Creating logical device: Device 1

Command completed successfully.
root@now-rfe36.gbl:/tmp/cmdline#

root@now-rfe36.gbl:/root# format
Searching for disks...done



You are able to see disk at OS level.

AVAILABLE DISK SELECTIONS:
       0. c0t0d0 <Sun    -STK RAID REM   -V1.0 cyl 17830 alt 2 hd 255 sec 63>
          /pci@0,0/pci8086,2690@1c/pci108e,7aac@0/sd@0,0
       1. c0t1d0 <Sun    -STK RAID REM   -V1.0 cyl 17830 alt 2 hd 255 sec 63>
          /pci@0,0/pci8086,2690@1c/pci108e,7aac@0/sd@1,0
Specify disk (enter its number):

root@now-rfe36.gbl:/root#


Label the disk

prtvtoc /dev/rdsk/c0t0d0s2| fmthard -s - /dev/rdsk/c0t1d0s2

root@now-rfe36.gbl:/tmp/cmdline#

oot@now-rfe36.gbl:/tmp/cmdline# zpool attach -f rpool c0t0d0s0 c0t1d0s0
Make sure to wait until resilver is done before rebooting.
root@now-rfe36.gbl:/tmp/cmdline# zpool status
  pool: rpool
 state: ONLINE
status: One or more devices is currently being resilvered.  The pool will
        continue to function, possibly in a degraded state.
action: Wait for the resilver to complete.
 scan: resilver in progress since Fri Nov  8 12:27:55 2013
    849M scanned out of 49.6G at 42.4M/s, 0h19m to go
    845M resilvered, 1.67% done
config:

        NAME          STATE     READ WRITE CKSUM
        rpool         ONLINE       0     0     0
          mirror-0    ONLINE       0     0     0
            c0t0d0s0  ONLINE       0     0     0
            c0t1d0s0  ONLINE       0     0     0  (resilvering)

errors: No known data errors
root@now-rfe36.gbl:/tmp/cmdline#

Finally the zpool status as below

root@now-rfe36.gbl:/root# zpool status
  pool: rpool
 state: ONLINE
 scan: resilvered 49.6G in 0h20m with 0 errors on Fri Nov  8 12:48:54 2013
config:

        NAME          STATE     READ WRITE CKSUM
        rpool         ONLINE       0     0     0
          mirror-0    ONLINE       0     0     0
            c0t0d0s0  ONLINE       0     0     0
            c0t1d0s0  ONLINE       0     0     0

errors: No known data errors
root@now-rfe36.gbl:/root#

No comments:

Post a Comment