HP 3PAR Disk Replacement
HP 3PAR Disk Replacement
HP 3PAR Disk Replacement
Regmen
The exact working principle will be explained in the other article. However for now, it
is good to know, that 3PAR allows to build RAID groups based on some capabilities:
Deep investigation is not needed to see that the numeration of spare chunklets on
each drive has high number, hence chunklets designated for spares are from inner
zones of disks, which is obviously good.
But thats not all. The amount of chunklets that are candidates for spares are
determined by policy, which can be chosen by following vendor recommendation
from presents or chosen by us. We can distinguish below sparing algorithms:
default: amount of one full disk for every 40 disks, with required 4 disks as
minimum.
minimum: same as default, but without required minimum drives.
maximum: amount of one full disk per drive cage so called cage level high
availability.
custom: defined by user, but administrator should remember to add spares
while adding new drives.
SizeMB
WThru MapV
5 log0.0
1 normal
0/-/-/-
20480
0 log
0 ---
6 log1.0
1 normal
1/-/-/-
20480
0 log
0 ---
7 log2.0
1 normal
2/-/-/-
20480
0 log
0 ---
8 log3.0
1 normal
3/-/-/-
20480
0 log
0 ---
Column Use: log in cell under this column means that logical disk is used as a
logging logical disk.
Column Lgct: The number of chunklets that are in logging mode in the logical
disk.
Column LgId: The ID of the logging disk that is being used for logging by the
logical disk.
Important information
Logging logical disk is entity that is entirely created and managed by 3PAR
system.
04:36:38
GMT
Informational
Disk
event
scsistat:0x02
asc/ascq:0x5d/0x0
(Check
(Failure
cmd_spec:0x0,
condition),
prediction
snskey:0x01
threshold
sns_spec:0x50000,
CDB:2A00562EE38800001800
(Recovered
exceeded),
info:0x0,
host:0x0,
(Write10),
blk:0x562ee388,
error),
abort:0,
blkcnt
0x18,
04:37:41
GMT
Degraded
Disk
abort
04:41:49
GMT
Informational
Disk
event
scsistat:0x02
asc/ascq:0x5d/0x0
cmd_spec:0x0,
(Check
(Failure
condition),
prediction
snskey:0x01
threshold
sns_spec:0x50000,
(Recovered
exceeded),
host:0x0,
error),
info:0x0,
abort:0,
2015-12-06
04:47:42
GMT
Informational
Disk
state
change
Lets check the situation with marked as failed disk. If you are uncertain about failed
drive you can see which PDs are failed with using command showpd with -failed
option. With showpd command information about systems physical disks can be
shown.
3PAR-cluster cli% showpd -failed
-Size(MB)-- ----Ports---Id CagePos Type RPM State Total Free A
7 0:7:0? FC
10 failed 838656
Capacity(GB)
0 ----- -----
900
----------------------------------------------------------------1 total
838656
667 152 0
---------------------------------------------------------------------------------------1 total
819 0
667 152 0
To see detailed information about chunklets within disk you can use command
showpdch.
3PAR-cluster cli% showpdch 7
Id CagePos State
SAS
----------------------------------------------------------------------------------------------------------------------1 total
After new disk will be replaced, event log will record it.
2015-12-08 12:03:06 GMT 1 Informational Disk state change sw_pd:160 pd 160
wwn 5000CCA0714AC3D3 changed state from new to valid because disk was
admitted successfully.
2015-12-08
12:03:06
GMT
Informational
Disk
state
change
12:03:06
GMT
Informational
Object
added
Important information
Remember that the PD_ID of replaced drive will be different from failed drive.
In this example pd_id of failed drive is 7, and replaced drive has been
assigned to the first available id, which is 160.
The array will still signal degraded status, until reallocation will be completed.
After reallocation, pd_id assigned to new drive disappear and replaced disk
will be visible under pd_id assigned previously to failed drive.
However all chunklets that resided on failed drive must be reallocated from logging
drive and spare area back to replaced disk. To see the progress use service mag
command or monitor the chunklets.
3PAR-cluster cli% servicemag status
Cage 0, magazine 7:
The magazine is being brought online due to a servicemag resume.
The last status update was at Tue Dec 8 12:04:07 2015.
Chunklets relocated: 404 in 3 hours, 17 minutes and 25 seconds
Chunklets remaining: 762
Unfortunately above method is not 100% reliable, and from time to time, the
amount of chunklets and time for that operations are wrongly displayed. To check
how the process looks like you can use showpdch command with -mov parameter.
At the end of output sum of chunklets remaining for move is shown.
3PAR-cluster cli% showpdch -mov 160
The log parameter determines that write operations to chunklets of specific drive
are committed to logging disk. The pdid obviously stands for disk ID. Also you could
add wait parameter to servicemag start. Task will not run in background and you
will have visibility on the whole process.
Command to use: servicemag start -log -pdid 8, which is shown on below image.
According to documentation:
Any I/O on the chunklets marked normal,smag, changes the states to logging and
I/O is written to the logging logical disks.
2. Now you can initiate the movement process for data from specified physical
disk to location chosen by system, which is one of the main steps in terms of
disk replacement.
The suitable command is movepdtospare with -vacate option. Vacate option
makes moves pernament and removes source tags after recolocation. The -f
parameter means that no confirmation is required.
where:
-perm: chunklet are moved pernamently and original location will be forgot.
-ovrd: allows to move chunklet to some destination even if it will have impact
on quality. Option is necessary with -perm parameter.
3. Time to see whether we have any spare chunklets on disk designated for
removal, as previous step only moved data chunklets.
To display chunklets marked as spare use showpdch -spr command.
3PAR-cluster cli% showpdch -spr <pd_id>
4. Time to see whether we have any spare chunklets on disk designated for
removal, as previous step only moved data chunklets.
Command designated for that kind of task is shown below. It will remove all
spare chunklets off the disk. After execution check again whether any spares
exist.
3PAR-cluster cli% removespare <pd_id>:a
5. After all previous steps you can safely remove physical disk definition from
system. Hold on with physical disk replacement at this step.
3PAR-cluster cli% dismisspd <pd_id>
6. Check if dismissed disk shown us new. If yes, then it can be safely remove
from magazine.
7. In case you put in new disk and disk will not be automatically added to the
system, you have to do it manually
First thing is to determine the WWN of disk. Check this with showpd -i
command.
3PAR-cluster cli% showpd -i <pd_id>
After that use admitpd command to make new disk operational for system.
3PAR-cluster cli% admitpd <disk_wwn>