Session #1 Understanding Metrocluster With Continuous Access XP
Session #1 Understanding Metrocluster With Continuous Access XP
Session #1 Understanding Metrocluster With Continuous Access XP
Access P9000/XP
Session #1
Understanding Metrocluster with Continuous Access XP
© Copyright 2011 Hewlett-Packard Development Company, L.P. The information contained herein is subject to change without notice. HP Confidential
Presenters
Deepthi
Velisetti
Ananth
Arumugam
Key Takeaways
Objectives for this session
3 © Copyright 2011 Hewlett-Packard Development Company, L.P. The information contained herein is subject to change without notice.
HP Confidential
AGENDA
• High Availability and Disaster Recovery Solutions
• Overview of Continuous Access
• Overview of Metrocluster CA P9000/XP
• Configuring Metrocluster CA P9000/XP
• Break 15 min
• Failover/Failback Scenarios
• Best Practices
• Troubleshooting
• Documentation for further reading
• Q&A
4
1. Open the Questions window
2. Enter your question and click Submit
3. Use Me Too if interested in other questions
4. Click the arrows to view the responses
Overview of High Availability and
Disaster Recovery Solutions
© Copyright 2011 Hewlett-Packard Development Company, L.P. The information contained herein is subject to change without notice. HP Confidential
Terminology
7 © Copyright 2011 Hewlett-Packard Development Company, L.P. The information contained herein is subject to change without notice.
HP Confidential
High Availability and Disaster Recovery Solutions
Disaster
Recovery
Solutions
High
•Stretched or
Availability Redundant clusters
across different data
•Local cluster with one centers with data
site in a single data replication
center
8
Local Serviceguard Cluster
Redundant networks
carrying cluster
heartbeats
Mirroring (1 cluster)
10 © Copyright 2011 Hewlett-Packard Development Company, L.P. The information contained herein is subject to change without notice.
HP Confidential
HP Metrocluster
Stretched Cluster with Data Replication
Redundant networks
carrying cluster
heartbeats
(1 cluster) Synchronous or
Asynchronous
Replication
11 © Copyright 2011 Hewlett-Packard Development Company, L.P. The information contained herein is subject to change without notice.
HP Confidential
HP Continentalclusters
Redundant Clusters with Data Replication
Serviceguard Cluster1 Serviceguard Cluster 2
Redundant networks
Redundant networks
carrying cluster
carrying cluster
heartbeats
heartbeats
(1 cluster)
(1 cluster)
12 © Copyright 2011 Hewlett-Packard Development Company, L.P. The information contained herein is subject to change without notice.
HP Confidential
Serviceguard Disaster Recovery Solutions Products
DRS Extended Metrocluster Continentalclusters
Distance Cluster
Distances Upto 100 kms Upto 300 kms More than 300 kms
Replication Type Host based mirroring Array based Array based or software
replication based replication
© Copyright 2011 Hewlett-Packard Development Company, L.P. The information contained herein is subject to change without notice. HP Confidential
Continuous Access - Device Groups
Device
Device Group is a Set of XP
logical disks for replication
where CA is defined
between them Group
15
Continuous Access – Types of Replication Mode
16
Continuous Access Synchronous Replication
17
Fence Levels in Synchronous Replication
18
Understanding Async replication further
19
Understanding Continuous Access Async
A copy of the data with a sequence number is saved in an internal buffer,
known as the side file, for later transmission to the remote XP disk array
20
Understanding Continuous Access Journal
The pull-style replication engine contributes to resource
Uses “disk- optimization
based
journaling”
and “pull- Journal contains metadata such as both time stamp and
style sequence number to enable correct write order ensuring
replication” data consistency and integrity
21
Fence Level in Asynchronous Replication
22
Continuous Access Fence Level
Fence Level/Replication DATA NEVER ASYNC
Mode
24
Continuous Access Pair States
• Affinity between the pairs is suspended due to a
PSUE / PDUB
hardware error
25
Overview of Metrocluster CA
P9000/XP
© Copyright 2011 Hewlett-Packard Development Company, L.P. The information contained herein is subject to change without notice. HP Confidential
Metrocluster CA P9000/XP
Solution Configuration
•Software*:
• Metrocluster with Continuous Access for P9000/XP A.11.00
• Serviceguard A.11.20
• MCDREnabler A.10.01
• RAID Manager 01.24.13 (P9000) 01.23.08(XP)
•Hardware
• Equal Number of Nodes on both the sites (Unequal number of nodes not
supported)
• Quorum Server OR Arbitrator Nodes on a 3rd Site
• XP or P9000 series Disk Arrays
• DWDM or SONET/SDH Links between the sites
Serviceguard Requires:
Round Trip Latency not exceeding 200 milli seconds.
Synchronous Replication:
Typically ~3 milli seconds
Asynchronous Replication:
Typically ~10 milli seconds
28 © Copyright 2011 Hewlett-Packard Development Company, L.P. The information contained herein is subject to change without notice.
HP Confidential
Metrocluster CA P9000/XP Solution Support
Supported Arrays , Server and Workloads on HP-UX
Supported
Supported Supported Supported
Volume
Array Servers Workloads
Managers
XP512/48 Oracle 10g R2/11g
R1/11gR2 RAC *
XP1024/128 VxVM/VxFS/CVM/CFS
HP 9000
5.0/5.0.1/5.1 SP1
XP10000 SAP using SGeSAP
XP12000
XP20000 HP Integrity VM
LVM 1.0 /LVM
XP24000 HP Integrity 2.x/SLVM 1.0/SLVM
2.x
ECMT applications
P9500
29 © Copyright 2011 Hewlett-Packard Development Company, L.P. The information contained herein is subject to change without notice.
HP Confidential
29 HP Restricted Information
Metrocluster CA P900/XP-Solution Architecture
3rd Location
(Site 3)
Each main data center (sites 1 and 2)
Quorum Service must have the same number of nodes
protected in a
separate cluster
Clients Serviceguard Sub-cluster B
Serviceguard Sub-cluster A IP Subnet K or L (different subnets supported)
IP Subnet K Volume Manager: SLVM or CVM
Ethernet
Ethernet
Volume Manager: SLVM or CVM Network
Network
TCP/IP DWDM
Channels
Up to 8- Up to 8-
node node
Sub- Up to 300 km Sub-
cluster w/ DWDM cluster
Data Replication
DWDM Channels
DWDM
DWDM
(Site 1) Bi-directional Data replication using (Site 2)
DATA CENTER 1 Continuous Access XP or EVA, DATA CENTER 2
or EMC SRDF
30 © Copyright 2011 Hewlett-Packard Development Company, L.P. The information contained herein is subject to change without notice.
HP Confidential
30
Metrocluster CA P9000/XP – Solution Components
RAID Manager configured to start as
RM part of boot. Present command device to all nodes in
CMD
the site
MC Metrocluster Software Package with Metrocluster Module
SG SG
SG SG
1
RM RM
RM
RM RM
MC MC
MC MC
DWDM DWDM
31 © Copyright 2011 Hewlett-Packard Development Company, L.P. The information contained herein is subject to change without notice.
HP Confidential
Metrocluster CA P9000/XP – Components
Metrocluster Software Bundle
•Metrocluster Binary
• Continuous Access storage preparation logic
•Metrocluster Module
• dts/mcxpca
• Glue that binds Continuous Access replication handling logic with Serviceguard
•Commands and Utilities
• cmdrprev
• Disk Group monitor service
• Sample RC scripts to start RAID Manager
32 © Copyright 2011 Hewlett-Packard Development Company, L.P. The information contained herein is subject to change without notice.
HP Confidential
Metrocluster P9000/XP - How it Works
Metrocluster Metrocluster RAID Command
Serviceguard
Module Binary Manager Device
Start the package
Calls MC binary Queries Device
Group Status
Request
Response
Returns Device
Group Status
Issue appropriate
takeover command
Request
Response
Returns command
Returns decision Status
33 © Copyright 2011 Hewlett-Packard Development Company, L.P. The information contained herein is subject to change without notice.
HP Confidential
Metrocluster CA P9000/XP - Features
Solution Capabilities
•Traditional Serviceguard Failover
• Preferred and Adoptive node Failover Model
•Automatic Site Aware Failover *
• Failover within the site before failing over to remote site
•Manual Site Failover *
• Package will remain down if there are no nodes to run in the current site
• Operator must start it manually on desired site after fixing the problem
•Device Group Monitor
• A service to monitor replication and protect data at remote site.
•Data Replication preview
• Check what will happen to replication if package is to failover to the node where the command is run
•Site Aware Disaster Tolerant Architecture
• Site Controller
•Failover complex workloads configured using multiple interdependent multi-node packages across sites
• Sub Cluster
•Allow upper layer clusters to be created in a site local fashion
P P
N1 N2 N3 N4 N1 N2 N3 N4
1 2
P P
N1 N2 N3 N4 N1
N1 N2
N2 N3
N3 N4
4 3
35 © Copyright 2011 Hewlett-Packard Development Company, L.P. The information contained herein is subject to change without notice.
HP Confidential
Site Aware Failover – Automatic Inter Site Automatic Failover
P P
N1 N2 N3 N4 N1 N2 N3 N4
1 2
P P
N1 N2 N3 N4 N1 N2 N3 N4
4 3
36 © Copyright 2011 Hewlett-Packard Development Company, L.P. The information contained herein is subject to change without notice.
HP Confidential
Automatic Failover
Site Aware Failover - Manual Inter Site
Manual Package
Manual inter site failover (site_preferred_manual) Start
P P
N1 N2 N3 N4 N1 N2 N3 N4
1 2
P P
N1 N2 N3 N4 N1 N2 N3 N4
4 3
37 © Copyright 2011 Hewlett-Packard Development Company, L.P. The information contained herein is subject to change without notice.
HP Confidential
Device Group Monitor
Monitor and Automate Replication Management
• Monitor the status of the P9000 or XP/Continuous Access device group used in
a package and raise notification
39 © Copyright 2011 Hewlett-Packard Development Company, L.P. The information contained herein is subject to change without notice.
HP Confidential
Configuring Metrocluster CA
P9000/XP
© Copyright 2011 Hewlett-Packard Development Company, L.P. The information contained herein is subject to change without notice. HP Confidential
Configuring Metrocluster CA P9000/XP
High Level Steps
Install & Configure Configuring
Setup Hardware &
Application Software Application as
Install required
( Place data on the Metrocluster
Software
replicated disks) packages
Configure CA Device
Configure
Group in RAID
Serviceguard cluster
Manager
41 © Copyright 2011 Hewlett-Packard Development Company, L.P. The information contained herein is subject to change without notice.
HP Confidential
Configuring RAID Manager Instance
Perform the steps on all nodes
• Install the Raid Manager,
• Select a RAID Manager instance number to use with Metrocluster
• Add a entry for the RAID Manager instance in /etc/services
– horcm <instance-number> <port-number> /udp
• Identify command devices to use with the RAID Manager instance
– ioscan –fnC disk | grep OPEN-CM
• Create RAID Manager configuration file with the identified instance number
– Use the template RAID Manager configuration file
– # cp /etc/horcm.conf /etc/horcm0.conf
• Specify the following sections in the RAID Manager configuration file:
– HORCM_CMD: Command Device Name (more than one is allowed)
– HORCM_MON: hostname
• Start the Raid Manager instance
– #horcmstart.sh <instance number>
42 © Copyright 2011 Hewlett-Packard Development Company, L.P. The information contained herein is subject to change without notice.
HP Confidential
RAID Manager Configuration File
Basic RAID Manager without any CA Device Groups
43 © Copyright 2011 Hewlett-Packard Development Company, L.P. The information contained herein is subject to change without notice.
HP Confidential
Configure CA Device Group with RAID Manager
instance
Perform the steps on all nodes
• Specify LUNS for the CA Device Group using the HORCM_DEV keyword in the
RAID manager configuration file
– ls /dev/rdsk/* | raidscan -find –fx -I<instance number>
• Specify the remote node IP and RM Manager port in the RAID Manager
configuration file under the HORCM_INST section
• Restart the RAID Manager instance
– #horcmstart.sh <instance number>
44 © Copyright 2011 Hewlett-Packard Development Company, L.P. The information contained herein is subject to change without notice.
HP Confidential
RAID Manager Configuration File
With CA Device Groups Configured
45 © Copyright 2011 Hewlett-Packard Development Company, L.P. The information contained herein is subject to change without notice.
HP Confidential
Creating Device Groups
To create a CA
• # paircreate -g <devgroup> -f async
Async Device
-vl -c 15
Group
To create a CA
•# paircreate -g <device_group> -f
Async Journal
Device Group
async -vl -c 15 -jp <id> -js <id>
46
Metrocluster Cluster deployment
Configuration steps
Traditional approach Easy deployment approach
•Install OS
•Connect the network interfaces between nodes. •Install OS
Hardware
configuration
•Connect the storage HW. •Connect the network
interfaces between
nodes.
•Populate entries in cmclnodelist file.
Hardware •Connect the storage
Security
configuration
configuration HW.
•Configure /etc/nsswitch.conf
•cmdeploycl command in
CLI
System
•Configure /etc/hosts
•/etc/inetd.conf Easy
configuration
deployment
•Generate Cluster Ascii file using cmquerycl
•Edit cluster Ascii File
Final •Apply configuration file
configuration
Metrocluster Cluster Configuration
Traditional Approach
• CROSS SUBNET:
cmquerycl -w full -n <node 1> -n < node 2> … -q <quorum server name> -C <cluster
configuration file name>
•Customize the cluster configuration file
• Specify the quorum server IP address Specify the arbitrator nodes using NODE_NAME
attribute
• When using Site Aware failover policies, specify site definitions (optional)
•Check and Apply the cluster configuration file
cmapplyconf -C <cluster configuration file>
48 © Copyright 2011 Hewlett-Packard Development Company, L.P. The information contained herein is subject to change without notice.
HP Confidential
Metrocluster – Cluster configuration file
• CLUSTER_NAME mcia07_cluster
• QS_HOST fep3.ind.hp.com
• QS_POLLING_INTERVAL 300000000
• SITE_NAME SITE1
• SITE_NAME SITE2
• NODE_NAME mcia07
– SITE SITE1
– NETWORK_INTERFACE lan0
– HEARTBEAT_IP 15.154.63.34
• NODE_NAME mcia08
– SITE SITE2
– NETWORK_INTERFACE lan2
– HEARTBEAT_IP 15.154.63.43
• MEMBER_TIMEOUT 14000000
49 Tiger Team Technical Training - HP RESTRICTED (HP/Partners)
Cluster Deployment Examples
Using cmdeploycl command
Attributes Meaning
DEVICE_GROUP The Raid Manager device group for this package. This device group is defined in the
/etc/horcm<#>.conf file.
HORCMINST Raid Manager Instance that the script will communicate with. This instance of Raid
Manager must be started on all nodes before this package can be successfully started.
53 © Copyright 2011 Hewlett-Packard Development Company, L.P. The information contained herein is subject to change without notice.
HP Confidential
Package Attributes
AUTO_FENCEDATA_SPLIT
Attribute AUTO_FENCEDATA_SPLIT
Local State Remote State Fence Level Values
54 © Copyright 2011 Hewlett-Packard Development Company, L.P. The information contained herein is subject to change without notice.
HP Confidential
Package Attribute
AUTO_NONCURDATA FENCE: NEVER or DATA
Attribute AUTO_NONCURDATA
Value = 1 or FORCEFLAG=yes
Starts with a warning message about non-current data in the
package’s control log file.
SVOL_PAIR PVOL_PSUE Value=0(Default)
SVOL_PAIR EX_ENORMT Do not start. Exit with 1.
SVOL_ PAIR EX_CMDIOE
Value = 1 or FORCEFLAG=yes
Perform SVOL takeover, which changes SVOL to PSUS
(SSWS). After the takeover succeeds, package starts with a
warning message about non-current data in the package’s
control log file.
55 © Copyright 2011 Hewlett-Packard Development Company, L.P. The information contained herein is subject to change without notice.
HP Confidential
5
Package Attribute
AUTO_NONCURDATA FENCE: Async Journal
Attribute AUTO_NONCURDATA
56 © Copyright 2011 Hewlett-Packard Development Company, L.P. The information contained herein is subject to change without notice.
HP Confidential
5
Package Attribute
AUTO_NONCURDATA and FENCE: Async
Attribute AUTO_NONCURDATA
57 © Copyright 2011 Hewlett-Packard Development Company, L.P. The information contained herein is subject to change without notice.
HP Confidential
5
Package Attributes
Attribute AUTO_SVOLPSUS
Local State Remote State Fence Level Values
58 © Copyright 2011 Hewlett-Packard Development Company, L.P. The information contained herein is subject to change without notice.
HP Confidential
5
Package Attributes
AUTO_PSUEPSUS
Attribute AUTO_PSUEPSUS
Local State Remote State Fence Level Values
59 © Copyright 2011 Hewlett-Packard Development Company, L.P. The information contained herein is subject to change without notice.
HP Confidential
5
Package Attributes
Attribute AUTO_ PSUSSSWS
Local State Remote State Fence Level Values
60 © Copyright 2011 Hewlett-Packard Development Company, L.P. The information contained herein is subject to change without notice.
HP Confidential
6
Package Attributes
Attribute AUTO_ SVOLPFUS
Local State Remote State Fence Level Values
61 © Copyright 2011 Hewlett-Packard Development Company, L.P. The information contained herein is subject to change without notice.
HP Confidential
6
Package Attributes
Attribute AUTO_SVOLPSUE
Local State Remote State Fence Level Values
62 © Copyright 2011 Hewlett-Packard Development Company, L.P. The information contained herein is subject to change without notice.
HP Confidential
6
Package Attributes
Attributes Meaning
HORCTIMEOUT In asynchronous mode, the timeout value in the horctakeover command, -t <timeout>,
time to wait while horctakeover re-synchronizes the delta data from the PVOL to the
SVOL.
Set the HORCTIMEOUT variable to a value greater than the Continuous Access link
timeout value. The package startup timeout value must be greater than the
HORCTIMEOUT value.
Pkg Startup Timeout > HORCTIMEOUT > Continuous Access link timeout value
63 © Copyright 2011 Hewlett-Packard Development Company, L.P. The information contained herein is subject to change without notice.
HP Confidential
6
Device Group Monitor
Setup Device Group Monitor for a Metrocluster Package
Configure DG Monitor Service in the Metrocluster Package
Attribute Values
SERVICE_NAME Specify unique Service Name
SERVICE_CMD /usr/sbin/DRMonitorXPCADevGrp
SERVICE_RESTART Unlimited
64 © Copyright 2011 Hewlett-Packard Development Company, L.P. The information contained herein is subject to change without notice.
HP Confidential
Device Group Monitor
Only Monitor and Notify CA Device Group Failures
Specify DG Monitor Attributes as below
Attribute Value
MON_POLL_INTERVAL 10 Minutes
MON_NOTIFICATION_FREQUENCY 3
MON_NOTIFICATION_EMAIL [email protected]
MON_NOTIFICATION_SYSLOG 1
MON_NOTIFICATION_CONSOLE 1
AUTO_RESYNC 0
65 © Copyright 2011 Hewlett-Packard Development Company, L.P. The information contained herein is subject to change without notice.
HP Confidential
Device Group Monitor
Notify on CA Device Group Failures and Auto Resync on Link
Recovery
Specify DG Monitor Attributes as below
Attribute Value
MON_POLL_INTERVAL 10 Minutes
MON_NOTIFICATION_FREQUENCY 3
MON_NOTIFICATION_EMAIL [email protected]
MON_NOTIFICATION_SYSLOG 1
MON_NOTIFICATION_CONSOLE 1
AUTO_RESYNC 1 - Automatically Resync PVOL to
SVOL when Links are restored
2 – Initiate Resync only if package
directory has MON_RESYNC file
66 © Copyright 2011 Hewlett-Packard Development Company, L.P. The information contained herein is subject to change without notice.
HP Confidential
Device Group Monitor
Protect Data at SVOL side during a Full Copy when Synchronous replication
Mirroring
Mirroring
BC BC
67 © Copyright 2011 Hewlett-Packard Development Company, L.P. The information contained herein is subject to change without notice.
HP Confidential
Failover/Failback Scenarios
68 © Copyright 2011 Hewlett-Packard Development Company, L.P. The information contained herein is subject to change without notice.
HP Confidential
Failover scenario 1
Reason for failover APP
• Failover to recovery site due to application failure on all APP
nodes in primary site
Configuration
• Fence level = any supported value
• Replication type = any
PVOL SVOL
Metrocluster Behavior
• Swaps the replication role by issuing horctakeover Location State
Site 1 PVOl_PAIR
Impact of Auto Variables or Forceflag Site 2 SVOL_PAIR
• None
Location State
Operator Intervention Site 1 SVOL_PAIR
69
Failback scenario 1
Reason for failover
• Issues at the primary site has been fixed. Application APP
can now start up in the primary site.
Configuration
• Fence level = any supported value
• Replication type = any
PVOL SVOL
Metrocluster Behavior
• Swaps the replication role by issuing horctakeover Location State
Site 1 PVOl_PAIR
Impact of Auto Variables or Forceflag Site 2 SVOL_PAIR
• None
Location State
Operator Intervention Site 1 SVOL_PAIR
70
Failover scenarios
Reason for failover
Failure of all nodes in the primary site
Configuration
•Fence level = any supported value
•Replication type = any except CAJ
Metrocluster Behavior
Do not start the package
71 Site2 PVOL_PAIR
Failover scenarios
Reason for failover
• Failure of all nodes in the primary site
Configuration
• Fence level = async
• Replication type = CAJ
72 Site2 PVOL_PAIR
Failback scenarios
Reason for failback
All nodes in the primary site are up
Configuration
•Fence level = any supported value
•Replication type = any
Metrocluster Behavior
Issue horctakeover - swap takeover
SVOL
Impact of Auto Variables or Forceflag PVOL
None
Location State
Operator Intervention Site 2 PVOL_PAIR
Site 1 SVOL_PAIR
• None
Location State
Site 2 SVOL_PAIR
Site 1 PVOL_PAIR
73
Failback scenarios
Reason for failback
All nodes in the primary site are up
Configuration
•Fence level = any supported value
•Replication type = Sync
Metrocluster Behavior
Issue pairresync –swapp and issue horctakeover - swap
takeover
PVOL_PAIR
Impact of Auto Variables or Forceflag SSWS
None
Location State
Operator Intervention Site 1 PVOL_PAIR
Site 2 SVOL_SSWS
• None
Location State
Site 2 SVOL_PAIR
Site 1 PVOL_PAIR
74
Failover scenarios
Reason for failover 2
APP
• 1 Link failure followed by
• 2 application failure in all nodes APP
Configuration
• Fence level = any 1
• Replication type = any except CAJ
Metrocluster Behavior
• do not start package PVOL_PSUE SVOL_PAIR
Metrocluster Behavior
PVOL SVOL
• do not start package
PVOL_PSUE SVOL_PSUE
Impact of Auto Variables or Forceflag OR
• Forceflag set, then start package Location State Location State
Site1 PVOL_PSUE Site1 PVOL_PAIR
• If AUTO_SVOLPSUE=1, Start package when recovery state is
SVOL_PSUE and issue SVOL takeover Site2 SVOL_PSUE Site2 SVOL_PAIR
Configuration
• Fence level = any
• Replication type = any
Configuration
• Fence level = any 1
• Replication type = any except CAJ
Metrocluster Behavior
• do not start package
ENORMT SVOL
Metrocluster Behavior
• do not start package
ENORMT SVOL_PSUE
Impact of Auto Variables or Forceflag OR
Location State
Operator Intervention Site1 EX_ENORMT
Configuration
• Fence level = async
• Replication type = CAJ
Metrocluster Behavior
• do not start package
PVOL_PSUE SVOL_SSWS
Configuration
• Fence level = any 1
• Replication type = any
Metrocluster Behavior
SVOL_PSUS
• do not start package PVOL_PSUS
Location State
Impact of Auto Variables or Forceflag
Site1 PVOL_PSUS
Location State
Operator Intervention
Site1 PVOL_PSUS
• Touch FORCEFLAG in dts_pkg_dir if AUTO_SVOLPSUS=0 Site2 SVOL_SSWS
81
Failover/Failback scenarios
Reason for failover 2
• 1 Manual suspend followed by
• 2 failure of all nodes in primary side
Configuration
• Fence level = any 1
• Replication type = any
Metrocluster Behavior
• do not start package ENORMT SVOL_PSUS
Configuration
• Fence level = any
• Replication type = any
© Copyright 2011 Hewlett-Packard Development Company, L.P. The information contained herein is subject to change without notice. HP Confidential
Best practices
• Increases the availability of remote array Raid
Remote Manager
Command
Device
Node 1 Node 2
Node 1 Node 3
SAN
CA PAIR
CMD RCMD
CMD
RCMD CMD
CA PAIR
Remote P9000
Local P9000 Array
Array
Best practices
• Prevent errors due to periodic configuration changes
Cluster
Verification
• Run as a cron job and save output to a file to analyze errors
Checks
When to run consistency
between
environment file
•Periodically and package
•After any array configuration
software,
cluster software
or Metrocluster
software
Checks if all the
upgrades/ Checks the raid
disks are part of
manager Cluster
patches Verification the device
version
group
How to run
Use supplied RC scripts to start Raid Manager Instances during machine boot
© Copyright 2011 Hewlett-Packard Development Company, L.P. The information contained herein is subject to change without notice. HP Confidential
Troubleshooting – Check Replication
Status
ons
90 © Copyright 2011 Hewlett-Packard Development Company, L.P. The information contained herein is subject to change without notice.
HP Confidential
Troubleshooting – where to look for
errors and warnings
• /HORCM/log<HORCM instance
RAID Manager #>/HORCC_<node name>.log
Log File • e.g. /HORCM/log0/HORCC_node1.log
91 © Copyright 2011 Hewlett-Packard Development Company, L.P. The information contained herein is subject to change without notice.
HP Confidential
Troubleshooting – Looking out for Pair states that
require Manual Intervention
When CA device
• Check if the raid manager instance is down and
group is in unknown bring it up manually if it is down
state
When Pairs are in • Pairs must be manually recreated if both the primary
and secondary P9000 or XP Series disk array are in
SMPL state SMPL (simplex) state
92 © Copyright 2011 Hewlett-Packard Development Company, L.P. The information contained herein is subject to change without notice.
HP Confidential
Q&A
© Copyright 2011 Hewlett-Packard Development Company, L.P. The information contained herein is subject to change without notice. HP Confidential
Miscellaneous Information
© Copyright 2011 Hewlett-Packard Development Company, L.P. The information contained herein is subject to change without notice. HP Confidential
HA Quarterly Update Webcast
Takeaways
Serviceguard Solutions portfolio
•Demo of new features
•Overview of upcoming releases
95 © Copyright 2011 Hewlett-Packard Development Company, L.P. The information contained herein is subject to change without notice.
HP Confidential
Documentation for further reading
• Metrocluster with Continuous Access for P9000 and XP Manuals and
White Papers
http://www.hp.com/go/hpux-serviceguard-docs > Select “ HP
Serviceguard Metrocluster with Continuous Access for P9000 and XP”
• Serviceguard Solutions Support Page
http://haweb.ind.hp.com/Support/
• Serviceguard Support Knowledge Base
http://haweb.ind.hp.com/Support/cgi-bin/QueryKDB.cgi
• Documentation for HP storage division
http://spock.corp.hp.com
Who to contact when !
• Triage Issues (Call HP Response Center)
• Support Related Queries(Serviceguard Solutions Support [email protected] )
– Which X is supported with Metrocluster ?
– Which version of Y is supported with Serviceguard ?
© Copyright 2011 Hewlett-Packard Development Company, L.P. The information contained herein is subject to change without notice. HP Confidential
Backup
© Copyright 2011 Hewlett-Packard Development Company, L.P. The information contained herein is subject to change without notice. HP Confidential
Troubleshooting – Status Checks
Pair and
Journal • pairdisplay -g oradb –fe
Group Seq#, LDEV# P/S,Status, Fence, %, P-LDEV# M CTG JID AP EM E-
Group Seq# E-LDEV#
oradb 30053 64 P-VOL PAIR Never, 75 C8 - 1 0 2
Informati oradb 30054 C8 S-VOL PAIR Never, 64 - - 1 0
on
Journal
Volumes
• raidvchkscan –v jnl 0
Informati
on