Cluster Server 7.4 Agent for EMC SRDF Configuration Guide - Windows
- Introducing the agent for EMC SRDF
- Configuring the agent for EMC SRDF
- Testing VCS disaster recovery support with EMC SRDF
- How VCS recovers from various disasters in an HA/DR setup with EMC SRDF
- Setting up fire drill
Failure scenarios in global clusters
The following table lists the failure scenarios in a global cluster configuration and describes the behavior of VCS and the agent in response to the failure.
Table: Failure scenarios in a global cluster configuration with the VCS agent for EMC SRDF
Failure | Description and VCS response |
---|---|
Application failure | Application cannot start successfully on any hosts at the primary site. VCS response at the secondary site:
The agent Write enables the devices at the secondary site. For dynamic RDF devices, if the value of the SwapRoles attribute of the SRDF resource is 1, the agent does the following :
See Performing failback after a node failure or an application failure. |
Host failure | All hosts at the primary site fail. VCS response at the secondary site:
The agent write enables the devices at the secondary site. For dynamic RDF devices, if the value of the SwapRoles attribute of the SRDF resource is 1, the agent does the following:
See Performing failback after a node failure or an application failure. |
Site failure | All hosts and the storage at the primary site fail. A site failure renders the devices on the array at the secondary site in the PARTITIONED state. VCS response at the secondary site:
Agent response: The agent does the following on the secondary site in case of a manual failover based on the value of the AutoTakeover attribute of the SRDF resource:
|
Replication link failure | Replication link between the arrays at the two sites fails. A replication link failure renders the SRDF devices in the PARTITIONED state. When the link is restored, the SRDF devices attain the SUSPENDED state. VCS response: No action. Agent response: No action. The agent does not monitor the replication link status and cannot detect link failures. After the link is restored, you must resynchronize the SRDF devices. To resynchronize the SRDF devices after the link is restored:
If you initiate a failover to the secondary site when resync is in progress, the online function of the EMC SRDF agent waits for the resync to complete and then initiates a takeover of the R2 devices. Note: If you did not configure BCV or target devices and if disaster occurs when resync is in progress, then the data at the secondary site becomes inconsistent. Veritas recommends configuring BCV or target devices at both the sites. |
Network failure | The network connectivity and the replication link between the sites fail. VCS response at the secondary site:
To resynchronize the data after the network link is restored:
Agent response: Similar to the site failure. |
Storage failure | The array at the primary site fails. A storage failure at the primary site renders the devices on the array at the secondary site in the PARTITIONED state. VCS response at the secondary site:
Agent response: The agent does the following on the secondary site in case of a manual failover based on the value of the AutoTakeover attribute of the SRDF resource:
|