Veritas™ Resiliency Platform 2.2 Solutions for VMware

Last Published:
Product(s): Resiliency Platform & CloudMobility (2.2)
  1. Section I. Overview of Resiliency Platform
    1. Overview of Resiliency Platform
      1.  
        About Veritas Resiliency Platform
      2.  
        About disaster recovery using Resiliency Platform
      3.  
        About Resiliency Platform features and components
      4.  
        About Resiliency Platform capabilities
      5.  
        About managing VMware virtual machines using Resiliency Platform
      6.  
        About permissions for operations in the console
    2. Overview of Resiliency Platform Data Mover
      1.  
        About Veritas Resiliency Platform Data Mover
      2.  
        Supported environments for Resiliency Platform Data Mover with VMware VAIO
      3.  
        How Resiliency Platform Data Mover works
      4.  
        About Veritas Resiliency Platform Data Mover architecture
      5.  
        About full synchronization with Veritas Resiliency Platform Data Mover
      6.  
        How Veritas Resiliency Platform Data Mover handles virtual machine writes
      7.  
        Using Resiliency Platform Data Mover for recovery to premises- an overview
    3. Overview of recovery to on-premises data center
      1.  
        About recovery to premises using third-party replication
      2.  
        Using third-party replication for recovery to premises- an overview
    4. Managing assets protected by NetBackup
      1.  
        About NetBackup and NetBackup Appliances
      2.  
        About protecting assets with NetBackup using Resiliency Platform
      3.  
        Using NetBackup - an overview
    5. Overview of Amazon Web Services
      1.  
        About recovery to AWS using Resiliency Platform Data Mover
      2.  
        Using Resiliency Platform Data Mover for recovery to AWS- an overview
    6. Overview of vCloud
      1.  
        About recovery to vCloud using Resiliency Platform Data Mover
      2.  
        Using Resiliency Platform Data Mover for recovery to vCloud- an overview
  2. Section II. Preparing your environment
    1. Using array-based replication
      1.  
        Supported replication technologies with Veritas Resiliency Platform
      2.  
        Protecting VMware virtual machines using array-based replication - an overview
      3.  
        Configuring VMware virtual machines for disaster recovery using EMC SRDF replication
      4.  
        Configuring VMware virtual machines for disaster recovery using EMC RecoverPoint replication
      5.  
        Configuring VMware virtual machines for disaster recovery using NetApp SnapMirror
      6.  
        Configuring VMware virtual machines for disaster recovery using Hitachi True Copy replication
      7.  
        Configuring VMware virtual machines for disaster recovery using HPE 3PAR Remote Copy replication
      8.  
        Configuring VMware virtual machines for disaster recovery using IBM SVC Global Mirror replication
      9.  
        Configuring VMware virtual machines for disaster recovery using IBM XIV Remote Mirror replication
    2. Using Veritas Resiliency Platform Data Mover
      1.  
        VMware vCenter Server privileges required for implementing Resiliency Platform Data Mover
      2.  
        Protecting VMware virtual machines using Resiliency Platform Data Mover - an overview
    3. Managing disaster recovery network mapping
      1.  
        Viewing and configuring network settings for a data center
      2.  
        Editing network settings for a data center
      3.  
        Removing network settings for a data center
      4. Configuring DNS server settings for a data center
        1.  
          Sample command for Windows keytab file
      5.  
        Setting up network mapping between production and recovery data centers
    4. Managing Replication Gateway pairs
      1.  
        About Veritas Replication Gateway pairs
      2.  
        How Resiliency Platform Data Mover supports encryption for data replication
      3.  
        Creating a Veritas Replication Gateway pair
      4.  
        Modifying encryption for a Veritas Replication Gateway pair
      5.  
        Viewing Veritas Replication Gateways
      6.  
        Viewing Veritas Replication Gateway pairs
      7.  
        Removing a Veritas Replication Gateway pair
  3. Section III. Working with resiliency groups
    1. Managing resiliency groups
      1.  
        About resiliency groups
      2.  
        Guidelines for organizing resiliency groups
      3.  
        About service objectives
      4.  
        Managing virtual machines for basic monitoring
      5.  
        Starting a resiliency group
      6.  
        Stopping a resiliency group
      7.  
        Displaying resiliency group information and status
      8.  
        Viewing resiliency group details
      9.  
        Editing a resiliency group
      10.  
        Deleting a resiliency group
    2. Configuring resiliency groups for remote recovery
      1.  
        Understanding the role of resiliency groups in disaster recovery operations
      2.  
        How Resiliency Platform configures disaster recovery protection for virtual machines
      3.  
        Prerequisites for configuring VMware virtual machines for disaster recovery
      4.  
        Limitations for virtual machine disaster recovery
      5. Managing virtual machines for remote recovery (DR) using 3rd party replication technology
        1.  
          Target asset selection options
        2. Network customization options
          1.  
            Prerequisites for network customization
      6. Managing virtual machines for remote recovery (DR) using Resiliency Platform Data Mover
        1.  
          DR configuration options using Resiliency Platform Data Mover
      7. Managing virtual machines for remote recovery (DR) in Amazon Web Services
        1.  
          AWS Customization options panel
      8.  
        Managing virtual machines for remote recovery (DR) in vCloud
      9.  
        Managing VMware virtual machines for remote recovery using NetBackup images
      10.  
        Verifying the replication status for Veritas Resiliency Platform Data Mover
  4. Section IV. Managing disaster recovery
    1. Rehearsing DR operations to ensure DR readiness
      1.  
        About ensuring the disaster recovery readiness of your assets
      2.  
        Rehearse operations - array-based replication
      3.  
        Rehearse operations - Resiliency Platform Data Mover
      4.  
        Prerequisites for rehearsal operation
      5.  
        Performing the rehearsal operation
      6.  
        Performing the rehearsal operation using NetBackup images
      7.  
        Performing cleanup rehearsal
    2. Performing disaster recovery operations
      1.  
        How Resiliency Platform Data Mover handles DR operations
      2.  
        Migrating a resiliency group of virtual machines
      3.  
        Taking over a resiliency group of virtual machines
      4.  
        Performing the resync operation
      5.  
        Restoring data using NetBackup
      6.  
        Clearing outage
  5. Managing resiliency plans
    1.  
      About resiliency plans
    2. Creating a new resiliency plan template
      1. About manual task
        1.  
          Using manual tasks in resiliency plans
      2. About custom script
        1.  
          Using custom scripts in resiliency plans
    3.  
      Editing a resiliency plan template
    4.  
      Deleting a resiliency plan template
    5.  
      Viewing a resiliency plan template
    6.  
      Creating a new resiliency plan
    7.  
      Editing a resiliency plan
    8.  
      Deleting a resiliency plan
    9.  
      Executing a resiliency plan
    10.  
      Viewing a resiliency plan
    11.  
      Creating a schedule for a resiliency plan
    12.  
      Editing a schedule for a resiliency plan
    13.  
      Deleting a schedule for a resiliency plan
    14.  
      Viewing a schedule for a resiliency plan
  6. Monitoring risks, reports, and activities
    1.  
      About the Resiliency Platform Dashboard
    2.  
      Understanding asset types
    3.  
      Displaying an overview of your assets
    4. About risk insight
      1.  
        Displaying risk information
      2.  
        Predefined risks in Resiliency Platform
      3.  
        Viewing the current risk report
      4.  
        Viewing the historical risk report
    5.  
      Viewing reports
    6. Managing activities
      1.  
        Viewing activities
      2.  
        Aborting a running activity
  7. Managing evacuation plans
    1.  
      About evacuation plan
    2.  
      Generating an evacuation plan
    3.  
      Regenerating an evacuation plan
    4.  
      Performing evacuation
    5.  
      Performing rehearse evacuation
    6.  
      Performing cleanup evacuation rehearsal
  8. Appendix A. General troubleshooting
    1.  
      Viewing events and logs in the console
    2.  
      Events in VMware virtual machines disaster discovery
    3.  
      Troubleshooting discovery of assets
    4.  
      Log files to troubleshoot Veritas Resiliency Platform Data Mover
    5.  
      Managing tunable parameters
    6.  
      Resiliency Platform fails to attach storage policy to virtual machines
    7.  
      Resiliency Platform fails to create storage policy
    8. Resolving the Admin Wait state
      1.  
        Admin Wait state codes
    9.  
      Troubleshooting NetBackup issues
    10.  
      Troubleshooting delete resiliency group operation
  9. Appendix B. Sample policy and trust relationships for AWS
    1.  
      Sample policy statement for AWS
    2.  
      Sample trust relationship for AWS
  10.  
    Glossary

Predefined risks in Resiliency Platform

Table: Predefined risks lists the predefined risks available in Resiliency Platform. These risks are reflected in the current risk report and the historical risk report.

Table: Predefined risks

Risks

Description

Risk detection time

Risk type

Affected operation

Fix if violated

Veritas Infoscale Operations Manager disconnected

Checks for Veritas Infoscale Operations Manager to Resiliency Manager connection state

1 minute

Error

All operations

Check Veritas Infoscale Operations Manager reachability

Try to reconnect Veritas Infoscale Operations Manager

vCenter Password Incorrect

Checks if vCenter password is incorrect

5 minutes

Error

  • On primary site: start or stop operations

  • On secondary site: migrate or takeover operations

In case of a password change, resolve the password issue and refresh the vCenter configuration

VM tools not installed

Checks if VM Tools are not Installed. It may affect IP Customization and VM Shutdown.

Real time, when resiliency group is created

Error

  • Migrate

  • Stop

  • In case of VMWare, install VMWare Tools

  • In case of Hyper-V, install Hyper-V Integration Tools

Snapshot removed from Virtual Machine

Checks if snapshot has been removed from virtual machine.

5 minutes

Error

Resiliency Platform Data Mover replication

Edit the resiliency group to refresh configuration

Snapshot reverted on Virtual Machine

Checks if snapshot has been reverted on virtual machine.

5 minutes

Error

Resiliency Platform Data Mover replication

Remove and re-add the virtual machine to the Resiliency group by editing Resiliency group

Data Mover Daemon Crash

Checks if VM Data Mover filter is not able to connect to its counterpart in ESX.

5 minutes

Error

Resiliency Platform Data Mover replication

In order to continue the replication, you can move (VMotion) the VM to a different ESX node in the cluster and either troubleshoot the issue with this ESX node or raise a support case with Veritas

Snapshot created on Virtual Machine

Checks if a snapshot has been created on Virtual machine.

5 minutes

Error

Resiliency Platform Data Mover replication

Edit the resiliency group to refresh configuration

DataMover virtual machine in noop mode

Checks if VM Data Mover filter is not able to connect to its counterpart in ESX.

5 minutes

Error

Resiliency Platform Data Mover replication

In order to continue the replication, you can move (VMotion) the VM to a different ESX node in the cluster and either troubleshoot the issue with this ESX node or raise a support case with Veritas

Resiliency group configuration drift

Checks if disk configuration of any of the assets in the resiliency group has changed.

30 minutes

Error

  • Migrate

  • Resync

Edit the resiliency group to first remove the impacted virtual machine from the resiliency group and then add it back to the resiliency group.

Global user deleted

Checks if there are no global users. In this case, the user will not be able to customize the IP for Windows machines in VMware environment.

Real time

Warning

  • Migrate

  • Takeover

Edit the resiliency group or add a Global user

Missing heartbeat from Resiliency Manager

Checks for heartbeat failure from a Resiliency Manager.

5 minutes

Error

All

Fix the Resiliency Manager connectivity issue

Infrastructure Management Server disconnected

Check for Infrastructure Management Server(IMS) to Resiliency Manager(RM) connection state.

1 minute

Error

All

Check IMS reachability

Try to reconnect IMS

Storage Discovery Host down

Checks if the discovery daemon is down on the storage discovery host

15 minutes

Error

Migrate

Resolve the discovery daemon issue

DNS removed

Checks if DNS is removed from the resiliency group where DNS customization is enabled

real time

Warning

  • Migrate

  • Takeover

Edit the Resiliency Group and disable DNS customization

IOTap driver not configured

Checks if the IOTap driver is not configured

2 hours

Error

None

Configure the IOTap driver

This risk is removed when the workload is configured for disaster recovery

VMware Discovery Host Down

Checks if the discovery daemon is down on the VMware Discovery Host

15 minutes

Error

Migrate

Resolve the discovery daemon issue

VM restart is pending

Checks if the VM has not been restarted after add host operation

2 hours

Error

Configure DR

Restart the VM after add host operation

New VM added to replication storage

Checks if a virtual machine that is added to a Veritas Replication Set on a primary site, is not a part of the resiliency group.

5 minutes

Error

  • Migrate

  • Takeover

  • Rehearsal

Add the virtual machine to the resiliency group.

Replication lag exceeding RPO

Checks if the replication lag exceeds the thresholds defined for the resiliency group. This risk affects the SLA for the services running on your production data center.

5 minutes

Warning

  • Migrate

  • Takeover

Check if the replication lag exceeds the RPO that is defined in the Service Objective

Replication state broken/critical

Checks if the replication is not working or is in a critical condition for each resiliency group.

5 minutes

Error

  • Migrate

  • Takeover

Contact the enclosure vendor.

Remote mount point already mounted

Checks if the mount point is not available for mounting on target site for any of the following reasons:

  • Mount point is already mounted.

  • Mount point is being used by other assets.

  • Native (ext3, ext4,NTFS ): 30 minutes

  • Virtualization (VMFS, NFS): 6 hours

Warning

  • Migrate

  • Takeover

Unmount the mount point that is already mounted or is being used by other assets.

Disk utilization critical

Checks if at least 80% of the disk capacity is being utilized. The risk is generated for all the resiliency groups associated with that particular file system.

  • Native (ext3, ext4,NTFS ): 30 minutes

  • Virtualization (VMFS, NFS): 6 hours

Warning

  • Migrate

  • Takeover

  • Rehearsal

Delete or move some files or uninstall some non-critical applications to free up some disk space.

ESX not reachable

Checks if the ESX server is in a disconnected state.

5 minutes

Error

  • On primary site: start or stop operations

  • On secondary site: migrate or takeover operations

Resolve the ESX server connection issue.

vCenter Server not reachable

Checks if the virtualization server is unreachable or if the password for the virtualization server has changed.

5 minutes

Error

  • On primary site: start or stop operations

  • On secondary site: migrate or takeover operations

Resolve the virtualization server connection issue.

In case of a password change, resolve the password issue.

Insufficient compute resources on failover target

Checks if there are insufficient CPU resources on failover target in a virtual environment.

6 hours

Warning

  • Migrate

  • Takeover

Reduce the number of CPUs assigned to the virtual machines on the primary site to match the available CPU resources on failover target.

Host not added on recovery data center

Checks if the host is not added to the IMS on the recovery data center.

30 minutes

Error

Migrate

Check the following and fix:

  • Host is up on recovery data center.

  • Host is accessible from recovery datacenter IMS.

  • Time is synchronized between host and recovery datacenter IMS.

NetBackup Notification channel disconnected

Checks for NetBackup Notification channel connection state

5 minutes

Error

Restore

Check if the NetBackup Notification channel is added to the NetBackup master server.

Backup image violates the defined RPO

Checks if the backup image violates the defined RPO

30 minutes

Warning

No operation

  • Check the connection state of NetBackup Notification channel

  • Check for issues due to which backup images are not available

NetBackup master server disconnected

Checks if NetBackup master server is disconnected or not reachable

5 minutes

Error

Restore

Check if IMS is added as an additional server to the NetBackup master server

Assets do not have copy policy

Checks if the assets do not have a copy policy

3 hours

Warning

No operation

Set up copy policy and then refresh the NetBackup master server

Target replication is not configured

Checks if the target replication is not configured

3 hours

Warning

No operation

Configure target replication and then refresh the NetBackup master server

Disabled NetBackup Policy

NetBackup policy associated with the virtual machine is disabled

3 hours

Warning

No operation

Fix the disabled policy