InfoScale™ 9.0 Disaster Recovery Implementation Guide - Linux

Last Published:
Product(s): InfoScale & Storage Foundation (9.0)
Platform: Linux
  1. Section I. Introducing Storage Foundation and High Availability Solutions for disaster recovery
    1. About supported disaster recovery scenarios
      1.  
        About disaster recovery scenarios
      2. About campus cluster configuration
        1.  
          VCS campus cluster requirements
        2.  
          How VCS campus clusters work
        3.  
          Typical VCS campus cluster setup
      3. About replicated data clusters
        1.  
          How VCS replicated data clusters work
      4. About global clusters
        1.  
          How VCS global clusters work
        2.  
          User privileges for cross-cluster operations
        3. VCS global clusters: The building blocks
          1.  
            Visualization of remote cluster objects
          2.  
            About global service groups
          3. About global cluster management
            1.  
              About the wide-area connector process
            2.  
              About the wide-area heartbeat agent
            3.  
              Sample configuration for the wide-area heartbeat agent
          4. About serialization - The Authority attribute
            1.  
              About the Authority and AutoStart attributes
          5.  
            About resiliency and "Right of way"
          6.  
            VCS agents to manage wide-area failover
          7.  
            About the Steward process: Split-brain in two-cluster global clusters
          8.  
            Secure communication in global clusters
      5.  
        Disaster recovery feature support for components in the Veritas InfoScale product suite
      6.  
        Virtualization support for InfoScale 9.0 products in replicated environments
    2. Planning for disaster recovery
      1. Planning for cluster configurations
        1.  
          Planning a campus cluster setup
        2.  
          Planning a replicated data cluster setup
        3.  
          Planning a global cluster setup
      2. Planning for data replication
        1.  
          Data replication options
        2.  
          Data replication considerations
  2. Section II. Implementing campus clusters
    1. Setting up campus clusters for VCS and SFHA
      1. About setting up a campus cluster configuration
        1.  
          Preparing to set up a campus cluster configuration
        2.  
          Configuring I/O fencing to prevent data corruption
        3.  
          Configuring VxVM disk groups for campus cluster configuration
        4.  
          Configuring VCS service group for campus clusters
        5.  
          Setting up campus clusters for VxVM and VCS using Veritas InfoScale Operations Manager
      2.  
        Fire drill in campus clusters
      3.  
        About the DiskGroupSnap agent
      4. About running a fire drill in a campus cluster
        1.  
          Configuring the fire drill service group
        2.  
          Running a successful fire drill in a campus cluster
    2. Setting up campus clusters for SFCFSHA, SFRAC
      1.  
        About setting up a campus cluster for disaster recovery for SFCFSHA or SF Oracle RAC
      2.  
        Preparing to set up a campus cluster in a parallel cluster database environment
      3.  
        Configuring I/O fencing to prevent data corruption
      4.  
        Configuring VxVM disk groups for a campus cluster in a parallel cluster database environment
      5.  
        Configuring VCS service groups for a campus cluster for SFCFSHA and SF Oracle RAC
      6.  
        Tuning guidelines for parallel campus clusters
      7.  
        Best practices for a parallel campus cluster
  3. Section III. Implementing replicated data clusters
    1. Configuring a replicated data cluster using VVR
      1. About setting up a replicated data cluster configuration
        1.  
          About typical replicated data cluster configuration
        2.  
          About setting up replication
        3.  
          Configuring the service groups
        4.  
          Configuring the service group dependencies
      2. About migrating a service group
        1.  
          Switching the service group
      3.  
        Fire drill in replicated data clusters
    2. Configuring a replicated data cluster using third-party replication
      1.  
        About setting up a replicated data cluster configuration using third-party replication
      2.  
        About typical replicated data cluster configuration using third-party replication
      3.  
        About setting up third-party replication
      4.  
        Configuring the service groups for third-party replication
      5.  
        Fire drill in replicated data clusters using third-party replication
  4. Section IV. Implementing global clusters
    1. Configuring global clusters for VCS and SFHA
      1.  
        Installing and Configuring Cluster Server
      2. Setting up VVR replication
        1.  
          About configuring VVR replication
        2.  
          Best practices for setting up replication
        3. Creating a Replicated Data Set
          1. Creating a Primary RVG of an RDS
            1.  
              Prerequisites for creating a Primary RVG of an RDS
            2.  
              Example - Creating a Primary RVG containing a data volume
            3.  
              Example - Creating a Primary RVG containing a volume set
          2. Adding a Secondary to an RDS
            1.  
              Best practices for adding a Secondary to an RDS
            2.  
              Prerequisites for adding a Secondary to an RDS
          3. Changing the replication settings for a Secondary
            1. Setting the mode of replication for a Secondary
              1.  
                Example - Setting the mode of replication to asynchronous for an RDS
              2.  
                Example - Setting the mode of replication to synchronous for an RDS
            2.  
              Setting the latency protection for a Secondary
            3.  
              Setting the SRL overflow protection for a Secondary
            4.  
              Setting the network transport protocol for a Secondary
            5. Setting the packet size for a Secondary
              1.  
                Example - Setting the packet size between the Primary and Secondary
            6. Setting the bandwidth limit for a Secondary
              1.  
                Example: Limiting network bandwidth between the Primary and the Secondary
              2.  
                Example: Disabling Bandwidth Throttling between the Primary and the Secondary
              3.  
                Example: Limiting network bandwidth used by VVR when using full synchronization
        4. Synchronizing the Secondary and starting replication
          1. Methods to synchronize the Secondary
            1.  
              Using the network to synchronize the Secondary
            2.  
              Using block-level tape backup to synchronize the Secondary
            3.  
              Moving disks physically to synchronize the Secondary
          2. Using the automatic synchronization feature
            1.  
              Notes on using automatic synchronization
          3.  
            Example for setting up replication using automatic synchronization
          4.  
            About SmartMove for VVR
          5.  
            About thin storage reclamation and VVR
          6.  
            Determining if a thin reclamation array needs reclamation
        5. Starting replication when the data volumes are zero initialized
          1.  
            Example: Starting replication when the data volumes are zero initialized
      3.  
        Setting up third-party replication
      4. Configuring clusters for global cluster setup
        1.  
          Configuring global cluster components at the primary site
        2.  
          Installing and configuring VCS at the secondary site
        3.  
          Securing communication between the wide-area connectors
        4.  
          Configuring remote cluster objects
        5.  
          Configuring additional heartbeat links (optional)
        6.  
          Configuring the Steward process (optional)
      5. Configuring service groups for global cluster setup
        1.  
          Configuring VCS service group for VVR-based replication
        2.  
          Configuring a service group as a global service group
      6.  
        Fire drill in global clusters
    2. Configuring a global cluster with Storage Foundation Cluster File System High Availability, Storage Foundation for Oracle RAC, or Storage Foundation for Sybase CE
      1.  
        About global clusters
      2.  
        About replication for parallel global clusters using Storage Foundation and High Availability (SFHA) Solutions
      3.  
        About setting up a global cluster environment for parallel clusters
      4.  
        Configuring the primary site
      5. Configuring the secondary site
        1.  
          Configuring the Sybase ASE CE cluster on the secondary site
      6.  
        Setting up replication between parallel global cluster sites
      7.  
        Testing a parallel global cluster configuration
    3. Configuring global clusters with VVR and Storage Foundation Cluster File System High Availability, Storage Foundation for Oracle RAC, or Storage Foundation for Sybase CE
      1.  
        About configuring a parallel global cluster using Volume Replicator (VVR) for replication
      2. Setting up replication on the primary site using VVR
        1.  
          Creating the data and SRL volumes on the primary site
        2.  
          Setting up the Replicated Volume Group on the primary site
      3. Setting up replication on the secondary site using VVR
        1.  
          Creating the data and SRL volumes on the secondary site
        2.  
          Editing the /etc/vx/vras/.rdg files
        3.  
          Setting up IP addresses for RLINKs on each cluster
        4.  
          Setting up the disk group on secondary site for replication
      4.  
        Starting replication of the primary site database volume to the secondary site using VVR
      5. Configuring Cluster Server to replicate the database volume using VVR
        1.  
          Modifying the Cluster Server (VCS) configuration on the primary site
        2.  
          Modifying the VCS configuration on the secondary site
        3.  
          Configuring the Sybase ASE CE cluster on the secondary site
      6.  
        Replication use cases for global parallel clusters
  5. Section V. Reference
    1. Appendix A. Sample configuration files
      1. Sample Storage Foundation for Oracle RAC configuration files
        1.  
          sfrac02_main.cf file
        2.  
          sfrac07_main.cf and sfrac08_main.cf files
        3.  
          sfrac09_main.cf and sfrac10_main.cf files
        4.  
          sfrac11_main.cf file
        5.  
          sfrac12_main.cf and sfrac13_main.cf files
        6.  
          Sample fire drill service group configuration
      2. About sample main.cf files for Storage Foundation (SF) for Oracle RAC
        1.  
          Sample main.cf for Oracle 10g for CVM/VVR primary site
        2.  
          Sample main.cf for Oracle 10g for CVM/VVR secondary site
      3. About sample main.cf files for Storage Foundation (SF) for Sybase ASE CE
        1.  
          Sample main.cf for a basic Sybase ASE CE cluster configuration under VCS control with shared mount point on CFS for Sybase binary installation
        2.  
          Sample main.cf for a basic Sybase ASE CE cluster configuration with local mount point on VxFS for Sybase binary installation
        3.  
          Sample main.cf for a primary CVM VVR site
        4.  
          Sample main.cf for a secondary CVM VVR site

Modifying the VCS configuration on the secondary site

The following are highlights of the procedure to modify the existing VCS configuration on the secondary site:

  • Add the log owner and Replicated Volume Group (RVG) service groups.

  • Add a service group to manage the database and the supporting resources.

  • Define the replication objects and agents, such that the cluster at the secondary site can function as a companion to the primary cluster.

The following steps are similar to those performed on the primary site.

To modify VCS on the secondary site

  1. Log into one of the nodes on the secondary site as root.
  2. Use the following command to save the existing configuration to disk, and make the configuration read-only while making changes:
    # haconf -dump -makero
  3. Use the following command to make a backup copy of the main.cf file:
    # cd /etc/VRTSvcs/conf/config
    # cp main.cf main.orig
  4. Use vi or another text editor to edit the main.cf file. Edit the CVM group on the secondary site.

    Review the sample configuration file after the VCS installation to see the CVM configuration.

    See “To view sample configuration files for SF Oracle RAC”.

    See “To view sample configuration files for SF Sybase CE”.

    In our example, the secondary site has clus2 consisting of the nodes sys3 and sys4. To modify the CVM service group on the secondary site, use the CVM group on the primary site as your guide.

  5. Add a failover service group using the appropriate values for your cluster and nodes. Include the following resources:

    • RVGLogowner resource. The node on which the group is online functions as the log owner (node connected to the second cluster for the purpose of replicating data).

    • IP resource

    • NIC resources

    Example RVGLogowner service group:
    group rlogowner ( 
        SystemList = { sys3 = 0, sys4 = 1 } 
        AutoStartList = { sys3, sys4 } 
        )
    
        IP logowner_ip (
           Device = eth0
           Address = "10.11.9.102"   
           NetMask = "255.255.255.0"   
           ) 
    NIC nic (
           Device = eth0
    							NetworkHosts = { "10.10.8.1" }
    							NetworkType = ether
    							) 
    RVGLogowner logowner (   
           RVG = dbdata_rvg   
           DiskGroup = dbdatadg   
           )
    requires group RVGgroup online local firm
    logowner requires logowner_ip
    logowner_ip requires nic
  6. Add the RVG service group using the appropriate values for your cluster and nodes.

    The following is an example RVGgroup service group:

    group RVGgroup (  
        SystemList = { sys3 = 0, sys4 = 1 }  
        Parallel = 1  
        AutoStartList = { sys3, sys4 }  
        )
    
    RVGShared dbdata_rvg (  
        RVG = dbdata_rvg  
        DiskGroup = dbdatadg  
        )
    
        CVMVolDg dbdata_voldg (
            CVMDiskGroup = dbdatadg
            CVMActivation = sw   
            )
    
    requires group cvm online local firm
    dbdata_rvg requires dbdata_voldg
  7. It is advisible to modify "OnlineRetryLimit" & "OfflineWaitLimit" attribute of IP resource type to 1 on both the clusters:
    # hatype -modify IP  OnlineRetryLimit  1
    # hatype -modify IP  OfflineWaitLimit  1
  8. Add an database service group. Use the database service group on the primary site as a model for the database service group on the secondary site.

    • Define the database service group as a global group by specifying the clusters on the primary and secondary sites as values for the ClusterList group attribute.

    • Assign this global group the same name as the group on the primary site. For example, database_grp.

    • Include the ClusterList and ClusterFailOverPolicy cluster attributes. Veritas recommends using the Manual value.

    • Add the RVGSharedPri resource to the group configuration.

    • Remove the CVMVolDg resource, if it has been configured in your previous configuration. This resource is now part of the RVG service group.

    • Specify the service group to depend (online, local, firm) on the RVG service group.

    See configuration examples below.
  9. Save and close the main.cf file.
  10. Use the following command to verify the syntax of the /etc/VRTSvcs/conf/config/main.cf file:
      # hacf -verify /etc/VRTSvcs/conf/config
    				
  11. Stop and restart VCS.
    # hastop -all -force

    Wait for port h to stop on all nodes, and then restart VCS with the new configuration on all primary nodes one at a time.

    # hastart
    				
  12. Verify that VCS brings all resources online. On one node, enter the following command:
    # hagrp -display
    				

    The database, RVG, and CVM groups are online on both nodes of the primary site. The RVGLogOwner and ClusterService groups are online on one node of the cluster. If either the RVG group or the RVGLogOwner group is partially online, manually bring the groups online using the hagrp -online command. This information applies to the secondary site, except for the database group which must be offline.

  13. Verify the service groups and their resources that are brought online. On one node, enter the following command:
    # hagrp -display

    The database service group is offline on the secondary site, but the ClusterService, CVM, RVG log owner, and RVG groups are online.

    This completes the setup for a global cluster using VVR for replication. Veritas recommends testing a global cluster before putting it into production.

Example of the Oracle RAC database group on the secondary site:

group database_grp ( 
    SystemList = { sys3 = 0, sys3 = 1 }
	   ClusterList = { clus2 = 0, clus1 = 1 } 
    Parallel = 1 
    OnlineRetryInterval = 300 
    ClusterFailOverPolicy = Manual 
    Authority = 1 
    AutoStartList = { sys3, sys4 } 
    )

				RVGSharedPri dbdata_vvr_shpri (
								RvgResourceName = rdbdata_rvg
								OnlineRetryLimit = 0
								)
 
    CFSMount dbdata_mnt (  
        MountPoint = "/dbdata"  
        BlockDevice = "/dev/vx/dsk/dbdatadg/dbdata_vol"  
        Critical = 0 
								)
			RVGSharedPri dbdata_vvr_shpri (  
        RvgResourceName = dbdata_rvg  
        OnlineRetryLimit = 0  
        ) 

Oracle rac_db (  
     Sid @sys3 = vrts1  
     Sid @sys4 = vrts2  
     Owner = Oracle  
     Home = "/oracle/orahome"  
     Pfile @sys3 = "/oracle/orahome/dbs/initvrts1.ora"  
     Pfile @sys4 = "/oracle/orahome/dbs/initvrts2.ora"  
     StartUpOpt = SRVCTLSTART  
     ShutDownOpt = SRVCTLSTOP  
     )

requires group RVGgroup online local firm
dbdata_mnt requires dbdata_vvr_shpri
rac_db requires dbdata_mnt

RVGSharedPri dbdata_vvr_shpri (  
        RvgResourceName = dbdata_rvg  
        OnlineRetryLimit = 0  
        ) 

requires group RVGgroup online local firm
dbdata_mnt requires dbdata_vvr_shpri

Example of the Sybase ASE CE database group on the secondary site:

.
group sybase ( 
    SystemList = { sys3 = 0, sys4 = 1 }
	   ClusterList = { clus2 = 0, clus1 = 1 } 
    Parallel = 1 
    OnlineRetryInterval = 300 
    ClusterFailOverPolicy = Manual 
    Authority = 1 
  # AutoStart = 0 here so faulting will not happen
    AutoStartList = { sys3, sys4 } 
    )
 
    CFSMount dbdata_mnt (  
        MountPoint = "/dbdata"  
        BlockDevice = "/dev/vx/dsk/dbdatadg/dbdata_vol"  
        )
 
RVGSharedPri dbdata_vvr_shpri (  
        RvgResourceName = dbdata_rvg  
        OnlineRetryLimit = 0  
        )

									CFSMount quorum_101_quorumvol_mnt (
													MountPoint = "/quorum"
													BlockDevice = "/dev/vx/dsk/quorum_101/quorumvol"
													)

								 CVMVolDg quorum_101_voldg (
													CVMDiskGroup = quorum_101
													CVMVolume = { quorumvol }
													CVMActivation = sw
													)

Sybase ase (  
     Sid @sys3 = ase1  
     Sid @sys4 = ase2  
     Owner = sybase  
     Home = "/sybase" 
			 	Version = 15
     SA = sa
	    Quorum_dev = "/quorum/q.dat"  
     )

requires group RVGgroup online local firm
dbdata_mnt requires dbdata_vvr_shpri
ase requires vxfend
ase requires dbdata_mnt
ase requires quorum_101_quorumvol_mnt
quorum_101_quorumvol_mnt requires quorum_101_voldg