Storage Foundation and High Availability Solutions 8.0.2 HA and DR Solutions Guide for Microsoft SQL Server - Windows

Last Published:
Product(s): InfoScale & Storage Foundation (8.0.2)
Platform: Windows
  1. Section I. Getting started with Storage Foundation and High Availability Solutions for SQL Server
    1. Introducing SFW HA and the VCS agents for SQL Server
      1.  
        About the Veritas InfoScale solutions for monitoring SQL Server
      2.  
        How application availability is achieved in a physical environment
      3. How is application availability achieved in a VMware virtual environment
        1.  
          How the VMwareDisks agent communicates with the vCenter Server instead of the ESX/ESXi host
        2.  
          Typical VCS cluster configuration in a virtual environment
      4.  
        Managing storage using VMware virtual disks
      5. Notes and recommendations
        1. Assigning privileges for non-administrator ESX/ESXi user account
          1.  
            Creating a role
          2.  
            Integrating with Active Directory or local authentication
          3.  
            Creating a new user
          4.  
            Adding a user to the role
      6.  
        Modifying the ESXDetails attribute
      7. How VCS monitors storage components
        1.  
          Shared storage - if you use NetApp filers
        2.  
          Shared storage - if you use SFW to manage cluster dynamic disk groups
        3.  
          Shared storage - if you use Windows LDM to manage shared disks
        4.  
          Non-shared storage - if you use SFW to manage dynamic disk groups
        5.  
          Non-shared storage - if you use Windows LDM to manage local disks
        6.  
          Non-shared storage - if you use VMware storage
      8.  
        What must be protected in an SQL Server environment
      9.  
        About the VCS agents for SQL Server
      10.  
        About the VCS agent for SQL Server Database Engine
      11.  
        About the VCS agent for SQL Server FILESTREAM
      12.  
        About the VCS GenericService agent for SQL Server Agent service and Analysis service
      13.  
        About the agent for MSDTC service
      14.  
        About the monitoring options
      15.  
        Typical SQL Server configuration in a VCS cluster
      16.  
        Typical SQL Server disaster recovery configuration
      17.  
        SQL Server sample dependency graph
      18.  
        MSDTC sample dependency graph
    2. Deployment scenarios for SQL Server
      1.  
        Workflows in the Solutions Configuration Center
      2. Reviewing the active-passive HA configuration
        1.  
          Sample Active-Passive configuration
      3.  
        Reviewing the prerequisites for a standalone SQL Server
      4. Reviewing a standalone SQL Server configuration
        1.  
          Sample standalone SQL Server configuration
      5.  
        Reviewing the MSDTC configuration
      6.  
        VCS campus cluster configuration
      7. Reviewing the campus cluster configuration
        1.  
          Campus cluster failover using the ForceImport attribute
        2.  
          Reinstating faulted hardware in a campus cluster
      8.  
        VCS Replicated Data Cluster configuration
      9. Reviewing the Replicated Data Cluster configuration
        1.  
          Sample replicated data cluster configuration
      10. About setting up a Replicated Data Cluster configuration
        1.  
          About setting up replication
        2.  
          About configuring and migrating the service group
      11. Disaster recovery configuration
        1.  
          DR configuration tasks: Primary site
        2.  
          DR configuration tasks: Secondary site
        3.  
          Supported disaster recovery configurations for service group dependencies
      12. Reviewing the disaster recovery configuration
        1.  
          Sample disaster recovery configuration
      13. Notes and recommendations for cluster and application configuration
        1.  
          IPv6 support
        2.  
          IP address requirements for an Active-Passive configuration
        3.  
          IP address requirements for a disaster recovery configuration
      14.  
        Configuring the storage hardware and network
      15. Configuring disk groups and volumes for SQL Server
        1.  
          About disk groups and volumes
        2.  
          Prerequisites for configuring disk groups and volumes
        3.  
          Considerations for a fast failover configuration
        4.  
          Considerations for converting existing shared storage to cluster disk groups and volumes
        5.  
          Considerations when creating disks and volumes for campus clusters
        6.  
          Considerations for volumes for a Volume Replicator configuration
        7.  
          Considerations for disk groups and volumes for multiple instances
        8.  
          Sample disk group and volume configuration
        9.  
          MSDTC sample disk group and volume configuration
        10.  
          Viewing the available disk storage
        11.  
          Creating a dynamic disk group
        12.  
          Adding disks to campus cluster sites
        13.  
          Creating volumes for high availability clusters
        14.  
          Creating volumes for campus clusters
      16. About managing disk groups and volumes
        1.  
          Importing a disk group and mounting a volume
        2.  
          Unmounting a volume and deporting a disk group
        3.  
          Adding drive letters to mount the volumes
      17. Configuring the cluster using the Cluster Configuration Wizard
        1.  
          Configuring notification
        2.  
          Configuring Wide-Area Connector process for global clusters
        3.  
          Adding nodes to a cluster
    3. Installing SQL Server
      1.  
        About installing and configuring SQL Server
      2.  
        About installing multiple SQL Server instances
      3.  
        Verifying that the SQL Server databases and logs are moved to shared storage
      4.  
        About installing SQL Server for high availability configuration
      5.  
        About installing SQL Server on the first system
      6.  
        About installing SQL Server on additional systems
      7.  
        Creating a SQL Server user-defined database
      8. Completing configuration steps in SQL Server
        1.  
          Moving the tempdb database if using Volume Replicator for disaster recovery
        2.  
          Assigning ports for multiple SQL Server instances
        3.  
          Enabling IPv6 support for the SQL Server Analysis Service
  2. Section II. Configuring SQL Server in a physical environment
    1. Configuring SQL Server for failover
      1.  
        Tasks for configuring a new server for high availability
      2.  
        Tasks for configuring an existing server for high availability
      3. About configuring the SQL Server service group
        1.  
          Service group requirements for Active-Active configurations
        2.  
          Prerequisites for configuring the SQL Server service group
        3.  
          Creating the SQL Server service group
      4. Configuring the service group in a non-shared storage environment
        1.  
          Assigning privileges to the existing SQL Server databases and logs
        2.  
          Enabling fast failover for disk groups (optional)
      5.  
        Verifying the SQL Server cluster configuration
      6.  
        About the modifications required for tagged VLAN or teamed network
      7.  
        Tasks for configuring MSDTC for high availability
      8. Configuring an MSDTC Server service group
        1.  
          Prerequisites for MSDTC configuration
        2.  
          Creating an MSDTC Server service group
      9.  
        About configuring the MSDTC client for SQL Server
      10.  
        About the VCS Application Manager utility
      11.  
        Viewing DTC transaction information
      12.  
        Modifying a SQL Server service group to add VMDg and MountV resources
      13.  
        Determining additional steps needed
    2. Configuring campus clusters for SQL Server
      1.  
        Tasks for configuring campus clusters
      2.  
        Modifying the IP resource in the SQL Server service group
      3.  
        Verifying the campus cluster: Switching the service group
      4.  
        Setting the ForceImport attribute to 1 after a site failure
    3. Configuring Replicated Data Clusters for SQL Server
      1.  
        Tasks for configuring Replicated Data Clusters
      2.  
        Creating the primary system zone for the application service group
      3.  
        Creating a parallel environment in the secondary zone
      4.  
        Setting up security for Volume Replicator
      5. Setting up the Replicated Data Sets (RDS)
        1.  
          Prerequisites for setting up the RDS for the primary and secondary zones
        2.  
          Creating the Replicated Data Sets with the wizard
      6. Configuring a RVG service group for replication
        1.  
          Creating the RVG service group
        2. Configuring the resources in the RVG service group for RDC replication
          1.  
            Configuring the IP and NIC resources
          2. Configuring the VMDg or VMNSDg resources for the disk groups
            1.  
              Modifying the DGGuid attribute for the new disk group resource in the RVG service group
            2.  
              Configuring the VMDg or VMNSDg resources for the disk group for the user-defined database
            3.  
              Adding the Volume Replicator RVG resources for the disk groups
            4.  
              Linking the Volume Replicator RVG resources to establish dependencies
            5.  
              Deleting the VMDg or VMNSDg resource from the SQL Server service group
        3. Configuring the RVG Primary resources
          1.  
            Creating the RVG Primary resources
          2.  
            Linking the RVG Primary resources to establish dependencies
          3.  
            Bringing the RVG Primary resources online
        4.  
          Configuring the primary system zone for the RVG service group
      7.  
        Setting a dependency between the service groups
      8. Adding the nodes from the secondary zone to the RDC
        1.  
          Adding the nodes from the secondary zone to the RVG service group
        2.  
          Configuring secondary zone nodes in the RVG service group
        3.  
          Configuring the RVG service group NIC resource for fail over (VMNSDg only)
        4.  
          Configuring the RVG service group IP resource for failover
        5.  
          Configuring the RVG service group VMNSDg resources for fail over
        6.  
          Adding nodes from the secondary zone to the SQL Server service group
        7.  
          Configuring the zones in the SQL Server service group
        8.  
          Configuring the application service group IP resource for fail over (VMNSDg only)
        9.  
          Configuring the application service group NIC resource for fail over (VMNSDg only)
      9. Verifying the RDC configuration
        1.  
          Bringing the service group online
        2.  
          Switching online nodes
      10.  
        Additional instructions for GCO disaster recovery
    4. Configuring disaster recovery for SQL Server
      1.  
        Tasks for configuring disaster recovery for SQL Server
      2.  
        Tasks for setting up DR in a non-shared storage environment
      3.  
        Guidelines for installing Veritas InfoScale Enterprise and configuring the cluster on the secondary site
      4.  
        Verifying your primary site configuration
      5. Setting up your replication environment
        1. Requirements for EMC SRDF array-based hardware replication
          1.  
            Software requirements for configuring EMC SRDF
          2.  
            Replication requirements for EMC SRDF
        2. Requirements for Hitachi TrueCopy array-based hardware replication
          1.  
            Software requirements for Hitachi TrueCopy
          2.  
            Replication requirements for Hitachi TrueCopy
      6.  
        Assigning user privileges (secure clusters only)
      7. About configuring disaster recovery with the DR wizard
        1.  
          Configuring disaster recovery with the DR wizard
      8.  
        Cloning the storage on the secondary site using the DR wizard (Volume Replicator replication option)
      9.  
        Creating temporary storage on the secondary site using the DR wizard (array-based replication)
      10.  
        Installing and configuring SQL Server on the secondary site
      11.  
        Cloning the service group configuration from the primary site to the secondary site
      12.  
        Configuring the SQL Server service group in a non-shared storage environment
      13. Configuring replication and global clustering
        1.  
          Configuring Volume Replicator replication and global clustering
        2. Configuring EMC SRDF replication and global clustering
          1.  
            Optional settings for EMC SRDF
        3. Configuring Hitachi TrueCopy replication and global clustering
          1.  
            Optional settings for HTC
        4.  
          Configuring global clustering only
      14.  
        Creating the replicated data sets (RDS) for Volume Replicator replication
      15.  
        Creating the Volume Replicator RVG service group for replication
      16. Configuring the global cluster option for wide-area failover
        1.  
          Linking clusters: Adding a remote cluster to a local cluster
        2.  
          Converting a local service group to a global service group
        3.  
          Bringing a global service group online
      17.  
        Verifying the disaster recovery configuration
      18.  
        Adding multiple DR sites (optional)
      19.  
        Recovery procedures for service group dependencies
      20.  
        Configuring DR manually without the DR wizard
    5. Testing fault readiness by running a fire drill
      1.  
        About disaster recovery fire drills
      2. About the Fire Drill Wizard
        1.  
          About Fire Drill Wizard general operations
        2. About Fire Drill Wizard operations in a Volume Replicator environment
          1.  
            Preparing the fire drill configuration
          2.  
            About running the fire drill
          3.  
            About restoring the fire drill configuration
          4.  
            About deleting the fire drill configuration
        3.  
          About Fire Drill Wizard operations in a Hitachi TrueCopy or EMC SRDF environment
      3.  
        About post-fire drill scripts
      4.  
        Tasks for configuring and running fire drills
      5. Prerequisites for a fire drill
        1.  
          Prerequisites for a fire drill in a Volume Replicator environment
        2.  
          Prerequisites for a fire drill in a Hitachi TrueCopy environment
        3.  
          Prerequisites for a fire drill in an EMC SRDF environment
      6. Preparing the fire drill configuration
        1.  
          System Selection panel details
        2.  
          Service Group Selection panel details
        3.  
          Secondary System Selection panel details
        4.  
          Fire Drill Service Group Settings panel details
        5.  
          Disk Selection panel details
        6.  
          Hitachi TrueCopy Path Information panel details
        7.  
          HTCSnap Resource Configuration panel details
        8.  
          SRDFSnap Resource Configuration panel details
        9.  
          Fire Drill Preparation panel details
      7.  
        Running a fire drill
      8.  
        Re-creating a fire drill configuration that has changed
      9.  
        Restoring the fire drill system to a prepared state
      10. Deleting the fire drill configuration
        1.  
          Fire Drill Deletion panel details
      11.  
        Considerations for switching over fire drill service groups

Notes and recommendations

Note the following prerequisites before configuring application monitoring:

  • Verify that the boot sequence of the virtual machine is such that the boot disk (OS hard disk) is placed before the removable disks.

    If the sequence places the removable disks before the boot disk, the virtual machine may not reboot after an application failover. The reboot may halt with an "OS not found" error.

    This issue occurs because during the application failover the removable disks are detached from the current virtual machine and are attached on the failover target system.

  • Verify that VMware Tools is installed on the virtual machine.

    Install the version that is similar to or later than that available with VMware ESX 4.1.

  • Verify that all the systems on which you want to configure application monitoring belong to the same domain.

  • Verify that the ESX/ESXi host user account has administrative privileges or is a root user.

    If the ESX/ESXi user account fails to have the administrative privileges or is not a root user, then in event of a failure the disk deattach and attach operation may fail.

    If you do not want to use the administrator user account or the root user, then you must create a role, add the required privileges to the created role and then add the ESX user to that role.

    See Assigning privileges for non-administrator ESX/ESXi user account.

  • Verify that the SQL Server instances that you want to monitor are installed on the non-shared local disk that can be deported from the system and imported to another system.

  • If you have configured a firewall, ensure that your firewall settings allow access to ports used by Veritas High Availability installer, wizard, and services.

  • You must run the Veritas High Availability Configuration wizard from the system to which the disk residing on the shared datastore is attached (first system on which you installed SQL Server).

  • After configuring SQL Server databases for monitoring, if you create another database or service, then these new components are not monitored as part of the existing configuration.

    In this case, you can either use the VCS commands to add the components to the configuration or unconfigure the existing configuration and then run the wizard again to configure the required components.

  • In case the VMwareDisks agent resource is configured manually, care should be taken not to add the operating system disk in the configuration. The VMwareDisks agent does not block this operation. This might lead to a system crash during failover.

  • If VMware vMotion is triggered at the same time as an application fails over, the VMwareDisks resource may either fail to go offline or may report an unknown status. The resource will eventually failover and report online after the vMotion is successful and the application is online on the target system.

  • VMware snapshot operations may fail if VMwareDisks agent is configured for a physical RDM type of disk. Currently only virtual RDM disks are supported.

  • Non-shared disks partitioned using GUID Partition Table (GPT) are not supported. Currently only Master Boot Record (MBR) partition is supported.

  • VMwareDisks agent does not support disks attached to the virtual machine using IDE controllers. The agent resource reports an unknown if IDE type of disks are configured.

  • In case VMware HA is disabled and the ESX itself faults, VCS moves the application to the target failover system on another ESX host. VMwareDisks agent registers the faulted system on the new ESX host. When you try to power on the faulted system, you may see the following message in the vSphere Client:

    This virtual machine might have been moved or copied. 
    In order to configure certain management and networking features, 
    VMware ESX needs to know if this virtual machine was moved or copied. 
    If you don't know, answer "I copied it".

    You must select "I moved it" (instead of the default "I copied it") on this message prompt.

  • You must not restore a snapshot on a virtual machine where an application is currently online, if the snapshot was taken when the application was offline on that virtual machine. Doing this may cause an unwanted fail over.

    This also applies in the reverse scenario; you should not restore a snapshot where the application was online on a virtual machine, where the application is currently offline. This may lead to a misconfiguration where the application is online on multiple systems simultaneously.

  • If you want to suspend a system on which an application is currently online, then you must first switch the application to a failover target system.

    If you suspend the system without switching the application, then VCS moves the disks along with the application to another system.

    Later, when you try to restore the suspended system, VMware does not allow the operation because the disks that were attached before the system was suspended are no longer with the system.

  • While creating a VCS cluster in a virtual environment, you must configure one of the cluster communication link over a public adapter in addition to the link configured over a private adapter. To have less VCS cluster communication over the link using the public adapter, you may assign it low priority. This keeps the VCS cluster communication intact even if the private network adapters fail. If the cluster communication is configured over the private adapters only, the cluster systems may fail to communicate with each other in case of network failure. In this scenario, each system considers that the other system has faulted, and then try to gain access to the disks, thereby leading to an application fault.

  • VMware Fault Tolerance does not support adding or removing of non-shared disks between virtual machines. During a failover, disks that contain application data cannot be moved to alternate failover systems. Applications that are being monitored thus cannot be brought online on the failover systems.

  • For cluster communication, you must not select the teamed network adapter or the independently listed adapters that are a part of the teamed NIC.

    A teamed network adapter is a logical NIC, formed by grouping several physical NICs together. All NICs in a team have an identical MAC address, due to which you may experience the following issues:

    • The application monitoring configuration wizard may fail to discover the specified network adapters

    • The application monitoring configuration wizard may fail to discover/validate the specified system name