Veritas NetBackup™ for Hadoop Administrator's Guide

Last Published:
Product(s): NetBackup (8.3.0.1)
  1. Introduction
    1.  
      Protecting Hadoop data using NetBackup
    2.  
      Backing up Hadoop data
    3.  
      Restoring Hadoop data
    4.  
      NetBackup for Hadoop terminologies
    5.  
      Limitations
  2. Verifying the pre-requisites and best practices for the Hadoop plug-in for NetBackup
    1.  
      About deploying the Hadoop plug-in
    2. Pre-requisites for the Hadoop plug-in
      1.  
        Operating system and platform compatibility
      2.  
        NetBackup server and client requirements
      3.  
        License for Hadoop plug-in for NetBackup
    3.  
      Preparing the Hadoop cluster
    4.  
      Best practices for deploying the Hadoop plug-in
  3. Configuring NetBackup for Hadoop
    1.  
      About configuring NetBackup for Hadoop
    2. Managing backup hosts
      1.  
        Whitelisting a NetBackup client on NetBackup master server
      2.  
        Configure a NetBackup Appliance as a backup host
    3.  
      Adding Hadoop credentials in NetBackup
    4. Configuring the Hadoop plug-in using the Hadoop configuration file
      1.  
        Configuring NetBackup for a highly-available Hadoop cluster
      2.  
        Configuring a custom port for the Hadoop cluster
      3.  
        Configuring number of threads for backup hosts
      4. Configuring communication between NetBackup and Hadoop clusters that are SSL-enabled (HTTPS)
        1.  
          ECA_TRUST_STORE_PATH for NetBackup servers and clients
        2.  
          ECA_CRL_PATH for NetBackup servers and clients
        3.  
          HADOOP_SECURE_CONNECT_ENABLED for servers and clients
        4.  
          HADOOP_CRL_CHECK for NetBackup servers and clients
        5.  
          Example values for the parameters in the bp.conf file
    5.  
      Configuration for a Hadoop cluster that uses Kerberos
    6. Configuring NetBackup policies for Hadoop plug-in
      1. Creating a BigData backup policy
        1. Creating BigData policy using the NetBackup Administration Console
          1.  
            Using the Policy Configuration Wizard to create a BigData policy for Hadoop clusters
          2.  
            Using the NetBackup Policies utility to create a BigData policy for Hadoop clusters
        2.  
          Using NetBackup Command Line Interface (CLI) to create a BigData policy for Hadoop clusters
    7.  
      Disaster recovery of a Hadoop cluster
  4. Performing backups and restores of Hadoop
    1. About backing up a Hadoop cluster
      1.  
        Pre-requisite for running backup and restore operations for a Hadoop cluster with Kerberos authentication
      2.  
        Best practices for backing up a Hadoop cluster
      3.  
        Backing up a Hadoop cluster
    2. About restoring a Hadoop cluster
      1.  
        Best practices for restoring a Hadoop cluster
      2. Restoring Hadoop data on the same Hadoop cluster
        1.  
          Using the Restore Wizard to restore Hadoop data on the same Hadoop cluster
        2.  
          Using the bprestore command to restore Hadoop data on the same Hadoop cluster
      3.  
        Restoring Hadoop data on an alternate Hadoop cluster
  5. Troubleshooting
    1.  
      About troubleshooting NetBackup for Hadoop issues
    2.  
      About NetBackup for Hadoop debug logging
    3. Troubleshooting backup issues for Hadoop data
      1.  
        Backup operation fails with error 6609
      2.  
        Backup operation failed with error 6618
      3.  
        Backup operation fails with error 6647
      4.  
        Extended attributes (xattrs) and Access Control Lists (ACLs) are not backed up or restored for Hadoop
      5.  
        Backup operation fails with error 6654
      6.  
        Backup operation fails with bpbrm error 8857
      7.  
        Backup operation fails with error 6617
      8.  
        Backup operation fails with error 6616
      9.  
        NetBackup configuration and certificate files do not persist after the container-based NetBackup appliance restarts
      10.  
        Unable to see incremental backup images during restore even though the images are seen in the backup image selection
      11.  
        One of the child backup jobs goes in a queued state
    4. Troubleshooting restore issues for Hadoop data
      1.  
        Restore fails with error code 2850
      2.  
        NetBackup restore job for Hadoop completes partially
      3.  
        Extended attributes (xattrs) and Access Control Lists (ACLs) are not backed up or restored for Hadoop
      4.  
        Restore operation fails when Hadoop plug-in files are missing on the backup host
      5.  
        Restore fails with bpbrm error 54932
      6.  
        Restore operation fails with bpbrm error 21296
      7.  
        Configuration file is not recovered after a disaster recovery
  6.  
    Index

Using the NetBackup Policies utility to create a BigData policy for Hadoop clusters

Use the following procedure to create a BigData policy with the NetBackup Policies utility.

To create a BigData policy with the NetBackup Policies utility

  1. In the NetBackup Administration Console, in the left pane, expand NetBackup Management > Policies.
  2. On the Actions menu, click New > Policy.
  3. Type a unique name for the new policy in the Add a New Policy dialog box.

    Click OK.

  4. On the Attributes tab, select BigData as the policy type.
  5. On the Attributes tab, select the storage unit for BigData policy type.
  6. On the Schedules tab, click New to create a new schedule.

    You can create a schedule for a Full Backup, Differential Incremental Backup, or Cumulative Incremental Backup for your BigData policy. Once you set the schedule, Hadoop data is backed up automatically as per the set schedule without any further user intervention.

  7. On the Clients tab, enter the IP address or the host name of the NameNode.
  8. On the Backup Selections tab, enter the following parameters and their values as shown:
    • Application_Type=hadoop

      The parameter values are case-sensitive.

    • Backup_Host=IP_address or hostname

      The backup host must be a Linux computer. The backup host can be a NetBackup client or a media server.

      You can specify multiple backup hosts.

    • File path or the directory to back up

      You can specify multiple file paths.

      Note:

      The directory or folder specified for backup selection while defining BigData Policy with Application_Type=hadoop must not contain space or comma in their names.

  9. Click OK to save the changes.

For more information on using NetBackup for big data applications, refer to the Veritas NetBackup documentation page.