NetBackup™ for Hadoop Administrator's Guide

Last Published:
Product(s): NetBackup & Alta Data Protection (10.3)
  1. Introduction
    1.  
      Protecting Hadoop data using NetBackup
    2.  
      Backing up Hadoop data
    3.  
      Restoring Hadoop data
    4.  
      NetBackup for Hadoop terms
    5.  
      Limitations
  2. Prerequisites and best practices for the Hadoop plug-in for NetBackup
    1.  
      About deploying the Active Directory plug-in
    2. Prerequisites for the Hadoop plug-in
      1.  
        Operating system and platform compatibility
      2.  
        License for Hadoop plug-in for NetBackup
    3.  
      Preparing the Hadoop cluster
    4.  
      Best practices for deploying the Hadoop plug-in
  3. Configuring NetBackup for Hadoop
    1.  
      About configuring NetBackup for Hadoop
    2. Managing backup hosts
      1.  
        Including a NetBackup client on NetBackup primary server allowed list
      2.  
        Configure a NetBackup Appliance as a backup host
    3.  
      Adding Hadoop credentials in NetBackup
    4. Configuring the Hadoop plug-in using the Hadoop configuration file
      1.  
        Configuring NetBackup for a highly-available Hadoop cluster
      2.  
        Configuring a custom port for the Hadoop cluster
      3.  
        Configuring number of threads for backup hosts
      4.  
        Configuring number of streams for backup hosts
      5.  
        Configuring distribution algorithm and golden ratio for backup hosts
      6. Configuring communication between NetBackup and Hadoop clusters that are SSL-enabled (HTTPS)
        1.  
          ECA_TRUST_STORE_PATH for NetBackup servers and clients
        2.  
          ECA_CRL_PATH for NetBackup servers and clients
        3.  
          HADOOP_SECURE_CONNECT_ENABLED for servers and clients
        4.  
          HADOOP_CRL_CHECK for NetBackup servers and clients
        5.  
          Example values for the parameters in the bp.conf file
    5.  
      Configuration for a Hadoop cluster that uses Kerberos
    6.  
      Hadoop.conf configuration for parallel restore
    7.  
      Create a BigData policy for Hadoop clusters
    8.  
      Disaster recovery of a Hadoop cluster
  4. Performing backups and restores of Hadoop
    1. About backing up a Hadoop cluster
      1.  
        Prerequisites for running backup and restore operations for a Hadoop cluster with Kerberos authentication
      2.  
        Best practices for backing up a Hadoop cluster
      3.  
        Backing up a Hadoop cluster
    2. About restoring a Hadoop cluster
      1.  
        Best practices for restoring a Hadoop cluster
      2. Restoring Hadoop data on the same Hadoop cluster
        1.  
          Restore Hadoop data on the same Hadoop cluster
      3.  
        Restoring Hadoop data on an alternate Hadoop cluster
    3.  
      Best practice for improving performance during backup and restore
  5. Troubleshooting
    1.  
      About troubleshooting NetBackup for Hadoop issues
    2.  
      About NetBackup for Hadoop debug logging
    3. Troubleshooting backup issues for Hadoop data
      1.  
        Backup operation fails with error 6609
      2.  
        Backup operation failed with error 6618
      3.  
        Backup operation fails with error 6647
      4.  
        Extended attributes (xattrs) and Access Control Lists (ACLs) are not backed up or restored for Hadoop
      5.  
        Backup operation fails with error 6654
      6.  
        Backup operation fails with bpbrm error 8857
      7.  
        Backup operation fails with error 6617
      8.  
        Backup operation fails with error 6616
      9.  
        Backup operation fails with error 84
      10.  
        NetBackup configuration and certificate files do not persist after the container-based NetBackup appliance restarts
      11.  
        Unable to see incremental backup images during restore even though the images are seen in the backup image selection
      12.  
        One of the child backup jobs goes in a queued state
    4. Troubleshooting restore issues for Hadoop data
      1.  
        Restore fails with error code 2850
      2.  
        NetBackup restore job for Hadoop completes partially
      3.  
        Extended attributes (xattrs) and Access Control Lists (ACLs) are not backed up or restored for Hadoop
      4.  
        Restore operation fails when Hadoop plug-in files are missing on the backup host
      5.  
        Restore fails with bpbrm error 54932
      6.  
        Restore operation fails with bpbrm error 21296
      7.  
        Hadoop with Kerberos restore job fails with error 2850
      8.  
        Configuration file is not recovered after a disaster recovery
  6.  
    Index

ECA_TRUST_STORE_PATH for NetBackup servers and clients

The ECA_TRUST_STORE_PATH option specifies the file path to the certificate bundle file that contains all trusted root CA certificates.

This certificate file should have one or more certificates in PEM format.

Do not specify the ECA_TRUST_STORE_PATH option if you use the Windows certificate store.

The trust store supports certificates in the following formats:

  • PKCS #7 or P7B file having certificates of the trusted root certificate authorities that are bundled together. This file may either be PEM or DER encoded.

  • A file containing the PEM encoded certificates of the trusted root certificate authorities that are concatenated together.

This option is mandatory for file-based certificates.

The root CA certificate in Cloudera distribution can be obtained from the Cloudera administrator. It may have a manual TLS configuration or an Auto-TLS enabled for the Hadoop cluster. For both cases, NetBackup needs a root CA certificate from the administrator.

The root CA certificate from the Hadoop cluster can validate the certificates for all nodes and allow NetBackup to run the backup and restore process in case of the secure (SSL) cluster. This root CA certificate is a bundle of certificates that has been issued to all such nodes.

Certificate from root CA must be configured under ECA_TRUST_STORE_PATH in case of self-signed, third party CA or Local/Intermediate CA environments. For example: In case of AUTO-TLS enabled Cloudera environments, you can typically find the root CA file named with cm-auto-global_cacerts.pem at path /var/lib/cloudera-scm-agent/agent-cert. For more details, refer Cloudera documentation.

Table: ECA_TRUST_STORE_PATH information

Usage

Description

Where to use

On NetBackup servers or clients.

If certificate validation is required for VMware, Red Hat Virtualization servers, or Nutanix AHV, this option must be set on the NetBackup primary server and respective access hosts, irrespective of the certificate authority that NetBackup uses for host communication (NetBackup CA or external CA).

How to use

Use the nbgetconfig and the nbsetconfig commands to view, add, or change the option.

For information about these commands, see the NetBackup Commands Reference Guide.

Use the following format:

ECA_TRUST_STORE_PATH = Path to the external CA certificate

For example: c:\rootCA.pem

If you use this option on a Flex Appliance application instance, the path must be /mnt/nbdata/hostcert/.

Equivalent UI property

No equivalent exists.