NetBackup™ for Hadoop Administrator's Guide
- Introduction
- Prerequisites and best practices for the Hadoop plug-in for NetBackup
- Configuring NetBackup for Hadoop
- Managing backup hosts
- Configuring the Hadoop plug-in using the Hadoop configuration file
- Configuring communication between NetBackup and Hadoop clusters that are SSL-enabled (HTTPS)
- Performing backups and restores of Hadoop
- Troubleshooting
- Troubleshooting backup issues for Hadoop data
- Troubleshooting restore issues for Hadoop data
Best practices for deploying the Hadoop plug-in
Consider the following when you deploy Hadoop plug-in and configure NetBackup for Hadoop:
Use consistent conventions for hostnames of backup hosts, media servers, and primary server. For example, if you are using the host name as hadoop.veritas.com (FQDN format) use the same everywhere.
Add the entries of all the NameNodes and DataNodes to the
/etc/hosts
file on all the backup hosts. You must add the hostname in FQDN format.Or
Add the appropriate DNS entries in the
/etc/resolv.conf
file.Always specify the NameNode and DataNodes in FQDN format.
Ping all the nodes (use FQDN) from the backup hosts.
Hostname and port of the NameNode must be the same as you have specified with the http address parameter in the
core-site.xml
of the Hadoop cluster.Canceling a parent job in a compound restore job does not cancel the child restore job. You must manually cancel the child restore jobs.
Ensure the following for a Hadoop cluster that is enabled with SSL (HTTPS):
A valid certificate exists on the backup host that contains the public keys from all the nodes of the Hadoop cluster.
For a Hadoop cluster that uses CRL, ensure that the CRL is valid and not expired.