Veritas NetBackup™ for Hadoop Administrator's Guide
- Introduction
- Verifying the pre-requisites and best practices for the Hadoop plug-in for NetBackup
- Configuring NetBackup for Hadoop
- Managing backup hosts
- Configuring the Hadoop plug-in using the Hadoop configuration file
- Configuring NetBackup policies for Hadoop plug-in
- Performing backups and restores of Hadoop
- Troubleshooting
- Troubleshooting backup issues for Hadoop data
- Troubleshooting restore issues for Hadoop data
Best practices for deploying the Hadoop plug-in
Consider the following when you deploy Hadoop plug-in and configure NetBackup for Hadoop:
Use consistent conventions for hostnames of backup hosts, media servers, and master server. For example, if you are using the hostname as hadoop.veritas.com (FQDN format) use the same everywhere.
Add the entries of all the NameNodes and DataNodes to the
/etc/hosts
file on all the backup hosts. You must add the hostname in FQDN format.Or
Add the appropriate DNS entries in the
/etc/resolv.conf
file.Always specify the NameNode and DataNodes in FQDN format.
Ping all the nodes (use FQDN) from the backup hosts.
Hostname and port of the NameNode must be same as you have specified with the http address parameter in the
core-site.xml
of the Hadoop cluster.Ensure the following for a Hadoop cluster that is enabled with SSL (HTTPS):
A valid certificate exists on the backup host that contains the public keys from all the nodes of the Hadoop cluster.
For a Hadoop cluster that uses CRL, ensure that the CRL is valid and not expired.