NetBackup™ for Hadoop Administrator's Guide
- Introduction
- Prerequisites and best practices for the Hadoop plug-in for NetBackup
- Configuring NetBackup for Hadoop
- Managing backup hosts
- Configuring the Hadoop plug-in using the Hadoop configuration file
- Configuring communication between NetBackup and Hadoop clusters that are SSL-enabled (HTTPS)
- Performing backups and restores of Hadoop
- Troubleshooting
- Troubleshooting backup issues for Hadoop data
- Troubleshooting restore issues for Hadoop data
Hadoop.conf configuration for parallel restore
TBD
"application_servers": { "punnbuucsm5b29-v14.vxindia.veritas.com": { "port": 9000, "distro_algo": 4, "num_streams": 2, "golden_ratio": 80, "additionalBackupHosts": ["bh1.vxindia.veritas.com", "bh2.vxindia.veritas.com"] } }, "number_of_threads": 10 } ------------
num_stream: To enhance the restore performance, you can configure the number of streams that each backup host can allow. Default value is 1.
additionalBackupHosts: To enhance the restore performance, you can configure additional backup host details. You can specify the hostname of additional backup hosts.
Notes:
You must keep
additionalBackupHosts
empty, if no additional backup hosts are available.The
hadoop.conf
configuration must be same on all the backup hosts.The
num_stream
configuration must be same for backup and restore process.Hadoop setups and NetBackup setups must be in the same timezone.
If you increase streams, adjust the maximum number of jobs per client, update the
stu
setting for multiple threads, and update the client timeout to avoid abrupt failures.