NetBackup™ Deduplication Guide
- Introducing the NetBackup media server deduplication option
- Quick start
- Planning your deployment
- About MSDP storage and connectivity requirements
- About NetBackup media server deduplication
- About NetBackup Client Direct deduplication
- About MSDP remote office client deduplication
- About MSDP performance
- About MSDP stream handlers
- MSDP deployment best practices
- Provisioning the storage
- Licensing deduplication
- Configuring deduplication
- About the MSDP Deduplication Multi-Threaded Agent
- About MSDP fingerprinting
- Enabling 400 TB support for MSDP
- About MSDP Encryption using NetBackup Key Management Server service
- Configuring a storage server for a Media Server Deduplication Pool
- About disk pools for NetBackup deduplication
- Configuring a Media Server Deduplication Pool storage unit
- Configuring client attributes for MSDP client-side deduplication
- About MSDP encryption
- About a separate network path for MSDP duplication and replication
- About MSDP optimized duplication within the same domain
- Configuring MSDP replication to a different NetBackup domain
- About NetBackup Auto Image Replication
- Configuring a target for MSDP replication to a remote domain
- About storage lifecycle policies
- Resilient network properties
- About variable-length deduplication on NetBackup clients
- About the MSDP pd.conf configuration file
- About saving the MSDP storage server configuration
- About protecting the MSDP catalog
- About NetBackup WORM storage support for immutable and indelible data
- Running MSDP services with the non-root user
- MSDP volume group (MVG)
- About the MSDP volume group
- Configuring the MSDP volume group
- MSDP cloud support
- About MSDP cloud support
- Cloud space reclamation
- About the disaster recovery for cloud LSU
- About Image Sharing using MSDP cloud
- About MSDP cloud immutable (WORM) storage support
- About immutable object support for AWS S3
- About bucket-level immutable storage support for Google Cloud Storage
- About object-level immutable storage support for Google Cloud Storage
- About AWS IAM Role Anywhere support
- About Azure service principal support
- About NetBackup support for AWS Snowball Edge
- About the cloud direct
- S3 Interface for MSDP
- Configuring S3 interface for MSDP on MSDP build-your-own (BYO) server
- Identity and Access Management (IAM) for S3 interface for MSDP
- S3 APIs for S3 interface for MSDP
- Disaster recovery in S3 interface for MSDP
- Monitoring deduplication activity
- Viewing MSDP job details
- Managing deduplication
- Managing MSDP servers
- Managing NetBackup Deduplication Engine credentials
- Managing Media Server Deduplication Pools
- Changing a Media Server Deduplication Pool properties
- Configuring MSDP data integrity checking behavior
- About MSDP storage rebasing
- Managing MSDP servers
- Recovering MSDP
- Replacing MSDP hosts
- Uninstalling MSDP
- Deduplication architecture
- Configuring and managing universal shares
- Introduction to universal shares
- Prerequisites to configure universal shares
- Managing universal shares
- Restoring data using universal shares
- Advanced features of universal shares
- Direct universal share data to object store
- Universal share accelerator for data deduplication
- Configure a universal share accelerator
- About the universal share accelerator quota
- Load backup data to a universal share with the ingest mode
- Managing universal share services
- Troubleshooting issues related to universal shares
- Configuring isolated recovery environment (IRE)
- Configuring an isolated recovery environment using the web UI
- Configuring an isolated recovery environment using the command line
- Using the NetBackup Deduplication Shell
- Managing users from the deduplication shell
- About the external MSDP catalog backup
- Managing certificates from the deduplication shell
- Managing NetBackup services from the deduplication shell
- Monitoring and troubleshooting NetBackup services from the deduplication shell
- Managing S3 service from the deduplication shell
- Troubleshooting
- About unified logging
- About legacy logging
- Troubleshooting MSDP configuration issues
- Troubleshooting MSDP operational issues
- Trouble shooting multi-domain issues
- Appendix A. Migrating to MSDP storage
- Appendix B. Migrating from Cloud Catalyst to MSDP direct cloud tiering
- About direct migration from Cloud Catalyst to MSDP direct cloud tiering
- Appendix C. Encryption Crawler
Tuning the MSDP configuration from the deduplication shell
The default MSDP configuration should work for most installations. However, if you need to make adjustments, use the following commands to set or view the parameters.
Parameter | Description | Commands |
---|---|---|
AllocationUnitSize | The allocation unit size for the data on the server | To set the parameter: setting set-MSDP-param allocation-unit-size value=<number of MiB> To view the parameter: setting get-MSDP-param allocation-unit-size |
DataCheckDays | The number of days to check the data for consistency | To set the parameter: setting set-MSDP-param data-check-days value=<number of days> To view the parameter: setting get-MSDP-param data-check-days |
LogRetention | The length of time to keep logs | To set the parameter: setting set-MSDP-param log-retention value=<number of days> To view the parameter: setting get-MSDP-param log-retention |
MaxCacheSize | The maximum size of the NetBackup Deduplication Engine (spoold) fingerprint cache | To set the parameter: setting set-MSDP-param max-cache-size value=<number of GB> To view the parameter: setting get-MSDP-param max-cache-size |
MaxRetryCount | The maximum number of times to retry a failed transmission | To set the parameter: setting set-MSDP-param max-retry-count value=<number of retry times> To view the parameter: setting get-MSDP-param max-retry-count |
SpadLogging | The log level for the NetBackup Deduplication Manager (spad) | To set the parameter: setting set-MSDP-param spad-logging log_level=<value> See Setting the MSDP log level from the deduplication shell. To view the parameter: setting get-MSDP-param spad-logging |
SpooldLogging | The log level for the NetBackup Deduplication Engine (spoold) | To set the parameter: setting set-MSDP-param spoold-logging log_level=<value> See Setting the MSDP log level from the deduplication shell. To view the parameter: setting get-MSDP-param spoold-logging |
WriteThreadNum | The number of threads for writing data to the data container in parallel | To set the parameter: setting set-MSDP-param write-thread-num value=<number of threads> To view the parameter: setting get-MSDP-param write-thread-num |
CloudDataCacheSize | The default data cache size when the cloud LSU is added. Decrease this value if sufficient free space is not available. | To set the parameter: setting set-MSDP-param cloud-data-cache-size value=<number> To view the parameter: setting get-MSDP-param cloud-data-cache-size |
CloudMapCacheSize | The default map cache size when the cloud LSU is added. Decrease this value if sufficient free space is not available. | To set the parameter: setting set-MSDP-param cloud-map-cache-size value=<number> To view the parameter: setting get-MSDP-param cloud-map-cache-size |
CloudMetaCacheSize | The default meta cache size when the cloud LSU is added. Decrease this value if sufficient free space is not available. | To set the parameter: setting set-MSDP-param cloud-meta-cache-size value=<number> To view the parameter: setting get-MSDP-param cloud-meta-cache-size |
CloudUploadCacheSize | The default upload cache size when the cloud LSU is added. The minimum value is 12 GiB. | To set the parameter: setting set-MSDP-param cloud-upload-cache-size value=<number> To view the parameter: setting get-MSDP-param cloud-upload-cache-size |
EnableLocalPredictiveSamplingCache | The parameter to enable or disable the local predictive sampling cache. Both spoold and spad have this parameter, and it should be synced between them. | To set the parameter: setting set-MSDP-param enable-local-predictive-sampling-cache value=<true/false> To view the parameter: setting get-MSDP-param enable-local-predictive-sampling-cache |
MaxPredictiveCacheSize | The maximum size of the spoold predictive cache. | To set the parameter: setting set-MSDP-param max-predictive-cache-size value=<number of bytes/%> To view the parameter: setting get-MSDP-param max-predictive-cache-size |
MaxSamplingCacheSize | The maximum size of the spoold sampling cache. | To set the parameter: setting set-MSDP-param max-sampling-cache-size value=<number of bytes/%> To view the parameter: setting get-MSDP-param max-sampling-cache-size |
UsableMemoryLimit | The maximum usable memory size in spoold. | To set the parameter: setting set-MSDP-param usable-memory-limit value=<number of bytes/%> To view the parameter: setting get-MSDP-param usable-memory-limit |
MaxCacheSize(Cluster) | The maximum size of the spoold fingerprint cache for all nodes in a cluster. | To set the parameter: setting set-MSDP-param max-cache-size-cluster value=<number> To view the parameter: setting get-MSDP-param max-cache-size-cluster |
MaxPredictiveCacheSize(Cluster) | The maximum size of the spoold predictive cache for all nodes in a cluster. | To set the parameter: setting set-MSDP-param max-predictive-cache-size-cluster value=<number of bytes> To view the parameter: setting get-MSDP-param max-predictive-cache-size-cluster |
MaxSamplingCacheSize(Cluster) | The maximum size of the spoold sampling cache for all nodes in a cluster. | To set the parameter: setting set-MSDP-param max-sampling-cache-size-cluster value=<number of bytes> To view the parameter: setting get-MSDP-param max-sampling-cache-size-cluster |
UsableMemoryLimit(Cluster) | The maximum usable memory size in spoold for all nodes in a cluster. | To set the parameter: setting set-MSDP-param usable-memory-limit-cluster value=<number> To view the parameter: setting get-MSDP-param usable-memory-limit-cluster |
EnableLocalPredictiveSamplingCache(Cluster) | The parameter to enable or disable the local predictive sampling cache for all nodes in cluster. Both spoold and spad have this parameter, and it should be synced between them. | To set the parameter: setting set-MSDP-param enable-local-predictive-sampling-cache-cluster value=<true/false> To view the parameter: setting get-MSDP-param enable-local-predictive-sampling-cache-cluster |
VpfsCloudFPIndexRemovalThreshold (cluster) | The threshold to remove the cloud fingerprint index file for all nodes in the cluster. When the number of deleted data containers in a fingerprint index file is greater than the threshold, the fingerprint index file is removed. | To set the parameter: setting set-MSDP-param vpfs-cloud-fpindex-removal-threshold value=<%> To view the parameter: setting get-MSDP-param vpfs-cloud-fpindex-removal-threshold |
VpfsPCacheReloadThreshold (cluster) | The threshold for spoold to reload fingerprint from fingerprint index file based on the fingerprint in pcache that is replaced. This applies to all nodes in the cluster. | To set the parameter: setting set-MSDP-param vpfs-pcache-reload-threshold-cluster value=<%> To view the parameter: setting get-MSDP-param vpfs-pcache-reload-threshold-cluster |