NetBackup™ for Cassandra Administrator's Guide

Last Published:
Product(s): NetBackup & Alta Data Protection (10.0)

NetBackup for Cassandra terminologies

The following table defines the terms you come across using NetBackup to protect Cassandra cluster.

Table: NetBackup terminologies

Terminology

Definition

Cassandra Backup Recovery component

The NetBackup thin client which gets deployed on data staging servers and Cassandra cluster to aid in backup and restore operations.

Data staging servers

NetBackup requires a set of servers for backup of Cassandra cluster in addition to the NetBackup primary, and backup hosts. These servers are typically 5% of the total number of servers in the Cassandra cluster. These servers are used to deduplicate the data from Cassandra cluster during backup and optimize the backup process. They are also used as staging-server for the data to be backed up and restored.

Parallel streams

The NetBackup parallel streaming framework allows data blocks from multiple nodes to be backed up using multiple backup hosts simultaneously.

Backup host

The backup host acts as a proxy client. All the backup and the restore operations are executed through the backup host.

You can configure media servers, clients, or a primary server as a backup host.

The backup host is also used as destination client during restores.

BigData policy

The BigData policy is introduced to:

  • Specify the application type.

  • Allow backing up distributed multi-node environments.

  • Associate backup hosts.

  • Perform workload distribution.

Application Cluster

  • Application cluster is the Cassandra production cluster name.

  • Cluster name must be a single word with no white spaces in between words and must be the actual cluster name used in the Cassandra.yaml file on the production nodes.