Search <book_title>...

Important Update: Cohesity Products Documentation

All Cohesity product documentation are now managed via the Cohesity Docs Portal: https://docs.cohesity.com/HomePage/Content/home.htm. Some documentation available here may not reflect the latest information or may no longer be accessible.

Cluster Server 7.4.1 Administrator's Guide - Linux

Last Published: 2019-10-17

Product(s): InfoScale & Storage Foundation (7.4.1)

Platform: Linux

Section I. Clustering concepts and terminology
Section II. Administration - Putting VCS to work
Section III. VCS communication and operations
Section IV. Administration - Beyond the basics
Section V. Veritas High Availability Configuration wizard
1. Introducing the Veritas High Availability Configuration wizard
2. Administering application monitoring from the Veritas High Availability view
  1. Administering application monitoring from the Veritas High Availability view
  2. Administering application monitoring settings
Section VI. Cluster configurations for disaster recovery
Section VII. Troubleshooting and performance
1. VCS performance considerations
2. Troubleshooting and recovery for VCS
Section VIII. Appendixes

Stale key detection to avoid a false preexisting split brain condition

When a cluster starts, all the cluster nodes register their unique keys on the coordination point servers (CP servers). The key is unique to the cluster and the node, and is based on the LLT cluster ID and the LLT node ID. When the network connectivity between the cluster nodes and the CP servers is lost, as the nodes go offline, some stale keys may be left on one or more CP servers. The presence of a key on a CP server indicates that the corresponding node is alive. Thus, after the network connectivity is restored, the presence of a stale key of a node that freshly comes online creates a false preexisting split brain in the cluster. When a false preexisting split brain is detected, the node fails to start, and manual intervention is required to bring the cluster online.

Cluster Server provides a reliable method to identify a false preexisting split brain condition that is caused by the presence of such stale keys. The CP server functionality detects such stale keys and provide this information to the vxfen module. The vxfen module performs its arbitration and brings the cluster online.

This feature is available only on Linux when the following criteria are met:

vxfen_mode is set to customized.
Majority of the coordination points are servers.
detect_false_pesb attribute is set to 1.

To enable stale key detection in a server-based fencing configuration

Stop VCS on all nodes.
# hastop -all
Make sure that the port h is closed on all the nodes. Run the following command on each node to verify that the port h is closed:
# gabconfig -a
Port h must not appear in the output.
Stop I/O fencing on all nodes. Run the following command on each node:
# /etc/init.d/vxfen stop
If you have any applications that run outside of VCS control that have access to the shared storage, shut down all the other nodes in the cluster that have access to the shared storage. This prevents data corruption.
Open the /etc/vxfenmode file, add detect_false_pesb=1 at the appropriate location, and save and close the file.
Start the fencing module on all the nodes.
# /etc/init.d/vxfen start
Start VCS on all nodes.
# hastart

See About I/O fencing configuration files.