Veritas InfoScale™ 7.3.1 Troubleshooting Guide - Linux
- Introduction
- Section I. Troubleshooting Veritas File System
- Section II. Troubleshooting Veritas Volume Manager
- Recovering from hardware failure
- Failures on RAID-5 volumes
- Recovery from failure of a DCO volume
- Recovering from instant snapshot failure
- Recovering from failed vxresize operation
- Recovering from boot disk failure
- VxVM boot disk recovery
- Recovery by reinstallation
- Managing commands, tasks, and transactions
- Backing up and restoring disk group configurations
- Troubleshooting issues with importing disk groups
- Recovering from CDS errors
- Logging and error messages
- Troubleshooting Veritas Volume Replicator
- Recovery from configuration errors
- Errors during an RLINK attach
- Errors during modification of an RVG
- Recovery on the Primary or Secondary
- Recovering from Primary data volume error
- Primary SRL volume error cleanup and restart
- Primary SRL header error cleanup and recovery
- Secondary data volume error cleanup and recovery
- Troubleshooting issues in cloud deployments
- Recovering from hardware failure
- Section III. Troubleshooting Dynamic Multi-Pathing
- Section IV. Troubleshooting Storage Foundation Cluster File System High Availability
- Troubleshooting Storage Foundation Cluster File System High Availability
- Troubleshooting CFS
- Troubleshooting fenced configurations
- Troubleshooting Cluster Volume Manager in Veritas InfoScale products clusters
- Troubleshooting interconnects
- Troubleshooting Storage Foundation Cluster File System High Availability
- Section V. Troubleshooting Cluster Server
- Troubleshooting and recovery for VCS
- VCS message logging
- Gathering VCS information for support analysis
- Troubleshooting the VCS engine
- Troubleshooting Low Latency Transport (LLT)
- Troubleshooting Group Membership Services/Atomic Broadcast (GAB)
- Troubleshooting VCS startup
- Troubleshooting issues with systemd unit service files
- Troubleshooting service groups
- Troubleshooting resources
- Troubleshooting I/O fencing
- System panics to prevent potential data corruption
- Fencing startup reports preexisting split-brain
- Troubleshooting CP server
- Troubleshooting server-based fencing on the Veritas InfoScale products cluster nodes
- Issues during online migration of coordination points
- Troubleshooting notification
- Troubleshooting and recovery for global clusters
- Troubleshooting licensing
- Licensing error messages
- VCS message logging
- Troubleshooting and recovery for VCS
- Section VI. Troubleshooting SFDB
Replacing a failed boot disk mirror
Messages such as the following may be displayed while booting from the primary boot disk if a mirror of the boot disk fails:
Starting rootvol, swapvol... VxVM vxconfigd WARNING V-5-1-122 Detaching plex mirrootvol-01 from volume rootvol VxVM vxconfigd WARNING V-5-1-122 Detaching plex mirswapvol-01 from volume swapvol VxVM vxconfigd WARNING V-5-1-546 Disk rootmir in group bootdg: Disk device not found
Failure of a mirror of the root disk is discovered at boot time when the volumes on the root disk are started. To maintain the integrity of your system, replace the failed disk at the earliest possible opportunity.
Use the vxprint -d command to confirm that the root disk mirror has failed:
# vxprint -d TY NAME ASSOC KSTATE LENGTH PLOFFS STATE TUTIL0 PUTIL0 dm rootdisk sda - 16450497 - - - - dm rootmir - - - - NODEVICE - -
In this example, the boot disk mirror, rootmir, is shown with the state NODEVICE.
To reconnect a disconnected root mirror
- Shut down the system, and then power it down.
- Reconnect the disk.
- Power up the system, and select vxvm_root at the GRUB or LILO boot prompt.
- Use the vxprint -d command to confirm that the disk is now active:
# vxprint -d TY NAME ASSOC KSTATE LENGTH PLOFFS STATE TUTIL0 PUTIL0 dm rootdisk sda - 16450497 - - - - dm rootmir sdb - 16450497 - - - -
- Use the vxprint -p command to view the state of the plexes. One or more of the plexes on the mirror disk are shown with the state STALE until their contents are recovered. You can use the vxtask command to monitor how the recovery and reattachment of the stale plexes is progressing, as shown in this example:
# vxtask list TASKID PTID TYPE/STATE PCT PROGRESS 160 PARENT/R 0.00% 2/0(1) VXRECOVER 161 161 ATCOPY/R 41.78% 0/12337857/5155232 PLXATT rootvol mirrootvol
- Use the vxplex command to remove the plex records that were on the failed disk:
# vxplex -o rm dis mirrootvol-01 mirswapvol-01
This example removes the plexes mirrootvol-01, and mirswapvol-01 that are configured on the mirror disk. You may need to modify the list of plexes according to your system configuration.
To replace a failed root mirror
- Shut down the system, and then power it down.
- Replace the failed disk with a disk of the same or larger capacity.
- Power up the system, and select vxvm_root at the GRUB or LILO boot prompt.
- Use the fdisk command to ensure that the root disk and the replacement mirror disk have the same geometry. See the fdisk(8) manual page for details.
- Run the following command to mirror the volumes on root disk onto the replacement disk:
# /etc/vx/bin/vxrootmir sdb rootmir
This example assumes that the disk media name of the replacement mirror disk is sdb. You may need to modify this name according to your system configuration.