Veritas SCSI-3 Fencing disks are not supported with Solaris 10 LDOM Guest with or without Solaris MPGROUPs

Article: 100052433
Last Published: 2022-02-25
Ratings: 7 3
Product(s): InfoScale & Storage Foundation

Problem


Solaris MPGROUPs must be used to present LDOM Virtual Disks in connection with Solaris 10 LDOM Guests.

Veritas does NOT support SCSI-3 Fencing disks with or without the use of MPGROUPs in LDOM Solaris 10 Guest environments.

NOTE: Veritas SCSI-3 Fencing disks have never been supported with MPGROUPs.

Clustered Volume Manager (CVM) is also impacted and cannot use traditional SCSI-3 Fencing disks for Solaris 10 Guests.

MPGROUPs is the only LDOM presentation vehicle (interface) to manage and handle Virtual disks presented to Solaris 10 LDOM Guests.

The use of MPGROUPs is NOT supported with Solaris 11.3 onwards and the Veritas Product Suite.
 

 

Error Message

 

Impact:

Commands will hang in the Solaris 10 LDOM Guest when the disk communication (access) is lost to the I/O domain. This can be a result of one or more I/O domains being rebooted or has encountered a panic.

Solaris 11.3 onwards provides additional functionality to enable Veritas Dynamic Multi-pathing (vxdmp) in LDOM environments without the use of MPGROUPs (which is NOT supported with Solaris 11 onwards).

Users will see commands hanging in the LDOM Solaris 10 Guests:

Whenever, there are disk mappings from an IO Domain to a Solaris 10 LDOM Guest via MPGROUPs, Veritas observed command hangs. This is seen with and without SCSI-3 Fencing disks.

VxVM commands which access disks (vxdisk scandisks, vxdisk list, etc) and OS native commands like “echo | format” will hang when the I/O domain is rebooted or it panics. This behaviour is common with both layered DMP device and OS native device mapping configurations.

Veritas has raised the below SR with Oracle to troubleshoot & investigate:

SR 3-28492730891 : OS native disk commands hang inside Sol 10 LDOM on IO domain failure in Native OS device + mpgroup
 

Cause

 

Due to the missing vdc.conf functionality limited to Solaris 11.3 onwards. Solaris 10 LDOM Guests is missing this key Solaris LDOM functionality, so Veritas is forced to use MPGROUPs to avoid hangs within the Solaris 10 LDOM guests

Solaris 11.3+ LDOM Guests are more than stable surrounding the reduction of hangs etc whilst I/O domains are panic’ing or being rebooted. 


Solaris 11 Sparc Enhancement

Solaris 11.3 SRU 18.0.6 provides the timeout-noretry-list parameter, which is not currently available with Solaris 10.

File location: /platform/sun4v/kernel/drv/vdc.conf

The vdc.conf file needs to be modified in the GUEST domain, to reduce the chances of specific commands hanging in the logical domain when access to the Control/Service I/O domain is lost.
 
Sample vdc.conf update:# cat /platform/sun4v/kernel/drv/vdc.conf
#
# Copyright (c) 2011, Oracle and/or its affiliates. All rights reserved.
#
#
# Associate the driver with devid resolution.
#
ddi-devid-registrant=1;
timeout-noretry-list=0,1,2,3,4,5;
 
NOTE:

The “timeout-noretry-list” setting has been added to the vdc.conf file shown above.

The Virtual Disk IDS can be extracted from the below LDOM command (for LDOM Guest "scooby"):
 

Sample output (executed from Primary/Control I/O Domain)

# ldm list -o disk scooby
NAME
scooby
  
DISK
    NAME         VOLUME                                  TOUT   ID   DEVICE  SERVER         MPGROUP
    scoobydisk-pri scoobyboot@primary-vds0  25         0    disk@0    primary
    scoobydisk-alt scoobyboot@altio-vds0        25        1    disk@1     altio
    scoobycds-pri scoobycds@primary-vds0    25         2    disk@2    primary
    scoobycds-alt scoobycds@altio-vds0          25        3    disk@3     altio
    scoobysli-pri scoobysli@primary-vds0         25        4    disk@4     primary
    scoobysli-alt scoobysli@altio-vds0               25        5    disk@5     altio


NOTE: The Virtual Disk IDs 0 to 5 in this instance, will be populated in the /platform/sun4v/kernel/drv/vdc.conf file in relation to the "timeout-noretry-list=0,1,2,3,4,5;" attribute.

The Virtual Disk Client (VDC) for virtual disk instances 0,1,2,3,4 & 5 will NOW return an I/O error immediately (for each I/O), after the defined VDC timeout from the time the I/O domain is not accessible.


Veritas QA identified a series of product roadblocks preventing Veritas Fencing working correctly with and without SCSI-3 Fencing disks in Solaris 10 LDOM environments.

Veritas development will continue to work with Oracle Solaris Support to see what can be achieved for Solaris 10 LDOM Fencing Support in the future.

 

Solution

 

To use Veritas Fencing in Solaris 10 LDOMs environments, we recommend the use of Veritas CPS configurations to avoid the use of traditional SCSI-3 based Fencing disks.
 

The only solution we currently have for Solaris 10 LDOM Fencing support, is the deployment and use of Veritas CPS servers.
 

NOTE: Veritas SCSI-3 Fencing disks have never been supported with MPGROUPs and does not work correctly.

 

Was this content helpful?