After upgrade of InfoScale Enterprise 7.3.1 to 7.4.2 on Solaris 11, CVM resource on cluster node fails to come online

book

Article ID: 100053513

calendar_today

Updated On:

Description

Error Message

From the CVMCluster_A.log:

2022/07/17 12:07:18 VCS ERROR V-16-20006-1005 CVMCluster:cvm_clus:monitor:node - state: out of cluster
reason: error in processing members

The following error may be present in that log:

ERROR: VxVM vxclustadm ERROR V-5-1-9743 errno 3

In addition, the following error may be present in the OS message log:

vxio: NOTICE: VxVM vxio V-5-0-1379 Node failed to join because of protocol version mismatch with the running cluster

Cause

During the upgrade, the cluster protocol version may not have changed. This mismatch will ultimately prevent the cluster from communicating and functioning properly. The mismatch can be verified by a comparison between the protocol versions from the volboot file on the problem node and the working node.

In the following example, the CVM resource is online on node1 but failing to online on node2. Note the difference between the protocol version numbers in the volboot files on both nodes:

root@node1# cat etc/vx/volboot

volboot 3.1 0.3 200
hostid
hostguid {}
request_threads 2
Command_Shipping 1
end

root@node2# cat etc/vx/volboot

volboot 3.1 0.2 240
hostid
hostguid {}
request_threads 2
Command_Shipping 1
end

 

Resolution

1. Stop the cluster locally:

# hastop -local -force

2. Update cluster protocol version on problem node to match working node:

# vxdctl setversion 200

3. Start cluster.

# hastart

4. Check both the volboot file version and the cluster protocol version itself to ensure they have been updated successfully after cluster is back online locally. 

# cat /etc/vx/volboot

# vxdctl support

Check cluster status to ensure the CVM service group has come online.

# hastatus -sum

Issue/Introduction

After upgrade of InfoScale Enterprise 7.3.1 to 7.4.2 on Solaris 11, CVM resource faults on one of the cluster nodes after maintaining a "waiting to come online" (W_ONLINE) status for extended period of time.