High availability firedrill check may fail while checking diskgroups with a large number of disks
book
Article ID: 100025645
calendar_today
Updated On:
Resolution
The default value of DiskGroup ActionTimeout is 30 seconds, which is less time to check if there is more number of disks.
To resolve the issue modify the ActionTimeout to 60 seconds.
haconf -makerw (Open VCS Configuration)
hatype -modify DiskGroup ActionTimeout 60 (Current value is 30 seconds)
haconf -dump -makero (Save and Close VCS Configuration)
Issue/Introduction
The high availability fire drill detects discrepancies between the VCS configuration and the underlying infrastructure on a node
These discrepancies might prevent a service group from going online on a specific node.
For DiskGroup resources, the high availability fire drill checks for:
- The Veritas Volume Manager license
- Visibility from host for all disks in the disk group
- The same disks for the disk group on cluster nodes
- Equal number of plexes on all sites for the disk group in a campus cluster setup
The issue occurs after calling the below command.
2011/07/04 15:18:36 VCS INFO V-16-1-50135 User root fired command: hares -action sap_dg checkudid 0 punt1000-c4n from localhost
If there is more number of disks in the disk group this action might fail stating the below
2011/07/04 15:18:37 VCS INFO V-16-2-13716 (punt1000-c4n) Resource(sap_dg): Output of the completed operation (actions)
==============================================
UDIDs for device <vmax_709 do not match on cluster nodes
==============================================
Was this article helpful?
thumb_up
Yes
thumb_down
No