The RHS.exe process crash causing VMDg resource to fault in Microsoft Server 2008 Failover Cluster after installing CP17 or CP18 for SFW 5.1 SP2

book

Article ID: 100010293

calendar_today

Updated On:

Description

Error Message

System Event Log
ERROR    1230(0x000004ce) Microsoft-Windows-FailoverClustering
Cluster resource 'X' (resource type '', DLL 'vxres.dll') either crashed or deadlocked. The Resource Hosting Subsystem (RHS) process will now attempt to terminate, and the resource will be marked to run in a separate monitor. 

Application Event Log
08/03/2013 09:50:00 PM ERROR    1000(0x000003e8) Application Error Server1.FQDN
Faulting application name: rhs.exe, version: 6.1.7601.17514, time stamp: 0x4ce79820
Faulting module name: ntdll.dll, version: 6.1.7601.17725, time stamp: 0x4ec4aa8e
Exception code: 0xc0000005
Fault offset: 0x00000000000532d0 
Faulting process id: 0x51ac
Faulting application start time: 0x01ce90aa9bb5021f
Faulting application path: C:\Windows\Cluster\rhs.exe
Faulting module path: C:\Windows\SYSTEM32\ntdll.dll
Report Id: 2c04c2f7-fca8-11e2-bfbf-001b78427272

Cluster Logs
000051ac.000030a0::2013/08/04-01:49:56.360 INFO  [RES] Volume Manager Disk Group : +++++++ Entering LDM_RESResourceControl +++++++
000051ac.000030a0::2013/08/04-01:49:57.826 ERR   [RES] Volume Manager Disk Group : ClusterResourceStorageGetMountPoints: GetVolumePathNamesForVolumeName failed Error: 234.
000051ac.000030a0::2013/08/04-01:49:57.826 INFO  [RES] Volume Manager Disk Group : +++++++ Exiting LDM_RESResourceControl +++++
000051ac.00001ccc::2013/08/04-01:49:57.826 INFO  [RES] Volume Manager Disk Group : +++++++ Entering LDM_RESResourceControl +++++++
00001a08.000037b4::2013/08/04-01:50:00.275 WARN  [RCM] HandleMonitorReply: FAILURENOTIFICATION for 'SQL2K8DEV08_DG', gen(6) result 5.
000051ac.00001ccc::2013/08/04-01:50:00.275 ERR   [RHS]: caught exception c0000005 in call RESOURCECONTROL for SQL2K8DEV08_DG.
00001a08.000037b4::2013/08/04-01:50:00.275 INFO  [RCM] rcm::RcmResource::HandleMonitorReply: Resource 'SQL2K8DEV08_DG' consecutive failure count 1.
00001a08.00002d78::2013/08/04-01:50:03.629 WARN  [RCM] Canceling pending control STORAGE_GET_MOUNTPOINTS for resource 'SQL2K8DEV08_DG' due to monitor crash.
00001a08.00003ad4::2013/08/04-01:50:03.629 WARN  [RCM] ResourceControl(STORAGE_GET_MOUNTPOINTS) to SQL2K8DEV08_DG returned 5038.
00001a08.00002d78::2013/08/04-01:50:03.629 INFO  [RCM] TransitionToState(SQL2K8DEV08_DG) Online-->ProcessingFailure.
00001a08.00002d78::2013/08/04-01:50:03.629 ERR   [RCM] rcm::RcmResource::HandleFailure: (SQL2K8DEV08_DG)
00001a08.00002d78::2013/08/04-01:50:03.629 INFO  [RCM] TransitionToState(SQL2K8DEV08_DG) ProcessingFailure-->[WaitingToTerminate to DelayRestartingResource]. 

Cause

Issue occurs while cluster sends the CLUSCTL_RESOURCE_STORAGE_GET_MOUNTPOINTS control code for which the VMDg resource vxres.dll returns incorrect required output buffer size for API GetVolumePathNamesForVolumeNameW(). The issue appears as a result of a change introduced in CP17 relating to buffer size.

Resolution

Veritas has identified the issue and the fix will be available to download via the Patch Finder on SORT website.

This issue has been resolved by modifying the affected vxres.dll binary so that the VMDg resource returns the required output buffer size by converting it from WCHAR to BYTE even if the GetVolumePathNamesForVolumeNameW function fails. The fix for this issue is included in SFW 5.1 SP2 Cumulative Patch 19 (CP19).  


Applies To

  • Veritas Storage Foundation for Windows 5.1 SP2 (SFW) CP17 or CP18
  • Microsoft Server 2008 R2
  • Microsoft Failover Cluster (FOC)

Issue/Introduction

After installing CP17 or CP18 for SFW 5.1 SP2 setup in Microsoft Server 2008 R2 Failover Cluster, the cluster resource host subsytem (RHS) crash when onlining Volume Manager Disk Group (VMDg) resource.   

Additional Information

ETrack: 3276701