LLT module cannot load (SFHA 6.1 cluster on RedHat 6)

book

Article ID: 100011856

calendar_today

Updated On:

Description

Error Message

 OS messages:

 Feb 12 19:20:24 falcon01 kernel: llt: disagrees about version of symbol ib_create_cq 

Feb 12 19:20:24 falcon01 kernel: llt: Unknown symbol ib_create_cq 

Feb 12 19:20:24 falcon01 kernel: llt: disagrees about version of symbol rdma_resolve_addr 

Feb 12 19:20:24 falcon01 kernel: llt: Unknown symbol rdma_resolve_addr 

Feb 12 19:20:24 falcon01 kernel: llt: disagrees about version of symbol ib_dereg_mr 

Feb 12 19:20:24 falcon01 kernel: llt: Unknown symbol ib_dereg_mr 

Feb 12 19:20:24 falcon01 kernel: llt: disagrees about version of symbol rdma_reject 

Feb 12 19:20:24 falcon01 kernel: llt: Unknown symbol rdma_reject 

Feb 12 19:20:24 falcon01 kernel: llt: disagrees about version of symbol rdma_disconnect 

Feb 12 19:20:24 falcon01 kernel: llt: Unknown symbol rdma_disconnect 

Feb 12 19:20:24 falcon01 kernel: llt: disagrees about version of symbol rdma_resolve_route 

Feb 12 19:20:24 falcon01 kernel: llt: Unknown symbol rdma_resolve_route 

Feb 12 19:20:24 falcon01 kernel: llt: disagrees about version of symbol rdma_bind_addr 

Feb 12 19:20:24 falcon01 kernel: llt: Unknown symbol rdma_bind_addr 

Feb 12 19:20:24 falcon01 kernel: llt: disagrees about version of symbol rdma_create_qp 

Feb 12 19:20:24 falcon01 kernel: llt: Unknown symbol rdma_create_qp 

Feb 12 19:20:24 falcon01 kernel: llt: disagrees about version of symbol ib_destroy_cq 

Feb 12 19:20:24 falcon01 kernel: llt: Unknown symbol ib_destroy_cq 

Feb 12 19:20:24 falcon01 kernel: llt: disagrees about version of symbol rdma_create_id 

Feb 12 19:20:24 falcon01 kernel: llt: Unknown symbol rdma_create_id 

Feb 12 19:20:24 falcon01 kernel: llt: disagrees about version of symbol rdma_listen 

Feb 12 19:20:24 falcon01 kernel: llt: Unknown symbol rdma_listen 

 

Cause

The problem is caused by Etrack incident 3410309 : LLT should load in non-RDMA mode if external OFED is installed on the box.   The following is the description of the etrack incident.

 
ONE_LINE_ABSTRACT: The LLT driver fails to load and logs a message in syslog. 
 
SYMPTOM: 
The Low Latency Transport (LLT) driver fails to load and logs the following 
message in the syslog when a mismatch is observed in the RDMA-specific symbols. 
 
llt: disagrees about version of symbol rdma_connect 
llt: Unknown symbol rdma_connect 
llt: disagrees about version of symbol rdma_destroy_id 
llt: Unknown symbol rdma_destroy_id 
 
DESCRIPTION: 
The LLT driver fails to load when an external OFED (OFA or MLNX_OFED) stack is 
installed on a system. The OFED replaces the native RDMA-related drivers 
(shipped with the OS) with the external OFED drivers. Since LLT is built against 
the native RDMA drivers, LLT fails due to symbol mismatch when startup script 
tries to load LLT. 
 
RESOLUTION: 
The LLT startup script is modified to detect if any external OFED is installed 
on the system. 
If the script detects an external OFED, then it loads a LLT driver (without the 
RDMA symbols) of a non-RDMA version. Since this LLT does not contain 
RDMA-specific symbols, the LLT driver successfully loads. However, the LLT 
driver does not have the RDMA functionality. In this case, LLT can be used 
either in Ethernet or in a UDP mode. 

 

Resolution

Please apply patch vcs-rhel6_x86_64-VRTSllt-6.1.0.100 to fix issue.   The patch can be downloaded from Veritas Operation Readiness Tools (SORT) website.

https://sort.Veritas.com/patch/detail/8338 

 


Applies To

 [root@falcon01 init.d]# uname -a 

Linux falcon01 2.6.32-279.el6.x86_64 #1 SMP Wed Jun 13 18:24:36 EDT 2012 x86_64 x86_64 x86_64 GNU/Linux 
[root@falcon01 init.d]# 
[root@falcon01 init.d]# cat /etc/redhat-release 
Red Hat Enterprise Linux Server release 6.3 (Santiago) 
[root@falcon01 init.d]# 
[root@falcon01 init.d]# rpm -aq|grep -i llt 
VRTSllt-6.1.0.000-GA_RHEL6.x86_64 

Issue/Introduction

 [root@falcon01 init.d]# /etc/init.d/llt start 
Starting LLT: 
LLT: loading module... 
Module with exact version failed to load. 
[root@falcon01 init.d]# 
 
[root@falcon01 init.d]# cat /etc/llttab 
set-node falcon01 
set-cluster 43062 
link em3 eth-00:24:e8:55:dc:40 - ether - - 
link em4 eth-00:24:e8:55:dc:42 - ether - - 
[root@falcon01 init.d]# 
[root@falcon01 init.d]# 
[root@falcon01 init.d]# ifconfig -a 
em1       Link encap:Ethernet  HWaddr 00:24:E8:55:DC:3C  
          inet addr:192.168.0.25  Bcast:192.168.0.255  Mask:255.255.255.0
          inet6 addr: fe80::224:e8ff:fe55:dc3c/64 Scope:Link
          UP BROADCAST RUNNING MULTICAST  MTU:1500  Metric:1
          RX packets:26782 errors:0 dropped:0 overruns:0 frame:0
          TX packets:1960 errors:0 dropped:0 overruns:0 carrier:0
          collisions:0 txqueuelen:1000 
          RX bytes:2133429 (2.0 MiB)  TX bytes:277904 (271.3 KiB)
          Interrupt:36 Memory:d6000000-d6012800 
 
em2       Link encap:Ethernet  HWaddr 00:24:E8:55:DC:3E  
          UP BROADCAST MULTICAST  MTU:1500  Metric:1
          RX packets:0 errors:0 dropped:0 overruns:0 frame:0
          TX packets:0 errors:0 dropped:0 overruns:0 carrier:0
          collisions:0 txqueuelen:1000 
          RX bytes:0 (0.0 b)  TX bytes:0 (0.0 b)
          Interrupt:48 Memory:d8000000-d8012800 
 
em3       Link encap:Ethernet  HWaddr 00:24:E8:55:DC:40  
          inet6 addr: fe80::224:e8ff:fe55:dc40/64 Scope:Link
          UP BROADCAST RUNNING MULTICAST  MTU:1500  Metric:1
          RX packets:1 errors:0 dropped:0 overruns:0 frame:0
          TX packets:5 errors:0 dropped:0 overruns:0 carrier:0
          collisions:0 txqueuelen:1000 
          RX bytes:64 (64.0 b)  TX bytes:418 (418.0 b)
          Interrupt:32 Memory:da000000-da012800 
 
em4       Link encap:Ethernet  HWaddr 00:24:E8:55:DC:42  
          inet6 addr: fe80::224:e8ff:fe55:dc42/64 Scope:Link
          UP BROADCAST RUNNING MULTICAST  MTU:1500  Metric:1
          RX packets:1 errors:0 dropped:0 overruns:0 frame:0
          TX packets:4 errors:0 dropped:0 overruns:0 carrier:0
          collisions:0 txqueuelen:1000 
          RX bytes:64 (64.0 b)  TX bytes:324 (324.0 b)
          Interrupt:42 Memory:dc000000-dc012800 
 
lo        Link encap:Local Loopback  
          inet addr:127.0.0.1  Mask:255.0.0.0
          inet6 addr: ::1/128 Scope:Host
          UP LOOPBACK RUNNING  MTU:16436  Metric:1
          RX packets:583 errors:0 dropped:0 overruns:0 frame:0
          TX packets:583 errors:0 dropped:0 overruns:0 carrier:0
          collisions:0 txqueuelen:0 
          RX bytes:99072 (96.7 KiB)  TX bytes:99072 (96.7 KiB)
 
p4p1      Link encap:Ethernet  HWaddr 00:16:31:F0:42:C0  
          inet addr:10.10.10.1  Bcast:10.255.255.255  Mask:255.0.0.0
          inet6 addr: fe80::216:31ff:fef0:42c0/64 Scope:Link
          UP BROADCAST RUNNING MULTICAST  MTU:1500  Metric:1
          RX packets:275 errors:0 dropped:0 overruns:0 frame:0
          TX packets:345 errors:0 dropped:0 overruns:0 carrier:0
          collisions:0 txqueuelen:1000 
          RX bytes:22559 (22.0 KiB)  TX bytes:32067 (31.3 KiB)
 
p4p2      Link encap:Ethernet  HWaddr 00:16:31:F0:42:C1  
          inet addr:11.11.11.1  Bcast:11.255.255.255  Mask:255.0.0.0
          inet6 addr: fe80::216:31ff:fef0:42c1/64 Scope:Link
          UP BROADCAST RUNNING MULTICAST  MTU:1500  Metric:1
          RX packets:274 errors:0 dropped:0 overruns:0 frame:0
          TX packets:339 errors:0 dropped:0 overruns:0 carrier:0
          collisions:0 txqueuelen:1000 
          RX bytes:22500 (21.9 KiB)  TX bytes:31725 (30.9 KiB)
[root@falcon01 init.d]#
 
 
NIC em3 & em4 is not RDMA type ,they are physical NIC

Additional Information

ETrack: 3410309