VCS WARNING V-16-1-10023 Agent NotifierMngr not sending alive messages since Fri Aug 3 23:56:12 2012
VCS WARNING V-16-1-53025 Agent NotifierMngr has faulted; ipm connection was lost; restarting the agent
NotifierMngr resource was not coming online and gave the above mentioned errors after adding new SNMPConsoles attribute.
Newly added SNMPConsoles IP was not reachable from the server which was causing the Agent having issues with sending the trap.
Warning queue lenght :
Notifier State: disconnect Host: tkp-lclprd-01 Severity: Warning Queue: 30
Also, find the below steps to troubleshoot the NotifierMngr resource issues :
a) To enable NotifierMngr agent logging :
root# haconf -makerw
root# hatype -modify $typename LogDbg DBG_AGINFO DBG_AGDEBUG
root# hatype -modify $typename LogDbg -add DBG_1
root# hatype -modify $typename LogDbg -add DBG_2
root# hatype -modify $typename LogDbg -add DBG_3
root# hatype -modify $typename LogDbg -add DBG_4
root# haconf -dump -makero
To disable the logging :
root# haconf -makerw
root# hatype -modify $typename LogDbg -delete -keys
root# haconf -dump -makero
b) Please check the smtp server config, You could try the following way of checking the smtp server config.
- Open a session with port 25 of the SmtpServer:
telnet $SmtpServer 25
This will show the SmtpServer's name.
- Introduce yourself to pass some security:
helo {node's FQDN}
You will get back a "nice to meet you" message.
- Verify the email address. Note the Numbered responce:
VRFY $SmtpRecipient
The following numbers are ok: 250, 251, 252, 502
- Quit the session. quit
c) Try to send emails using hanotify command rather than mailx to see if that works.
Create Error Level Notification Event
Use either.
- Enter Command:
hanotify -i .1.3.6.1.4.1.1302.3.8.10.2.2 -l Error -n vcs -T 5 -t 1 -m " test "
- Enter command:
notifier -t m={SMTPSERVER.EN_US},e={RECIPIENT.EN_US},l=Information
d) Please run the tcpdump and check if it using the VCS vip address to communicate with the SMTP server.
-> From both the nodes:
+ start a tcpdump to collect the snmp traffic with:
# tcpdump -vv -x -X -s 1500 -w /var/tmp/tcpdump_port_25.out -i eth0 'port 25'
and keep it running.
-> In same node, run 3x times:
/opt/VRTS/bin/hanotify -i .1.3.6.1.4.1.1302.3.8.10.2.2 -l Error -n vcs -T 5 -t 1 -m "test"
-> Then ctrl-c the tcpdump and collect & send the "/var/tmp/tcpdump_port_25.out" file.
Applies To
OS : Solaris 10
SFHA Version : 5.0