Unsolved
This post is more than 5 years old
4 Posts
0
32827
October 5th, 2015 18:00
OME 2.1 and Erroneous System Down Alerts
I have a freshly installed OME 2.1 installation and 4 brand new R730s. I added a discovery range with their iDRAC IP addresses and configured them for WS-MAN.
Everything looks fine after the initial discovery and inventory. After the first polling cycle, I keep getting erroneous system down alerts.
The systems are on the same physical network (different VLAN) with less than 1ms latency. I don't see any issues.
Any ideas?
No Events found!
DELL-Vijay B
183 Posts
0
October 6th, 2015 00:00
Hi,
One thing to check when you are getting system down alerts. Are you able to ping the iDRAC IP from the OME system when during the polling cycle you are observing this?
Thanks,
Vijay
pgfitzgerald
4 Posts
0
October 6th, 2015 06:00
I have observed no connectivity issues. Pings are consistent, with zero packet loss and less than 1ms latency.
DELL-Rob C
3 Apprentice
•
2.8K Posts
0
October 6th, 2015 07:00
Is your status poll still set to the original default value? I think it might be 60 minutes.
Also, there are timeout values and retries in the discovery wizard on the WSMan page. Perhaps bumping those up a bit might help.
Regards,
Rob
pgfitzgerald
4 Posts
0
October 6th, 2015 07:00
Yes, status polling schedule is the default 60 minutes.
The WS-Man configuration for the discovery range is still the default:
Is there a log anywhere that would indicate the reason for the system down alert? Is the status check truly timing out?
DELL-Vijay B
183 Posts
0
October 6th, 2015 22:00
Hi,
You can enable the schedule poll logs by modifying dconfig.ini file for following flag:
[TASK_AUDIT]
AUDIT_ENABLED=true
After you modify this setting in dconfig.ini, you need to restart OME Services for settings to take effect. Right click the status poll that executed for that range/device and see the detailed logs.
Note: Please disable the audit logs after you have analyzed/collected the logs as performance will be degraded with enabled audit logs.
Also, can you confirm whether you are enabling both WSMAN/SNMP or only WSMAN for that device discovery?
Thanks,
Vijay
pgfitzgerald
4 Posts
0
October 15th, 2015 08:00
Just wanted to follow up...
I haven't changed anything, but it seems to have settled down about a week ago (several days after I last posted). I'm not sure why additional time would have had any impact on anything. I'm going to keep an eye on things nonetheless.
Thanks!