I’m troubleshooting an SNMP issue with respect to monitoring switches with the Dude. It seems that the switches I’m monitoring (5 Netgear FSM726s) stop responding to SNMP queries every minute or so.
The result is that The Dude is constantly adding and removing snmp data to and from the switches–very frustrating when mapping and monitoring links. At first I thought that there was a software issue with the switches. However, if I disable SNMP on the Dude, the devices start responding to snmpwalks from local systems within 30 seconds.
However, the snmpwalk on these switches takes upwards of 60-120 seconds to dump all of the data (they aren’t that slow–theres just a TON of it it seems). I believe what is happening is The Dude is running an snmpwalk of the devices, which begins running, but is only partially complete by the time The Dude starts another snmpwalk. This snowballs until the switch can no longer handle any more concurrent snmpqueries, causing The Dude to start timing out snmp data that was discovered in the previous snmp queries.
First, is my hypothesis correct? If so, is there any way I can prevent this from happening with the current Dude version?
Thanks in advance.
Andrew