Hello, I have searched the forums for ideas to identify when a redundant link goes down and have not been able to find any.
If a tower has two high capacity redundant links available, and one goes down I need a way to become notified. Sure, if a BH literally dies nagios will alert us, but what if the antenna gets blown out of alignment, or something else causes the link to be obstructed but both radios are still reachable via the redundant link. Is there any software out there, opensource or otherwise that can alert you if traffic patterns change greatly based on the statistical past? Or even a way to monitor via SNMP and manually set alerts for bandwidth going below or above a set amount?
A few times now we have had this happen, and discovered it only when a second link fails and the tower goes down completed, who knows how long it was using the backup link?
Let me know how you handle this problem! Thanks in advance!
With cacti (a monitoring server) you can use the Thold plugin to send alerts on threshold breaches, be it going over or under a set amount. There is also the traffic monitor built into MikroTik that you can use for alerts from the router itself, once again available for going above or below a certain amount. You can also probably script things inside of the MikroTik to monitor and alert on a problem depending how you have your links setup.
Thanks for the info guys, I got busy with other projects and forgot I had this post. To answer the one question everything is completely routed. Each BH is on it’s own /30 which resides on the router of the tower they are physically at. I didn’t realize the traffic charts in routeros could be set to send alerts, and I will definitely check out the cacti plugin. Any advice on which one seems to work the best? Perhaps cacti since it can be completely managed from a single front end?