We currently use TheDude to monitor and send notifications for our entire network. Our current polling is setup as follows:
Probe Interval: 30s
Probe Timeout: 10s
Probe Down Count: 5
So a page will get sent out after 3 minutes and 20 seconds of straight down time. Unfortunately this does not account for when a link may be rapidly flapping (disassociating and reassociating).
We don't want to make our probes so sensitive that we get paged for every blip, but we do need to be made aware of such flapping. Is there a way to count unpaged outages or page based on x amount of outages over 24 hours? Any and all suggestions are welcome. Thank you.