I’m running different email notifications in the dude for fault purposes. The email notifications work perfectly fine when setting any of the statuses, in this situation, up–>down and delayed message to 10 minutes/I get an email.
Now, I’d like to be able to set this option, if service comes back before the designated email time(10 minutes) to stop that email from coming and that way I only get an email when the fault passes the 10 minutes. Is there any setting in the dude to set this up?
Good evening, I’m in the same need. Did you manage to have it working?
As suggested in http://forum.mikrotik.com/t/send-mail-after-long-outage/89149/1 one way to accomplish that is to create a probe that actually fails (e.g. goes in DOWN state) after 10 minutes.
As far as I know Device statuses can be UP or UNSTABLE or DOWN. Once in DOWN the notification is raised (i.e. the mail is sent) regardless the optionally configured delay… if you set 10 minutes delay you will simply receive the notification with a delay of 10 minutes BUT in any case you will receive it, regardless the fact that the error condition has been cleared during those 10 minutes. So this is NOT the way!
You can get what you want playing into the UNSTABLE filed: you can create a new probe that actually takes 10 minutes to fail bringing in DOWN the status of the device. If the error condition clears before 10 minutes then the status of the device moves from UNSTABLE to UP and no notification is raised.
Everything turns around the way you define the probe itself (retry count, retry interval) and the way you use the probe in the service (probe interval, probe timeout, probe down-count): they definitely are not the same thing!
It took years of misunderstandings to me to figure out how to properly mix the two things.
Regards.
f.
If I understand you correctly, you only want to be notified if the error condition last beyond 10 minutes.
If that is the case, we solved our problem by increasing the Probe interval and Probe down count.
In theory, a Probe interval of 5 minutes & Probe down count of 3, should result in an email after 15 minutes of down count.
I too am inundated with this up, down emails and have been using this method to prevent unnecessary emails. The above values do not work across the board & I simply keep increasing the values for a specific device till I hit a sweet spot. Not sure if it is the correct approach , but I am at wits end.