I use Netwatch with an UP and DOWN script that emails me notifications when a new UP or DOWN condition occurs.
The non-default netwatch settings include:
packet-count=270
packet-interval=1s
thr-avg=400ms
thr-loss-percent=95%
This is the script entry:
/tool netwatch
add comment=Netwatch-192.168.0.11 disabled=no down-script=netwatch-5-2025 host=192.168.0.11 http-codes="" interval=5m name=Netwatch-192.168.0.11 packet-count=270 packet-interval=1s test-script="" thr-avg=400ms thr-loss-percent=95% timeout=4s type=icmp up-script=netwatch-5-2025
I am occassionally getting emails such as these:
2025-11-15 17:58:18 Netwatch-192.168.0.11 629hAPac3 down to 192.168.0.11 with rtt-avg of 76 and loss-percent of 1%. The thresholds are thr-avg of 400ms and loss-percent of 95%.
rtt-avg and loss-percent were clearly not the triggers of the DOWN condition.
So I added a logging topic for netwatch:
/system logging
add topics=netwatch
And waited until the next DOWN email arrived. Checked the log and this is what I found:
[FAIL] rtt-max: 3757.254 ms [ > 1000.000 ms ]
[FAIL] rtt-jitter: 3736.313 ms [ > 1000.000 ms ]
[ OK ] rtt-avg: 78.987 ms [ <= 400.000 ms ]
[FAIL] rtt-stdev: 303.947 ms [ > 250.000 ms ]
[ OK ] loss count: 3 [ <= 4294967295 ]
[ OK ] loss: 1.1% [ <= 95.0% ]
Seems that the rtt-max, rtt-jitter, and rtt-stdev occassionally are above the default values.
What would you do?
Explicitly set those parameters to higher levels, or leave as is and accept what, for my purposes, are false-positives (at least insofar as actionable (i.e., worryable) events)? Or something else?