Hi all,
I have a script through API Rest that performs pings and suddenly it has stopped working and only returns “timeout”. From the terminal it doesn’t work either but what does work perfectly is the traceroute.
The log in the rules filter indicates that there is both output and input icmp traffic at the time I perform the ping but incomprehensibly responds timeout.
I suspect that if I restart the router it will work again but it is not something that I can do whenever I want.
It could not ping anything but I could telnet to ports on the destinations..
i.e. ping 192.168.1.1 failed with timeout, but telnet 192.168.1.1 I could login to the router.
This was not just for one device it was for all devices on the network.. routers, switches, servers, etc… icmp not working but telnet/ssh/http ok to the destinations.
It’s still happening on 7.3 but now it recovers a few hours later with no intervention… so still an issue but hard to replicate and with 7.3 it lasts 4-5hours before it comes good again.
Looks like Mikrotik managed to reproduce this. They told me this is due to valid packets being seen as invalid and thus discarded.
Let’s hope we will see a fix soon.
I have this bug only on x86 platforms. After 36/48h i got the ping tool not working. So all the route with check-gateway=ping are disabled.
To workaround it I reboot the router each night.
Now with 7.4 there is a differnt behavior:
netwatch works correctly, this mean that this tool correctly identifies if an host is up (also if the regular ping tool don’t get reply due to this bug) and down, in fact blocking icmp for an host it get the host down.
the check-gateway=ping in route follow not the netwatch but the ping tool, so during the issue, the routes with check-gateway=ping got disabled and at the same time the netwatch report the gateway up…..
this pushes me to thing that ping tool during the issue works on a wrong vrf or socket or something like that.
I updated my ticket SUP-67221 about it.
regards
Ros