L009UiGS - Sudden packet loss on all ISP's

Hello,

We have VRRP setup between two L009UiGS’s to allow failover of routers with some scripts to automatically move around public facing IPs, and both routers are setup with ISP failover as well. Each router connects to both ISP supplied routers allowing us to have multi isp failover and router failover if one of the L009UiGS’s should die.

Today something strange happened. We started getting packet loss on all our connected ISP’s. I’m working remotely, so I wasn’t on site but was able to get access through to the router via a SSH tunnel (took a few attempt because the connection kept dropping). Finally managed to get on long enough to reboot the router then everything returned to normal.

These routers are set to graph cpu, memory, disk and interface information, so I took a look at the graphs and there doesn’t seem to be anything untoward (no high cpu/memory usage).

Obviously, rebooting the router has fixed the issue but as a result, I’ve lost the ability to see what could have been causing it. My other concern is that these run VRRP for failover, but obviously the router was alive enough to maintain a master role but performing badly. Had a quick look at the change log of RouterOS to see if there was any performance related issues with the firmware the routers were running but couldn’t really see anything.

Has anyone else experienced any issues with the L009UiGS needing to be rebooted, or performance degrading after months of uptime? In this particular case, the router had been up and running since September last year without a reboot. Obviously there are newer firmware/router os versions now which I will upgrade them both too, but I have to do these sparingly as there is a high availability cluster running behind these which is accessed 24/7, so if there isn’t a security related reason to update the firmware, it wont be done.

Routers were running router os version 7.15.3 when this happened.