Community discussions

MikroTik App
 
gtb
just joined
Topic Author
Posts: 6
Joined: Tue Jan 19, 2016 8:44 am

RB4011 gradually stops accepting traffic on LAN Gateway bridge

Mon May 13, 2024 10:44 pm

We have solved this issue after a 15 hour day by 2 engineers. Fix took 5 minutes, so just in case it helps someone else. Fix was to change the IP Address of the affected Bridge then change Gateway settings on affected servers to match the new Gateway IP.

SITUATION
  • RB4011 with several LAN bridges
  • Firewall rules for Input Chain and Forward Chain both end with a Drop All. All Dropped traffic is logged.
  • Mix of IP \Firewall \ Address Lists and Interface \ Lists are used for refining various (simple) IP Firewall Filter and IP Firewall NAT rules
  • Been working fine for years.
SYMPTOMS
  • Saturday night 1030pm, approx at peak of massive Solar Storm, RDS (Remote Desktop Server), one of 3 Virtual servers on the LAN, stops reporting in to our monitoring software
  • no worries, not in use over weekend.
  • Sunday morning, small investigation finds the server is up and running, just not responding on internet
  • restart of affected server resolves issue, with moderate testing of all basic functions all running clean
  • Monday morning 7am staff arrive and cannot sign in to RDS, ven though all servers are all running in the VMHost, and all are showing up online to our monitoring system
INVESTIGATION
  • None of the traffic in question ever showed up in RB4011 logs so we repeatedly concluded "its not the Mikrotik blocking traffic".
  • initial investigation showed the RDS and other servers could SOMETIMES ping google.com but at same time COULD NOT PING 8.8.8.8
  • after a while, none of the Virtual servers could ping each other, or the gateway, or any IP or URL upstream of the gateway
  • MANY fixes tried, in Windows and ESET firewalls, NLAsvc configuration, Network Adapter configurations, etc;
  • Found from VM BB8-DC at 192.168.0.17 can ping Host VMswitch 192.168.0.250 but cannot ping Mikrotik Gateway at 192.168.0.1. Somehow Mikrotik, or something on the VMHOST blocking traffic to 192.168.0.1. Tried but could not find what was blocking the traffic.
  • moved one of the VMs to another VMHOST, exactly same issue.
FIXED BY
  • Fixed by crazy "jiggle it" fix: change Mikrotik SERVER-VMS-bridge name back to old name LANbridge, and address from 192.168.0.1 to 192.168.0.254.
QUESTIONS ARISING
  • wtf?? Any ideas anyone why on earth that fix worked?
  • anyone doing a PhD in the whole class of "jiggle it" fixes in IT?
Last edited by gtb on Mon May 13, 2024 11:05 pm, edited 1 time in total.
 
User avatar
anav
Forum Guru
Forum Guru
Posts: 20818
Joined: Sun Feb 18, 2018 11:28 pm
Location: Nova Scotia, Canada
Contact:

Re: RB4011 gradually stops accepting traffic on LAN Gateway bridge

Mon May 13, 2024 11:01 pm

Do you jiggle up and down or back and forth?
 
gtb
just joined
Topic Author
Posts: 6
Joined: Tue Jan 19, 2016 8:44 am

Re: RB4011 gradually stops accepting traffic on LAN Gateway bridge

Tue May 14, 2024 4:46 am

Mate we jiggled everything, every direction!
All running nice and fast and stable today - we were dreading it might return since we don't know what triggered the issue.
 
jaclaz
Forum Guru
Forum Guru
Posts: 1468
Joined: Tue Oct 03, 2023 4:21 pm

Re: RB4011 gradually stops accepting traffic on LAN Gateway bridge

Tue May 14, 2024 12:21 pm

Mate we jiggled everything, every direction!
All running nice and fast and stable today - we were dreading it might return since we don't know what triggered the issue.
I thought you were attributing it to the solar storm:
Saturday night 1030pm, approx at peak of massive Solar Storm, RDS (Remote Desktop Server), one of 3 Virtual servers on the LAN, stops reporting in to our monitoring software
maybe it was just a coincidence.

After you have "jiggled" and restored connection, what happens if you change back to the previous bridge name and previous IP (changing one setting at the time)?

I mean, is the specific IP address the issue, or the bridge name or - while "jiggling" you did *something else* that may have reset *some other setting* ?

Who is online

Users browsing this forum: Bing [Bot] and 36 guests