Hi,
I'm wondering if anyone has experienced issues similar to this with 2.4G wifi networking. As summary of the issue is that after an initial reset and clean start with the attached config, 2.4G wifi works. However, after a time (or maybe some other event that is not-obvious), the 2.4G wifi stops. 5G and ethernet are unaffected.
It also appears to be a l3 issue. I also seems to have started since an upgrade to v7.19.4. I've not tried downgrading or the test branch yet.
I've spent a few days trying to debug this, so I hope someone has an idea.
Setup
- Mikrotik Chateau series, v7.19.4
- 4 bridges, with one ethernet and 2 wifi interfaces bound to each (see config).
- ether1 is WAN
What’s working
- After a reset, everything. Then after a time, the 2.4G wifi stops working.
What’s not working
- 2.4G wifi, after a time.
Debugging and fault finding
Starting with a fresh restart with the attached config. everything works. After a time, when the 2.4G wifi (wifi2 interface) stops working, this is what I've done. I had a separate test machine which I conducted tests from, which is an up-to-date OSX.
After each test below, I rebooted the router, and switched the client wifi off and on again.
- Confirm no issue with the 5G network
% ping -n -W 5000 192.168.0.1
PING 192.168.0.1 (192.168.0.1): 56 data bytes
64 bytes from 192.168.0.1: icmp_seq=0 ttl=64 time=5.734 ms
64 bytes from 192.168.0.1: icmp_seq=1 ttl=64 time=4.023 ms
64 bytes from 192.168.0.1: icmp_seq=2 ttl=64 time=5.114 ms
64 bytes from 192.168.0.1: icmp_seq=3 ttl=64 time=3.196 ms
64 bytes from 192.168.0.1: icmp_seq=4 ttl=64 time=4.778 ms
64 bytes from 192.168.0.1: icmp_seq=5 ttl=64 time=6.309 ms
All okay so far.
- Disable the 5G networks, forcing all devices to the 2.4G network
/interface/wifi/set wifi1 disabled=yes
This will disable all secondary interfaces as well. The client now reports a 5Mbps TX rate on the 2.4G network.
% ping -n -i 6 -W 5000 192.168.0.1
PING 192.168.0.1 (192.168.0.1): 56 data bytes
Request timeout for icmp_seq 0
Request timeout for icmp_seq 1
Request timeout for icmp_seq 2
64 bytes from 192.168.0.1: icmp_seq=3 ttl=64 time=221.404 ms
<snip>
52 packets transmitted, 2 packets received, 96.2% packet loss
round-trip min/avg/max/stddev = 221.404/2402.196/4582.988/2180.792 ms
While not a complete outage, it is very slow.
- Removed all interfaces from bridges, except for wifi2 (2G wifi primary interface)
No change in the ping statistics. However, as I was watching with a tcpdump, I noticed that l2 traffic was seemingly okay. So I tried an arp ping.
arping -c 10 192.168.0.1
ARPING 192.168.0.1
42 bytes from f6:1e:57:6e:cd:78 (192.168.0.1): index=0 time=54.049 msec
42 bytes from f6:1e:57:6e:cd:78 (192.168.0.1): index=1 time=6.130 msec
42 bytes from f6:1e:57:6e:cd:78 (192.168.0.1): index=2 time=134.100 msec
42 bytes from f6:1e:57:6e:cd:78 (192.168.0.1): index=3 time=60.035 msec
42 bytes from f6:1e:57:6e:cd:78 (192.168.0.1): index=4 time=50.867 msec
42 bytes from f6:1e:57:6e:cd:78 (192.168.0.1): index=5 time=93.288 msec
42 bytes from f6:1e:57:6e:cd:78 (192.168.0.1): index=6 time=7.524 msec
42 bytes from f6:1e:57:6e:cd:78 (192.168.0.1): index=7 time=6.767 msec
42 bytes from f6:1e:57:6e:cd:78 (192.168.0.1): index=8 time=70.833 msec
42 bytes from f6:1e:57:6e:cd:78 (192.168.0.1): index=9 time=46.205 msec
--- 192.168.0.1 statistics ---
10 packets transmitted, 10 packets received, 0% unanswered (0 extra)
rtt min/avg/max/std-dev = 6.130/52.980/134.100/38.758 ms
A bit slower than expected, but layer 2 traffic seems to be good, indicating a connection issue between the wifi interface and the bridge.... maybe.
- Removed all virtual wifi interfaces for 5g interfaces
/interface/wifi/remove [ /interface/wifi/find where master-interface=wifi1 ]
From here on, I've put a summary of the results, I hope that it suffices.
- L2 ping okay
- L3 ping not okay
- Disabled all virtual interfaces
/interface/wifi/set [ /interface/wifi/find where master-interface=wifi2 ] disabled=yes
- L2 ping okay
- L3 ping not okay
- Removed all virtual wifi interfaces for 2.4G interfaces
/interface/wifi/remove [ /interface/wifi/find where master-interface=wifi2 ]
- L2 ping okay
- L3 ping not okay
- Reset the 5G network
/interface/wifi/reset wifi1
- L2 ping okay
- L3 ping not okay
- Removed all /interface/wifi configs except for those effecting wifi2
- L2 ping okay
- L3 ping not okay
- Removed the wifi1 datapath, channel settings, security.ft settings
- L2 ping okay
- L3 ping not okay
- Hard power off
- L2 ping okay
- L3 ping not okay
- Removed country settings from wifi1
- L2 ping okay
- L3 ping not okay
- Removed all bridges except bridge 0
- L2 ping okay
- L3 ping not okay
At this point, you'll see from my config that I effectively locked myself out of the router. So I did a full reset, and setup from the config file
- Full reset, then quickly turned off wifi1 (the 5G network) to force all clients to 2.4G
/system reset-configuration no-defaults=yes keep-users=no skip-backup=yes run-after-reset=myconfig.rsc
...
/interface/set wifi1 disabled=yes
- L2 ping okay
- L3 ping okay
So, working with the default config. The client also shows a 130Mbps TX rate. However, it's a matter of time until this happens again.
current.rsc (24.8 KB)