Hello,
I have an Mikrotik infrastructure with a main router RB1100AHx4 (RouterOS 6.49.13), an hEX secondary router (RouterOS 7.17rc3).
The main router acts as a router, firewall and a CapsMan Controller.
The secondary router acts as a CapsMan2 Controller, for tests and migration from Router 6 to RouterOS 7 and wireless to wifi-qcom-ac.
Downstream I have about 10 switches CRS354-48G-4S+2Q+RM with MSTP configured, 14 cAP ac and two hAPax^2.
The entire network is separated with VLANS and each customer has it’s own VLAN and SSID.
All devices are spread on multiple floors, some adjacent, two of them are not.
When using the wireless driver and Capsman, I made sure that the OFDM rates were configured like below:
/caps-man rates
add basic=24Mbps name=OFDM supported=24Mbps,36Mbps,48Mbps,54Mbps
This made roaming near perfect by using the signal quality as a promoter/deterrent of sticky clients or clients that would connect to a far away AP.
This also forces clients whenever a radar is detected on a DFS channel to roam to a nearby 2.4GHz or 5GHz radio with minimal disruption, even though there is no Fast Roaming or 802.11r/k/v.
On another floor where I have a similar solution, but a separated setup, that I migrated to wifi-qcom-ac and CapsMan2 with the following features configured:
WNM&RRM enabled (also the defaults)
FT and ft-over-ds, unique ft-mobility-domain per SSID and a ft-r0-key-lifetime of 12 hours (Windows 10 default for WPA2-Enterprise)
Neighbor groups for each SSID
WPA2-Enterprise with CCMP encryption and group encryption (default settings).
Tx Power limit at 20dBm (including antenna gain) for 2.4GHz and 23dBm (including antenna-gain) for 5GHz. I tried lower Tx power but maximum speed could not be reached and so Wi-Fi cell capacity would be negatively impacted. I couldn’t convince the building owners to double the number of APs per floor, this way I could of used a much lower Tx power per radio.
I didn’t use WPA3 because I wanted to roll back with minimum customer disruption.
There are two Wi-Fi networks, one is using WPA2-Enterprise, the other WPA2-Personal configured in the same way except the authentication.
Everything is working perfectly, both Android and Windows 10 devices roam from one AP to another on either the WPA2-Enterprise or WPA2-Personal network.
Seeing that everything is OK, I migrated the rest of the building APs from wireless to wifi-qcom-ac and enrolled them into CapsMan2 with a very similar setup like the previous described, the major difference being is that the building uses only WPA2-Personal (at least for now).
Also, each floor has it’s neighbor group for every SSID so that the clients would have AP neighbors recommendations within the floor and not adjacent floors.
Soon I started having issues similar with Capsman, before limiting the caps-man rates. Some Windows 10 devices would completely avoid close APs and connect to APs with a very low signal, like -88dBm - -90dBm, of course at this signal the connection is unstable and after 20-30s the client would roam to another AP, also far away or low signal (like an AP from the upper or lower floor), eventually remaining connected, even though the connection would be slow. After a while it would roam to a closer AP where it would stay for about 30-40 minutes (sometimes for hours) and then
start the whole roaming all over again, distrupting the user’s work.
It is possible that WNM has the disassociation imminent bit (but I couldn’t verify), triggered by high air time use of the 2.4GHz or 5GHz radio, or some other reason.
The same problematic clients would sometime connect to 2.4Ghz and then disconnect, only to connect to 2.4GHz again, manually removing the connection from the controller would make them switch to 5GHz immediately but not always.
Not all clients would behave this way, some of them would remain connected to the same AP that some other clients would actively avoid it for no apparent reason. I’ve seen that mostly Intel AX adapters (2xx series) seem to have this issue but only on WPA2-Personal.
It is as if the neighbor groups are completely ignored and some clients make bogus roaming decisions that make no objective sense. Usually the problematic clients have at least another close AP on both 2.4GHz and 5GHz radios so there are better roaming candidates.
That’s way is PARAMOUNT that higher (at least 24Mbps) minimum bitrates must be used and configurable in the wifi-qcom-ac driver, the DSSS and low OFDM modulations must be manually disabled, since they can pass thru walls and even between floors with enough integrity that they are decoded and used to connect to sub-optimal APs.
I really like the benefits added by wifi-qcom-ac but without more fine grained settings it’s hard to use it a multi AP setup without WPA2-Enterprise or WPA3-Enterprise.
I understand, roaming is a client decision but signal quality can be used to force a client to connect to the closest AP regardless of 802.11 roaming extensions.
I’m willing to collaborate with Mikrotik in order so solve this issues, until then I had to rollback to the wireless and CapsMan, there were just too many bad roaming decisions that even access lists could not solve, actually made them worse in some situations.