We have a network of mikrotik devices making a network for some of our clients. The are a few legacy clients connected with alvarion devices to an AU a few hops away from us, and until this weekend, we had been using an EoIP tunnel to bridge the traffic from their AU to our local network here. Everything has worked fine in terms of forwarding the traffic, even including full sized ethernet frames from one side to the other.
After a minor upgrade (nothing software based, just added redundant power), the alvarion network has suddenly been hamstrung by an MTU of 1458 (pinging with anything larger times out). We have controlled every connection between the two points, all ethernet, bridge and wireless interfaces have an MTU of 1500, as does the EoIP tunnel. When we ping between points, or even making multiple hops, there is no problem with 1500. A 1500 byte ping between the two connected ends of the EoIP tunnel works perfectly. When you try and ping a machine connected on the remote side of the tunnel, the 1458 MTU limit kicks in. It also happens between secondary addresses added to the EoIP interfaces (the ones for the LANs to connect.
the network looks like this:
( EoIP 192.168.252.45) 10.10.1.1<-wireless link-> 10.10.1.2 [router] 10.11.1.1 <-wireless link-> 10.11.1.2 [router] (EoIP 192.168.252.46) <-ethernet cable-> (switch) 192.168.252.128/25
the EoIP tunnel is stabilized between 10.10.1.1 and 10.11.1.2 who ping just fine at 1500 bytes as do any pings that bypass the EoIP tunnel.
192.168.252.46 — 1500 byte ping — 192.168.252.128/25 WORKS
192.168.252.45 — 1500 byte ping — 192.168.252.46 FAILS (maximum size that passes is 1458)
192.168.252.45 — 1500 byte ping — 192.168.252.128/25 FAILS (maximum size that passes is 1458)
This worked just fine before the weekend, in fact it’s been up and running for 7 or 8 months..
The only minor change made was the addition of a PPPoE server on an unrelated interface for a technician to connect a VoIP telephone to call for confirmation of the installation. The server has since been removed.. we even restarted the devices to assure that if something had gone wrong that they were reset to the configuration without the PPPoE server.
Does anyone have any idea why something like this is happening?
AFAIK the overhead for a GRE encapsulated package is only 28 bytes..