Understandable and I hear you. Unfortunately even those incredibly expensive paid tools don’t really do the job properly. And i’m not actually interested in a full routing topology overview, just simple point-to-point link states
Let me explain with a practical (and very real world for me) example of a simple yet scalable topology involving just 2 routers. And explaining why most tools utterly fail at this and where it could be done much better
Lets start with RouterA and RouterB which connect together - but not directly. In between them are a pair of LHG 60G radios connected to Ether1. These are invisible to both routers at Layer3+, but are there at Layer1 and Layer2. Now having a program that can map out a Layer2 network and show those devices being present is very useful, but i’m mostly interested in network topologies at Layer1 and Layer4
The problem we discover is the link goes down in rain, but of course none of the Ethernet links do, they are in the Up and Running state. It’s the 60ghz wireless interface (invisible to both RouterA and RouterB) that goes down. More crucially they can actually be flapping, or it might even be Up but its so bad that effectively no traffic is going across the link. Here lies the biggest problem with most network monitoring software - they concern themselves with the state of the Interface, not whether or not its actually passing traffic or a viable path. Ergo they don’t report this link as being down or degraded in any way since as far as RouterA/B are concerned, they have an active Ethernet interface so happy days
This comes back to ideally monitoring the OSPF/BGP neighbors. It’s not the only way and a cheaper/easier implementation would be to use CDP/LLDP/MNDP and see if a neighbor merely exists on an interface. It’s not as fast or accurate but if there is an interruption to data flow then a neighbor will eventually timeout and disappear
Now expanding a bit more, we got sick of the weather killing the link so we install a 5ghz backup radio between RouterA and RouterB on Ether2 that never goes down in the rain but its nowhere near as fast. Herein lies the 2nd problem with most software. They either don’t acknowledge direct backup links, or they simply can’t draw it. What i’d like to see on a map is 2 discrete lines between RouterA and RouterB, each corresponding to their relevant interfaces. And some people might think well does it really matter as long as they are still technically up and working? Yes it does because we have our 3rd development in how this network is evolving:
An obstruction happens (building in the way, tree grows, mast falls over, whatever), meaning the 60ghz radio is now permanently interrupted and does not link, yet the 5ghz is still able to. We have a permanently degraded connection between routers operating at ~150mbps 24/7 instead of operating at ~1gbps 99% of the time. Nobody has been aware of it because the link is still ‘Up’ due to the 5ghz maintaining an uninterrupted connection
So what i’d like to see is a program that can dynamically build and show adjacencies between routers and display all of the links between them (in this case Ether1 and Ether2). I’d be happy with a piece of software that will automatically display a link if a neighbor ever pops on an interface, and if it disappears it still remembers that a neighbor ‘should’ be there on that interface but isn’t. Thus draws the link as red/broken rather than automatically removing it, so its a very clear visual indicator that something is wrong. Allowing me to then go and manually delete that link if it was either drawn in error, interfaces have changed or the link has been decommissioned
In an ideal world i’d like to see an industry supported extension to OSPF/IS-IS that can utilize transit metrics and not just interface state (i.e. bandwidth, latency, packet loss) and it can steer traffic dynamically through an entire network in the best possible way rather than just relying on fixed costs. Until such a pipe dream ever happens i’m still reliant on manually intervening and having to periodically look over the network and make sure we don’t have any heavily degraded or failed links. Anything that can make that job easier is a very welcome addition to the tools arsenal