BGP Crashes ROS 3.28 with routing and routing-test

Hello,

My BGP peers keep crashing (3 full routing table). With both routing-test and routing packages (since ROS 3.7 currently I’m using 3.28).
Is MK looking into finding a resolution for this is there any work going ahead???
Is there particular hardware issues with x86 I use hardware from the approaved list.
Could someone give an anwser to this if MK are looking for a solution to the problem (this one is for Normis).
I’m having crashing problems with BGP since ROS 3.7.

Thanks,

Sotiris

just wondering - why do you need 3 full routing tables?..

This happens with a single peer as well. I tested that as well.

is the hardware crashing, or are you just dropping the peer sessions ? What exactly happens, and how far into it. A little more info should help us troubleshoot.

The peers crash. The go from established to idle the entire routing table is lost. And everything has to start all over again dpownloadiung all the routes.
When I log into the box and I try to view the peers it takes some comsiderable amount of time to display the print.
This happens once every couple of days.

Thanks,
Sotiris

Soltiris,

What Ethernet cards are in your setup?

Do you use Marvell PCI-E maybe ?

Hello exe,

Nop I’m using an RB44 gigabit at 1Gbps. I don’t know what other information you might need. I’ve been strugling with this for some time now.


Thanks,

Sotiris

check your memory modules with memtestx. make sure that you don’t have any IRQs that aren’t being used, ie disable all peripherals in the bios that you are not using. try turning off multi-cpu. how much memory do you have and how much is free?

post a /system resources pci print

also, does the log have anything about it? i wonder if its the remote side terminating the session, or tcp/179 packets getting blocked, etc. Is the hold timer just expiring or is it an abrupt close?

Hello guys,

It is an abrupt shut because all the peers go down at the same time. I have 2GB of memory in the system and 600MB are used. Any systems that can be disabled from the bios and they are not used they have been disabled. The IRQ’s look fine they all corespond to periferals I have on the server (i.e. ethernet ports).

The only thing I haven’t done so far is to disable the multi-cpu.
On the logs I get the connection has terminated and the system waits for the BGP peers to get reastablished. The firewall allows TCP and UDP connections on port 179. So that shouldn’t be the problem.

Thanks,

Sotiris

v3.28 and earlier versions had route redistribution bug. Send supout to support at mikrotik.com.

what kind of bug? could you please be more precise?..

Route redistribution crash. It shows up especially if there are multiple BGP peers.

v3.28 is the currently stable version, do you suggest to use 4.0 devel if i plan to deploy on a similar architecture ?

Im about to switch all my routers to Mikrotik, so i dont want to run into the same problem :slight_smile:

synologic, are you using BGP full view?.. if no - then 3.28 is good enough, I believe. or you may want to wait for 3.29

Yes, i will be having 2 global bgp tables and 3 local peering tables of about 6k routes each … any ideea when 3.29 is going to be released ?

guys, are you Tiers 1? =) why do you need full view? %-)

MT has fixed the bug into the 3.30
I am testing it and seems more stable, no crashes after 3days of operation.

regards
Ros

Hopefully 3.30 will be stable enough for the BGP.

Sotiris

rpingar how many active BGP peers do you have on your testing platform and how many route entries ???


Sotiris