I am getting a boot loop on my switch every few weeks. I have no idea why this is happening. Either the switch reboots on its on, or when I reboot it, the switch will go into an infinite boot loop. Looks like it is crashing when loading kernel. I have to netboot and restore the image and backup to get the switch working again. I am then good for a few weeks until I have to do the process all over again. I have not been able to get the boot msgs from the console yet. I am working on that. Could anyone give me any insight on why this might be happening somewhat randomly. Let me know what information I could provide that may be helpful. Should I try formating the flash storage? Could the storage have some sort of corruption? Should I consider just replacing the switch.
I formatted the NAND (Flash Memory) and re-loaded RouterOS. After less than a week I started to notice some slowdown on the internet performance. This seems to indicate the start of the potential corruption of the OS where the boot loop with start. I rebooted the router to check to see if the OS was OK, and to fix the performance issue. Luckily the boot loop did not occur. I did however notice that the bad blocks on my flash has increased from 4% to 4.2% in less that a week. Could this be the culprit to my issues? See the attached screen cap.
It happened again. I woke up this morning, and the router was beeping every 30 seconds. It was in a boot loop again. It seems this is happening every month or so.
Here are the steps I took to get it running again.
Did a netinstall of 6.43.8
Restored my latest backup
Everything seemed to be working at that point.
I was not thinking, and forgot to check the console output when it was boot looping. I think I just wanted to get it back up. I will try and remember next time. One odd thing to note, is that after the netinstall I checked the resources, and bad blocks are at 0.0%. Before it crashed bad blocks were at 6.0%. I would figure once a block is marked bad it would stay that way.
Any insight into what is going on would really be appreciated. I would like to know if this is hardware/software or configuration.