x86 failure with strange/unknown error, please help

Just wondering if anyone has seen this. One of our x86 routers failed with the following error repeating on the console screen:

“tty _release_dev: pts14: read/write wait queue active!”

When this happened the router was not pingable nor responsive in any way.. funny thing was that the backup router never came on line due to vrrps on it not going from backup to master. So the failed router must have been communicating something to hold the vrrp interfaces as master. Once we unplugged the router having issues the vrrp’s all swtiched over to the backup router and everything came up.

When we plugged in the primary router again it booted properly and all the vrrps and routes transfered back to it and it’s been running fine ever since. Before the server had been running solidly for 2 years since we first installed it (other then trying to find a version of ROS that didn’t have an issue with our Gig Intel network cards). There’s nothing in the logs nor is there any indication that there was a problem except in our monitoring software there was a spike (80-90%, they usually run around 2-3%) of all 16 cpu’s on the machine just before going offline. As I’ve said the only thing I have to go by for diagnostics at this time is while it wasn’t responding the error “tty _release_dev: pts14: read/write wait queue active!” was repeating on the screen when I plugged in a monitor and keyboard and I dont know for the life of me what this error means… any help would be thankfully received

Thanks again
Jonathan

When your x86 ROS router quits working …

If you can … Try the console or serial port. Can you still login via console or serial port ?
If you are able to login, then can you ping your internal loopback IP address of 127.0.0.1 ?
If your x86 ROS can not ping 127.0.0.1 - then you may be experiencing the same problem I am with my x86 ROS public speedtest 207.32.195.2 btest server. The only solution is to reset or reboot.

Is your x86 ROS running on a PC computer type motherboard ? -or- Is your x86 ROS running as a guest hosted virtual server like VMware ESXi or some other type of hypervisor ?

I have a question for you - when your x86 ROS is running normally, what do you get when you perform a UDP send btest to 127.0.0.1 ?

North Idaho Tom Jones

No I couldn’t login via serial or Console. The only thing showing on the console screen is the error “tty _release_dev: pts14: read/write wait queue active!” Keyboard was unresponsive