[ULTRAMONKEY-USERS] LVS + Ldirectord causing trouble
Heiko Neuhaus
webmaster at euros4click.de
Mon Oct 22 18:32:23 EST 2007
Hi Mailing-list :-)
I have a short problem that I've been struggling with for quite some time:
Im running 2 directors on FC7_64 (Intel q6600 quadcore and 4 gig ram each)... i use ldirectord to check for service uptime (httpd). In case of a total failure of the director the backup-director takes its place correctly. However in my real-world-downtimes the first server often becomes slow and irresponsive (= ssh login not working, webserver hardly working) but heartbeat still seems to send its pings and considering itself as alive - even if its not!:"ipvsadm -Ln" still shows the correct routing table but its just not forwarding stuff anymore. I tried reducing the deadtime to crazy low values (1-3 sec) which led to a split-brain and didn't solve the problem. It seems that pings always arrive in time so this can't be an option to detect this.
So basically my questio is: Is there a way to detect such situation? I was thinking about writing a script that checks for ssh-login working and stops heartbeat server in case its not. But I strongly think that there must be a more professional approach for this problem?
Thanks alot & bw,
Heiko Neuhaus
-------------- next part --------------
An HTML attachment was scrubbed...
URL: http://lists.vergenet.net/pipermail/ultramonkey-users/attachments/20071022/ab6a1d0f/attachment.htm
More information about the Ultramonkey-users
mailing list