Need help to run UM3

Lin Wang lwang at downstate.edu
Fri Oct 28 00:32:55 EST 2005



I am not an expert on Linux, but managed to make UM2.01 work on rh3 for two
years.  I recently upgraded to UM3 with rh kernel and got into trouble.
Basically, HA dies on startup and report error "Cannot locate resource
script dmccm1".  I reinstalled everything and got the same result.  I must
have missed something here and need help.  I am using advanced high
availability and load balancing and have two Linux directors, dmccm1 and
dmccm2.  The following shown in the log on dmccm1 which is the primary:

heartbeat: 2005/10/25_17:53:11 info: Configuration validated. Starting
heartbeat 1.2.3.cvs.20050404
heartbeat: 2005/10/25_17:53:11 info: heartbeat: version 1.2.3.cvs.20050404
heartbeat: 2005/10/25_17:53:12 info: Heartbeat generation: 2
heartbeat: 2005/10/25_17:53:12 info: UDP Broadcast heartbeat started on
port 694 (694) interface eth1
heartbeat: 2005/10/25_17:53:12 info: pid 10665 locked in memory.
heartbeat: 2005/10/25_17:53:12 info: Local status now set to: 'up'
heartbeat: 2005/10/25_17:53:13 info: pid 10668 locked in memory.
heartbeat: 2005/10/25_17:53:13 info: pid 10669 locked in memory.
heartbeat: 2005/10/25_17:53:13 info: pid 10670 locked in memory.
heartbeat: 2005/10/25_17:53:13 info: Link dmccm1:eth1 up.
heartbeat: 2005/10/25_17:54:12 WARN: node dmccm2: is dead
heartbeat: 2005/10/25_17:54:12 info: Local status now set to: 'active'
heartbeat: 2005/10/25_17:54:12 WARN: No STONITH device configured.
heartbeat: 2005/10/25_17:54:12 WARN: Shared disks are not protected.
heartbeat: 2005/10/25_17:54:12 info: Resources being acquired from dmccm2.
heartbeat: 2005/10/25_17:54:12 info: Running /etc/ha.d/rc.d/status status
heartbeat: 2005/10/25_17:54:12 info: /usr/lib/heartbeat/mach_down:
nice_failback: foreign resources acquired
heartbeat: 2005/10/25_17:54:12 info: mach_down takeover complete.
heartbeat: 2005/10/25_17:54:12 info: Initial resource acquisition complete
(mach_down)
heartbeat: 2005/10/25_17:54:12 info: mach_down takeover complete for node
dmccm2.
heartbeat: 2005/10/25_17:54:15 info: Local Resource acquisition completed.
heartbeat: 2005/10/25_17:54:15 info: Running /etc/ha.d/rc.d/ip-request-resp
ip-request-resp
heartbeat: 2005/10/25_17:54:15 received ip-request-resp
ldirectord::ldirectord.cf OK yes
heartbeat: 2005/10/25_17:54:15 info: Acquiring resource group: dmccm1
ldirectord::ldirectord.cf LVSSyncDaemonSwap::master
IPaddr2::x.x.x.25/23/eth0/x.x.x.255 dmccm1 ldirectord::ldirectord.cf
LVSSyncDaemonSwap::master IPaddr2::x.x.x.51/23/eth0/x.x.x.255 dmccm1
ldirectord::ldirectord.cf LVSSyncDaemonSwap::master
IPaddr2::x.x.x.52/23/eth0/x.x.x.255 dmccm1 ldirectord::ldirectord.cf
LVSSyncDaemonSwap::master IPaddr2::x.x.x.53/23/eth0/x.x.x.255
heartbeat: 2005/10/25_17:54:15 info: Running
/etc/ha.d/resource.d/ldirectord ldirectord.cf start
heartbeat: 2005/10/25_17:54:16 info: Running
/etc/ha.d/resource.d/LVSSyncDaemonSwap master start
heartbeat: 2005/10/25_17:54:16 info: ipvs_syncbackup down
heartbeat: 2005/10/25_17:54:16 info: ipvs_syncmaster up
heartbeat: 2005/10/25_17:54:16 info: ipvs_syncmaster obtained
heartbeat: 2005/10/25_17:54:16 info: Running /etc/ha.d/resource.d/IPaddr2
x.x.x.25/23/eth0/x.x.x.255 start
heartbeat: 2005/10/25_17:54:16 info: /sbin/ip -f inet addr add x.x.x.25/23
brd x.x.x.255 dev eth0
heartbeat: 2005/10/25_17:54:16 info: /sbin/ip link set eth0 up
heartbeat: 2005/10/25_17:54:16 /usr/lib/heartbeat/send_arp -i 200 -r 5 -p
/var/lib/heartbeat/rsctmp/send_arp/send_arp-x.x.x.25 eth0 x.x.x.25 auto
x.x.x.25 ffffffffffff
heartbeat: 2005/10/25_17:54:17 ERROR: Cannot locate resource script dmccm1
heartbeat: 2005/10/25_17:54:17 ERROR: Cannot locate resource script dmccm1
heartbeat: 2005/10/25_17:54:17 ERROR: Cannot locate resource script dmccm1
heartbeat: 2005/10/25_17:54:17 CRIT: Giving up resources due to failure of
dmccm1
heartbeat: 2005/10/25_17:54:17 info: Releasing resource group: dmccm1
ldirectord::ldirectord.cf LVSSyncDaemonSwap::master
IPaddr2::x.x.x.25/23/eth0/x.x.x.255 dmccm1 ldirectord::ldirectord.cf
LVSSyncDaemonSwap::master IPaddr2::x.x.x.51/23/eth0/x.x.x.255 dmccm1
ldirectord::ldirectord.cf LVSSyncDaemonSwap::master
IPaddr2::x.x.x.52/23/eth0/x.x.x.255 dmccm1 ldirectord::ldirectord.cf
LVSSyncDaemonSwap::master IPaddr2::x.x.x.53/23/eth0/x.x.x.255
heartbeat: 2005/10/25_17:54:17 info: Running /etc/ha.d/resource.d/IPaddr2
x.x.x.53/23/eth0/x.x.x.255 stop
heartbeat: 2005/10/25_17:54:17 info: Running
/etc/ha.d/resource.d/LVSSyncDaemonSwap master stop
heartbeat: 2005/10/25_17:54:17 info: ipvs_syncmaster down
heartbeat: 2005/10/25_17:54:17 info: ipvs_syncbackup up
heartbeat: 2005/10/25_17:54:17 info: ipvs_syncmaster released
heartbeat: 2005/10/25_17:54:17 info: Running
/etc/ha.d/resource.d/ldirectord ldirectord.cf stop
heartbeat: 2005/10/25_17:54:17 ERROR: Cannot locate resource script dmccm1
heartbeat: 2005/10/25_17:54:17 ERROR: Cannot locate resource script dmccm1
heartbeat: 2005/10/25_17:54:18 info: Retrying failed stop operation
[dmccm1]
heartbeat: 2005/10/25_17:54:19 ERROR: Cannot locate resource script dmccm1
heartbeat: 2005/10/25_17:54:19 ERROR: Cannot locate resource script dmccm1
heartbeat: 2005/10/25_17:54:20 info: Retrying failed stop operation
[dmccm1]
heartbeat: 2005/10/25_17:54:20 ERROR: Cannot locate resource script dmccm1
heartbeat: 2005/10/25_17:54:20 ERROR: Cannot locate resource script dmccm1
heartbeat: 2005/10/25_17:54:21 info: Retrying failed stop operation
[dmccm1]
heartbeat: 2005/10/25_17:54:21 ERROR: Cannot locate resource script dmccm1
heartbeat: 2005/10/25_17:54:21 ERROR: Cannot locate resource script dmccm1
heartbeat: 2005/10/25_17:54:22 info: Retrying failed stop operation
[dmccm1]
heartbeat: 2005/10/25_17:54:22 ERROR: Cannot locate resource script dmccm1
heartbeat: 2005/10/25_17:54:22 ERROR: Cannot locate resource script dmccm1
heartbeat: 2005/10/25_17:54:23 info: Retrying failed stop operation
[dmccm1]
heartbeat: 2005/10/25_17:54:23 ERROR: Cannot locate resource script dmccm1
heartbeat: 2005/10/25_17:54:23 ERROR: Cannot locate resource script dmccm1
heartbeat: 2005/10/25_17:54:24 info: Retrying failed stop operation
[dmccm1]
heartbeat: 2005/10/25_17:54:24 ERROR: Cannot locate resource script dmccm1
heartbeat: 2005/10/25_17:54:24 ERROR: Cannot locate resource script dmccm1
heartbeat: 2005/10/25_17:54:24 info: Local Resource acquisition completed.
(none)
heartbeat: 2005/10/25_17:54:24 info: local resource transition completed.
heartbeat: 2005/10/25_17:54:25 info: Retrying failed stop operation
[dmccm1]
heartbeat: 2005/10/25_17:54:25 ERROR: Cannot locate resource script dmccm1
heartbeat: 2005/10/25_17:54:25 ERROR: Cannot locate resource script dmccm1
heartbeat: 2005/10/25_17:54:26 info: Retrying failed stop operation
[dmccm1]
heartbeat: 2005/10/25_17:54:26 ERROR: Cannot locate resource script dmccm1
heartbeat: 2005/10/25_17:54:26 ERROR: Cannot locate resource script dmccm1
heartbeat: 2005/10/25_17:54:27 info: Retrying failed stop operation
[dmccm1]
heartbeat: 2005/10/25_17:54:27 ERROR: Cannot locate resource script dmccm1
heartbeat: 2005/10/25_17:54:27 ERROR: Cannot locate resource script dmccm1
heartbeat: 2005/10/25_17:54:28 info: Retrying failed stop operation
[dmccm1]
heartbeat: 2005/10/25_17:54:28 ERROR: Cannot locate resource script dmccm1
heartbeat: 2005/10/25_17:54:28 ERROR: Cannot locate resource script dmccm1
heartbeat: 2005/10/25_17:54:28 ERROR: Cannot locate resource script dmccm1
heartbeat: 2005/10/25_17:54:28 ERROR: Resource script for dmccm1 probably
not LSB-compliant.
heartbeat: 2005/10/25_17:54:28 WARN: it (dmccm1) MUST succeed on a stop
when already stopped
heartbeat: 2005/10/25_17:54:28 WARN: Machine reboot narrowly avoided!
heartbeat: 2005/10/25_17:54:28 info: Running /etc/ha.d/resource.d/IPaddr2
x.x.x.52/23/eth0/x.x.x.255 stop
heartbeat: 2005/10/25_17:54:28 info: Running
/etc/ha.d/resource.d/LVSSyncDaemonSwap master stop
heartbeat: 2005/10/25_17:54:28 info: ipvs_syncmaster released
heartbeat: 2005/10/25_17:54:28 info: Running
/etc/ha.d/resource.d/ldirectord ldirectord.cf stop
heartbeat: 2005/10/25_17:54:29 ERROR: Cannot locate resource script dmccm1
heartbeat: 2005/10/25_17:54:29 ERROR: Cannot locate resource script dmccm1
heartbeat: 2005/10/25_17:54:30 info: Retrying failed stop operation
[dmccm1]
heartbeat: 2005/10/25_17:54:30 ERROR: Cannot locate resource script dmccm1
heartbeat: 2005/10/25_17:54:30 ERROR: Cannot locate resource script dmccm1
heartbeat: 2005/10/25_17:54:31 info: Retrying failed stop operation
[dmccm1]
heartbeat: 2005/10/25_17:54:31 ERROR: Cannot locate resource script dmccm1
heartbeat: 2005/10/25_17:54:31 ERROR: Cannot locate resource script dmccm1
heartbeat: 2005/10/25_17:54:32 info: Retrying failed stop operation
[dmccm1]
heartbeat: 2005/10/25_17:54:32 ERROR: Cannot locate resource script dmccm1
heartbeat: 2005/10/25_17:54:32 ERROR: Cannot locate resource script dmccm1
heartbeat: 2005/10/25_17:54:33 info: Retrying failed stop operation
[dmccm1]
heartbeat: 2005/10/25_17:54:33 ERROR: Cannot locate resource script dmccm1
heartbeat: 2005/10/25_17:54:33 ERROR: Cannot locate resource script dmccm1
heartbeat: 2005/10/25_17:54:34 info: Retrying failed stop operation
[dmccm1]
heartbeat: 2005/10/25_17:54:34 ERROR: Cannot locate resource script dmccm1
heartbeat: 2005/10/25_17:54:34 ERROR: Cannot locate resource script dmccm1
heartbeat: 2005/10/25_17:54:35 info: Retrying failed stop operation
[dmccm1]
heartbeat: 2005/10/25_17:54:35 ERROR: Cannot locate resource script dmccm1
heartbeat: 2005/10/25_17:54:35 ERROR: Cannot locate resource script dmccm1
heartbeat: 2005/10/25_17:54:36 info: Retrying failed stop operation
[dmccm1]
heartbeat: 2005/10/25_17:54:36 ERROR: Cannot locate resource script dmccm1
heartbeat: 2005/10/25_17:54:36 ERROR: Cannot locate resource script dmccm1
heartbeat: 2005/10/25_17:54:37 info: Retrying failed stop operation
[dmccm1]
heartbeat: 2005/10/25_17:54:37 ERROR: Cannot locate resource script dmccm1
heartbeat: 2005/10/25_17:54:37 ERROR: Cannot locate resource script dmccm1
heartbeat: 2005/10/25_17:54:38 info: Retrying failed stop operation
[dmccm1]
heartbeat: 2005/10/25_17:54:38 ERROR: Cannot locate resource script dmccm1
heartbeat: 2005/10/25_17:54:38 ERROR: Cannot locate resource script dmccm1
heartbeat: 2005/10/25_17:54:39 info: Retrying failed stop operation
[dmccm1]
heartbeat: 2005/10/25_17:54:39 ERROR: Cannot locate resource script dmccm1
heartbeat: 2005/10/25_17:54:39 ERROR: Cannot locate resource script dmccm1
heartbeat: 2005/10/25_17:54:39 ERROR: Cannot locate resource script dmccm1
heartbeat: 2005/10/25_17:54:39 ERROR: Resource script for dmccm1 probably
not LSB-compliant.
heartbeat: 2005/10/25_17:54:39 WARN: it (dmccm1) MUST succeed on a stop
when already stopped
heartbeat: 2005/10/25_17:54:39 WARN: Machine reboot narrowly avoided!
heartbeat: 2005/10/25_17:54:39 info: Running /etc/ha.d/resource.d/IPaddr2
x.x.x.51/23/eth0/x.x.x.255 stop
heartbeat: 2005/10/25_17:54:39 info: Running
/etc/ha.d/resource.d/LVSSyncDaemonSwap master stop
heartbeat: 2005/10/25_17:54:39 info: ipvs_syncmaster released
heartbeat: 2005/10/25_17:54:39 info: Running
/etc/ha.d/resource.d/ldirectord ldirectord.cf stop
heartbeat: 2005/10/25_17:54:40 ERROR: Cannot locate resource script dmccm1
heartbeat: 2005/10/25_17:54:40 ERROR: Cannot locate resource script dmccm1
heartbeat: 2005/10/25_17:54:41 info: Retrying failed stop operation
[dmccm1]
heartbeat: 2005/10/25_17:54:41 ERROR: Cannot locate resource script dmccm1
heartbeat: 2005/10/25_17:54:41 ERROR: Cannot locate resource script dmccm1



Lin Wang
Information Services
SUNY, Downstate Medical Center
-------------- next part --------------
An HTML attachment was scrubbed...
URL: http://lists.vergenet.net/pipermail/ultramonkey-users/attachments/20051027/480d08b8/attachment.htm 


More information about the Ultramonkey-users mailing list