[prev in list] [next in list] [prev in thread] [next in thread] 

List:       linux-ha
Subject:    [Linux-HA] Error when testing DRBD with Heartbeat
From:       Fabrice Durand <durand.fabrice () gmail ! com>
Date:       2005-08-30 12:27:37
Message-ID: 12afb699050830052751dcd9f5 () mail ! gmail ! com
[Download RAW message or body]

Hello all,

I am testing an active/passive Heartbeart cluster with DRBD for testing the good
mirroring of data on DRBD filesystems of the 2 nodes. Here is the
haresource file :

EEPCLU1 135.9.216.51 \
		drbddisk \
		Filesystem::/dev/drbd0::/montagedrbd::ext3:: \
		wu-ftpd

Here is the ha.cf file :
  
bcast        eth1,eth0  

debugfile    /var/log/ha-debug  
logfile      /var/log/ha-log  
logfacility  local0  

keepalive    1
deadtime     3    
warntime     6     
initdead     60    

udpport      694  

node         EEPCLU1  
node         EEPCLU2

auto_failback  on  

respawn      hacluster    /usr/lib/heartbeat/ipfail  
ping         EEPNFS

Now when I shutdown heartbeat on the primary node (/etc/init.d/heartbeat stop),
the node is properly giving up its resources but at the end there is
an error message
saying heartbeat cannot open the /var/lib/heartbeat/fifo file. Here is
the log on node EEPCLU1 :

heartbeat: 2005/07/29_15:21:03 info: Heartbeat shutdown in progress. (1416)
heartbeat: 2005/07/29_15:21:03 info: Giving up all HA resources.
heartbeat: 2005/07/29_15:21:03 info: Core process 1419 exited. 7 remaining
heartbeat: 2005/07/29_15:21:03 info: Core process 1420 exited. 6 remaining
heartbeat: 2005/07/29_15:21:03 info: Core process 1421 exited. 5 remaining
heartbeat: 2005/07/29_15:21:03 info: Core process 1422 exited. 4 remaining
heartbeat: 2005/07/29_15:21:03 info: Core process 1423 exited. 3 remaining
heartbeat: 2005/07/29_15:21:03 info: Core process 1424 exited. 2 remaining
heartbeat: 2005/07/29_15:21:03 info: Core process 1425 exited. 1 remaining
heartbeat: 2005/07/29_15:21:03 info: Heartbeat shutdown complete.
heartbeat: 2005/07/29_15:21:04 info: Releasing resource group: eepclu1
135.9.216.51 drbddisk Filesystem::/dev/drbd0::/montagedrbd::ext3::
wu-ftpd
heartbeat: 2005/07/29_15:21:04 info: Running /etc/ha.d/resource.d/wu-ftpd  stop
heartbeat: 2005/07/29_15:21:04 info: Running
/etc/ha.d/resource.d/Filesystem /dev/drbd0 /montagedrbd ext3  stop
heartbeat: 2005/07/29_15:21:04 info: Running /etc/ha.d/resource.d/drbddisk  stop
heartbeat: 2005/07/29_15:21:04 info: Running
/etc/ha.d/resource.d/IPaddr 135.9.216.51 stop
heartbeat: 2005/07/29_15:21:04 info: /sbin/route -n del -host 135.9.216.51
heartbeat: 2005/07/29_15:21:04 info: /sbin/ifconfig eth0:0 down
heartbeat: 2005/07/29_15:21:04 info: IP Address 135.9.216.51 released
heartbeat: 2005/07/29_15:21:04 info: killing /usr/lib/heartbeat/ipfail
process group 1430 with signal 15
heartbeat: 2005/07/29_15:21:04 info: All HA resources relinquished.
heartbeat: 2005/07/29_15:21:04 ERROR: send_cluster_msg: cannot open
/var/lib/heartbeat/fifo: No such device or address

I have "enlarged" the rights of the file, but this makes no changes.
Do you have any explaination of this error ?

Thanks a lot,
Fabrice
_______________________________________________
Linux-HA mailing list
Linux-HA@lists.linux-ha.org
http://lists.linux-ha.org/mailman/listinfo/linux-ha
See also: http://linux-ha.org/ReportingProblems

[prev in list] [next in list] [prev in thread] [next in thread] 

Configure | About | News | Add a list | Sponsored by KoreLogic