[prev in list] [next in list] [prev in thread] [next in thread] 

List:       linux-ha-dev
Subject:    [Linux-ha-dev] VirtualDomain issue
From:       Dominik Klein <dominik.klein () googlemail ! com>
Date:       2011-06-23 5:51:48
Message-ID: 4E02D474.70509 () gmail ! com
[Download RAW message or body]

Hi

code snippet from
http://hg.linux-ha.org/agents/raw-file/7a11934b142d/heartbeat/VirtualDomain
(which I believe is the current version)

VirtualDomain_Validate_All() {
<snip>
     if [ ! -r $OCF_RESKEY_config ]; then
	if ocf_is_probe; then
	    ocf_log info "Configuration file $OCF_RESKEY_config not readable
during probe."
	else
	    ocf_log error "Configuration file $OCF_RESKEY_config does not exist
or is not readable."
	    return $OCF_ERR_INSTALLED
	fi
     fi
}
<snip>
VirtualDomain_Validate_All || exit $?
<snip>
if ocf_is_probe && [ ! -r $OCF_RESKEY_config ]; then
     exit $OCF_NOT_RUNNING
fi

So, say one node does not have the config, but the cluster decides to
run the vm on that node. The probe returns NOT_RUNNING, so the cluster
tries to start the vm, that start returns ERR_INSTALLED, the cluster has
to try to recover from the start failure, so stop it, but that stop op
returns ERR_INSTALLED as well, so we need to be stonith'd.

I think this is wrong behaviour. I read the comments about
configurations being on shared storage which might not be available at
certain points in time and I see the point. But the way this is
implemented clearly does not work for everybody. I vote for making this
configurable. Unfortunately, due to several reasons, I am not able to
contribute this patch myself at the moment.

Regards
Dominik
_______________________________________________________
Linux-HA-Dev: Linux-HA-Dev@lists.linux-ha.org
http://lists.linux-ha.org/mailman/listinfo/linux-ha-dev
Home Page: http://linux-ha.org/
[prev in list] [next in list] [prev in thread] [next in thread] 

Configure | About | News | Add a list | Sponsored by KoreLogic