[prev in list] [next in list] [prev in thread] [next in thread] 

List:       linux-ha
Subject:    [Linux-HA] drdb StandAlone state not handled properly
From:       "Serge Dubrouski" <sergeyfd () gmail ! com>
Date:       2008-07-30 16:33:43
Message-ID: 868cbbaa0807300933j7b0d3e8byea8d98e6f67632fa () mail ! gmail ! com
[Download RAW message or body]

Hello -

My test cluster has 3 network connections: service, heartbeat,
drbd-sync. While playing with drbd link I found kind of a strange
behavior that's most probably cause by a bug in the drbd OCF RA. When
drbd link is broken drbd puts both nodes to StandAlone state but
Heartbeat doesn't see it and still reports both nodes as healthy. That
is cause by the following code in drbd OCF RA:

        DRBD_STATE=$(do_drbdadm state $RESOURCE)
        DRBD_STATE_LOCAL=$(echo $DRBD_STATE | sed -e 's#/.*##')
        DRBD_STATE_REMOTE=$(echo $DRBD_STATE | sed -e 's#.*/##')


Here OCF RA relays on "drbdadm state" to get state but doesn't check
cstate which can be StandAlone in the case of a broken link. So OCF RA
always sees Primary or Master and doesn't process StandAlone case. In
my opinion it's a kind of a critical bug:

#cat /proc/drbd
version: 8.2.6 (api:88/proto:86-88)
GIT-hash: 3e69822d3bb4920a8c1bfdf7d647169eba7d2eb4 build by
sdubrov@home, 2008-06-02 09:31:28
 0: cs:StandAlone st:Secondary/Unknown ds:UpToDate/DUnknown   r---
    ns:4 nr:1 dw:1 dr:4 al:0 bm:1 lo:0 pe:0 ua:0 ap:0 oos:0

# drbdadm state drbd0
Secondary/Unknown

# drbdadm cstate drbd0
StandAlone


crm_mon:

============
Last updated: Wed Jul 30 12:33:02 2008
Current DC: fc-node2 (ad6f19b7-228a-48b7-bae0-f95a838bde2a)
2 Nodes configured.
3 Resources configured.
============

Node: fc-node1 (b88f98c6-50f2-463a-a6eb-51abbec645a9): online
Node: fc-node2 (ad6f19b7-228a-48b7-bae0-f95a838bde2a): online

Clone Set: DoFencing
    child_DoFencing:0   (stonith:external/xen0):        Started fc-node1
    child_DoFencing:1   (stonith:external/xen0):        Started fc-node2
Master/Slave Set: ms-drbd0
    drbd0:0     (ocf::heartbeat:drbd):  Master fc-node1
    drbd0:1     (ocf::heartbeat:drbd):  Started fc-node2
Resource Group: myGroup
    myIP        (ocf::heartbeat:IPaddr):        Started fc-node1
    fs0 (ocf::heartbeat:Filesystem):    Started fc-node1
    myPgsql     (ocf::heartbeat:pgsql): Started fc-node1


-- 
Serge Dubrouski.
_______________________________________________
Linux-HA mailing list
Linux-HA@lists.linux-ha.org
http://lists.linux-ha.org/mailman/listinfo/linux-ha
See also: http://linux-ha.org/ReportingProblems
[prev in list] [next in list] [prev in thread] [next in thread] 

Configure | About | News | Add a list | Sponsored by KoreLogic