[prev in list] [next in list] [prev in thread] [next in thread] 

List:       ocfs2-users
Subject:    Re: [Ocfs2-users] Reg: ocfs2 two node cluster crashed, node2 crashed,
From:       Sunil Mushran <sunil.mushran () oracle ! com>
Date:       2010-10-26 2:08:22
Message-ID: 4CC63816.5090002 () oracle ! com
[Download RAW message or body]

[Attachment #2 (multipart/alternative)]


Means that the reboot is not shutting down the services in order.
Ensure ocfs2 fs is unmounting before the network shutdown.

On 10/23/2010 11:27 AM, veeraa bose wrote:
> Hi All,
> 
> We have ocfs2 node cluster with oracle 11G RAC running,
> 
> The node2 got crashed automatically, when i rebooted node one for maintenance.
> 
> please check the log from node2 , before its got crashed.
> 
> Oct 23 15:42:25 node2 kernel: ocfs2_dlm: Nodes in domain \
>                 ("029C02C993E44E90879922E268FB161A"): 2
> Oct 23 15:42:29 node2 kernel: ocfs2_dlm: Node 1 leaves domain \
>                 2AB2C04A99BD482A89A7FCE9D3C9319A
> Oct 23 15:42:29 node2 kernel: ocfs2_dlm: Nodes in domain \
>                 ("2AB2C04A99BD482A89A7FCE9D3C9319A"): 2
> Oct 23 15:42:33 node2 kernel: ocfs2_dlm: Node 1 leaves domain \
>                 B239262A386C465AA7DEE81C05F2EB93
> Oct 23 15:42:33 node2 kernel: ocfs2_dlm: Nodes in domain \
>                 ("B239262A386C465AA7DEE81C05F2EB93"): 2
> Oct 23 15:42:38 node2 kernel: ocfs2_dlm: Node 1 leaves domain \
>                 C54B4F6991954F98AA6A37C4F3901CD8
> Oct 23 15:42:38 node2 kernel: ocfs2_dlm: Nodes in domain \
>                 ("C54B4F6991954F98AA6A37C4F3901CD8"): 2
> Oct 23 15:42:58 node2 kernel: ocfs2_dlm: Node 1 leaves domain \
>                 D96AC8E8BDD54913AE6D8EC0EB539603
> Oct 23 15:42:58 node2 kernel: ocfs2_dlm: Nodes in domain \
>                 ("D96AC8E8BDD54913AE6D8EC0EB539603"): 2
> Oct 23 15:44:06 node2 kernel: o2net: connection to node node1 (num 1) at \
>                 192.168.3.1:7777 <http://192.168.3.1:7777> has been idle for 60
> .0 seconds, shutting it down.
> Oct 23 15:44:06 node2 kernel: (swapper,0,15):o2net_idle_timer:1503 here are some \
>                 times that might help debug the situa
> tion: (tmr 1287848586.872368 now 1287848646.872227 dr 1287848586.872346 adv \
> 1287848586.872376:1287848586.872376 func (fb860756 :513) \
>                 1287848578.874476:1287848578.874487)
> Oct 23 15:44:06 node2 kernel: o2net: no longer connected to node node1 (num 1) at \
>                 192.168.3.1:7777 <http://192.168.3.1:7777>
> Oct 23 15:45:06 node2 kernel: (o2net,14590,15):o2net_connect_expired:1664 ERROR: no \
> connection established with node 1 after 60.0 seconds, giving up and returning \
>                 errors.
> Oct 23 15:46:06 node2 kernel: (o2net,14590,15):o2net_connect_expired:1664 ERROR: no \
> connection established with node 1 after 60.0 seconds, giving up and returning \
>                 errors.
> Oct 23 15:51:34 node2 syslogd 1.4.1: restart.
> 
> Please guide me what could the issue.
> 
> Thanks
> Veera.
> 
> 
> 
> _______________________________________________
> Ocfs2-users mailing list
> Ocfs2-users@oss.oracle.com
> http://oss.oracle.com/mailman/listinfo/ocfs2-users


[Attachment #5 (text/html)]

<!DOCTYPE HTML PUBLIC "-//W3C//DTD HTML 4.01 Transitional//EN">
<html>
  <head>
    <meta content="text/html; charset=ISO-8859-1"
      http-equiv="Content-Type">
  </head>
  <body bgcolor="#ffffff" text="#000000">
    Means that the reboot is not shutting down the services in order.<br>
    Ensure ocfs2 fs is unmounting before the network shutdown.<br>
    <br>
    On 10/23/2010 11:27 AM, veeraa bose wrote:
    <blockquote
      cite="mid:AANLkTi=qT3Nka781q_CS5fHwzrJjBNustRxW6ewrymBx@mail.gmail.com"
      type="cite">Hi All,<br>
      <br>
      We have ocfs2 node cluster with oracle 11G RAC running,<br>
      <br>
      The node2 got crashed automatically, when i rebooted node one for
      maintenance.<br>
      <br>
      please check the log from node2 , before its got crashed.<br>
      <br>
      Oct 23 15:42:25 node2 kernel: ocfs2_dlm: Nodes in domain
      ("029C02C993E44E90879922E268FB161A"): 2<br>
      Oct 23 15:42:29 node2 kernel: ocfs2_dlm: Node 1 leaves domain
      2AB2C04A99BD482A89A7FCE9D3C9319A<br>
      Oct 23 15:42:29 node2 kernel: ocfs2_dlm: Nodes in domain
      ("2AB2C04A99BD482A89A7FCE9D3C9319A"): 2<br>
      Oct 23 15:42:33 node2 kernel: ocfs2_dlm: Node 1 leaves domain
      B239262A386C465AA7DEE81C05F2EB93<br>
      Oct 23 15:42:33 node2 kernel: ocfs2_dlm: Nodes in domain
      ("B239262A386C465AA7DEE81C05F2EB93"): 2<br>
      Oct 23 15:42:38 node2 kernel: ocfs2_dlm: Node 1 leaves domain
      C54B4F6991954F98AA6A37C4F3901CD8<br>
      Oct 23 15:42:38 node2 kernel: ocfs2_dlm: Nodes in domain
      ("C54B4F6991954F98AA6A37C4F3901CD8"): 2<br>
      Oct 23 15:42:58 node2 kernel: ocfs2_dlm: Node 1 leaves domain
      D96AC8E8BDD54913AE6D8EC0EB539603<br>
      Oct 23 15:42:58 node2 kernel: ocfs2_dlm: Nodes in domain
      ("D96AC8E8BDD54913AE6D8EC0EB539603"): 2<br>
      Oct 23 15:44:06 node2 kernel: o2net: connection to node node1 (num
      1) at <a moz-do-not-send="true" \
href="http://192.168.3.1:7777">192.168.3.1:7777</a>  has been idle for 60<br>
      .0 seconds, shutting it down.<br>
      Oct 23 15:44:06 node2 kernel: (swapper,0,15):o2net_idle_timer:1503
      here are some times that might help debug the situa<br>
      tion: (tmr 1287848586.872368 now 1287848646.872227 dr
      1287848586.872346 adv 1287848586.872376:1287848586.872376 func
      (fb860756<br>
      :513) 1287848578.874476:1287848578.874487)<br>
      Oct 23 15:44:06 node2 kernel: o2net: no longer connected to node
      node1 (num 1) at <a moz-do-not-send="true"
        href="http://192.168.3.1:7777">192.168.3.1:7777</a><br>
      Oct 23 15:45:06 node2 kernel:
      (o2net,14590,15):o2net_connect_expired:1664 ERROR: no connection
      established with node 1<br>
      &nbsp;after 60.0 seconds, giving up and returning errors.<br>
      Oct 23 15:46:06 node2 kernel:
      (o2net,14590,15):o2net_connect_expired:1664 ERROR: no connection
      established with node 1<br>
      &nbsp;after 60.0 seconds, giving up and returning errors.<br>
      Oct 23 15:51:34 node2 syslogd 1.4.1: restart.<br>
      <br>
      Please guide me what could the issue.<br>
      <br>
      Thanks<br>
      Veera.<br>
      <br>
      <br>
      <pre wrap="">
<fieldset class="mimeAttachmentHeader"></fieldset>
_______________________________________________
Ocfs2-users mailing list
<a class="moz-txt-link-abbreviated" \
href="mailto:Ocfs2-users@oss.oracle.com">Ocfs2-users@oss.oracle.com</a> <a \
class="moz-txt-link-freetext" \
href="http://oss.oracle.com/mailman/listinfo/ocfs2-users">http://oss.oracle.com/mailman/listinfo/ocfs2-users</a></pre>
  </blockquote>
    <br>
  </body>
</html>



_______________________________________________
Ocfs2-users mailing list
Ocfs2-users@oss.oracle.com
http://oss.oracle.com/mailman/listinfo/ocfs2-users

[prev in list] [next in list] [prev in thread] [next in thread] 

Configure | About | News | Add a list | Sponsored by KoreLogic