[prev in list] [next in list] [prev in thread] [next in thread]
List: ocfs2-users
Subject: [Ocfs2-users] Re: Kernel panic on OCFS2 1.2.6-6 for EL5
From: Daniel <daniel.anderzen () gmail ! com>
Date: 2007-08-28 17:06:11
Message-ID: 8371bf3f0708281006w3ab66b2csdc5ba1fa57f99e33 () mail ! gmail ! com
[Download RAW message or body]
[Attachment #2 (multipart/alternative)]
Hello
filed as bugzilla number 912.
On 8/28/07, Sunil Mushran <Sunil.Mushran@oracle.com> wrote:
>
> Please file a bugzilla. It is very hard to track issue via email.
> Attach the trace below. You should also see a corresponding
> message in one of the other nodes. Specifically node 0. Add
> that too in the bugzilla.
>
> Daniel wrote:
> > Hello
> >
> > I'm still having weekly panics on my system, but now I've at least got
> > something to report back from the netconsole.
> >
> > To summarize system: 2x Dell 1950 connected to a EMC CX3-20 SAN.
> > Centos 5 x86_64 2.6.18-8.1.8.el5 #1 SMP.
> >
> > Tonight both servers locked up - both while idling afaik. But this
> > time tilesrv2 reported the following via netconsole before it went dead.
> >
> > (4225,2):dlm_drop_lockres_ref:2289 ERROR: while dropping ref on
> > 359E1C1D38374654BC5E5896EB7D5187:M0000000000000009cb578f4d7803fc
> > (master=0) got -22.
> > (4225,2):dlm_print_one_lock_resource:294 lockres:
> > M0000000000000009cb578f4d7803fc, owner=0, state=64
> > (4225,2):__dlm_print_one_lock_resource:309 lockres:
> > M0000000000000009cb578f4d7803fc, owner=0, state=64
> > (4225,2):__dlm_print_one_lock_resource:311 last used: 4354492857, on
> > purge list: yes
> > (4225,2):dlm_print_lockres_refmap:277 refmap nodes: [ ], inflight=0
> > (4225,2):__dlm_print_one_lock_resource:313 granted queue:
> > (4225,2):__dlm_print_one_lock_resource:328 converting queue:
> > (4225,2):__dlm_print_one_lock_resource:343 blocked queue:
> > ----------- [cut here ] --------- [please bite here ] ---------
> > Kernel BUG at ...mushran/BUILD/ocfs2-1.2.6/fs/ocfs2/dlm/dlmmaster.c:2291
> > invalid opcode: 0000 [1] SMP
> > last sysfs file: /devices/pci0000:00/0000:00:
> > 04.0/0000:0c:00.0/host1/rport-1:0-1/target1:0:1/1:0:1:4/vendor
> > CPU 2
> > Modules linked in: netconsole autofs4 hidp ocfs2(U) nfs lockd fscache
> > nfs_acl rfcomm l2cap bluetooth ocfs2_dlmfs(U) ocfs2_dlm(U)
> > ocfs2_nodemanager(U) configfs sunrpc ipt_REJECT ip6t_REJECT xt_tcpudp
> > ip6table_filter ip6_tables x_tables dm_emc dm_round_robin dm_multipath
> > video sbs i2c_ec i2c_core button battery asus_acpi acpi_memhotplug ac
> > ipv6 parport_pc lp parport joydev shpchp bnx2 sr_mod ide_cd serio_raw
> > cdrom sg pcspkr dm_snapshot dm_zero dm_mirror dm_mod usb_storage
> > qla2xxx scsi_transport_fc megaraid_sas sd_mod scsi_mod ext3 jbd
> > ehci_hcd ohci_hcdPid: 4225, comm: dlm_thread Not tainted
> > 2.6.18-8.1.8.el5 #1
> > [<ffffffff884d60d3>] :ocfs2_dlm:dlm_drop_lockres_ref+0x1d3/0x1ec
> > RDX: 00000000ffffffff RSI: 0000000000000000 RDI: ffffffff802da65c
> > R13: ffff81012d087000 R14: ffff8100435c5f60 R15: ffffffff8009b4f6
> > CR2: 000000001ec07000 CR3: 00000001289c9000 CR4: 00000000000006e0
> > 303030303030304d
> > 0000000000000000 0000000000000000 ffff81001defe648
> > [<ffffffff884e9031>] :ocfs2_dlm:dlm_purge_lockres+0x175/0x34a
> > [<ffffffff8009b6b9>] autoremove_wake_function+0x0/0x2e
> > [<ffffffff884e93c2>] :ocfs2_dlm:dlm_thread+0x0/0x579
> > [<ffffffff80032189>] kthread+0xfe/0x132
> > [<ffffffff8005bfe5>] child_rip+0xa/0x11
> > [<ffffffff8009b4f6>] keventd_create_kthread+0x0/0x61
> > [<ffffffff8005bfdb>] child_rip+0x0/0x11
> > 0f d6 c2 83 d8 5c [<ffffffff884d60d3>]
> > :ocfs2_dlm:dlm_drop_lockres_ref+0x1d3/0x1ec
> > <0>Kernel panic - not syncing: Fatal exception
> >
> > I'd be happy to provide more info or open a bug-report. Just tell me
> > what you need. I hope this is a better report than last time :)
> >
> > Daniel
>
>
[Attachment #5 (text/html)]
Hello<br><br>filed as bugzilla number 912.<br><br><div><span class="gmail_quote">On \
8/28/07, <b class="gmail_sendername">Sunil Mushran</b> <<a \
href="mailto:Sunil.Mushran@oracle.com">Sunil.Mushran@oracle.com</a>> wrote: \
</span><blockquote class="gmail_quote" style="border-left: 1px solid rgb(204, 204, \
204); margin: 0pt 0pt 0pt 0.8ex; padding-left: 1ex;">Please file a bugzilla. It is \
very hard to track issue via email.<br>Attach the trace below. You should also see a \
corresponding <br>message in one of the other nodes. Specifically node 0. Add<br>that \
too in the bugzilla.<br><br>Daniel wrote:<br>> Hello<br>><br>> I'm still \
having weekly panics on my system, but now I've at least got <br>> something \
to report back from the netconsole.<br>><br>> To summarize system: 2x Dell 1950 \
connected to a EMC CX3-20 SAN.<br>> Centos 5 x86_64 2.6.18-8.1.8.el5 #1 \
SMP.<br>><br>> Tonight both servers locked up - both while idling afaik. But \
this <br>> time tilesrv2 reported the following via netconsole before it went \
dead.<br>><br>> (4225,2):dlm_drop_lockres_ref:2289 ERROR: while dropping ref \
on<br>> 359E1C1D38374654BC5E5896EB7D5187:M0000000000000009cb578f4d7803fc <br>> \
(master=0) got -22.<br>> (4225,2):dlm_print_one_lock_resource:294 lockres:<br>> \
M0000000000000009cb578f4d7803fc, owner=0, state=64<br>> \
(4225,2):__dlm_print_one_lock_resource:309 lockres:<br>> \
M0000000000000009cb578f4d7803fc, owner=0, state=64 <br>> \
(4225,2):__dlm_print_one_lock_resource:311 last used: 4354492857, \
on<br>> purge list: yes<br>> (4225,2):dlm_print_lockres_refmap:277 \
refmap nodes: [ ], inflight=0<br>> \
(4225,2):__dlm_print_one_lock_resource:313 granted queue: <br>> \
(4225,2):__dlm_print_one_lock_resource:328 converting queue:<br>> \
(4225,2):__dlm_print_one_lock_resource:343 blocked queue:<br>> \
----------- [cut here ] --------- [please bite here ] ---------<br>> Kernel BUG at \
...mushran/BUILD/ocfs2- 1.2.6/fs/ocfs2/dlm/dlmmaster.c:2291<br>> invalid opcode: \
0000 [1] SMP<br>> last sysfs file: /devices/pci0000:00/0000:00:<br>> \
04.0/0000:0c:00.0/host1/rport-1:0-1/target1:0:1/1:0:1:4/vendor<br>> CPU 2<br>> \
Modules linked in: netconsole autofs4 hidp ocfs2(U) nfs lockd fscache <br>> \
nfs_acl rfcomm l2cap bluetooth ocfs2_dlmfs(U) ocfs2_dlm(U)<br>> \
ocfs2_nodemanager(U) configfs sunrpc ipt_REJECT ip6t_REJECT xt_tcpudp<br>> \
ip6table_filter ip6_tables x_tables dm_emc dm_round_robin dm_multipath <br>> video \
sbs i2c_ec i2c_core button battery asus_acpi acpi_memhotplug ac<br>> ipv6 \
parport_pc lp parport joydev shpchp bnx2 sr_mod ide_cd serio_raw<br>> cdrom sg \
pcspkr dm_snapshot dm_zero dm_mirror dm_mod usb_storage <br>> qla2xxx \
scsi_transport_fc megaraid_sas sd_mod scsi_mod ext3 jbd<br>> ehci_hcd ohci_hcdPid: \
4225, comm: dlm_thread Not tainted<br>> 2.6.18-8.1.8.el5 \
#1<br>> [<ffffffff884d60d3>] \
:ocfs2_dlm:dlm_drop_lockres_ref+0x1d3/0x1ec <br>> RDX: 00000000ffffffff RSI: \
0000000000000000 RDI: ffffffff802da65c<br>> R13: ffff81012d087000 R14: \
ffff8100435c5f60 R15: ffffffff8009b4f6<br>> CR2: 000000001ec07000 CR3: \
00000001289c9000 CR4: 00000000000006e0 \
<br>> 303030303030304d<br>> 0000000000000000 \
0000000000000000 ffff81001defe648<br>> [<ffffffff884e9031>] \
:ocfs2_dlm:dlm_purge_lockres+0x175/0x34a<br>> [<ffffffff8009b6b9>] \
autoremove_wake_function+0x0/0x2e <br>> [<ffffffff884e93c2>] \
:ocfs2_dlm:dlm_thread+0x0/0x579<br>> [<ffffffff80032189>] \
kthread+0xfe/0x132<br>> [<ffffffff8005bfe5>] \
child_rip+0xa/0x11<br>> [<ffffffff8009b4f6>] \
keventd_create_kthread+0x0/0x61 <br>> [<ffffffff8005bfdb>] \
child_rip+0x0/0x11<br>> 0f d6 c2 83 d8 \
5c [<ffffffff884d60d3>]<br>> \
:ocfs2_dlm:dlm_drop_lockres_ref+0x1d3/0x1ec<br>> <0>Kernel panic - not \
syncing: Fatal exception <br>><br>> I'd be happy to provide more info or \
open a bug-report. Just tell me<br>> what you need. I hope this is a better report \
than last time :)<br>><br>> Daniel<br><br></blockquote></div><br>
_______________________________________________
Ocfs2-users mailing list
Ocfs2-users@oss.oracle.com
http://oss.oracle.com/mailman/listinfo/ocfs2-users
[prev in list] [next in list] [prev in thread] [next in thread]
Configure |
About |
News |
Add a list |
Sponsored by KoreLogic