[prev in list] [next in list] [prev in thread] [next in thread] 

List:       freedesktop-xorg
Subject:    Re: Random ocassional graphics freeze on Intel chipset
From:       Alex_Villací­s_Lasso <a_villacis () palosanto ! com>
Date:       2013-10-15 22:36:10
Message-ID: 525DC35A.9010704 () palosanto ! com
[Download RAW message or body]

El 14/10/13 10:16, Alex Villací­s Lasso escribió:
> El 02/10/13 10:01, Chris Wilson escribió:
> > On Wed, Oct 02, 2013 at 09:22:18AM -0500, Alex Villací­s Lasso wrote:
> > > El 02/10/13 05:19, Chris Wilson escribió:
> > > > On Tue, Oct 01, 2013 at 06:15:00PM -0500, Alex Villací­s Lasso wrote:
> > > > > I have seen this graphics freeze under stock 3.10.x from the Fedora
> > > > > 18 x86_64 distro, and also with vanilla compiled 3.11 and 3.12-rc3.
> > > > > After a few hours of working, the screen stops updating. The mouse
> > > > > pointer moves around and changes if moved over different parts of
> > > > > the screen, but the display itself does not change anymore. If I
> > > > > check /sys/kernel/debug/dri/0/i915_error_state right then (via a
> > > > > remote ssh), there is no error captured. However, if I do "echo 1 >
> > > > > /sys/kernel/debug/dri/0/i915_wedged", after a few moments an error
> > > > > is captured, as well as messages in the kernel log, both of which
> > > > > are attached. If I try to restart the gnome-shell session, I get the
> > > > > KMS console, and then the start of the graphic login, but then the
> > > > > graphic login itself freezes again.
> > > > > 
> > > > > Is the attached information enough to diagnose the issue?
> > > > Afaict it was a userspace hang, the GPU was rightfully idle. Only on the
> > > > reset did it actually die.
> > > If I do "echo 1 > /sys/kernel/debug/dri/0/i915_wedged" when the display is not \
> > > frozen, I only get the following in dmesg, and the system keeps working \
> > > normally: [  323.441616] [drm] Manually setting wedged to 1
> > > [  323.441622] [drm] capturing error event; look for more information in \
> > > /sys/class/drm/card0/error [  348.955655] [drm] Manually setting wedged to 0
> > > 
> > > Is it to be expected that an "userspace hang" will escalate into a failed reset \
> > > when setting i915_wedged to 1, without anything being actually wrong at the \
> > > kernel side, at least at first?
> > Yes, your chipset is notorious for not being able to restart the rings.
> > We've added a few attempts to workaround the issue, but I'm not
> > surprised if it still occasionally fails.
> > 
> > > > I'd suggest looking at the stacktraces of the usual suspects and see who
> > > > is waiting upon whom, or if there is a more obvious lockup. Then begin
> > > > the painful process of tracing the interoperation of those two processes
> > > > to try and catch the breakdown.
> > > > -Chris
> > > > 
> > > I think Xorg is one of the "usual suspects". Should gnome-shell be one too? \
> > > This is a Fedora 18 desktop with gnome-shell as installed from the DVD.
> > X and gnome-shell are the two responsible for working together and
> > presenting your desktop, so would definitely be the first to check for
> > an error.
> > -Chris
> > 
> I got the hang again in kernel-3.10.13-101.fc18.x86_64. I switched to the KMS text \
> console with Ctrl-Alt-F2, downloaded all of the debuginfo packages, and got this \
> backtrace with "gdb -batch -ex bt -p `pidof gnome-shell`". 
> 
> [New LWP 2217]
> [New LWP 2182]
> [New LWP 2035]
> [New LWP 2034]
> [New LWP 2033]
> [New LWP 2032]
> [Thread debugging using libthread_db enabled]
> Using host libthread_db library "/lib64/libthread_db.so.1".
> 0x00007febbcca7a5d in poll () at ../sysdeps/unix/syscall-template.S:81
> 81    T_PSEUDO (SYSCALL_SYMBOL, SYSCALL_NAME, SYSCALL_NARGS)
> #0  0x00007febbcca7a5d in poll () at ../sysdeps/unix/syscall-template.S:81
> #1  0x00007febbb943f42 in poll (__timeout=-1, __nfds=1, __fds=0x7fffbc0cefd0) at \
> /usr/include/bits/poll2.h:46 #2  _xcb_conn_wait (c=c@entry=0x19d4130, \
> cond=cond@entry=0x7fffbc0cf040, vector=vector@entry=0x0, count=count@entry=0x0) at \
> xcb_conn.c:414 #3  0x00007febbb94543e in wait_for_reply (c=c@entry=0x19d4130, \
> request=154086, e=e@entry=0x0) at xcb_in.c:399 #4  0x00007febbb94564b in \
> xcb_wait_for_reply (c=c@entry=0x19d4130, request=154086, e=e@entry=0x0) at \
> xcb_in.c:429 #5  0x00007febbbb5a435 in xcb_dri2_wait_msc_reply \
> (c=c@entry=0x19d4130, cookie=..., e=e@entry=0x0) at dri2.c:1619 #6  \
> 0x00007febc1f55b32 in dri2WaitForMSC (pdraw=0x310b180, target_msc=0, divisor=2, \
> remainder=<optimized out>, ust=0x7fffbc0cf158, msc=0x7fffbc0cf160, \
> sbc=0x7fffbc0cf168) at dri2_glx.c:473 #7  0x00007febc1f3074b in \
> __glXWaitVideoSyncSGI (divisor=2, remainder=0, count=0x7fffbc0cf1ac) at \
> glxcmds.c:1850 #8  0x00007febbf396fcd in _cogl_winsys_wait_for_vblank () at \
> winsys/cogl-winsys-glx.c:1143 #9  0x00007febbf397ffc in \
> _cogl_winsys_onscreen_swap_region (onscreen=<optimized out>, \
> user_rectangles=0x7fffbc0cf2a0, n_rectangles=1) at winsys/cogl-winsys-glx.c:1270 \
> #10 0x00007febbf38f978 in cogl_onscreen_swap_region (onscreen=0x310af10, \
> rectangles=rectangles@entry=0x7fffbc0cf2a0, n_rectangles=n_rectangles@entry=1) at \
> ./cogl-onscreen.c:181 #11 0x00007febbfa18571 in clutter_stage_cogl_redraw \
> (stage_window=0x19c0cb0) at cogl/clutter-stage-cogl.c:482 #12 0x00007febbfa8341d in \
> clutter_stage_do_redraw (stage=0x3108a40 [ClutterStage]) at ./clutter-stage.c:1170 \
> #13 _clutter_stage_do_update (stage=0x3108a40 [ClutterStage]) at \
> ./clutter-stage.c:1228 #14 0x00007febbfa67d3d in master_clock_update_stages \
> (stages=0x5d37910 = {...}, master_clock=0x2f6f4a0 [ClutterMasterClock]) at \
> ./clutter-master-clock.c:386 #15 clutter_clock_dispatch \
> (source=source@entry=0x30961b0, callback=<optimized out>, user_data=<optimized \
> out>) at ./clutter-master-clock.c:520 #16 0x0000003b5d847a55 in g_main_dispatch \
> (context=0x19a95d0) at gmain.c:2715 #17 g_main_context_dispatch \
> (context=context@entry=0x19a95d0) at gmain.c:3219 #18 0x0000003b5d847d88 in \
> g_main_context_iterate (context=0x19a95d0, block=block@entry=1, \
> dispatch=dispatch@entry=1, self=<optimized out>) at gmain.c:3290 #19 \
> 0x0000003b5d848182 in g_main_loop_run (loop=0x19b0020) at gmain.c:3484 #20 \
> 0x00007febc3979467 in meta_run () at core/main.c:545 #21 0x0000000000401e2c in main \
> () 
> 
> _______________________________________________
> xorg@lists.x.org: X.Org support
> Archives: http://lists.freedesktop.org/archives/xorg
> Info: http://lists.x.org/mailman/listinfo/xorg
> Your subscription address: a_villacis@palosanto.com
Still happening with self-compiled 3.12-rc5 kernel and self-compiled xf86-video-intel \
2.21.15. _______________________________________________
xorg@lists.x.org: X.Org support
Archives: http://lists.freedesktop.org/archives/xorg
Info: http://lists.x.org/mailman/listinfo/xorg
Your subscription address: freedesktop-xorg@progressive-comp.com


[prev in list] [next in list] [prev in thread] [next in thread] 

Configure | About | News | Add a list | Sponsored by KoreLogic