[prev in list] [next in list] [prev in thread] [next in thread] 

List:       netbsd-port-amd64
Subject:    Re: sometimes resumeable freeze
From:       "Jonathan A. Kollasch" <jakllsch () kollasch ! net>
Date:       2011-07-20 1:43:12
Message-ID: 20110720014312.GB2553 () tarantulon ! kollasch ! net
[Download RAW message or body]

Following up,

It turns out that this issue is caused by the 8259 PICs not being masked
in IOAPIC mode, causing spurious interrupts in conjunction with real
ones.  I should get a PR filed soon.

On Tue, Apr 12, 2011 at 01:17:32AM +0000, Jonathan A. Kollasch wrote:
> Hi,
> 
> My quad core AMD box has an issue where it will lock up under load.
> 
> Often, when the machine is otherwise unresponsive, I'll press
> Num Lock on the pckbd(4) and it will awaken at the point in time
> it went comatose.  Eventually, pressing num lock doesn't bring it
> back and I have to force a reboot.
> 
> If I `cpuctl offline` ¾ of the CPUs the machine is acceptably stable.
> The machine is also acceptably stable under Linux with all cores
> enabled and performing a similar workload (build.sh release).
> 
> acpicpu(4) is present in the kernel and I have verified that C1E
> is not enabled.  The machine was significantly less stable when
> C1E was enabled.
> 
> I've tried to wire in a Conventional PCI SERR# trigger, but
> the chipset doesn't propagate that into an NMI and I lack
> details on the chipset required to enable that if possible.
> 
> Can anyone think of ways to debug this issue?
> 
> 	Jonathan Kollasch
[prev in list] [next in list] [prev in thread] [next in thread] 

Configure | About | News | Add a list | Sponsored by KoreLogic