[prev in list] [next in list] [prev in thread] [next in thread]
List: mesos-dev
Subject: Re: [MESOS-10007] random "Failed to get exit status for Command" for short-lived commands
From: Benjamin Mahler <bmahler () apache ! org>
Date: 2019-10-22 1:57:56
Message-ID: CAFp_NitPuPiaJMr+wNFqKc9XO2RtVLpR3xo4C1+YLRoEBM_W4g () mail ! gmail ! com
[Download RAW message or body]
Hi Charles, thanks for the thorough ticket and for surfacing it here for
attention, it didn't get spotted amongst the JIRA noise.
I replied on the ticket with a patch that should fix the issue, we can
discuss further in the ticket.
Ben
On Sat, Oct 19, 2019 at 7:35 AM Charles-Fran=C3=A7ois Natali <cf.natali@gma=
il.com>
wrote:
> Hi,
>
> I'm wondering if there's anything I could do to help
> https://issues.apache.org/jira/browse/MESOS-10007 move forward?
>
> Basically it's a race condition in libprocess/command executor causing
> spurious errors to be reported for short-lived tasks.
> I've got a detailed code path of the race and a repro, however I'm not
> sure what's the best way to fix it - any suggestion?
>
> Cheers,
>
> Charles
>
[prev in list] [next in list] [prev in thread] [next in thread]
Configure |
About |
News |
Add a list |
Sponsored by KoreLogic