[prev in list] [next in list] [prev in thread] [next in thread] 

List:       mesos-dev
Subject:    Re: [MESOS-10007] random "Failed to get exit status for Command" for short-lived commands
From:       Benjamin Mahler <bmahler () apache ! org>
Date:       2019-10-22 1:57:56
Message-ID: CAFp_NitPuPiaJMr+wNFqKc9XO2RtVLpR3xo4C1+YLRoEBM_W4g () mail ! gmail ! com
[Download RAW message or body]


Hi Charles, thanks for the thorough ticket and for surfacing it here for
attention, it didn't get spotted amongst the JIRA noise.

I replied on the ticket with a patch that should fix the issue, we can
discuss further in the ticket.

Ben

On Sat, Oct 19, 2019 at 7:35 AM Charles-Fran=C3=A7ois Natali <cf.natali@gma=
il.com>
wrote:

> Hi,
>
> I'm wondering if there's anything I could do to help
> https://issues.apache.org/jira/browse/MESOS-10007 move forward?
>
> Basically it's a race condition in libprocess/command executor causing
> spurious errors to be reported for short-lived tasks.
> I've got a detailed code path of the race and a repro, however I'm not
> sure what's the best way to fix it - any suggestion?
>
> Cheers,
>
> Charles
>


[prev in list] [next in list] [prev in thread] [next in thread] 

Configure | About | News | Add a list | Sponsored by KoreLogic