[prev in list] [next in list] [prev in thread] [next in thread] 

List:       mesos-user
Subject:    Re: Reconnected slaves not sending resource offers?
From:       Thomas Petr <tpetr () hubspot ! com>
Date:       2016-04-25 21:55:46
Message-ID: CAJRB3TH24r9EYU3moagTHbG2HTaX5y1Oi4EM=fDPUdMmBfvbFQ () mail ! gmail ! com
[Download RAW message or body]

Ah, thanks for the clarification. I can't find any logs from the framework
indicating that we got the initial offer, so it looks like it could have
been dropped. We haven't set --offer-timeout on our masters, so your
explanation makes sense. Thanks!

On Mon, Apr 25, 2016 at 4:17 PM, Vinod Kone <vinodkone@apache.org> wrote:

>
> I0421 21:03:32.014999 17071 master.cpp:4290] Sending 1 offers to
>> framework sy3x4 (sy3x4) at
>> scheduler-6bb2bcf0-d060-4072-a25b-917d8007fb1c@172.16.13.243:56861
>>
>
> This shows that the slaves resources were sent to a framework. Looks like
> the framework is holding on to the offer for a long time?
>
>
>> I0421 21:03:32.019800 17076 hierarchical.hpp:588] Slave
>> 20151116-203437-35000492-5050-17068-S70 (lively-rice) updated with
>> oversubscribed resources  (total: mem(*):217609; cpus(*):210;
>> ports(*):[2048-3048]; disk(*):639829, allocated: mem(*):217609;
>> cpus(*):210; ports(*):[2048-3048]; disk(*):639829)
>>
>
> This says that from the view point of master/allocator, all the resources
> are allocated. This is because the framework hasn't replied to the offer.
> Did the framework receive the offer or was it dropped by the network due to
> the networking issues?
>
>

[Attachment #3 (text/html)]

<div dir="ltr">Ah, thanks for the clarification. I can&#39;t find any logs from the \
framework indicating that we got the initial offer, so it looks like it could have \
been dropped. We haven&#39;t set --offer-timeout on our masters, so your explanation \
makes sense. Thanks!</div><div class="gmail_extra"><br><div class="gmail_quote">On \
Mon, Apr 25, 2016 at 4:17 PM, Vinod Kone <span dir="ltr">&lt;<a \
href="mailto:vinodkone@apache.org" \
target="_blank">vinodkone@apache.org</a>&gt;</span> wrote:<br><blockquote \
class="gmail_quote" style="margin:0 0 0 .8ex;border-left:1px #ccc \
solid;padding-left:1ex"><div dir="ltr"><div class="gmail_extra"><div \
class="gmail_quote"><span class=""><br><blockquote class="gmail_quote" \
style="margin:0 0 0 .8ex;border-left:1px #ccc solid;padding-left:1ex"><div>I0421 \
<span><span>21:03:32.014999</span></span> 17071 master.cpp:4290] Sending 1 offers to \
framework sy3x4 (sy3x4) at <a \
href="http://scheduler-6bb2bcf0-d060-4072-a25b-917d8007fb1c@172.16.13.243:56861" \
target="_blank">scheduler-6bb2bcf0-d060-4072-a25b-917d8007fb1c@172.16.13.243:56861</a></div></blockquote><div><br></div></span><div>This \
shows that the slaves resources were sent to a framework. Looks like the framework is \
holding on to the offer for a long time?</div><span class=""><div>  </div><blockquote \
class="gmail_quote" style="margin:0 0 0 .8ex;border-left:1px #ccc \
solid;padding-left:1ex"><div>I0421 <span><span>21:03:32.019800</span></span> 17076 \
hierarchical.hpp:588] Slave 20151116-203437-35000492-5050-17068-S70 (lively-rice) \
updated with oversubscribed resources   (total: mem(*):217609; cpus(*):210; \
ports(*):[2048-3048]; disk(*):639829, allocated: mem(*):217609; cpus(*):210; \
ports(*):[2048-3048]; \
disk(*):639829)<br></div><div></div></blockquote></span></div><div \
class="gmail_extra"><br></div>This says that from the view point of master/allocator, \
all the resources are allocated. This is because the framework hasn&#39;t replied to \
the offer. Did the framework receive the offer or was it dropped by the network due \
to the networking issues?<br><br></div></div> </blockquote></div><br></div>



[prev in list] [next in list] [prev in thread] [next in thread] 

Configure | About | News | Add a list | Sponsored by KoreLogic