[prev in list] [next in list] [prev in thread] [next in thread] 

List:       hadoop-user
Subject:    Re: [E] Re: Increased DN heap usage during Hadoop 3 upgrade
From:       Kihwal Lee <kihwal () verizonmedia ! com ! INVALID>
Date:       2020-10-07 15:46:18
Message-ID: CAKYKF1Wo-e0LALxy1HvwhcGH9N3rf-RLTLsykm7u8XZ+s-uEqg () mail ! gmail ! com
[Download RAW message or body]

We haven't experienced anything like that up to 2.8. We are still in the
process of stabilizing 2.10 as we upgrade some of the bigger clusters. We
will know soon how 2.10 datanodes behave under heavy load and storage
utilization.

If you are seeing a significant change, it might be something post-2.8 or
even post-2.10.

Kihwal

On Tue, Oct 6, 2020 at 5:09 PM Wei-Chiu Chuang <weichiu@cloudera.com> wrote:

> Sorry for not being specific.
> I was referring to HDFS-8791
> <https://urldefense.proofpoint.com/v2/url?u=https-3A__issues.apache.org_jira_browse_ \
> HDFS-2D8791&d=DwMFaQ&c=sWW_bEwW_mLyN3Kx2v57Q8e-CRbmiT9yOhqES_g_wVY&r=dAJ657NT-13Zjdb \
> 3zsUQxFoymNFB0SJd_2OTmE5mCR4&m=M36liML4Z0UBfc0vLFzg_C0fN_jTaH_ZbUGM_0Mnwjo&s=ukaowpvXdF0_o7i-UHB4046_L5Qyd0ZkEP9D778DM9c&e=> \
> (block ID-based DN storage layout can be very slow for datanode on ext4) where it
> is in 2.8 and above.
> 
> As I understand it, the increased heap usage only occurs during upgrade.
> No issue afterwards.
> 
> My experience was based on CDH5 to CDH6 upgrade (Hadoop 2.6 -> Hadoop 3.0)
> and HDP2 to HDP3 (Hadoop 2.7 -> Hadoop 3.1) upgrade. It is nearly
> impossible to tell which commit increases heap usage worse during upgrade.
> 
> 
> 
> On Tue, Oct 6, 2020 at 3:01 PM Kihwal Lee <kihwal@verizonmedia.com> wrote:
> 
> > Which layout change are you referring to? The only layout change I know
> > of was done in 2.7, IIRC. We backported that to 2.6 and did not see any
> > adverse effects at that time.
> > 
> > Is datanode using more heap all the time? Or is it running into trouble
> > when generating full block reports?
> > 
> > Kihwal
> > 
> > On Mon, Oct 5, 2020 at 1:40 PM Wei-Chiu Chuang
> > <weichiu@cloudera.com.invalid> wrote:
> > 
> > > We experienced this issue on CDH6 and HDP3, so roughly Hadoop 3.0.x and
> > > 3.1.x.
> > > Hermanth experienced the same issue on Hadoop 3.1.1 as well (HDFS-15569
> > > <
> > > https://urldefense.proofpoint.com/v2/url?u=https-3A__issues.apache.org_jira_brow \
> > > se_HDFS-2D15569&d=DwIBaQ&c=sWW_bEwW_mLyN3Kx2v57Q8e-CRbmiT9yOhqES_g_wVY&r=b6gUZYe \
> > > wojO-9YMJdyeI_g&m=itpohwgKPN5qoauYyyMxhGSnasaP3LLbbMVezETEenA&s=kgWYVv2utuAyPWBhv0KVH8ZZGJqQBMvUM7dZ8J0jaa8&e=
> > > 
> > > > )
> > > 
> > > On Mon, Oct 5, 2020 at 11:03 AM Igor Dvorzhak <idv@google.com> wrote:
> > > 
> > > > What Hadoop 3 version do you use?
> > > > 
> > > > On Mon, Oct 5, 2020 at 10:03 AM Wei-Chiu Chuang <weichiu@apache.org>
> > > > wrote:
> > > > 
> > > > > I have anecdotally learned of multiple data points where during the
> > > > > upgrading from Hadoop 2 to Hadoop 3, DN heap usage increases to the
> > > point
> > > > > where it goes OOM.
> > > > > 
> > > > > Don't have much logs for this issue, but I suspect it's caused by the
> > > > > layout change added in Hadoop 2.8.0.
> > > > > 
> > > > > Does anyone else observe the same issue and how do you mitigate this?
> > > For
> > > > > now we suggested increasing DN heap size prior to upgrade as part of
> > > > > pre-upgrade checklist.
> > > > > 
> > > > > Thanks,
> > > > > Wei-Chiu
> > > > > 
> > > > 
> > > 
> > 


[Attachment #3 (text/html)]

<div dir="ltr"><div class="gmail_default" \
style="font-family:verdana,sans-serif;font-size:large">We haven&#39;t experienced \
anything like that up to 2.8. We are still in the process of stabilizing 2.10 as we \
upgrade some of the bigger clusters. We will know soon how 2.10 datanodes behave  \
under heavy load and storage utilization.  </div><div class="gmail_default" \
style="font-family:verdana,sans-serif;font-size:large"><br></div><div \
class="gmail_default" style="font-family:verdana,sans-serif;font-size:large">If you \
are seeing a significant change, it might be something post-2.8 or even \
post-2.10.</div><div class="gmail_default" \
style="font-family:verdana,sans-serif;font-size:large"><br></div><div \
class="gmail_default" \
style="font-family:verdana,sans-serif;font-size:large">Kihwal</div><div \
class="gmail_default" \
style="font-family:verdana,sans-serif;font-size:large"></div></div><br><div \
class="gmail_quote"><div dir="ltr" class="gmail_attr">On Tue, Oct 6, 2020 at 5:09 PM \
Wei-Chiu Chuang &lt;<a \
href="mailto:weichiu@cloudera.com">weichiu@cloudera.com</a>&gt; \
wrote:<br></div><blockquote class="gmail_quote" style="margin:0px 0px 0px \
0.8ex;border-left:1px solid rgb(204,204,204);padding-left:1ex"><div dir="ltr">Sorry \
for not being specific.<div>I was referring to  <a \
href="https://urldefense.proofpoint.com/v2/url?u=https-3A__issues.apache.org_jira_brow \
se_HDFS-2D8791&amp;d=DwMFaQ&amp;c=sWW_bEwW_mLyN3Kx2v57Q8e-CRbmiT9yOhqES_g_wVY&amp;r=dA \
J657NT-13Zjdb3zsUQxFoymNFB0SJd_2OTmE5mCR4&amp;m=M36liML4Z0UBfc0vLFzg_C0fN_jTaH_ZbUGM_0Mnwjo&amp;s=ukaowpvXdF0_o7i-UHB4046_L5Qyd0ZkEP9D778DM9c&amp;e=" \
id="gmail-m_1371779981432574560gmail-key-val" rel="12845740" \
style="font-family:-apple-system,system-ui,&quot;Segoe \
UI&quot;,Roboto,Oxygen,Ubuntu,&quot;Fira Sans&quot;,&quot;Droid \
Sans&quot;,&quot;Helvetica Neue&quot;,sans-serif;font-size:14px;color:rgb(0,101,255)" \
target="_blank">HDFS-8791</a>  (block ID-based DN storage layout can be very slow for \
datanode on ext4) where it is in 2.8 and above.</div><div><br></div><div>As I \
understand it, the increased heap usage only occurs during upgrade. No issue \
afterwards.</div><div><br></div><div>My experience was based on CDH5 to CDH6 upgrade \
(Hadoop 2.6 -&gt; Hadoop 3.0) and HDP2 to HDP3 (Hadoop 2.7 -&gt; Hadoop 3.1) upgrade. \
It is nearly impossible to tell which commit increases heap usage worse during \
upgrade.  </div><div><br></div><div><br></div></div><br><div class="gmail_quote"><div \
dir="ltr" class="gmail_attr">On Tue, Oct 6, 2020 at 3:01 PM Kihwal Lee &lt;<a \
href="mailto:kihwal@verizonmedia.com" target="_blank">kihwal@verizonmedia.com</a>&gt; \
wrote:<br></div><blockquote class="gmail_quote" style="margin:0px 0px 0px \
0.8ex;border-left:1px solid rgb(204,204,204);padding-left:1ex"><div dir="ltr"><div \
class="gmail_default" style="font-family:verdana,sans-serif;font-size:large">Which \
layout change are you referring to? The only layout change I know of was done in 2.7, \
IIRC. We backported that to 2.6 and did not see any adverse effects at that time.  \
</div><div class="gmail_default" \
style="font-family:verdana,sans-serif;font-size:large"><br></div><div \
class="gmail_default" style="font-family:verdana,sans-serif;font-size:large">Is \
datanode using more heap all the time? Or is it running into trouble when generating \
full block reports?</div><div class="gmail_default" \
style="font-family:verdana,sans-serif;font-size:large"><br></div><div \
class="gmail_default" \
style="font-family:verdana,sans-serif;font-size:large">Kihwal</div></div><br><div \
class="gmail_quote"><div dir="ltr" class="gmail_attr">On Mon, Oct 5, 2020 at 1:40 PM \
Wei-Chiu Chuang &lt;weichiu@cloudera.com.invalid&gt; wrote:<br></div><blockquote \
class="gmail_quote" style="margin:0px 0px 0px 0.8ex;border-left:1px solid \
rgb(204,204,204);padding-left:1ex">We experienced this issue on CDH6 and HDP3, so \
roughly Hadoop 3.0.x and<br> 3.1.x.<br>
Hermanth experienced the same issue on Hadoop 3.1.1 as well (HDFS-15569<br>
&lt;<a href="https://urldefense.proofpoint.com/v2/url?u=https-3A__issues.apache.org_ji \
ra_browse_HDFS-2D15569&amp;d=DwIBaQ&amp;c=sWW_bEwW_mLyN3Kx2v57Q8e-CRbmiT9yOhqES_g_wVY& \
amp;r=b6gUZYewojO-9YMJdyeI_g&amp;m=itpohwgKPN5qoauYyyMxhGSnasaP3LLbbMVezETEenA&amp;s=kgWYVv2utuAyPWBhv0KVH8ZZGJqQBMvUM7dZ8J0jaa8&amp;e=" \
rel="noreferrer" target="_blank">https://urldefense.proofpoint.com/v2/url?u=https-3A__ \
issues.apache.org_jira_browse_HDFS-2D15569&amp;d=DwIBaQ&amp;c=sWW_bEwW_mLyN3Kx2v57Q8e- \
CRbmiT9yOhqES_g_wVY&amp;r=b6gUZYewojO-9YMJdyeI_g&amp;m=itpohwgKPN5qoauYyyMxhGSnasaP3LLbbMVezETEenA&amp;s=kgWYVv2utuAyPWBhv0KVH8ZZGJqQBMvUM7dZ8J0jaa8&amp;e=</a> \
&gt;)<br> <br>
On Mon, Oct 5, 2020 at 11:03 AM Igor Dvorzhak &lt;<a href="mailto:idv@google.com" \
target="_blank">idv@google.com</a>&gt; wrote:<br> <br>
&gt; What Hadoop 3 version do you use?<br>
&gt;<br>
&gt; On Mon, Oct 5, 2020 at 10:03 AM Wei-Chiu Chuang &lt;<a \
href="mailto:weichiu@apache.org" target="_blank">weichiu@apache.org</a>&gt;<br> &gt; \
wrote:<br> &gt;<br>
&gt;&gt; I have anecdotally learned of multiple data points where during the<br>
&gt;&gt; upgrading from Hadoop 2 to Hadoop 3, DN heap usage increases to the \
point<br> &gt;&gt; where it goes OOM.<br>
&gt;&gt;<br>
&gt;&gt; Don&#39;t have much logs for this issue, but I suspect it&#39;s caused by \
the<br> &gt;&gt; layout change added in Hadoop 2.8.0.<br>
&gt;&gt;<br>
&gt;&gt; Does anyone else observe the same issue and how do you mitigate this? \
For<br> &gt;&gt; now we suggested increasing DN heap size prior to upgrade as part \
of<br> &gt;&gt; pre-upgrade checklist.<br>
&gt;&gt;<br>
&gt;&gt; Thanks,<br>
&gt;&gt; Wei-Chiu<br>
&gt;&gt;<br>
&gt;<br>
</blockquote></div>
</blockquote></div>
</blockquote></div>



[prev in list] [next in list] [prev in thread] [next in thread] 

Configure | About | News | Add a list | Sponsored by KoreLogic