[prev in list] [next in list] [prev in thread] [next in thread]
List: hadoop-user
Subject: Help with LeaseExpiredException
From: Michael Stack <stack () archive ! org>
Date: 2006-09-20 21:35:34
Message-ID: 4511B426.4030604 () archive ! org
[Download RAW message or body]
Dear Hadoopers:
I'm using hadoop 0.5.0 (My job is a derivative of the nutch fetch job).
I've had success in the past with older versions of hadoop but now jobs
keep failing because one of the reduces invariably encounters 4
instances of the below:
org.apache.hadoop.ipc.RemoteException: org.apache.hadoop.dfs.LeaseExpiredException: \
No lease on /user/stack/nla/2005-outputs/segments/20060920054847-nla2005/crawl_fetch/part-00018/data
at org.apache.hadoop.dfs.FSNamesystem.getAdditionalBlock(FSNamesystem.java:454)
at org.apache.hadoop.dfs.NameNode.addBlock(NameNode.java:228)
at sun.reflect.GeneratedMethodAccessor16.invoke(Unknown Source)
at sun.reflect.DelegatingMethodAccessorImpl.invoke(DelegatingMethodAccessorImpl.java:25)
at java.lang.reflect.Method.invoke(Method.java:585)
at org.apache.hadoop.ipc.RPC$Server.call(RPC.java:332)
at org.apache.hadoop.ipc.Server$Handler.run(Server.java:468)
at org.apache.hadoop.ipc.Client$Connection.run(Client.java:159)
I've been playing making the jobs smaller in size -- shrinking from
multi-day to single-day, and on down -- but they continue to fail with
the above. I was going to try the Konstantin suggestion from here --
http://mail-archives.apache.org/mod_mbox/lucene-hadoop-dev/200607.mbox/%3C331ED54F-9FA7-48FE-A604-017CC54DA524@yahoo-inc.com%3E \
-- lowering the ipc timeout down to about 20 seconds from 60 but am a
little worried that'll provoke issues elsewhere.
Was wondering if anyone else is running into this issue or if pointers
on things to try.
Thanks,
St.Ack
[prev in list] [next in list] [prev in thread] [next in thread]
Configure |
About |
News |
Add a list |
Sponsored by KoreLogic