[prev in list] [next in list] [prev in thread] [next in thread] 

List:       nutch-general
Subject:    Re: [Nutch-general] upgrading to hadoop-0.4
From:       "Zaheed Haque" <zaheed.haque () gmail ! com>
Date:       2006-07-01 9:30:49
Message-ID: 116aa12e0607010230y52d07a6esc8aa298eb6dced32 () mail ! gmail ! com
[Download RAW message or body]

After creating the directories crawldb and current by hand I could
perform an injection. Is this a bug should I file a JIRA issue?

Zaheed

On 7/1/06, Zaheed Haque <zaheed.haque@gmail.com> wrote:
> Forgot to mention I was doing some URL injection bin/nutch inject crawldb urls
> Cheers
>
> On 7/1/06, Zaheed Haque <zaheed.haque@gmail.com> wrote:
> > Hi:
> >
> > Everything was working good with hadoop 3.2, but now after upgrading
> > to hadoop-0.4 I am getting the following error
> >
> > 2006-07-01 11:12:44,989 INFO  conf.Configuration
> > (Configuration.java:loadResource(397)) - parsing
> > jar:file:/usr/local/java/nutch-0.8-dev/lib/hadoop-0.4.0.jar!/hadoop-default.xml
> > 2006-07-01 11:12:45,006 INFO  conf.Configuration
> > (Configuration.java:loadResource(397)) - parsing
> > file:/usr/local/java/nutch-0.8-dev/conf/nutch-default.xml
> > 2006-07-01 11:12:45,040 INFO  conf.Configuration
> > (Configuration.java:loadResource(397)) - parsing
> > jar:file:/usr/local/java/nutch-0.8-dev/lib/hadoop-0.4.0.jar!/mapred-default.xml
> > 2006-07-01 11:12:45,058 INFO  conf.Configuration
> > (Configuration.java:loadResource(397)) - parsing
> > jar:file:/usr/local/java/nutch-0.8-dev/lib/hadoop-0.4.0.jar!/mapred-default.xml
> > 2006-07-01 11:12:45,120 INFO  conf.Configuration
> > (Configuration.java:loadResource(397)) - parsing
> > file:/usr/local/java/nutch-0.8-dev/conf/hadoop-site.xml
> > 20
> > 2006-07-01 11:12:46,379 ERROR mapred.JobClient
> > (JobClient.java:submitJob(273)) - Input directory
> > /usr/local/java/nutch-0.8-dev/crawldb/current in local is invalid.
> > Exception in thread "main" java.io.IOException: Input directory
> > /usr/local/java/nutch-0.8-dev/crawldb/current in local is invalid.
> >         at org.apache.hadoop.mapred.JobClient.submitJob(JobClient.java:274)
> >         at org.apache.hadoop.mapred.JobClient.runJob(JobClient.java:327)
> >         at org.apache.nutch.crawl.Injector.inject(Injector.java:146)
> >         at org.apache.nutch.crawl.Injector.main(Injector.java:164)
> >
> > I am wondering if this is a known fact or do I need to do something
> > with my configuration?
> >
> > Thanks
> > Zaheed
> >
>

Using Tomcat but need to do more? Need to support web services, security?
Get stuff done quickly with pre-integrated technology to make your job easier
Download IBM WebSphere Application Server v.1.0.1 based on Apache Geronimo
http://sel.as-us.falkag.net/sel?cmd=lnk&kid=120709&bid=263057&dat=121642
_______________________________________________
Nutch-general mailing list
Nutch-general@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/nutch-general
[prev in list] [next in list] [prev in thread] [next in thread] 

Configure | About | News | Add a list | Sponsored by KoreLogic