[prev in list] [next in list] [prev in thread] [next in thread] 

List:       nutch-developers
Subject:    [Nutch-dev] [jira] Updated: (NUTCH-311) Page with tens of thousands
From:       "stack () archive ! org (JIRA)" <jira () apache ! org>
Date:       2006-06-23 4:08:31
Message-ID: 7679918.1151035711777.JavaMail.jira () brutus
[Download RAW message or body]

     [ http://issues.apache.org/jira/browse/NUTCH-311?page=all ]

stack@archive.org updated NUTCH-311:
------------------------------------

    Attachment: too-many-links.patch

Adds configurable upper bound to link field in CrawlDatum.

> Page with tens of thousands of links OOME'd.
> --------------------------------------------
>
>          Key: NUTCH-311
>          URL: http://issues.apache.org/jira/browse/NUTCH-311
>      Project: Nutch
>         Type: Bug

>     Versions: 0.8-dev
>     Reporter: stack@archive.org
>     Priority: Minor
>  Attachments: too-many-links.patch
>
> Came across a page that caused OOME because no upper-bound on link count  in a CrawlDatum.

-- 
This message is automatically generated by JIRA.
-
If you think it was sent incorrectly contact one of the administrators:
   http://issues.apache.org/jira/secure/Administrators.jspa
-
For more information on JIRA, see:
   http://www.atlassian.com/software/jira


Using Tomcat but need to do more? Need to support web services, security?
Get stuff done quickly with pre-integrated technology to make your job easier
Download IBM WebSphere Application Server v.1.0.1 based on Apache Geronimo
http://sel.as-us.falkag.net/sel?cmd=lnk&kid=120709&bid=263057&dat=121642
_______________________________________________
Nutch-developers mailing list
Nutch-developers@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/nutch-developers
[prev in list] [next in list] [prev in thread] [next in thread] 

Configure | About | News | Add a list | Sponsored by KoreLogic