[prev in list] [next in list] [prev in thread] [next in thread]
List: solr-user
Subject: Re: Split one string into many fields
From: "Ryan McKinley" <ryantxu () gmail ! com>
Date: 2007-01-22 20:28:28
Message-ID: 176776ee0701221228k7571b025pbf106eec53f5bfea () mail ! gmail ! com
[Download RAW message or body]
looks like we wont save the discussion for later :)
>
> At this point though, I can't for the life of me remeber what Ryan said to
> convince me that it made sense to have a DocumentParser concept that
> UpdateHandlers could delegate to -- as opposed to the UpdateHandler doing
> it directly :)
>
We were discussing a handler that crawls an svn repository and another
that may accept a single file. They should be able to share the logic
of parsing a single ContentStream into a Document.
Essentially, I was suggesting making a standard DocumentHandler
framework (like the one in LIA that gets pointed to at least once a
week for people wondering how to parse XML/PDF/TXT/etc into lucene a
Document)
With SOLR-104, this will be straight forward to implement. I totally
agree it probably belongs in a 'tools' or 'plugins' directory along
with other things that are useful, but not the focus of solr.
[prev in list] [next in list] [prev in thread] [next in thread]
Configure |
About |
News |
Add a list |
Sponsored by KoreLogic