[prev in list] [next in list] [prev in thread] [next in thread]
List: lucene-user
Subject: Re: XML support in Lucene
From: Otis Gospodnetic <otis_gospodnetic () yahoo ! com>
Date: 2003-11-25 16:18:19
[Download RAW message or body]
Check out the Resources page on Lucene's site. Look for a link to the
article about Lucene and Digester. You can also look at Lucene's
Sandbox for some ideas.
Otis
--- ambiesense@gmx.de wrote:
> Hello group,
>
> does Lucene offer an effective and flexible way to treat XML files. I
> know
> that as soon as an InputStream is provided Lucene can basically index
> (evtl.
> after clearning) everything. How is it with XML files?
>
> If there is a way is it possbile to have one big XML file with many
> individual parts in it. This should be considered as docuemnts and
> the repeative XML
> tags as fields.
>
> Here an example:
>
> <MySMSList>
> <SMS>
> <From>Tim</From>
> <Content>How are you? Tom</Content>
> </SMS>
> <SMS>
> <From>Linda</From>
> <Content>bla bla bla</Content>
> <SMS>
> </MySMSList>
>
>
> Does somebody has already developed classes which go though this XML
> file,
> create TWO documents with the fields "From" and "Content" and fill in
> the text
> between the tags ? The Indexing business should then be the same
> since it is
> abstract against the Document object. The same for the search
> process. The
> search process however could be optimised with stuctural information
> (i.e.
> only search in "Content")...
>
> Cheers,
> Ralph
>
> --
> NEU FÜR ALLE - GMX MediaCenter - für Fotos, Musik, Dateien...
> Fotoalbum, File Sharing, MMS, Multimedia-Gruß, GMX FotoService
>
> Jetzt kostenlos anmelden unter http://www.gmx.net
>
> +++ GMX - die erste Adresse für Mail, Message, More! +++
>
>
> ---------------------------------------------------------------------
> To unsubscribe, e-mail: lucene-user-unsubscribe@jakarta.apache.org
> For additional commands, e-mail: lucene-user-help@jakarta.apache.org
>
__________________________________
Do you Yahoo!?
Free Pop-Up Blocker - Get it now
http://companion.yahoo.com/
---------------------------------------------------------------------
To unsubscribe, e-mail: lucene-user-unsubscribe@jakarta.apache.org
For additional commands, e-mail: lucene-user-help@jakarta.apache.org
[prev in list] [next in list] [prev in thread] [next in thread]
Configure |
About |
News |
Add a list |
Sponsored by KoreLogic