[prev in list] [next in list] [prev in thread] [next in thread]
List: lucene-user
Subject: RE: Lucene indexing PPT
From: "mcarcelen" <mcarcelen () isoco ! com>
Date: 2006-06-30 12:03:20
Message-ID: 20060630120358.94949C069 () mad ! isoco ! net
[Download RAW message or body]
Hello Nick!
Thanks for your help, it´s useful for me
Bye
-----Mensaje original-----
De: Nick Burch [mailto:nick@torchbox.com]
Enviado el: viernes, 30 de junio de 2006 12:19
Para: java-user@lucene.apache.org
Asunto: Re: Lucene indexing PPT
On Fri, 30 Jun 2006, mcarcelen wrote:
> I´m trying to build a index with PPT files. I have downloaded the api
> POI, "poi.bin.3.0" and "poi.src.3.0", but I don´t know where may I have
> to unzip them. I´d like to build the index by the command line, the same
> way as
I don't know about the lucene demo, but I can help with your POI issue.
You only need the poi bin package, but you do need to unpack it. In there
you'll find three jar files - for PowerPoint stuff, you'll just need to
put the poi-3.0 and poi-scratchpad-3.0 jars on your classpath.
You can then use org.apache.poi.hslf.extractor.PowerPointExtractor to do
your text extraction.
Perhaps someone can advise you on how to integrate this into the demo.
Nick
---------------------------------------------------------------------
To unsubscribe, e-mail: java-user-unsubscribe@lucene.apache.org
For additional commands, e-mail: java-user-help@lucene.apache.org
---------------------------------------------------------------------
To unsubscribe, e-mail: java-user-unsubscribe@lucene.apache.org
For additional commands, e-mail: java-user-help@lucene.apache.org
[prev in list] [next in list] [prev in thread] [next in thread]
Configure |
About |
News |
Add a list |
Sponsored by KoreLogic