[prev in list] [next in list] [prev in thread] [next in thread] 

List:       lucene-user
Subject:    RE: Lucene indexing PPT
From:       "mcarcelen" <mcarcelen () isoco ! com>
Date:       2006-06-30 12:03:20
Message-ID: 20060630120358.94949C069 () mad ! isoco ! net
[Download RAW message or body]

Hello Nick!
Thanks for your help, it´s useful for me
Bye


-----Mensaje original-----
De: Nick Burch [mailto:nick@torchbox.com] 
Enviado el: viernes, 30 de junio de 2006 12:19
Para: java-user@lucene.apache.org
Asunto: Re: Lucene indexing PPT

On Fri, 30 Jun 2006, mcarcelen wrote:
> I´m trying to build a index with PPT files. I have downloaded the api
> POI, "poi.bin.3.0" and "poi.src.3.0", but I don´t know where may I have
> to unzip them. I´d like to build the index by the command line, the same
> way as

I don't know about the lucene demo, but I can help with your POI issue.
You only need the poi bin package, but you do need to unpack it. In there
you'll find three jar files - for PowerPoint stuff, you'll just need to
put the poi-3.0 and poi-scratchpad-3.0 jars on your classpath.

You can then use org.apache.poi.hslf.extractor.PowerPointExtractor to do
your text extraction.


Perhaps someone can advise you on how to integrate this into the demo.

Nick

---------------------------------------------------------------------
To unsubscribe, e-mail: java-user-unsubscribe@lucene.apache.org
For additional commands, e-mail: java-user-help@lucene.apache.org


---------------------------------------------------------------------
To unsubscribe, e-mail: java-user-unsubscribe@lucene.apache.org
For additional commands, e-mail: java-user-help@lucene.apache.org


[prev in list] [next in list] [prev in thread] [next in thread] 

Configure | About | News | Add a list | Sponsored by KoreLogic