[prev in list] [next in list] [prev in thread] [next in thread]
List: lucene-dev
Subject: Re: New "Did you mean" feature: How to approach?
From: Dave Spencer <dave-lucene-dev () tropo ! com>
Date: 2005-03-29 11:16:50
Message-ID: 42493922.7080800 () tropo ! com
[Download RAW message or body]
Otis Gospodnetic wrote:
>Maybe the spellchecker at the bottom of the following URL will help:
>
> http://svn.apache.org/repos/asf/lucene/java/trunk/contrib/
>
>
Yeah, I did this, the "ngram based spelling corrector".
You build a normal lucene index as you always do
then run NGramSpeller, analyzes your index to determine which ngrams are
used, and saves this in a separate Lucene index
then you call NGramSpeller.suggestUsingNGrams() if a users query
doesn't return too many results
weblog entry here w/ more info and a test page:
http://www.searchmorph.com/weblog/index.php?id=23
--
Some chance you'll be instested in the "more like this" similarity query
generator - see the "similar" tree in the sandbox
-- Dave
>Otis
>
>
>--- "Stefan F. Keller" <sfkeller@gmail.com> wrote:
>
>
>>We would like to add "Did you mean..." to our Lucene-based search
>>engine www.geometa.info. Doug mentioned in his recent interview that
>>this feature would be not too complicated to implement.
>>
>>First I considered integrating a spelling checker (through JADT-API)
>>but one would rather expect "nearby" words which really exist in the
>>document pool. Some people have mentioned this feature here (or on
>>the
>>java-user-list).
>>
>>=> Is anyone aware of any real developments in this area?
>>
>>Ideally, one would combine the data already maintained by the
>>IndexReader class with an existing similarity search algorithm (like
>>trigram)...
>>
>>=> Any ideas?
>>
>>Stefan
>>
>>---------------------------------------------------------------------
>>To unsubscribe, e-mail: java-dev-unsubscribe@lucene.apache.org
>>For additional commands, e-mail: java-dev-help@lucene.apache.org
>>
>>
>>
>>
>
>---------------------------------------------------------------------
>To unsubscribe, e-mail: java-dev-unsubscribe@lucene.apache.org
>For additional commands, e-mail: java-dev-help@lucene.apache.org
>
>
>
---------------------------------------------------------------------
To unsubscribe, e-mail: java-dev-unsubscribe@lucene.apache.org
For additional commands, e-mail: java-dev-help@lucene.apache.org
[prev in list] [next in list] [prev in thread] [next in thread]
Configure |
About |
News |
Add a list |
Sponsored by KoreLogic