[prev in list] [next in list] [prev in thread] [next in thread] 

List:       lucene-user
Subject:    Re: Stemming german words
From:       Markus Fischer <markus () fischer ! name>
Date:       2006-01-31 21:32:59
Message-ID: 43DFD78B.4060300 () fischer ! name
[Download RAW message or body]

Jonathan,

what should I say, I'm feeling like an idiot now. Of course you're 
right. This actually solves the issue ;)

thanks and sorry for wasting time,
- Markus

Jonathan O'Connor wrote:
> Markus,
> As I'm sure you know, "sucht" is also an inflection of "suchen", e.g. 
> "er sucht etwas". Sadly, you may be able to fix this one problem, but 
> there will be hundreds of other problems too. Stemmers are never 
> perfect. You just have to live with it.
> 
> Most users won't have a problem with that. If they want want to search 
> for addiction, then they will probably add "drug" or "alcohol", etc... 
> to the search.
> Ciao,
> Jonathan O'Connor
> XCOM Dublin
> Inactive hide details for Markus Fischer <markus@fischer.name>Markus 
> Fischer <markus@fischer.name>
> 
> 
>                         *Markus Fischer <markus@fischer.name>*
> 
>                         31/01/2006 12:49
>                         Please respond to
>                         java-user@lucene.apache.org
> 
> 	
> 
> To
> 	
> java-user@lucene.apache.org
> 
> cc
> 	
> 
> Subject
> 	
> Stemming german words
> 
> 	
> 
> 
> Hi,
> 
> I'm currently using the GermanStemmer and it works well. However today
> I've found two words which get stemmed to the same stemm-word.
> 
> "Suche" and "Sucht" both get stemmed to the same "such" it seems,
> however they've completely different meanings in german (Suche = the
> Search, Sucht => addicttion).
> 
> Is there a way to tune the stemmer or are there alternatives available
> or should I look for another stemmer for the german language?
> 
> thanks for any pointers,
> - Markus
> 
> ---------------------------------------------------------------------
> To unsubscribe, e-mail: java-user-unsubscribe@lucene.apache.org
> For additional commands, e-mail: java-user-help@lucene.apache.org
> 
> 
> 
> 
> 
> *** XCOM AG Legal Disclaimer ***
> 
> Diese E-Mail einschliesslich ihrer Anhaenge ist vertraulich und ist 
> allein für den Gebrauch durch den vorgesehenen Empfaenger bestimmt. 
> Dritten ist das Lesen, Verteilen oder Weiterleiten dieser E-Mail 
> untersagt. Wir bitten, eine fehlgeleitete E-Mail unverzueglich 
> vollstaendig zu loeschen und uns eine Nachricht zukommen zu lassen.
> 
> This email may contain material that is confidential and for the sole 
> use of the intended recipient. Any review, distribution by others or 
> forwarding without express permission is strictly prohibited. If you are 
> not the intended recipient, please contact the sender and delete all copies.
> 
> Hauptsitz: Bahnstrasse 33, D-47877 Willich, USt-IdNr.: DE 812 885 664
> Kommunikation: Telefon +49 2154 9209-70, Telefax +49 2154 9209-900, 
> www.xcom.de
> Handelsregister: Amtsgericht Krefeld, HRB 10340
> Vorstand: Matthias Albrecht, Renate Becker-Grope, Marco Marty
> Vorsitzender des Aufsichtsrates: Stephan Steuer
> 


---------------------------------------------------------------------
To unsubscribe, e-mail: java-user-unsubscribe@lucene.apache.org
For additional commands, e-mail: java-user-help@lucene.apache.org

[prev in list] [next in list] [prev in thread] [next in thread] 

Configure | About | News | Add a list | Sponsored by KoreLogic