[prev in list] [next in list] [prev in thread] [next in thread]
List: lucene-dev
Subject: Re: QueryParser Strips "++" out of my word "c++"
From: "Mike Klaas" <mike.klaas () gmail ! com>
Date: 2007-01-27 3:00:43
Message-ID: 3d2ce8cb0701261900m4bf315d9s5ecbbc7b010444f2 () mail ! gmail ! com
[Download RAW message or body]
On 1/26/07, Joe Tang <joe.tang@workmetro.com> wrote:
>
> Thanks for you reply Doron. It works partly on me.
>
> How should I customize the Analyzer so as to have the functionality of
> StandardAnalyzer as well as not stripping out some of the charactors?
Joe,
See nutch's version of StandardAnalyzer: it adds a few rules to
special case C++ and C#.
(nutch-trunk/src/java/org/apache/nutch/analysis/NutchAnalysis.jj).
You may be able to pluck those rules and insert them into Lucene's
StandardAnalyzer (after recompiling with javaCC).
-Mike
---------------------------------------------------------------------
To unsubscribe, e-mail: java-dev-unsubscribe@lucene.apache.org
For additional commands, e-mail: java-dev-help@lucene.apache.org
[prev in list] [next in list] [prev in thread] [next in thread]
Configure |
About |
News |
Add a list |
Sponsored by KoreLogic