[prev in list] [next in list] [prev in thread] [next in thread] 

List:       lucene-dev
Subject:    Re: QueryParser Strips "++" out of my word "c++"
From:       "Mike Klaas" <mike.klaas () gmail ! com>
Date:       2007-01-27 3:00:43
Message-ID: 3d2ce8cb0701261900m4bf315d9s5ecbbc7b010444f2 () mail ! gmail ! com
[Download RAW message or body]

On 1/26/07, Joe Tang <joe.tang@workmetro.com> wrote:
>
> Thanks for you reply Doron. It works partly on me.
>
> How should I customize the Analyzer so as to have the functionality of
> StandardAnalyzer as well as not stripping out some of the charactors?

Joe,

See nutch's version of StandardAnalyzer: it adds a few rules to
special case C++ and C#.
(nutch-trunk/src/java/org/apache/nutch/analysis/NutchAnalysis.jj).
You may be able to pluck those rules and insert them into Lucene's
StandardAnalyzer (after recompiling with javaCC).

-Mike

---------------------------------------------------------------------
To unsubscribe, e-mail: java-dev-unsubscribe@lucene.apache.org
For additional commands, e-mail: java-dev-help@lucene.apache.org

[prev in list] [next in list] [prev in thread] [next in thread] 

Configure | About | News | Add a list | Sponsored by KoreLogic