[prev in list] [next in list] [prev in thread] [next in thread]
List: solr-dev
Subject: [jira] Updated: (SOLR-14) Add the ability to preserve the original
From: "Geoffrey Young (JIRA)" <jira () apache ! org>
Date: 2008-06-30 14:37:45
Message-ID: 1628313577.1214836665352.JavaMail.jira () brutus
[Download RAW message or body]
[ https://issues.apache.org/jira/browse/SOLR-14?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel \
]
Geoffrey Young updated SOLR-14:
-------------------------------
Attachment: SOLR-14.patch
this new patch addresses three additional cases
o words prefixed with delimiters
o words postfixed with delimiters
o words that are all delimiters
there's a special place for people who name themselves !!! and expect to be found.
the input string
{noformat}
test 404-123 $foo bar& beer !!! *foo baz's biff
{noformat}
produces
{noformat}
test 404-123 404 123 404123 foo $foo bar bar& beer !!! foo *foo baz baz's biff
{noformat}
using options
{noformat}
org.apache.solr.analysis.WordDelimiterFilterFactory {preserveOriginal=1, \
generateNumberParts=1, catenateWords=1, generateWordParts=1, catenateAll=1, \
catenateNumbers=1} {noformat}
> Add the ability to preserve the original term when using WordDelimiterFilter
> ----------------------------------------------------------------------------
>
> Key: SOLR-14
> URL: https://issues.apache.org/jira/browse/SOLR-14
> Project: Solr
> Issue Type: Improvement
> Components: search
> Reporter: Richard "Trey" Hyde
> Attachments: SOLR-14.patch, SOLR-14.patch, SOLR-14.patch, TokenizerFactory.java, \
> WordDelimiterFilter.patch, WordDelimiterFilter.patch
>
> When doing prefix searching, you need to hang on to the original term othewise \
> you'll miss many matches you should be making.
> Data: ABC-12345
> WordDelimiterFitler may change this into
> ABC 12345 ABC12345
> A user may enter a search such as
> ABC\-123*
> Which will fail to find a match given the above scenario.
> The attached patch will allow the use of the "preserveOriginal" option to \
> WordDelimiterFilter and will analyse as ABC 12345 ABC12345 ABC-12345
> in which case we will get a postive match.
--
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.
[prev in list] [next in list] [prev in thread] [next in thread]
Configure |
About |
News |
Add a list |
Sponsored by KoreLogic