[prev in list] [next in list] [prev in thread] [next in thread] 

List:       solr-dev
Subject:    [jira] Updated: (SOLR-14) Add the ability to preserve the original
From:       "Geoffrey Young (JIRA)" <jira () apache ! org>
Date:       2008-06-30 14:37:45
Message-ID: 1628313577.1214836665352.JavaMail.jira () brutus
[Download RAW message or body]


     [ https://issues.apache.org/jira/browse/SOLR-14?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel \
]

Geoffrey Young updated SOLR-14:
-------------------------------

    Attachment: SOLR-14.patch

this new patch addresses three additional cases

  o words prefixed with delimiters
  o words postfixed with delimiters
  o words that are all delimiters

there's a special place for people who name themselves  !!! and expect to be found.

the input string

{noformat}
  test 404-123 $foo bar& beer !!! *foo baz's biff
{noformat}

produces

{noformat}
test 404-123 404 123 404123 foo $foo bar bar& beer !!! foo *foo baz baz's biff
{noformat}

using options

{noformat}
org.apache.solr.analysis.WordDelimiterFilterFactory {preserveOriginal=1, \
generateNumberParts=1, catenateWords=1, generateWordParts=1, catenateAll=1, \
catenateNumbers=1} {noformat}



> Add the ability to preserve the original term when using WordDelimiterFilter
> ----------------------------------------------------------------------------
> 
> Key: SOLR-14
> URL: https://issues.apache.org/jira/browse/SOLR-14
> Project: Solr
> Issue Type: Improvement
> Components: search
> Reporter: Richard "Trey" Hyde
> Attachments: SOLR-14.patch, SOLR-14.patch, SOLR-14.patch, TokenizerFactory.java, \
> WordDelimiterFilter.patch, WordDelimiterFilter.patch 
> 
> When doing prefix searching, you need to hang on to the original term othewise \
>                 you'll miss many matches you should be making.
> Data: ABC-12345
> WordDelimiterFitler may change this into
> ABC 12345 ABC12345
> A user may enter a search such as 
> ABC\-123*
> Which will fail to find a match given the above scenario.
> The attached patch will allow the use of the "preserveOriginal" option to \
> WordDelimiterFilter and will analyse as ABC 12345 ABC12345  ABC-12345 
> in which case we will get a postive match.

-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.


[prev in list] [next in list] [prev in thread] [next in thread] 

Configure | About | News | Add a list | Sponsored by KoreLogic