[prev in list] [next in list] [prev in thread] [next in thread]
List: lucene-dev
Subject: [jira] [Commented] (SOLR-12284) WordBreakSolrSpellChecker incorrectly adds parenthesis when breaking
From: "ASF subversion and git services (JIRA)" <jira () apache ! org>
Date: 2018-04-30 18:55:00
Message-ID: JIRA.13155713.1524836673000.17898.1525114500140 () Atlassian ! JIRA
[Download RAW message or body]
[ https://issues.apache.org/jira/browse/SOLR-12284?page=com.atlassian.jira.plugin. \
system.issuetabpanels:comment-tabpanel&focusedCommentId=16458885#comment-16458885 ]
ASF subversion and git services commented on SOLR-12284:
--------------------------------------------------------
Commit 162a077ef3da3e7a4f8aab6c5e693c0fc812dfdf in lucene-solr's branch \
refs/heads/branch_7x from jdyer1 [ \
https://git-wip-us.apache.org/repos/asf?p=lucene-solr.git;h=162a077 ]
SOLR-12284: Stop adding parenthesis to word-break suggestions, unless query uses \
boolean operators.
> WordBreakSolrSpellChecker incorrectly adds parenthesis when breaking words
> --------------------------------------------------------------------------
>
> Key: SOLR-12284
> URL: https://issues.apache.org/jira/browse/SOLR-12284
> Project: Solr
> Issue Type: Improvement
> Security Level: Public(Default Security Level. Issues are Public)
> Components: spellchecker
> Affects Versions: 7.3
> Reporter: James Dyer
> Assignee: James Dyer
> Priority: Minor
> Attachments: SOLR-12284.patch
>
>
> When using WordBreakSolrSpellChecker to break single words into multiple, the \
> collation queries include parenthesis around the original term. In some cases, \
> this causes required terms to become optional and users get spurious nonsensical \
> collation results. For instance, if I search: +eward +smith
> ...If +ward +smith is a match, it might give a collation like: (+e +ward) +smith
> ...This requires either the "e" or the "ward" to exist, but not both. But users \
> are more likely to want both terms to be required, so it would be better if it was \
> not adding parenthesis. This might be the cause of SOLR-5995 and [this SO \
> issue|https://stackoverflow.com/questions/23849747/solr-wordbreak-spellchecker-breaking-words-into-letters-excessive-breaking]
>
--
This message was sent by Atlassian JIRA
(v7.6.3#76005)
---------------------------------------------------------------------
To unsubscribe, e-mail: dev-unsubscribe@lucene.apache.org
For additional commands, e-mail: dev-help@lucene.apache.org
[prev in list] [next in list] [prev in thread] [next in thread]
Configure |
About |
News |
Add a list |
Sponsored by KoreLogic