[prev in list] [next in list] [prev in thread] [next in thread] 

List:       lucene-dev
Subject:    [jira] [Commented] (LUCENE-627) highlighter problems with overlapping tokens
From:       "David Smiley (JIRA)" <jira () apache ! org>
Date:       2014-12-31 22:01:13
Message-ID: JIRA.12345948.1152835819000.123282.1420063273404 () Atlassian ! JIRA
[Download RAW message or body]


    [ https://issues.apache.org/jira/browse/LUCENE-627?page=com.atlassian.jira.plugin. \
system.issuetabpanels:comment-tabpanel&focusedCommentId=14262458#comment-14262458 ] 

David Smiley commented on LUCENE-627:
-------------------------------------

[~ryguasu] HighlighterTest was affected by r422302.

I think that _either_ {{<B>Hi</B>-<B>Speed</B>10 foo}} OR {{<B>Hi-Speed</B>10 foo}} \
are acceptable results from a boolean query of "hi" and "speed".

> highlighter problems with overlapping tokens
> --------------------------------------------
> 
> Key: LUCENE-627
> URL: https://issues.apache.org/jira/browse/LUCENE-627
> Project: Lucene - Core
> Issue Type: Bug
> Components: core/other
> Affects Versions: 2.1
> Reporter: Yonik Seeley
> Fix For: 2.1
> 
> Attachments: Highlighter.java.diff, highlight_overlap.diff
> 
> 
> The lucene highlighter has problems when tokens that overlap are generated.
> For example, if analysis of iPod generates the tokens "i", "pod", "ipod" (with pod \
> and ipod in the same position), then the highlighter will output this as iipod, \
> regardless of if any of those tokens are highlighted. Discovered via \
> http://issues.apache.org/jira/browse/SOLR-24



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

---------------------------------------------------------------------
To unsubscribe, e-mail: dev-unsubscribe@lucene.apache.org
For additional commands, e-mail: dev-help@lucene.apache.org


[prev in list] [next in list] [prev in thread] [next in thread] 

Configure | About | News | Add a list | Sponsored by KoreLogic