[prev in list] [next in list] [prev in thread] [next in thread] 

List:       solr-dev
Subject:    [jira] [Updated] (LUCENE-2899) Add OpenNLP Analysis capabilities as a module
From:       "Em (JIRA)" <jira () apache ! org>
Date:       2012-09-30 17:22:08
Message-ID: 62106700.145909.1349025728837.JavaMail.jiratomcat () arcas
[Download RAW message or body]


     [ https://issues.apache.org/jira/browse/LUCENE-2899?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel \
]

Em updated LUCENE-2899:
-----------------------

    Attachment: OpenNLPFilter.java
                OpenNLPTokenizer.java

Some Attributes were not reset (i.e. "first"-Attribute in OpenNLPTokenizer and \
"indexToken" in OpenNLPFilter) correctly.

Since I had trouble applying your patch, I'd like to provide the working source code. \
Please, create a patch from the current Trunk.   
> Add OpenNLP Analysis capabilities as a module
> ---------------------------------------------
> 
> Key: LUCENE-2899
> URL: https://issues.apache.org/jira/browse/LUCENE-2899
> Project: Lucene - Core
> Issue Type: New Feature
> Components: modules/analysis
> Reporter: Grant Ingersoll
> Assignee: Grant Ingersoll
> Priority: Minor
> Attachments: LUCENE-2899.patch, LUCENE-2899.patch, LUCENE-2899.patch, \
> LUCENE-2899.patch, LUCENE-2899.patch, LUCENE-2899.patch, OpenNLPFilter.java, \
> OpenNLPTokenizer.java, opennlp_trunk.patch 
> 
> Now that OpenNLP is an ASF project and has a nice license, it would be nice to have \
> a submodule (under analysis) that exposed capabilities for it. Drew Farris, Tom \
>                 Morton and I have code that does:
> * Sentence Detection as a Tokenizer (could also be a TokenFilter, although it would \
>                 have to change slightly to buffer tokens)
> * NamedEntity recognition as a TokenFilter
> We are also planning a Tokenizer/TokenFilter that can put parts of speech as either \
> payloads (PartOfSpeechAttribute?) on a token or at the same position. I'd propose \
> it go under: modules/analysis/opennlp

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators
For more information on JIRA, see: http://www.atlassian.com/software/jira

---------------------------------------------------------------------
To unsubscribe, e-mail: dev-unsubscribe@lucene.apache.org
For additional commands, e-mail: dev-help@lucene.apache.org


[prev in list] [next in list] [prev in thread] [next in thread] 

Configure | About | News | Add a list | Sponsored by KoreLogic