From lucene-user Mon Jan 11 15:40:38 2016 From: Michal Hlavac Date: Mon, 11 Jan 2016 15:40:38 +0000 To: lucene-user Subject: identifier n-gram tokenizer Message-Id: <5772664.ZvmmFlCMAI () hlavki> X-MARC-Message: https://marc.info/?l=lucene-user&m=145252685305669 Hello, I published some token filters that can be used to tokenize some kind of identifiers into punctation delimited n-grams (e.g. ip address). I think it needs some optimization, but it works for now. https://github.com/hlavki/lucene-analyzers You can find example of usage in unit test: https://github.com/hlavki/lucene-analyzers/blob/master/src/test/java/eu/hlavki/lucene/analysis/identifier/IdentifierNGramFilterTest.java m. --------------------------------------------------------------------- To unsubscribe, e-mail: java-user-unsubscribe@lucene.apache.org For additional commands, e-mail: java-user-help@lucene.apache.org