[prev in list] [next in list] [prev in thread] [next in thread]
List: lucene-user
Subject: WebLucene 0.3 release:support CJK, use sax based indexing, docID based result sorting and xml format
From: "Che Dong" <chedong () hotmail ! com>
Date: 2003-11-30 13:57:18
[Download RAW message or body]
http://sourceforge.net/projects/weblucene/
WebLucene:
Lucene search engine XML interface, provided sax based indexing, indexing sequence \
based result sorting and xml output with highlight support.The CJKTokenizer support \
Chinese Japanese and Korean with Westen language simultaneously.
The key features:
1 The bi-gram based CJK support: org/apache/lucene/analysis/cjk/CJKTokenizer
2 docID based result sorting: org/apache/lucene/search/IndexOrderSearcher
3 xml output: com/chedong/weblucene/search/DOMSearcher
4 sax based indexing: com/chedong/weblucene/index/SAXIndexer
5 token based highlighter:
reverse StopTokenzier:
org/apache/lucene/anlysis/HighlightAnalyzer.java
HighlightFilter.java
with abstract:
com/chedong/weblucene/search/WebluceneHighlighter
6 A simplified query parser:
google like syntax with term limit
org/apache/lucene/queryParser/SimpleQueryParser
modified from early version of Lucene :)
Regards
Che, Dong
[prev in list] [next in list] [prev in thread] [next in thread]
Configure |
About |
News |
Add a list |
Sponsored by KoreLogic