[prev in list] [next in list] [prev in thread] [next in thread] 

List:       lucene-user
Subject:    WebLucene 0.3 release:support CJK, use sax based indexing, docID based result sorting and xml format
From:       "Che Dong" <chedong () hotmail ! com>
Date:       2003-11-30 13:57:18
[Download RAW message or body]

http://sourceforge.net/projects/weblucene/

WebLucene: 
Lucene search engine XML interface, provided sax based indexing, indexing sequence \
based result sorting and xml output with highlight support.The CJKTokenizer support \
Chinese Japanese and Korean with Westen language simultaneously.

The key features:
1 The bi-gram based CJK support: org/apache/lucene/analysis/cjk/CJKTokenizer

2 docID based result sorting: org/apache/lucene/search/IndexOrderSearcher

3 xml output: com/chedong/weblucene/search/DOMSearcher

4 sax based indexing: com/chedong/weblucene/index/SAXIndexer

5 token based highlighter: 
    reverse StopTokenzier:
    org/apache/lucene/anlysis/HighlightAnalyzer.java
                              HighlightFilter.java
    with abstract:
    com/chedong/weblucene/search/WebluceneHighlighter

6 A simplified query parser:
    google like syntax with term limit
    org/apache/lucene/queryParser/SimpleQueryParser
    modified from early version of Lucene :)

Regards

Che, Dong


[prev in list] [next in list] [prev in thread] [next in thread] 

Configure | About | News | Add a list | Sponsored by KoreLogic