[prev in list] [next in list] [prev in thread] [next in thread] 

List:       solr-dev
Subject:    [jira] Created: (SOLR-321) misleading comment about spellchecker's
From:       "Daniel Naber (JIRA)" <jira () apache ! org>
Date:       2007-07-29 10:23:52
Message-ID: 22035364.1185704632818.JavaMail.jira () brutus
[Download RAW message or body]

misleading comment about spellchecker's termSourceField in solrconfig.xml
-------------------------------------------------------------------------

                 Key: SOLR-321
                 URL: https://issues.apache.org/jira/browse/SOLR-321
             Project: Solr
          Issue Type: Bug
          Components: documentation
            Reporter: Daniel Naber


The config file comment says this about "termSourceField":

"the field in your schema that you want to be able to build
your spell index on. This should be a field that uses a very
simple FieldType without a lot of Analysis (ie: string)"

I think this is wrong or at least misleading: the Lucene spellchecker uses a TermEnum \
to access the terms of this field, so the only requirement is that the field needs to \
be indexed. Isn't the common usecase of the spellchecker to use all of your terms in \
e.g. "body" as candidates for spellchecking? Then the field given for termSourceField \
should be e.g. "body", which is usually indexed and tokenized.

Of course, if you want "new yorc" to be corrected to "new york" this won't work with \
a tokenized field. I suggest this text for the comment:

The field in your schema that you want to be able to build your spell index on. This \
must be a field that is indexed. If it is of type "text" all the terms in that field \
will be used as separate candidates for spellchecking, if it is of type "string" the \
complete content of that field is considered a single term. This might me useful if \
you have a field whose only content is something like 'new york' and the text you \
want to have spell checked is 'new yrok'.

(besied that, spellchecking more than one term doesn't seem to be supported, I'll see \
if I add a comment about that to the wiki)



-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.


[prev in list] [next in list] [prev in thread] [next in thread] 

Configure | About | News | Add a list | Sponsored by KoreLogic