[prev in list] [next in list] [prev in thread] [next in thread] 

List:       lucene-dev
Subject:    [jira] [Commented] (LUCENE-2025) Ability to turn off the store for an index
From:       "Robert Muir (JIRA)" <jira () apache ! org>
Date:       2012-08-31 14:41:08
Message-ID: 230011395.22499.1346424068450.JavaMail.jiratomcat () arcas
[Download RAW message or body]


    [ https://issues.apache.org/jira/browse/LUCENE-2025?page=com.atlassian.jira.plugin \
.system.issuetabpanels:comment-tabpanel&focusedCommentId=13446003#comment-13446003 ] 

Robert Muir commented on LUCENE-2025:
-------------------------------------

One simple way to do this today is to just use a codec that has a NoStoredFieldsImpl,
Throws exception in its writer impl if you ask it to actually write any stored fields \
 (e.g. startDocument\(n\) is called where n > 0), and does nothing in its reader \
impl.

I think for the typical case its fairly uncommon, i looked into seeing if we could
optimize this case for Lucene40's impl, but it introduces a lot of scary situations
for things like bulk merge.

So for now I really think this is a simple safe way at the moment, if someone wants \
to  turn it off they just set this as their codec on indexwriter.

                
> Ability to turn off the store for an index
> ------------------------------------------
> 
> Key: LUCENE-2025
> URL: https://issues.apache.org/jira/browse/LUCENE-2025
> Project: Lucene - Core
> Issue Type: New Feature
> Components: core/index
> Reporter: Michael Busch
> Assignee: Michael Busch
> Priority: Minor
> Labels: gsoc2011, gsoc2012, lucene-gsoc-11, lucene-gsoc-12, mentor
> Fix For: 4.1
> 
> 
> It would be really good in combination with parallel indexing if the
> Lucene store could be turned off entirely for an index. 
> The reason is that part of the store is the FieldIndex (.fdx file),
> which contains an 8 bytes pointer for each document in a segment, even
> if a document does not contain any stored fields.
> With parallel indexing we will want to rewrite certain parallel
> indexes to update them, and if such an update affects only a small
> number of documents it will be a waste if you have to write the .fdx
> file every time.
> So in the case where you only want to update a data structure in the
> inverted index it makes sense to separate your index into multiple
> parallel indexes, where the ones you want to update don't contain any
> stored fields.
> It'd be also great to not only allow turning off the store but to make
> it customizable, similarly to what flexible indexing wants to achieve
> regarding the inverted index.
> As a start I'd be happy with the ability to simply turn off the store and to
> add more flexibility later.

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators
For more information on JIRA, see: http://www.atlassian.com/software/jira

---------------------------------------------------------------------
To unsubscribe, e-mail: dev-unsubscribe@lucene.apache.org
For additional commands, e-mail: dev-help@lucene.apache.org


[prev in list] [next in list] [prev in thread] [next in thread] 

Configure | About | News | Add a list | Sponsored by KoreLogic