[prev in list] [next in list] [prev in thread] [next in thread]
List: lucene-dev
Subject: [jira] Updated: (LUCENE-769) [PATCH] Performance improvement for
From: "Artem Vasiliev (JIRA)" <jira () apache ! org>
Date: 2007-01-31 22:20:05
Message-ID: 32484946.1170282005820.JavaMail.jira () brutus
[Download RAW message or body]
[ https://issues.apache.org/jira/browse/LUCENE-769?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel \
]
Artem Vasiliev updated LUCENE-769:
----------------------------------
Attachment: selfContained.patch
Removed several inner classes and documents cache from StoredFieldSortFactory. Now \
the whole class is pretty clean and simple. Checked the timings in the test - they \
remain pretty much the same.
Checked it in Sharehound with more large index (~1mln documents), it sorts resultset \
with 4000 docs in ~0,2s now taking 70M RAM, that's fine to me.
With standart new Sort(field, false) it takes (for the first search on a field) about \
30-40s and quite a lot of memory (after several sorted searches different fields it \
took about 500M).
> [PATCH] Performance improvement for some cases of sorted search
> ---------------------------------------------------------------
>
> Key: LUCENE-769
> URL: https://issues.apache.org/jira/browse/LUCENE-769
> Project: Lucene - Java
> Issue Type: Improvement
> Affects Versions: 2.0.0
> Reporter: Artem Vasiliev
> Attachments: DocCachingSorting.patch, DocCachingSorting.patch, \
> IndexReaderUtils.java, QueryFilter.java, selfContained.patch, selfContained.patch, \
> selfContained.patch, selfContained.patch, StoredFieldSorting.patch
>
> It's a small addition to Lucene that significantly lowers memory consumption and \
> improves performance for sorted searches with frequent index updates and relatively \
> big indexes (>1mln docs) scenario. This solution supports only single-field sorting \
> currently (which seem to be quite popular use case). Multiple fields support can be \
> added without much trouble. The solution is this: documents from the sorting set \
> (instead of given field's values from the whole index - current FieldCache \
> approach) are cached in a WeakHashMap so the cached items are candidates for GC. \
> Their fields values are then fetched from the cache and compared while sorting.
--
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.
---------------------------------------------------------------------
To unsubscribe, e-mail: java-dev-unsubscribe@lucene.apache.org
For additional commands, e-mail: java-dev-help@lucene.apache.org
[prev in list] [next in list] [prev in thread] [next in thread]
Configure |
About |
News |
Add a list |
Sponsored by KoreLogic