[prev in list] [next in list] [prev in thread] [next in thread] 

List:       wikitech-l
Subject:    Re: [Wikitech-l] [Foundation-l] EN Wikipedia Editing Statistics
From:       "Thomas Dalton" <thomas.dalton () gmail ! com>
Date:       2008-11-30 20:58:36
Message-ID: a4359dff0811301258x659ea06ate56aaf152a08b358 () mail ! gmail ! com
[Download RAW message or body]

> I saw this the other day as well and found it odd. While enwiki dumps
> do take the longest, this does seem like an _incredibly_ long time for
> "All pages with complete page edit history (.bz2)" to finish (May 2009).

Do you know how many pages enwiki has and how much edit history they
each have? It's a lot!

I think the dumps work by starting with the last successful dump and
just adding in anything that's changed, but because there haven't been
any successful dumps of the whole of enwiki in a long time, it
basically has to start from scratch, which is going to take a long
time (and means it probably won't succeed - ie. we have a catch-22).
It seems to me that (if my understanding of the problem is correct),
the answer is to devote a more powerful computer to the dump for just
this one so that we can get things moving again - I'm sure if we asked
around someone could lend us a really powerful computer for a few
weeks to do the dump on.

_______________________________________________
Wikitech-l mailing list
Wikitech-l@lists.wikimedia.org
https://lists.wikimedia.org/mailman/listinfo/wikitech-l
[prev in list] [next in list] [prev in thread] [next in thread] 

Configure | About | News | Add a list | Sponsored by KoreLogic