[prev in list] [next in list] [prev in thread] [next in thread] 

List:       ceph-devel
Subject:    RE: Mon backing store
From:       Somnath Roy <Somnath.Roy () sandisk ! com>
Date:       2014-06-05 18:40:22
Message-ID: 755F6B91B3BE364F9BCA11EA3F9E0C6F25CFE02D () SACMBXIP01 ! sdcorp ! global ! sandisk ! com
[Download RAW message or body]

Yeah :-)...We are also waiting for wip-rocksdb to merge in mainstream.

Thanks & Regards
Somnath

-----Original Message-----
From: Mark Nelson [mailto:mark.nelson@inktank.com] 
Sent: Thursday, June 05, 2014 11:33 AM
To: Somnath Roy; Samuel Just; ceph-devel@vger.kernel.org
Subject: Re: Mon backing store

Hi Somnath,

Sorry to get your hopes up, no ceph+rocksdb benchmarks (yet!).  I was referring to \
the benchmarks that the rocksdb developers published here:

https://github.com/facebook/rocksdb/wiki/Performance-Benchmarks

Sounds like we need to start some performance testing on wip-rocksdb going though! :)

Mark

On 06/05/2014 01:26 PM, Somnath Roy wrote:
> Mark,
> Could you please share the performance benchmark result with Rocksdbstore + ceph \
> and leveldbstore+ceph as you mentioned below ? BTW, have you measured the WA \
> induced by Rocksdbstore and leveldbstore in the process since that is also a very \
> important factor while backend is flash ? 
> Thanks & Regards
> Somnath
> 
> -----Original Message-----
> From: ceph-devel-owner@vger.kernel.org 
> [mailto:ceph-devel-owner@vger.kernel.org] On Behalf Of Mark Nelson
> Sent: Thursday, June 05, 2014 11:18 AM
> To: Samuel Just; ceph-devel@vger.kernel.org
> Subject: Re: Mon backing store
> 
> On 06/05/2014 12:42 PM, Samuel Just wrote:
> > I am starting to wonder whether using leveldb for the mon is actually 
> > introducing an excessive amount unnecessary complexity and 
> > non-determinism.  Given that the monitor workload is read mostly, 
> > except for failure conditions when it becomes write latency 
> > sensitive, might we do better with a strict b-tree style backing db 
> > such as berkely db even at the cost of some performance?  It seems 
> > like something like that might provide more reliable latency properties.
> 
> I'm not against trying it, but I'm not convinced it's the right solution.  If the \
> 99th percentile latency is significantly better, that's obviously a win, but I \
> think we are indeed going to take a big performance hit overall.  I'm more in favor \
> of trying rocksdb first. I'm certainly not as well versed in the leveldb interface \
> as you or Joao are, but it appears much of our code in LevelDBStore would be \
> reusable. I don't know that rocksdb won't have the same issues that leveldb does, \
> but the rocksdb developers specifically mention leveldb's bad 99th percentile \
> latencies as a driver for it's development: 
> "By contrast, we've published the RocksDB benchmark results for server side \
> workloads on Flash. We also measured the performance of LevelDB on these \
> server-workload benchmarks and found that RocksDB solidly outperforms LevelDB for \
> these IO bound workloads. We found that LevelDB's single-threaded compaction \
> process was insufficient to drive server workloads. We saw frequent write-stalls \
> with LevelDB that caused 99-percentile latency to be tremendously large. We found \
> that mmap-ing a file into the OS cache introduced performance bottlenecks for \
> reads. We could not make LevelDB consume all the IOs offered by the underlying \
> Flash storage." 
> Compaction performance and high mmap/page fault/kswapd utilization during reads are \
> two big issues we've hit in leveldb, so I'm inclined to think that rocksdb is at \
> least worthy of some attention. 
> Here's the benchmark results on their wiki:
> 
> https://github.com/facebook/rocksdb/wiki/Performance-Benchmarks
> 
> > 
> > Thoughts?
> > -Sam
> > --
> > To unsubscribe from this list: send the line "unsubscribe ceph-devel"
> > in the body of a message to majordomo@vger.kernel.org More majordomo 
> > info at  http://vger.kernel.org/majordomo-info.html
> > 
> 
> --
> To unsubscribe from this list: send the line "unsubscribe ceph-devel" 
> in the body of a message to majordomo@vger.kernel.org More majordomo 
> info at  http://vger.kernel.org/majordomo-info.html
> 
> ________________________________
> 
> PLEASE NOTE: The information contained in this electronic mail message is intended \
> only for the use of the designated recipient(s) named above. If the reader of this \
> message is not the intended recipient, you are hereby notified that you have \
> received this message in error and that any review, dissemination, distribution, or \
> copying of this message is strictly prohibited. If you have received this \
> communication in error, please notify the sender by telephone or e-mail (as shown \
> above) immediately and destroy any and all copies of this message in your \
> possession (whether hard copies or electronically stored copies). 

--
To unsubscribe from this list: send the line "unsubscribe ceph-devel" in
the body of a message to majordomo@vger.kernel.org
More majordomo info at  http://vger.kernel.org/majordomo-info.html


[prev in list] [next in list] [prev in thread] [next in thread] 

Configure | About | News | Add a list | Sponsored by KoreLogic