[prev in list] [next in list] [prev in thread] [next in thread] 

List:       cassandra-user
Subject:    Fwd: Re: bigger data density with Cassandra 4.0?
From:       onmstester onmstester <onmstester () zoho ! com>
Date:       2018-08-29 9:55:31
Message-ID: 16585110d64.11c1966ad17752.4319763406247145388 () zoho ! com
[Download RAW message or body]

Thanks Kurt, Actually my cluster has > 10 nodes, so there is a tiny chance to stream \
a complete SSTable. While logically any Columnar noSql db like Cassandra, needs \
always to re-sort grouped data for later-fast-reads and having nodes with big amount \
of data (> 2 TB) would be annoying for this background process, How is it possible \
that some of these databases like HBase and Scylla db does not emphasis on small \
nodes (like Cassandra do)? Sent using Zoho Mail ============ Forwarded message \
============ From : kurt greaves <kurt@instaclustr.com> To : \
"User"<user@cassandra.apache.org> Date : Wed, 29 Aug 2018 12:03:47 +0430 Subject : \
Re: bigger data density with Cassandra 4.0? ============ Forwarded message \
============ My reasoning was if you have a small cluster with vnodes you're more \
likely to have enough overlap between nodes that whole SSTables will be streamed on \
major ops. As   N gets >RF you'll have less common ranges and thus less likely to be \
streaming complete SSTables. Correct me if I've misunderstood.


[Attachment #3 (text/html)]

<!DOCTYPE html PUBLIC "-//W3C//DTD HTML 4.01 Transitional//EN"><html><head><meta \
content="text/html;charset=UTF-8" http-equiv="Content-Type"></head><body ><div \
style='font-size:10pt;font-family:Verdana,Arial,Helvetica,sans-serif;color:#00000;'><div>Thanks \
Kurt,<br></div><div>Actually my cluster has &gt; 10 nodes, so there is a tiny chance \
to stream a complete SSTable.<br></div><div>While logically any Columnar noSql db \
like Cassandra, needs always to re-sort grouped data for later-fast-reads and having \
nodes with big amount of data (&gt; 2 TB) would be annoying for this background \
process, How is it possible that some of these databases like HBase and Scylla db \
does not emphasis on small nodes (like Cassandra do)?<br></div><div><br></div><div \
id=""><p style=""><span class="colour" style="color:rgb(42, 42, 42)">Sent using <a \
target="_blank" href="https://www.zoho.com/mail/" style="color:#598fde;">Zoho \
Mail</a></span><br></p></div><div><br></div><div class="zmail_extra"><div \
id="1"><div><br></div><div>============ Forwarded message \
============<br></div><div>From : kurt greaves \
&lt;kurt@instaclustr.com&gt;<br></div><div>To : \
"User"&lt;user@cassandra.apache.org&gt;<br></div><div>Date : Wed, 29 Aug 2018 \
12:03:47 +0430<br></div><div>Subject : Re: bigger data density with Cassandra \
4.0?<br></div><div>============ Forwarded message \
============<br></div></div><div><br></div><blockquote style="border-left: 1px solid \
#cccccc; padding-left: 6px; margin:0 0 0 5px"><div><div dir="ltr">My reasoning was if \
you have a small cluster with vnodes you're more likely to have enough overlap \
between nodes that whole SSTables will be streamed on major ops. As&nbsp; N gets \
&gt;RF you'll have less common ranges and thus less likely to be streaming complete \
SSTables. Correct me if I've \
misunderstood.<br></div></div></blockquote></div><div><br></div></div><br></body></html>




[prev in list] [next in list] [prev in thread] [next in thread] 

Configure | About | News | Add a list | Sponsored by KoreLogic