[prev in list] [next in list] [prev in thread] [next in thread]
List: cassandra-user
Subject: Fwd: Re: bigger data density with Cassandra 4.0?
From: onmstester onmstester <onmstester () zoho ! com>
Date: 2018-08-29 9:55:31
Message-ID: 16585110d64.11c1966ad17752.4319763406247145388 () zoho ! com
[Download RAW message or body]
Thanks Kurt, Actually my cluster has > 10 nodes, so there is a tiny chance to stream \
a complete SSTable. While logically any Columnar noSql db like Cassandra, needs \
always to re-sort grouped data for later-fast-reads and having nodes with big amount \
of data (> 2 TB) would be annoying for this background process, How is it possible \
that some of these databases like HBase and Scylla db does not emphasis on small \
nodes (like Cassandra do)? Sent using Zoho Mail ============ Forwarded message \
============ From : kurt greaves <kurt@instaclustr.com> To : \
"User"<user@cassandra.apache.org> Date : Wed, 29 Aug 2018 12:03:47 +0430 Subject : \
Re: bigger data density with Cassandra 4.0? ============ Forwarded message \
============ My reasoning was if you have a small cluster with vnodes you're more \
likely to have enough overlap between nodes that whole SSTables will be streamed on \
major ops. As N gets >RF you'll have less common ranges and thus less likely to be \
streaming complete SSTables. Correct me if I've misunderstood.
[Attachment #3 (text/html)]
<!DOCTYPE html PUBLIC "-//W3C//DTD HTML 4.01 Transitional//EN"><html><head><meta \
content="text/html;charset=UTF-8" http-equiv="Content-Type"></head><body ><div \
style='font-size:10pt;font-family:Verdana,Arial,Helvetica,sans-serif;color:#00000;'><div>Thanks \
Kurt,<br></div><div>Actually my cluster has > 10 nodes, so there is a tiny chance \
to stream a complete SSTable.<br></div><div>While logically any Columnar noSql db \
like Cassandra, needs always to re-sort grouped data for later-fast-reads and having \
nodes with big amount of data (> 2 TB) would be annoying for this background \
process, How is it possible that some of these databases like HBase and Scylla db \
does not emphasis on small nodes (like Cassandra do)?<br></div><div><br></div><div \
id=""><p style=""><span class="colour" style="color:rgb(42, 42, 42)">Sent using <a \
target="_blank" href="https://www.zoho.com/mail/" style="color:#598fde;">Zoho \
Mail</a></span><br></p></div><div><br></div><div class="zmail_extra"><div \
id="1"><div><br></div><div>============ Forwarded message \
============<br></div><div>From : kurt greaves \
<kurt@instaclustr.com><br></div><div>To : \
"User"<user@cassandra.apache.org><br></div><div>Date : Wed, 29 Aug 2018 \
12:03:47 +0430<br></div><div>Subject : Re: bigger data density with Cassandra \
4.0?<br></div><div>============ Forwarded message \
============<br></div></div><div><br></div><blockquote style="border-left: 1px solid \
#cccccc; padding-left: 6px; margin:0 0 0 5px"><div><div dir="ltr">My reasoning was if \
you have a small cluster with vnodes you're more likely to have enough overlap \
between nodes that whole SSTables will be streamed on major ops. As N gets \
>RF you'll have less common ranges and thus less likely to be streaming complete \
SSTables. Correct me if I've \
misunderstood.<br></div></div></blockquote></div><div><br></div></div><br></body></html>
[prev in list] [next in list] [prev in thread] [next in thread]
Configure |
About |
News |
Add a list |
Sponsored by KoreLogic