English 中文(简体)
Cassandra何时打过Amdahl的法律?
原标题:When does Cassandra hit Amdahl s law?
  • 时间:2012-01-12 17:26:09
  •  标签:
  • cassandra

我试图理解关于卡桑德拉比额表与节点数一致的说法。 在迅速研究这个净额时,我没有看到对这个专题的许多处理。 当然,卡桑德拉有序列处理要素,必须限制随着N级增长而获得的速度。 欢迎对这一主题的任何想法、观点或联系。

Edit to provide perspective:
I am working on a project that has a current request for a 1,000+ node Cassandra infrastructure. I did not come-up with this spec. I find myself proposing that N be reduced to a range between 200 and 500, with each node being at least twice as fast for serial computation. This is easy to achieve without a cost penalty per node by making simple changes to the server configuration.

最佳回答

, not Amdahl s Law。 瓜斯塔夫森电梯度视,而随着节点增加,可以处理更多的数据。 也就是说,如果你有这么多的节点,那么你可以同时处理一个数据集的N倍。

之所以能够做到这一点,是因为Casses除了进行图形和环形变化外,很少使用全组范围的协调。 大多数业务只涉及一些与复制系数相等的节点,随着数据集的增长,这一节点保持不变——因此几乎是线性尺度。

相比之下,Amdahl的比值看 略快 ,随着节点增加,你可以处理固定数据集。 也就是说,如果你有N倍的点子,你能否更快地处理同样的N级数据?

显然,在某个时候,如果增加更多节点,你的要求不会更快,因为满足请求所需时间最少。 Cassandra不在此列。

就你而言,它同你一样,再次询问,它是否能够更好地有1 000个缓慢节点或200个快点。 贵国的数据集如何大? 这取决于你的工作量,但通常的建议是,每个数据点的最佳尺寸约为1TB,确保你有足够的RAM和CPU匹配(见cassandra node limitations

问题回答

暂无回答




相关问题
How does Voldemort compare to Cassandra?

How does Voldemort compare to Cassandra? I m not talking about size of community and only want to hear from people who have actually used both. Especially I m interested in: How they dynamically ...

How does Cassandra rebalance when nodes go down?

Does anyone have experience with Cassandra when nodes go down or are unavailable? I am mostly interested in whether the cluster rebalances and what happens when the nodes come online, or are replaced ...

Cassandra time series data

We are looking at using Cassandra to store a stream of information coming from various sources. One issue we are facing is the best way to query between two dates. For example we will need to ...

Picking a database technology

We re setting out to build an online platform (API, Servers, Data, Wahoo!). For context, imagine that we need to build something like twitter, but with the comments (tweets) organized around a live ...

Row count of a column family in Cassandra

Is there a way to get a row count (key count) of a single column family in Cassandra? get_count can only be used to get the column count. For instance, if I have a column family containing users and ...

Update an existing column value

What happens when a new value for an existing column is added? Will the older value be overwritten by the new value? Or the older value will also retain and can be retrieved (similar to simpleDB)?

Cassandra Vs Amazon SimpleDB

I m working on an application where data size and SQL queries are going to be heavy. I am thinking between Cassandra or Amazon SimpleDB. Can you please suggest which is more suitable in this kind of ...

Cassandra load balancing with an ordered partitioner?

So I see here that Cassandra does not have automatic load balancing, which comes into view when using the ordered partitioner (a certain common range of values of a group of rows would be stored on a ...

热门标签