English 中文(简体)
Cassandra/HBase or just MySQL: Potential problems doing the next thing
原标题:

Say I have "user". It s the key. And I need to keep "user count". I am planning to have record with key "user" and value "0" to "9999+ ;-)" (as many as I ll have).

What problems I will drive in if I use Cassandra, HBase or MySQL for that? Say, I have thousand of new updates to this "user" key, where I need to increment the value. Am I in trouble? Locked for writes? Any other way of doing that?

Why this is done -- there will be a lot of "user"-like keys. Different other cases. But the idea is the same. Why keep it this way -- because I ll have more reads, so I can always get "counted value" very fast.

问题回答

I would just update the user count as a batch operation every N minutes rather than updating it in realtime. If there s only one process updating it, you don t need to worry about contention by definition.

Alternatively cassandra has a contrib/mutex for adding lock support via ZooKeeper.

MongoDB has update-in-place and a special inc operator for counters. http://blog.mongodb.org/post/171353301/using-mongodb-for-real-time-analytics

MongoDB and HBase have this built-in (as do most other databases which guarantee consistency).

One fairly simple trick with Cassandra is to have a particular row for usercount, and then insert an unique ID (e.g. a random UUID) column name with an empty value to it every single time an user is added. With regular intervals count the number of columns and put them in a total counter - removing the columns you ve just counted.

At any time, your total user count is therefore [total counter]+[number of columns on your usercount row]. You can get these with essentially two reads, and if you have row cache enabled, it s going to be fast.

HBase has an incrementColumnValue method for a fast, atomic read/write operation.





相关问题
SQL SubQuery getting particular column

I noticed that there were some threads with similar questions, and I did look through them but did not really get a convincing answer. Here s my question: The subquery below returns a Table with 3 ...

please can anyone check this while loop and if condition

<?php $con=mysql_connect("localhost","mts","mts"); if(!con) { die( unable to connect . mysql_error()); } mysql_select_db("mts",$con); /* date_default_timezone_set ("Asia/Calcutta"); $date = ...

php return a specific row from query

Is it possible in php to return a specific row of data from a mysql query? None of the fetch statements that I ve found return a 2 dimensional array to access specific rows. I want to be able to ...

Character Encodings in PHP and MySQL

Our website was developed with a meta tag set to... <meta http-equiv="Content-Type" content="text/html; charset=iso-8859-1" /> This works fine for M-dashes and special quotes, etc. However, I ...

Pagination Strategies for Complex (slow) Datasets

What are some of the strategies being used for pagination of data sets that involve complex queries? count(*) takes ~1.5 sec so we don t want to hit the DB for every page view. Currently there are ~...

Averaging a total in mySQL

My table looks like person_id | car_id | miles ------------------------------ 1 | 1 | 100 1 | 2 | 200 2 | 3 | 1000 2 | 4 | 500 I need to ...

热门标签