Can someone explain how MapReduce works with Cassandra .6? I ve read through the word count example, but I don t quite follow what s happening on the Cassandra end vs. the "client" end.
https://svn.apache.org/repos/asf/cassandra/trunk/contrib/word_count/
For instance, let s say I m using Python and Pycassa, how would I load in a new map reduce function, and then call it? Does my map reduce function have to be java that s installed on the cassandra server? If so, how do I call it from Pycassa?
There s also mention of Pig making this all easier, but I m a complete Hadoop noob, so that didn t really help.
Your answer can use Thrift or whatever, I just mentioned Pycassa to denote the client side. I m just trying to understand the difference between what runs in the Cassandra cluster vs. the actual server making the requests.