English 中文(简体)
How to Analyze and Query big chunks of data
原标题:

I need to:

1. Analyze big files of http logs

  • I m thinking in using mapreduce but I m not sure where to host it. Shall I use App Engine Mapper or EC2+MapReduce or simply use it in my VPS?
  • Other suggestions besides MapReduce?

2.Save and query the results

  • After analyzing all the data I need to save it and present to the user in the best possible way. How should I save the results? Is MongoDB a good solution for this?

Thanks.

Regards.

问题回答

Have you seen BigQuery? It looks like exactly what you need.





相关问题
How to make logging.debug work on Appengine?

I m having a tough time getting the logging on Appengine working. the statement import logging is flagged as an unrecognized import in my PyDev Appengine project. I suspected that this was just an ...

gqlQuery returns object, want list of keys

Is there a way to convert the GqlQuery object to an array of keys, or is there a way to force the query to return an array of keys? For example: items = db.GqlQuery("SELECT __key__ FROM Items") ...

Integrating Google AppEngine with a Thick Client

I want to make a multi-user client-server solution with Java Swing thick client as a front-end and Google AppEngine (Java one) as a back-end. The problem is that GAE provides only web-based forms for ...

sorl.thumbnail : thumbnail is not a valid tag library?

I am trying to install sorl.thumbnail but am getting the following error message: thumbnail is not a valid tag library: Could not load template library from django.templatetags.thumbnail, No module ...

热门标签