good links about hadoop ecosystem


Hive - A Petabyte Scale Data Warehouse using Hadoop
http://www.facebook.com/note.php?note_id=89508453919

How Facebook uses Scribe, Hadoop, and Hive for Analytics, Ad hoc
analysis, Spam detection and Ad Optimization
http://axonflux.com/how-facebook-uses-scribe-hadoop-and-hive-for

Presentation: Data analysis with Hadoop and Hive
http://www.slideshare.net/jseidman/data-analysis-with-hadoop-and-hive-chicagodb-2212011

Tracking Trends with Hadoop and Hive on EC2, Here they summarize pageviews for millions of wikipedia hits.
http://www.cloudera.com/blog/2009/07/tracking-trends-with-hadoop-and-hive-on-ec2/

Log analytics with Hadoop and Hive
http://help.papertrailapp.com/kb/analytics/log-analytics-with-hadoop-and-hive

Exploring apache log files using hive and hadoop
http://www.johnandcailin.com/blog/cailin/exploring-apache-log-files-using-hive-and-hadoop

More apache log analysis code.
 https://gist.github.com/1556097

0 pensamientos:

Post a Comment

feedback!