Machine Learning on Big Data with MapReduce
http://machinelearningbigdata.pbworks.com/w/page/37651454/FrontPage
Hbase & NoSQL Comparison :
http://kkovacs.eu/cassandra-vs-mongodb-vs-couchdb-vs-redis
http://ria101.wordpress.com/2010/02/24/hbase-vs-cassandra-why-we-moved/
http://natishalom.typepad.com/nati_shaloms_blog/hbase/
http://natishalom.typepad.com/nati_shaloms_blog/2011/07/real-time-analytics-for-big-data-an-alternative-approach.html
http://natishalom.typepad.com/nati_shaloms_blog/2011/07/real-time-analytics-for-big-data-an-alternative-approach-to-facebooks-new-realtime-analytics-system.html How to develop Big Data Pipelines for Hadoop : Quora Suggests : http://www.quora.com/MapReduce/Whats-the-best-way-to-come-up-to-speed-on-MapReduce-Hadoop-and-Hive Advanced Concepts : Hadoop -ing and Hive -ing with Parallel DB, Fast Joins and Optimized Quries: http://engineering.linkedin.com/hadoop/recap-improving-hadoop-performance-1000x http://database.cs.brown.edu/sigmod09/benchmarks-sigmod09.pdf Hadoop Security : http://hbase.apache.org/book/hadoop.html#2.3.1. Hadoop Security http://hortonworks.com/blog/the-role-of-delegation-tokens-in-apache-hadoop-security/
How Facebook , Twitter solving the Analytics problem using Hbase : http://www.slideshare.net/cloudera/building-realtime-big-data-services-at-facebook-with-hadoop-and-hbase-jonathan-gray-facebook http://www.slideshare.net/parallellabs/sigmod-realtime-hadooppresentation http://www.slideshare.net/ydn/hive-with-h-base http://www.slideshare.net/brizzzdotcom/facebook-messages-hbase http://www.slideshare.net/giganati/real-time-analytics-for-big-dataa-twitter-casestudy-v3-i-pad Just as a side note : Hadoop is not just the only technique for Map-Reduce : There are other less-memory-intensive, less-complex, less-resource-hungry techniques : http://mapreduce.sandia.gov/doc/Manual.html : MapReduce-MPI https://github.com/erikfrey/bashreduce http://code.google.com/p/cloudmapreduce/ Why should a SMB embrace OSS ? http://www.slideshare.net/lusciouspear/building-a-business-on-hadoop-hbase-and-open-source-distributed-computing All HBase Links : http://hbase.apache.org/apidocs/org/apache/hadoop/hbase/client/HTableInterface.html#batch%28java.util.List%29 http://www.mail-archive.com/user@hbase.apache.org/msg18782.html http://grokbase.com/p/hbase/user/117sr49gtp/fanning-out-hbase-queries-in-parallel https://blogs.apache.org/hbase/entry/hbase_project_management_committee_meeting http://www.cloudera.com/resource/hbasecon-2012-real-time-analytics-with-hbase/ http://blog.sematext.com/2012/04/27/hbase-real-time-analytics-rollbacks-via-append-based-updates-part-2/ http://blog.sematext.com/2010/12/16/deferring-processing-updates-to-increase-hbase-write-performance/ http://www.slideshare.net/cloudera/5-healthcare-at-explorys-doug-meil-explorys-final-2 http://blog.sematext.com/2012/04/22/hbase-real-time-analytics-rollbacks-via-append-based-updates/ https://blogs.apache.org/hbase/entry/coprocessor_introduction http://svn.apache.org/repos/asf/hbase/trunk/src/main/java/org/apache/hadoop/hbase/client/HConnectionManager.java http://stackoverflow.com/questions/6833892/realtime-querying-aggregating-millions-of-records-hadoop-hbase-cassandra https://issues.apache.org/jira/browse/HBASE-3131 https://issues.apache.org/jira/browse/HBASE-3220 https://issues.apache.org/jira/browse/HBASE-1512 https://issues.apache.org/jira/browse/HBASE-1002 https://issues.apache.org/jira/browse/HBASE-2469 https://issues.apache.org/jira/browse/HBASE-1845 https://issues.apache.org/jira/browse/HBASE-2000 http://grokbase.com/t/hbase/user/10a6t1qvmt/parallel-computing-on-hbase http://www.slideshare.net/cloudera/3-h-base-coprocessors-hbase-con-may-2012 http://hbase-coprocessor-experiments.blogspot.com/2011/05/extending.html http://hbase.apache.org/book/architecture.html More Hadoop Links : http://www.cloudera.com/resources/hadoop-world/ http://www.cloudera.com/resource/hadoop-world-2011-presentation-slides-raptor-real-time-analytics-on-hadoop/ http://news.ycombinator.com/item?id=2588185 http://gigaom.com/2012/03/03/hadoop-jumps-through-hoops-becomes-mainstream/ http://pro.gigaom.com/2012/01/how-amazons-dynamodb-is-rattling-the-big-data-and-cloud-markets/?utm_source=tech&utm_medium=editorial&utm_campaign=auto3&utm_term=493240+hadoop-jumps-through-hoops-becomes-mainstream&utm_content=aprilkilcrease http://www.cloudera.com/resource/hadoop-world-2011-presentation-slides-building-realtime-big-data-services-at-facebook-with-hadoop-and-hbase/ http://hadapt.squarespace.com/storage/Hadapt-Handout-v2.pdf http://www.slideshare.net/cloudera/building-realtime-big-data-services-at-facebook-with-hadoop-and-hbase-jonathan-gray-facebook?from=ss_embed http://www.slideshare.net/reedshea/boston-hug-cloudera-presentation/download http://www.slideshare.net/dacort/mongodb-realtime-data-collection-and-stats-generation/download http://t.co/M6Z5O19x Real-time Bigdata Links (MongoDB, GridGain In-Memory Distributed MapR ) : http://www.slideshare.net/dacort/mongodb-realtime-data-collection-and-stats-generation/download http://www.slideshare.net/craigsdickson/java-paas-vendor-survey-september-2011 http://pradyutsarma.blogspot.com/2011/06/extending-orion-navigator.html http://www.gridgain.com/data_grid.html https://github.com/aloiscochard/spring-batch-integration-gridgain http://aniefer.blogspot.com/2011/02/embedding-orion-editor_02.html http://www.java.net/external?url=http://aloiscochard.blogspot.com/2010/04/spring-batch-integration-module-for.html http://static.springsource.org/spring-batch/reference/html-single/index.html#whatsNewPartitioning http://static.springsource.org/spring-batch/reference/html/scalability.html
http://university.cloudera.com/training/apache_hbase/hbase.html