Apache Hadoop



(Back to docs.huihoo.com/apache)

Introduction

Apache Hadoop is a free Java software framework that supports data intensive distributed applications running on large clusters of commodity computers. It enables applications to work with thousands of nodes and petabytes of data. Hadoop was inspired by Google's MapReduce and Google File System (GFS) papers.

Documents

• Hadoop 0.16.1 Documentation
• 雅虎郑皓: 推动云计算应用 - Hadoop开源平台 (2010)
• Hadoop Operations: Managing Big Data Clusters (2009)
• Introduction to Hadoop (2007)
• Hadoop Distributed File System (2007)
• Meet Hadoop (2007)
• Yahoo! Experience with Hadoop (2007)
• Hadoop Map/Reduce (2006)
• Scalable Computing with Hadoop (2006)

Links

• http://hadoop.apache.org/
• http://wiki.apache.org/hadoop/
• http://wiki.apache.org/pig/
• http://en.wikipedia.org/wiki/Hadoop
• http://download.huihoo.com/apache/hadoop/