views:

196

answers:

4

I'm interested in learning techniques for distributed computing. As a Java developer, I'm probably willing to start with Hadoop. Could you please recommend some books/tutorials/articles to begin with?

+3  A: 

Maybe you can read some papers related to MapReduce and distributed computing first, to gain a better understanding of it. Here are some I would like to recommand:

On the other hand, if you want to know better of Hadoop, maybe you can start reading Hadoop MapReduce framework source code.

ZelluX
+1  A: 

Here are some resources from Yahoo! Developer Network

a tutorial:

http://developer.yahoo.com/hadoop/tutorial/

an introductory course (requires Siverlight, sigh):

http://yahoo.hosted.panopto.com/CourseCast/Viewer/Default.aspx?id=281cbf37-eed1-4715-b158-0474520014e6

David M
+2  A: 

Currently, bookwise I would check out - Hadoop A Definitive Guide. Its written by Tom White who has worked on Hadoop for a good while now, and works at Cloudera with Doug Cutting (Hadoop creator).

Also on the free side, Jimmy Lin from UMD has written a book called: Data-Intensive Text Processing with MapReduce. Here's a link to the final pre-production verison (link provided by the author on his website).

Binary Nerd
+1  A: 

The All Things Hadoop Podcast http://allthingshadoop.com/podcast has some good content and good guests. A lot of it is geared to getting started with Distributed Computing.

Joe Stein