IT Management and Cloud Blog

hadoop

« Previous Entries Next Entries »

Distributed computing with Linux and Hadoop

Thursday, December 11th, 2008

Every day people rely on search engines to find specific content in the many terabytes of data that exist on the Internet, but have you ever wondered how this search is actually performed? One approach is Apache’s Hadoop, which is a software framework that enables distributed manipulation of vast amounts of data. One application of [...]

Free White Paper on Hadoop from Sun

Wednesday, December 10th, 2008

What is Hadoop?

Writing An Hadoop MapReduce Program In Python

Saturday, November 29th, 2008

Here is another great tutorial on using Hadoop MapReduce. Hadoop MapReduce Program In Python Tutorial

The Commoditization of Massive Data Analysis

Thursday, November 20th, 2008

There is a debate brewing among data systems cognoscenti as to the best way to do data analysis at this scale. The old guard in the Enterprise IT camp tends to favor relational databases and the SQL language, while the web upstarts have rallied around the MapReduce programming model popularized at Google, and cloned in [...]

IBM MapReduce Tools for Eclipse

Friday, October 17th, 2008

IBM MapReduce Tools for Eclipse

« Previous Entries Next Entries »