hadoop
« Previous EntriesCloud Cafe #38 Haddop and Cascading With Flightcaster
Wednesday, August 19th, 2009This is a great podcast with Bradford Cross a senior architect with Flightcaster. He has been working with Flightcast for three months and wait to you hear the really cool stuff he has done with “Big Data”. Here is a list of the topics and technologies we discuss in this podcast:
Freaking Flight Dealys
Rails on Heruko
Amazon [...]
Building a Business on Hadoop, HBase, and Open Source
Wednesday, July 29th, 2009Building a Business on Hadoop, HBase, and Open Source Distributed Computing
View more presentations from Bradford Stephens.
Hadoop and Cascading Slides from AJUG
Saturday, July 25th, 2009Hadoop and Cascading At AJUG July 2009
View more presentations from Christopher Curtin.
Link to the video post…
Hadoop and Cascading – AJUG – 07/21/09
Wednesday, July 22nd, 2009Sorry for the shaken hand-held production. My tripod broke… Look at it this way Content is King and Chris Curtin’s content was brilliant.
Velocity 09 – Chris Wensel on Cascading
Saturday, June 27th, 2009Big 4 Little 4 – Private Clouds
Thursday, June 25th, 2009As lists go, here goes…
Big 4
IBM Blue Cloud
IBM’s latest announcement, IMHO, finally puts them on the map as far as private cloud infrastructures go. I still believe they have a long way to go, but they have a very powerful infrastructure with their Cloudburst architecture backed by TSAM, ITPM, and ITM.
VMware vSphere
VMware has gone [...]
Cloudera’s Distribution for Hadoop
Monday, March 16th, 2009Looks like the Cloudera boys have come out of stealth mode…
Here is a list of features of the Cloudera distribution of Hadoop:
RPM Deployment – Never again wonder which files go in which directories and if your component versions are compatible. RPM was designed for this. In addition to Hadoop, we have RPMs for compatible versions [...]
Cloud Droplet #70 – If the sky should tumble and fall
Monday, March 16th, 2009Drop #70
Cloudera Raises 5M
Mosso and PCI
Heterogeneous computing
Accelereyes
37 Countries 1 song
Listen Here…
Awsome MapReduce and Hadoop Presentation
Wednesday, March 11th, 2009Last night at our Awsome meetup Don Brown of Twitpay gave a great presentation on Map Reduce and Hadoop.
Awsome This Tuesday Should Be Awesome
Sunday, March 8th, 2009If you are in the Atlanta area this Tuesday night (3/10/09) you might want to stop by and attend our Awsome meetup. We have two great presenters as follows:
7:00 Hadoop
Don Brown of TwitPay will be giving a presentation on Hadoop and Map Reduce. For those of you who missed Don’s presentation at Cloud Camp [...]
Distributed computing with Linux and Hadoop
Thursday, December 11th, 2008Every day people rely on search engines to find specific content in the many terabytes of data that exist on the Internet, but have you ever wondered how this search is actually performed? One approach is Apache’s Hadoop, which is [...]
Free White Paper on Hadoop from Sun
Wednesday, December 10th, 2008What is Hadoop?
Writing An Hadoop MapReduce Program In Python
Saturday, November 29th, 2008Here is another great tutorial on using Hadoop MapReduce.
Hadoop MapReduce Program In Python Tutorial
The Commoditization of Massive Data Analysis
Thursday, November 20th, 2008There is a debate brewing among data systems cognoscenti as to the best way to do data analysis at this scale. The old guard in the Enterprise IT camp tends to favor relational databases and the SQL language, while the web upstarts have rallied around the MapReduce programming model popularized at Google, and cloned in [...]
IBM MapReduce Tools for Eclipse
Friday, October 17th, 2008IBM MapReduce Tools for Eclipse
Think Red Hat for Hadoop
Thursday, October 16th, 2008A group of former Google, Yahoo and Facebook heavy weights have started the first Red Hat style support infrastructure for Hadoop.
Cloudera
Hadoop + Python = Happy
Thursday, September 25th, 2008Now we have something to smile about… Happy is a framework that allows Hadoop jobs to be written and run in Python 2.2 using Jython. It is an easy way to write map-reduce programs for Hadoop, and includes some new useful features as well. The current release supports Hadoop 0.17.2.
Happy Overview
Some Hadoop Links
Thursday, September 11th, 2008 Write a Hadoop MapReduce job in any programming language
MapReduce is a method for writing software that can be parallelized across thousands of machines to process enormous amounts of data. For instance, let’s say you want to count the number of referrals, by domain, in all the world’s Apache server logs.
Cascading – Answer For The [...]
Elastic Hadoop Clusters with Amazon’s Elastic Block Store
Tuesday, August 26th, 2008Elastic Hadoop Clusters with Amazon’s Elastic Block Store
Do you hadoop?
Making Hadoop More Modular
Thursday, July 24th, 2008Pluggable Hadoop
I especially like the discussion about instrumentation. Hadoop currently has a metrics API that proves disk stats like diskused and disk busy. More metrics and instrumentation for IT management would be a really cool thing.

