IT Management and Cloud Blog

hadoop

« Previous Entries

Cloud Cafe #38 Haddop and Cascading With Flightcaster

Wednesday, August 19th, 2009

This is a great podcast with Bradford Cross a senior architect with Flightcaster.  He has been working with Flightcast for three months and wait to you hear the really cool stuff he has done with “Big Data”. Here is a list of the topics and technologies we discuss in this podcast:

Freaking Flight Dealys
Rails on Heruko
Amazon [...]

Building a Business on Hadoop, HBase, and Open Source

Wednesday, July 29th, 2009

Building a Business on Hadoop, HBase, and Open Source Distributed Computing
View more presentations from Bradford Stephens.

Hadoop and Cascading Slides from AJUG

Saturday, July 25th, 2009

Hadoop and Cascading At AJUG July 2009
View more presentations from Christopher Curtin.
Link to the video post…

Hadoop and Cascading – AJUG – 07/21/09

Wednesday, July 22nd, 2009

Sorry for the shaken hand-held production. My tripod broke… Look at it this way Content is King and Chris Curtin’s content was brilliant.

Velocity 09 – Chris Wensel on Cascading

Saturday, June 27th, 2009

Big 4 Little 4 – Private Clouds

Thursday, June 25th, 2009

As lists go, here goes…
Big 4
IBM Blue Cloud
IBM’s latest announcement, IMHO, finally puts them on the map as far as private cloud infrastructures go. I still believe they have a long way to go, but they have a very powerful infrastructure with their Cloudburst architecture backed by TSAM, ITPM, and ITM.
VMware vSphere
VMware has gone [...]

Cloudera’s Distribution for Hadoop

Monday, March 16th, 2009

Looks like the Cloudera boys have come out of stealth mode…
Here is a list of features of the Cloudera distribution of Hadoop:

RPM Deployment – Never again wonder which files go in which directories and if your component versions are compatible. RPM was designed for this. In addition to Hadoop, we have RPMs for compatible versions [...]

Cloud Droplet #70 – If the sky should tumble and fall

Monday, March 16th, 2009

Drop #70

Cloudera Raises 5M

Mosso and PCI

Heterogeneous computing
Accelereyes

37 Countries 1 song

Listen Here…

Awsome MapReduce and Hadoop Presentation

Wednesday, March 11th, 2009

Last night at our Awsome meetup Don Brown of Twitpay gave a great presentation on Map Reduce and Hadoop.

Awsome This Tuesday Should Be Awesome

Sunday, March 8th, 2009

If you are in the Atlanta area this Tuesday night (3/10/09) you might want to stop by and attend our Awsome meetup. We have two great presenters as follows:
7:00 Hadoop
Don Brown of TwitPay will be giving a presentation on Hadoop and Map Reduce. For those of you who missed Don’s presentation at Cloud Camp [...]

Distributed computing with Linux and Hadoop

Thursday, December 11th, 2008

Every day people rely on search engines to find specific content in the many terabytes of data that exist on the Internet, but have you ever wondered how this search is actually performed? One approach is Apache’s Hadoop, which is [...]

Free White Paper on Hadoop from Sun

Wednesday, December 10th, 2008

What is Hadoop?

Writing An Hadoop MapReduce Program In Python

Saturday, November 29th, 2008

Here is another great tutorial on using Hadoop MapReduce.
Hadoop MapReduce Program In Python Tutorial

The Commoditization of Massive Data Analysis

Thursday, November 20th, 2008

There is a debate brewing among data systems cognoscenti as to the best way to do data analysis at this scale. The old guard in the Enterprise IT camp tends to favor relational databases and the SQL language, while the web upstarts have rallied around the MapReduce programming model popularized at Google, and cloned in [...]

IBM MapReduce Tools for Eclipse

Friday, October 17th, 2008

IBM MapReduce Tools for Eclipse

Think Red Hat for Hadoop

Thursday, October 16th, 2008

A group of former Google, Yahoo and Facebook heavy weights have started the first Red Hat style support infrastructure for Hadoop.
Cloudera

Hadoop + Python = Happy

Thursday, September 25th, 2008

Now we have something to smile about… Happy is a framework that allows Hadoop jobs to be written and run in Python 2.2 using Jython. It is an easy way to write map-reduce programs for Hadoop, and includes some new useful features as well. The current release supports Hadoop 0.17.2.
Happy Overview

Some Hadoop Links

Thursday, September 11th, 2008

Write a Hadoop MapReduce job in any programming language
MapReduce is a method for writing software that can be parallelized across thousands of machines to process enormous amounts of data. For instance, let’s say you want to count the number of referrals, by domain, in all the world’s Apache server logs.
Cascading – Answer For The [...]

Elastic Hadoop Clusters with Amazon’s Elastic Block Store

Tuesday, August 26th, 2008

Elastic Hadoop Clusters with Amazon’s Elastic Block Store
Do you hadoop?

Making Hadoop More Modular

Thursday, July 24th, 2008

Pluggable Hadoop
I especially like the discussion about instrumentation.  Hadoop currently has a metrics API that proves disk stats like diskused and disk busy.  More metrics and instrumentation for IT management would be a really cool thing.

« Previous Entries