IT Management and Cloud Blog

hadoop

« Previous Entries

Elastic Hadoop Clusters with Amazon’s Elastic Block Store

Tuesday, August 26th, 2008

Elastic Hadoop Clusters with Amazon’s Elastic Block Store
Do you hadoop?

Making Hadoop More Modular

Thursday, July 24th, 2008

Pluggable Hadoop
I especially like the discussion about instrumentation.  Hadoop currently has a metrics API that proves disk stats like diskused and disk busy.  More metrics and instrumentation for IT management would be a really cool thing.

Do You Hadoop?

Thursday, July 3rd, 2008

Maybe you should.  Yahoo entered and won a sort contest using 1000 lines of Java code with the open source Hadoop.  They were able to use a 900 node cluster and sort 10 billion 100 byte records (1 TeraByte) in 209 seconds.  Here is a PDF of the details.

Do we need a cloud standard or just one good old IT management standard?

Tuesday, July 1st, 2008

It is my belief that what we today call the “cloud” will really just evolve into a complex IT infrastructure of the future, and in the end, will just be referred to as infrastructure. There is no doubt the traditional IT landscape of the last 20 years is going through a substantial transformation on […]

Hadoop Query Languages

Saturday, June 21st, 2008

Tom White discusses some Querly language options with Hadoop.  Do you Hadooop? Maybe you should.
Hadoop Query Languages

RAM is the new disk…

Thursday, June 19th, 2008

This is a great article  that discusses a lot of the relevant “New Development”  issues an architect needs to consider when creating a new application.  The article includes discussions on Hadoop vs. RDBMS,  Seek rates for SSD vs.  Hard Drives,  and  RDBMS Clustering vs.  In-Memory Data Grid (IMDG).  Here is a quote from the article…
Whether […]

A Ruby MapReduce Framework

Sunday, June 15th, 2008

Skynet
Skynet is an open source Ruby implementation of Google’s MapReduce framework, created at Geni. With Skynet, one can easily convert a time-consuming serial task, such as a computationally expensive Rails migration, into a distributed program running on many computers.
http://skynet.rubyforge.org/doc/index.html

Hadoop at Facebook

Friday, June 6th, 2008

We have come a long way from those initial days. Facebook has multiple Hadoop clusters deployed now - with the biggest having about 2500 cpu cores and 1 PetaByte of disk space. We are loading over 250 gigabytes of compressed data (over 2 terabytes uncompressed) into the Hadoop file system every day and have hundreds […]

Drupal Barcamp NYC - Podcast #1 Cloud Talk Presentation

Monday, March 31st, 2008

| View | Upload your own

Listen to the podcast here…

Yahoo, Tata Subsiduary In Research Pact

Monday, March 24th, 2008

Yahoo, Tata Subsiduary In Research Pact

Powered by Hadoop

Monday, March 24th, 2008

Applications and organizations using Hadoop

Multicore boom needs new developer skills

Saturday, March 22nd, 2008

Multicore boom needs new developer skills
More than charity lies behind Microsoft and Intel’s announcement this week that they will donate US$20 million to a pair of U.S. colleges in the hope of spurring advances in parallel, or multicore, programming research
64-Core Desktop Processors to by 2012
“Expect x86 servers with as many as 64 processor cores in […]

Cloud Computing Vendors A to Z

Thursday, March 20th, 2008

I HAVE A REVISED VERSION OF THIS POST…
Cloud Vendors A to Z (Revised)

More Hadoop Summit Seats Available! New Venue too.

Thursday, March 13th, 2008

They have had to add more seats due to the tremendous interest in this event. I wish I could attend it however I have to teach a class in NYC that week. It sounds like it is going to rock.
Hadoop Summit

Can Your Programming Language Do This? - MapReduce

Monday, March 10th, 2008

Can Your Programming Language Do This?
Joel on Software does a nice MapReduce tutorial.

CouchDB from 10,000 Feet

Thursday, March 6th, 2008

CouchDB from 10,000 Feet
CouchDB views allow you to filter, collate, and aggregate data. Views are powered by Map/Reduce. The map stage processes key/value pairs to produce intermediate values and reduce then combines intermediate values for particular key. Map/Reduce is inherently parallelizable making it useful on clusters of machines.
Interesting Note:
Damien Katz, the CouchDB creator […]

February 2008 - Review Post

Tuesday, March 4th, 2008

Doug McClure started doing an end month review and I think it is a great idea. In fact it is Doug who first tutored me on how to get my blog site started so it makes sense that I would continue to copy him. Here’s a quick look at Feb 08:

Hadoop It’s no Longer a Niche

Friday, February 29th, 2008

Hadoop presentation from Yahoo!/OSCON

Wednesday, February 20th, 2008

Hadoop presentation from Yahoo!/OSCON

Announcing the Hadoop Summit at Yahoo, March 25th, 2008

Wednesday, February 20th, 2008

Announcing the Hadoop Summit at Yahoo, March 25th, 2008

« Previous Entries