hadoop
« Previous EntriesElastic Hadoop Clusters with Amazon’s Elastic Block Store
Tuesday, August 26th, 2008Elastic Hadoop Clusters with Amazon’s Elastic Block Store
Do you hadoop?
Making Hadoop More Modular
Thursday, July 24th, 2008Pluggable Hadoop
I especially like the discussion about instrumentation. Hadoop currently has a metrics API that proves disk stats like diskused and disk busy. More metrics and instrumentation for IT management would be a really cool thing.
Do You Hadoop?
Thursday, July 3rd, 2008Maybe you should. Yahoo entered and won a sort contest using 1000 lines of Java code with the open source Hadoop. They were able to use a 900 node cluster and sort 10 billion 100 byte records (1 TeraByte) in 209 seconds. Here is a PDF of the details.
Do we need a cloud standard or just one good old IT management standard?
Tuesday, July 1st, 2008It is my belief that what we today call the “cloud” will really just evolve into a complex IT infrastructure of the future, and in the end, will just be referred to as infrastructure. There is no doubt the traditional IT landscape of the last 20 years is going through a substantial transformation on […]
Hadoop Query Languages
Saturday, June 21st, 2008Tom White discusses some Querly language options with Hadoop. Do you Hadooop? Maybe you should.
Hadoop Query Languages
RAM is the new disk…
Thursday, June 19th, 2008This is a great article that discusses a lot of the relevant “New Development” issues an architect needs to consider when creating a new application. The article includes discussions on Hadoop vs. RDBMS, Seek rates for SSD vs. Hard Drives, and RDBMS Clustering vs. In-Memory Data Grid (IMDG). Here is a quote from the article…
Whether […]
A Ruby MapReduce Framework
Sunday, June 15th, 2008Skynet
Skynet is an open source Ruby implementation of Google’s MapReduce framework, created at Geni. With Skynet, one can easily convert a time-consuming serial task, such as a computationally expensive Rails migration, into a distributed program running on many computers.
http://skynet.rubyforge.org/doc/index.html
Hadoop at Facebook
Friday, June 6th, 2008We have come a long way from those initial days. Facebook has multiple Hadoop clusters deployed now - with the biggest having about 2500 cpu cores and 1 PetaByte of disk space. We are loading over 250 gigabytes of compressed data (over 2 terabytes uncompressed) into the Hadoop file system every day and have hundreds […]
Drupal Barcamp NYC - Podcast #1 Cloud Talk Presentation
Monday, March 31st, 2008| View | Upload your own
Listen to the podcast here…
…
Yahoo, Tata Subsiduary In Research Pact
Monday, March 24th, 2008Yahoo, Tata Subsiduary In Research Pact
Powered by Hadoop
Monday, March 24th, 2008Applications and organizations using Hadoop
Multicore boom needs new developer skills
Saturday, March 22nd, 2008Multicore boom needs new developer skills
More than charity lies behind Microsoft and Intel’s announcement this week that they will donate US$20 million to a pair of U.S. colleges in the hope of spurring advances in parallel, or multicore, programming research
64-Core Desktop Processors to by 2012
“Expect x86 servers with as many as 64 processor cores in […]
Cloud Computing Vendors A to Z
Thursday, March 20th, 2008I HAVE A REVISED VERSION OF THIS POST…
Cloud Vendors A to Z (Revised)
More Hadoop Summit Seats Available! New Venue too.
Thursday, March 13th, 2008They have had to add more seats due to the tremendous interest in this event. I wish I could attend it however I have to teach a class in NYC that week. It sounds like it is going to rock.
Hadoop Summit
Can Your Programming Language Do This? - MapReduce
Monday, March 10th, 2008Can Your Programming Language Do This?
Joel on Software does a nice MapReduce tutorial.
CouchDB from 10,000 Feet
Thursday, March 6th, 2008CouchDB from 10,000 Feet
CouchDB views allow you to filter, collate, and aggregate data. Views are powered by Map/Reduce. The map stage processes key/value pairs to produce intermediate values and reduce then combines intermediate values for particular key. Map/Reduce is inherently parallelizable making it useful on clusters of machines.
Interesting Note:
Damien Katz, the CouchDB creator […]
February 2008 - Review Post
Tuesday, March 4th, 2008Doug McClure started doing an end month review and I think it is a great idea. In fact it is Doug who first tutored me on how to get my blog site started so it makes sense that I would continue to copy him. Here’s a quick look at Feb 08:
Hadoop It’s no Longer a Niche
Friday, February 29th, 2008Hadoop presentation from Yahoo!/OSCON
Wednesday, February 20th, 2008Hadoop presentation from Yahoo!/OSCON
Announcing the Hadoop Summit at Yahoo, March 25th, 2008
Wednesday, February 20th, 2008Announcing the Hadoop Summit at Yahoo, March 25th, 2008
« Previous Entries
