« Cloud Droplet #70 – If the sky should tumble and fall | Home | The Rackspace/Mosso PCI Debate »
Cloudera’s Distribution for Hadoop
By John | March 16, 2009
Looks like the Cloudera boys have come out of stealth mode…
Here is a list of features of the Cloudera distribution of Hadoop:
- RPM Deployment – Never again wonder which files go in which directories and if your component versions are compatible. RPM was designed for this. In addition to Hadoop, we have RPMs for compatible versions of Hive and Pig in this release.
- Standard Linux Service Management – Your IT staff knows how to work with RPMs and init level services. Now they know how to work with Hadoop.
- Public YUM Repository – We’ll make sure it’s easy to stay up to date with the latest stable version of Hadoop.
- Simple Web Based Configuration Assistance – Do you know what the optimal setting for mapred.child.ulimit is? another.arcane.parameter? Well, we have some ideas, and are always learning more. To share that, we’ve created a configurator that asks a few important questions about your hardware and computes sensible values for all of your configuration parameters.
Topics: cloudera, hadoop | No Comments »

