mapreduce« Previous Entries
Metrics on eBay’s main Teradata data warehouse include: >2 petabytes of user data 10s of 1000s of users Millions of queries per day 72 nodes >140 GB/sec of I/O, or 2 GB/node/sec, or maybe that’s a peak when the workload is scan-heavy 100s of production databases being fed in Metrics on eBay’s Greenplum data warehouse [...]
Last night at our Awsome meetup Don Brown of Twitpay gave a great presentation on Map Reduce and Hadoop.
Ultraparallel Computing .
Here is another great tutorial on using Hadoop MapReduce. Hadoop MapReduce Program In Python Tutorial« Previous Entries