ApacheCon NA 2010 Session

Top 10 Lessons Learned from Deploying Hadoop in a Private Cloud

Hadoop, HBase, and friends are built from the ground up to support Big Data, but that doesn't make them easy. Just like with any other relatively new and complex technologies, there are some rough edges and growing pains to manage. I've learned some hard lessons while deploying HBase tables containing billions of rows and dozens of terabytes on OpenLogic's Hadoop infrastructure. Come to this session to learn about some of the "gotchas" you might run into when deploying Hadoop and HBase in your own private cloud and how to avoid them. Here are some general areas we'll explore: * Hard-to-find configuration problems and debugging techniques * Under-documented yet critical features * Deployment recommendations for particular use cases * Advice on how to import Big Data * Using JRuby/Ruby to make life with Hadoop and HBase easier