Welcome to HBase!
News
June 30th, HBase Contributor Workshop (Day after Hadoop Summit)
May 10th, 2010: HBase graduates from Hadoop sub-project to Apache Top Level Project
Signup for HBase User Group Meeting, HUG10 hosted by Trend Micro, April 19th, 2010
HBase User Group Meeting, HUG9 hosted by Mozilla, March 10th, 2010
Sign up for the HBase User Group Meeting, HUG8, January 27th, 2010 at StumbleUpon in SF
September 8th, 2010: HBase 0.20.0 is faster, stronger, slimmer, and sweeter tasting than any previous HBase release. Get it off the Releases page.
ApacheCon in Oakland: November 2-6th, 2009: The Apache Foundation will be celebrating its 10th anniversary in beautiful Oakland by the Bay. Lots of good talks and meetups including an HBase presentation by a couple of the lads.
HBase at Hadoop World in NYC: October 2nd, 2009: A few of us will be talking on Practical HBase out east at Hadoop World: NYC.
HUG7 and HBase Hackathon: August 7th-9th, 2009 at StumbleUpon in SF: Sign up for the HBase User Group Meeting, HUG7 or for the Hackathon or for both (all are welcome!).
June, 2009 -- HBase at HadoopSummit2009 and at NOSQL: See the presentations
March 3rd, 2009 -- HUG6: HBase User Group 6
January 30th, 2009 -- LA Hbackathon:HBase January Hackathon Los Angeles at Streamy in Manhattan Beach
Overview
HBase is the Hadoop database. Use it when you need random, realtime read/write access to your Big Data. This project's goal is the hosting of very large tables -- billions of rows X millions of columns -- atop clusters of commodity hardware.
HBase is an open-source, distributed, versioned, column-oriented store modeled after Google' Bigtable: A Distributed Storage System for Structured Data by Chang et al. Just as Bigtable leverages the distributed data storage provided by the Google File System, HBase provides Bigtable-like capabilities on top of Hadoop. HBase includes:
- Convenient base classes for backing Hadoop MapReduce jobs with HBase tables
- Query predicate push down via server side scan and get filters
- Optimizations for real time queries
- A high performance Thrift gateway
- A REST-ful Web service gateway that supports XML, Protobuf, and binary data encoding options
- Cascading, hive, and pig source and sink modules
- Extensible jruby-based (JIRB) shell
- Support for exporting metrics via the Hadoop metrics subsystem to files or Ganglia; or via JMX
- No HBase single point of failure
- Rolling restart for configuration changes and minor upgrades
- Random access performance on par with open source relational databases such as MySQL
Getting Started
Start by installing HBase on a single machine or a very small cluster.
- Learn about HBase by reading the documentation.
- Download HBase from the release page.
Getting Involved
HBase is an open source volunteer project under the Apache Software Foundation. It is a subproject of Hadoop. We encourage you to learn about the project and contribute your expertise. Here are some starter links:
- See our How to Contribute to HBase page.
- Give us feedback: What can we do better?
- Join the mailing list: Meet the community.
- Come hang with us on IRC.
- Become an HBase Fan up on facebook.
- Or up on twitter, HBase.