Apache HBase

7

Transcript of Apache HBase

Page 1: Apache HBase
Page 2: Apache HBase

Google file system

HDFS

Page 3: Apache HBase

Features

• Seamless integration with Hadoop• Distributed operation– Fault tolerance– Load balancing– Easily add/remove nodes

• Non-technical reasons– Large community– Large scale online user cases

Page 4: Apache HBase

When should I use ?

• Not for all problems• Hundreds of millions or billions or rows• Live without extra features provided by

RDBMS (typed columns, secondary indexes, transactions)

• Enough hardware

Page 5: Apache HBase

History

• 2006: BigTable paper published by Google.• 2006 (end of year): HBase development starts.• 2008: HBase becomes Hadoop sub-project.• 2010: HBase becomes Apache top-level

project.

Page 6: Apache HBase

http://hortonworks.com/hadoop/pig/

http://hortonworks.com/hadoop/hbase/

http://pig.apache.org

http://hbase.apache.org

Page 7: Apache HBase

slideshare.net/jefersonm

twitter.com/jefersonm

github.com/jefersonm