HBase is an open source, non-relational, distributed database modeled after Google BigTable and written in Java. It is developed as part of Apache Hadoop project and runs on top of HDFS, providing BigTable-like capabilities for Hadoop. That is, it provides a fault-tolerant way of storing large quantities of sparse data (small amounts of information caught within a large collection of empty or unimportant data, such as finding the 50 largest items in a group of 2 billion records, or finding the non-zero items representing less than 0.1% of a huge collection).
HBase features compression, in-memory operation, and Bloom filters on a per-column basis as outlined in the original BigTable paper. Tables in HBase can serve as the input and output for MapReduce jobs run in Hadoop, and may be accessed through the Java API but also through REST, Avro or Thrift gateway APIs.
HBase is not a direct replacement for a classic SQL database, although recently its performance has improved, and it is now serving several data-driven websites, including Facebook's Messaging Platform.
In this course we will teach you everything thats needed to build a application using HBase as the database. Things you will learn is:
We are sure that this course will help you to learn more about how big companies manage and process large amounts of Data.
Andrei has been working for the past seven years with cutting edge technologies like: Big Data, Cloud Computing, Software Defined Data Centers and Open Computer for top class telecoms. He has a strong background and outstanding experience in system administration, designing and programming as well. He is here to share his knowledge because of his dedication and passion for the industry.