Apache Hadoop in 2011
2011 was a breakthrough year for Apache Hadoop as many more mainstream organizations large and small turned to Hadoop to manage and process Big Data, while enterprise software and hardware vendors have...
View ArticleApache HBase 0.92.0 has been released
Today the Apache HBase community has proudly released Apache HBase 0.92.0, a major new version of the scalable distributed data store inspired by Google’s BigTable. Over 670 issues were addressed, so...
View ArticleCDH3, update 3 now available
Keeping with our release policy for Cloudera’s Distribution Including Apache Hadoop (CDH) I’m pleased to announce the availability of update 3 for CDH3. As a reminder, we ship updates for our most...
View ArticleApache HBase Write Path
Apache HBase is the Hadoop database, and is based on the Hadoop Distributed File System (HDFS). HBase makes it possible to randomly access and update data stored in HDFS, but files in HDFS can only be...
View ArticleApache HBase Log Splitting
In the recent blog post about the Apache HBase Write Path, we talked about the write-ahead-log (WAL), which plays an important role in preventing data loss should a HBase region server failure occur....
View ArticleApache HBase Replication Overview
Apache HBase Replication is a way of copying data from one HBase cluster to a different and possibly distant HBase cluster. It works on the principle that the transactions from the originating cluster...
View ArticleApache HBase Replication: Operational Overview
This is the second blogpost about Apache HBase replication. The previous blogpost, HBase Replication Overview, discussed use cases, architecture and different modes supported in HBase replication. This...
View ArticleHBase at ApacheCon Europe 2012
Apache HBase will have a notable profile at ApacheCon Europenext month. Clouderan and HBase committer Lars George has two sessions on the schedule: HBase Sizing and Schema DesignAbstract: This talk...
View ArticleGuide to Using Apache HBase Ports
For those people new to Apache HBase (version 0.90 and later), the configuration of network ports used by the system can be a little overwhelming. In this blog post, you will learn all the TCP ports...
View ArticleWhat are HBase znodes?
Apache ZooKeeper is a client/server system for distributed coordination that exposes an interface similar to a filesystem, where each node (called a znode) may contain data and a set of children. Each...
View Article