AkbarAhmed.com

Engineering Leadership

Introduction You will need to know the location of binaries, configuration files, and libraries when working with HBase. Directories Configuration /etc/hbase/conf is the location for all of HBase’s configuration files. HBase uses Debian Alternatives, so there are a number of symlinks to the configuration files. /etc/hbase/conf is a symlink to /etc/alternatives/hbase-conf. /etc/alternatives/hbase-conf is a symlink …

Continue reading

Introduction You will need to know the location of binaries, configuration files, and libraries when working with Zookeeper. Zookeeper 3.4.3 is a part of Cloudera Distribution Hadoop (CDH4). Directories /etc/zookeeper/conf /etc/zookeeper/conf is the location for all of Zookeeper’s configuration files. Zookeeper uses Debian Alternatives, so there are a number of symlinks to the configuration files. …

Continue reading

Introduction If you are running Hadoop on a development machine, then it’s likely that you’ll run into a situation where multiple services require port 8080. I recently ran into this issue where both the Pentaho User Console and the Hadoop MapReduce ShuffleHandler were trying to use port 8080. One solution is to change the port …

Continue reading

Introduction These instructions cover a manual installation of the Cloudera CDH4 packages on Ubuntu 12.04 LTS and are based on my following the Cloudera CDH4 Quick Start Guide (CDH4_Quick_Start_Guide_4.0.0.pdf). Installation prerequisites sudo apt-get install curl Verify that Java is installed correctly First, check that Java is setup correctly for your account. echo $JAVA_HOME The output …

Continue reading