Month: June 2012

Google Wallet on Android

comment 1
Google Android

What is the Google Wallet Android app? I have finally had an occasion to use the Google Wallet app on my Galaxy Nexus. So what is the Google Wallet Android app? It’s basically an Android app that acts like a digital credit card that you can use to purchase coffee at Peet’s, medicine at CVS and so on. There is a special reader in the store that you tap your phone against when checking out […]

Install JDK 7 u5 on Ubuntu 12.04 LTS (as a secondary JDK)

comments 3
Linux / Software Development

Introduction I had installed JDK 6.0 update 31 in an earlier post. However, I now need to write a Java application that requires the features available in JDK 7. In this post, I will install JDK 7 update 5 as a secondary JDK, while JDK 6.0 u31 will be the primary JDK. It’s perfectly normal to have multiple JDKs on a single machine to support the requirements of different applications. Fortunately, it’s easy to use […]

Create a DB connection to MySQL in Kettle

comments 4
MySQL / Pentaho

Introduction Creating a DB connection from Kettle to MySQL involves creating a MySQL use who can access the DB in question, installing the JDBC driver, and creating a connection. Install the MySQL JDBC driver Download the MySQL JDBC driver from http://dev.mysql.com/downloads/connector/j/. Login to mysql.com, then click Download. cd ~/Downloads tar -xzf mysql-connector-java-5.1.20.tar.gz cd mysql-connector-java-5.1.20 cp mysql-connector-java-5.1.20-bin.jar ~/bin/data-integration/libext/ Create a MySQL user In this post, I am going to create a connection to the Sakila DB. […]

Create a Kettle repository

Leave a comment
Pentaho

Introduction Open Kettle cd ~/bin/data-integration To run Spoon: ./spoon.sh Create a new repository Run the following in a terminal. mkdir ~/kettle The steps below are performed within the PDI UI. In the Repository Connection dialog box, click the small green plus symbol. In the Select the repository type dialog box, select Kettle file repository. Click OK In the File repository settings dialog box, enter the following information: Base directory: /home/akbar/kettle Read-only repository?: Leave unchecked Hide […]

Install MySQL 5.5 on Ubuntu 12.04 LTS

Leave a comment
Linux / MySQL

Introduction Installing the MySQL package on Ubuntu is extremely simple. Installation Open a terminal and enter the following commands. sudo apt-get install mysql-client mysql-navigator mysql-server Type Y to accept the additional packages. Press Enter. After downloading and during installation, the MySQL configuration dialogs will display in the terminal. In the first dialog, press Enter. Enter a password for the MySQL root user. Press Enter. Reenter the root password. Press Enter. That’s it, MySQL is now […]

Change the Hadoop MapReduce v2 (YARN) ShuffleHandler Port

comment 1
Hadoop

Introduction If you are running Hadoop on a development machine, then it’s likely that you’ll run into a situation where multiple services require port 8080. I recently ran into this issue where both the Pentaho User Console and the Hadoop MapReduce ShuffleHandler were trying to use port 8080. One solution is to change the port used by the Hadoop MapReduce ShuffleHandler, which is what I’m going to configure below. Configuration sudo vi /etc/hadoop/conf/mapred-site.xml Add the […]

Install Sqoop 1.4.1 for Cloudera Hadoop (CHD4) on Ubuntu 12.04 LTS

comment 1
Hadoop

Introduction Sqoop is a tool to import data from an SQL database into Hadoop and/or export data from Hadoop into an SQL database. Sqoop can import/export from HDFS, HBase and Hive. It’s extremely common to use SQL databases as part of the setup in for Hadoop. Often, a SQL database will serve as an upstream datasource, such as a persistence layer for an MQ server, and as a downstream repository, such as a datamart in […]

Install Cloudera Hadoop (CDH4) with YARN (MRv2) in Pseudo mode on Ubuntu 12.04 LTS

comments 25
Hadoop

Introduction These instructions cover a manual installation of the Cloudera CDH4 packages on Ubuntu 12.04 LTS and are based on my following the Cloudera CDH4 Quick Start Guide (CDH4_Quick_Start_Guide_4.0.0.pdf). Installation prerequisites sudo apt-get install curl Verify that Java is installed correctly First, check that Java is setup correctly for your account. echo $JAVA_HOME The output should be: "/usr/lib/jvm/jdk1.6.0_31" Next, check that the JAVA_HOME environment variable is setup correctly for the sudo user. sudo env | […]