Introduction sed is short for Stream EDitor, which is a utility that allow you to parse and transform text one line at a time. sed is a useful tool, along with grep and awk, when manipulating text files. It is also often overlooked when working with Hadoop, although the use of sed, awk and grep can help speed up processing times by preprocessing text before sending it to a MapReduce job.
Introduction I had installed JDK 6.0 update 31 in an earlier post. However, I now need to write a Java application that requires the features available in JDK 7. In this post, I will install JDK 7 update 5 as a secondary JDK, while JDK 6.0 u31 will be the primary JDK. It’s perfectly normal to have multiple JDKs on a single machine to support the requirements of different applications. Fortunately, it’s easy to use […]
Introduction Installing the MySQL package on Ubuntu is extremely simple. Installation Open a terminal and enter the following commands. sudo apt-get install mysql-client mysql-navigator mysql-server Type Y to accept the additional packages. Press Enter. After downloading and during installation, the MySQL configuration dialogs will display in the terminal. In the first dialog, press Enter. Enter a password for the MySQL root user. Press Enter. Reenter the root password. Press Enter. That’s it, MySQL is now […]
Introduction This is my personal .bash_aliases file that is mainly used for Cloudera CDH4 (Hadoop) and Pentaho. As a result, many of my aliases are specific to these software packages. I plan to update this post as my .bash_aliases file expands. I will also push my .bash_aliases file into Git to make it easier to keep up with changes to the file. How to create a .bash_aliases file vi ~/.bash_aliases Paste the following into the […]
The following are instructions for creating an Ubuntu 10.04 (LTS) server instance in the Rackspace cloud. Login to your CloudControl Panel account. In the left pane, click Hosting, then click Cloud Servers. Click Add Server. Click Select next to Ubuntu 10.04 LTS (Lucid). Enter a Server Name. Select a Server Size. Click Create Server.