Posts

Showing posts from January, 2014

Common Hadoop Problems

Today we will learn some common problems, that a person faces while installing Hadoop. Here are few problems  listed below. 1. Problem with ssh conguration.  error: connection refused to port 22 2. Namenode node not reachable     error: Retrying to connect 127.0.0.1 1. Problem with ssh configuration: In this case you may face many kind of errors, but most common one while installing hadoop is connection refused to port 22. Here you should check if machine on which you are trying to login, should have ssh server installed.     If you are using Ubuntu/Lubuntu, you can install ssh server using following command.         sudo apt-get install openssh-server        On CentOs or Redhat you can install ssh server using yum package manager         sudo yum install openssh-server        after installing ssh server, make sure you have configured the keys properly and share public key with the machine that you want to login into. If the problem persists then check for configurat

HBase Installation On Ubuntu / Lubuntu

Hi Everybody, Today we will learn how to install HBase. HBase can be installed in two modes. 1. Standalone Mode 2. Distributed Mode       Distributed Mode can be of two types Pseudo Distributed mode and fully distributed mode. In This Tutorial we will discuss about Pseudo Distributed Mode installation of HBase. I will soon share another post on how to install HBase in fully distributed mode. UPDATE: Video for this topic is available  here Following are the steps for HBase installation. 1. Before installing HBase you should have installed java and hadoop already. If you have not installed Hadoop, please follow the link  here . 2. Next is to chose HBase version that is completable with your hadoop installation. I am using Hadoop 1.0.3. so i am using HBase installation HBase 0.94.8 . 3. Download HBase from  HBase 0.94.8  and extract it in "$HOME/hbase" in my case it is "/home/hduser/hbase". 4. Edit $HOME/hbase/conf/hbase-env.sh with following command

Hadoop series: Hadoop Introduction

Big Data ... Big Data everywhere... well thats been the state of Industry as of Now. Everybody is talking about big data, hiring big data professionals, starting projects on big data. People who have knowledge of big data tools, are getting very good salary. So what exactly is this big data. For me it is a problem. Consider a scenario before smartphones or internet enabled phones, For shopping we used to go to markets(at least in India), for paying utility bills we had to stand in queues, for transferring money, for rail ticket booking etc we had to stand in queues or physically go to the place. Now with more and more usage of internet (thanks to smartphones), we are doing every thing online, shopping, bill payments, investment in share market, connecting with friends, applying for passport or Driving licence etc. Now all these things have suddenly increased the amount of digital data we produce. I think big data has been there from quite some time, but now it has come in form of digi

Installing Hive

Hi Guys Today we will learn how to install Hive. It is very easy and needs only few steps. before you start installing hive, you should have already installed hadoop. If not please check out  hadoop installation  post. Steps to install Hive. 1. Download Hive from  Hive 0.11.0 . If you want to install other version of Hive, check its comparability with hadoop version that you have already installed. In my case i am using hadoop 1.0.3 2. Go to downloads folder, right click and extract hive-0.11.0.tar.gz. 3. Copy the extracted jar into /home/hduser/hive. 4. Edit /etc/bash.bashrc and export HADOOP_HOME if not set.        on Lubuntu               sudo leafpad /etc/bash.bashrc        on ubuntu               sudo gedit /etc/bash.bashrc         for people new to linux, leafpad and gedit are two editors to edit text files similar to notepad in windows. 5. Insert following statement in file.                export HADOOP_HOME=/home/hduser/hadoop    

Hadoop series : Pseudo Distributed Mode Hadoop Instalation

In this tutorial, we will learn required steps of setting up Hadoop on single node also called pseudo distributed mode. How ever as part of this series we will also setup hadoop on multiple machines. We will also learn Map Reduce, hive , Pig etc.So stay tuned, Here it comes .... :) We need Following this to start with. Ubuntu: I always prefer ubuntu as my linux flavour. However if you are using a very low end machine with very small ram, you can install Lubuntu also. Lubuntu is very light weight and any low end machine should work. If you have a windows machine. you can install hadoop using cygwin or VirtualBox or Vmware player.  Here we go... Steps for installation of Hadoop. If you are using Windows follow steps from 1 to 4 1. Download  and install Virtual box from https://www.virtualbox.org/wiki/Downloads   2. Download ubuntu from  http://www.ubuntu.com/download/desktop . If you have low  configuration machine then you can use Lubuntu. download Lubuntu from