Hadoop Tutorial 5 - Steps to Install Hadoop on a Personal Computer (Windows/OS X)

Hadoop Tutorial 5 – Steps to Install Hadoop on a Personal Computer (Windows/OS X)

Views:109298|Rating:4.27|View Time:4:44Minutes|Likes:198|Dislikes:34
This video explains the steps to install Hadoop on a PC running any operating system Windows or OS X.

hello in this session I will talk about steps to install Hadoop on a personal computer this is hassan mir from zero to pro training comm we understand that hadoop is a linux based software and it runs on distributed computers the best way to get good at hadoop is to do hands-on with it and to do hands-on exercises you need to have access to hadoop and one way of getting access is to have it installed on your personal computer most computers at home have either Windows operating system or OS 10 operating system so how do we go about installing a Linux based software on Windows or OS 10 operating systems the approach we are going to take will let you install do tools on any operating systems Windows OS 10 Linux or even Solaris first we will install a virtual machine software on our personal computer it could be a laptop or even desktop so once virtual machine software is installed we are going to then import a virtual machine in which Linux operating system would be running so now we have Linux and Linux is running on a virtual machine virtual machine is running on your personal computer so this way we have taken the operating system of the personal computer out of the equation because this virtual machine is available for most operating systems and within Linux will have Hadoop tools running so this way we will achieve what we want and not only using a virtual machine is beneficial because it takes the host operating system out of the equation but also it's very easy to manage the software running within the virtual machine you can take snapshots you can copy or clone virtual machines easily it makes management and maintenance very easy the virtual machine that I will be using is Oracle VirtualBox you also have a choice of using VMware server oracle virtualbox is very easy to manage as compared to vmware but both are good options and we will take an easy approach we will not be installing Linux software on the virtual machine and then installing Hadoop on Linux we will simply download an appliance and import the applies on the virtual machine software now we are in the virtual machine error so what we can do is we can simply download the whole virtual machine on which somebody has else has already installed Linux as well as Hadoop and it could be downloaded as a file and the term appliance is technically used to refer to a hardware and software together and it is also referred to soft copies of the virtual machines that are downloaded as files because when you install this file to a virtual machine software you end up having a computer with software already installed on it and this is a virtual computer of course so the Hadoop services will be running within Linux and Linux is running within virtual machine and on the personal computer we will have a browser and will open up the browser and we will connect to the virtual to the virtual machine and that's how we will access Hadoop in terms of the hardware requirements you need three to four gigabyte of RAM on your host computer for is the best because 2gb will be occupied by the virtual machine so only one to two will be left for the host computer if you have only two GB you can still try it you have to downsize the virtual machine I'll show in the videos how to do that down size meaning you have to tell the virtual machine to only use 1 GB rather than 2 by default it is designed to use 2 GB not too much hard disk space is required just 10 GB or so extra space on your computer will do the job 5 GB this space will be used by the virtual machine itself and 2 GB is will be occupied when you download the Hadoop file and some extra space is required for downloading virtual machine itself so here are the steps that will be following will be downloading the VirtualBox software will install the VirtualBox or fear we will then download the dupe of lines we will import the Hadoop appliance into the VirtualBox and then we will configure the virtual machine before we start it then we will finally start the virtual machine and test the connection from our host computer through the browser

Leave a Reply

Your email address will not be published. Required fields are marked *