Hadoop on Windows 8

18
How to install Hadoop on Windows 8 in 10 minutes! Vala Ali Rohani, PhD of Software Engineering [email protected] [email protected] April 2015

Transcript of Hadoop on Windows 8

Page 1: Hadoop on Windows 8

How to install

Hadoop on Windows 8in 10 minutes!

Vala Ali Rohani, PhD of Software Engineering

[email protected]@um.edu.my

April 2015

Page 2: Hadoop on Windows 8

Here, you will learn:

by: Dr. Vala Ali Rohani

Easily install and run a personal and portable Hadoop on your PC

In doing so,

we will install Hortonworks Sandbox VERSION 2.2 – VirtualBox on Windows

Page 3: Hadoop on Windows 8

Contents

• About me• What is Sanbox?• Prerequisites • Installing VirtualBox• Installing Sandbox

By: Vala Ali Rohani, PhD

[email protected]

[email protected]

April 2015

About me

• Postdoctoral scholar in Social Network Analysis

• PhD in Software Engineering (Recommender Systems)

• Social Network Analysis from University of Michigan

Professional Certificates:

• University Lecturer for more than 10 years

• Mining Massive Datasets from Stanford University

• Pattern Discovery in Data Mining from Illinois University

• Process Mining from Eindhoven University of Technology

• Statistical Analysis using SPSS & SAS from University of Malaya

• MongoDB for DBAs from MongoDB

[email protected]

[email protected]

Vala Ali Rohani

Page 4: Hadoop on Windows 8

Contents

• About me• What is Sanbox?• Prerequisites • Installing VirtualBox• Installing Sandbox

The easiest way to get started with Enterprise Hadoop

Sandbox is a personal, portable Hadoop environment that comes with a dozen

interactive Hadoop tutorials.

Using Sandbox, we can have Hadoop on your PC with Windows OS.

Sandbox includes many of the most exciting developments from the latest HDP

distribution, packaged up in a virtual environment that you can get up and running in 15

minutes!

http://hortonworks.com/products/hortonworks-sandbox/#overview

By: Vala Ali Rohani, PhD

[email protected]

[email protected]

To run the Sandbox you must install one of the supported virtual machine environments on your host machine. Here, we will use Oracle VirtualBox.

April 2015

Page 5: Hadoop on Windows 8

Contents

• About me• What is Sanbox?• Prerequisites • Installing VirtualBox• Installing Sandbox

Prerequisites:

Hosts: o A 64-bit machine with a chip that supports virtualization. o A BIOS that has been set to enable virtualization support.

• Host Operating Systems: o Windows 7, 8

• Supported Browsers: o Internet Explorer 9 (The Sandbox will not work with Internet Explorer 10)o Firefox – latest stable release o Google Chrome – latest stable release

• At least 4 GB of RAM

• Virtual Machine Environments: o Oracle VirtualBox, version 4.2 or later

To use the Hortonworks Sandbox on Windows you must have the following resources available to you:

By: Vala Ali Rohani, PhD

[email protected]

[email protected]

April 2015

Page 6: Hadoop on Windows 8

Contents

• About me• What is Sanbox?• Prerequisites • Installing VirtualBox• Installing Sandbox

How to enable ‘virtualization’ in Bios?• Restart your PC or Laptop• Go to BIOS setup (usually you need to press and keep the ‘Delete’ button)

• In most of the mother boards, ‘virtualization’ option can be found under‘Advanced Chipset settings’

• Enable the ‘virtualization’ option• It’s recommended to shutdown the computer for minimum 10 seconds and

restart it (Cold Restart) to take effect.

By: Vala Ali Rohani, PhD

[email protected]

[email protected]

April 2015

Page 7: Hadoop on Windows 8

Contents

• About me• What is Sanbox?• Prerequisites • Installing VirtualBox• Installing Sandbox

by: Dr. Vala Ali Rohani

Installing Oracle VirtualBoxVirtualBox is a powerful x86 and AMD64/Intel64 virtualization product for enterprise as well as home use that is freely available as Open Source Software.

https://www.virtualbox.org/

It can be directly downloaded from https://www.virtualbox.org/wiki/Downloads

(For this tutorial, we downloaded the VirtualBox 4.3.26 for Windows hosts)

By: Vala Ali Rohani, PhD

[email protected]

[email protected]

April 2015

Page 8: Hadoop on Windows 8

Contents

• About me• What is Sanbox?• Prerequisites • Installing VirtualBox• Installing Sandbox

Now, that you already installed Oracle VirtualBox as one of the supported virtual machine

environments, you can run the Sandbox on your host machine.

First, you need to download it from Hortonworks website:

http://hortonworks.com/products/hortonworks-sandbox/#install

>> Please download HDP Sandbox 2.2 for Virtualbox

By: Vala Ali Rohani, PhD

[email protected]

[email protected]

April 2015

Page 9: Hadoop on Windows 8

Contents

• About me• What is Sanbox?• Prerequisites • Installing VirtualBox• Installing Sandbox

1. Open the Oracle VM VirtualBox Manager by Double clicking its icon in your desktop:

To install Sandbox:

2. The Oracle VM Virtualization Manager window opens.

By: Vala Ali Rohani, PhD

[email protected]

[email protected]

April 2015

http://hortonworks.com/wp-content/uploads/unversioned/pdfs/InstallingHortonworksSandbox2onWindowsusingVB.pdf

Page 10: Hadoop on Windows 8

Contents

• About me• What is Sanbox?• Prerequisites • Installing VirtualBox• Installing Sandbox

3. Change the Auto-Capture preference. File->Preferences and select Input in the left navigation bar. Uncheck Auto-Capture Keyboard

To install Sandbox:

By: Vala Ali Rohani, PhD

[email protected]

[email protected]

April 2015

Page 11: Hadoop on Windows 8

Contents

• About me• What is Sanbox?• Prerequisites • Installing VirtualBox• Installing Sandbox

4. Import the Sandbox appliance file: File->Import Appliance

To install Sandbox:

The Import Virtual Appliance screen opens . To open the file browser, click on icon.

By: Vala Ali Rohani, PhD

[email protected]

[email protected]

April 2015

Page 12: Hadoop on Windows 8

Contents

• About me• What is Sanbox?• Prerequisites • Installing VirtualBox• Installing Sandbox

5. In file browser screen, you make sure to select the correct appliance as shown in the picture.

To install Sandbox:

By: Vala Ali Rohani, PhD

[email protected]

[email protected]

April 2015

Page 13: Hadoop on Windows 8

Contents

• About me• What is Sanbox?• Prerequisites • Installing VirtualBox• Installing Sandbox

6. You return to the Import Virtual Appliance screen. Click Next

To install Sandbox:

By: Vala Ali Rohani, PhD

[email protected]

[email protected]

April 2015

Page 14: Hadoop on Windows 8

Contents

• About me• What is Sanbox?• Prerequisites • Installing VirtualBox• Installing Sandbox

7. The Appliance settings screen appears. The default settings work. If you have more then 4Gb of physical RAM installed, you may wish to allocate more RAM to the VM – 4GB of RAM in the Virtual Appliance will improve the performance. Click Import.

To install Sandbox:

By: Vala Ali Rohani, PhD

[email protected]

[email protected]

April 2015

Page 15: Hadoop on Windows 8

Contents

• About me• What is Sanbox?• Prerequisites • Installing VirtualBox• Installing Sandbox

9. Turn on the Sandbox. Select the Sandbox appliance and the right click on it. Then, click on Start icon in displayed menu.

A console window opens and displays an information screen. Click OK to clear the info screen.

To install Sandbox:

8. The appliance will be imported.

By: Vala Ali Rohani, PhD

[email protected]

[email protected]

April 2015

Page 16: Hadoop on Windows 8

Contents

• About me• What is Sanbox?• Prerequisites • Installing VirtualBox• Installing Sandbox

10. Wait while the VM boots up. When the process is complete, the console displays the login instructions for the Sandbox.

To install Sandbox:

By: Vala Ali Rohani, PhD

[email protected]

[email protected]

April 2015

Page 17: Hadoop on Windows 8

Contents

• About me• What is Sanbox?• Prerequisites • Installing VirtualBox• Installing Sandbox

11. To initiate your Hortonworks Sandbox session, please open a browser and enter this address:

To install Sandbox:

http://127.0.0.1:8888

By: Vala Ali Rohani, PhD

[email protected]

[email protected]

April 2015

Page 18: Hadoop on Windows 8

Contents

• About me• What is Sanbox?• Prerequisites • Installing VirtualBox• Installing Sandbox

12. The Sandbox GUI is displayed.

Congratulations! You successfully installed Hortonworks Sandbox V2.2 on your machine.

To install Sandbox:

By: Vala Ali Rohani, PhD

[email protected]

[email protected]

April 2015