0% found this document useful (0 votes)

39 views51 pages

CC Lab Manual-New

The document outlines a laboratory record for a Cloud Computing course, detailing various experiments and procedures related to cloud technologies, including installation of VirtualBox, Google App Engine, and Hadoop. Each experiment includes aims, procedures, and results, demonstrating practical applications of cloud computing concepts. The document serves as a guide for students in the Department of Artificial Intelligence and Data Science for the academic year 2023-2024.

Uploaded by

zachjebaraj

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

39 views51 pages

CC Lab Manual-New

Uploaded by

zachjebaraj

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 51

CCS335 – CLOUD COMPUTING

LABORATORY

LABORATORY RECORD

DEPARTMENT OF ARTIFICIAL INTELLIGENCE AND DATA

SCIENCE
ACADEMIC YEAR 2023-2024 (ODD SEMESTER)
TABLE OF CONTENT

PAGE
S.NO DATE NAME OF THE EXPERIMENT MARKS SIGN
NO

Install Virtualbox/VMware/ Equivalent open source cloud

1 Workstation with different flavours of Linux or Windows
OS on top of windows 8 and above.

Install a C compiler in the virtual machine created using a

2
virtual box and execute Simple Programs

Install Google App Engine. Create a hello world app and

3
other simple web applications using python/java.

4 Use the GAE launcher to launch the web applications.

Simulate a cloud scenario using CloudSim and run a

5
scheduling algorithm that is not present in CloudSim.

Find a procedure to transfer the files from one virtual

6
machine to another virtual machine.

Install Hadoop single node cluster and run simple

7
applications like wordcount.

Creating and Executing Your First Container Using

8
Docker

9 Run a Container from Docker Hub

EX NO. : 1

DATE:
Install Virtualbox / VMware Workstation with different flavours of linux or
windows OS on top of windows7 or 8.

Aim:
To Install Virtualbox / VMware Workstation with different flavours of linux or windows
OS on top of windows7 or 8.

PROCEDURE:

Steps to install Virtual Box:

1. Download the Virtual box exe and click the exe file…and select next button..

2. Click the next button..

3. Click the next button

4. Click the YES button..

5. Click the install button…

6. Then installation was completed..the show virtual box icon on desktop screen….
Steps to import Open nebula sandbox:
1. Open Virtual box
2. File import Appliance
3. Browse OpenNebula-Sandbox-5.0.ova file
4. Then go to setting, select Usb and choose USB 1.1
5. Then Start the Open Nebula
6. Login using username: root, password:opennebula
Steps to create Virtual Machine through opennebula
1. Open Browser, type localhost:9869
2. Login using username: oneadmin, password: opennebula
3. Click on instances, select VMs then follow the steps to create Virtaul machine
a. Expand the + symbol
b. Select user oneadmin
c. Then enter the VM name,no.of instance, cpu.
d. Then click on create button.
e. Repeat the steps the C,D for creating more than one VMs.
APPLICATIONS:
There are various applications of cloud computing in today’s network world. Many search engines and social
websites are using the concept of cloud computing like www.amazon.com, hotmail.com, facebook.com, linkedln.com
etc. the advantages of cloud computing in context to scalability is like reduced risk , low cost testing
,ability to segment the customer base and auto-scaling based on application load.

RESULT:
Thus the procedure to run the virtual machine of different configuration.
EX.NO.:2
DATE:
Install a C compiler in the virtual machine created using virtual box and
execute Simple Programs

Aim:
To Install a C compiler in the virtual machine created using virtual box and
execute Simple Programs`

PROCEDURE:

Steps to import .ova file:

1. Open Virtual box
2. File import Appliance
3. Browse ubuntu_gt6.ova file
4. Then go to setting, select Usb and choose USB 1.1
5. Then Start the ubuntu_gt6
6. Login using username: dinesh, password:99425.
Steps to run c program:

1. Open the terminal

2. Type cd /opt/axis2/axis2-1.7.3/bin then press enter
3. gedit hello.c
4. gcc hello.c
5. ./a.out

1. Type cd /opt/axis2/axis2-1.7.3/bin then press enter

2. Type gedit first.c

3. Type the c program

4. Running the C program

5. Display the output:

APPLICATIONS:
Simply running all programs in grid environment.

RESULT:

Thus the simple C programs executed successfully.

EX NO.:3
DATE:
Install Google App Engine. Create hello world app and other simple web
applications using python/java.

Aim:
To Install Google App Engine. Create hello world app and other simple web
applications using python/java.
Procedure:

1. Install Google Plugin for Eclipse

Read this guide – how to install Google Plugin for Eclipse. If you install the Google App Engine
Java SDK together with “Google Plugin for Eclipse“, then go to step 2, Otherwise, get the Google
App Engine Java SDK and extract it.

2. Create New Web Application Project

In Eclipse toolbar, click on the Google icon, and select “New Web Application Project…”

Figure – New Web Application Project

Figure – Deselect the “Google Web ToolKit“, and link your GAE Java SDK via the “configure
SDK” link.
Click finished, Google Plugin for Eclipse will generate a sample project automatically.

3. Hello World
Review the generated project directory.
Nothing special, a standard Java web project structure.

HelloWorld/
src/
...Java source code...
META-INF/
...other configuration...
war/
...JSPs, images, data files...
WEB-INF/
...app configuration...
lib/
...JARs for libraries...
classes/
...compiled classes...
Copy
The extra is this file “appengine-web.xml“, Google App Engine need this to run and deploy the
application.

File : appengine-web.xml

<?xml version="1.0" encoding="utf-8"?>

<appengine-web-app xmlns="http://appengine.google.com/ns/1.0">
<application></application>
<version>1</version>

<system-properties>
<property name="java.util.logging.config.file" value="WEB-INF/logging.properties"/>
</system-properties>

</appengine-web-app>
Copy

4. Run it local
Right click on the project and run as “Web Application“.

Eclipse console :

//...
INFO: The server is running at http://localhost:8888/
30 Mac 2012 11:13:01 PM com.google.appengine.tools.development.DevAppServerImpl start
INFO: The admin console is running at http://localhost:8888/_ah/admin
Copy
Access URL http://localhost:8888/, see output
and also the hello world servlet – http://localhost:8888/helloworld

5. Deploy to Google App Engine

In this demonstration, I created an application ID, named “mkyong123”, and put it in appengine-
web.xml.

File : appengine-web.xml

<?xml version="1.0" encoding="utf-8"?>

<appengine-web-app xmlns="http://appengine.google.com/ns/1.0">
<application>mkyong123</application>
<version>1</version>

<system-properties>
<property name="java.util.logging.config.file" value="WEB-INF/logging.properties"/>
</system-properties>

</appengine-web-app>
Copy
To deploy, see following steps:

Figure 1.1 – Click on GAE deploy button on the toolbar.

Figure 1.2 – Sign in with your Google account and click on the Deploy button.
Figure 1.3 – If everything is fine, the hello world web application will be deployed to this URL –
http://mkyong123.appspot.com/

Result:

Thus the simple application was created successfully.

EX. NO.:4
DATE:
Simulate a cloud scenario using CloudSim and run a scheduling
algorithm that is not present in CloudSim.

Aim:
To simulate a cloud scenario using CloudSim and run a scheduling
algorithm that is not present in CloudSim.
Steps:

How to use CloudSim in Eclipse

CloudSim is written in Java. The knowledge you need to use CloudSim is basic Java
programming and some basics about cloud computing. Knowledge of programming IDEs
such as Eclipse or NetBeans is also helpful. It is a library and, hence, CloudSim does not have
to be installed. Normally, you can unpack the downloaded package in any directory, add it to
the Java classpath andit is ready to be used. Please verify whether Java is available on your
system.

To use CloudSim in Eclipse:

1. Download CloudSim installable files
from https://code.google.com/p/cloudsim/downloads/list and unzip
2. Open Eclipse
3. Create a new Java Project: File -> New
4. Import an unpacked CloudSim project into the new Java Project
The first step is to initialise the CloudSim package by initialising the CloudSim
library, as follows

CloudSim.init(num_user, calendar, trace_flag)

5. Data centres are the resource providers in CloudSim; hence, creation
of data centres is a second step. To create Datacenter, you need the
DatacenterCharacteristics object that stores the properties of a data centre
such as architecture, OS, list of machines, allocation policy that covers the
time or spaceshared, the time zone and its price:
Datacenter datacenter9883 = new Datacenter(name, characteristics, new
VmAllocationPolicySimple(hostList), s
6. The third step is to create a broker:
DatacenterBroker broker = createBroker();
7. The fourth step is to create one virtual machine unique ID of the VM,
userId ID of the VM’s owner, mips, number Of Pes amount of CPUs,
amount of RAM, amount of bandwidth, amount of storage, virtual
machine monitor, and cloudletScheduler policy for cloudlets:
Vm vm = new Vm(vmid, brokerId, mips, pesNumber, ram, bw, size,
vmm, new CloudletSchedulerTimeShared())
8. Submit the VM list to the broker:
broker.submitVmList(vmlist)
9. Create a cloudlet with length, file size, output size, and utilisation model:
Cloudlet cloudlet = new Cloudlet(id, length, pesNumber, fileSize, outputSize, utilizationModel,
utilizationMode
10. Submit the cloudlet list to the broker:
broker.submitCloudletList(cloudletList)
Sample Output from the Existing Example:
Starting
CloudSimExample1...
Initialising...
Starting CloudSim
version 3.0 Datacenter_0
is starting...
>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>null
Broker is
starting...
Entities started.
: Broker: Cloud Resource List received with 1
resource(s) 0.0: Broker: Trying to Create VM #0
in Datacenter_0
: Broker: VM #0 has been created in Datacenter #2, Host #0
0.1: Broker: Sending cloudlet 0 to VM #0
400.1 : Broker: Cloudlet 0 received
: Broker: All Cloudlets executed.
Finishing ... 400.1: Broker: Destroying
VM #0
Broker is shutting down...
Simulation: No more future
events
CloudInformationService: Notify all CloudSim entities for
shutting down. Datacenter_0 is shutting down...
Broker is shutting
down ... Simulation
completed.
Simulation completed.
========== OUTPUT ==========
Cloudlet ID STATUS Data center ID VM ID Time Start Time
Finish Time 0 SUCCESS 2 0 400
0.1 400.1
*****Datacenter:
Datacenter_0***** User id
Debt
3 35.6

CloudSimExample1 finished!

RESULT:

The simulation was successfully executed.

EX.NO.:5
DATE:
Use GAE launcher to launch the web applications.

Aim:
To Use GAE launcher to launch the web applications.

Steps:

MakingyourFirstApplication

Nowyouneedtocreateasimpleapplication. Wecouldusethe “+”optiontohave the

launcher make us an application – but instead we willdo it byhand to get a better sense of
what is going on.

Makeafolder for your Google App Engineapplications. I amgoing tomakethe Folder

on my Desktop called “apps” – the pathtothis folder is:

C:\Documents and Settings\csev\Desktop\apps

And then make a sub-•‐folder in within apps called “ae-•01-•trivial” – the path to this
folder wouldbe:
C:\ Documents and Settings \csev\Desktop\apps\ae-•01-•trivial
Using a text editor such as JEdit (www.jedit.org), create a file called app.yaml in the
ae-•01-•trivial folder with the following contents:
application: ae-01-trivial
version: 1
runtime: python api_version: 1
handlers:- url: /.*
script: index.py
Note: Please do notcopyandpaste these lines into yourtexteditor– youmightend up
with strange characters – simply type them into your editor.
Then create a file in the ae-•01-•trivial folder called index.py with three lines in it:
print 'Content-Type: text/plain'
print ' '
print 'Hello there Chuck'
Then start the GoogleAppEngineLauncher program that can be found
under Applications. Use the File -•> Add Existing Application command
and navigate into the apps directory and select the ae-•01-•trivial folder. Once
you have added the application, select it so that youcancontrolthe application
usingthelauncher.
Once youhave selected yourapplicationand press Run. Aftera few moments your
application willstart and the launcher willshow a little green icon next to your
application. Then press Browse to open a browser pointing at your application
which is running at http://localhost:8080/

Paste http://localhost:8080 into your browser and you should see your
application as follows:

Justforfun, edittheindex.pytochange thename“Chuck” to yourownname

and press Refresh in the browser to verify your updates.

Watching the Log

Youcanwatchthe internal logofthe actions that the webserver is performing

when youare interactingwithyourapplication in the browser. Select your
application in the Launcher and press the Logs button to bring up a log window:

EachtimeyoupressRefreshinyourbrowser–youcanseeitretrievingthe
output with a GET request.
Dealing With Errors

With two files to edit, there are two general categories of errors that you may
encounter. If youmake a mistake onthe app.yamlfile, the App Engine willnotstart
and your launcher will show a yellow icon near your application:

To get more detail on what is going wrong, take a look at the log for the application:
In this instance – the mistake is mis--‐indenting the last line in the app.yaml (line 8).
Ifyoumake asyntaxerror in the index.pyfile, a Pythontrace backerrorwillappear in
yourbrowser.

The error you need to see is likely to be the last few lines of the output – in this
case I made a Python syntax error on line one of our one-•‐line application.
Reference: http://en.wikipedia.org/wiki/Stack_trace
When you make a mistake in the app.yaml file – you must the fix the mistake
and attempt to start the application again.
If you make a mistake in a file like index.py, you can simply fix the file and
press refresh in your browser – there is no need to restart the server.

Shutting Down the Server

To shut down the server, use the Launcher, select your application and press the
Stop button.

Result:

Thus the GAE web application was created.

EX.NO:6
DATE:
Find a procedure to transfer the files from one virtual machine
to another virtual machine.

Aim:
To Find a procedure to transfer the files from one virtual machine
to another virtual machine.

Steps:

1. You can copy few (or more) lines with copy & paste mechanism.
For this you need to share clipboard between host OS and guest OS, installing
Guest Addition on both the virtual machines (probably setting bidirectional
and restarting them). You copy from guest OS in the clipboard that is shared
with the host OS.
Then you paste from the host OS to the second guest OS.
2. You can enable drag and drop too with the same method (Click on the
machine, settings, general, advanced, drag and drop: set to bidirectional )
3. You can have common Shared Folders on both virtual machines and
use one of the directory shared as buffer to copy.
Installing Guest Additions you have the possibility to set Shared Folders too.
As you put a file in a shared folder from host OS or from guest OS, is
immediately visible to the other. (Keep in mind that can arise some problems
for date/time of the files when there are different clock settings on the
different virtual machines).
If you use the same folder shared on more machines you can exchange files
directly copying them in this folder.
4. You can use usual method to copy files between 2 different computer with
client-server application. (e.g. scp with sshd active for linux, winscp... you can
get some info about SSH servers e.g. here)
You need an active server (sshd) on the receiving machine and a client on
the sending machine. Of course you need to have the authorization setted
(via password or, better, via an automatic authentication method).
Note: many Linux/Ubuntu distribution install sshd by default: you can see if
it is running with pgrep sshd from a shell. You can install with sudo apt-get
install openssh-server.
5. You can mount part of the file system of a virtual machine via NFS or
SSHFS on the other, or you can share file and directory with Samba.
You may find interesting the article Sharing files between guest and
host without VirtualBox shared folders with detailed step by step
instructions.
You should remember that you are dialling with a little network of machines
with different operative systems, and in particular:
 Each virtual machine has its own operative system running on and acts
as a physical machine.
 Each virtual machine is an instance of a program owned by an user in the
hosting operative system and should undergo the restrictions of the user in the
hosting OS.
E.g Let we say that Hastur and Meow are users of the hosting machine, but
they did not allow each other to see their directories (no read/write/execute
authorization). When each of them run a virtual machine, for the hosting OS
those virtual machine are two normal programs owned by Hastur and Meow
and cannot see the private directory of the other user. This is a restriction due
to the hosting OS. It's easy to overcame it: it's enough to give authorization to
read/write/execute to a directory or to chose a different directory in which both
users can read/write/execute.
 Windows likes mouse and Linux fingers. :-)
I mean I suggest you to enable Drag & drop to be cosy with the Windows
machines and the Shared folders or to be cosy with Linux.
When you will need to be fast with Linux you will feel the need of ssh-keygen and
to Generate once SSH Keys to copy files on/from a remote machine without writing
password anymore. In this way it functions bash auto-completion remotely too!

PROCEDURE:
Steps:
1. Open Browser, type localhost:9869
2. Login using username: oneadmin, password: opennebula
3. Then follow the steps to migrate VMs
a. Click on infrastructure
b. Select clusters and enter the cluster name
c. Then select host tab, and select all host
d. Then select Vnets tab, and select all vnet
e. Then select datastores tab, and select all datastores
f. And then choose host under infrastructure tab
g. Click on + symbol to add new host, name the host then click on create.
4. on instances, select VMs to migrate then follow the stpes
a. Click on 8th icon ,the drop down list display
b. Select migrate on that ,the popup window display
c. On that select the target host to migrate then click on migrate.
Before migration
Host:SACET

Host:one-sandbox
After Migration:

Host:one-sandbox
Host:SACET

APPLICATIONS:
Easily migrate your virtual machine from one pc to another.

Result:
Thus the file transfer between VM was successfully completed…..
EX NO.:7
DATE :
Install Hadoop single node cluster and run simple
applications like wordcount.

Aim:
To Install Hadoop single node cluster and run simple applications like wordcount.

Steps:

Install Hadoop

Step 1: Click here to download the Java 8 Package. Save this file in your home
directory.

Step 2: Extract the Java Tar File.

Command: tar -xvf jdk-8u101-linux-i586.tar.gz

Fig: Hadoop Installation – Extracting Java Files

Step 3: Download the Hadoop 2.7.3 Package.

Command: wget https://archive.apache.org/dist/hadoop/core/hadoop-2.7.3/hadoop-

2.7.3.tar.gz

Fig: Hadoop Installation – Downloading Hadoop

Step 4: Extract the Hadoop tar File.

Command: tar -xvf hadoop-2.7.3.tar.gz

Fig: Hadoop Installation – Extracting Hadoop Files Step

5: Add the Hadoop and Java paths in the bash file (.bashrc). Open. bashrc

file. Now, add Hadoop and Java Path as shown below.

Command: vi .bashrc

Fig: Hadoop Installation – Setting Environment Variable

Then, save the bash file and close it.

For applying all these changes to the current Terminal, execute the source command.
Command: source .bashrc

Fig: Hadoop Installation – Refreshing environment variables

To make sure that Java and Hadoop have been properly installed on your system and can be
accessed through the Terminal, execute the java -version and hadoop version commands.

Command: java -version

Fig: Hadoop Installation – Checking Java Version
Command: hadoop version

Fig: Hadoop Installation – Checking Hadoop Version

Step 6: Edit the Hadoop Configuration files.

Command: cd hadoop-2.7.3/etc/hadoop/

Command: ls

All the Hadoop configuration files are located in hadoop-2.7.3/etc/hadoop directory as you can
see in the snapshot below:

Fig: Hadoop Installation – Hadoop Configuration Files

Step 7: Open core-site.xml and edit the property mentioned below inside configuration
tag:

core-site.xml informs Hadoop daemon where NameNode runs in the cluster. It contains
configuration settings of Hadoop core such as I/O settings that are common to HDFS &
MapReduce.

Command: vi core-site.xml

Step 8: Edit hdfs-site.xml and edit the property mentioned below inside
configuration tag:

hdfs-site.xml contains configuration settings of HDFS daemons (i.e. NameNode, DataNode,

Secondary NameNode). It also includes the replication factor and block size of HDFS.

Command: vi hdfs-site.xml
Fig: Hadoop Installation – Configuring hdfs-site.xml

1
2 <?xml version="1.0" encoding="UTF-8"?>
3 <?xml-stylesheet type="text/xsl" href="configuration.xsl"?>
<configuration>
4 <property>
5 <name>dfs.replication</name>
6 <value>1</value>
7 </property>
<property>
8 <name>dfs.permission</name>
9 <value>false</value>
10 </property>
11 </configuration>

Step 9: Edit the mapred-site.xml file and edit the property mentioned below
inside configuration tag:

mapred-site.xml contains configuration settings of MapReduce application like number of JVM

that can run in parallel, the size of the mapper and the reducer process, CPU cores available for a
process, etc.

In some cases, mapred-site.xml file is not available. So, we have to create the mapred- site.xml
file using mapred-site.xml template.

Command: cp mapred-site.xml.template mapred-site.xml

Command: vi mapred-site.xml.

Step 10: Edit yarn-site.xml and edit the property mentioned below inside
configuration tag:

yarn-site.xml contains configuration settings of ResourceManager and NodeManager like

application memory management size, the operation needed on program & algorithm, etc.

Command: vi yarn-site.xml

Fig: Hadoop Installation – Configuring yarn-site.xml

Step 11: Edit hadoop-env.sh and add the Java Path as mentioned below:
1
2 <?xml version="1.0">
3 <configuration>
4 <property>
5 <name>yarn.nodemanager.aux-services</name>
<value>mapreduce_shuffle</value>
6 </property>
7 <property>
8 <name>yarn.nodemanager.auxservices.mapreduce.shuffle.class</
9 name>
<value>org.apache.hadoop.mapred.ShuffleHandler</value>
1 </property>
0
1

hadoop-env.sh contains the environment variables that are used in the script to run Hadoop
like Java home path, etc.
Command: vi hadoop–env.sh

Fig: Hadoop Installation – Configuring hadoop-env.sh Step

12: Go to Hadoop home directory and format the NameNode.

Command: cd

Command: cd hadoop-2.7.3

Command: bin/hadoop namenode -format

Fig: Hadoop Installation – Formatting NameNode

This formats the HDFS via NameNode. This command is only executed for the first time.
Formatting the file system means initializing the directory specified by the dfs.name.dir
variable.

Never format, up and running Hadoop filesystem. You will lose all your data stored in the
HDFS.

Step 13: Once the NameNode is formatted, go to hadoop-2.7.3/sbin directory and start all the daemons.

Command: cd hadoop-2.7.3/sbin

Either you can start all daemons with a single command or do it individually.

Command: ./start-all.sh

The above command is a combination of start-dfs.sh, start-yarn.sh & mr-jobhistory-

daemon.sh

Or you can run all the services individually as below:

Start NameNode:

The NameNode is the centerpiece of an HDFS file system. It keeps the directory tree of all files
stored in the HDFS and tracks all the file stored across the cluster.

Command: ./hadoop-daemon.sh start namenode

Fig: Hadoop Installation – Starting NameNode

Start DataNode:
On startup, a DataNode connects to the Namenode and it responds to the requests from
the Namenode for different operations.

Command: ./hadoop-daemon.sh start datanode

Fig: Hadoop Installation – Starting DataNode

Start ResourceManager:

ResourceManager is the master that arbitrates all the available cluster resources and thus
helps in managing the distributed applications running on the YARN system. Its work
is to manage each NodeManagers and the each application’s ApplicationMaster.

Command: ./yarn-daemon.sh start resourcemanager

Fig: Hadoop Installation – Starting ResourceManager

Start NodeManager:

The NodeManager in each machine framework is the agent which is responsible for
managing containers, monitoring their resource usage and reporting the same to the
ResourceManager.

Command: ./yarn-daemon.sh start nodemanager

See Batch Details

Fig: Hadoop Installation – Starting NodeManager

Start JobHistoryServer:

JobHistoryServer is responsible for servicing all job history related requests from client.

Command: ./mr-jobhistory-daemon.sh start historyserver

Step 14: To check that all the Hadoop services are up and running, run the below
command.

Command: jps

Fig: Hadoop Installation – Checking Daemons

Step 15: Now open the Mozilla browser and go
to localhost:50070/dfshealth.html to check the NameNode interface.

Fig: Hadoop Installation – Starting WebUI

Congratulations, you have successfully installed a single node Hadoop cluster

Result:
Thus the Hadoop one cluster was installed and simple applications executed
successfully.
VIVA QUESTIONS AND ANSWERS

1. Define Cloud Computing with example.

Cloud computing is a model for enabling convenient, on-demand network access to a
shared pool of configurable computing resources (e.g., networks, servers, storage, applications,
and services) that can be rapidly provisioned and released with minimal management effort or
service provider interaction.

2. What is the working principle of Cloud Computing?

The cloud is a collection of computers and servers that are publicly accessible via the
Internet. This hardware is typically owned and operated by a third party on a consolidated
basis in one or more data center locations. The machines can run any combination of
operating systems.

3. What are the advantages and disadvantages of Cloud Computing?

Advantages
Lower-Cost Computers for Users
Improved Performance
Lower IT Infrastructure Costs
Fewer Maintenance Issues
Lower Software Costs
Instant Software Updates
Increased Computing Power
Unlimited Storage Capacity
Increased Data Safety
Improved Compatibility Between Operating Systems
Improved Document Format Compatibility
Easier Group Collaboration
Universal Access to Documents
Latest Version Availability
Removes the Tether to Specific Devices
Disadvantages
Requires a Constant Internet Connection
Doesn’t Work Well with Low-Speed Connections
Can Be Slow
Features Might Be Limited
Stored Data Might Not Be Secure
If the Cloud Loses Your Data, You’re Screwed
4. What is distributed system?
A distributed system is a software system in which components located on networked computers
communicate and coordinate their actions by passing messages. The components interact with each other in
order to achieve a common goal.
Three significant characteristics of distributed systems are:
 Concurrency of components
 Lack of a global clock
 Independent failure of components
 What is cluster?
 Acomputingclusterconsistsofinterconnectedstand-
alonecomputerswhichworkcooperativelyasasingleintegratedcomputingresource.Inthepast,clus
teredcomputersystemshavedemonstrated
5. What is grid computing?
Grid Computing enables virtuals organizations to share geographically distributed resources as they
pursue common goals, assuming the absence of central location, central control, omniscience, and an
existing trust relationship.
(or)
 Gridtechnologydemandsnewdistributedcomputingmodels,software/middlewaresupport,networkp
rotocols,andhardwareinfrastructures.
 Nationalgridprojectsarefollowedbyindustrialgridplat-
formdevelopmentbyIBM,Microsoft,Sun,HP,Dell,Cisco,EMC,PlatformCo mputing,andothers.
Newgridserviceproviders(GSPs)andnewgridapplicationshaveemergedrapidly,similartothegrowtho
fInternetandwebservicesinthepasttwodecades.
 gridsystemsareclassifiedinessentiallytwocategories:computationalordatagridsandP2Pgrids.
6. What are the business areas needs in Grid computing?
 Life Sciences
 Financial services
 Higher Education
 Engineering Services
 Government
 Collaborative games

7. List out the Grid Applications:

 Application partitioning that involves breaking the problem into discrete pieces
 Discovery and scheduling of tasks and workflow
 Data communications distributing the problem data where and when it is required
 Provisioning and distributing application codes to specific system nodes
 Autonomic features such as self-configuration, self-optimization, self-recovery and self-
management
8. List some grid computing toolkits and frameworks?
 Globus Toolkit Globus Resource Allocation Manager(GRAM)
 Grid Security Infrastructure(GSI)
 Information Services
 Legion, Condor and Condor-G
 NIMROD, UNICORE, NMI.
9. What are Desktop Grids?
These are grids that leverage the compute resources of desktop computers.
Because of the true (but unfortunate) ubiquity of Microsoft® Windows® operating
system in corporations, desktop grids are assumed to apply to the Windows environment.
The Mac OS™ environment is supported by a limited number of vendors.
10. What are Server Grids?
 Some corporations, while adopting Grid Computing , keep it limited to server resources that are
within the purview of the IT department.
 Special servers, in some cases, are bought solely for the purpose of creating an internal “utility
grid” with resources made available to various departments.
 No desktops are included in server grids. These usually run some flavor of the Unix/Linux
operating system.
11. Define Opennebula.
OpenNebula is an open source management tool that helps virtualized data centers oversee private clouds,
public clouds and hybrid clouds ..... OpenNebula is vendor neutral, as well as platform- and API-agnostic. It
can use KVM, Xen or VMware hypervisors.

12. Define Eclipse.

Eclipse is an integrated development environment (IDE) used in computer programming, and is the most
widely used Java IDE. It contains a base workspace and an extensible plug-in system for customizing the
environment.
13. Define Netbeans.
NetBeans is an open-source integrated development environment (IDE) for developing with Java, PHP,
C++, and other programming languages. NetBeans is also referred to as a platform of modular components
used for developing Java desktop applications.

14. Define Apache Tomcat.

Apache Tomcat (or Jakarta Tomcat or simply Tomcat) is an open source servlet container developed by
the Apache Software Foundation (ASF). Tomcat implements the Java Servlet and the JavaServer Pages
(JSP) specifications from Sun Microsystems, and provides a "pure Java" HTTP web server environment for
Java code to run."

15. What is private cloud?

The private cloud is built within the domain of an intranet owned by a single organization. Therefore,
they are client owned and managed. Their access is limited to the owning clients and their partners. Their
deployment was not meant to sell capacity over the Internet through publicly accessible interfaces. Private
clouds give local users a flexible and agile private infrastructure to run service workloads within their
administrative domains.

16. What is public cloud?

A public cloud is built over the Internet, which can be accessed by any user who has paid for the
service. Public clouds are owned by service providers. They are accessed by subscription. Many companies
have built public clouds, namely Google App Engine, Amazon AWS, Microsoft Azure, IBM Blue Cloud,
and Salesforce Force.com. These are commercial providers that offer a publicly accessible remote interface
for creating and managing VM instances within their proprietary infrastructure.

17. What is hybrid cloud?

A hybrid cloud is built with both public and private clouds, Private clouds can also support
a hybrid cloud model by supplementing local infrastructure with computing capacity from an external public
cloud. For example, the research compute cloud (RC2) is a private cloud built by IBM.

18. What is a Community Cloud ?

A community cloud in computing is a collaborative effort in which infrastructure is shared between
several organizations from a specific community with common concerns (security, compliance, jurisdiction,
etc.), whether managed internally or by a third-party and hosted internally or externally. Thisis controlled
and used by a group of organizations that have shared interest. The costs are spread over fewer users than a
public cloud (but more than a private cloud

19. Define IaaS?

The IaaS layer offers storage and infrastructure resources that is needed to deliver the Cloud services.
It only comprises of the infrastructure or physical resource. Top IaaS Cloud Computing Companies: Amazon
(EC2), Rackspace, GoGrid, Microsoft, Terremark and Google.

20. Define PaaS?

PaaS provides the combination of both, infrastructure and application. Hence, organisations
using PaaS don’t have to worry for infrastructure nor for services. Top PaaS Cloud Computing
Companies: Salesforce.com, Google, Concur Technologies, Ariba, Unisys and Cisco..

21. Define SaaS?

In the SaaS layer, the Cloud service provider hosts the software upon their servers. It can be defined as
a in model in which applications and softwares are hosted upon the server and made available to
customers over a network. Top SaaS Cloud Computing Companies: Amazon Web Services, AppScale,
CA Technologies, Engine Yard, Salesforce and Windows Azure.
22. What is meant by virtualization?
Virtualizationisacomputerarchitecturetechnologybywhichmultiplevirtualmachines (VMs)are
multipl exedin the same hardwar emachine.Theideaof VMs canbe dated back to the 1960s. The purpose
of a VM is to enhance resource sharing by many users and improve computer performance interms of
resource utilization and application flexibility.

23. What are the implementation levels of virtualization?

The virtualization types are following
1. OS-level virtualization
2. ISA level virtualization
3. User-ApplicationLevel virtualization
4. hardware level virtualization
5. library level virtualization

24. List the requirements of VMM?

There are three requirements for a VMM.
First, a VMM should provide an environment for programs which is essentially identical to the
original machine.
Second, programs run in this environment should show, at worst, only minor decreases in speed.
Third, a VMM should be in complete control of the system resources.

25. Explain Host OS and Guest OS?

A comparison of the differences between a host system, a guest system, and a virtual machine within
a virtual infrastructure.
A host system (host operating system) would be the primary & first installed operating system. If you
are using a bare metal Virtualization platform like Hyper-V or ESX, there really isn’t a host operating
system besides the Hypervisor. If you are using a Type-2 Hypervisor like VMware Server or Virtual Server,
the host operating system is whatever operating system those applications are installed into.
A guest system (guest operating system) is a virtual guest or virtual machine (VM) that is installed
under the host operating system. The guests are the VMs that you run in your virtualization platform.

26. Write the steps for live VM migration?

The five steps for live VM migration is
Stage 0: Pre-Migration
Active VM on Host A
Alternate physical host may be preselected for migration
Block devices mirrored and free resources maintained
Stage 1: Reservation
Initialize a container on the target host
Stage 2: Iterative pre-copy
Enable shadow paging
Copy dirty pages in successive rounds.
Stage 3: Stop and copy
Suspend VM on host A
Generate ARP to redirect traffic to Host B
Synchronize all remaining VM state to Host B
Stage 4: Commitment
VM state on Host A is released
Stage 5: Activation
VM starts on Host B
Connects to local devices
Resumes normal operation

27. .Define Globus Toolkit: Grid Computing Middleware

 Globus is open source grid software that addresses the most challenging problmes in distributed
resources sharing.
 The Globus Toolkit includes software services and libraries for distributed security, resource
management, monitoring and discovery, and data management.

28. Define Blocks in HDFS

 A disk has a block size, which is the minimum amount of data that it can read or write. Filesystems for
a single disk build on this by dealing with data in blocks, which are an integral multiple of the disk block
size. Filesystem blocks are typically a few kilobytes in size, while disk blocks are normally 512 bytes.
This is generally transparent to the filesystem user who is simply reading or writing a file—of whatever
length.
29. Define Namenodes and Datanodes
 An HDFS cluster has two types of node operating in a master-worker pattern:
 a namenode (the master) and
 a number of datanodes(workers).
 The namenode manages the filesystem namespace. It maintains the filesystem tree and the metadata
for all the files and directories in the tree. This information is stored persistently on the local disk in
the form of two files: the namespace image and the edit log.
 The namenode also knows the datanodes on which all the blocks for a given file are located, however,
it does not store block locations persistently, since this information is reconstructed from datanodes
when the system starts.

30. Define HADOOP.

Hadoop is an open source, Java-based programming framework that supports the processing and storage of
extremely large data sets in a distributed computing environment. It is part of the Apache project sponsored
by the Apache Software Foundation.

31. Define HDFS.

Hadoop Distributed File System (HDFS) is a Java-based file system that provides scalable and reliable data
storage that is designed to span large clusters of commodity servers. HDFS, MapReduce, and YARN form
the core of Apache™ Hadoop®.

32. Write about HADOOP.

Hadoop was created by Doug Cutting and Mike Cafarella in 2005. Cutting, who was working at Yahoo! at
the time, named it after his son's toy elephant. It was originally developed to support distribution for the
Nutch search engine project.

33. Definition of Grid Portal:

A Grid Portal provides an efficient infrastructure to put Grid-empowered applications on corporate
Intranet/Internet.

34. Define GAE.

Google App Engine (often referred to as GAE or simply App Engine) is a Platform as a Service and cloud
computing platform for developing and hosting web applications in Google-managed data centers.
Applications are sandboxed and run across multiple servers. App Engine offers automatic scaling for web
applications—as the number of requests increases for an application, App Engine automatically allocates
more resources for the web application to handle the additional demand.
35. What is Cloudsim?
CloudSim is a simulation toolkit that supports the modeling and simulation of the core functionality of
cloud, like job/task queue, processing of events, creation of cloud entities(datacenter, datacenter brokers,
etc), communication between different entities, implementation of broker policies, etc. This toolkit allows
to:

 Test application services in a repeatable and controllable environment.

 Tune the system bottlenecks before deploying apps in an actual cloud.
 Experiment with different workload mix and resource performance scenarios on simulated
infrastructure for developing and testing adaptive application provisioning techniques

36. Core features of CloudSim are:

 The Support of modeling and simulation of large scale computing environment as federated cloud
data centers, virtualized server hosts, with customizable policies for provisioning host resources to
virtual machines and energy-aware computational resources
 It is a self-contained platform for modeling cloud’s service brokers, provisioning, and allocation
policies.
 It supports the simulation of network connections among simulated system elements.
 Support for simulation of federated cloud environment, that inter-networks resources from both
private and public domains.
 Availability of a virtualization engine that aids in the creation and management of multiple
independent and co-hosted virtual services on a data center node.
 Flexibility to switch between space shared and time shared allocation of processing cores to
virtualized services.

37. Uses of Cloudsim.

 Load Balancing of resources and tasks

 Task scheduling and its migrations
 Optimizing the Virtual machine allocation and placement policies
 Energy-aware Consolidations or Migrations of virtual machines
 Optimizing schemes for Network latencies for various cloud scenarios

38. Define OpenStack.

OpenStack is a cloud operating system that controls large pools of compute, storage, and networking
resources throughout a datacenter, all managed and provisioned through APIs with common authentication
mechanisms.A dashboard is also available, giving administrators control while empowering their users to
provision resources through a web interface.
39. Define Trystack.
TryStack is a great way to take OpenStack for a spin without having to commit to a full
deployment.
This free service lets you test what the cloud can do for you, offering networking, storage and compute
instances, without having to go all in with your own hardware.
It’s a labor of love spearheaded by three Red Hat OpenStack experts Will Foster, Kambiz
Aghaiepour and Dan Radez.
TryStack’s set-up must bear the load of anyone who wants to use it, but instead of an equally
boundless budget and paid staff, it was originally powered by donated equipment and volunteers from
Cisco, Dell, Equinix, NetApp, Rackspace and Red Hat who pulled together for this OpenStack Foundation
project.

40. Define Hadoop.

Hadoop is an open-source software framework for storing data and running applications on clusters
of commodity hardware. It provides massive storage for any kind of data, enormous processing power and
the ability to handle virtually limitless concurrent tasks or jobs.

CC Lab Record
No ratings yet
CC Lab Record
48 pages
CCS335 Lab Manual FINAL
No ratings yet
CCS335 Lab Manual FINAL
47 pages
Ccs335-Cloud Computing Lab Record
No ratings yet
Ccs335-Cloud Computing Lab Record
73 pages
CCS335 - Cloud Computing Lab Manual
No ratings yet
CCS335 - Cloud Computing Lab Manual
49 pages
Cloud Lab
No ratings yet
Cloud Lab
59 pages
Cloud Computing Lab Guide
No ratings yet
Cloud Computing Lab Guide
59 pages
CS8711-CC Lab Finish
No ratings yet
CS8711-CC Lab Finish
48 pages
Cloud Computing Lab Record
No ratings yet
Cloud Computing Lab Record
31 pages
Lab File Cloud Computing
No ratings yet
Lab File Cloud Computing
22 pages
Cloud Computing Lab Manual
No ratings yet
Cloud Computing Lab Manual
50 pages
Ccs335-Cloud Computing Laboratory-778800227-Cc Lab Manual
No ratings yet
Ccs335-Cloud Computing Laboratory-778800227-Cc Lab Manual
55 pages
CCS335 - Cloud Computing Lab Manual
No ratings yet
CCS335 - Cloud Computing Lab Manual
49 pages
CC Manual - Ref
No ratings yet
CC Manual - Ref
61 pages
CCS340 Cyber Security Lab
No ratings yet
CCS340 Cyber Security Lab
60 pages
Ccs335-Cloud Computing Laboratory
No ratings yet
Ccs335-Cloud Computing Laboratory
61 pages
Cloud Computing Lab Guide
67% (3)
Cloud Computing Lab Guide
61 pages
Cs8711-Cloud Computing Laboratory-778800227-Cc Lab Manual
No ratings yet
Cs8711-Cloud Computing Laboratory-778800227-Cc Lab Manual
55 pages
Cloud Computing Record
No ratings yet
Cloud Computing Record
48 pages
Cloud Computing Laboratory
No ratings yet
Cloud Computing Laboratory
56 pages
CCLab Manual 1
No ratings yet
CCLab Manual 1
57 pages
Cloud Computing CC Lab Manual - 240125 - 135558
No ratings yet
Cloud Computing CC Lab Manual - 240125 - 135558
51 pages
Cs8711-Cloud Computing Laboratory-778800227-Cc Lab Manual
No ratings yet
Cs8711-Cloud Computing Laboratory-778800227-Cc Lab Manual
58 pages
Cloud Computing Final 12-29
No ratings yet
Cloud Computing Final 12-29
18 pages
Madha Institute of Engineering and Technology
No ratings yet
Madha Institute of Engineering and Technology
60 pages
CC Lab Manual-3
No ratings yet
CC Lab Manual-3
76 pages
CC Lab Manual
No ratings yet
CC Lab Manual
40 pages
CS8711 Cloud Computing Lab Manual
No ratings yet
CS8711 Cloud Computing Lab Manual
47 pages
CC Lab Manual
No ratings yet
CC Lab Manual
90 pages
Cloud Computing Lab Manual Fibal
No ratings yet
Cloud Computing Lab Manual Fibal
53 pages
Cloud Computing Lab Manual
100% (1)
Cloud Computing Lab Manual
61 pages
CC Lab
No ratings yet
CC Lab
46 pages
CC Manual Final
No ratings yet
CC Manual Final
56 pages
Ccs335 CC Lab Manual Cce &csbs
No ratings yet
Ccs335 CC Lab Manual Cce &csbs
49 pages
CC Lab Manual
No ratings yet
CC Lab Manual
58 pages
Labfile Cryptography
No ratings yet
Labfile Cryptography
27 pages
CC Lab Manual
No ratings yet
CC Lab Manual
57 pages
CCS335 Cloud Computing Programs
No ratings yet
CCS335 Cloud Computing Programs
55 pages
CC Manual Final
No ratings yet
CC Manual Final
55 pages
Cloud Computing File
No ratings yet
Cloud Computing File
56 pages
Cloud Computing Lab Manual
No ratings yet
Cloud Computing Lab Manual
77 pages
Cloud Computing Lab Manual Final
No ratings yet
Cloud Computing Lab Manual Final
72 pages
CC Lab Manual 1-6
No ratings yet
CC Lab Manual 1-6
55 pages
Gaurav Kumar
No ratings yet
Gaurav Kumar
38 pages
CC Lab Record Print-May2025
No ratings yet
CC Lab Record Print-May2025
51 pages
Cloudcomputinglabmanual Final
No ratings yet
Cloudcomputinglabmanual Final
66 pages
CC Record - 2023-1 (New2)
No ratings yet
CC Record - 2023-1 (New2)
44 pages
CC New Lab Manual
No ratings yet
CC New Lab Manual
54 pages
Cloud Lab Manual
No ratings yet
Cloud Lab Manual
61 pages
Cloudcomputinglabmanual Cse
No ratings yet
Cloudcomputinglabmanual Cse
64 pages
KCS751A Cloudcomputinglabmanual
No ratings yet
KCS751A Cloudcomputinglabmanual
69 pages
Department of Computer Science and Engineering: Ccs335 - Cloud Computing Laboratory Record Note
No ratings yet
Department of Computer Science and Engineering: Ccs335 - Cloud Computing Laboratory Record Note
52 pages
CCS335 Lab Manual
No ratings yet
CCS335 Lab Manual
51 pages
CC Lab 1 Updated 2
No ratings yet
CC Lab 1 Updated 2
74 pages
Cloud Computing Lab Manual Ccs335
50% (2)
Cloud Computing Lab Manual Ccs335
56 pages
KCS751A Cloudcomputinglabmanual
No ratings yet
KCS751A Cloudcomputinglabmanual
65 pages
Ccs335 Lab Manual-2
No ratings yet
Ccs335 Lab Manual-2
67 pages
CC Lab 2
No ratings yet
CC Lab 2
39 pages
CC Print Only Observation
No ratings yet
CC Print Only Observation
33 pages
MarkzTools2 Manual
No ratings yet
MarkzTools2 Manual
30 pages
Report of Gym Website
100% (2)
Report of Gym Website
33 pages
Type T Array (1..10) of Integer S T A: T B: T C: S D: Array (1..10) of Integer
No ratings yet
Type T Array (1..10) of Integer S T A: T B: T C: S D: Array (1..10) of Integer
3 pages
StarCraft Manual
54% (13)
StarCraft Manual
8 pages
Multi-Agent Systems: Tom Holvoet, Hoang Tung Dinh
No ratings yet
Multi-Agent Systems: Tom Holvoet, Hoang Tung Dinh
4 pages
Course Name: IAA202 Student Name: Chế Công Đại Instructor Name: Mai Hoang Dinh Lab Due Date: 9/2/2022
No ratings yet
Course Name: IAA202 Student Name: Chế Công Đại Instructor Name: Mai Hoang Dinh Lab Due Date: 9/2/2022
8 pages
Microprocessor Programming Solution
No ratings yet
Microprocessor Programming Solution
39 pages
Computer Architecture - Memory System
100% (1)
Computer Architecture - Memory System
22 pages
Compiler Building Tutorial: Jack W. Crenshaw
No ratings yet
Compiler Building Tutorial: Jack W. Crenshaw
306 pages
NorDig-Unified Test Specification Ver 2.0
No ratings yet
NorDig-Unified Test Specification Ver 2.0
231 pages
Ict Action Plan 2023-2024
No ratings yet
Ict Action Plan 2023-2024
1 page
Data Structures for CS Students
No ratings yet
Data Structures for CS Students
3 pages
Asianic Notebooks Pricelist
No ratings yet
Asianic Notebooks Pricelist
1 page
Program Mandate Overview
No ratings yet
Program Mandate Overview
5 pages
Merged OSEI 041P
No ratings yet
Merged OSEI 041P
13 pages
Ade Ci - CD
No ratings yet
Ade Ci - CD
15 pages
VxRail Appliance - VxRail Installation Procedures-VxRail E660 - E660F - E660N
100% (1)
VxRail Appliance - VxRail Installation Procedures-VxRail E660 - E660F - E660N
102 pages
Role of Data For Emerging Technologies
87% (15)
Role of Data For Emerging Technologies
12 pages
Aau Cs Model Exit 2025
No ratings yet
Aau Cs Model Exit 2025
18 pages
NOTE - Elevator Controls - Manualzz
100% (1)
NOTE - Elevator Controls - Manualzz
107 pages
Building A Modern Data Center Principles and Strategies of Design by Scott D Lowe
No ratings yet
Building A Modern Data Center Principles and Strategies of Design by Scott D Lowe
255 pages
Curriculum Vitae of Rifat
No ratings yet
Curriculum Vitae of Rifat
2 pages
TROOPERS13-Dirty Use of USSD Codes in Cellular-Ravi Borgaonkor
100% (1)
TROOPERS13-Dirty Use of USSD Codes in Cellular-Ravi Borgaonkor
35 pages
Forensic Magicube (Magiq-2) : Hard Drive Duplication
No ratings yet
Forensic Magicube (Magiq-2) : Hard Drive Duplication
2 pages
Automobile CAD - Lab - Manual - CBGS - 17-18
No ratings yet
Automobile CAD - Lab - Manual - CBGS - 17-18
24 pages
2023 IEEE Machine-Generated - Text - A - Comprehensive - Survey - of - Threat - Models - and - Detection - Methods
No ratings yet
2023 IEEE Machine-Generated - Text - A - Comprehensive - Survey - of - Threat - Models - and - Detection - Methods
26 pages
TETRA System Release 7.0: Mobility Management
No ratings yet
TETRA System Release 7.0: Mobility Management
40 pages
Computer Asssignment 6
No ratings yet
Computer Asssignment 6
3 pages
Ger-4900a - Ur Version 6 Cpu Module & Compatibe Firmware
No ratings yet
Ger-4900a - Ur Version 6 Cpu Module & Compatibe Firmware
3 pages
SQL PLSQL
No ratings yet
SQL PLSQL
227 pages

CC Lab Manual-New

Uploaded by

CC Lab Manual-New

Uploaded by

CCS335 – CLOUD COMPUTING

DEPARTMENT OF ARTIFICIAL INTELLIGENCE AND DATA

Install Virtualbox/VMware/ Equivalent open source cloud

Install a C compiler in the virtual machine created using a

Install Google App Engine. Create a hello world app and

4 Use the GAE launcher to launch the web applications.

Simulate a cloud scenario using CloudSim and run a

Find a procedure to transfer the files from one virtual

Install Hadoop single node cluster and run simple

Creating and Executing Your First Container Using

9 Run a Container from Docker Hub

Steps to install Virtual Box:

2. Click the next button..

4. Click the YES button..

Steps to import .ova file:

1. Open the terminal

1. Type cd /opt/axis2/axis2-1.7.3/bin then press enter

2. Type gedit first.c

4. Running the C program

Thus the simple C programs executed successfully.

1. Install Google Plugin for Eclipse

2. Create New Web Application Project

Figure – New Web Application Project

<?xml version="1.0" encoding="utf-8"?>

5. Deploy to Google App Engine

<?xml version="1.0" encoding="utf-8"?>

Figure 1.1 – Click on GAE deploy button on the toolbar.

Thus the simple application was created successfully.

How to use CloudSim in Eclipse

To use CloudSim in Eclipse:

CloudSim.init(num_user, calendar, trace_flag)

The simulation was successfully executed.

Nowyouneedtocreateasimpleapplication. Wecouldusethe “+”optiontohave the

Makeafolder for your Google App Engineapplications. I amgoing tomakethe Folder

C:\Documents and Settings\csev\Desktop\apps

Justforfun, edittheindex.pytochange thename“Chuck” to yourownname

Watching the Log

Youcanwatchthe internal logofthe actions that the webserver is performing

Shutting Down the Server

Thus the GAE web application was created.

Step 2: Extract the Java Tar File.

Command: tar -xvf jdk-8u101-linux-i586.tar.gz

Fig: Hadoop Installation – Extracting Java Files

Step 3: Download the Hadoop 2.7.3 Package.

Command: wget https://archive.apache.org/dist/hadoop/core/hadoop-2.7.3/hadoop-

Fig: Hadoop Installation – Downloading Hadoop

Step 4: Extract the Hadoop tar File.

Command: tar -xvf hadoop-2.7.3.tar.gz

file. Now, add Hadoop and Java Path as shown below.

Fig: Hadoop Installation – Setting Environment Variable

Fig: Hadoop Installation – Refreshing environment variables

Command: java -version

Fig: Hadoop Installation – Checking Hadoop Version

Step 6: Edit the Hadoop Configuration files.

Fig: Hadoop Installation – Hadoop Configuration Files

Fig: Hadoop Installation – Configuring core-site.xml

hdfs-site.xml contains configuration settings of HDFS daemons (i.e. NameNode, DataNode,

mapred-site.xml contains configuration settings of MapReduce application like number of JVM

Command: cp mapred-site.xml.template mapred-site.xml

Fig: Hadoop Installation – Configuring mapred-site.xml

yarn-site.xml contains configuration settings of ResourceManager and NodeManager like

Fig: Hadoop Installation – Configuring yarn-site.xml

Fig: Hadoop Installation – Configuring hadoop-env.sh Step

12: Go to Hadoop home directory and format the NameNode.

Command: bin/hadoop namenode -format

Fig: Hadoop Installation – Formatting NameNode

The above command is a combination of start-dfs.sh, start-yarn.sh & mr-jobhistory-

Or you can run all the services individually as below:

Command: ./hadoop-daemon.sh start namenode

Fig: Hadoop Installation – Starting NameNode

Command: ./hadoop-daemon.sh start datanode

Fig: Hadoop Installation – Starting DataNode

Command: ./yarn-daemon.sh start resourcemanager

Fig: Hadoop Installation – Starting ResourceManager