0% found this document useful (0 votes)

33 views23 pages

Cloud Computing Chapter4

Chapter 4 of 'Cloud Computing: Theory and Practice' discusses the challenges and architectural styles for cloud applications, emphasizing performance isolation, reliability, and latency issues. It introduces workflows and coordination mechanisms, including the MapReduce programming model and a case study on the GrepTheWeb application, which demonstrates scalable, on-demand infrastructure. The chapter also covers ZooKeeper for distributed coordination and various workflow patterns essential for managing complex processes in cloud environments.

Uploaded by

nairakash2004

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

33 views23 pages

Cloud Computing Chapter4

Uploaded by

nairakash2004

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 23

Chapter 4 – Cloud Computing

Applications and Paradigms

Cloud Computing: Theory and Practice.

Dan C. Marinescu Chapter 4 1
Contents
 Challenges for cloud computing.
 Architectural styles for cloud applications.
 Workflows - coordination of multiple activities.
 Coordination based on a state machine model.
 The MapReduce programming model.
 A case study: the GrepTheWeb application.

Cloud Computing: Theory and Practice.

Dan C. Marinescu Chapter 4 2
Challenges for cloud application development
 Performance isolation - nearly impossible to reach in a real system,
especially when the system is heavily loaded.

 Reliability - major concern; server failures expected when a large

number of servers cooperate for the computations.

 Cloud infrastructure exhibits latency and bandwidth fluctuations

which affect the application performance.

 Performance considerations limit the amount of data logging; the

ability to identify the source of unexpected results and errors is
helped by frequent logging.

Cloud Computing: Theory and Practice.

Dan C. Marinescu Chapter 4 3
Architectural styles for cloud applications
 Based on the client-server paradigm.
 Stateless servers - view a client request as an independent
transaction and respond to it; the client is not required to first
establish a connection to the server.
 Often clients and servers communicate using Remote Procedure
Calls (RPCs).
 Simple Object Access Protocol (SOAP) - application protocol for
web applications; message format based on the XML. Uses TCP
or UDP transport protocols.
 Representational State Transfer (REST) - software architecture
for distributed hypermedia systems. Supports client
communication with stateless servers, it is platform independent,
language independent, supports data caching, and can be used
in the presence of firewalls.
Cloud Computing: Theory and Practice.
Dan C. Marinescu Chapter 4 4
Workflows
 Process description - structure describing the tasks to be
executed and the order of their execution. Resembles a flowchart.

 Case - an instance of a process description.

 State of a case at time t - defined in terms of tasks already

completed at that time.

 Events - cause transitions between states.

 The life cycle of a workflow - creation, definition, verification, and

enactment; similar to the life cycle of a traditional program
(creation, compilation, and execution).
Cloud Computing: Theory and Practice.
Dan C. Marinescu Chapter 4 5
Dynamic Workflows Static Workflows Static Programs Dynamic Programs
Workflow
Programming
Description Component
Component Language
Language Libraries
Database
User User
Planning Automatic
Engine Programming

Workflow Computer
Description Program

Verification
Engine Compiler

Workflow Object
Workflow Program
Description Code
Database Libraries

Case Activation Record Data

Processor
Enactment
Running
Engine Run-Time Program
Unanticipated Exception the Process
Handling Modification Requests

(a) Workflow (b) Program

Cloud Computing: Theory and Practice.
Dan C. Marinescu Chapter 4 6
Safety and liveness

 Desirable properties of workflows.

 Safety  nothing “bad” ever happens.

 Liveness  something “good” will eventually happen.

Cloud Computing: Theory and Practice.

Dan C. Marinescu Chapter 4 7
Cloud Computing: Theory and Practice.
Dan C. Marinescu Chapter 4 8
Basic workflow patterns
 Workflow patterns - the temporal relationship among the tasks of a process
 Sequence - several tasks have to be scheduled one after the completion of
the other.
 AND split - both tasks B and C are activated when task A terminates.
 Synchronization - task C can only start after tasks A and B terminate.
 XOR split - after completion of task A, either B or C can be activated.
 XOR merge - task C is enabled when either A or B terminate.
 OR split - after completion of task A one could activate either B, C, or both.
 Multiple Merge - once task A terminates, B and C execute concurrently;
when the first of them, say B, terminates, then D is activated; then, when C
terminates, D is activated again.
 Discriminator – wait for a number of incoming branches to complete before
activating the subsequent activity; then wait for the remaining branches to
finish without taking any action until all of them have terminated. Next,
resets itself.
Cloud Computing: Theory and Practice.
Dan C. Marinescu Chapter 4 9
Basic workflow patterns (cont’d)
 N out of M join - barrier synchronization. Assuming that M tasks
run concurrently, N (N<M) of them have to reach the barrier before
the next task is enabled. In our example, any two out of the three
tasks A, B, and C have to finish before E is enabled.
 Deferred Choice - similar to the XOR split but the choice is not
made explicitly; the run-time environment decides what branch to
take.

Cloud Computing: Theory and Practice.

Dan C. Marinescu Chapter 4 10
B A

A B C A AND AND C
C B

a b c

B A B

A XOR XOR C A OR

C B C

d e f

B B

A AND XOR D A AND DIS D

C C

g h

B B

A XOR X
A AND C 2/3 E
C

i j

Cloud Computing: Theory and Practice.

Dan C. Marinescu Chapter 4 11
Coordination - ZooKeeper
 Cloud elasticity  distribute computations and data across multiple
systems; coordination among these systems is a critical function in a
distributed environment.
 ZooKeeper
 Distributed coordination service for large-scale distributed systems.
 High throughput and low latency service.
 Implements a version of the Paxos consensus algorithm.
 Open-source software written in Java with bindings for Java and C.
 The servers in the pack communicate and elect a leader.
 A database is replicated on each server; consistency of the replicas is
maintained.
 A client connect to a single server, synchronizes its clock with the
server, and sends requests, receives responses and watch events
through a TCP connection.

Cloud Computing: Theory and Practice.

Dan C. Marinescu Chapter 4 12
Server Server Server Server Server

Client Client Client Client Client Client Client Client

(a)

Follower
Replicated
database Follower
Write
processor Leader
Follower

Follower
Atomic broadcast Follower

WRITE READ WRITE

(b) (c)
Cloud Computing: Theory and Practice.
Dan C. Marinescu Chapter 4 13
Zookeeper communication
 Messaging layer  responsible for the election of a new leader
when the current leader fails.

 Messaging protocols use:

 Packets - sequence of bytes sent through a FIFO channel.
 Proposals - units of agreement.
 Messages - sequence of bytes atomically broadcast to all
servers.
 A message is included into a proposal and it is agreed upon
before it is delivered.
 Proposals are agreed upon by exchanging packets with a
quorum of servers, as required by the Paxos algorithm.

Cloud Computing: Theory and Practice.

Dan C. Marinescu Chapter 4 14
Zookeeper communication (cont’d)

 Messaging layer guarantees:

 Reliable delivery: if a message m is delivered to one server, it will

be eventually delivered to all servers.

 Total order: if message m is delivered before message n to one

server, it will be delivered before n to all servers.

 Causal order: if message n is sent after m has been delivered by

the sender of n, then m must be ordered before n.

Cloud Computing: Theory and Practice.

Dan C. Marinescu Chapter 4 15
Shared hierarchical namespace similar to a
file system; znodes instead of inodes

/a /b /c

/a/1 /a/2 /b/1 /c/1 /c/2

Cloud Computing: Theory and Practice.

Dan C. Marinescu Chapter 4 16
ZooKeeper service guarantees
 Atomicity - a transaction either completes or fails.

 Sequential consistency of updates - updates are applied

strictly in the order they are received.

 Single system image for the clients - a client receives the same
response regardless of the server it connects to.

 Persistence of updates - once applied, an update persists until

it is overwritten by a client.

 Reliability - the system is guaranteed to function correctly as

long as the majority of servers function correctly.

Cloud Computing: Theory and Practice.

Dan C. Marinescu Chapter 4 17
Zookeeper API
 The API is simple - consists of seven operations:

 Create - add a node at a given location on the tree.

 Delete - delete a node.

 Get data - read data from a node.

 Set data - write data to a node.

 Get children - retrieve a list of the children of the node.

 Synch - wait for the data to propagate.

Cloud Computing: Theory and Practice.

Dan C. Marinescu Chapter 4 18
Elasticity and load distribution
 Elasticity  ability to use as many servers as necessary to optimally
respond to cost and timing constraints of an application.
 How to divide the load
 Transaction processing systems  a front-end distributes the incoming
transactions to a number of back-end systems. As the workload
increases new back-end systems are added to the pool.
 For data-intensive batch applications two types of divisible workloads are
possible:
 modularly divisible  the workload partitioning is defined a priori.

 arbitrarily divisible  the workload can be partitioned into an

arbitrarily large number of smaller workloads of equal, or very close

size.
 Many applications in physics, biology, and other areas of
computational science and engineering obey the arbitrarily divisible
load sharing model.
Cloud Computing: Theory and Practice.
Dan C. Marinescu Chapter 4 19
MapReduce philosophy
1. An application starts a master instance, M worker instances for the
Map phase and later R worker instances for the Reduce phase.
2. The master instance partitions the input data in M segments.
3. Each map instance reads its input data segment and processes
the data.
4. The results of the processing are stored on the local disks of the
servers where the map instances run.
5. When all map instances have finished processing their data, the R
reduce instances read the results of the first phase and merge the
partial results.
6. The final results are written by the reduce instances to a shared
storage server.
7. The master instance monitors the reduce instances and when all of
them report task completion the application is terminated.

Cloud Computing: Theory and Practice.

Dan C. Marinescu Chapter 4 20
Application

Master instance

1 1 7

Map
Segment 1
instance 1 Local disk
Reduce
Segment 12 Map instance 1
Segment instance 2 Local disk Shared
Map Reduce storage
Segment 3 instance 3 Local disk instance 2

Shared
storage
Reduce
3 4 5 instance R 6
Map
Segment M instance M Local disk

Input data Map phase Reduce phase

Cloud Computing: Theory and Practice.

Dan C. Marinescu Chapter 4 21
Case study: GrepTheWeb
 The application illustrates the means to
 create an on-demand infrastructure.
 run it on a massively distributed system in a manner that allows
it to run in parallel and scale up and down, based on the number
of users and the problem size.
 GrepTheWeb
 Performs a search of a very large set of records to identify
records that satisfy a regular expression.
 It is analogous to the Unix grep command.
 The source is a collection of document URLs produced by the
Alexa Web Search, a software system that crawls the web every
night.
 Uses message passing to trigger the activities of multiple
controller threads which launch the application, initiate
processing, shutdown the system, and create billing records.
Cloud Computing: Theory and Practice.
Dan C. Marinescu Chapter 4 22
Input records
(a) The simplified workflow Regular
expression SQS

showing the inputs: Controller

- the regular expression. Output
EC2
- the input records generated Status Simple
DB Cluster S3

by the web crawler. (a)

- the user commands to report
the current status and to Billing
queue
terminate the processing. Launch
queue
Monitor
queue

Shutdown
(b) The detailed workflow. queue
Billing

The system is based on

service

message passing between Controller

several queues; four Launch Monitor Shutdown Billing

controller controller controller controller
controller threads
periodically poll their
Put file
associated input queues, Status
Output

retrieve messages, and DB

HDHS Input
Get file
carry out the required Hadoop Cluster on

actions Amazon SimpleDB Amazon SE2 Amazon S3

(b)
Cloud Computing: Theory and Practice.
Dan C. Marinescu Chapter 4 23

Cloud Computing Chapter4
100% (1)
Cloud Computing Chapter4
23 pages
Cloud Computing Applications Guide
No ratings yet
Cloud Computing Applications Guide
36 pages
Cloud Applications-Edited-2
No ratings yet
Cloud Applications-Edited-2
30 pages
Chapter 4
No ratings yet
Chapter 4
36 pages
Module 3-2
No ratings yet
Module 3-2
26 pages
Dynamic Parallel Data Processing in Heterogeneous Cloud
No ratings yet
Dynamic Parallel Data Processing in Heterogeneous Cloud
30 pages
Chapter 1
No ratings yet
Chapter 1
31 pages
Introduction: Cloud Computing: Theory and Practice. Dan C. Marinescu
No ratings yet
Introduction: Cloud Computing: Theory and Practice. Dan C. Marinescu
31 pages
Chapter 1
No ratings yet
Chapter 1
42 pages
Cloud Computing Chapter1
No ratings yet
Cloud Computing Chapter1
35 pages
CloudComputing Module2
No ratings yet
CloudComputing Module2
17 pages
CC Unit-1 Lecture Notes
No ratings yet
CC Unit-1 Lecture Notes
143 pages
Module-1 Cloud Notes
No ratings yet
Module-1 Cloud Notes
83 pages
Chapter 2
No ratings yet
Chapter 2
54 pages
Chapter 3
No ratings yet
Chapter 3
60 pages
Cloud Computing 1
No ratings yet
Cloud Computing 1
59 pages
Cloud Computing: Theory and Practice 3rd Edition Dan C. Marinescu Digital Download
No ratings yet
Cloud Computing: Theory and Practice 3rd Edition Dan C. Marinescu Digital Download
150 pages
CC1A2
No ratings yet
CC1A2
10 pages
ECS781P 2 CloudNetworking
No ratings yet
ECS781P 2 CloudNetworking
59 pages
Module 1
No ratings yet
Module 1
83 pages
Cloud Computing Notes
No ratings yet
Cloud Computing Notes
3 pages
Mod1 1
No ratings yet
Mod1 1
22 pages
Cloud Computing: Theory and Practice 3rd Edition Dan C. Marinescu Available Any Format
No ratings yet
Cloud Computing: Theory and Practice 3rd Edition Dan C. Marinescu Available Any Format
164 pages
Cloud Computing: Theory and Practice 3rd Edition Dan C. Marinescu Online Reading
No ratings yet
Cloud Computing: Theory and Practice 3rd Edition Dan C. Marinescu Online Reading
174 pages
Cloud Computing Course Overview
No ratings yet
Cloud Computing Course Overview
33 pages
Cloud Computing Tutorial
100% (1)
Cloud Computing Tutorial
16 pages
Cloud Computing
No ratings yet
Cloud Computing
32 pages
CC - Unit1 Notes
No ratings yet
CC - Unit1 Notes
5 pages
Chapter 1
No ratings yet
Chapter 1
113 pages
Chapter 1
No ratings yet
Chapter 1
29 pages
Module 1.0
No ratings yet
Module 1.0
48 pages
Cloud Computing Ben
No ratings yet
Cloud Computing Ben
44 pages
Module1st Cloudcomputing
No ratings yet
Module1st Cloudcomputing
114 pages
Cloud Computing Introduction PDF
100% (1)
Cloud Computing Introduction PDF
28 pages
Ch1 Introduction To Cloud
No ratings yet
Ch1 Introduction To Cloud
238 pages
Cloud Computing Evolution
No ratings yet
Cloud Computing Evolution
38 pages
Mca - Sem I Cloud Computing-1
No ratings yet
Mca - Sem I Cloud Computing-1
34 pages
CSE 6145 Lecture-No.1 (Spring 2019) - Introduction
No ratings yet
CSE 6145 Lecture-No.1 (Spring 2019) - Introduction
44 pages
CSE 6145 Lecture-No.2 (Spring 2019) - Introduction
No ratings yet
CSE 6145 Lecture-No.2 (Spring 2019) - Introduction
60 pages
Cloud Computing: Key Concepts & Types
No ratings yet
Cloud Computing: Key Concepts & Types
10 pages
Cloud Computing Chapter3
100% (3)
Cloud Computing Chapter3
42 pages
A Journey Through Cloud Computing
No ratings yet
A Journey Through Cloud Computing
3 pages
Cloud Computing Overview2
No ratings yet
Cloud Computing Overview2
18 pages
Lec 1 4 Introduction To Cloud Computing
No ratings yet
Lec 1 4 Introduction To Cloud Computing
41 pages
CIM M1 - Ch-4
No ratings yet
CIM M1 - Ch-4
16 pages
FCC Notes
No ratings yet
FCC Notes
43 pages
Cloud Rajkumar
No ratings yet
Cloud Rajkumar
115 pages
Cloud Computing Seminar Report
No ratings yet
Cloud Computing Seminar Report
25 pages
Introduction To Cloud Computing L8 S1 1730885830
No ratings yet
Introduction To Cloud Computing L8 S1 1730885830
5 pages
Cloud Computing: Presentation By
No ratings yet
Cloud Computing: Presentation By
20 pages
AWS For Cloud
No ratings yet
AWS For Cloud
14 pages
Cloud Computing Questions-1
No ratings yet
Cloud Computing Questions-1
38 pages
Cloud Computing Chapter7 (UNIT 3) Modified According To Syllabus
No ratings yet
Cloud Computing Chapter7 (UNIT 3) Modified According To Syllabus
30 pages
Cloud Computing
100% (2)
Cloud Computing
29 pages
Cloud & Edge Computing Insights
No ratings yet
Cloud & Edge Computing Insights
7 pages
Cloud Unit 1
No ratings yet
Cloud Unit 1
11 pages
CADABRA NFT Marketplace Trademark
No ratings yet
CADABRA NFT Marketplace Trademark
2 pages
Log 2
No ratings yet
Log 2
7 pages
Quiz - 6
No ratings yet
Quiz - 6
3 pages
Workato Connector SDK Guide
No ratings yet
Workato Connector SDK Guide
1 page
Database Objective Type Questions
No ratings yet
Database Objective Type Questions
17 pages
LDAP and Proxy Configuration Guide
No ratings yet
LDAP and Proxy Configuration Guide
3 pages
Unit 1 - Microprocessor 8085
No ratings yet
Unit 1 - Microprocessor 8085
18 pages
Save Wizard Code Info
No ratings yet
Save Wizard Code Info
7 pages
XN120 Consolidated Manual
No ratings yet
XN120 Consolidated Manual
200 pages
Advanced Java Exam Paper 2017
No ratings yet
Advanced Java Exam Paper 2017
3 pages
International Home of Openbim: Ifc4 Poised For Wider Reach As Iso 16739 Launched
No ratings yet
International Home of Openbim: Ifc4 Poised For Wider Reach As Iso 16739 Launched
4 pages
Document Control Procedure Guide
No ratings yet
Document Control Procedure Guide
16 pages
BRKSPG 2381
No ratings yet
BRKSPG 2381
60 pages
ITU-Trends in Telecommunication Reform 2006
No ratings yet
ITU-Trends in Telecommunication Reform 2006
240 pages
Dynamic SQL with EXECUTE IMMEDIATE
No ratings yet
Dynamic SQL with EXECUTE IMMEDIATE
5 pages
UT35A/UT32A Display Parts Guide
No ratings yet
UT35A/UT32A Display Parts Guide
2 pages
Ssssss
No ratings yet
Ssssss
5 pages
MIS - Chapter 1
No ratings yet
MIS - Chapter 1
55 pages
Sd2bbc User Guide v1.2
No ratings yet
Sd2bbc User Guide v1.2
2 pages
Exams 2024 Python For Beginners
No ratings yet
Exams 2024 Python For Beginners
22 pages
Creating A Vowel Chart in Excel
No ratings yet
Creating A Vowel Chart in Excel
3 pages
Configuring Cisco Mobility Express Controller: CLI Setup Wizard
No ratings yet
Configuring Cisco Mobility Express Controller: CLI Setup Wizard
16 pages
OSAMA KHAN-Software Engineer
No ratings yet
OSAMA KHAN-Software Engineer
1 page
Introduction To Wireless Security
No ratings yet
Introduction To Wireless Security
3 pages
Securityscorecard Aravo Transforming Insights Into Cyber Resilience
No ratings yet
Securityscorecard Aravo Transforming Insights Into Cyber Resilience
12 pages
Conventional Software Management
No ratings yet
Conventional Software Management
19 pages
Deedy
No ratings yet
Deedy
9 pages
AFPX-COM5 Ethernet Communication Guide
No ratings yet
AFPX-COM5 Ethernet Communication Guide
34 pages
4rf LR Aprisa Utility v1.1 Radioenlace
No ratings yet
4rf LR Aprisa Utility v1.1 Radioenlace
2 pages
ControlNet Counters, Warnings and Cable Redundancy
No ratings yet
ControlNet Counters, Warnings and Cable Redundancy
3 pages

Cloud Computing Chapter4

Uploaded by

Cloud Computing Chapter4

Uploaded by

Chapter 4 – Cloud Computing

Applications and Paradigms

Cloud Computing: Theory and Practice.

Cloud Computing: Theory and Practice.

 Reliability - major concern; server failures expected when a large

 Cloud infrastructure exhibits latency and bandwidth fluctuations

 Performance considerations limit the amount of data logging; the

Cloud Computing: Theory and Practice.

 Case - an instance of a process description.

 State of a case at time t - defined in terms of tasks already

 Events - cause transitions between states.

 The life cycle of a workflow - creation, definition, verification, and

Case Activation Record Data

(a) Workflow (b) Program

 Desirable properties of workflows.

 Safety  nothing “bad” ever happens.

 Liveness  something “good” will eventually happen.

Cloud Computing: Theory and Practice.

Cloud Computing: Theory and Practice.

A AND XOR D A AND DIS D

Cloud Computing: Theory and Practice.

Cloud Computing: Theory and Practice.

Client Client Client Client Client Client Client Client

WRITE READ WRITE

 Messaging protocols use:

Cloud Computing: Theory and Practice.

 Messaging layer guarantees:

 Reliable delivery: if a message m is delivered to one server, it will

 Total order: if message m is delivered before message n to one

 Causal order: if message n is sent after m has been delivered by

Cloud Computing: Theory and Practice.

/a/1 /a/2 /b/1 /c/1 /c/2

Cloud Computing: Theory and Practice.

 Sequential consistency of updates - updates are applied

 Persistence of updates - once applied, an update persists until

 Reliability - the system is guaranteed to function correctly as

Cloud Computing: Theory and Practice.

 Create - add a node at a given location on the tree.

 Delete - delete a node.

 Get data - read data from a node.

 Set data - write data to a node.

 Get children - retrieve a list of the children of the node.

 Synch - wait for the data to propagate.

Cloud Computing: Theory and Practice.

 arbitrarily divisible  the workload can be partitioned into an

arbitrarily large number of smaller workloads of equal, or very close

Cloud Computing: Theory and Practice.

Input data Map phase Reduce phase

Cloud Computing: Theory and Practice.

showing the inputs: Controller

by the web crawler. (a)

The system is based on

message passing between Controller

several queues; four Launch Monitor Shutdown Billing

retrieve messages, and DB

actions Amazon SimpleDB Amazon SE2 Amazon S3

You might also like