KEMBAR78
Chapter 1-Introduction To Distributed Systems | PDF | Client–Server Model | Database Transaction
0% found this document useful (0 votes)
558 views59 pages

Chapter 1-Introduction To Distributed Systems

This document provides an introduction to distributed systems. It discusses how distributed systems became feasible after the 1980s due to cheaper computers and computer networks. A distributed system is defined as a collection of independent computers that appear as a single system to users. Distributed systems allow for resource and data sharing, availability, scalability, and performance. They have characteristics like transparency, openness, and scalability. The document outlines various aspects of distributed systems including their organization, forms of transparency, techniques for scaling, and advantages and disadvantages.

Uploaded by

Hiziki Tare
Copyright
© © All Rights Reserved
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as PPTX, PDF, TXT or read online on Scribd
0% found this document useful (0 votes)
558 views59 pages

Chapter 1-Introduction To Distributed Systems

This document provides an introduction to distributed systems. It discusses how distributed systems became feasible after the 1980s due to cheaper computers and computer networks. A distributed system is defined as a collection of independent computers that appear as a single system to users. Distributed systems allow for resource and data sharing, availability, scalability, and performance. They have characteristics like transparency, openness, and scalability. The document outlines various aspects of distributed systems including their organization, forms of transparency, techniques for scaling, and advantages and disadvantages.

Uploaded by

Hiziki Tare
Copyright
© © All Rights Reserved
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as PPTX, PDF, TXT or read online on Scribd
You are on page 1/ 59

Chapter 1-Introduction to Distributed

Systems

SE3062 : Distributed Systems


Department of Software Engineering
WCU
Lecture by: Mesay A. M.(MSc)
Introduction
 Before the mid-80s, computers were
• very expensive (hundred of thousands or even millions of
dollars)
• very slow (a few thousand instructions per second)
• not connected among themselves
 After the mid-80s: two major developments
• cheap and powerful microprocessor-based computers appeared
• computer networks
• LANs at speeds ranging from 10 to 1000 Mbps
• WANs at speed ranging from 64 Kbps to gigabits/sec
 Consequence
• feasibility of using a large network of computers to work for the
same application; this is in contrast to the old centralized systems
where there was a single computer with its peripherals.
2
Definition of a Distributed System
 Distributed System
 A distributed system :is a collection of independent computers that
appears to its users as a single coherent system - computer
(Tanenbaum & Van Steen)
 This definition has two aspects:
1. hardware: autonomous machines
2. software: a single system view for the users
 Other Definitions
• A distributed system :is a system designed to support the development of
applications and services which can exploit a physical architecture consisting
of multiple, autonomous processing elements that do not share primary
memory but cooperate by sending asynchronous messages over a
communication network (Blair & Stefani)

3
Why Distributed?
 Resource and Data Sharing
• printers, databases, multimedia servers, ...
 Availability, Reliability
• the loss of some instances can be hidden
 Scalability, Extensibility
• the system grows with demand (e.g., extra servers)
 Performance
• huge power (CPU, memory, ...) available
 Inherent distribution, communication
• organizational distribution, e-mail, video

4
Characteristics of Distributed Systems
 Differences between the computers and the ways they
communicate are hidden from users
 Users and applications can interact with a distributed
system in a consistent and uniform way regardless of
location
 Distributed systems should be easy to expand and scale
 A distributed system is normally continuously available,
even if there may be partial failures
 Users and applications should not notice that parts are being
replaced or fixed, or that new parts are added to serve more
users or applications

5
Organization of a Distributed System
 To support heterogeneous computers and networks with a single-
system view, a distributed system is often organized by means of a
layer of software called middleware that extends over multiple
machines.

A distributed system organized as middleware; note that the middleware layer


extends over multiple machines 6
 Goals of a distributed system: a distributed system should
 make resources accessible(printers, computers, storage facilities,
data, files, Web pages, ...)
 reasons: economics, to collaborate and exchange information
 be transparent: hide the fact that the resources and processes are
distributed across multiple computers.
 be open
 be scalable

Transparency in a Distributed System


 a distributed system that is able to present itself to users and
applications as if it were only a single computer system is said to be
transparent

7
 Different forms of transparency in a distributed system
Transparency Description
Access Hide differences in data representation
and how a resource is accessed
Location Hide where a resource is physically located; where
is http://www.prenhall.com/index.html? (naming)
Migration Hide that a resource may move to another location
Relocation Hide that a resource may be moved to another location
while in use; e.g., mobile users using their wireless laptops
Replication Hide that a resource is replicated
Concurrency Hide that a resource may be shared by several competitive
users; a resource must be left in a consistent state
Failure Hide the failure and recovery of a resource
Persistence Hide whether a (software) resource is in memory
or on disk

8
 Openness in a Distributed System
 A distributed system should be open we need well-defined interfaces
 Interoperability
 components of different origin can communicate
 Portability
 components work on different platforms
 Another goal of an open distributed system is that it should be flexible
and extensible; easy to configure the system out of different
components; easy to add new components, replace existing ones
 An Open Distributed System is a system that offers services according
to standard rules that describe the syntax and semantics of those
services; e.g., protocols in networks
 In distributed systems, such services are often specified through
interfaces often described using an Interface Definition Language
(IDL)
 specify only syntax: the names of the functions, types of
parameters, return values, possible exceptions, ...
9
 Scalability in Distributed Systems
 A distributed system should be scalable
 size: adding more users and resources to the system
 geographically: users and resources may be far apart
 administratively: should be easy to manage even if it spans
many administrative organizations
 scalability problems: performance problems caused by limited
capacity of servers and networks

examples of scalability limitations


10
 Scaling Techniques
 how to solve scaling problems
 the problem is mainly performance, and arises as a result of limitations
in the capacity of servers and networks (for geographical scalability)
 three possible solutions: hiding communication latencies, distribution,
and replication
 Hide Communication Latencies
 try to avoid waiting for responses to remote service requests
 let the requester do other useful job
 i.e., construct requesting applications that use only asynchronous
communication instead of synchronous communication; when a
reply arrives the application is interrupted
 good for batch processing and parallel applications but not for
interactive applications
 for interactive applications, move part of the job to the client to
reduce communication; e.g. filling a form and checking the entries

11
(a) a server checking the correctness of field entries
(b) a client doing the job
 e.g. Shipping code is now supported in Web applications using Java
Applets

12
 Distribution
 e.g. DNS - Domain Name System
 divide the name space into zones
 for details, see later in Chapter 4 - Naming

an example of dividing the DNS name space into zones


13
 Replication
 replicate components across a distributed system to increase
availability and for load balancing, leading to better performance
 decided by the owner of a resource
 caching (a special form of replication) also reduces communication
latency; decided by the user
 but, caching and replication may lead to consistency problems (see
Chapter 6 - Consistency and Replication)

14
Pros and Cons of Distributed Systems
 Advantages of Distributed Systems
 Performance: Very often a collection of processors can provide higher
performance (and better price/performance ratio) than a centralized
computer.
 Distribution: many applications involve, by their nature, spatially
separated machines (banking, commercial, automotive system).
 Reliability (fault tolerance): if some of the machines crash, the system
can survive.
 Incremental growth: as requirements on processing power grow, new
machines can be added incrementally.
 Sharing of data/resources: shared data is essential to many applications
(banking, computer supported cooperative work, reservation systems);
other resources can be also shared (e.g. expensive printers).
 Communication: facilitates human-to-human communication.
15
Pros and Cons of Distributed Systems(cont.’)
 Disadvantages of Distributed Systems

 Difficulties of developing distributed software: how should


operating systems, programming languages and applications look
like?

 Networking problems: several problems are created by the


network infrastructure, which have to be dealt with: loss of
messages, overloading, ...

 Security problems: sharing generates the problem of data


security.

16
Hardware and Software Concepts in Distributed System
 Hardware Concepts of Distributed System
 different classification schemes exist
 multiprocessors - with shared memory
 multicomputers - that do not share memory
 can be homogeneous or heterogeneous

17
 a single
backbone

different basic organizations of processors and memories in distributed


systems
Parallel system?
18
 Multiprocessors - Shared Memory
 The shared memory has to be coherent - the same value written by one
processor must be read by another processor
 Performance problem for bus-based organization since the bus will be
overloaded as the number of processors increases
 The solution is to add a high-speed cache memory between the
processors and the bus to hold the most recently accessed words; may
result in incoherent memory

a bus-based multiprocessor
 bus-based multiprocessors are difficult to scale even with caches
 two possible solutions: crossbar switch and omega network
19
 Crossbar switch
 divide memory into modules and connect them to the processors with
a crossbar switch
 at every intersection, a cross point switch is opened and closed to
establish connection
 problem: expensive; with n CPUs and n memories, n2 switches are
required

20
 Omega network
 use switches with multiple input and output lines
 drawback: high latency because of several switching stages between
the CPU and memory

21
 Homogeneous --Multicomputer Systems
 Also referred to as System Area Networks (SANs)
 the nodes are mounted on a big rack and connected through a high-
performance network
 could be bus-based or switch-based

 bus-based
 shared multiaccess network such as Fast Ethernet can be used and
messages are broadcasted
 performance drops highly with more than 25-100 nodes (contention)

22
 switch-based
 messages are routed through an interconnection network
 two popular topologies: meshes (or grids) and hypercubes

Hypercube
Grid

23
 Heterogeneous --Multicomputer Systems
 most distributed systems are built on heterogeneous multicomputer
systems
 the computers could be different in processor type, memory size,
architecture, power, operating system, etc. and the interconnection
network may be highly heterogeneous as well
 the distributed system provides a software layer to hide the
heterogeneity at the hardware level; i.e., provides transparency

24
 Software Concepts of Distributed System
 OSs in relation to distributed systems
 tightly-coupled systems, referred to as distributed OSs (DOS)
 the OS tries to maintain a single, global view of the resources it
manages
 used for multiprocessors and homogeneous multicomputers
 loosely-coupled systems, referred to as network OSs (NOS)
 a collection of computers each running its own OS; they work
together to make their services and resources available to others
 used for heterogeneous multicomputers
 Middleware: to enhance the services of NOSs so that a better
support for distribution transparency is provided

25
 Summary of main issues
Description Main Goal
Tightly-coupled operating system for Hide and manage
DOS multi-processors and homogeneous hardware
multicomputers resources
Loosely-coupled operating system for Offer local
NOS heterogeneous multicomputers (LAN and services to remote
WAN) clients
Provide
Additional layer atop of NOS
Middleware distribution
implementing general-purpose services
transparency

an overview of DOSs, NOSs, and middleware

26
 Distributed Operating Systems
 Two types
 multiprocessor operating system: to manage the resources of a
multiprocessor
 multicomputer operating system: for homogeneous
multicomputers
 Uniprocessor Operating Systems
 separating applications from operating system code through a
microkernel

27
 Multiprocessor Operating Systems
 extended uniprocessor operating systems to support multiple
processors having access to a shared memory
 a protection mechanism is required for concurrent access to
guarantee consistency
 two synchronization mechanisms: semaphores and monitors
 semaphore: an integer with two atomic operations down (if s=0
then sleep; s := s-1) and up (s := s+1; wakeup a sleeping process
if any)
 monitor: a programming language construct consisting of
procedures and variables that can be accessed only by the
procedures of the monitor; only a single process at a time is
allowed to execute a procedure

28
 Multicomputer Operating Systems
 processors can not share memory; instead communication is through
message passing
 each node has its own
 kernel for managing local resources
 separate module for handling inter-processor communication

29
general structure of a multicomputer operating system
 Distributed Shared Memory Systems
 how to emulate shared memories on distributed systems to provide a
virtual shared memory
 page-based distributed shared memory (DSM) - use the virtual
memory capabilities of each individual node

pages of address space distributed among four machines


30
situation after CPU 1 references page 10

 read-only pages can be easily replicated

situation if page 10 is read only and replication is used


31
 Network Operating Systems
 possibly heterogeneous underlying hardware
 constructed from a collection of uniprocessor systems, each with its own
operating system and connected to each other in a computer network

general structure of a network operating system


32
 Services offered by network operating systems
 remote login (rlogin)
 remote file copy (rcp)
 shared file systems through file servers

two clients and a server in a network operating system

33
 Middleware
 a distributed operating system is not intended to handle a collection of
independent computers but provides transparency and ease of use

 a network operating system does not provide a view of a single


coherent system but is scalable and open

 combine the scalability and openness of network operating systems


and the transparency and ease of use of distributed operating systems

 this is achieved through a middleware, another layer of software

34
general structure of a distributed system as middleware

35
 Different middleware models exist
 treat every resource as a file; just as in UNIX
 through Remote Procedure Calls (RPCs) - calling a procedure on a
remote machine

 distributed object invocation


 (details later in Chapter 4 - Communication)
 middleware services
 access transparency: by hiding the low-level message passing
 naming: such as a URL in the WWW
 distributed transactions: by allowing multiple read and write
operations to occur atomically

 security
36
 Middleware and Openness
 In an open middleware-based distributed system, the protocols used
by each middleware layer should be the same, as well as the interfaces
they offer to applications

37
 A comparison between multiprocessor operating systems,
multicomputer operating systems, network operating systems, and
middleware-based distributed systems

Distributed OS
Network Middleware-
Item
Multiproc Multicomp OS based OS

Degree of transparency Very High High Low High


Same OS on all nodes Yes Yes No No
Number of copies of OS 1 N N N
Shared Model
Basis for communication Messages Files
memory specific
Global, Global,
Resource management Per node Per node
central distributed
Scalability No Moderately Yes Varies
Openness Closed Closed Open Open

38
The Client-Server Model

 how are processes organized in a system


 thinking in terms of clients requesting services from servers

general interaction between a client and a server

39
 Application Layering
 no clear distinction between a client and a server; for instance a
server for a distributed database may act as a client when it forwards
requests to different file servers

 three levels exist


 the user-interface level: implemented by clients and contains
all that is required by a client; usually through GUIs, but not
necessarily

 the processing level: contains the applications


 the data level: contains the programs that maintain the actual
data dealt with

40
 the general organization of an Internet search engine into three different
layers

 Client-Server Architectures
 how to physically distribute a client-server application across several
machines
 Multitiered Architectures 41
Two-tiered architecture: alternative client-server organizations
a) put only terminal-dependent part of the user interface on the client
machine and let the applications remotely control the presentation
b) put the entire user-interface software on the client side
c) move part of the application to the client, e.g. checking correctness in
filling forms
d) and e) are for powerful client machines
42
three tiered architecture: an example of a server acting as a client

43
 Modern Architectures
 vertical distribution: when the different tiers correspond directly with
the logical organization of applications
 horizontal distribution: physically split up the client or the server into
logically equivalent parts. e.g. Web server

an example of horizontal distribution of a Web service 44


TYPES OF DISTRIBUTED SYSTEMS
 Main distributed system types:
1. Distributed computing systems
– Focus on computation
– Goal: High performance computing tasks
2. Distributed information systems
– Focus on interoperability (the ability to exchange and use
information)
– Goal: Distribute information across several servers
3. Distributed pervasive systems
– Focus on mobile, embedded, communicating systems
– Goal: Spread a real-life environment with a large variety of
smart devices.

45
Distributed Computing Systems
 Cluster Computing
Essentially a group of systems connected through a LAN.
 Homogeneous o Same OS, near-identical hardware
 Single managing node
 Tightly coupled systems
 Centralized job management & scheduling system

46
Distributed Computing Systems

 Grid Computing
Lots of nodes (including clusters across multiple subnets) from
everywhere.
 Heterogeneous
 Diversity and dynamism (it can handle nodes dropping in and out at any
point of time)
 Dispersed across several organizations
 Can easily span a wide-area network
 To allow for collaborations, grids generally use virtual organizations
(grouping of users that will allow for authorization on resource
allocation).
 Loosely coupled (decentralization)
 Distributed job management & scheduling

47
Distributed Computing Systems

Example of Grid Computing

48
Distributed Computing Systems
 Cloud Computing
Web-based tools or applications that users can access and use through a
web browser as if it were a program installed locally on their own
computer.
 Internet-based computing
 offers dynamically scalable and virtualized resources that make up
services for users to use over the internet
 The only thing the user's computer needs to be able to run is the cloud
computing system's interface software

49
Distributed Computing Systems

Example of Cloud Computing


50
Distributed Computing Systems
 Distributed Information Systems
The vast amount of distributed systems in use today is in the form
of traditional information systems.
Example: Transaction processing systems
BEGIN TRANSACTION(server, transaction);
READ(transaction, file-1, data);
WRITE(transaction, file-2, data);
newData := MODIFIED(data);
IF WRONG(newData) THEN
ABORT TRANSACTION(transaction);
ELSE
WRITE(transaction, file-2, newData);
END TRANSACTION(transaction);
END IF;

51
Distributed Computing Systems

 Distributed Information Systems


Note
• All READ and WRITE operations are executed, i.e. their
effects are made permanent at the execution of END
TRANSACTION.
• Transactions form an atomic operation.

52
Distributed Computing Systems-Distributed Information Systems
 Transaction processing systems
A transaction is a collection of operations on the state of an object (database,
object composition, etc.) that satisfies the following properties (ACID):
 Atomicity: All operations either succeed, or all of them fail.
- When the transaction fails, the state of the object will remain unaffected by
the transaction.
 Consistency: A transaction establishes a valid state transition.
- This does not exclude the possibility of invalid,
intermediate states during the transaction’s execution.
 Isolation: Concurrent transactions do not interfere with each other.
- It appears to each transaction T that other transactions occur either
before T, or after T, but never both.
 Durability: After the execution of a transaction, its effects are
made permanent:
- Changes to the state survive failures.
53
Distributed Computing Systems
 Distributed Pervasive Systems
A next-generation of distributed systems emerging in which the nodes
are small, wireless, battery-powered, mobile (e.g. PDAs, smart phones,
wireless surveillance cameras, portable ECG monitors, etc.), and often
embedded as part of a larger system.
 Some requirements:
 Contextual change: The system is part of an environment in
which changes should be immediately accounted for.
 Ad hoc composition: Each node may be used in a very different ways by different
users.
- Requires ease-of-configuration.
 Sharing is the default: Nodes come and go, providing sharable services and
information.
- Calls again for simplicity.

54
Distributed Computing Systems
Distributed Pervasive Systems: Examples
Electronic Health Systems
 Devices are physically close to a person
 Where and how should monitored data be stored?
 How can we prevent loss of crucial data?
 What infrastructure is needed to generate and propagate alerts?
 How can security be enforced?
 How can physicians provide online feedback?

55
Distributed Pervasive Systems: Examples
Electronic Health Systems

56
Distributed Computing Systems
 Sensor Networks
 Consists of spatially distributed autonomous sensors to
cooperatively monitor physical or environmental conditions,
such as temperature, sound, vibration, pressure, motion or
pollutants, etc.
 The nodes to which sensors are attached are:
• Many (10s-1000s)
• Simple (i.e., hardly any memory,CPU power, or
communication
facilities)
• Often battery-powered (or even battery-less)

57
Distributed Pervasive Systems: Examples
Sensor Networks

58
Thank You!!!

59

You might also like