BCAR-102 (Operating System & Software Installation)
BCAR-102 (Operating System & Software Installation)
BABASAHEB AMBEDKAR
OPEN UNIVERSITY
BCA
BACHELOR OF COMPUTER APPLICATION
BCAR- 102
Operating System & Software Installation
OPERATING SYSTEM AND
SOFTWARE INSTALLATION
ISBN 978-81-943679-7-0
Edition : 2020
Acknowledgment
We sincerely hope this book will help you in every way you
expect.
Unit 15 Protection
Block Objectives :
The objective of this block is to define an Operating System and principles
and objectives of the operating system with the corresponding aims they have.
To describe the evolution stages of operating systems and general categories of
OS, different software structures of operating systems, services provided by the
operating systems, objetives of Operating System, evolution of Windows OS,
features of Windows OS and discuss new features of windows 10
Block Structure :
Unit 1 : Introduction to Operating System
Unit 2 : Structure and Services of Operating System
Unit 3 : Windows Operating System
Unit
INTRODUCTION TO
01 OPERATING SYSTEM
UNIT STRUCTURE
1.0 Learning Objectives
1.1 Introduction
1.2 Definition of Operating System
1.3 Evolution of Operating Systems
1.3.1 Wiring–up Plug–boards
1.3.2 Serial Processing
1.3.3 Batch Processing
1.3.4 Spooling Batch Processing
1.3.5 Multi–Programming
1.3.6 Time Sharing
1.3.7 Real Time Processing
1.3.8 Network Processing
1.3.9 Distributed Processing
1.4 General Categories of Operating System
1.5 Let Us Sum Up
1.6 Check Your Progress : Possible Answers
1.7 Assignment
1.8 Activity
1.9 Case Study
1.10 Further Reading
1.1 Introduction :
An operating system (OS) is a program that manages the computer
hardware. It also provides a basis for application programs and acts as an
intermediary between the computer user and the computer hardware. In this
unit we provide a general overview of the major components of an operating
system.
This unit introduces definition, evolution, structure and services of operating
system. The operating system is a system software. This unit addresses the
following questions :
1
Operating System and • What are operating systems functionalities ?
Software Installation
• What are the activities performed by the modern operating systems ?
• What kind of software structure does it have ?
2
Evolution of operating systems over the past decades has shown number Introduction to
of progresses from several perspectives. Starting from the point where machine Operating System
does not have operating systems to a distributed processing capability. The
foundation for the modern operating system was built over different parameters.
The following shows the major evolution of Operating Systems :
1.3.1 Wiring–Up Plug–Boards :
Low level programming language called machine language was used to
write computer instructions through wiring up plug–boards Plug–boards control
the basic functions of the computer. No Operating Systems software were
introduced. No programming languages to develop applications. No operators
were required. Complex programming and machine underutilization.
1.3.2 Serial Processing :
The technique is to running a single Job. Following are the working of
serial processing :
• Developer creates his/her program and punches it on cards
• He/she submits the card deck to an operator in the input room
• he/she reserves machine time on a sign–up sheet
• Operator set–up the job as scheduled
• Operator caries over the returned output to the output room
• The developer will collect the output later
No Operating Systems software were introduced. No programming
languages to develop applications.
Features :
• Users get access in series
• Program writing was improved Disadvantages
• Wasted time due to scheduling and setup
• Wasted time while operators walk around the machine room
• Large machine/processor idle time
1.3.3 Batch Processing :
The users of batch operating system do not interact with the computer
directly. Each user prepares his job on an off–line device like punch cards
and submits it to the computer operator. To speed up processing, jobs with
similar needs are batched together and run as a group. Thus, the programmers
left their programs with the operator. The operator then sorts programs into
batches with similar requirements. The technique is to running a batch of Job.
Advantages :
• Once the data process is started, the computer can be left running without
supervision.
• Batch processing allows an organization to increase efficiency because
a large amount of transactionscan be combined into a batch rather than
processing them each individually.
Disadvantages :
• It is very difficult to maintain the priority between the batches.
3
Operating System and • There is no direct interaction of user with computer. Lack of interaction
Software Installation between the user and job.
• CPU is often idle, because the speeds of the mechanical I/O devices is
slower than CPU.
• Difficult to provide the desired priority.
• With batch processing there is a time delay before the work is processed
and returned.
1.3.4 Spooling Batch Processing :
Technique :
• Adds up Spooling technique to simple batch processing
• Spooling – Simultaneous Peripheral Operation Online
• It is the ability to read jobs/print job outputs while the processor executes
jobs
• Read jobs – from cards to disk
• Print job outputs – from disk to printer
• Used in input and output operations
With spooling, every time a currently executing task is completed, a new
task is shipped from storage to the now–empty panel and be executed by the
OS. Spooling is the first attempt of multiprogramming
Advantages :
• Avoids CPU idle time between batches of jobs
• Improve turn–around time : Output of a job was available as soon as
the job completed, rather than only after all jobs in the current cycle
were finished.
Disadvantages :
• Large turn–around time
• Large CPU idle time on heavily I/O bound jobs
1.3.5 Multi–Programming
In a multiprogramming system there are one or more programs loaded
in main memory which are ready to execute. Only one program at a time is
able to get the CPU for executing its instructions (i.e., there is at most one
process running on the system) while all the others are waiting their turn.
The main idea of multiprogramming is to maximize the use of CPU time.
Indeed, suppose the currently running process is performing an I/O task (which,
by definition, does not need the CPU to be accomplished). Then, the OS may
interrupt that process and give the control to one of the other in–main–memory
program that are ready to execute (i.e. process context switching). In this way,
no CPU time is wasted by the system waiting for the I/O task to be completed,
and a running process keeps executing until either it voluntarilyreleases the
CPU or when it blocks for an I/O operation. Therefore, the ultimate goal of
multi programming is to keep the CPU busy as long as there are processes
ready to execute.
Note that in order for such a system to function properly, the OS must
be able to load multiple programs into separate areas of the main memory and
4
provide the required protection to avoid the chance of one process being Introduction to
modified by another one. Other problems that need to be addressed when having Operating System
multiple programs in memory is fragmentation as programs enter or leave the
main memory. Another issue that needs to be handled as well is that large
programs may not fit at once in memory which can be solved by using
pagination and virtual memory.
Advantages :
• Increase CPU utilization and reduce CPU idle time.
• It decreases total read time needed to execute a job as the jobs are in
main memory. Disadvantages of Multi Programming Operating System
Disadvantages :
• Long response time : the elapsed time to return back a result for a given
job was often several hours
• Poor interactivity between programmer and his program
1.3.6 Time Sharing :
Multiple jobs are executed by the CPU by switching between them, but
the switches occur so frequently. Thus, the user can receives an immediate
response. For example, in a transaction processing, processor execute each user
program in a short burst or quantum of computation. That is if n users are
present, each user can get time quantum. When the user submits the command,
the response time is in few seconds at most.
Operating system uses CPU scheduling and multiprogramming to provide
each user with a small portion of atime. Computer systems that were designed
primarily as batch systems have been modified to time–sharing systems.
Advantages :
• More than one user can execute their task simultaneously.
• Avoids duplication of software
• CPU Idle time is reduced and better utilization of resources.
• Provide advantage of quick response.
Disadvantages :
• Question of securing the security and integrity of user's data and programs.
• Since multiple processes are managed simultaneously, so it requires an
adequate management of main memory
• Problem of reliability.
1.3.7 Real–Time Processing
Real time system is defined as a data processing system in which the
time interval required to process and respond to inputs is so small that it controls
the environment. Real time processing is always on line whereas on line system
need not be real time. The time taken by the system to respond to an input
and display of required updated information is termed as response time. So,
in this method response time is very less as compared to the online processing.
Real–time systems are used when there are rigid time requirements on
the operation of a processor or the flow of data and real–time systems can
be used as a control device in a dedicated application. Real–timeoperating
5
Operating System and system has well–defined, fixed time constraints otherwise system will fail. For
Software Installation example, Scientific experiments, medical imaging systems, industrial control
systems, weapon systems, robots, and home–applicance controllers, Air traffic
control system etc. There are two types of real–time operating systems.
Hard Real–Time Systems : Hard real–time systems guarantee that
critical tasks complete on time. In hard real–time systems secondary storage
is limited or missing with data stored in ROM. In these systems virtual memory
is almost never found.
Soft Real–Time Systems : Soft real time systems are less restrictive.
Critical real–time task gets priority over other tasks and retains the priority
until it completes. Soft real–time systems have limited utility than hard real–
time systems. For example, Multimedia, virtual reality, Advanced Scientific
Projects like undersea exploration and planetary rovers etc.
Advantages :
• Better task scheduling as compared to manual process and time deadlines
achievement is guaranteed in most of the cases.
• Accelerate the process by automanaging the system resources as in the
case of Autopilot airplanes and railway e–ticket booking system.
Disadvantages :
• If time dead lines are missed it may result in severe disastrous situation.
• Complex and need additional kernel, Memory and other resources are
required.
• More vulnerable to security breaches like virus and unauthorized access.
1.3.8 Networked Processing :
Comprise of several computers that are interconnected to one another.
Each of these networked devices has their own local users and executes their
own local OS not basically different from single computer OSs. Users also
know about the presence of the several computers. Additional features needed
by network operating systems are Network interface controller, NIC, A low–
level software to drive it and software for remote login & remote file access.
1.3.9 Distributed Processing :
A distributed system is a collection of physically separate, possibly
heterogeneous computer systems that are networked to provide the users with
access to the various resources that the system maintains. Access to a shared
resource increases computation speed, functionality, data availability, and
reliability. Some operating systems generalize network access as a form of file
access, with the details of networking contained in the network interface's device
driver. Others make users specifically invoke network functions. Generally,
systems contain a mix of the two modes–for example FTP and NFS. The
protocols that create a distributed system can greatly affect that system's utility
and popularity.
Runs on a Multi–Computer System : set of computers, each having
its own memory, storage devices and other I/O modules. This is useful for
distributing the task between these different computers. Existence of multiple
computers is transparent to the user : It appears as a uniprocessor system. It
differs in critical ways from uniprocessor OSs. Examples of distributed operating
systems include LOCUS, MICROS, IRIX, Solaris, Mac/OS, and OSF/1.
6
Advantages : Introduction to
Operating System
• With resource sharing facility user at one site may be able to use the
resources available at another.
• Speedup the exchange of data with one another via electronic mail.
• If one site fails in a distributed system, the remaining sites can potentially
continue operating.
• Better service to the customers.
• Reduction of the load on the host computer.
• Reduction of delays in data processing
Disadvantages :
• Security problem due to sharing
• Some messages can be lost in the network system
• Bandwidth is another problem if there is large data then all network wires
to be replaced which tends to become expensive
• Overloading is another problem in distributed operating systems
• If there is a database connected on local system and many users accessing
that database through remote or distributed way then performance become
slow
• The databases in network operating is difficult to administrate then single
user system
Check Your Progress – 2 :
1. Monitoring utilities were put to memory when required in simple batch
processing systems (True/False)
2. In which type of system, the output of a program is sent to the disk
and is printed only when the job completes ?
(a) Multi–programmed batch systems
(b) Batch systems with spooling
(c) Time sharing systems
(d) Multi–printing systems
3. A system that supports multiple processes per user is called a
(a) Multi–user system (b) Multi–programming system
(c) Multi–tasking system (d) None of the above
4. operating system allows a computer to work independently
as well as provides a means for sharing resources (e.g. files, printers,
etc) to other computers connected to it.
(a) Real–time operating system (b) Symmetric operating system
(c) Local area network (d) Network operating system
8
1.5 Let Us Sum Up : Introduction to
Operating System
An operating system is software that manages the computer hardware
as well as providing an environment for application programs to run. Perhaps
the most visible aspect of an operating system is the interface to the computer
system it provides to the human user.
There are several different strategies for designing a computer system.
Uniprocessor systems have only a single processor while multiprocessor systems
contain two or more processors that share physical memory and peripheral
devices. The most common multiprocessor design is symmetric multiprocessing
(or SMP), where all processors are considered peers and run independently
of one another. Clustered systems are a specialized form of multiprocessor
systems and consist of multiple computer systems connected by a local area
network.
To best utilize the CPU, modem operating systems employ
multiprogramming, which allows several jobs to be in memory at the same
time, thus ensuring the CPU always has a job to execute. Timesharing systems
are an extension of multiprogramming whereby CPU scheduling algorithms
rapidly switch between jobs, thus providing the illusion each job is running
concurrently.
The operating system must ensure correct operation of the computer
system. To prevent user programs from interfering with the proper operation
of the system, the hardware has two modes : user mode and kernel mode.
Various instructions (such as I/0 instructions and halt instructions) are privileged
and can be executed only in kernel mode. The memory in which the operating
system resides must also be protected from modification by the user. A timer
prevents infinite loops. These facilities (dual mode, privileged instructions,
memory protection, and timer interrupt) are basic building blocks used by
operating systems to achieve correct operation.
In this unit we have lean about Definition of Operating System, Evolution
of Operating Systems, operating systems functionalities, activities performed
by the modern operating systems.
1.7 Assignments :
• What is an operating system ?
• Write detailed note on evolution of operating system
• Write note on Simple Batch Processing
• Explain network operating system
• What is real time operating system ?
9
Operating System and 1.8 Activity :
Software Installation
• Write Disadvantages of multiprogramming and timesharing
• Write difference between Network OS and Distributed OS
10
STRUCTURE AND SERVICES
Unit
02 OF OPERATING SYSTEM
UNIT STRUCTURE
2.0 Learning Objectives
2.1 Introduction
2.2 Structure of Operating System
1.3.1 Monolithic OS Structure
1.3.2 Layered OS Structure
1.3.3 Virtual Machines
1.3.4 Exokernels
1.3.5 Client–Server Model
1.3.6 Microkernel
2.3 Services of Operating System
2.4.1 User Interface
2.4.2 System and Utility Function Calls
2.4.3 Process Control
2.4.4 File Management
2.4.5 Communication Management
2.4.6 Information Maintenance
2.4.7 Device Management
2.4.8 Program Execution
2.4.9 Input/Output Operations
2.4.10 File Management
2.4.11 Error Detection
2.4.12 Communication
2.4.13 Resource Allocation
2.4.14 Accounting
2.4.15 System Protection
2.4 Objectives of Operating System
2.4.1 System View as Resource Manager
2.4.2 Users View as Virtual Machine
2.4.3 Ability to Evolve as Software
2.5 Let Us Sum Up
2.6 Check Your Progress : Possible Answers
2.7 Assignment
2.8 Activity
2.9 Case Study
2.10 Further Reading
11
Operating System and 2.0 Learning Objectives :
Software Installation
After studying this unit learner should be able to :
• Describe the different software structures of operating systems
• Determine the services provided by the operating systems
• List the Objetives of Operating System
2.1 Introduction :
This unit providing the services and structure of operating systems as
a whole. Focus on terminologies used, functionalities of an operating systems
have and types of operating systems.
The very foundations of operating system can be viewed from different
perspectives. Mainly, these perspectives focus on user and system level.
Additionally, we can consider the operating system's nature as a software that
requires to evolve through time depending on change on hardware as well as
human needs. The principles that govern operating systems can be viewed from
the following :
• System view as resource manager
• Users view as virtual machine
• Ability to evolve as software
12
Available Hardware : Structure and Services
of Operating System
• Low–level functions of an OS often require hardware support such as
protection mechanisms
• Can limit allowed commands
2.2.1 Monolithic OS Structure :
• It is also referred to as "The Big Mess" due to an absence of any kind
of organization
• It consists of a bundle of procedures calling one another whenever they
need to
• Each procedure has a well–defined interface in order to allow the
programmer to use it in the development of other procedures.
• The object program of the operating system will be constructed by first
compiling each procedure separately and then binding them all together
into one object using the system linker
This structure is basically one big lump with no well separated functionality
levels, no clearly defined internal interfaces and no level of encapsulation as
every procedure is visible to all others. Example : MS–DOS.
The monolithic structure includes a little structure. It has clearly defined
interfaces in that services of the Operating System are demanded by placing
the required arguments in a well– defined area and then executing a trap
instruction. The OS will then get the parameters and determines which service
is being requested and indexes into a table that contains pointers to the
procedures carrying out the services. In this case, the basic structure of the
operating system can be summarized as :
• Main program invoking required system call procedure
• Collection of service modules to perform the system calls
• Collection of helpful programs assisting the service procedures
But the compiled code is still one big executable and changes in some
part requires re–compiling the whole package. Example : most Linux Kernels.
2.2.2 Layered OS Structure :
• Hierarchical organization of an OS where each layer implements a
function based on the layer beneath.
• The lower most level is the hardware and the uppermost level is the
user interface
• High degree of encapsulation
• Easily maintainable and extendable
• With modularity, layers are arranged in such a way that each upper layer
uses services of only the lower level layers
• This was the first OS designed with such structure by Dijkstra and his
students
• It is a batch operating system with 6 layers
13
Operating System and Layer Name Function
Software Installation
5 Operator Interface for the system to operate
4 User programs Enable users to perform an action
3 I/O Management Controls I/O devices and buffers information
stream to and from them
2 Operator console Manages communication between processes
and operator console
1 Memory management Allocate space for processes
0 CPU scheduling Processor allocation and Switching between
processes when an interrupt occurs
2.2.3 Virtual Machines :
• This approach provides several Virtual Machines identical to the underlying
bare hardware without any additional feature such as a file system.
• Considers the hardware and the operating system kernel together as a
hardware
• Is used to separate the multiprogramming and extended machine functions
found in timesharing operating systems
• The VM Controller that runs on the plain machine manages the
multiprogramming and provides multiple virtual machines to the upper
layers
• Virtual machines are created by sharing the physical machine's resources
• Exact copy of actual computer is given to every user process.
• The virtual machines can run different operating systems
• As each virtual machine is totally insulated from all other machines, there
is a complete protection and thus high encapsulation however, resource
sharing is difficult among the VMs
• VM in general, is difficult to implement as it requires an effort to create
exact copy of the underlying machine
2.2.4 Exokernels :
• In such OS design, users are presented with a replica of the actual device
but with only a portion of the device's resources
• The exokernel allocates resources to the available virtual machines and
controls any attempt of accessing others resources.
• It has a main advantage of avoiding address mapping
2.2.5 Client–Server Model :
• Modern operating systems tend to move codes, as much as possible from
the operating system up into upper levels to leave a minimal kernel or
microkernel
• A so–called message passing is used to facilitate communication among
user modules
• A user process called client process sends its request for a service to
a server process
14
• The server process responds to the request by performing the required Structure and Services
service and sending back the answer to the client process of Operating System
• This structure splits the operating system into smaller and manageable
parts each carrying out only one part of the system
• Easily maintainable and extendable as all server processes run in user
mode
• Is more reliable and secure OS structure
• Easy to port the OS to new architectures
• It also has an advantage of being adaptable for distributed systems
• However, there is a performance overhead of user space to kernel space
communication
2.2.6 Microkernel :
• A microkernel OS is structured as a tiny kernel that provides the minimal
services used by cooperating processes. These processes later provide
the higher–level functionalities of operating system.
• The microkernel itself do not have file systems and several other
functionalities which are required to be offered by an operating system.
• Microkernel provides a mechanism to implement the operating system.
Check Your Progress – 1 :
1. MS–DOS has type OS structure ?
(a) Monolithic (b) Client–Server
(c) Virtual Machines (d) Microkernel
2. In OS Structure, users are presented with a replica of the actual
device but with only a portion of the device's resources.
(a) Monolithic (b) Layered (c) Exokernels (d) Microkernel
3. is adaptable for distributed systems
(a) Monolithic (b) Client–Server
(c) Virtual Machines (d) Microkernel
4. The microkernel itself have file systems and several other functionalities
which are required to be offered by an operating system. (True/False)
5. Virtual machines are created by sharing the physical machine's resources
(True/False)
6. In Layered OS structure, the upper most level is the hardware and the
lower most level is the user interface (True/False)
15
Operating System and 2.3.1 User Interface :
Software Installation
Users communicate with a computer system to use its resources through
an interface provided by an operating system
The UI provided can be :
1. Command Line Interface (CLI) : allows direct command entry by
fetching commands from user and executing it. it provides text–based
interaction for the user where by users are forced to remember the label
of each command. The commands range from a built–in to simple
program names.
2. Graphical User Interfaces (GUI) : easy to use, user–oriented interfaces
where users interact with the system through icons. These are user
friendly desktop metaphor interfaces. Users pass their commands through
keyboard, mouse and monitor.
2.3.2 System and Utility Function Calls :
These are programming interfaces to the services provided by the operating
system. Programs access these calls through an Application Program Interface
(API)
They are written and used with a high–level programming language like
C++. When implemented, each system call is associated with a number that
is used for indexing.
The API invokes the required system call in the OS kernel and sends
back the status of the system call along with any return value.
The API hides any detail of the OS from the caller. System calls can
be about :
2.3.3 Process Control :
Creating, terminating and waiting for a process
2.3.4 File Management :
Create, read, write and delete files
2.3.5 Communication Management :
Creating connections, sending message, receiving message, sharing memory
2.3.6 Information Maintenance :
Getting/setting time, system data, file and process attributes
2.3.7 Device Management :
Access, read, write devices
2.3.8 Program Execution :
The OS is capable of loading a program onto main memory and run
the program. It ends the execution of a program normally, if no errors are
found, or terminates abnormally otherwise.
2.3.9 Input/Output Operations :
Running programs mostly require input and output tasks.
The input/output can be either from/to file or I/O devices
It is the task of the OS to facilitate such kinds of requests coming from
programs
16
2.3.10 File Management : Structure and Services
of Operating System
The OS handles reading and writing files and directories
It also manages creation, deletion and searching of items in the files
and directories Access Permission of files and directories is also the service
of an OS.
2.3.11 Error Detection :
Errors might occur in a system in different areas including CPU, memory,
I/O devices or even user programs
The OS ensures correct and consistent operation of the system despite
the errors occurred by taking the appropriate measurement on the detected
errors. To do so, the OS requires to continuously inspect the system and identify
possible errors.
2.3.12 Communication :
Processes found on the same computer or over a networked computer
might need to communicate among themselves
This communication can be carried out either through message passing
or shared memory concept
The OS facilitates such communication among processes through
implementing different communication management mechanisms
2.3.13 Resource Allocation :
A computer system consists of several kinds of assets
Some of these resources such as the CPU and main memory need unique
assignment code while others like the I/O devices might need a general request
and release code to be accessed
It is the job of the OS to identify the type of resource required and
run the appropriate code to access the resource Especially, in a multi–tasking
and multi–user environment, the OS should be able to assign resources for
each of the concurrently running jobs.
2.3.14 Accounting :
The OS also keeps track of the number of users using the system,
It also maintains which users are using which kinds of resources and
their consumption rate of the resource as well
2.3.15 System Protection :
The OS controls all access to a systems' resource
It also enforces authentication and different access control mechanisms
to keep the system free of undesired access attempts.
Check Your Progress – 2 :
1. The UI provided by OS can be
(a) CLI (b) GUI
(c) Either (a) or (b) (d) None of these
2. System calls provided by the OS can provide management of
(a) Process Control (b) File
(c) Device (d) All of these
17
Operating System and 3. This communication can be carried out through
Software Installation
(a) Message Passing (b) Shared Memory Concept
(c) Either (a) or (b) (d) None of these
4. The API hides any detail of the OS from the caller (True / False)
5. The OS handles reading and writing files and directories (True/False)
6. The OS does not require to continuously inspect the system to identify
possible errors (True/False)
18
2.4.3 Ability to Evolve as Software : Structure and Services
of Operating System
The operating system is a system software that bridges hardware with
the rest of the computer components.
The operating system is the software that communicates with the software,
i.e., device drivers, of the hardware which are the products of different vendors.
If there are any changes on the hardware peripherals, then the vendors change
the device drives of the hardware. Thus, these changes need to be incorporated
in the operating system.
The operating system, as discussed in unit 1, is responsible services to
the user. If the user is requiring additional services, the operating system has
to deliver those services, too.
As a software, the operating system might have a bug. Later on, usage
of the operating system will result in error on the operating system. These
bugs must be fixed.
The addition of hardware resources, need for new services, and maintenance
of bugs has to be done on the next version of the operating system. Every
version of the operating system has at least one of the following features :
• Capability to recognize a new hardware
• New service(s)
• Fixed bugs
Check Your Progress – 3 :
1. A computer system has a collection of components for data
(a) Moving (b) Processing (c) Storing (d) All of these
2. An operating system is a system designed to provide
(a) Easy–to–work–in environment
(b) Manage all the possible needs of its users regardless of the resource
scarcity
(c) Both (a) and (b)
(d) None of these
3. As a software, the operating system might have a bug (True / False)
4. From the user point of view, an operating system can be considered as
resource manager and allocator (True/False)
5. If the user is requiring additional services, the operating system has to
deliver those services, too. (True/False)
6. The OS should be designed to guarantee efficient usage and sharing of
all available resources by everyone fairly. (True/False)
2.7 Assignments :
• What are the major issues considered when the internal structure of
operating system is designed ?
• What is the worst–case scenario of the monolithic structure of operating
system ?
• What is a microkernel ?
2.8 Activity :
• List at least four services of the operating system.
20
Unit
WINDOWS OPERATING
03 SYSTEM
UNIT STRUCTURE
3.0 Learning Objectives
3.1 Introduction
3.2 History of Windows OS
3.3 Microsoft OS for Servers and Mobile Devices
3.4 New Features of Windows 10
3.5 Let's Sum up
3.6 Check Your Progress : Possible Answers
3.7 Assignments
3.8 Activities
3.9 Case Study
3.10 Further Reading
3.1 Introduction :
The Windows operating system (Windows OS) are more formally called
Microsoft Windows and is actually a family of operating systems for personal
computers. Windows dominates the personal computer world, running, by some
estimates, more than 90 percent of all personal computers – the remainder
running Linux and Mac operating systems.
Windows provides a graphical user interface (GUI), virtual memory
management, multitasking, and support for many peripheral devices. In addition
to Windows operating systems for personal computers, Microsoft also offers
operating systems for servers and mobile devices. In this unit we will learn
about evolution of today's Windows Operating System and features of Windows
Operating System.
23
Operating System and Windows 10 :
Software Installation
Windows 10 is Microsoft's Windows successor to Windows 8. Windows
10 debuted on July 29, 2015, following a "technical preview" beta release of
the new operating system that arrived in Fall 2014 and a "consumer preview"
beta in early 2015. Microsoft claims Windows 10 features fast start up and
resume, built–in security and the return of the Start Menu in an expanded form.
This version of Windows will also feature Microsoft Edge, Microsoft's new
browser. Any qualified device (such as tablets, PCs, smartphones and Xbox
consoles) can upgrade to Windows 10, including those with pirated copies of
Windows.
24
Windows Phone (November 2010) : Windows Operating
System
Windows OS – Windows Phone
A mobile operating system for smartphones and mobile devices that serves
as the successor to Microsoft's initial mobile OS platform system, Windows
Mobile. Unlike Windows Mobile, Windows Phone 7 (also referred to as
WinPhone7) is targeted more to the consumer market than the enterprise market,
and it replaces the more traditional Microsoft Windows OS look and feel with
a new "Metro" design system user interface.
Windows Phone 7 features a multi–tab Internet Explorer Mobile Web
browser that uses a rendering engine based on Internet Explorer 9 as well
Microsoft Office Mobile, a version of Microsoft Office that's tailored for mobile
devices. Its successors include Windows Phone 8 and Windows 10 Mobile.
3.7 Assignment :
• Write detailed note on history of Windows OS
• Explain new features of Windows 10 Operatiing System
3.8 Activities :
• Explore new features of windows 10 operating system
26
Windows Operating
BLOCK SUMMARY :
System
• An operating system is software that manages the computer hardware as
well as providing an environment for application programs to run
• To best utilize the CPU, modem operating systems employ
multiprogramming, which allows several jobs to be in memory at the same
time, thus ensuring the CPU always has a job to execute.
• Timesharing systems are an extension of multiprogramming whereby CPU
scheduling algorithms rapidly switch between jobs, thus providing the
illusion each job is running concurrently.
• The operating system must ensure correct operation of the computer
system.
• The main five models of operating systems are : monolithic structure,
layered structure, virtual machines, exokernels and client server structure.
• Operating system provides number of services like user interface, program
execution, system protection, accounting, input/output management, file
management, error handling, etc.
BLOCK ASSIGNMENT :
Short Questions
• What is operating system ?
• What is kernel mode ?
• What is Multi–programming ?
• What is time sharing OS ?
• What is network operating system ?
• What is windows operating system ?
• What is Exokernels architecture ?
Long Question
• Write detailed note on General Categories of Operating System
• Explain in details about Services of Operating System
• Describe objectives of operating system
• Write detailed note on windows operating system with its features
27
Dr. Babasaheb Ambedkar BCAR-102/
Open University Ahmedabad DCAR-102
Block Objectives :
The most central notion in any operating system is a process. The objective
of this block is to define process and components of process and how process
is managed, it also describes process life cycle and different process states. Multi–
threading brings many advantages to processes. This block also discuss thread
and types of thread implementation, concurrency problem, Interprocess
Communication mechanism and mutual exclusion.
Scheduling is a means through which fair distribution of execution time
is granted to processes with similar state for a multiprogramming system. The
objective of this block is to define scheduling, schedular and types of schedular,
it describe process scheduling policies and types of scheduling algorithms and
important terminology.
Deadlock is one of the problems of concurrency. In a multiprogramming
environment, processes compute for resources, The objective of this block is to
define dead lock and describe necessary condition for deadlock, a Resource–
Allocation (R–A) Graph and explains Deadlock Handling Meachnism
Block Structure :
Unit 4 : Process Management
Unit 5 : Threads and Concurrency
Unit 6 : Process Scheduling
Unit 7 : Process Synchronization and Deadlocks
Unit
04 PROCESS MANAGEMENT
UNIT STRUCTURE
4.0 Learning Objectives
4.1 Introduction
4.2 Overview of Process
4.3 Components of Process
4.3.1 Address Space
4.3.2 Processor State
4.3.3 OS Resources
4.4 Process States
4.5 Types of Process
4.6 Let's Sum up
4.7 Check Your Progress : Possible Answers
4.8 Assignments
4.9 Activity
4.10 Case study
4.11 Further Reading
4.1 Introduction :
The most central notion in any operating system is a process. A process
is a program currently running and that needs different kinds of resources to
be allocated. A process also interacts with other processes more frequently.
Thus, an operating system is responsible to manage all the requirements of
processes and also coordinate communication among different processes. This
unit covers in detail processes, types of processes, the different states of a
process in process life cycle.
28
Early computer systems were capable of running only one program at Process Management
a time and the program had full control of the systems' resources and thus
no structuring is required. But today's computer systems are designed to support
multiprogramming, running more than one program concurrently, besides the
operating system they have.
This arrangement requires the system to know which is which so that
going back and forth between the programs is possible and thus enforces the
need for structuring. Process is an instance of a program that is being executed.
It is a program that has been loaded from disk onto memory and is running.
While a program is just a passive set of commands, process is the actual
execution of these instructions and one program can be partitioned to more
than one processes. It is the basic unit of execution in an operating system.
A process consumes some amount of assets like CPU time, main memory, and
I/O devices to carry out its operations. It is the duty of the operating system
to manage all these requirements of processes and bring them to termination.
A process is the basic execution unit in an operating system with unique
number assigned as an identifier (PID). Though the CPU is running only one
program at a time, it may also work on multiple programs by switching and
running them for tens or hundreds of milliseconds. This in turn requires more
control and partitioning of programs which resulted in a process. Thus, the
operating system will have many processes running at the same time all
requiring resources.
29
Operating System and
Software Installation
30
Process Management
32
• Waiting : sleeping or blocked state where the process is waiting for Process Management
an event. Even if the CPU is free to handle requests, a process at this
state will not use it as it is waiting for some external events to occur.
Four possible moves can be made by a process between these states as
it executes as shown in figure 1.3 below.
33
Operating System and Process state
Software Installation
Process number, PID
Program counter
Registers
Memory limit
List of open files
Fig. 4.5 : PCB Entries Per Process
Process state : indicate the current status of the process as new, running,
waiting, ready.
Process number : a unique positive integer value associated with each
process during creation and used as an identifier. The getpid() command is
used to return the current process's number, PID. In order to read the parent's
id, we can use the getppid () command.
Program counter : a value indicating the next instruction to be executed
for the process
Registers : including accumulators, stack pointers, and the like maintains
the current value produced when the process runs so that the process resumes
from where it left off in case of interrupts.
Memory limit : holds the value of the base and limit registers, page
table size etc.
4.8 Assignments :
• What distinguishes process from progam ?
• What three basic reasons cause process creation ?
• How does the system differentiate between processes ?
• What are the different possible states that a process can be in ?
• When does a process move from running to waiting/blocked/sleeping
state ?
35
Operating System and 4.9 Activity :
Software Installation
• Open Task Manager in Windows and see the running process and terminate
any process by right clicking on it and tehn click End Task.
36
Unit
THREADS AND
05 CONCURRENCY
UNIT STRUCTURE
5.0 Learning Objectives
5.1 Introduction
5.2 Multithreading
5.3 Thread implementation
5.3.1 User Space Thread
5.3.2 Kernel Space Thread
5.3.3 Hybrid Thread implementation
5.4 Thread Management
5.5 Concurrency and Its Problems
5.5.1 Inter–Process Communication
5.5.2 Concurrency Problem
5.5.3 Mutual Exclusion
5.6 Let's Sum up
5.7 Check Your Progress : Possible Answers
5.8 Assignments
5.9 Activity
5.10 Case Study
5.11 Further Reading
5.1 Introduction :
To perform several jobs at a time multiprogramming is a must. A program
by itself cannot address several problems at a time. But there is a mechanism
through which a program can solve this problem. The answer is thread. What
is thread ? How does it work ?
A process is a means through which interrelated resources such as the
address space, child processes, opened files, etc. are grouped together. Threads
are multiple execution streams within a single process and are used to separate
the resource management task of the process from its actual execution. While
processes are used to group resources together, threads are the objects scheduled
for execution on the CPU. Thread can also be considered as lightweight process
37
Operating System and in that it does small portion of the entire task of a process. Having a thread
Software Installation adds an advantage to a process by enabling multiple independent executions
in the same process environment by sharing the resources grouped together.
The CPU moves back and forth among the threads running and creates an
illusion that all of the threads are running in parallel. A process may create
several threads (multi–threading) that share same address space and work
together to accomplish a job in a sense they are not as such independent as
two processes are. Threads are very important in today's programming to handle
execution of several tasks of a single process independently of the others. This
is mainly true when one of the tasks of the process may block, and it is required
to let the remaining tasks run without blocking.
For instance, in a word processor, a thread running in the background
may validate spelling and grammar while a foreground thread handles user's
input, and yet another third thread fetches images from the local disk, and
even a fourth thread is doing periodic automatic backups of the file being edited
and so on. Like a single thread (one process), threads can be in any of the
states a process can be in that we hace discussed in last unit.
5.2 Multithreading :
In a multi–thread model, a process starts by creating only one thread
first and the thread creating other threads through a create–thread library
procedure call. The newly created thread runs automatically in the address space
of the creating thread and the identifier of the newly created thread will be
passed to the creating thread. The exit_thread library procedure is used to
terminate a thread when it finishes its task. A thread can also wait for another
thread with a call to the library procedure thread_wait. The CPU has no interrupt
towards threads, as with processes, that forces them to surrender their usage
of the CPU. However, threads use the thread_yield procedure call to willingly
give up the CPU for other threads to use. This avoids the monopolized usage
of the system by a single thread.
Multi–threading brings four main advantages to processes. These are :
• Responsiveness : A thread may give results quickly while other threads
are waiting or slowed down performing some serious computations.
• Resource sharing : By default, threads use common code, data, and other
assets, that enables several jobs to be carried out concurrently in one
address space.
• Economy : Creating, destroying and handling threads is much faster than
performing the same operation for processes since no resource is attached
to a thread
• Scalability : A process with one thread can only run on a single CPU,
regardless of the number of CPUs available, while running a process
with several threads may require distributing the threads t amongst
available processors.
Despite these advantages, threads also bring a complication to processes.
One problem is associated with the fork() system call. Assume a parent and
child processes with multi–threading capability. What would happen if a thread
in the parent is waiting for input ? Does a thread in the child also get blocked ?
Who gets a typed input, only the parent ? The child ? Or both ?
38
Another problem arose due to the common resources shared among Threads and
threads. One thread might close or modify a particular resource which is Concurrency
currently being also used by another thread.
In general, since the advantage of multi–threading overweighs its problems,
modern computer systems implement the concept of multi–threading but it
requires a careful thought and design.
41
Operating System and 5.4 Thread Management :
Software Installation
Pthreads are Linux implementation of thread management mechanism.
Pthreads are defined as a set of C language programming types and procedure
calls. Vendors usually provide a Pthreads implementation in the form of a
header/include file and a library, which you link with your program. The two
names used in thread management frequently are : pthread_t for thread objects
and pthread_attr for thread attributes objects.
The pthread_create function is used to create a new thread, and pthread_exit
function for terminating a thread by itself. A thread to wait for termination
of another thread uses the function pthread_join.
Normally when a program starts up and becomes a process, it starts with
a default thread which brings the conclusion that every process has at least
one thread of control. A process can create extra threads using the following
function :
#include <pthread.h>
int pthread_create (pthread_t *restrict tidp, const pthread_attr_t
*restrict attr, void *(*start_rtn) (void), void *restrict arg)
Where :
• The first argument is a pthread_t type address. Once the function is called
successfully, the variable whose address is passed as first argument will
hold the thread ID of the newly created thread.
• The second argument may contain certain attributes which we want the
new thread to contain. It could be priority etc.
• The third argument is a function pointer. This is something to keep in
mind that each thread starts with a function and that functions address
is passed here as the third argument so that the kernel knows which
function to start the thread from.
• As the function may accept some arguments also we can pass these
arguments in form of a pointer to a void type, which is the fourth
argument.
Check Your Progress – 1 :
1. Thread can also be considered as heavyweight process (True/False)
2. A process may create threads
(a) One (b) Two (c) Three (d) Many
3. Which of the following is an advantage of Multi–threading to process ?
(a) Responsiveness (b) Economy
(c) Scalability (d) All of these
4. A system can implement and manage threads through two main ways
namely user thread and kernel thread. (True/False)
5. Thread creation and destroy has smaller costs in this kernel mode.
(True/False)
3. The main difference between a kernel thread and a user thread is the
fact that the kernel thread has no user task memory mapping. (True/False)
42
5.5 Concurrency and Its Problems : Threads and
Concurrency
The most fundamental task of modern operating systems is management
of multiple processes within uniprocessor, multiprocessor or distributed computer
systems. The fundamental design issue in the management of multiple processes
is concurrency : simultaneous execution of multiple processes
Concurrency arises in three different contexts
Multiple applications : concurrently running applications
Structured applications : an application structured as a set of concurrent
processes (threads)
OS structure : OS implemented as a set of processes
Concurrency provides major benefits in processing efficiency and in
program structuring
5.5.1 Inter–Process Communication :
There is frequent interaction among concurrently running processes.
There are three ways in which concurrent processes interact with each other :
Competition for Resources :
• It occurs when independent processes that are not intended to work
together compete for the use of the same or shared resource, e.g. printer,
memory, or file.
• No communication or message exchange exists amongst such competing
processes.
• Processes are unaware of the existence of each other
Cooperation by Sharing Resources :
• It occurs when processes that are not necessarily aware of each other
usage and interaction to shared data without reference to other processes
but suspects that other processes may have access to the same data
• Processes need to collaborate and guarantee that the data they share are
properly managed Processes are aware of the existence of each other
indirectly
Cooperation by Communication :
• It occurs when various processes communicate with each other, for
instance with message passing in order to provide a way to synchronize
or coordinate their various activities.
• There is nothing shared between processes
• Processes are aware of the existence of each other directly
5.5.2 Concurrency Problems :
There are some serious problems associated with the interaction of
concurrently running processes :
Race Condition : A situation that occurs when two or more processes
are reading or writing into some shared data and the final result depends on
who runs precisely.
Deadlock : It is the permanent blocking of a set of processes that either
compete for system resources or communicate with each other. It involves
conflicting needs for resources by two or more processes.
43
Operating System and It refers to a situation in which a set of two or more processes are waiting
Software Installation for other members of the set to complete an operation in order to proceed,
but none of the members is able to proceed.
e.g. Traffic deadlock : consider a situation in which four cars have arrived
at a four–way stop intersection at the same time. The four quadrants of the
intersection are the resources over which control is needed. If all four cars
proceed into the intersection, then each car controls one resource (one quadrant)
but cannot proceed because the required second resource has already been
controlled by another car. Hence deadlock will occur. The main reason for this
deadlock is because each car needs exclusive use of both resources for certain
period of time.
It is a difficult phenomenon to anticipate and there are no easy general
solutions to this problem.
Starvation : It referees to the situation in which a process is ready to
execute but is continuously denied access to a processor in deference to other
processes.
e.g. suppose that there are three processes P1, P2, and P3 and each require
periodic access to resource R. If the operating system grants the resource to
P1 and P2 alternately, P3 may indefinitely be denied access to the resource,
thus starvation may occur.
5.5.3 Mutual Exclusion :
The key to preventing race condition is to enforce mutual exclusion :
It is the ability to exclude (prohibit) all other processes from using a shared
variable or file while one process is using it.
Part of a program where shared resource (critical resource) is accessed
is called critical region or critical section.
5.8 Assignments :
1. What is the importance of using threads ?
2. What are the main differences between user space and kernel threads ?
3. What are the disadvantages of kernel level thread implementation ?
45
Operating System and 4. Which problems of user space thread implementation are solved by using
Software Installation the scheduler activation thread implementation method ?
5.9 Activity :
• Use windows task manager to manage process
46
Unit
06 PROCESS SCHEDULING
UNIT STRUCTURE
6.0 Learning Objectives
6.1 Introduction
6.2 Overview of Process Scheduling
6.3 Schedular
6.8.1 Long–Term Schedular
6.8.2 Short–term scheduler
6.4 Scheduling
6.5 Process scheduling policies
6.8.1 Non Preemptive scheduling
6.8.2 Pre–emptive scheduling
6.6 Context switch and the interrupt handler
6.7 Scheduling criteria
6.8 Scheduling Algorithms
6.8.1 First–Come First–Served (FCFS) Scheduling
6.8.2 Shortest Job First (SJF)Scheduling
6.8.3 Shortest Remaining Time Scheduling
6.8.4 Round Robin Scheduling (RR)
6.8.5 Priority Scheduling
6.8.6 Multilevel Queue Scheduling
6.8.7 Thread Scheduling
6.8.8 Multiple Processor Scheduling
6.9 Let's Sum Up
6.10 Check Your Progress : Possible Answers
6.11 Assignments
6.12 Activities
6.13 Case Study
6.14 Further Reading
47
Operating System and 6.1 Introduction :
Software Installation
Starvation is one of the problems of concurrency. In a multiprogramming
environment, processes compute for the uniprocessor. How does the operating
system execute these processes in turn with minimal starvation ?
Kernel is one part of an operating system that carries out several activities
and operations among which process management is one. The process management
task includes handling process creation, termination, inter–process communication,
scheduling, switching, dispatching and management of process control blocks.
Scheduling is a means through which fair distribution of execution time is
granted to processes with similar state for a multiprogramming system.
Multiprogramming aims to have some process running at all times and maximize
CPU utilization by switching the CPU among processes so frequently that users
can interact with each program while it is running.
Fig. 6.1 : The three processes queue and CPU interaction among them
48
6.3 Schedular : Process Scheduling
6.4 Scheduling :
Scheduling is fundamental to operating system function. Almost all
computer resources are scheduled before use. The scheduler will not have that
much of a task on simple multi– programmed PCs as only one program is
actually running currently and the user also waits for the completion of all
programs. When it comes to high–end networked workstations and servers, the
scheduler has a real job here and scheduling algorithms used significantly affects
the efficient use of the system. When does a system need to schedule is a
question that should be addressed before going deep into the scheduling
process ?
Four different events bring the need for scheduling. These are :
State change from running to waiting : The current process goes from
the running to the waiting state because it issues an I/O request or some
operating system request that cannot be satisfied immediately.
State change from running to ready : For instance, a timer interrupt
causes the scheduler to run and decide that a process has run for its allotted
interval of time and itis time to move it from the running to the ready state.
State change from waiting to ready : For instance, an I/O operation
is complete for a process that requested it and the process now moves from
the waiting to the ready state. The scheduler may then decide to preempt the
currently–running process and move this ready process into the running state.
Terminates : Some processes spend more time in a waiting state as they
have more I/O requests while some processes spend more time in their running
state. The period of computation between I/O requests of a process is called
the CPU burst. In general, processes can be described as I/O bound or CPU
bound. An I/O–bound process is one that spends more of its time doing and
waiting for I/O than it spends doing computations. Such type of processes
generally experiences short CPU bursts. A CPU–bound process, in contrast,
49
Operating System and generates I/O requests infrequently, using more of its time doing computations
Software Installation and exhibit long CPU bursts. It is important that the job scheduler selects a
good process mix of I/O–bound and CPU– bound processes for best performance.
This scheduling algorithm uses the best of each algorithm by having more
than one queue and is used for situations in which processes can easily be
classified into different groups. For example, a common division is made
between foreground (interactive) processes and background (batch) processes.
These two types of processes have different response–time requirements and
so may have different scheduling needs. Ready queue is partitioned into several
separate queues and processes will be permanently placed in these queues based
on some criteria. Each queue then implements its own scheduling algorithm
for the processes inside and commonly a static preemptive scheduling algorithm
is used among the queues. For example, foreground queue may have absolute
priority over background queue.
Therefore, no process in the background queue could run except the
foreground queues are empty. If a process entered the foreground queue while
a process from the background queue is running, the background queue process
will be preempted and the new process from the foreground queue gets the
CPU causing possible starvation for the background queue process. To address
this problem, time slice can be used between the queues where each queue
gets a certain portion of the CPU time which it can then schedule among the
various processes in its queue. On the above example, for instance, the foreground
queue can be given 80% of the CPU time for RR scheduling among its
processes, whereas the background queue receives 20% of the CPU to give
to its processes in FCFS manner.
Multilevel Feedback Scheduling algorithm is another implementation of
multilevel queue where processes have the right to move between the several
queues of the ready queue instead of being placed permanently into queues
at arrival time based on their CPU–burst properties. A process that uses too
much CPU time is degraded to a lower–priority queue and a process that waits
too long is upgraded to a higher–priority queue.
For example, consider a MLFQ scheduler with three queues, Q0 with
time quantum 8 milliseconds, Q1 with time quantum 16 milliseconds and Q2
on FCFS basis only when queues Q0 and Q1 are empty. A new job enters
queue Q0 is served by FCFS and receives 8 milliseconds. If not finished in
8 milliseconds, it is moved to Q1. At Q1 the job is again served by FCFS
and receives 16 milliseconds. If not completed, it is preempted and moved to
Q2 where it is served in FCFS order with any CPU cycles left over from queues
Q0 and Q1.
59
Operating System and 3. Priority scheduling works as policy
Software Installation
(a) Pre–emptive (b) None Pre–emptive
(c) (a) or (b) (d) None of these
4. Aging is used to guard against indefinite postponement. (True/False)
5. In Priority scheduling algorithm, Priority is expressed in terms of fixed
range number such as 0 to 100.
6.11 Assignments :
• Define the differences between pre–emptive and non–pre–emptive
scheduling
• Write note on Multilevel Queue Scheduling
• Explain Thread Scheduling
60
6.12 Activity : Process Scheduling
61
Unit
PROCESS SYNCHRONIZATION
07 AND DEADLOCKS
UNIT STRUCTURE
7.0 Learning Objectives
7.1 Introduction
7.2 Dead lock Conditions
7.3 Resource–Allocation (R–A) Graph
7.4 Deadlock Handling Mechanisms
7.5 Deadlock Detection
7.6 Deadlock Recovery
7.7 Deadlock Avoidance
7.8 Deadlock Prevention
7.9 Let's Sum Up
7.10 Check Your Progress : Possible Answers
7.11 Assignments
7.12 Activity
7.13 Case Study
7.14 Further Reading
7.1 Introduction :
Deadlock is one of the problems of concurrency. In a multiprogramming
environment, processes compute for resources. Processes wait for resources
which are owned by other processes, in turn these processes wait for that process
to release a resource. How does the operating system bring these processes
to an agreement ?
In a multiprogramming system, processes are always in request of the
several finite resources available in the system for their successful execution.
The resources are partitioned into several types like Memory space, CPU cycles,
files, and I/O devices (such as printers and tape drives). A process must request
a resource before using it, and must release the resource after using it. When
two or more processes simultaneously request for a resource, only one process
will be allowed to use the resource at any instant of time to avoid unintended
outcomes or corrupted outputs.
62
But for most programs, a process requires exclusive use of more than Process Synchronization
one resource at a time without exceeding the number of available resources and Deadlocks
in the system. For instance, two processes, A and B, may need to write document
being scanned on a CD. Process A asks for the scanner first and is allowed
to use it while process B asks for the CD recorder first because of its different
programming structure and will also be permitted to use the recorder, After
process A completes the scanning task, it requests for the CD recorder but
will be denied as it is already being in use by process B. Process B now requests
for the scanner without releasing the CD recorder which is requested by A.
such situations causes both processes to be blocked or move to their waiting
state and remain there forever as both are in need of same resource simultaneously
resulting in a deadlock condition.
Deadlock can be defined as a situation whereby two or more processes
get into a state whereby each is holding a resource that the other is requesting
and processes are waiting in a loop for the resource to be released. None of
these waiting processes will ever wake up and start execution as all the
processes are blocked and needs an event to happen by one of these blocked
processes to move to a running state.
At this point, it is important to distinguish between starvation and a
deadlock. In starvation, a process cannot continue simply because it is not
allocated the CPU while in deadlock, processes cannot continue because they
are blocked and would remain in this state even if the CPU was available
to them. Deadlock is a global condition rather than a local one which means
that if were to analyze any process involved in a deadlock; we would not be
able to find any error in the program of that process. The problem does not
lie with any particular process, but with the interaction between a group of
processes that are multitasked together. Deadlock effectively brings a large
portion of the system to a halt.
Normally, a process goes through three consecutive steps to utilize a
resource. These are :
• Request a resource
• Use the resource
• Release the resource
The operating system keeps in a system table the status of each resource
as free or allocated and if allocated, to which process. If a requested resource
is not available, the operating system can add the requesting process to a waiting
queue of processes for this resource or the process itself waits a little while
and checks again the resource as is the case in some systems. A resource can
be either a preemptable or non–preempt able type. A preemptable resource is
a type of resource that can be shared by taking away from a process owning
it without negatively affecting it. A good example of such type is a memory.
A non–preemptable resource, in contrary, is a non–sharable resource that cannot
be taken from a process currently owning it without negatively affecting it.
CD recorders are examples of non–preemptable resources. A deadlock happens
when resources are non–preemptable types and is most common in multitasking
and client/server environments.
63
Operating System and 7.2 Dead Lock Conditions :
Software Installation
For a deadlock to happen between processes, the following four conditions
should hold simultaneously in a system.
Mutual Exclusion : Resources are non–preemptable and once a process
is allocated a resource, it has exclusive access to that resource. This resource
cannot be shared with other processes
Hold and Wait : A process must be holding exclusively at least one
resource and still make a request for another resource that is currently hold
by another process.
No Preemption : Resources can only be released voluntarily by the
process itself rather than by the action of an external event
Circular Wait : Two or more processes must be in a circular chain where
each process waits for a resource that the next process in the chain holds.
All these four conditions are necessary but not sufficient for a deadlock
to occur. Only if the right combinations of unfortunate circumstances arise,
then will deadlock manifest itself.
65
Operating System and To conclude, resource allocation graphs are tools that can be used to
Software Installation identify the occurrence of a deadlock given sequence of request/release. If a
cycle can be seen in the graph, there is a possibility of deadlock none otherwise.
67
Operating System and 7.6 Deadlock Recovery :
Software Installation
Once, the detection algorithm has succeeded in identifying a deadlock,
several ways exist to overcome the situation. One alternative can be through
notifying the operator about the deadlock and let the operator handle it manually.
Another possibility can be letting the system recover from the deadlock
automatically. Two different approaches are used to recover automatically : pre–
emption and abortion.
Recovery through resource pre–emption
This method works by temporarily taking away a resource from its current
owner process and gives it to another process and finally returns back to the
owner. i.e. successively preempt some resources from processes and give these
resources to other processes until the deadlock cycle is broken.
Three important issues need to be addressed if to use this method
• Victim : which resources and which processes are to be pre–empted ?
Cost minimization must be considered while ordering the pre–emption.
The number of resources a deadlock process is holding and the time
thus far used by the deadlocked process during its execution can be taken
as cost factors.
• Rollback : if a process is pre–empted a resource, what is the fate of
this process ? Obviously, the normal execution of the process can't
proceed as it lacks some required resources. Thus, the process must be
rolled back to some safe state and restarted from that state after the
deadlock is solved. Since, in general, it is difficult to determine what
a safe state is, the simplest solution could be a total rollback.
• Starvation : how can we guarantee that resources will not always be
pre–empted from the same process ? Same process may always be picked
as victim which leads to starvation. Including the number of rollback
in the cost factor can be a solution to this problem.
This recovery method is difficult, if not impossible, as picking a process
to pre–empt is highly dependent on the ease of taking back resources now
held.
Recovery through process termination
Killing one or more processes in a system may help to recover from
deadlocks. Choose a victim process from the cycle and abort it so that other
processes continue running by using the freed resources. This process can be
repeated several times in the cycle till breaking the cycle is possible. A process
outside a cycle can also be picked as a prey process where the resources hold
by it are needed by one or more deadlocked processes. For example, two
processes are running one holding a printer and requesting a CD reader while
the other one using a CD reader and waiting for a printer. The two processes
end up being deadlocked. If there is a third process with another identical printer
and CD reader resource granted, killing this process makes the resources free
which can then be allocated to the two processes solving the deadlock situation.
Killing a process may not be easy. If the process was in the midst of updating
a file or printing data on a printer, terminating it will leave the file in an
incorrect state and the system needs to reset the printer to its correct state
before printing the next job. Factors such as the priority of the process, how
many more resources are required to complete the process, etc. affect the process
68
chosen as a victim. In general, It is best if the process chosen as a victim Process Synchronization
is a process that can resume from the beginning with no bad effect or the and Deadlocks
termination cost is minimal.
70
Banker's algorithm useless. Moreover, it is also unrealistic to assume static Process Synchronization
number of processes in a system as this number varies dynamically in most and Deadlocks
systems. The requirement that a process will eventually release all its resources
on termination is insufficient for a practical system as waiting for hours even
days for the resources to be released is unacceptable. In conclusion, there are
few, if any, currently existing systems using this algorithm to avoid deadlocks.
71
Operating System and is a complicated task if not impossible. Attacking this condition is even more
Software Installation difficult than the previous two.
Circular Wait Condition : The last condition leading to a deadlock is
the circular wait. This can be avoided by letting processes to wait but not
in a circular fashion. One way to achieve this could be through assigning a
global precedence number to resources and force processes to request for
resources according to increasing precedence. That is, if a process holds some
resources and the highest precedence of these resources is m, then this process
cannot request any resource with ordering smaller than m. This forces resource
allocation to follow a particular and non– circular ordering, so circular wait
cannot occur. Alternatively, a system can set a rule specifying a process is
entitled to only a single resource at any time. This allows holding only one
resource per process and if a process requests another resource, it must first
free the one it's currently holding (or hold–and–wait).
Check Your Progress – 1 :
1. Deadlock is not the problems of concurrency. (True/False)
2. The deadlock can be described more precisely in terms of a system
Resource allocation graph (True/False)
3. In Resource allocation graph, if cycle exists in the graph, then all the
processes are free of any deadlock situation (True/False)
4. Banker's Algorithm ensures that a circular–wait condition can never exist.
(True/False)
7.11 Assignments :
1. State the necessary conditions for deadlock to occur.
72 2. What are the various methods for handling deadlocks ?
7.12 Activity : Process Synchronization
and Deadlocks
• Using R–A graph, describe deadlocks
73
Operating System and
BLOCK SUMMARY :
Software Installation
• The most central notion in any operating system is a process.
• A process is a program currently running and that needs different kinds of
resources to be allocated.
• A process also interacts with other processes frequently.
• An operating system is responsible to manage all the requirements of
processes and also coordinate communication among different processes.
• A process state determines the current status of the process. There are three
different states a process can be in.
• Every process has important information associated and this information
are stored in a PCB by the operating system
• Thread is a basic unit of CPU utilization, consisting of a program counter,
a stack, a set of registers, and a thread ID.
• A process can have one or more threads which share the same resources
like the address space and opened files of a process.
• Dependency exists between threads and also they work collaboratively to
accomplish a task and also share same resources.
• Threads can be in any of the process states, threads can create child threads,
etc.
• Concurrently running processes compute for resources. These processes
might communicate directly through message or share common variable on
the memory.
• Concurrency have associated problems, namely race condition, starvation
and deadlock. To avoid race condition, mutual exclusion is a solution.
• Process scheduling is a process of granting CPU for processes in the ready
queue.
• The scheduler decides which process to grant the CPU according to a
scheduling policy.
• pre–emptive policy forcibly suspends a process and gives the CPU to other
processes in a ready queue
• non–pre–emptive scheduling policy simply waits for one process to finish
its execution or voluntarily give–up the CPU and assigns another process
to execute.
• FCFS is a non–pre–emptive scheduling algorithm
• SRT is a pre–emptive scheduling algorithm
• Multiprocessor systems have a more complex scheduling requirement that
must be handled by the scheduler.
BLOCK ASSIGNMENT :
Short Questions
• Why is it not possible to move a process from ready to waiting state ?
• What is the reason for a process to stay in its ready state ?
• What kind of solution is test–and–set lock (TSL) provide ?
• How does deadlock happen in a system ?
74
• What are the two scheduling approaches used in a multi–processor Process Synchronization
system ? and Deadlocks
75
Dr. Babasaheb Ambedkar BCAR-102/
Open University Ahmedabad DCAR-102
Block Objectives :
The objective of this block is to elicit the importance of memory and
describe the difference and usage of swapping, and partitions and work with the
techniques of paging and segmentation and effectively address the different page
replacement algorithms and working sets, it also explain the concept of caching.
It describes the Characteristics of I/O Devices, Principles of I/O hardware,
Principles of I/O Software and differentiate various Disk Scheduling Algorithms.
It define : file, file structure, file access, file attribures and types of files. to
explains in details about various file operations, Directories, types of directory
organizations and directory operations. To Explain file system hierarchy, file
storage allocation methods and methods of directory implementation. To discuss
file sharing, file system consitency, file system reliability
Block Structure :
Unit 8 : Memory Management
Unit 9 : Page Replacement Algorithms
Unit 10 : Device Management
Unit 11 : File Systems
Unit 12 : Directories and File System Hierarchy
Unit 13 : File System Implementation
Unit
08 MEMORY MANAGEMENT
UNIT STRUCTURE
8.0 Learning Objectives
8.1 Introduction
8.2 Swapping
8.3 Partitions
8.4 Paging
8.5 Implementation of Paging
8.6 Hierarchical Page Table
8.7 Address Translation With a 2–Level Page Table
8.8 Segmentation
8.9 Check Your Progress
8.10 Let Us Sum Up
8.11 Check Your Progress : Possible Answers
8.12 Assignment
8.13 Activity
8.14 Case Study
8.15 Further Reading
8.1 Introduction :
One of the fundamental functionality of the operating system is memory
management. Processes, opened files, and input/output devices require memory
to operate. How does the operating manage to allocate memory in a
multiprogramming environment ?
Main memory (RAM) is one of the major component of a computer
system. Every program to get executed must be loaded to the memory. Execution
of the program happens once part or all of the program is loaded and the
program becomes process. Hence, there are possibly number of processes
running there will be an input queue of processes on the disk that wait to
be brought to the memory. Obviously, programs needs to pass through several
steps before they start to execute.
A program file or binary file is different from data or ASCII file in such
a way that it has code (text) part, data part, and stack part. These parts are
76
attached to the memory at different stages/times. Attaching these components Memory Management
of a program is known as binding. Address binding of code or text, data, and
stack parts of the program to memory addresses occurs at the following stages :
Compile time : Address binding is done at program compilation time.
Absolute code is generated if the available memory is known in advance. Stating
memory address is changed, the source code must be recompiled.
Load time : If the memory location to bind is not known in advance,
a relocate table code has to be generated at the loading time, i.e., the time
when the program is brought in to the memory.
Execution time : With the support from hardware, for example, using
base and limit registers, address maps can be done at execution time. Address
binding can be delayed until execution time if the process has to be moved
from segment to segment.
The steps of program execution is illustrated in the Fig. 8.1.
memory
8.2 Swapping :
Concurrently running multiple processes on a uni–processor environment
requires larger memory. How large the memory be ? If we set the size always
we want to run additional process that requires more memory. One cannot set
the exact size of the memory that is large enough for all the processes on
the system. The solution for this problem is swapping.
What is Swapping ?
It is impractical to imagine the size of main memory to accommodate
all the processes. Normally, if you check the number of processes Windows
and Linux are executing at a time, more than 60 at start up. Maintaining all
these processes in the primary memory is costly as large area of memory is
needed. A process can be swapped temporarily out of the main memory and
stored in the disk is the simplest strategy.
It is the responsibility of the memory manager to swap ideal processes
fast enough in the memory. If you consider any CPU–scheduling algorithm,
there is a mechanism that identifies a process that finishes its quanta recently,
or has least priority, or least waiting time, or long execution time. In a specific
scheduling algorithms, there are processes that are scheduled last. So, the
memory manager swaps these processes to the disk in order to larger space
in the main memory and be able to execute new processes.
78
Memory Management
8.3 Partitions :
Multiple processes reside in the main memory at a time. Main memory
is partitioned into two : resident operating system, also known as kernel space,
in low memory area and user space where user processes are held in high
memory area.
Single Partition Allocation
Base/relocation register is used to hold the starting physical address for
the user process. They provide protection for the operating system code and
data from being changed by user processes. The logical address generated by
user processes should not be greater than the limit register. Otherwise, the
process is trying to address a memory location which is not part of its own
address space. The partition size is fixed. The early IBM mainframe operating
system, OS/MFT (Multiprogramming with Fixed Number of Tasks), was a
successful single partition operating system.
Multiple Partition Allocation
The user space is divided into multiple partitions. The free partitions
are referred to as holes. Holes have various size, which are scattered throughout
the main memory. A process arriving to the memory will be allocated to a
hole which is large enough to hold it.
The operating system is responsible to keep track of information about
the allocated and free (hole) partitions.
Dynamic Memory Allocation
It is an algorithm that addresses how to allocate a process with n bytes
to a list of free holes. The following are the three different algorithms :
First–fit : Starting from the first hole, scan for a hole which is large
enough for the process to be allocated.
79
Operating System and Best–fit : Scan for the entire holes in the list, allocated the process in
Software Installation a hole that produces a smallest leftover or internal fragmentation. It is best
allocation algorithm but poor performance, since it has to scan all the free
holes.
Next–fit : Scanning starts from the location of the last placement, and
chooses the next available block that is large enough.
Worst–fit : Scan for the entire holes in the list, allocated the process
in a hole that produces a largest leftover or internal fragmentation. It is worst
allocation algorithm with poor performance, since it has to scan all the free
holes.
First–fit and best–fit algorithms are better in terms of storage utilization.
Fragmentation
As a process is not large enough to take the entire space of a hole, there
is a space which is unused. Such spaces are referred as fragmented spaces.
There are two kinds of fragmentations :
• External fragmentation
• Internal fragmentation
External fragmentation
It refers to the total memory space which is left over when holes are
assigned to processes. These spaces are not contiguous.
Internal fragmentation
The assigned process is not large enough to take the entire space of the
hole producing an internal fragmentation.
To reduce external fragmentation a technique called compaction is used.
Compaction moves all allocated memory spaces to one end of the main memory
in order to produce one larger free memory block. It is expensive activity due
to the effort it takes to move the content. Moreover, the relocation should be
dynamic, since compaction is done at execution time.
8.4 Paging :
Both single partition and multiple partitions allocation are inefficient in
the use of memory; the former results in internal fragmentation, the latter in
external fragmentation.
Suppose, however, that main memory is partitioned into equal fixed–size
chunks that are relatively small known as frames or page frames (size is power
of 2, between 512 bytes and 8192 bytes). And that each process is also divided
into small fixed–size chunks of the same size known as pages. To run a program
of size n pages, need to find n free frames and load program. In this approach,
internal fragmentation happens only on a fraction of the last page of a process.
There is no external fragmentation.
While processes occupy some of the frames, there are also free frames.
A list of free frames is maintained by the operating system.
The page table shows a frame location for each page of a process. The
logical address generated by the process consists of a page number and an
offset within the page. A logical address is the location of a word relative
to the beginning of the process.
80
The CPU translates the logical address into a physical address. Logical– Memory Management
to–physical address translation is done by Memory Management Unit (MMU).
The processor uses the page table to convert the logical address logical address
(page number, offset) to produce a physical address (frame number, offset).
Page number (p) – used to identify the page's index in a page table which
contains base address of each page in physical memory.
Page offset (d) – united with start address to define the physical memory
address that is sent to the physical memory.
Fig. 8.4 : Block Diagram for Mapping Logical Address to Physical Address
Assume that the logical memory has four pages (page 0 to page 3). The
physical memory has eight frames. The page table entry shows the mapping
of the logical pages to the frames. Consider page size of 1KB. Let the logical
address generated by the processor is 1300. To determine the frame where this
page is located, divide the logical address by the page size, 1024, resulting
page 1. Locating the frame that corresponds to page 1 is frame 6. The base
address is the frame number times the frame size, which is equal to page size,
results 6144.
To find the page offset, compute the remainder of the 1300 by page
size. The page offset is 276. The physical address is obtained by adding the
base address with the page offset, which is 6420.
Frame
number
page 0 0 5 0
page 1 1 6 1 page 2
page 2 2 1 2 page 3
page 3 3 2 3
logical page 4
memory table 5 page 0
6 page 1
7 physical
memory
Fig. 8.5 : Paging Using Page Table Data Structure
81
Operating System and If the page do not have an entry in the page table, you cannot located
Software Installation the corresponding frame number. That shows the page is not loaded to the
physical memory. In such cases, a page fault is said to happen and the operating
system is notified of a trap. The process which results a page fault is blocked.
Operating system loads the page that creates a page fault and brings the process
to ready state.
82
Memory Management
83
Operating System and Now there is one entry per frame. Again using 4KB pages and 4 byte
Software Installation PTEs, we see that the table would be a constant 0.1% of the size of real memory.
But on a Translation Lookaside Buffer (TLB) miss, the system must
search the inverted page table, which would be hopelessly slow except that
some tricks are employed. Specifically, hashing is used.
A Translation Lookaside Buffer or TLB is an associate memory where
the index field is the page number. The other fields include the frame number,
dirty bit, valid bit, etc.
Note that, unlike the situation with a page table, the page number is
stored in the TLB; indeed it is the index field.
A TLB is small and expensive but at least it is fast. When the page
number is in the TLB, the frame number is returned very quickly.
On a miss, a TLB reload is performed. The page number is looked up
in the page table. The record found is placed in the TLB and a victim is
discarded (not really discarded, dirty and referenced bits are copied back to
the page table entry). There is no placement question since all TLB entries
are accessed at the same time and hence are equally suitable. But there is a
replacement question.
8.8 Segmentation :
A user program is divided into segments. Rather than dividing the
program into unstructured pages, it divides the program based on the different
sections, segments. The size of the segments are not necessarily equal. As with
paging, a logical address using segmentation consists of two parts, in this case
a segment number and an offset.
Since segments are unequal in size, segmentation is similar to dynamic
partitioning. The difference with dynamic partitioning is that with segmentation
a program may occupy more than one partition. Moreover, these partitions need
not be contiguous. Segmentation avoids internal fragmentation but, similar to
dynamic partitioning, it suffers from external fragmentation.
Segmentation gives much more sense for programmers, compared to
paging, in visualizing and organizing programs and data. Typically, the
programmer or compiler will assign programs and data to different segments.
For purposes of modular programming, the program or data may be further
broken down into multiple segments.
84
A segment refers to a logical unit of a program which is code/text, stack, Memory Management
local variables, global variables, etc. Figure 8.7 illustrates the association of
user space and physical memory allocation for segments of a program.
In this case, there is no simple relationship between logical addresses
and physical addresses. Similar to paging, logical address consists of segment
number and offset. A segment table is used to map logical addresses to physical
addresses. In the segment table, each segment table entry would have to give
the starting address in main memory of the corresponding segment using the
base register. The entry should also provide the length of the segment, to assure
that invalid addresses are not used using the limit register.
8.12 Assignment :
• Explain what will happen if a page is not found in the physical memory.
• Why hierarchical pages structure is required ?
• When does the address binding activity is done ?
• What is the purpose of limit register ?
85
Operating System and 8.13 Activity :
Software Installation
• Describe the different parts of a program.
86
Unit
PAGE REPLACEMENT
09 ALGORITHMS
UNIT STRUCTURE
9.0 Learning Objectives
9.1 Introduction
9.2 The Optimal Page Replacement Algorithm (PRA)
9.3 The Not Recently Used (NRU) PRA
9.4 First In, First Out (FIFO) PRA
9.5 Second Chance PRA
9.6 Clock PRA
9.7 Last In, First Out (LIFO) PRA
9.8 Least Recently Used (LRU) PRA
9.9 The Aging PRA
9.10 Check Your Progress
9.11 Let Us Sum Up
9.12 Check Your Progress : Possible Answers
9.13 Assignment
9.14 Activity
9.15 Case Study
9.16 Further Reading
9.1 Introduction :
In the previous activity, we demonstrated how pages are managed. If a
page is found on the physical memory, the instruction continue to execute.
But, if page fault happens, the required page shall be brought to the memory
replacing a victim page. How is the victim page selected ? We require an
algorithm which is efficient enough in such a way that it minimizes subsequent
page faults.
A page is brought into the memory when it is referenced. When a process
first is started, there will be number of page faults. After a while, the number
of page faults should drop to a very low level. The operating system is
responsible to identify a victim page to be dispossessed from memory during
a page fault.
These are solutions to the replacement a page in a memory question.
A better solution take advantage of locality when choosing the victim page
to replace. 87
Operating System and Temporal locality : If a word is referenced now, it is likely to be
Software Installation referenced in the near future. This argues for caching referenced words, i.e.
keeping the referenced word near the processor for a while.
Spatial locality : If a word is referenced now, nearby words are likely
to be referenced in the near future. This argues for pre fetching words around
the currently referenced word.
Temporal and spatial locality are lumped together into locality : If any
word in a page is referenced, each word in the page is likely to be referenced.
So it is good to bring in the entire page on a miss and to keep the page in
memory for a while.
At the very beginning a program has no record to refer for locality. At
this point the paging system is said to be undergoing a cold start.
Pages belonging to processes that have terminated are of course perfect
choices for victims. Pages belonging to processes that have been blocked for
a long time are good choices as well.
93
Operating System and 8. Why does the OS want to know the working set for each process ? Circle
Software Installation the best answer and briefly explain your reasoning.
(a) To determine the best page to replace.
(b) To know when memory is over–committed.
(c) Assessment : Essay Type Questions
9.13 Assignment :
• What is the job of Memory Management Unit (MMU) ?
• What is the basic objective of page replacement algorithm ?
94
9.14 Activity : Page Replacement
Algorithms
• Consider the page replacement policies of OPRA, FIFO, and LRU. Which
of the following statements are true ? Be careful to notice when the phrase
states "better than or equal to" versus "strictly better than".
a. OPT always performs better than or equal to LRU.
b. OPT always performs strictly better than LRU.
c. LRU always performs better than or equal to FIFO.
d. LRU always performs strictly better than FIFO.
e. OPT with n+1 pages of physical memory always performs better
than or equal to OPT with n pages.
f. OPT with n+1 pages of physical memory always performs strictly
better than OPT with n pages.
g. FIFO with n+1 pages of physical memory always performs better
than or equal to FIFO with n pages.
h. FIFO with n+1 pages of physical memory always performs strictly
better than FIFO with n pages.
i. LRU with n+1 pages of physical memory always performs better
than or equal to LRU with n pages.
j. LRU with n+1 pages of physical memory always performs strictly
better than LRU with n pages.
95
Unit
10 DEVICE MANAGEMENT
UNIT STRUCTURE
10.0 Learning Objectives
10.1 Introduction
10.2 Characteristics of I/O Devices
10.3 Principles of I/O hardware
10.4 Principles of I/O Software
10.5 Disk
10.6 Disk Scheduling Algorithms
10.7 Check Your Progress
10.8 Let Us Sum Up
10.9 Check Your Progress : Possible Answers
10.10 Assignment
10.11 Activity
10.12 Case Study
10.13 Further Reading
10.1 Introduction :
I/O device management is a very important activity of the operating
system. I/O is important for the communication of users to computer. The
following are services of the operating system as I/O manager :
• Monitoring all the I/O devices
• Order the I/O devices, capture interrupts and manage bugs related to
I/O
• Avail communication channel between I/O devices and all other hardware
components
Accessing and storing information is the task of every computer application.
A clear and obvious requirement of an operating system is thus, the provision
of a convenient, efficient, and robust information handling system. A process
can use its address space to store some amount of information. However, three
main problems are associated with this method.
96
One is the adequacy of the space to accommodate all information to Device Management
be stored by the application as the size of the address space is determined
by the size of the virtual address space.
The second problem is the data loss as the process terminates the
information kept on its address space is also lost though the information is
required to be retained for long period of time.
The third problem is concurrent accessibility of the information by other
processes as information saved in one process's address space is accessible only
to that process and sometimes there is a need to make this information as whole
or part of it available to other processes as well. Solving these problems by
separately managing information resulted by a process from its process is a
concern of any operating system which is usually done by storing the information
on external media in units called files.
The operating system manages naming, structure, access, use, protection
and implementation of these files. Thus, component of an operating system
and monitors activities related with files is known as a file system which is
to be addressed in this section. File management system consists of system
utility programs that run as privileged applications concerned with secondary
storages.
Figure : Device
I/O Unit :
Indicates the hardware components. There are two major components –
Electronic Component (Device controller/Adapter) and the Mechanical Component
Memory–mapped I/O is a technique for communicating with I/O devices.
In this case a certain portion of the processor's address space is mapped
to the device, and communications occur by reading and writing directly to/
from those memory areas.
Memory–mapped I/O :
It is suitable for devices which must move large quantities of data quickly,
such as graphics cards.
Memory–mapped I/O can be used either instead of or more often in
combination with traditional registers. For example, graphics cards still use
registers for control information such as setting the video mode.
A potential problem exists with memory–mapped I/O, if a process is
allowed to write directly to the address space used by a memory–mapped
I/O device.
Direct Memory Access (DMA) :
It is a technique for moving a data directly between main memory and
I/O devices without the CPU's intervention.
In the absence of DMA reading from a disk is done with the following
steps :
• The controller serially fetches every bit from the block which may have
one or more sectors and maintains the whole data in its internal buffer
• The controller then checks for a read error by computing the checksum
value of the data
• The controller then sends an interrupt signal to the system
• The OS reads the information found in the controller's buffer, which is
the disk block byte by byte or word by word and put it on memory
• Problem : Since the OS controls the loop of reading, it wastes the CPU
time. On the other hand, when DMA is used the Device Controller will
do the counting and address tracking activities.
98
The following diagram shows the steps to use DMA : Device Management
99
Operating System and Layers of I/O software
Software Installation
The following are the I/O software layers
• Interrupt handler (bottom)
• Device driver
• Device independent OS software
• User–level software (top)
Interrupt Handler
Interrupts are undesirable and inevitable situations in life but can be
hidden away. One of the methods used to hide interrupts is through blocking
all processes with an I/O operation till the I/O is finished and an interrupt
happens. The interrupt method will then need to perform various actions and
unblock the process which started it.
Device Driver
All device – dependent code goes in the device driver
Only one device type, or at most, one group of closely linked devices
is handled through each device driver.
Each device driver handles one device type, or at most, one class of
closely related devices.
Each controller has one or more device registers used to give it command
The device driver is responsible to give these commands and ensure their
proper execution
Thus, the disk driver is the only part of the OS that knows how many
registers that disk controller has and what they are used for.
Generally, we can say that a device driver is responsible to get requests
from the software, which is device independent, and issue orders for the
execution of the requests.
Steps in carrying out I/O requests :
• Translate the requests from abstract to concrete terms
• Write the interpreted requests into registers of the device controller's
• The device driver blocks itself until an interrupt comes which awakens
or unblocks the driver
• The Device driver starts its operations by first testing for any error on
the I/O device
• If no error is identified and everything is properly functioning, data is
passed from the driver to the requester software
• If the driver couldn't get any request in the queue, it will move back
to block state and waits until a request comes in.
Device Independent I/O Software
It is large fraction of I/O software
Services it provides are :
Uniform interfacing for device drivers – perform I/O function common
to all drives
100
Device Naming – responsible for mapping symbolic devices names onto Device Management
the proper driver
Device protection – secures devices by blocking illegitimate or non–
allowed access requests of users
Providing device–independent block size – provide uniform block size
to higher layers hiding differences in block sizes
Buffering : if a user process write half a block, the OS will normally
keep the data in buffer until the rest of the data are written. Keyboard inputs
that arrive before it is needed also require buffering.
Storage allocation on block devices : when a file is created and filled
with data, new disk blocks have to be allocated to the file. To perform this
allocation, the OS needs a list of free blocks and used some algorithms for
allocation
Allocating and releasing dedicated devices : The OS is solely responsible
to validate any device request and act accordingly by either granting or denying
the service.
Error reporting : Errors handling, by and large, is done by drivers. Most
errors are device dependent. When a device request comes to the driver, it
attempts to communicate the requested block certain number of times.
If it is unable to read the block during these times, it stops trying and
communicate the requester software the status and finally reports to the caller.
User Space I/O Software
A situation where a small portion of the I/O software is outside the OS
In this case, library procedures are used to perform system calls including
I/O system calls. For instance, if a C program has a function call count=write(fd,
buffer, n bytes); the write procedure from the library will be linked with it
and kept in the binary program which will be loaded to memory during
execution
The library procedures are also responsible to format the I/O requests
The following is an example how the I/O system works during reading
by an application
• Step 1 : system call to do file reading is passed from the user program.
• Step 2 : the device–independent software checks the block cache; if the
requested block is found there, the device driver will be called.
• Step 3 : The device driver issue the request to the hardware and the
requesting user process will be moved to a block state until the disk
operation is finished
• Step 4 : the disk then generates an interrupt once it finishes the operation
• Step 5 : The interrupt handler immediately takes over and investigate
the interrupt i.e., it first checks for the device currently requiring attention.
It then reads the output of the device and unblocks the sleeping process
indicating the I/O process is being completed and let the user process
continue
The following table 1. shows the I/O system layers along with the major
responsibilities of each layer.
101
Operating System and Table 1 : Input Output layers
Software Installation
10.5 Disk :
All real disks are organized into cylinders, each one containing many
tracks. Each of the tracks then will be divided into sectors (equal number of
sectors or different number of sectors)
In the case of equal number of sectors
The data density as closer to the center (hub) is high
The speed increases as the read/write moves to the outer tracks
Modern large hard drives have more sectors per track on outer tracks
e.g. IDE drives
Many controllers, except floppy disk controllers, are capable of doing
a read or write operation on one drive and also seek operation on one or more
other drives simultaneously.
Disk Access Time
Three factors determine the time required to read or write a disk block :
The seek time (the time to move the arm to the proper cylinder)
The rotational delay (the time for the proper sector to rotate under the
head)
The actual data transfer time
For most disks, the seek time dominates the other two times, so reducing
the mean seek time can improve system performance substantially.
Disk requests can come from processes while the arm is doing a seek
operation of another process. A table of waiting disk requests is kept by disk
drivers. The table is indexed by cylinder number and pending requests of each
cylinder is linked through a linked list that is headed by the table entries.
104
10.9 Check Your Progress : Possible Answers : Device Management
10.10 Assignment :
1. Explain the I/O software goals
2. Discuss the two most widely used approaches to improve disk I/O
performance
3. What is DMA ? What advantages does it provide to the operating system ?
4. Explain the I/O software goals
5. What is DMA ? What advantages does it provide to the operating system ?
10.11 Activity :
• Discuss the two most widely used approaches to improve disk I/O
performance
105
Unit
11 FILE SYSTEMS
UNIT STRUCTURE
11.0 Learning Objectives
11.1 Introduction
11.2 File Naming
11.3 File Types
11.4 File Attributes
11.5 File Operations
11.6 Check Your Progress
11.7 Let Us Sum Up
11.8 Check Your Progress : Possible Answers
11.9 Assignment
11.10 Activity
11.11 Case Study
11.12 Further Reading
11.1 Introduction :
A file is a named collection of related information that are treated as
a single entity defined by its creator and kept on secondary storage devices
that can be accessed by users or applications through file management systems.
In this activity, we will discuss how files are handled by the operating system,
the naming scheme, access methods, and operations carried out on a file.
File is a contiguous logical storage unit defined by the operating system
by abstracting the physical properties of storage devices so that the computer
system will be convenient to use by providing a uniform logical view of
information storage. File is the most visible aspect of an OS which is used
to store and retrieve information from the disk
Files are mapped by the operating system onto physical devices. These
storage devices are usually non–volatile making the contents persistent through
power failures and system reboots. Files represent both programs and data where
data files may be numeric, alphabetic, alphanumeric, or binary. Files can also
be either in free form or rigidly formatted. Every file has a name associated
with it which is used to interact with. Operating systems provide a layer of
system–level software, using system–calls to provide services relating to the
provision of files which avoids the need for each application program to manage
106
its own disk allocation and access. File manager component of the operating File Systems
system is responsible for the maintenance of files on secondary storages as
memory manager is responsible for the maintenance of primary memory. Let's
discuss some properties of files from user's point of view.
108
Key length Number of bytes in a key field File Systems
109
Operating System and 3. In a file access, a system reads the bytes in order starting
Software Installation from the beginning without skipping any in between.
4. In a file access, a system can traverse the file directly in any
order.
5. operating system make distinctions between uppercase and
lowercase file names
6. is a binary file consisting of list of files contained in it.
7. files are used for unbuffered data transfer to and from a device
11.9 Assignment :
a. Discuss in detail three file attributes.
b. What is the difference between random access and sequential access ?
c. What are the extensions found in file name used for ?
11.10 Activity :
• Identify different types of files exists on your computer / laptop
• Discuss the problems encountered by a process when keeping information
in its address space
110
11.12 Further Reading : File Systems
111
DIRECTORIES AND FILE
Unit
12 SYSTEM HIERARCHY
UNIT STRUCTURE
12.0 Learning Objectives
12.1 Introduction
12.2 File Organization
12.3 Directory Organization
12.4 File System Hierarchy
12.5 Check Your Progress
12.6 Let Us Sum Up
12.7 Check Your Progress : Possible Answers
12.8 Assignment
12.9 Activity
12.10 Case Study
12.11 Further Reading
12.1 Introduction :
Directory is collection of nodes that contains information about files.
Directories shall be logically organized to achieve efficiency, facilitate convenient
naming, and allow file grouping. This activity covers overview of file directory,
organization of directories and operations that can be carried out on directories
by users or applications.A general purpose computer maintains thousands and
millions of files on secondary storages.
An entire space of a secondary storage device may be used to store a
file or only its portion. Directory is a means provided by the file system to
keep track of files. In most systems, directories, also known as folders, are
files themselves owned by the operating system and have their own organization,
properties and operations. They provide mapping between file names and the
files themselves. Directory contains a number of entries about files such as
attributes, location, ownership etc.. The directory may keep the attributes of
a file within itself, like a table, or may keep them elsewhere and access them
through a pointer. In opening a file, the OS puts all the attributes in main
memory for subsequent usage
112
12.2 File Organization : Directories and
File System Hierarchy
Files can be organized in a single–level directory structure or multi–level
directory structure. In a single level directory structure, all the files in a system
are kept in one directory, which may also be referred as root directory. This
organization scheme is known to be simple and results in fast file search.
However, it encounters a problem when the number of files to be maintained
increases or when used in a multi–user systems as names of each file is required
to be unique. If two or more users create files with same name, this uniqueness
requirement is overridden and in such cases the last file created overwrites
the previous one causing the first file being replaced with another file of same
name.
In a two–level directory structure, a private user level directory is
assigned for each user which elevates the naming conflicts encountered in a
single–level directory structure. The basic implementation of this organization
requires all users to access only their own directory. But with little modification,
it can extend to allow users to also access other user's directories through some
notification mechanism. In this organization, it is possible to provide same
names for files of different user. The system implicitly knows where to search
for a file when asked to open a file since each user is associated with a private
directory. This organization also has its own drawbacks. It creates total isolation
between users which is not required in systems where processes share and
exchange data. It may also not be satisfactory if users have many files. Figure
1 and figure 2 below shows the single–level and two–level directory organizations
respectively.
12.8 Assessment :
• What are the limitations of a two–level directory organization ?
• What does a link operation define on directories ?
• What is the difference between absolute path and relative path of a file ?
• Discuss what a file system is
• What are the different layers found in a file system hierarchy ?
12.9 Activity :
• What advantage has the hierarchical arrangement of a file system ?
• What are the two design issues related with file systems ?
117
Unit
FILE SYSTEM
13 IMPLEMENTATION
UNIT STRUCTURE
13.0 Learning Objectives
13.1 Introduction
13.2 File System Layout
13.3 File Storage Allocation Methods
13.4 Directory Implementation
13.5 File Sharing
13.6 Disk Space Management
13.7 File System Reliability
13.8 File System Consistency
13.9 File System Performance
13.10 Check Your Progress
13.11 Let Us Sum Up
13.12 Check Your Progress : Possible Answers
13.13 Assignment
13.14 Activity
13.15 Case Study
13.16 Further Reading
13.1 Introduction :
In the previous unit, we tried to point out file system from the user's
view as well as kernel's view. Let's now move our discussion to the structures
and operations used to implement file systems. In this activity, files and
directory storages, disk space management, as well as efficient and reliable
file management system implementations will be examined.
A directory is defined with fixed size entries per each file and holds
the file name, structure of the file attributes and disk addresses specifying
locations of disk blocks. It is also possible to store this information in the
i–nodes for systems with i–node space allocation.
A file name can be of fixed length with a pre–defined number of
characters constituting the name or variable length which doesn't restrict the
character numbers in a file name. These variant of file names can be implemented
by setting a limit on file names length, commonly 255 characters. This approach
has simple implementations but wastes directory space as few files have such
lengthy names.
Another possibility can be considering a variant entry size in directories
containing the length of the entry itself, some file attributes and the file name.
Every file name is expected to fill out an integral number of words so that
the next directory entry is started on a word boundary. Figure7.6 depicts this
method of handling variable length file names. During file removal process,,
a variable size break is introduced in the directory which may not be enough
for a newly coming file. A page fault can also occur while reading a file name
as directories may cover multiple pages of memory.
Making constant size directory entries and keeping all the file names
together in a heap found at the end of the directory can also be another solution
towards handling the variant long file names as. This method solves the fitting
into a freed space problem encountered in the previous scheme. But heap
management should be performed and there is no guarantee of page fault
avoidance.
In all these directory implementations, a linear list of directory entries
with pointers to the data blocks is used which makes searching for files linear.
This method is simple to program but time–consuming to execute. Creating
a new file, for instance, requires searching the directory to be sure that no
existing file has the same name and the new entry is added at the end of the
directory. File deletion also requires searching the directory for the named file
and then release the space allocated to it. This linear arrangement of entries
results in a slow search operation especially when the directory is too long.
Caching results of a search or using a hash table of each directory can be
used to speed up the search. The cache is first checked for the file name before
starting the search avoiding the long lookup if it is found in the cache.
Through hash table method, file name is entered by representing it with
value between 0 and n–1 by performing division by n and taking the remainder
of the division or by adding the words in the file name and dividing it by
n where n is the size of the hash table. This hash code is inspected to determine
if it is used or not. If the slot is occupied, a linked list headed at that table
entry is created that threads through all entries with same hash table. But if
the slot is unexploited, a pointer to the file entry that follows the hash table
will be kept there. Searching for a file is done by hashing the file name and
select a hash table entry and checking if the file is present in all the chain
entries. If the file is not found, the file is not available in the directory. While
searching is improved in hash table implementation, administration complexity
is also inevitable
121
Operating System and 13.5 File Sharing :
Software Installation
In a multiuser system, there is a frequent need to share files among users
which demands the need to present common files within different directories
of multiple users at a time. Two important issues needs to be addressed in
association with file sharing. These are access rights and simultaneous access
of files. One thing to note is the difference between shared file (or directory)
and two copies of the file. With two copies of a file, the copy is accessible
by each user not the original, and if one user updates the file, the changes
will not appear in the other's copy. With a shared file, however, only one actual
file exists, making immediately visible any changes made by one person to
the other. Sharing is particularly important for subdirectories as a newly created
file by one user will automatically appear in all the shared subdirectories.
Several implementation possibilities are there for shared files.
A common way, exemplified by many of the UNIX systems, is to create
a new directory entry called a link which is effectively a pointer to another
file or subdirectory. When a shared file or directory exists, the system creates
a new file of type link specifying the path name of the file it is linked to.
Links are easily identified by their format in the directory entry (or by having
a special type on systems that support types) and are effectively indirect pointers
which are neglected by the operating system when traversing directory trees
and thus referred as symbolic links. The advantage of symbolic link
implementation is their use to link to files over networks through the network
address specification of the machine that holds the shared file as link. But
the problem with this implementation is the resulting of two or more paths
of same file. Another problem associated with the symbolic link implementation
is the extra overhead incurred as a result of parsing the path component wise
through extra disc accesses.
Another shared file implementation approach is through simple duplication
of all information about the shared files in both directories resulting in identical
and non–distinguishable entries which is different from the link implementation
discussed earlier. Consistency maintenance is a major problem in this
implementation during file modifications as the changed content is visible only
to the user making the changes which brings down the sharing concept.
124
13.8 File System Consistency : File System
Implementation
A file system is always dealing with blocks through Reading, modifying
and writing blocks. Inconsistency may arise if a file system updates a block
but the system is interrupted before the changes are written. This inconsistency
becomes even worse when the unwritten block is free list, i–node or directory
blocks. To overcome this problem, most systems implement a utility program
to check for file system consistency that is run after each system boot like
the scandisk for windows and fsck for UNIX. The consistency check can be
performed on blocks or files. Two tables are constructed by the utility program
during block consistency checking each table with counters for each block that
is initialized with 0. The first table's counters count the number of times each
block exist in that file while the second table's counters keep track of the number
of times each block is found in the free list. List of all block numbers used
in a file is constructed by the program from the i–node. The counter at the
first table is then incremented for every block read. The program also checks
the bitmap to see blocks not in use and increments the counter of the second
table for every occurrence of a block in the bitmap. The consistency of the
blocks will then be checked by checking the counters from the two tables.
If each block has a 1 value either in the first or second table, the program
returns fine indicating block consistency while a value 0 in both tables
corresponding to a block number indicates file inconsistency.
The utility program then fixes the inconsistency and informs the user.
File consistency checking is also done in same fashion in a directory system.
The inspection starts from the root directory and recursively descends through
the tree. The file's usage counter is incremented for every file in each directory
which will be checked against a list sorted by i–node numbers that indicates
the number of directories where each file is found in. When a file is created,
the counter starts at 1 and increments every time a file is linked. If the counters
value match, the file is determined to be consistent. The inconsistency happens
if the values of the counters do not match in which case the program should
take measures to correct the values.
125
Operating System and on to the cache first and then the request is answered. Due to the large number
Software Installation of blocks found in cache, the system needs to respond quickly for a block's
request. One way to do this can be through hashing the device and disk
addresses and then look for the disk in the hash table. If a block is not found
in the cache and needs to be brought and if the cache is already full, some
blocks must be removed and re–written back to the disk and make some space
for the new comer. This situation is similar with paging and thus the algorithms
discussed for page replacement can also be used for block replacement as well.
Block read ahead is another technique used by file systems to optimize
performance. This method brings blocks into the cache ahead before they are
requested so that a hit rate is increased. With this method, a requested block
and several subsequent blocks which are likely to be requested after current
block is processed are read and cached. Retrieving these data from the disk
in one transfer and caching them saves a considerable amount of time. This
works well for sequentially accessed files where the file system checks if block
a+1 exists in the cache while reading block and makes a schedule to read block
a+1 to the cache.
However, if the file is randomly accessible, block read ahead has a
disadvantage of tying up the disk bandwidth with writing unwanted blocks into
the cache and even dropping possibly needed blocks from the cache.
Arranging blocks which more likely are accessed in sequence together
also minimizes the amount of disk arm motion which in turn enhances disk
access performance.
A file has different organization when seen from the system's perspective.
File system designers are thus required to focus on internal structures as storage
allocation, disk space management, sharing of files, reliability and performance.
Three methods are used to allocate space to file systems among which the
i–node is known to be the optimal way. Directory–management routines must
consider efficiency, performance, and reliability. A hash table is a commonly
used method, as it is fast and efficient. Unfortunately, damage to the table or
a system crash can result in inconsistency between the directory information
and the disk's contents. A consistency checker can be used to repair the damage.
Operating–system backup tools allow disk data to be copied to tape, enabling
the user to recover from data or even disk loss due to hardware failure, operating
system bug, or user error.
A file is a named collection of related information that are treated as
a single entity defined by its creator and kept on secondary storage devices.
File names can be of fixed length or variable length and are used to interact
with it. A file can be structured in bytes, records or trees. Several user programs
or system calls can be carried out to interact with files. Each device in a file
system keeps a volume table of contents or a device directory listing the location
of the files on the device. In addition, it is useful to create directories to allow
files to be organized.
Three directory organization methods are available : a single–level, two–
level and tree directory organization. A tree–structured directory allows a user
to create subdirectories to organize files.
The file system resides permanently on secondary storage devices, mostly
on disks which is designed to hold a large amount of data permanently. Physical
disks may be segmented into partitions to control media use and to allow
multiple, possibly varying, file systems on a single spindle. These file systems
are mounted onto a layered logical file system architecture to make use of
them. The lower levels deal with the physical properties of storage devices
while the upper levels deal with symbolic file names and logical properties
of files. The Intermediate levels map the logical file concepts into physical
device properties. Disk space is allocated to files through three different
mechanisms. Directory–management routines must consider efficiency,
performance, and reliability of file systems.
13.13 Assignment :
• Discuss about the MBR along with its contents.
• What are the file sharing schemes used by a file management system ?
• What advantages and disadvantages do you observe on disk quota
specification ?
127
Operating System and • What is a free block space management ? Why is it necessary for the
Software Installation ystem to identify free blocks ? What schemes are used by the system
to manage these free blocks ?
• What are the mechanisms used by a file system to check for consistency ?
13.14 Activity :
• Discuss in pair cons and pros of the three file storage allocation methods
on hard disks
• Discuss about bitmap and linked list free space management
implementations.
128
File System
BLOCK SUMMARY :
Implementation
• One of the fundamental and complex tasks of an operating system is
memory management.
• Memory management involves treating main memory as a resource to
be allocated to and shared among a number of active processes.
• To use the processor and the I/O facilities efficiently, it is desirable to
maintain as many processes in main memory as possible.
• It is desirable to free programmers from size restrictions in program
development.
• The basic tools of memory management are paging and segmentation.
• With paging, each process is divided into relatively small, fixed–size
pages.
• Segmentation provides for the use of pieces of varying size. It is also
possible to combine segmentation and paging in a single memory
management scheme.
• To use the processor and the I/O facilities efficiently, it is desirable to
maintain as many processes in main memory as possible.
• With virtual memory, all address references are logical references that
are translated at run time to real addresses.
• A virtual memory management scheme requires both hardware and software
support. The hardware support is provided by the processor. The support
includes dynamic translation of virtual addresses to physical addresses
and the generation
• I/O devices are interfaces that communicate users with the computer
system. To manage the communication effectively, the operating system
uses the I/O subsystem, which has a complete layer of hardware and
software.
• The aspect of I/O that has the greatest impact on overall system performance
is disk I/O. Two of the most widely used approaches to improve disk
I/O performance are disk scheduling and the disk cache.
• Disk scheduling minimizes the mechanical seek time of the disk and
hence improves performance.
• A disk cache is a buffer, usually kept in main memory that functions
as a cache of disk blocks between disk memory and the rest of main
memory.
• Because of the principle of locality, the use of a disk cache should
substantially reduce the number of block I/O transfers between main
memory and disk.
• A file is a named collection of related information defined by its creator.
• File is an abstraction used by the kernel to represent and organize the
system's non–volatile storage resources, including hard disks, CD–ROMs,
and optical disks.
• File system is part of the operating system that manages files.
• A file can represent a program or data and can support free form or
rigid form.
129
Operating System and • A file can be structured in byte, record or tree form and can be accessed
Software Installation either sequentially or randomly.
• The operating system uses several types of operations, through system
calls, to interact with files.
• Each device in a file system keeps a volume table of contents or a device
directory listing the location of the files on the device. In addition, it
is useful to create directories to allow files to be organized.
• Three directory organization methods are available : a single–level, two–
level and tree directory organization. A tree–structured directory allows
a user to create subdirectories to organize files.
• Disk space is allocated to files through three different mechanisms.
Directory–management routines must consider efficiency, performance,
and reliability of file systems.
BLOCK ASSIGNMENT :
Short Questions
• What is paging ?
• What is segmentation ?
• What is virtual memory ?
• What is page fault ?
• When does internal fragmentation occurs ?
• What is file ?
• What is a directory ?
• What is a path ?
• What is backup ?
• How can a disk access time be enhanced ?
• What are the three space allocation methods used in file systems ?
• What is the difference between file and block ?
• What are the two file access methods ?
• Do we need a rewind operation in a random access files ?
• What are the open() and close() operations used for ?
• What is caching ?
Long Question
• Discuss the methods of backup
• What are the main issues need to be addressed in disk space management
in relation to file systems ?
• Explain the difference between single versus multiple partition allocation.
• Explain advantages of FAT linked list file allocation
• What are the problems with contiguous file allocation ?
• What are the merits and demerits of the two file sharing methods ?
130
Dr. Babasaheb Ambedkar BCAR-102/
Open University Ahmedabad DCAR-102
UNIT 14 SECURITY
UNIT 15 PROTECTION
Block Objectives :
Block Structure :
Unit 14 : Security
Unit 15 : Protection
14.1 Introduction :
Operating system consists of a collection of objects, hardware or software
each object with unique name that can be accessed through a well–defined
set of operations. The system should allow different users to access different
resources in a shared way and needs to ensure that each object is accessed
correctly and only by those processes that are allowed to do so. Security ensures
the authentication of system users to protect the integrity of the information
stored in the system as well as the physical resources of the computer system.
The security system prevents unauthorized access, malicious destruction or
alteration of data, and accidental introduction of inconsistency. Protection
mechanisms control access to a system by limiting the types of file access
permitted to users. In addition, protection must ensure that only processes that
have gained proper authorization from the operating system can operate on
memory segments, the CPU, and other resources. This unit covers in detail
the concept of operating system security and the mechanisms used to provide
system protection. We start our discussion with different security threats,
intruders, and accidental data losses followed by the protection mechanisms
enforced to overcome the threats.
131
Operating System and 14.2 Key Terms :
Software Installation
Computer Security The prevention and protection of computer resources
from unauthorized access, use, alteration, degradation,
destruction, or any other problem.
Threat Any person, act, or object that poses a danger to computer
security
Intruder Any person, or program that carries out malicious or
unauthorized activity on a system and jeopardize its
safety.
Attack A threat that is carried out or practiced on a system
Cryptography A means used to hide communicated message content
Authentication is a process of validating one's identity
Access Control A method used to define users' interaction with a system
"The most secure computers are those not connected to the Internet and
shielded from any interference". However, in this era of information access
anywhere and at anytime, the phrase violates the goal of communication and
resource sharing.
• How can one then keep itself connected and accessible anytime without
jeopardizing its safety ?
• What are the concerns out there that threatens one's safety ?
• What measures one shall take to overcome these threats ?
This unit builds your knowledge about computer security concerns as
well as counter measures set by the operating system to be taken to secure
all resources of a system.
Current computer systems are designed in a way to handle several
resources and bulk information. As the number of information stored grows,
the need to protect this information from damage, unwelcomed change as well
as unintended access is becoming more important. However, several sophisticated
systems are also being used that opposes the objectives of the operating system
and abuse the information causing catastrophic disasters. Security refers to the
wellbeing of a system that requires adequate protection as well as consideration
of the external environment within which the system operates. A secured system
is thus, a system whose resources are used as intended in all situations. There
are several situations which violates security and causes a system unsecured.
These problems can be external or internal to the system as well as
intended or unintended problems. Computer Security aims in safeguarding a
system from external threats as well as from legitimate users as possible
attackers.
133
Operating System and • Natural phenomenon : accidents like earthquake, fire, flood, etc
Software Installation
• Hardware or software error : malfunctioning of CPU, disk
failures, errors in programs
• User errors : invalid input, wrong program run, disk loss, wrong
disk mount, etc
These problems can be solved mostly by taking appropriate and adequate
backups on devices furthest away from the original location
134
Outside attacks : File System
Implementation
These are attacks caused by an attacker from outside the system mostly
on a network. These attacks make use of codes being transmitted to the targeted
machine over a network and runs on it causing the damage. These codes are
known as malicious codes and can be virus, worm, mobile codes or java applets.
Virus : is a program that replicates by attaching itself with another
program and causing harm on the system's resources. The attacker distributes
the virus by attaching it to an appealing program which is more likely to be
used by users (games, free apps, etc). After penetrating the system, the virus
remains inactive until the victim application is launched and when initiated,
it starts affecting other programs in the system. Viruses are of different types
according to their properties and activation methods.
Preventing a virus from penetrating into a system on the first place is
an ideal solution to protect a system from a virus attack which is very difficult,
if not impossible, to do in the era of networked systems. So the best alternative
towards virus protection could be detection of a virus, once detected identifying
its type, and then trying to remove it from the system or if not possible remove
the entire affected program. These are carried out by special programs known
as antivirus. Antivirus programs are used to protect system from being affected
by viruses. An antivirus maintains definition for viruses which is the pure code
of the viruses. The antivirus then inspects executable files of the system to
check if there is a code matching the virus definition. If so, they try to fix
the program by removing the virus code.
Worm : is similar with virus except it has self–replicating capability
and does not need to attach itself on other programs to duplicate itself. It is
an independent program that spreads via network connections, typically using
either email, remote execution, or remote login to deliver or execute a copy
of itself to or on another system, as well as causing local damage.
14.8 Assignments :
1. What is a threat ?
2. What differences are there between active and passive intruder ?
3. What makes intruders attack a system ?
4. What are the three goals of computer security ?
14.9 Activity :
• Install any anti–virus program on your computer / laptop
136
Unit
15 PROTECTION
UNIT STRUCTURE
15.0 Learning Objectives
15.1 Introduction
15.2 Design Principles of an Operating System
15.3 Cryptography
15.4 User Authentication
15.5 Access Control
15.6 Check Your Progress
15.7 Let Us Sum Up
15.8 Check Your Progress : Possible Answers
15.9 Assignment
15.10 Activity
15.11 Case Study
15.12 Further Reading
15.1 Introduction :
An operating system has the responsibility of enforcing security protection
mechanisms to avoid possible threats from happening and protect the various
objects of a system. Each asset of a system has a name and well defined set
of operations through which access is made. In this section various
countermeasures set by a computer system to protect itself against security
attacks will be addressed along with the design principles that shall be followed
to strengthen system's security.
15.3 Cryptography :
Have you ever heard the word cryptography ? Or have you seen people
communicating with some weird, non–understandable writings where only the
communicating parties know what it means ? Well, in this activity we are going
to address what a cryptography is, which is one of the most popular security
provision ways, its importance in a system security and discuss some cryptography
schemes .Encryption is a process of transforming a message with an intention
of hiding its meaning to avoid possible attacks on it. Encryption is achieved
with a concept of cryptography which converts the original data, termed as
plaintext, into a secreted one, termed as cyphertext that can only be understood
and reverted back by legitimate users of the data. Reverting back from the
cyphertext to the plaintext is known as decryption.
Most of the time encryption and decryption works well through security
by obscurity where the encryption and decryption algorithms are made publicly
accessible while the contents are private.
Encryption and decryption algorithms have a secret parameter known as
key which needs to be known by only authorized users and with which the
text is converted to its required representation (cypher for encryption and plain
for decryption). An encryption function can be stated as C= E(P, Ke) where
a cyphertext C is obtained by an encryption algorithm E implemented on a
plaintext P with an encryption key Ke.
The decryption can also be stated as P=D(C,Kd),where a plaintext P of
an encrypted data is obtained by implementing a general decryption algorithm
D with the cyphertext C and decryption key Kd as parameters.
Figure 1 shows the pictorial representations for encryption and decryption
139
Operating System and • Based on what the user has : Physical key, ticket, passport, token, smart
Software Installation card
• Based on something about the user's context : including Location, time
• Each of these principles set their own authentication requirement with
different security properties and different level of complexity. Let's see
each of the principles separately.
• Authentication with what the user knows (password)
This is probably the oldest authentication mechanism which is also most
widely used by computer systems. This method requires the user to type
username and password on login. For each user, system stores (ID, F(password)),
where F is some transformation (e.g., one–way hash) in a password file. When
user enters the password, system computes F(password) to check if a match
can be found to proof the identity. It is needless to say that a password should
not be visible when typed but different systems use different techniques towards
this. For example windows operating system displays boxes in place of each
character in a password while LINUX displays nothing at all when the user
types the password. Passwords have an advantage in their ease to modify when
compromised but comes also with several vulnerabilities to attack. Inherent
vulnerabilities are : easy to guess or snoop and no control on sharing. Practical
vulnerabilities are visible if unencrypted in distributed and network environment,
and susceptible for replay attacks if encrypted naively. Due to these vulnerabilities,
an intruder can attempt to attack one's password. The possible attacks on
password are :
Guessing attack/dictionary attack
This is a trial and error attack which exploits human nature to use easy
to remember passwords. An intruder needs to know the user or have some
information about the user to perform a guessing attack. Alternatively, the
attacker can do Brute–force attack by trying all possible passwords using
exhaustive search until it matches. Mostly, passwords of shorter length are
susceptible to this brute force attack as it is possible to get the password after
a limited number of guessing attempts.
But it becomes less successful on systems that allow longer passwords
that include both uppercase and lowercase letters, numbers and all punctuation
characters
Social Engineering
Attacker asks for password by masquerading as somebody else (not
necessarily an authenticated user)
Sniffing
Anyone with access to a network on which a computer resides can
seamlessly add a network monitor, allowing him/her to watch all data being
transferred on the network including user IDs and passwords. Encrypting the
data stream containing the password solves this problem. Even such a system
could have passwords stolen, however.
Trojan login
Trojan horses are seemingly normal programs but with hidden activities
of accessing system's resources by making the user believe it is safe. Trojan
140
login is thus, tricking someone into executing a program that does nasty things. Protection
This Trojan program installed on the system captures every keystroke before
sending it on to the application.
Improving password security
Different techniques can be used to enhance the security of a password
by eliminating the security threats and reducing its vulnerability. Some of the
techniques used are :
• Educate users to make better choices. Better passwords can be constructed
by combining letters (upper and lower), digits and symbols and making
them lengthy, using non–guessable phrases, etc.
• Define rules for good password selection and ask users to follow them
• Ask or force users to change their password periodically such as one
time password which forces users to change their password after each
login.
• Actively attempt to break user's passwords and force users to change
broken ones.
• Screen password choices. Some systems use a password screening module
that forbids users from using a vulnerable password.
Challenge–response authentication can be combined with password
authentication where the user provides list of questions with answers at password
creation and the system picks any of the questions randomly and asks the user
during every login attempt to make sure the password is used by the owner.
The problem with this method is, however, several question–answer pair might
be required. Challenge–response is a variant of this method which lets the user
pick an algorithm during registration. The system then gives an argument for
the user on an attempt to login where the user returns the computed result
of the argument by applying the algorithm picked during registration.
The challenge response authentication method, in general, asks the user
questions no one else would know the answer to except the user.
Authentication with what the user is (biometrics)
This authentication method uses physical properties of a user known as
biometrics. These physical properties include Fingerprint, Retina scan, Voice
pattern, Signature, Typing style. A biometric system is constructed from enrollment
and identification part. The enrollment part measures and collects the physical
property and associates it with a user defined username. The identification part
starts its work when the user attempts to login. It requests the user to provide
his/her username and makes the physical property measurement again which
will be checked against the previously collected value by the enrollment. If
they match, the system allows the user to get access.
Biometrics have been hailed as a way to get rid of the problems with
password and token–based authentication though, they have their own problems
mostly cost and rare false readings. The physical property considered should
be picked in such a way that clear distinguishing can be made between two
people and also is durable characteristics that does not change much over time.
For instance using a hair color as a physical characteristic would not be
appropriate as more than one person can have same color. Moreover, voices,
141
Operating System and face figures and the like that varies greatly based on current situations, are
Software Installation also not good candidates to be measured and recorded for a physical property
of a person.
Authentication with what the user has
This is an authentication method that uses a physical object possessed
by a user such as ATM cards, smartcards, etc… inserted into a reader machine
which is capable of reading the content associated with the object from the
central database kept with the object provider. This method uses a personal
identification number (PIN) or even a biometrics data associated with each
object owned by a user to authenticate the user. Smart cards are the most
commonly used physical objects recently. Smart cards are portable devices with
a CPU, I/O ports, and some nonvolatile memory.
Authentication with the user's context
This is also referred as next generation's authentication method. It works
by analyzing various user's attributes such as logon attributes based on a set
of configurable parameters including geographical location, IP address, time
of day, and device recognition and generates a context assurance level. These
context assurance level requires a different level of authentication as administrators
optimize security for any given logon instance. Context based authentication
allows the system to create rules that determine, pre–authentication, whether
and how a given authentication process should proceed based on context.
142
Access Control Matrix Protection
143
Operating System and Each user is given a number of capability tickets specifying authorized
Software Installation objects and operations for a user. This implementation stores the access matrix
by rows where each subject Si is the label of the access matrix rows. a list
of pairs <Oj, A[Si, Oj]>is associated for each object Oj, such that A[Si, Oj]
is not empty. Each row of the CL stores objects and the access rights granted
to the subject on the objects. CL is also advantageous due to the easy
maintenance and simple query system which is subject based. However, the
computational difficulty of identifying the set of subjects that have access right
on a given object exists here as well. Moreover, revoking access rights of
subjects for removed objects and granting rights to a newly created objects
is time taking.
Mandatory Access Control model, MAC
Regulates the flow of information by assuring information goes only to
the intended recipients. When a system mechanism controls access to an object
and an individual user cannot alter that access, the control is a mandatory access
control (MAC). This model is important for environments requiring a much
tighter security such as military system. Once access is defined, altering is not
possible by users. The Bell–La–Padula model and the Bela model are access
control models based on MAC.
Role–Based Access Control model, RBAC
Access rights are defined based on the role(s) a user has instead of his/
her identity (DAC) or his/her clearance (MAC). Access rights are defined based
on subject roles. It is used to express organizational policies with delegation
of authority and separation of duties.
144
in unsecured system. Three general security threats are there violating the Protection
objectives of system security. The threats are initiated by intruders which can
be either active or passive and attack a system through different attacking
mechanisms. Attacks happen from inside a system or outside a system and can
be intentional or non–intentional. System uses different mechanisms to protect
itself from such attacks including cryptography, user authentications, access
control mechanisms, other software methods like use of anti–viruses, etc.
Cryptography, one security enforcement mechanism, is a science and art
of communication where by the communicated information is changed to a
code that can't be perceived by any other entity except the communicating ones.
It encompasses both encryption and decryption schemes to achieve its objective.
Encryption is a way used to hide information communicated between
a sender and a receiver with an objective of avoiding any harm on the
information by illegitimate users. A message encrypted on the sender side needs
to be reverted back to its original representation with a process of decryption.
Cryptography can be symmetric or Asymmetric depending on the code
generation method it utilizes.
Authentication is another important aspect of a system used to assure
security of resources. It is a means by which object's identity is validated for
further service provision. The system uses several methods of authentication
such as authentication by password, question–response, biometrics, physical
objects, etc. The third major security provision method is an access control
mechanism through which every object of a system is associated with each
subject including people in terms of allowable operations. Access control is
thus a way of managing permissions to subjects on objects. Three access control
model are DAC, MAC, and RBAC each with their own implementation.
15.9 Assignment :
• How does the public key cryptography overcome the limitations of private
key cryptography ?
• How does a receiver convert a cyphertext to a plain text in a substitution
private key cryptography ?
• What are the possible vulnerabilities of a password ?
15.10 Activity :
• Implement one of the private key cryptography algorithms using your
preferred programming language
• Implement the RSA cryptography algorithm using your preferred
programming language
145
Operating System and 15.12 Further Reading :
Software Installation
1. Andrew S. Tanenbaum (2008). Modern Operating System. 3rd Edition.
Prentice–Hall.
2. Optional readings and other resources for Encryption, User Authentication,
and types of attacks :
3. William Stallings (2005). Cryptography and Network Security
4. Principles and Practices, 4th edition. Prentice Hall
5. A Silberschatz, Peter B Galvin, G Gagne (2009). Operating
6. System Concepts. 8th Edition. Wiley.
146
OPERATING SYSTEM AND
Unit
16 APPLICATION SOFTWARE
INSTALLATION
UNIT STRUCTURE
16.0 Learning Objectives
16.1 Introduction
16.2 Windows 10 Upgrade Installation
16.3 Windows 10 Clean installation
16.4 Windows 10 Out–Of–Box Experience
16.5 Install Microsoft 365 or Office 2019 on a PC
16.6 Check Your Progress
16.7 Let Us Sum Up
16.8 Check Your Progress : Possible Answers
16.9 Assignment
16.10 Activity
16.11 Case Study
16.12 Further Reading
16.1 Introduction :
There are two approaches you can take for installing Windows 10 in
your computer :
Upgrade or a Clean Install. Typically, when you upgrade your operating
system, all your documents and applications remain unchanged, whereas with
a clean install you would be starting again from scratch. But one important
thing to verify first is if your computer has the necessary hardware to support
Windows 10.
147
Operating System and • At least 16GB space in your hard disk.
Software Installation
• One important thing to note is that the installation process itself will
verify your computer hardware and let you know if it qualifies for a
Windows 10 upgrade or not. If not, then you will need to upgrade your
hardware.
If your computer qualifies for a Windows 10 upgrade, then you can follow
these steps to perform the upgrade :
Step 1 : Look for the Windows 10 notification in the lower–right corner
of the screen.
This is a one–year–only offer that Microsoft is extending to valid users
of Windows 7 and Windows 8.1.
The download required for the upgrade is quite large, so make sure you
have a stable Internet connection and continuous power for your computer to
avoid interruptions during the process.
Step 3 : After the download is complete, it will prompt you to accept
Microsoft's license terms.
148
Operating System and
Application Software
Installation
Step 4 : After agreeing to the terms, it will ask if you want to install
the upgrade at that moment or schedule it for later.
149
Operating System and During this time, your computer will reboot a couple of times, so don't
Software Installation worry. The process itself will take you through the steps to complete the
upgrade.
Step 6 : As the upgrade approaches its end, it will ask you to configure
some basic Windows settings. You can choose to use Express settings, which
will apply the most common or recommended settings, or you can choose to
customize the settings as you please.
Step 7 : After the upgrade finishes, you'll see the Windows welcome
screen.
151
Operating System and Step–8 : Click the Install now button.
Software Installation
152
Operating System and
Application Software
Installation
Step–15 : Select the partition with the current installation (usually "Drive
0"), and click the Delete button.
Step–16 : Click the Yes button.
153
Operating System and Step–18 : Click the Next button.
Software Installation
Once you complete the steps, the setup will install a fresh copy of
Windows 10 on the computer.
155
Operating System and 9. Click the Next button.
Software Installation
10. Confirm your account password.
156
13. Create a four–digit Windows Hello PIN. Operating System and
Application Software
Installation
157
Operating System and 16. Optinally You can provide your phone number and then click the Send
Software Installation button to link your phone with your device. If you want to skip this
option, click the Do it later button in the screen's bottom–left corner.
158
18. Click the No, thanks button to skip the Microsoft 365 setup (as necessary). Operating System and
Application Software
Installation
159
Operating System and 16.5 Install Microsoft 365 or Office 2019 on a PC :
Software Installation
Before you begin, make sure your PC or Mac meets the system
requirements.
If this is the first time you're installing Office you may have some setup
steps to do first. Expand the learn more section below.
However, if you're reinstalling Office, you've already redeemed a product
key, or you're installing Office at work or school and know you have a license,
then go to the section sign in and install Office on your PC or Mac.
Sign in to download Office
1. Go to www.office.com and if you're not already signed in, select Sign
in.
2. Sign in with the account you associated with this version of Office. This
account can be a Microsoft account, or work or school account. I forgot
the account I use with Office
3. After signing in, follow the steps that match the type of account you
signed in with.
If You signed in with a Microsoft account
(a) From the Office home page select Install Office.
160
(b) Select Office 365 apps to begin the installation. Operating System and
Application Software
The 64–bit version is installed by default unless Office detects you
Installation
already have a 32–bit version of Office (or a stand–alone Office app such as
Project or Visio) installed. In this case, the 32–bit version of Office will be
installed instead.
To change from a 32–bit version to a 64–bit version or vice versa, you
need to uninstall Office first (including any stand–alone Office apps you have
such as Project of Visio). Once the uninstall is complete, sign in again to
www.office.com and select Other install options, choose the language and
version you want (64 or 32–bit), and then select Install
4. This completes the download of Office to your device. To complete the
installation, follow the prompts in the "Install Office" section below.
Install Office
1. Depending on your browser, select Run (in Edge or Internet Explorer),
Setup (in Chrome), or Save File (in Firefox).
If you see the User Account Control prompt that says, Do you want
to allow this app to make changes to your device ? select Yes.
The install begins.
2. Your install is finished when you see the phrase, "You're all set! Office
is installed now" and an animation plays to show you where to find
Office applications on your computer. Select Close.
161
Operating System and Activate Office
Software Installation
1. To open an Office app, select the Start button (lower–left corner of your
screen) and type the name of an Office app, like Word.
2. To open the Office app, select its icon in the search results.
3. When the Office app opens, accept the license agreement. Office is
activated and ready to use.
16.9 Assignment :
• Explain step by step process to install Microsoft Windows 10
• Differentiate between upgrade and clean installation of Windows 10
• Explain Activation process for Microsoft Office 365
16.10 Activity :
• Create installation media for Window 10
162
Operating System and
BLOCK SUMMARY :
Application Software
• Security in a computer system focuses on safeguarding the resources Installation
(hardware, software) of a system.
• A system's resource may be damaged by intention or non–intentionally.
• Intruders are people and programs that intentionally damage a system
• Accidental data loss is non–intended system damage as a result of
different phenomenon like natural accidents, hardware/software errors
etc…
• Trojan horse, logic bomb, spoofing, buffer overflow are some of the
attacks from inside the system
• Viruses, worms, mobile codes are attacks from outside the system.
• The system should allow different users to access different resources in
a shared way and needs to ensure that each object is accessed correctly
and only by those processes that are allowed to do so.
• A computer system needs to protect itself from unwanted and illegitimate
access.
• A threat is a potential situation that results in unsecured system.
• The threats are initiated by intruders which can be either active or passive
and attack a system through different attacking mechanisms.
• Attacks happen from inside a system or outside a system and can be
intentional or non–intentional.
• System uses different mechanisms to protect itself from such attacks
including cryptography, user authentications, access control mechanisms
and other software methods like use of anti–viruses, etc.
• Cryptography is a science and art of communication where by the
communicated information is changed to a code that can't be perceived
by any other entity except the communicating ones. It encompasses both
encryption and decryption schemes to achieve its objective.
• Encryption is a way used to hide information communicated between
a sender and a receiver with an objective of avoiding any harm on the
information by illegitimate users.
• A message encrypted on the sender side needs to be reverted back to
its original representation with a process of decryption.
• Cryptography can be symmetric or Asymmetric depending on the code
generation method it utilizes.
• Authentication is a means by which object's identity is validated for
further service provision.
• The system uses several methods of authentication such as authentication
by password, question–response, biometrics, physical objects, etc.
• Access control is a way of managing permissions to subjects on objects.
• Three access control model are DAC, MAC, and RBAC each with their
own implementation.
163
Operating System and BLOCK ASSIGNMENT :
Software Installation
Short Questions
• What is security ?
• What is cryptography ?
• What is authentication ?
• How does the buffer overflow attack a system ?
• What are the solution to protect system from worm attacks ?
• What is a Trojan horse ?
• What is inside attack ?
• What is outside attack ?
• What is the difference between inside attack and outside attack ?
• How does a challenge response authentication works ?
Long Question
• How does Trojan Horse operate and attack a system ?
• Explain two implementations of MAC model
• Explain the difference among a virus and worm
• What advantages does the RBAC has over the other access control
methods ?
• Differentiate DAC model from the MAC model
• Explain the differences between inside attack and outside attack
164
DR.BABASAHEB AMBEDKAR
OPEN UNIVERSITY
'Jyotirmay' Parisar,
Sarkhej-Gandhinagar Highway, Chharodi, Ahmedabad-382 481.
Website : www.baou.edu.in