KEMBAR78
OS Unit 4 | PDF | Computer File | Computer Data Storage
0% found this document useful (0 votes)
17 views74 pages

OS Unit 4

The document provides an overview of disk management, detailing the components of file systems, disk structures, data organization, and formatting processes. It discusses disk scheduling algorithms, protection mechanisms, and user authentication methods, including passwords and biometrics. Additionally, it highlights the importance of managing bad blocks and swap space in operating systems.

Uploaded by

shreyassupe346
Copyright
© © All Rights Reserved
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as PDF, TXT or read online on Scribd
0% found this document useful (0 votes)
17 views74 pages

OS Unit 4

The document provides an overview of disk management, detailing the components of file systems, disk structures, data organization, and formatting processes. It discusses disk scheduling algorithms, protection mechanisms, and user authentication methods, including passwords and biometrics. Additionally, it highlights the importance of managing bad blocks and swap space in operating systems.

Uploaded by

shreyassupe346
Copyright
© © All Rights Reserved
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as PDF, TXT or read online on Scribd
You are on page 1/ 74

Disk Management

File system and its parts


A file system can be viewed logically as consisting of three
parts:

• File system interface: The end user and programmer


interface to the file system.

• File system implementation: The internal data structures


and algorithms used by the operating system to implement
this interface.

• Storage structures: The lowest level of the file system the


secondary storage and tertiary storage(removable disk like
magnetic tape, floppy disks, CD ROM) structures.
Disk Structure
• The classical secondary storage of a computer system is
provided interms of Magnetic Tape(cartridge),Magnetic Drum
and Magnetic Hard Disk.

• The portable secondary storage devices include floppy disk,


disk drive, zip drive, optical disks like CD-ROM,CD-RW,DVD’S ,
pen drive etc.

• Magnetic disks were bulky and obsolete.

• Magnetic tapes were used as an early storage medium for


mass secondary storage.
• The access time of magnetic tapes were much slower than
Magnetic disks.
• With advent in magnetic disk and optical disk technology,
magnetic tapes are currently being used as backup storage
medium for storing voluminous data for large system.
• Magnetic hard disks are metal platters coated on both sides with a
thin film of magnetic material.

• The data is recorded on the surface of a disk as it rotates about its


axis in a circular fashion.

• The disk plates storage surface is logically organized into circular


tracks and a set of tracks on all surfaces of a disk pack into
cylinder.

• Individual tracks are divided into sectors. These individual sectors


represent smallest unit of transfer. Typically 512 bytes are stored
on each sector.

• A disk is a circular platter constructed of nonmagnetic material,


called the substrate, coated with a magnetizable material.

• The substrate has been an aluminium material.


Data Organization and Formatting

• Concentric set of rings, called tracks each track has the


same width as the head.

• There are thousands of tracks per surface.


- Gaps between tracks
- Reduce gap to increase capacity
- Same number of bits per track (variable packing density)

• Tracks divided into sectors


• Minimum block size is one sector
• May have more than one sector per block (track)
Organizing Tracks by Block
• Disk tracks may be divided into user-defined
blocks rather than into sectors.
• Blocks can be fixed or variable length.
• A block is usually organized to hold an integral
number of logical records.
• Blocking Factor number of records stored in a
block.
• No internal fragmentation, no record spanning two
blocks.
• In block-addressing scheme each block of data is
accompanied by one or more subblocks containing
extra information about the block.
Disk Formatting
It is the process of preparing a data storage
device. There are three parts in formatting

• Low level formatting


• Disk partitioning
• High level formatting
Low level formatting
• It involves sub dividing each track on the disk
into one or more blocks which would contain
the user data and associated control
information.
• Low level formatting is generally done by an
OS system utility.
Disk Partitioning
• Partitioning is the process of writing
information into blocks of a storage device or
medium that allows access by an OS.

High Level Formatting


• It is the process of setting up an empty file
system on a disk partition.
• It creates an initial file system data structures
onto the hard disk.
Boot block
• Basically for a computer to start running to get an
instance when it is powered up or rebooted it
need to have an initial program to run. And this
initial program which is known as bootstrap needs
to be simple.

• It must initialize all aspects of the system, from


CPU registers to device controllers and the
contents of the main memory, and then starts the
operating system.
• To do this job the bootstrap program basically
finds the operating system kernel on disk and
then loads the kernel into memory and after this,
it jumps to the initial address to begin the
operating-system execution.

• The full bootstrap program is stored in the boot


blocks at a fixed location on the disk.

• A disk that has a boot partition is called a boot


disk.
Bad block
• Bad Block is an area of storing media that is no longer
reliable for the storage of data because it is completely
damaged or corrupted.

• We know disk have moving parts and have small tolerances,


they are prone to failure.

• In case when the failure is complete, then the disk needs to


be replaced and its contents restored from backup media to
the new disk.

• More frequently, one or more sectors become defective.


More disks even come from the factory named Bad blocks.
This is also referred to as Bad Sector.
Replacement of bad blocks will be performed in one of the
following two ways:
• Sector Sparing or Sector Forwarding
• Sector Slipping

Sector Sparing or Sector Forwarding:

It suggest that bad blocks must be replaced with good number


of random spare sectors, so the disk controller is programmed
to replace bad blocks logically with the corresponding spare
sectors(that are in good condition).

Sector Slipping:

It suggest the replacement of bad blocks by slipping(or sliding


a head) them into the subsequent good sectors that
immediately follows bad blocks.
Swap Space Management
• Swapping is a memory management technique
used in multi-programming to increase the
number of processes sharing the CPU.

• It is a technique of removing a process from the


main memory and storing it into secondary
memory, and then bringing it back into the main
memory for continued execution
• This action of moving a process out from main
memory to secondary memory is called Swap
Out and the action of moving a process out
from secondary memory to main memory is
called Swap In.

• Swap-space management is a technique used


by operating systems to optimize memory
usage and improve system performance.
Disk Scheduling Algorithms in OS
Seek Time- Seek time is the time taken in locating the disk arm to a
specified track where the read/write request will be satisfied.

Rotational Latency - It is the time taken by the desired sector to


rotate itself to the position from where it can access the R/W heads.

Transfer Time - It is the time taken to transfer the data.

Disk Access Time - Disk access time is given as,


Disk Access Time = Rotational Latency + Seek Time + Transfer Time

Disk Response Time - It is the average of time spent by each request


waiting for the IO operation.

Purpose of Disk Scheduling


The main purpose of disk scheduling algorithm is to select a disk
request from the queue of IO requests and decide the schedule when
this request will be processed.
Disk Scheduling Algorithms

• FCFS scheduling algorithm


• SSTF (shortest seek time first) algorithm
• SCAN scheduling
• C-SCAN scheduling
FCFS Scheduling Algorithm

• It is the simplest Disk Scheduling algorithm. It


services the IO requests in the order in which they
arrive. There is no starvation in this algorithm,
every request is serviced.

Disadvantages
• The scheme does not optimize the seek time.
• The request may come from different processes
therefore there is the possibility of inappropriate
movement of the head.
Example

Consider the following disk request sequence for a disk with


100 tracks 45, 21, 67, 90, 4, 50, 89, 52, 61, 87, 25
Head pointer starting at 50 and moving in left direction. Find
the number of head movements in cylinders using FCFS
scheduling.
SSTF Scheduling Algorithm
Shortest seek time first (SSTF) algorithm selects the disk I/O
request which requires the least disk arm movement from its
current position regardless of the direction. It reduces the
total seek time as compared to FCFS.

It allows the head to move to the closest track in the service


queue.

Disadvantages
It may cause starvation for some requests.
Switching direction on the frequent basis slows the working of
algorithm.
It is not the most optimal algorithm.
SCAN and C-SCAN algorithm
Scan Algorithm
It is also called as Elevator Algorithm. In this algorithm, the disk
arm moves into a particular direction till the end, satisfying all the
requests coming in its path, and then it turns back and moves in
the reverse direction satisfying requests coming in its path.

It works in the way an elevator works, elevator moves in a


direction completely till the last floor of that direction and then
turns back.

C-SCAN algorithm
In C-SCAN algorithm, the arm of the disk moves in a particular
direction servicing requests until it reaches the last cylinder, then
it jumps to the last cylinder of the opposite direction without
servicing any request then it turns back and start moving in that
direction servicing the remaining requests.
PROTECTION
Protection
The term Protection can be defined as a mechanism
that provides controlled access to the valuable
hardware and software resources of a computer
system.

Once the protection mechanisms are enforced,


those users, programs or processes having an
authentication and/or authorization from the
operating system can only be allowed to access
system resources such as CPU, memory, segments,
files, system programs etc.
Goals of Protection
• To prohibit any intentional or unintentional violation of access rights by
users(to avoid misuse).

• To verify the validity of resource usages by active program segments


on par with access restrictions as per stated protection policies.

• To distinguish between authorized and unauthorized usage and users


as well.

• To increase reliability of the system by detecting hidden errors at the


interfaces of component subsystems.

• To establish a mechanism for the enforcement of policies that govern


the resource usage.

• To allow users to define and/or manage in their own ways as to protect


their own data files and program codes.
• To extend flexibility in employing a variety of
protection policies as protection policies may vary
with different applications and designs of the
operating systems.

• To develop it as a tool for application programmer


that could protect the misuse of resources generated
by application subsystems.

• To conclude, there is need to differentiate between


policy and mechanism of protection, protection policy
emphasize as “what will be done for ensuring
protection?” where as protection mechanism suggest
as to “how it will be done?”
Domain defines collection of access rights each of which is an
ordered pair<object-name, right-set> <Result,read>

Static protection domain


A process during its lifetime, may be either bound or fixed to a
specific protection domain(static).

Dynamic protection domain


The process may be allowed to switch from one domain to
another(dynamic).

How to implement domain of protection?


A domain can be implemented as
• user domain
• Process domain
• Procedure domain and so on.
User Domain

It suggests that the identity of the users would


decide the set of objects and associated access
rights of that domain.

Accordingly, domain switching(changeover) occurs


whenever user is changed.

A user is said to be changed when one user logs out


of the system and new user logs in with different
user – id and passwords.
Process domain

Each process can be defined to be a domain.


It suggests that the identity of a process would
determine the set of objects and associated access rights
of that domain.

Procedure domain

Each procedure can be defined to be a domain. It


suggests that the local variables defined in procedure
would decide the set of objects to be accessed.

Here, domain switching is said to occur when a process is


invoked.
Authentication
• When a computer user wants to log into a computer
system, the operating system (OS) installed on that
computer system generally wants to determine or check
who the user is. This process is called "user authentication.“

• It is sometimes critical to authenticate the user because the


computer system may contain sensitive information about
the owner.

The user can be authenticated in one of the following ways:


a. User authentication using a password
b. User authentication using a physical object
c. User authentication using biometrics
Authentication

A major security problem for an operating system is


to identify or guarantee the identity of an user that
he or she is a legitimate user of this system?

This process of checking or certifying whether a


user’s identity is authentic refers to authentication.

It also extends to verifying the identity of the user


programs and processes that are currently being
executed in the system.
How does an OS can authenticate?

Authentication in general, is based on one or more of the


following items:

1. A user’s possession
For example a key or an identity card or a registration card.

2. A user’s knowledge
For example login-id or user-id coupled with a password or
PIN.

3. A user’s attribute
For example signature, thumb impression or finger print or
eye-retina pattern.
Passwords
Password facility is used to authenticate a user’s
identity.

Whenever is user is already registered his or her


user-id and password for their individual user
accounts on a single user or multi user computer
systems – an attempt to operate and/or access
the computer system will demand user-id
coupled with passwords.
Passwords could be any combination of letters,
digits and sometimes special characters.

The character combination permitted and the width


ie. Maximum number of characters in a password
sequence vary from one OS to another OS.

Typing password sequence while accessing the


system, would never display the same on the
monitor screen instead * or # appear on the
password text box on the screen.
• The login name typed in is looked up in the list, and the typed
password is then compared to the stored password in this
method.

• If both the login and password match, the login is permitted, or


the user has been successfully authenticated and authorized to
log into that system. If no match is found, the login error is
detected.

How to Improve Password Security ?


Here are four basic and widely used password security methods:

1. Passwords should be at least eight characters long.


2. Upper- and lower-case letters should be used in passwords.
3. At least one digit and one special character should be included
in a password.
4. Don't use dictionary words or well-known names like stick,
mouth, sun, Albert, and so on.
One Time Passwords
One time password scheme ensures that the passwords will be
different at each instance or session of accessing the computer
system.

Therefore any attacker or intruder who can get a password


illegally at one session, will not be successful to reuse it at
another session.

This method involve the use of algorithmic passwords.


Accordingly when a session commences, the OS provides a
randomly generated one part of a password pair.

The end user of that current session must key-in the other part
of the password pair.
• If both password correspond each other for a
clear match, the end user is allowed to access
the system.

• Both end user and the OS share a common


function f(x) to generate the respective part of
their password pair called the secret code.

• 2 parameters, secret and seed are input to a


function in order to compute a password.
password=f(secret,seed);
• Seed – it is an alphanumeric sequence or random
number substituted as an authentication challenge
from the OS.

• As both secret and seed are shared between the


computer and the end-user, they should not be
revealed and/or transmitted over a network in order
to safeguard the overall passwords generation
process.

• However, the OS generates a new seed as and when


the end user needs to be authenticated for the next
time or for the subsequent session of accessing the
system.
Biometric Devices for User Authentication

• The third authentication method is biometric user


authentication. This method measures the user's
physical characteristics, which are extremely difficult
to forge. These are known as biometrics.

• User authentication using biometrics—for example, a


fingerprint, voiceprint, or retina scan reader in the
terminal—could verify the user's identity.

The typical biometric system consists of two


components:
• Enrollment
• Identification
Enrolment

• The user's characteristics are measured and digitized during


enrollment in a biometric system. Then, significant features
are extracted and stored in the user's record.

• The record can be kept or stored in a central or main


database, or it can be stored on a smart card that the user
carries with them and inserts into a remote reader, such as
an ATM machine.

Identification
• During identification, the user appears and enters a login
name or ID. Once again, the system performs the
measurement. If the new values match the ones sampled
during enrollment, the login is accepted; otherwise, the
login attempt is rejected.
Program Threats

Operating system's processes and kernel do the designated


task as instructed. If a user program made these process do
malicious tasks, then it is known as Program Threats.

One of the common example of program threat is a program


installed in a computer which can store and send user
credentials via network to some hacker.

Following is the list of some well-known program threats.


• Trojan Horse
• Trap Door
• Logic Bomb
• Virus
• Trojan Horse − Such program traps user login credentials and stores
them to send to malicious user who can later on login to computer
and can access system resources.

• Trap Door − If a program which is designed to work as required,


have a security hole in its code and perform illegal action without
knowledge of user then it is called to have a trap door.

• Logic Bomb − Logic bomb is a situation when a program misbehaves


only when certain conditions met otherwise it works as a genuine
program. It is harder to detect.

• Virus − Virus as name suggest can replicate themselves on computer


system. They are highly dangerous and can modify/delete user files,
crash systems.

• A virus is generally a small code embedded in a program. As user


accesses the program, the virus starts getting embedded in other
files/ programs and can make system unusable for user
System Threats
System threats refers to misuse of system services and
network connections to put user in trouble.
System threats can be used to launch program threats on
a complete network called as program attack.
System threats creates such an environment that
operating system resources/ user files are misused.

Following is the list of some well-known system threats.


• Worm
• Port Scanning
• Denial of Service
• Worm − A Worm process generates its multiple copies
where each copy uses system resources, prevents all
other processes to get required resources. Worms
processes can even shut down an entire network.

• Port Scanning − Port scanning is a mechanism or


means by which a hacker can detects system
vulnerabilities to make an attack on the system.

• Denial of Service − Denial of service attacks normally


prevents user to make legitimate use of the system.
For example, a user may not be able to use internet if
denial of service attacks browser's content settings.
File Management
Introduction
• A file is collection of specific information
stored in the memory of computer system.

• File management is defined as the process of


manipulating files in computer system, the
management includes the process of
creating, modifying and deleting the files.
Functions of File Management
• Support one or more file systems
• Support one or more file structures
• Define and maintain filed and directory structures.
• Perform mapping of logical file address to physical disk
blocks
• Management of Free disk space, allocation and deal
location
• Provide for backing up of files on secondary storage
devices
Operations of Files

• Create a file
• Open a file
• Read a file
• Write a file
• Append a file
• Delete a file
• Seek a file
• Truncate a file
• Rename a file
• Close a file
Create a file

• initially the file must be created without any data in it.


• The create system call declares that the new file,
afresh is going to exist with some(default) attribute
settings.

File creation operation calls for:


• Allocating free space on the disk file system
• Making a directory entry for this new file.

The directory entry contain the name of this new file and
its exact location in the file system.
• Open a file: Most systems enforce the programmer to
open a file explicitly with an open system call well
before using that file.

• The file opening operation enable the system to fetch


the necessary attributes of that file and a list of disk
addresses into main memory for rapid accessing of
that file at later stages.

• Read a file: the read system call reads data from the
given open file at current file position pointer.

• The caller must specify the file pointer or the name of


the file and how much data to be read from that file.
• Write a file : the write system call writes data to the
given file which is opened in write mode.
• The writing takes place again at current write pointer.

• Append a file : the append system call can only add


the supplementary data to the end of the file. For
which caller must open a file in append mode.

• Delete a file : the delete system call erases the


specified file and releases the file space to the disk.
• Whenever the file is no longer needed, file deletion
operation can be carried out and the deleted file name
will disappear from the directory entry.
• Seek a file (position or reposition within a file):
• The seek system call perform file seek operation
and there by repositions the current file pointer
to a specific value(pace) in the file.
• After seek operation data can be read from or
written to the file from this new position.

• Truncate(contents of) a file: whenever the


structure of the file(its attributes both in number
and field size) is to be preserved and only its
contents are to be erased then truncate system
call will do the same.
• Rename a file: the rename system call allow
users to change the name of an existing file.
Its not file copying operation.

• Close a file: the close system call closes the


file and delinks it from both main memory and
disk addresses and internal descriptors/tables.
• Any other system resource associated with a
file can be released by closing that file.
File Types
Directories – are the system files that maintain the structure
of the file system.

Regular files – are either ASCII files or binary files in general.


ASCII files contain lines of text data.

Character special files – emphasize serial input and output and


are used to model serial I/O(character by character) devices
such as video terminals, printers and network I/O.

Block special files – are used to model disks.

We are interested in regular files since they contain user


programs and data.
File Access methods
• Files store information. When it is used this
information must be accessed and read into
computer memory. The information in the file can
be accessed in several ways.

Some access methods for file:

i. Sequential Access
ii. Direct Access
iii. Other Access Methods
Sequential access method
In sequential access the os read the file word by word.

current

Beginning End

R1 R2 R3 R4 R5 R6 R7 R8 R9

Rewind Read/Write

The figure represents file. The current pointer is pointing to the record currently
being accessed . In the sequential access method, the current pointer cannot
directly jump to any. It has to “cross” every record that comes in its path. Suppose
there are nine records in the file from R1 to R9.The current pointer is at record R6.If
we want to access record R8,we have to first access record R6 and record R7.This is
one of the major disadvantages of the sequential access method.
Direct access method
• In the direct access method , the files are considered as a sequence of blocks or records.
current

B1 B2 B3 B4 B5 B6 B7

• In this method we can access any record or block randomly .


• It is also known as Relative Access Method.

• A direct access file allows random blocks to be read or written. Thus the user may
read block B6, then read B4 and write block7. There is no restrictions on the
order of reading or writing for a direct access file.
• This type of access method is used in database management systems.

The Direct Access Method has the following operations:


• Read n:This operation is used to read the nth block . Read 6 would allow us to
read block B6.
• Write n:This operation is used to write in the nth block.
• Goto n:This operation is used to directly access the nth block.
Other access methods
• These methods generally involves the construction of an index for
the file.
• File access through the index.
Last name Logical record number

Adams
Arthur
Asher Smith , John social-security
age

Smith

Relative file
Index file
Directory Operations:

•Create
•Delete
•Change
•Open or list
•Rename
•Search
Create : A directory is created with commands such as mkdir
or md. Initially, it is empty without any files in it.

Delete : a directory can be erased or removed if it is empty.


That is it should not contain any files and other
sub-directories.

Commands such as rmdir or rd can be used to delete an


existing empty sub-directory.

Change : A working directory can be changed or shifted to new


working directory for a different file access and manipulations
in that new directory.

Commands such as chdir or cd can be used to change from


current directory to new directory (sub-directory).
Open or list : The contents of the directories ie. Listing of
files and other sub-directories can be read or displayed.
On the screen with dir.(dir *., dir *.*, dir *.c) or ls
commands.

Rename : like files the directories can be renamed with


new directory names suggested by users.

Search : Directories can be searched for specific file


entries.
For example the command dir *.C in MS-DOS list out all
file entries, in that directory, ending with file extension
.C.
Logical Structure of a directory:
• Single-Level Directory
• Two-Level Directory
• Tree-Structured Directory
• Acyclic Graph Directory

Single-Level Directory:
• Single-Level Directory is the easiest directory structure.
• There is only one directory in a single-level directory, and that directory is
called a root directory.
• In a single-level directory, all the files are present in one directory that
makes it easy to understand.
Two-Level Directory

• Two-Level Directory is another type of directory structure.


• In this, it is possible to create an individual directory for each of the users.
• There is one master node in the two-level directory that include an
individual directory for every user.
Tree-Structured Directory

• A Tree-structured directory is another type of directory structure in which


the directory entry may be a sub-directory or a file.
• We can group the same type of files into one directory.
• In a tree-structured directory, there is an own directory of each user, and
any user is not allowed to enter into the directory of another user.
Acyclic-Graph Directory

• In the tree-structure directory, the same files cannot exist in the multiple
directories, so sharing the files is the main problem in the tree-structure
directory.
• In the acyclic-graph directory, more than one directory can point to a
similar file or subdirectory.
Allocation Methods:

1. Contiguous allocation
2. Linked list allocation
3. Indexed allocation

Contiguous Allocation
Contiguous allocation means we allocate the block in such a manner, so
that in the hard disk, all the blocks get the contiguous physical block.
Linked List Allocation
• In this file allocation method, each file is treated as a linked list of disks
blocks.
• In the linked list allocation method, it is not required that disk blocks
assigned to a specific file are in the contiguous order on the disk.
• The directory entry comprises of a pointer for starting file block and also
for the ending file block.

Indexed Allocation
• In the index allocation method, we have an additional block, and that
block is known as the index block.
• For each file, there is an individual index block.
• In the index block, the ith entry holds the disk address of the ith file
block.
Types of access Control
The different types of operations that can be given a
controlled access are:

• Reading a file
• Writing a file
• Executing a file
• Renaming a file
• Editing a file
• Appending a file
• Copying a file
• Listing file name and its attributes
Free space management

• The system keeps tracks of the free disk blocks for allocating
space to files when they are created.
• Also, to reuse the space released from deleting the files, free
space management becomes crucial.

The free space list can be implemented mainly as:

1. Bitmap or Bit vector


2. Linked Lists
3. Grouping
4. Counting
1. Bitmap or Bit vector
A Bitmap or Bit Vector is series or collection of bits where each bit
corresponds to a disk block. The bit can take two values: 0 and 1: 0 indicates
that the block is allocated and 1 indicates a free block.

2. Linked List
In this approach, the free disk blocks are linked together i.e. a free block
contains a pointer to the next free block. The block number of the very first
disk block is stored at a separate location on disk and is also cached in
memory.
3. Grouping
• In this, there is a modification of the free-list approach which stores the
address of the n free blocks.
• The first n-1 blocks are free but the last block contains the address of the n
blocks.

4. Counting
When there is an entry in the free space list it consists the address of the disk
and a count variable.

This method of free space management is similar to the method of allocating


blocks.
So the operations like lookup, deletion, insertion are efficient.
Recovery

Files and directories are kept both in main memory and on disk, and
care must taken to ensure that system failure does not result in loss
of data or in data inconsistency.

• Magnetic disks sometimes fail, and care must be taken to ensure


that the data lost in such a failure are not lost forever.

• To this end, system programs can be used to back up data from


disk to another storage device, such as a floppy disk, magnetic
tape, optical disk, or other hard disk.

• Recovery from the loss of an individual file, or of an entire disk,


may then be a matter of restoring the data from backup.

• Copy to a backup medium all files from the disk. This is called a full
backup.

You might also like