OS Full Notes
OS Full Notes
An operating system is a piece of software that manages all the resources of a computer
system, both hardware and software, and provides an environment in which the user can
execute his/her programs in a convenient and efficient manner by hiding underlying
complexity of the hardware and acting as a resource manager.
Why OS?
    1. What if there is no OS?
         a. Bulky and complex app. (Hardware interaction code must be in app’s
               code base)
         b. Resource exploitation by 1 App.
         c. No memory protection.
    2. What is an OS made up of?
         a. Collection of system software.
            User
     Application programs
      Operating system
      Computer hardware
The operating system provides the means for proper use of the resources in the operation of
the computer system.
                                      LEC-2: Types of OS
OS goals –
                                         p
   - Batch-processing operating system     [ATLAS, Manchester Univ., late 1950s – early 1960s]
   - Multiprogramming operating system     [THE, Dijkstra, early 1960s]
   - Multitasking operating system         [CTSS, MIT, early 1960s]
   - Multi-processing operating system     [Windows NT]
                                       el
   - Distributed system                    [LOCUS]
   - Real time OS                          [ATCS]
              eH
od
C
Single process OS, only 1 process executes at a time from the ready queue. [Oldest]
Batch-processing OS,
 1. Firstly, user prepares his job using punch cards.
 2. Then, he submits the job to the computer operator.
 3. Operator collects the jobs from different users and sort the jobs into batches with
    similar needs.
 4. Then, operator submits the batches to the processor one by one.
 5. All the jobs of one batch are executed together.
                                          p
                                        el
           eH
Multiprogramming increases CPU utilization by keeping multiple jobs (code and data)
in thememory so that the CPU always has one to execute in case some job gets busy with
I/O.
    - Single CPU                       on changing states it stores before moving to a new state
    - Context switching for processes.
    - Switch happens when current process goes to wait state.
    - CPU idle time reduced.
multiprogramming.
   - Single CPU
   - Able to run more than one task
        simultaneously.
   - Context switching and time sharing used.
   - Increases responsiveness.
   - CPU idle time is further reduced.
                                          p
                                        el
             eH
od
C
                          LEC-3: Multi-Tasking vs Multi-Threading
Program: A Program is an executable file which contains a certain set of instructions written
to complete the specific job or operation on your computer.
    • It’s a compiled code. Ready to be executed.
    • Stored in Disk
Thread:
   • Single sequence stream within a process.
   • An independent path of execution in a process.
   • Light-weight process.
   • Used to achieve parallelism by dividing a process’s tasks which are independent path
                                               p
      of execution.
   • E.g., Multiple tabs in a browser, text editor (When you are typing in an editor, spell-
      checking, formatting of text and saving the text are done concurrently by multiple
                                             el
      threads.)
 Multi-Tasking                                 Multi-Threading
 The execution of more than one task           A process is divided into several different
                    eH
 simultaneously is called as multitasking.     sub-tasks called as threads, which has its
                                               own path of execution. This concept is
                                               called as multithreading.
 Concept of more than 1 processes being        Concept of more than 1 thread. Threads are
 context switched.                             context switched.
 No. of CPU 1.                                 No. of CPU >= 1. (Better to have more than
                                               1)
  od
process.
Thread Scheduling:
Threads are scheduled for execution based on their priority. Even though threads are
executing within the runtime, all threads are assigned processor time slices by the operating
system.
                                          p
                                        el
                   eH
 od
C
                          LEC-4: Components of OS
   1. Kernel: A kernel is that part of the operating system which interacts directly with
      the hardware andperforms the most crucial tasks.
          a. Heart of OS/Core component
          b. Very first part of OS to load on start-up.
   2. User space: Where application software runs, apps don’t have privileged access to the
      underlying hardware. It interacts with kernel.
          a. GUI
          b. CLI
   A shell, also known as a command interpreter, is that part of the operating system that receives
   commands from the users and gets them executed.
Functions of Kernel:
                                                                  p
     1. Process management:
            a. Scheduling processes and threads on the CPUs.
            b. Creating & deleting both user and system process.
                                                     el
            c. Suspending and resuming processes
            d. Providing mechanisms for process synchronization or process
               communication.
     2. Memory management:
            a. Allocating and deallocating memory space as per need.
              eH
            b. Keeping track of which part of memory are currently being used and by
               which process.
     3. File management:
            a. Creating and deleting files.
            b. Creating and deleting directories to organize files.
            c. Mapping files into secondary storage.
            d. Backup support onto a stable storage media.
     4. I/O management: to manage and control I/O operations and I/O devices
           od
           ii. Buffering
                   1. Within one job.
                   2. Eg. Youtube video buffering
          iii. Caching
                   1. Memory caching, Web caching etc.
 Types of Kernels:
        1. Monolithic kernel
         a. All functions are in kernel itself.
         b. Bulky in size.
         c. Memory required to run is high.
         d. Less reliable, one module crashes -> whole kernel is down.
         e. High performance as communication is fast. (Less user mode, kernel
             mode overheads)
         f. Eg. Linux, Unix, MS-DOS.
      2. Micro Kernel
            a. Only major functions are in kernel.
                     i. Memory mgmt.
                    ii. Process mgmt.
            b. File mgmt. and IO mgmt. are in User-space.
            c. smaller in size.
            d. More Reliable
            e. More stable
            f. Performance is slow.
            g. Overhead switching b/w user mode and kernel mode.
            h. Eg. L4 Linux, Symbian OS, MINIX etc.
      3. Hybrid Kernel:
                     a. Advantages of both worlds. (File mgmt. in User space and rest in Kernel
                        space. )
                     b. Combined approach.
                     c. Speed and design of mono.
                     d. Modularity and stability of micro.
                                                             p
                     e. Eg. MacOS, Windows NT/7/10
                     f. IPC also happens but lesser overheads
      4. Nano/Exo kernels…
                                                el
Q. How will communication happen between user mode and kernel mode?
Ans. Inter process communication (IPC).
            eH
        1. Two processes executing independently, having independent memory space (Memory
            protection), But some may need to communicate to work.
  A system call is a mechanism using which a user program can request a service from the kernel for
  which it does not have the permission to perform.
  User programs typically do not have permission to perform operations like accessing I/O devices and
  communicating other programs.
  System Calls are the only way through which a process can go into kernel mode from user mode.
Types of System Calls:
1) Process Control
   a. end, abort
   b. load, execute
   c. create process, terminate process
   d. get process attributes, set process attributes
   e. wait for time
   f. wait event, signal event
   g. allocate and free memory
2) File Management
   a. create file, delete file
   b. open, close
   c. read, write, reposition
   d. get file attributes, set file attributes
3) Device Management
   a. request device, release device
   b. read, write, reposition
   c. get device attributes, set device attributes
   d. logically attach or detach devices
4) Information maintenance
   a. get time or date, set time or date
   b. get system data, set system data
   c. get process, file, or device attributes
   d. set process, file, or device attributes
5) Communication Management
   a. create, delete communication connection
   b. send, receive messages
   c. transfer status information
   d. attach or detach remote devices
  i. PC On
 ii. CPU initializes itself and looks for a firmware program (BIOS) stored in
     BIOS Chip (Basic input-output system chip is a ROM chip found on
     mother board that allows to access & setup computer system at most
     basic level.)
         1. In modern PCs, CPU loads UEFI (Unified extensible firmware
              interface)
iii. CPU runs the BIOS which tests and initializes system hardware. Bios
     loads configuration settings. If something is not appropriate (like missing
     RAM) error is thrown and boot process is stopped.
     This is called POST (Power on self-test) process.
     (UEFI can do a lot more than just initialize hardware; it’s really a tiny
     operating system. For example, Intel CPUs have the Intel Management
     Engine. This provides a variety of features, including powering Intel’s
     Active Management Technology, which allows for remote management
                                          p
     of business PCs.)
iv. BIOS will handoff responsibility for booting your PC to your OS’s
     bootloader.
                                        el
       1. BIOS looked at the MBR (master boot record), a special boot
            sector at the beginning of a disk. The MBR contains code that
            loads the rest of the operating system, known as a “bootloader.”
            The BIOS executes the bootloader, which takes it from there and
                 eH
            begins booting the actual operating system—Windows or Linux,
            for example.
            In other words,
            the BIOS or UEFI examines a storage device on your system to
            look for a small program, either in the MBR or on an EFI system
            partition, and runs it.
v. The bootloader is a small program that has the large task of booting the
   rest of the operating system (Boots Kernel then, User Space). Windows
od
1.   A 32-bit OS has 32-bit registers, and it can access 2^32 unique memory addresses. i.e., 4GB of
     physical memory.
2.   A 64-bit OS has 64-bit registers, and it can access 2^64 unique memory addresses. i.e.,
     17,179,869,184 GB of physical memory.
3.   32-bit CPU architecture can process 32 bits of data & information.
4.   64-bit CPU architecture can process 64 bits of data & information.
5.   Advantages of 64-bit over the 32-bit operating system:
          a. Addressable Memory: 32-bit CPU -> 2^32 memory addresses, 64-bit CPU -> 2^64
              memory addresses.
          b. Resource usage: Installing more RAM on a system with a 32-bit OS doesn't impact
              performance. However, upgrade that system with excess RAM to the 64-bit version of
              Windows, and you'll notice a difference.
                                               p
          c. Performance: All calculations take place in the registers. When you’re performing math in
              your code, operands are loaded from memory into registers. So, having larger registers
              allow you to perform larger calculations at the same time.
                                             el
              32-bit processor can execute 4 bytes of data in 1 instruction cycle while 64-bit means that
              processor can execute 8 bytes of data in 1 instruction cycle.
              (In 1 sec, there could be thousands to billons of instruction cycles depending upon a
              processor design)
                    eH
          d. Compatibility: 64-bit CPU can run both 32-bit and 64-bit OS. While 32-bit CPU can only
              run 32-bit OS.
          e. Better Graphics performance: 8-bytes graphics calculations make graphics-intensive apps
              run faster.
od
C
                                 Lec-8: Storage Devices Basics
                                                   p
                                                 el
                       eH
    1. Register: Smallest unit of storage. It is a part of CPU itself.
       A register may hold an instruction, a storage address, or any data (such as bit sequence or individual
       characters).
       Registers are a type of computer memory used to quickly accept, store, and transfer data and
       instructions that are being used immediately by the CPU.
    2. Cache: Additional memory system that temporarily stores frequently used instructions and data for
       quicker processing by the CPU.
   od
Comparison
   1. Cost:
           a. Primary storages are costly.
C
                                                    p
                                                  el
                        eH
    5. Attributes of process:
           a. Feature that allows identifying a process uniquely.
           b. Process table
                      i. All processes are being tracked by OS using a table like data structure.
                     ii. Each entry in that table is process control block (PCB).
   od
Registers in the PCB, it is a data structure. When a processes is running and it's time slice expires, the
current value of process specific registers would be stored in the PCB and the process would be swapped
out. When the process is scheduled to be run, the register values is read from the PCB and written to the
CPU registers. This is the main purpose of the registers in the PCB.
                              Lec-10: Process States | Process Queues
1.   Process States: As process executes, it changes state. Each process may be in one of the following
     states.
          a. New: OS is about to pick the program & convert it into process. OR the process is being
             created.
          b. Run: Instructions are being executed; CPU is allocated.
          c. Waiting: Waiting for IO.
          d. Ready: The process is in memory, waiting to be assigned to a processor.
          e. Terminated: The process has finished execution. PCB entry removed from process table.
                                               p
2. Process Queues:
                                             el
                    eH
       a. Job Queue:
                 i. Processes in new state.
                ii. Present in secondary memory.
               iii. Job Schedular (Long term schedular (LTS)) picks process from the pool and
                    loads them into memory for execution.
       b. Ready Queue:
                 i. Processes in Ready state.
od
1.   Swapping
        a. Time-sharing system may have medium term schedular (MTS).
        b. Remove processes from memory to reduce degree of multi-programming.
        c. These removed processes can be reintroduced into memory, and its execution can be continued
            where it left off. This is called Swapping.
        d. Swap-out and swap-in is done by MTS.
        e. Swapping is necessary to improve process mix or because a change in memory requirements has
            overcommitted available memory, requiring memory to be freed up.
        f. Swapping is a mechanism in which a process can be swapped temporarily out of main memory (or
            move) to secondary storage (disk) and make that memory available to other processes. At some
            later time, the system swaps back the process from the secondary storage to main memory.
                                                 p
                                               el
                     eH
2. Context-Switching
       a. Switching the CPU to another process requires performing a state save of the current process and a
           state restore of a different process.
       b. When this occurs, the kernel saves the context of the old process in its PCB and loads the saved
           context of the new process scheduled to run.
       c. It is pure overhead, because the system does no useful work while switching.
       d. Speed varies from machine to machine, depending on the memory speed, the number of registers
           that must be copied etc.
od
3. Orphan process
       a. The process whose parent process has been terminated and it is still running.
       b. Orphan processes are adopted by init process.
       c. Init is the first process of OS.
4. Zombie process / Defunct process
       a. A zombie process is a process whose execution is completed but it still has an entry in the process
C
           table.
       b. Zombie processes usually occur for child processes, as the parent process still needs to read its
           child’s exit status. Once this is done using the wait system call, the zombie process is eliminated
           from the process table. This is known as reaping the zombie process.
       c. It is because parent process may call wait () on child process for a longer time duration and child
           process got terminated much earlier.
       d. As entry in the process table can only be removed, after the parent process reads the exit status of
           child process. Hence, the child process remains a zombie till it is removed from the process table.
                       LEC-12: Intro to Process Scheduling | FCFS | Convoy Effect
1.    Process Scheduling
             a. Basis of Multi-programming OS.
             b. By switching the CPU among processes, the OS can make the computer more productive.
             c. Many processes are kept in memory at a time, when a process must wait or time quantum expires,
                  the OS takes the CPU away from that process & gives the CPU to another process & this pattern
                  continues.
2.    CPU Scheduler
             a. Whenever the CPU become ideal, OS must select one process from the ready queue to be executed.
             b. Done by STS.
3.    Non-Preemptive scheduling
             a. Once CPU has been allocated to a process, the process keeps the CPU until it releases CPU either by
                  terminating or by switching to wait-state.
             b. Starvation, as a process with long burst time may starve less burst time process.
                                                      p
             c. Low CPU utilization.
4.    Preemptive scheduling
             a. CPU is taken away from a process after time quantum expires along with terminating or switching
                                                    el
                  to wait-state.
             b. Less Starvation
             c. High CPU utilization.
5.    Goals of CPU scheduling
             a. Maximum CPU utilization
                       eH
             b. Minimum Turnaround time (TAT).
             c. Min. Wait-time
             d. Min. response time.
             e. Max. throughput of system.
6.    Throughput: No. of processes completed per unit time.
7.    Arrival time (AT): Time when process is arrived at the ready queue.
8.    Burst time (BT): The time required by the process for its execution.
od
9.    Turnaround time (TAT): Time taken from first time process enters ready state till it terminates. (CT - AT)
10.   Wait time (WT): Time process spends waiting for CPU. (WT = TAT – BT)
11.   Response time: Time duration between process getting into ready queue and process getting CPU for the
      first time.
12.   Completion Time (CT): Time taken till process gets terminated.
13.   FCFS (First come-first serve):
             a. Whichever process comes first in the ready queue will be given CPU first.
C
            b. In this, if one process has longer BT. It will have major effect on average WT of diff processes, called
                  Convoy effect.
             c. Convoy Effect is a situation where many processes, who need to use a resource for a short time, are
                  blocked by one process holding that resource for a long time.
                        i. This cause poor resource management.
                                           LEC-13: CPU Scheduling | SJF | Priority | RR
1.   Shortest Job First (SJF) [Non-preemptive]
          a. Process with least BT will be dispatched to CPU first.
          b. Must do estimation for BT for each process in ready queue beforehand, Correct estimation of BT is
               an impossible task (ideally.)
          c. Run lowest time process for all time then, choose job having lowest BT at that instance.
          d. This will suffer from convoy effect as if the very first process which came is Ready state is having a
               large BT.
          e. Process starvation might happen.
          f. Criteria for SJF algos, AT + BT.
2.   SJF [Preemptive]
          a. Less starvation.
          b. No convoy effect.
          c. Gives average WT less for a given set of processes as scheduling short job before a long one
               decreases the WT of short job more than it increases the WT of the long process.
                                                      p
3.   Priority Scheduling [Non-preemptive]
          a. Priority is assigned to a process when it is created.
          b. SJF is a special case of general priority scheduling with priority inversely proportional to BT.
                                                    el
4.   Priority Scheduling [Preemptive]
          a. Current RUN state job will be preempted if next job has higher priority.
          b. May cause indefinite waiting (Starvation) for lower priority jobs. (Possibility is they won’t get
               executed ever). (True for both preemptive and non-preemptive version)
                      i. Solution: Ageing is the solution.
                      eH
                     ii. Gradually increase priority of process that wait so long. E.g., increase priority by 1 every 15
                         minutes.
5.   Round robin scheduling (RR)
          a. Most popular
          b. Like FCFS but preemptive.
          c. Designed for time sharing systems.
          d. Criteria: AT + time quantum (TQ), Doesn’t depend on BT.
od
          e. No process is going to wait forever, hence very low starvation. [No convoy effect]
          f. Easy to implement.
          g. If TQ is small, more will be the context switch (more overhead).
C
                                       LEC-14: MLQ | MLFQ
                                                                         p
         d. System process: Created by OS (Highest priority)
                                                         el
         Interactive process (Foreground process): Needs user input (I/O).
         Batch process (Background process): Runs silently, no user input required.
         e. Scheduling among different sub-queues is implemented as fixed priority preemptive
              scheduling. E.g., foreground queue has absolute priority over background queue.
         f. If an interactive process comes & batch process is currently executing. Then, batch process will
         eH
              be preempted.
         g. Problem: Only after completion of all the processes from the top-level ready queue, the
              further level ready queues will be scheduled.
              This came starvation for lower priority process.
         h. Convoy effect is present.
3. Comparison:
            FCFS       SJF       PSJF      Priority       P-         RR       MLQ       MLFQ
                                                                 p
                                                          Priority
 Design       Simple   Complex   Complex   Complex        Complex    Simple   Complex   Complex
 Preemption   No       No        Yes       No             Yes        Yes      Yes       Yes
                                                el
 Convoy       Yes      Yes       No        Yes            Yes        No       Yes       Yes
 effect
 Overhead     No       No        Yes       No             Yes        Yes      Yes       Yes
       eH
    od
C
                            LEC-15: Introduction to Concurrency
1.   Concurrency is the execution of the multiple instruction sequences at the same time. It happens in
     the operating system when there are several process threads running in parallel.
2.   Thread:
         • Single sequence stream within a process.
         • An independent path of execution in a process.
         • Light-weight process.
         • Used to achieve parallelism by dividing a process’s tasks which are independent path of
              execution.
         • E.g., Multiple tabs in a browser, text editor (When you are typing in an editor, spell
              checking, formatting of text and saving the text are done concurrently by multiple threads.)
3.   Thread Scheduling: Threads are scheduled for execution based on their priority. Even though
     threads are executing within the runtime, all threads are assigned processor time slices by the
                                                p
     operating system.
4.   Threads context switching
         • OS saves current state of thread & switches to another thread of same process.
                                              el
         • Doesn’t includes switching of memory address space. (But Program counter, registers &
              stack are included.)
         • Fast switching as compared to process switching
         • CPU’s cache state is preserved.
                    eH
5.   How each thread get access to the CPU?
         • Each thread has its own program counter.
         • Depending upon the thread scheduling algorithm, OS schedule these threads.
         • OS will fetch instructions corresponding to PC of that thread and execute instruction.
6.   I/O or TQ, based context switching is done here as well
         • We have TCB (Thread control block) like PCB for state storage management while
              performing context switching.
od
                                         p
                    don't know the order in which the threads will attempt to access the
                    shared data. Therefore, the result of the change in data is dependent
                    on the thread scheduling algorithm, i.e., both threads are "racing" to
                                       el
                    access/change the data.
4. Solution to Race Condition
       a. Atomic operations: Make Critical code section an atomic operation, i.e.,
            Executed in one CPU cycle.
                 eH
       b. Mutual Exclusion using locks.
       c. Semaphores
5. Can we use a simple flag variable to solve the problem of race condition?
       a. No.
6. Peterson’s solution can be used to avoid race condition but holds good for only 2
   process/threads.
7. Mutex/Locks
od
       a. Locks can be used to implement mutual exclusion and avoid race condition
            by allowing only one thread/process to access critical section.
       b. Disadvantages:
                 i. Contention: one thread has acquired the lock, other threads will be
                    busy waiting, what if thread that had acquired the lock dies, then all
                    other threads will be in infinite waiting.
C
                ii. Deadlocks
               iii. Debugging
               iv. Starvation of high priority threads.
LEC-17: Conditional Variable and Semaphores for Threads synchronization
 1. Conditional variable
       a. The condition variable is a synchronization primitive that lets the thread wait
           until a certain condition occurs.
       b. Works with a lock
       c. Thread can enter a wait state only when it has acquired a lock. When a
           thread enters the wait state, it will release the lock and wait until another
           thread notifies that the event has occurred. Once the waiting thread enters
           the running state, it again acquires the lock immediately and starts executing.
       d. Why to use conditional variable?
                i. To avoid busy waiting.
       e. Contention is not here.
 2. Semaphores
                                         p
       a. Synchronization method.
       b. An integer that is equal to number of resources
       c. Multiple threads can go and execute C.S concurrently.
                                       el
       d. Allows multiple program threads to access the finite instance of resources
           whereas mutex allows multiple threads to access a single shared resource
           one at a time.
       e. Binary semaphore: value can be 0 or 1.
                  eH
                i. Aka, mutex locks
       f. Counting semaphore
                i. Can range over an unrestricted domain.
               ii. Can be used to control access to a given resource consisting of a finite
                    number of instances.
       g. To overcome the need for busy waiting, we can modify the definition of
           the wait () and signal () semaphore operations. When a process executes the
od
           wait () operation and finds that the semaphore value is not positive, it must
           wait. However, rather than engaging in busy waiting, the process car block
           itself. The block- operation places a process into a waiting queue associated
           with the semaphore, and the state of the process is switched to the Waiting
           state. Then control is transferred to the CPU scheduler, which selects another
           process to execute.
C
                                            p
                                          el
                 eH
1.    We have 5 philosophers.
2.    They spend their life just being in two states:
           a. Thinking
           b. Eating
3.    They sit on a circular table surrounded by 5 chairs (1 each), in the center of table is a bowl of
      noodles, and the table is laid with 5 single forks.
4.    Thinking state: When a ph. Thinks, he doesn’t interact with others.
od
5.    Eating state: When a ph. Gets hungry, he tries to pick up the 2 forks adjacent to him (Left and
      Right). He can pick one fork at a time.
6.    One can’t pick up a fork if it is already taken.
7.    When ph. Has both forks at the same time, he eats without releasing forks.
8.    Solution can be given using semaphores.
           a. Each fork is a binary semaphore.
C
                                            p
                                          el
               eH
od
C
                                      LEC-21: Deadlock Part-1
                                                p
     CPUs.
8.   How a process/thread utilize a resource?
           a. Request: Request the R, if R is free Lock it, else wait till it is available.
                                              el
           b. Use
           c. Release: Release resource instance and make it available for other processes
                    eH
od
                                                p
                 ii. Protocol (A) can be, each process has to request and be allocated all its resources
                     before its execution.
                iii. Protocol (B) can be, allow a process to request resources only when it has none. It
                                              el
                     can request any additional resources after it must have released all the resources
                     that it is currently allocated.
        c. No preemption
                  i. If a process is holding some resources and request another resource that cannot
                    eH
                     be immediately allocated to it, then all the resources the process is currently
                     holding are preempted. The process will restart only when it can regain its old
                     resources, as well as the new one that it is requesting. (Live Lock may occur).
                 ii. If a process requests some resources, we first check whether they are available. If
                     yes, we allocate them. If not, we check whether they are allocated to some other
                     process that is waiting for additional resources. If so, preempt the desired resource
                     from waiting process and allocate them to the requesting process.
od
        d. Circular wait
                  i. To ensure that this condition never holds is to impose a proper ordering of
                     resource allocation.
                 ii. P1 and P2 both require R1 and R1, locking on these resources should be like, both
                     try to lock R1 then R2. By this way which ever process first locks R1 will get R2.
C
                                       LEC-22: Deadlock Part-2
1. Deadlock Avoidance: Idea is, the kernel be given in advance info concerning which resources will
   use in its lifetime.
   By this, system can decide for each request whether the process should wait.
   To decide whether the current request can be satisfied or delayed, the system must consider the
   resources currently available, resources currently allocated to each process in the system and the
   future requests and releases of each process.
        a. Schedule process and its resources allocation in such a way that the DL never occur.
        b. Safe state: A state is safe if the system can allocate resources to each process (up to its
             max.) in some order and still avoid DL.
             A system is in safe state only if there exists a safe sequence.
        c. In an Unsafe state, the operating system cannot prevent processes from requesting
             resources in such a way that any deadlock occurs. It is not necessary that all unsafe states
                                                 p
             are deadlocks; an unsafe state may lead to a deadlock.
        d. The main key of the deadlock avoidance method is whenever the request is made for
             resources then the request must only be approved only in the case if the resulting state is a
                                               el
             safe state.
        e. In a case, if the system is unable to fulfill the request of all processes, then the state of the
             system is called unsafe.
        f. Scheduling algorithm using which DL can be avoided by finding safe state. (Banker
                    eH
             Algorithm)
2. Banker Algorithm
        a. When a process requests a set of resources, the system must determine whether allocating
             these resources will leave the system in a safe state. If yes, then the resources may be
             allocated to the process. If not, then the process must wait till other processes release
             enough resources.
3. Deadlock Detection: Systems haven’t implemented deadlock-prevention or a deadlock avoidance
od
1. In Multi-programming environment, we have multiple processes in the main memory (Ready Queue) to
   keep the CPU utilization high and to make computer responsive to the users.
2. To realize this increase in performance, however, we must keep several processes in the memory; that is, we
   must share the main memory. As a result, we must manage main memory for all the different processes.
3. Logical versus Physical Address Space
        a. Logical Address
                     i. An address generated by the CPU.
                    ii. The logical address is basically the address of an instruction or data used by a process.
                  iii. User can access logical address of the process.
                   iv. User has indirect access to the physical address through logical address.
                    v. Logical address does not exist physically. Hence, aka, Virtual address.
                   vi. The set of all logical addresses that are generated by any program is referred to as Logical
                        Address Space.
                                                   p
                  vii. Range: 0 to max.
        b. Physical Address
                     i. An address loaded into the memory-address register of the physical memory.
                                                 el
                    ii. User can never access the physical address of the Program.
                   iii. The physical address is in the memory unit. It’s a location in the main memory physically.
                   iv. A physical address can be accessed by a user indirectly but not directly.
                    v. The set of all physical addresses corresponding to the Logical addresses is commonly
                        known as Physical Address Space.
                     eH
                   vi. It is computed by the Memory Management Unit (MMU).
                  vii. Range: (R + 0) to (R + max), for a base value R.
        c. The runtime mapping from virtual to physical address is done by a hardware device called the
             memory-management unit (MMU).
        d. The user's program mainly generates the logical address, and the user thinks that the program is
             running in this logical address, but the program mainly needs physical memory in order to
             complete its execution.
od
C
e.
4.   How OS manages the isolation and protect? (Memory Mapping and Protection)
        a. OS provides this Virtual Address Space (VAS) concept.
        b. To separate memory space, we need the ability to determine the range of legal addresses that the
            process may access and to ensure that the process can access only these legal addresses.
        c. The relocation register contains value of smallest physical address (Base address [R]); the limit
            register contains the range of logical addresses (e.g., relocation = 100040 & limit = 74600).
        d. Each logical address must be less than the limit register.
        e.   MMU maps the logical address dynamically by adding the value in the relocation register.
        f.   When CPU scheduler selects a process for execution, the dispatcher loads the relocation and limit
             registers with the correct values as part of the context switch. Since every address generated by the
             CPU (Logical address) is checked against these registers, we can protect both OS and other users’
             programs and data from being modified by running process.
        g.   Any attempt by a program executing in user mode to access the OS memory or other uses’
             memory results in a trap in the OS, which treat the attempt as a fatal error.
        h.   Address Translation
                                                  p
5. Allocation Method on Physical Memory
       a. Contiguous Allocation
       b. Non-contiguous Allocation
                                                el
                    eH
6. Contiguous Memory Allocation
       a. In this scheme, each process is contained in a single contiguous block of memory.
       b. Fixed Partitioning
                 i. The main memory is divided into partitions of equal or different sizes.
od
C
                   ii.
                  iii. Limitations:
                           1. Internal Fragmentation: if the size of the process is lesser then the total size of
                                the partition then some size of the partition gets wasted and remain unused.
                                This is wastage of the memory and called internal fragmentation.
                           2. External Fragmentation: The total unused space of various partitions cannot be
                                used to load the processes even though there is space available but not in the
                                contiguous form.
                           3. Limitation on process size: If the process size is larger than the size of maximum
                                sized partition then that process cannot be loaded into the memory. Therefore, a
                                limitation can be imposed on the process size that is it cannot be larger than the
                                size of the largest partition.
                   4.  Low degree of multi-programming: In fixed partitioning, the degree of
                       multiprogramming is fixed and very less because the size of the partition cannot
                       be varied according to the size of processes.
c.   Dynamic Partitioning
         i. In this technique, the partition size is not declared initially. It is declared at the time of
            process loading.
                                          p
           ii.
          iii. Advantages over fixed partitioning
                                        el
                   1. No internal fragmentation
                   2. No limit on size of process
                   3. Better degree of multi-programming
          iv. Limitation
                   1. External fragmentation
            eH
od
C
                                  LEC-25: Free Space Management
1. Defragmentation/Compaction
       a. Dynamic partitioning suffers from external fragmentation.
       b. Compaction to minimize the probability of external fragmentation.
       c. All the free partitions are made contiguous, and all the loaded partitions are brought together.
       d. By applying this technique, we can store the bigger processes in the memory. The free partitions
           are merged which can now be allocated according to the needs of new processes. This technique is
           also called defragmentation.
       e. The efficiency of the system is decreased in the case of compaction since all the free spaces will be
           transferred from several places to a single place.
2. How free space is stored/represented in OS?
       a. Free holes in the memory are represented by a free list (Linked-List data structure).
3. How to satisfy a request of a of n size from a list of free holes?
       a. Various algorithms which are implemented by the Operating System in order to find out the holes
                                                  p
           in the linked list and allocate them to the processes.
       b. First Fit
                   i. Allocate the first hole that is big enough.
                                                el
                  ii. Simple and easy to implement.
                 iii. Fast/Less time complexity
       c. Next Fit
                   i. Enhancement on First fit but starts search always from last allocated hole.
                  ii. Same advantages of First Fit.
                     eH
       d. Best Fit
                   i. Allocate smallest hole that is big enough.
                  ii. Lesser internal fragmentation.
                 iii. May create many small holes and cause major external fragmentation.
                 iv. Slow, as required to iterate whole free holes list.
       e. Worst Fit
                   i. Allocate the largest hole that is big enough.
od
                                                   p
              logical memory into blocks of same size called Pages. (# Page size = Frame size)
          d. Page size is usually determined by the processor architecture. Traditionally, pages in a system had
             uniform size, such as 4,096 bytes. However, processor designs often allow two or more, sometimes
                                                 el
             simultaneous, page sizes due to its benefits.
          e. Page Table
                    i. A Data structure stores which page is mapped to which frame.
                   ii. The page table contains the base address of each page in the physical memory.
          f. Every address generated by CPU (logical address) is divided into two parts: a page number (p) and
                     eH
             a page offset (d). The p is used as an index into a page table to get base address the corresponding
             frame in physical memory.
od
         g. Page table is stored in main memory at the time of process creation and its base address is stored
C
                                         p
                                       el
            eH
f.   TLB hit, TLB contains the mapping for the requested logical address.
g.   Address space identifier (ASIDs) is stored in each entry of TLB. ASID uniquely identifies each
     process and is used to provide address space protection and allow to TLB to contain entries for
     several different processes. When TLB attempts to resolve virtual page numbers, it ensures that
od
     the ASID for the currently executing process matches the ASID associated with virtual page. If it
     doesn’t match, the attempt is treated as TLB miss.
C
                    LEC-27: Segmentation | Non-Contiguous Memory Allocation
1.    An important aspect of memory management that become unavoidable with paging is separation of user’s
     view of memory from the actual physical memory.
2.   Segmentation is memory management technique that supports the user view of memory.
3.   A logical address space is a collection of segments, these segments are based on user view of logical
     memory.
4.   Each segment has segment number and offset, defining a segment.
     <segment-number, offset> {s,d}
5.   Process is divided into variable segments based on user view.
6.   Paging is closer to the Operating system rather than the User. It divides all the processes into the form of
     pages although a process can have some relative parts of functions which need to be loaded in the same
     page.
7.   Operating system doesn't care about the User's view of the process. It may divide the same function into
     different pages and those pages may or may not be loaded at the same time into the memory. It
                                                   p
     decreases the efficiency of the system.
8.   It is better to have segmentation which divides the process into the segments. Each segment contains the
     same type of functions such as the main function can be included in one segment and the library functions
                                                 el
     can be included in the other segment.
                     eH
od
9.
C
10. Advantages:
        a. No internal fragmentation.
        b. One segment has a contiguous allocation, hence efficient working within segment.
        c. The size of segment table is generally less than the size of page table.
        d. It results in a more efficient system because the compiler keeps the same type of functions in one
            segment.
11. Disadvantages:
        a. External fragmentation.
        b. The different size of segment is not good that the time of swapping.
12. Modern System architecture provides both segmentation and paging implemented in some hybrid
    approach.
            LEC-28: What is Virtual Memory? || Demand Paging || Page Faults
1.  Virtual memory is a technique that allows the execution of processes that are not completely in the
    memory. It provides user an illusion of having a very big main memory. This is done by treating a part of
    secondary memory as the main memory. (Swap-space)
2. Advantage of this is, programs can be larger than physical memory.
3. It is required that instructions must be in physical memory to be executed. But it limits the size of a
    program to the size of physical memory. In fact, in many cases, the entire program is not needed at the
    same time. So, we want an ability to execute a program that is only partially in memory would give
    many benefits:
          a. A program would no longer be constrained by the amount of physical memory that is
              available.
          b. Because each user program could take less physical memory, more programs could be run at
              the same time, with a corresponding increase in CPU utilization and throughput.
          c. Running a program that is not entirely in memory would benefit both the system and the
                                               p
              user.
4. Programmer is provided very large virtual memory when only a smaller physical memory is available.
5. Demand Paging is a popular method of virtual memory management.
                                             el
6. In demand paging, the pages of a process which are least used, get stored in the secondary memory.
7. A page is copied to the main memory when its demand is made, or page fault occurs. There are various
    page replacement algorithms which are used to determine the pages which will be replaced.
8. Rather than swapping the entire process into memory, we use Lazy Swapper. A lazy swapper never
    swaps a page into memory unless that page will be needed.
                 eH
9. We are viewing a process as a sequence of pages, rather than one large contiguous address space, using
    the term Swapper is technically incorrect. A swapper manipulates entire processes, whereas a Pager is
    concerned with individual pages of a process.
10. How Demand Paging works?
          a. When a process is to be swapped-in, the pager guesses which pages will be used.
          b. Instead of swapping in a whole process, the pager brings only those pages into memory. This,
              it avoids reading into memory pages that will not be used anyway.
od
          c. Above way, OS decreases the swap time and the amount of physical memory needed.
          d. The valid-invalid bit scheme in the page table is used to distinguish between pages that are
              in memory and that are on the disk.
                     i. Valid-invalid bit 1 means, the associated page is both legal and in memory.
                    ii. Valid-invalid bit 0 means, the page either is not valid (not in the LAS of the process)
                         or is valid but is currently on the disk.
C
                                    p
e.
f. If a process never attempts to access some invalid bit page, the process will be executed
   successfully without even the need pages present in the swap space.
                                  el
g. What happens if the process tries to access a page that was not brought into memory, access
   to a page marked invalid causes page fault. Paging hardware noticing invalid bit for a
   demanded page will cause a trap to the OS.
h. The procedure to handle the page fault:
       eH
           i. Check an internal table (in PCB of the process) to determine whether the reference
              was valid or an invalid memory access.
          ii. If ref. was invalid process throws exception.
              If ref. is valid, pager will swap-in the page.
         iii. We find a free frame (from free-frame list)
         iv. Schedule a disk operation to read the desired page into the newly allocated frame.
          v. When disk read is complete, we modify the page table that, the page is now in
              memory.
od
         vi. Restart the instruction that was interrupted by the trap. The process can now access
              the page as through it had always been in memory.
C
                                              p
         i.
         j. Pure Demand Paging
                                            el
                  i. In extreme case, we can start executing a process with no pages in memory. When
                 eH
                     OS sets the instruction pointer to the first instruction of the process, which is not in
                     the memory. The process immediately faults for the page and page is brought in the
                     memory.
                 ii. Never bring a page into memory until it is required.
        k. We use locality of reference to bring out reasonable performance from demand paging.
11. Advantages of Virtual memory
        a. The degree of multi-programming will be increased.
        b. User can run large apps with less real physical memory.
od
1.   Whenever Page Fault occurs, that is, a process tries to access a page which is not currently present in a
     frame and OS must bring the page from swap-space to a frame.
2.   OS must do page replacement to accommodate new page into a free frame, but there might be a possibility
     the system is working in high utilization and all the frames are busy, in that case OS must replace one of the
     pages allocated into some frame with the new page.
3.   The page replacement algorithm decides which memory page is to be replaced. Some allocated page is
     swapped out from the frame and new page is swapped into the freed frame.
4.   Types of Page Replacement Algorithm: (AIM is to have minimum page faults)
          a. FIFO
                     i. Allocate frame to the page as it comes into the memory by replacing the oldest page.
                    ii. Easy to implement.
                   iii. Performance is not always good
                              1. The page replaced may be an initialization module that was used long time ago
                                                    p
                                   (Good replacement candidate)
                              2. The page may contain a heavily used variable that was initialized early and is in
                                   content use. (Will again cause page fault)
                                                  el
                   iv. Belady’s anomaly is present.
                              1. In the case of LRU and optimal page replacement algorithms, it is seen that
                                   the number of page faults will be reduced if we increase the number of
                                   frames. However, Balady found that, In FIFO page replacement algorithm, the
                                   number of page faults will get increased with the increment in number of
                      eH
                                   frames.
                              2. This is the strange behavior shown by FIFO algorithm in some of the cases.
          b. Optimal page replacement
                     i. Find if a page that is never referenced in future. If such a page exists, replace this page
                        with new page.
                        If no such page exists, find a page that is referenced farthest in future. Replace this page
                        with new page.
od
                              1. Counters
                                       a. Associate time field with each page table entry.
                                       b. Replace the page with smallest time value.
                             2. Stack
                                       a. Keep a stack of page number.
                                       b. Whenever page is referenced, it is removed from the stack & put on
                                             the top.
                                        c. By this, most recently used is always on the top, & least recently used
                                             is always on the bottom.
                                       d. As entries might be removed from the middle of the stack, so Doubly
                                             linked list can be used.
          d. Counting-based page replacement – Keep a counter of the number of references that have been
              made to each page. (Reference counting)
  i. Least frequently used (LFU)
         1. Actively used pages should have a large reference count.
         2. Replace page with the smallest count.
 ii. Most frequently used (MFU)
         1. Based on the argument that the page with the smallest count was probably just
               brought in and has yet to be used.
iii. Neither MFU nor LFU replacement is common.
                              p
                            el
  eH
od
C
                                             LEC-30: Thrashing
1.   Thrashing
         a. If the process doesn’t have the number of frames it needs to support pages in active use, it will
             quickly page-fault. At this point, it must replace some page. However, since all its pages are in active
             use, it must replace a page that will be needed again right away. Consequently, it quickly faults
             again, and again, and again, replacing pages that it must bring back in immediately.
         b. This high paging activity is called Thrashing.
         c. A system is Thrashing when it spends more time servicing the page faults than executing
             processes.
                                                    p
                                                  el
                      eH
         d. Technique to Handle Thrashing
                 i. Working set model
                         1. This model is based on the concept of the Locality Model.
                         2. The basic principle states that if we allocate enough frames to a process to
od
                            accommodate its current locality, it will only fault whenever it moves to some
                            new locality. But if the allocated frames are lesser than the size of the current
                            locality, the process is bound to thrash.
                ii. Page Fault frequency
                         1. Thrashing has a high page-fault rate.
                         2. We want to control the page-fault rate.
                         3. When it is too high, the process needs more frames. Conversely, if the page-fault
C
                            rate is too low, then the process may have too many frames.
                         4. We establish upper and lower bounds on the desired page fault rate.
                         5. If pf-rate exceeds the upper limit, allocate the process another frame, if pf-rate
                            fails falls below the lower limit, remove a frame from the process.
                         6. By controlling pf-rate, thrashing can be prevented.