0% found this document useful (0 votes)

55 views28 pages

Pipeline Processing

This document discusses pipelining in microprocessors. It defines pipelining as a technique where a microprocessor begins executing a second instruction before the first is completed. The pipeline is divided into stages where each stage performs part of the instruction processing concurrently. This allows new instructions to be fetched while other stages are performing operations. Pipelining improves processor throughput and reduces average instruction time compared to an unpipelined processor. However, it requires careful design to balance the stages and avoid issues like bubbles or synchronization problems that reduce its efficiency. Examples are provided to illustrate how speedup is calculated from pipelining.

Uploaded by

anismitaray14

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PPT, PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

55 views28 pages

Pipeline Processing

Uploaded by

anismitaray14

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PPT, PDF, TXT or read online on Scribd

You are on page 1/ 28

PIPELINING

By
Dr. Mausumi Maitra
Professor
Dept. of Information Technology
Govt. College of Engg. and Ceramic Technology
What is Pipelining
 A technique used in advanced microprocessors
where the microprocessor begins executing a second
instruction before the first has been completed.

- A Pipeline is a series of stages, where some work is

done at each stage. The work is not finished until it
has passed through all stages.

 With pipelining, the computer architecture allows the

next instructions to be fetched while the processor is
performing arithmetic operations, holding them in a
buffer close to the processor until each instruction
operation can be performed.
How Pipeline Works

 The pipeline is divided into segments and each

segment can execute it’s operation concurrently
with the other segments.

 Once a segment completes an operation, it

passes the result to the next segment in the
pipeline and fetches the next operations from
the preceding segment.
Pipeline

 A common analogue for a pipeline is a factory

assembly line. Assume that there are three
stages:
1. Welding

2. Painting

3. Polishing
 For simplicity, assume that each task takes one
hour.
Characteristics Of Pipelining
 If the stages of a pipeline are not balanced and one
stage is slower than another, the entire throughput of
the pipeline is affected.

 In terms of a pipeline within a CPU, each instruction is

broken up into different stages. Ideally if each stage
is balanced (all stages are ready to start at the same
time and take an equal amount of time to execute.)
the time taken per instruction (pipelined) is defined
as:

Time per instruction (unpipelined) / Number of stages

Contd..
 The previous expression is ideal. We will see later
that there are many ways in which a pipeline
cannot function in a perfectly balanced fashion.
 In terms of a CPU, the implementation of
pipelining has the effect of reducing the average
instruction time, therefore reducing the average
CPI.
 EX: If each instruction in a microprocessor takes
5 clock cycles (unpipelined) and we have a 4
stage pipeline, the ideal average CPI with the
pipeline will be 1.25 .
Example

Instruction 1 Instruction 2

X X

Instruction 4 Instruction 3

X X
Four sample instructions, executed linearly
5

IF ID EX M W
1
IF ID EX M W
1
IF ID EX M W
1
IF ID EX M W

Four Pipelined Instructions

Instruction Fetch

 The instruction Fetch (IF) stage is responsible

for obtaining the requested instruction from
memory. The instruction and the program
counter (which is incremented to the next
instruction) are stored in the IF/ID pipeline
register as temporary storage so that may be
used in the next stage at the start of the next
clock cycle.
Instruction Decode

 The Instruction Decode (ID) stage is responsible

for decoding the instruction and sending out the
various control lines to the other parts of the
processor. The instruction is sent to the control
unit where it is decoded and the registers are
fetched from the register file.
Execution

 The Execution (EX) stage is where any

calculations are performed. The main
component in this stage is the ALU. The ALU is
made up of arithmetic, logic units.
Memory and IO

 The Memory and IO (MEM) stage is responsible

for storing and loading values to and from
memory. It is also responsible for input or
output from the processor.
Write Back

 The Write Back (WB) stage is responsible for

writing the result of a calculation, memory

access or input into the register file.

Operation Timings

 Estimated timings for Instruction 2ns

each of the stages: Fetch
Instruction 1ns
Decode
Execution 2ns

Memory 2ns
and IO
Write Back 1ns
Advantages/Disadvantages
Advantages:
 More efficient use of processor
 Quicker time of execution of large number of
instructions

Disadvantages:
 Pipelining involves adding hardware to the chip
 Inability to continuously run the pipeline
at full speed because of pipeline hazards
which disrupt the smooth execution of the
pipeline.
Instructions in the pipeline stages
Computer Performance

 1. Latency – The amount of time that a single

operation takes to execute .

 2. Throughput – The rate at which operations

get executed. Generally expressed as
operations/second or operations/cycle
For Normal Processor
Throughput = 1 / Latency

In Pipeline Architecture
Throughput > 1 / Latency

Since instruction execution is overlapped, which is

called Temporal Parallelism. It is appropriate if

1. The jobs to be carried out are identical

2. A job can be divided into many independent tasks

i.e. each task can be done independent of other tasks
3. The time taken for performing each task is the
same.

4. The time taken to transmit a job from one

processing stage to the next is negligible
compared to the time needed to execute a task.

5. The number of tasks into which a job is broken

up is much smaller compared to the number of
jobs to be carried out.
To Find the Speed of Pipeline :

Let us consider a Pipeline system, where

No. of segments in Pipeline = K
Clock Cycle = tp
No. of jobs = n

1st task requires a time = k. tp

Remaining (n – 1) jobs comes out one job per

clock cycle = (n – 1) tp

Total time = {k+(n-1)} tp

Let us consider a system without pipeling :

Time required for each job = tn = k.tp

Total time required for n job = n. tn = n.k.tp

Speed up of pipeline process is defined as the ratio

S = (Time taken without pipeline) / (Time taken with

pipeline)
=n.k.tp / (k+n-1) tp
As the no. of job increases n >> k-1

S = n.k.tp / n.tp ~ k

Therefore, theoretically maximum speed up is k, where k is

the number of stages in the pipeline.
Ex.
Let tp = 20 ns, k = 4 and n = 100 jobs

For pipeline system, time required

(k + n -1) tp = (4 + 99) x 20 ns = 2060 ns

For non pipeline system, time required

n. k. tp = 100 X 4 X 20 ns = 8000 ns.

Speed up ratio S = 8000 / 2060 = 3.88

If n = 1000, what will be the speed up ratio ?

As the number of job increases s -> 4, which is equal to the

number of stages in the pipeline.
Ex. 2
A job consists of 4 tasks. The time taken by the 4 tasks are respectively
20 ns, 10 ns, 15 ns and 20 ns. Pipelining is used to reduce processing
time. If the no. of job entering the pipeline is 120 . Find the efficiency of
pipelining.

Time taken by each job = 65 ns

Without pipelining time taken by 120 jobs = 120 X 65 = 7800 ns.

If pipelining is used all tasks must be allotted equal time which should
be the maximum time for a task = 20 ns.
Therefore, time taken to complete 120 jobs
= 80 + 119 X 20 = 2460 ns.

Therefore, speed up = 7800 / 2460 = 3.17

Ideal speed up = 4 (No. of stages)
Therefore pipeline Efficiency
= Actual speed up / Ideal speed up = 3.17 / 4 = 0.7925
% Efficiency = 0.7925 X 100 % = 79.25 %
Problems in Implementing Pipeline

1. Synchronization – Each stage must take equal

amount of time so that the job can flow
between stages without hold up.

2. Bubbles in Pipeline – If some tasks are absent

in a job “bubbles” form in the pipeline.
3. Fault Tolerance – System does not tolerate faults.
If one of the stages in the pipeline fails for some
reason, the entire pipeline is upset.

4. Intertask Communication – The time to transmit a

job between pipeline stages should be much smaller
compared to the time taken to do a task.
Ex. 3
A 4 stage pipeline adder is to be designed. The time
taken by each of the pipeline stages are 3 ns, 3 ns,
10 ns and 5 ns.
a)What should be the clock frequency to drive the
adder ?
b) What is the time taken to add 100 pairs of
operands ?
c)If 20 operands are zero (at random), what is the
time taken to add 100 pairs of operands ?
d) What is the efficiency of pipeline addition if all the
operands are non-zero ?
e)What is the efficiency in case (c) ?
a) We should design for the slowest option in the pipeline.
Therefore, Clock Frequency = 1 / 10 ns = 100 MHz.

b) Time taken = (k + n-1) tp = (4 + 100 -1) X 10 ns = 1030 ns.

c) Same as (b) as there is no method of detecting zeros in pipelining.

d)Efficiency = Actual Speed up / Ideal Speed up

Now, Ideal Speed up = No. of stages in the pipeline = 4

Actual Speed up =Time taken without pipeline / Time taken with pipeline
= 2100 / 1030 = 2.04
Therefore, Efficiency = 2.04 / 4 = 0.5097

% Efficiency = 51 %

e) In case of ( c ) : Actual speed up = 21 X 80 / 1030 = 1.63

Therefore, Efficiency = 1.63 / 4 = 0.408

% Efficiency = 40.8 %
Thank You

Pipelining 2
No ratings yet
Pipelining 2
16 pages
Co Unit 4
No ratings yet
Co Unit 4
17 pages
Pipelining and Parallel Processing
No ratings yet
Pipelining and Parallel Processing
26 pages
Pipelining Concepts and Problems
No ratings yet
Pipelining Concepts and Problems
33 pages
3 Pipelining Pipeline:: "Folder" Takes 20 Minutes
No ratings yet
3 Pipelining Pipeline:: "Folder" Takes 20 Minutes
8 pages
Module 3-Part 2
No ratings yet
Module 3-Part 2
50 pages
Computer Organization and Architecture Pipelining Set Execution, Stages and Throughput
No ratings yet
Computer Organization and Architecture Pipelining Set Execution, Stages and Throughput
7 pages
Pipeline Processing Explained
No ratings yet
Pipeline Processing Explained
47 pages
Lec18 Pipeline
No ratings yet
Lec18 Pipeline
59 pages
Understanding Processor Pipelining
No ratings yet
Understanding Processor Pipelining
28 pages
Module 4
No ratings yet
Module 4
12 pages
COA Unit-3 Slides
No ratings yet
COA Unit-3 Slides
76 pages
Pipelining Basic Concept
No ratings yet
Pipelining Basic Concept
23 pages
Pipelining Basic and Intermediate Concepts
No ratings yet
Pipelining Basic and Intermediate Concepts
75 pages
Pipe Lining
No ratings yet
Pipe Lining
66 pages
PipeLining in Microprocessors
No ratings yet
PipeLining in Microprocessors
19 pages
Pipe Lining
No ratings yet
Pipe Lining
14 pages
Chapter # 03 Pipelining
No ratings yet
Chapter # 03 Pipelining
85 pages
Bản Sao Của Lecture 9 - Pipelined Processor Design
No ratings yet
Bản Sao Của Lecture 9 - Pipelined Processor Design
11 pages
Pipeline Processing
No ratings yet
Pipeline Processing
43 pages
07 Pipeline Notes
No ratings yet
07 Pipeline Notes
145 pages
3-Pipelining 241110 203716
No ratings yet
3-Pipelining 241110 203716
59 pages
PIpeline Processing and Multi Processing
No ratings yet
PIpeline Processing and Multi Processing
16 pages
Pipelining
No ratings yet
Pipelining
21 pages
Pipeline 1
No ratings yet
Pipeline 1
17 pages
Pipe Lining
No ratings yet
Pipe Lining
23 pages
Unit 6
No ratings yet
Unit 6
30 pages
Unit 6 Updated
No ratings yet
Unit 6 Updated
40 pages
Uni1-2 Pipelining
No ratings yet
Uni1-2 Pipelining
12 pages
Comparison Between Pipelining
No ratings yet
Comparison Between Pipelining
9 pages
DLCOA 6.1 Sep2024
No ratings yet
DLCOA 6.1 Sep2024
81 pages
Parallel Processing & Pipelining
No ratings yet
Parallel Processing & Pipelining
33 pages
Chap-10: Speed and Efficiency
No ratings yet
Chap-10: Speed and Efficiency
29 pages
Pipe Lining
No ratings yet
Pipe Lining
32 pages
Unit 4 Coa
No ratings yet
Unit 4 Coa
25 pages
5.1-5.3 Pipelining and Parallel Processing
No ratings yet
5.1-5.3 Pipelining and Parallel Processing
56 pages
Computer Architecture 1
No ratings yet
Computer Architecture 1
8 pages
Slide 6
No ratings yet
Slide 6
46 pages
Pipelining I: Prepared By: Noshaba Nasir
No ratings yet
Pipelining I: Prepared By: Noshaba Nasir
32 pages
Unit 6 Updated
No ratings yet
Unit 6 Updated
40 pages
CO Pipelining PDF Notes
No ratings yet
CO Pipelining PDF Notes
10 pages
CA Slides#3 Pipeline Introduction
No ratings yet
CA Slides#3 Pipeline Introduction
26 pages
Chapter 4.5 - 4.8 Piplined Processor and Hazards
No ratings yet
Chapter 4.5 - 4.8 Piplined Processor and Hazards
68 pages
Unit 3 - Advanced Computer Architecture - WWW - Rgpvnotes.in
No ratings yet
Unit 3 - Advanced Computer Architecture - WWW - Rgpvnotes.in
15 pages
Module 4-Pipelining
No ratings yet
Module 4-Pipelining
39 pages
05 Pipelining
No ratings yet
05 Pipelining
34 pages
COA Module 3 PPT Part 2
No ratings yet
COA Module 3 PPT Part 2
62 pages
Pipelining
No ratings yet
Pipelining
43 pages
Parallel Processing and Pipelining
No ratings yet
Parallel Processing and Pipelining
53 pages
Instruction Pipelining and SuperScalar Development - 2019
No ratings yet
Instruction Pipelining and SuperScalar Development - 2019
53 pages
Week 11 Reduced
No ratings yet
Week 11 Reduced
29 pages
CAO-II Module 2 Complete
100% (1)
CAO-II Module 2 Complete
32 pages
Pipeline: A Simple Implementation of A RISC Instruction Set
No ratings yet
Pipeline: A Simple Implementation of A RISC Instruction Set
16 pages
Parallelism in Uniprocessor System and Granularity
100% (5)
Parallelism in Uniprocessor System and Granularity
5 pages
Pipelining. Pipeline Hazards: Sabina Batyrkhanovna
No ratings yet
Pipelining. Pipeline Hazards: Sabina Batyrkhanovna
19 pages
Pipe Lining
No ratings yet
Pipe Lining
35 pages
PCC-CS402
No ratings yet
PCC-CS402
7 pages
LECTURE 3 Pipelining
No ratings yet
LECTURE 3 Pipelining
27 pages
Lecture 7 - PIPELINING
No ratings yet
Lecture 7 - PIPELINING
16 pages
CXA1315M/P: 8-Bit D/A Converter Supporting With I C Bus
No ratings yet
CXA1315M/P: 8-Bit D/A Converter Supporting With I C Bus
13 pages
Laboratory 2 Hall-Effect Sensors: ME 104 Sensors and Actuators Fall 2003
No ratings yet
Laboratory 2 Hall-Effect Sensors: ME 104 Sensors and Actuators Fall 2003
13 pages
Computer Awareness - Badge
63% (8)
Computer Awareness - Badge
6 pages
Technical Documentation Agfa Net-Lab.12
No ratings yet
Technical Documentation Agfa Net-Lab.12
555 pages
Stps 10 L 60 REC
No ratings yet
Stps 10 L 60 REC
9 pages
Week 5 - Connecting To The Internet
No ratings yet
Week 5 - Connecting To The Internet
2 pages
Isc 2SC1969: Silicon NPN RF Power Transistor
No ratings yet
Isc 2SC1969: Silicon NPN RF Power Transistor
2 pages
Lab 5 - Votage Multilpier Circuits
No ratings yet
Lab 5 - Votage Multilpier Circuits
6 pages
Status: User Manual
100% (1)
Status: User Manual
95 pages
Digital Display Technology: Guided By: Mrs. Charulata Leuva Prepared By: Name: Ronak Sanghadiya Name: Sarthak Ravat
No ratings yet
Digital Display Technology: Guided By: Mrs. Charulata Leuva Prepared By: Name: Ronak Sanghadiya Name: Sarthak Ravat
10 pages
By: Philip Marvin D. Joven, R.E.E VP Engineering, Industrial Controls Corp
No ratings yet
By: Philip Marvin D. Joven, R.E.E VP Engineering, Industrial Controls Corp
35 pages
Automotive Recommended Product Selector Guide
No ratings yet
Automotive Recommended Product Selector Guide
32 pages
DC Motor
No ratings yet
DC Motor
6 pages
7inch HDMI Display C User ManualEn
No ratings yet
7inch HDMI Display C User ManualEn
4 pages
Eoip Mikrotik
No ratings yet
Eoip Mikrotik
16 pages
Control Unit and Symbolic Microinstruction
No ratings yet
Control Unit and Symbolic Microinstruction
1 page
Group-4 Report
No ratings yet
Group-4 Report
4 pages
Boundary Scan: Boundary Scan Is A Method For Testing Interconnects (Wire Lines) On
No ratings yet
Boundary Scan: Boundary Scan Is A Method For Testing Interconnects (Wire Lines) On
5 pages
BEE 4101 Power Electronics Course Outline
No ratings yet
BEE 4101 Power Electronics Course Outline
3 pages
Unisonic Technologies Co., LTD: N-Channel Power Mosfet
No ratings yet
Unisonic Technologies Co., LTD: N-Channel Power Mosfet
5 pages
Datasheet Diode Zener
No ratings yet
Datasheet Diode Zener
5 pages
Nova Science Publishers An Introduction To Contact Resistance 2020
No ratings yet
Nova Science Publishers An Introduction To Contact Resistance 2020
188 pages
Voltage Regulators for Engineers
No ratings yet
Voltage Regulators for Engineers
3 pages
Unit 1-Introduction To VLSI
No ratings yet
Unit 1-Introduction To VLSI
36 pages
Wireless Sensor Networks Overview
No ratings yet
Wireless Sensor Networks Overview
40 pages
M66291GP
No ratings yet
M66291GP
126 pages
(ALCANTARA - BSEE-2D) Experiment 2 Final Report
No ratings yet
(ALCANTARA - BSEE-2D) Experiment 2 Final Report
11 pages
Power Electronics Components
No ratings yet
Power Electronics Components
2 pages
Q1 Week 1 Day 3 Ict 10 Assembling-Your-PC-A-Step-by-Step-Guide-with-OHS
No ratings yet
Q1 Week 1 Day 3 Ict 10 Assembling-Your-PC-A-Step-by-Step-Guide-with-OHS
8 pages
From 3GPP LTE To 5G: An Evolution: February 2016
No ratings yet
From 3GPP LTE To 5G: An Evolution: February 2016
9 pages

Pipeline Processing

Uploaded by

Pipeline Processing

Uploaded by

PIPELINING

- A Pipeline is a series of stages, where some work is

 With pipelining, the computer architecture allows the

 The pipeline is divided into segments and each

 Once a segment completes an operation, it

 A common analogue for a pipeline is a factory

 In terms of a pipeline within a CPU, each instruction is

Time per instruction (unpipelined) / Number of stages

Four Pipelined Instructions

 The instruction Fetch (IF) stage is responsible

 The Instruction Decode (ID) stage is responsible

 The Execution (EX) stage is where any

 The Memory and IO (MEM) stage is responsible

 The Write Back (WB) stage is responsible for

writing the result of a calculation, memory

access or input into the register file.

 Estimated timings for Instruction 2ns

 1. Latency – The amount of time that a single

 2. Throughput – The rate at which operations

Since instruction execution is overlapped, which is

1. The jobs to be carried out are identical

2. A job can be divided into many independent tasks

4. The time taken to transmit a job from one

5. The number of tasks into which a job is broken

Let us consider a Pipeline system, where

1st task requires a time = k. tp

Remaining (n – 1) jobs comes out one job per

Total time = {k+(n-1)} tp

Time required for each job = tn = k.tp

Speed up of pipeline process is defined as the ratio

S = (Time taken without pipeline) / (Time taken with

Therefore, theoretically maximum speed up is k, where k is

For pipeline system, time required

For non pipeline system, time required

n. k. tp = 100 X 4 X 20 ns = 8000 ns.

Speed up ratio S = 8000 / 2060 = 3.88

If n = 1000, what will be the speed up ratio ?

As the number of job increases s -> 4, which is equal to the

Time taken by each job = 65 ns

Therefore, speed up = 7800 / 2460 = 3.17

1. Synchronization – Each stage must take equal

2. Bubbles in Pipeline – If some tasks are absent

4. Intertask Communication – The time to transmit a

b) Time taken = (k + n-1) tp = (4 + 100 -1) X 10 ns = 1030 ns.

c) Same as (b) as there is no method of detecting zeros in pipelining.

d)Efficiency = Actual Speed up / Ideal Speed up

Now, Ideal Speed up = No. of stages in the pipeline = 4

e) In case of ( c ) : Actual speed up = 21 X 80 / 1030 = 1.63

Therefore, Efficiency = 1.63 / 4 = 0.408

You might also like