0% found this document useful (0 votes)

51 views11 pages

Unit-5-Parallel Processing

The document discusses parallel processing techniques aimed at increasing computational speed, including pipeline and vector processing. It outlines Flynn's classification of computer architectures, detailing SISD, SIMD, MISD, and MIMD systems, and explains the concept of pipelining as a method to decompose processes into suboperations for concurrent execution. Additionally, it addresses pipeline conflicts and their resolutions, as well as the implementation of instruction pipelines in RISC architectures.

Uploaded by

ShivuAg

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

51 views11 pages

Unit-5-Parallel Processing

Uploaded by

ShivuAg

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 11

PIPELINE AND VECTOR PROCESSING

Parallel processing:
• Parallel processing is a term used for a large class of techniques that

are used to provide simultaneous data-processing tasks for the purpose of increasing the
computational speed of a computer system.

 It refers to techniques that are used to provide simultaneous data processing.

 The system may have two or more ALUs to be able to execute two or more
instruction at the same time.

 The system may have two or more processors operating concurrently.

 It can be achieved by having multiple functional units that perform same or different
operation simultaneously.

• Example of parallel Processing:

– Multiple Functional Unit:

Separate the execution unit into eight functional units operating in parallel.

 There are variety of ways in which the parallel processing can be classified

 Internal Organization of Processor

 Interconnection structure between processors

 Flow of information through system

19
UNIT-V
Architectural Classification:

– Flynn's classification

» Based on the multiplicity of Instruction Streams and Data Streams

» Instruction Stream

• Sequence of Instructions read from memory

» Data Stream

• Operations performed on the data in the processor

 SISD represents the organization containing single control unit, a processor unit and a
memory unit. Instruction are executed sequentially and system may or may not have
internal parallel processing capabilities.

 SIMD represents an organization that includes many processing units under the
supervision of a common control unit.

 MISD structure is of only theoretical interest since no practical system has been
constructed using this organization.

 MIMD organization refers to a computer system capable of processing several

programs at the same time.

The main difference between multicomputer system and multiprocessor system is that the
multiprocessor system is controlled by one operating system that provides interaction
between processors and all the component of the system cooperate in the solution of a
problem.

 Parallel Processing can be discussed under following topics:

 Pipeline Processing

 Vector Processing

 Array Processors

20
UNIT-V
PIPELINING:

• A technique of decomposing a sequential process into suboperations, with

each subprocess being executed in a special dedicated segment that operates
concurrently with all other segments.

• It is a technique of decomposing a sequential process into sub operations, with

each sub process being executed in a special dedicated segments that operates
concurrently with all other segments.

• Each segment performs partial processing dictated by the way task is

partitioned.

• The result obtained from each segment is transferred to next segment.

• The final result is obtained when data have passed through all segments.

• Suppose we have to perform the following task:

• Each sub operation is to be performed in a segment within a pipeline. Each segment

has one or two registers and a combinational circuit.

21
UNIT-V
OPERATIONS IN EACH PIPELINE STAGE:

• General Structure of a 4-Segment Pipeline

• Space-Time Diagram

The following diagram shows 6 tasks T1 through T6 executed in 4segments.

PIPELINE SPEEDUP:

Consider the case where a k-segment pipeline used to execute n tasks.

 n = 6 in previous example

22
UNIT-V
 k = 4 in previous example

• Pipelined Machine (k stages, n tasks)

 The first task t1 requires k clock cycles to complete its operation since there
are k segments

 The remaining n-1 tasks require n-1 clock cycles

 The n tasks clock cycles = k+(n-1) (9 in previous example)

• Conventional Machine (Non-Pipelined)

 Cycles to complete each task in nonpipeline = k

 For n tasks, n cycles required is

• Speedup (S)

 S = Nonpipeline time /Pipeline time

 For n tasks: S = nk/(k+n-1)

 As n becomes much larger than k-1; Therefore, S = nk/n = k

PIPELINE AND MULTIPLE FUNCTION UNITS:

Example:

- 4-stage pipeline

- 100 tasks to be executed

- 1 task in non-pipelined system; 4 clock cycles

Pipelined System : k + n - 1 = 4 + 99 = 103 clock cycles

Non-Pipelined System : nk = 100 4 = 400 clock cycles

Speedup : Sk = 400 / 103 = 3.88

Types of Pipelining:

• Arithmetic Pipeline

• Instruction Pipeline

ARITHMETIC PIPELINE:

 Pipeline arithmetic units are usually found in very high speed computers.

 They are used to implement floating point operations.

23
UNIT-V
 We will now discuss the pipeline unit for the floating point addition and subtraction.

 The inputs to floating point adder pipeline are two normalized floating point numbers.

 A and B are mantissas and a and b are the exponents.

 The floating point addition and subtraction can be performed in four segments.

Floating-point adder:

[1] Compare the exponents

[2] Align the mantissa

[3] Add/sub the mantissa

[4] Normalize the result

X = A x 10a = 0.9504 x 103

Y = B x 10b = 0.8200 x 102

1) Compare exponents :

3-2=1

2) Align mantissas

X = 0.9504 x 103

Y = 0.08200 x 103

3) Add mantissas

Z = 1.0324 x 103

4) Normalize result

Z = 0.10324 x 104

24
UNIT-V
Instruction Pipeline:

 Pipeline processing can occur not only in the data stream but in the instruction stream
as well.

 An instruction pipeline reads consecutive instruction from memory while previous

instruction are being executed in other segments.

 This caused the instruction fetch and execute segments to overlap and perform
simultaneous operation.

Four Segment CPU Pipeline:

 FI segment fetches the instruction.

 DA segment decodes the instruction and calculate the effective address.

 FO segment fetches the operand.

 EX segment executes the instruction.

25
UNIT-V
INSTRUCTION CYCLE:

Pipeline processing can occur also in the instruction stream. An instruction

pipeline reads consecutive instructions from memory while previous

instructions are being executed in other segments.

Six Phases* in an Instruction Cycle

[1] Fetch an instruction from memory

[2] Decode the instruction

26
UNIT-V
[3] Calculate the effective address of the operand

[4] Fetch the operands from memory

[5] Execute the operation

[6] Store the result in the proper place

* Some instructions skip some phases

* Effective address calculation can be done in the part of the decoding phase

* Storage of the operation result into a register is done automatically in the execution phase

==> 4-Stage Pipeline

[1] FI: Fetch an instruction from memory

[2] DA: Decode the instruction and calculate the effective address of the operand

[3] FO: Fetch the operand

[4] EX: Execute the operation

Pipeline Conflicts :

– Pipeline Conflicts : 3 major difficulties

–
1) Resource conflicts: memory access by two segments at the same time. Most of these
conflicts can be resolved by using separate instruction and data memories.

2) Data dependency: when an instruction depend on the result of a previous instruction,

but this result is not yet available.

27
UNIT-V
Example: an instruction with register indirect mode cannot proceed to fetch the operand
if the previous instruction is loading the address into the register.

3) Branch difficulties: branch and other instruction (interrupt, ret, ..) that change the value
of PC.

Handling Data Dependency:

 This problem can be solved in the following ways:

 Hardware interlocks: It is the circuit that detects the conflict situation and
delayed the instruction by sufficient cycles to resolve the conflict.

 Operand Forwarding: It uses the special hardware to detect the conflict and
avoid it by routing the data through the special path between pipeline
segments.

 Delayed Loads: The compiler detects the data conflict and reorder the
instruction as necessary to delay the loading of the conflicting data by
inserting no operation instruction.

Handling of Branch Instruction:

 Pre fetch the target instruction.

 Branch target buffer(BTB) included in the fetch segment of the pipeline

 Branch Prediction

 Delayed Branch

RISC Pipeline:

 Simplicity of instruction set is utilized to implement an instruction pipeline using

small number of sub-operation, with each being executed in single clock cycle.

Since all operation are performed in the register, there is no need of effective address
calculation.

Three Segment Instruction Pipeline:

 I: Instruction Fetch

 A: ALU Operation

 E: Execute Instruction

Delayed Load:

28
UNIT-V
Delayed Branch:

Let us consider the program having the following 5 instructions

29
UNIT-V

Pipeline and Vector Processing
100% (1)
Pipeline and Vector Processing
18 pages
COAU5
No ratings yet
COAU5
31 pages
Presentation 5156 Content Document 20250301102853AM
No ratings yet
Presentation 5156 Content Document 20250301102853AM
40 pages
Chapter 9 - Pipeline and Vector Processing Section 9.1 - Parallel Processing
No ratings yet
Chapter 9 - Pipeline and Vector Processing Section 9.1 - Parallel Processing
10 pages
Pipeline and Vector Processing
No ratings yet
Pipeline and Vector Processing
52 pages
Coa Unit 5
No ratings yet
Coa Unit 5
20 pages
Chapter 5 Pipelining and Vector Processing Modified
No ratings yet
Chapter 5 Pipelining and Vector Processing Modified
37 pages
Parallel Processing Essentials
No ratings yet
Parallel Processing Essentials
32 pages
Pipelining & Vector Processing Guide
No ratings yet
Pipelining & Vector Processing Guide
29 pages
Unit 6 COA
No ratings yet
Unit 6 COA
37 pages
Advanced Computer Architecture
No ratings yet
Advanced Computer Architecture
33 pages
Multiprocessor Systems & Pipelining
No ratings yet
Multiprocessor Systems & Pipelining
11 pages
Pipelining 2
No ratings yet
Pipelining 2
43 pages
3rd Unit
No ratings yet
3rd Unit
72 pages
Unit 5-2 COA
No ratings yet
Unit 5-2 COA
52 pages
Chapter 5 - CO - BIM - III
No ratings yet
Chapter 5 - CO - BIM - III
7 pages
Pipelining & Vector Processing Guide
No ratings yet
Pipelining & Vector Processing Guide
28 pages
Chap. 9 Pipeline and Vector Processing
0% (1)
Chap. 9 Pipeline and Vector Processing
12 pages
Unit-V NEW
No ratings yet
Unit-V NEW
21 pages
Cao Unit 6
No ratings yet
Cao Unit 6
21 pages
COA DR MVN 5 UNIT - Latest PDF
No ratings yet
COA DR MVN 5 UNIT - Latest PDF
24 pages
Pipelining Vector Processing
No ratings yet
Pipelining Vector Processing
27 pages
Pipelining and Vector Processing
No ratings yet
Pipelining and Vector Processing
30 pages
Lecture 10
No ratings yet
Lecture 10
23 pages
Pipelining
No ratings yet
Pipelining
33 pages
BCA Semester II Computer Organisation and Architecture (COA
No ratings yet
BCA Semester II Computer Organisation and Architecture (COA
24 pages
Co Unit 6
No ratings yet
Co Unit 6
48 pages
Unit-6 - Pileline and Vector Processing
No ratings yet
Unit-6 - Pileline and Vector Processing
54 pages
Pipelining & Vector Processing Guide
No ratings yet
Pipelining & Vector Processing Guide
73 pages
Pipelining and Vector Processing
No ratings yet
Pipelining and Vector Processing
37 pages
Caalp Unit5
No ratings yet
Caalp Unit5
20 pages
Pipelining and Vector Processing
No ratings yet
Pipelining and Vector Processing
37 pages
Ca Unit 2.2
100% (2)
Ca Unit 2.2
22 pages
Lecture 8 Unit 4 Pipeline and Vector Processing 2019
No ratings yet
Lecture 8 Unit 4 Pipeline and Vector Processing 2019
36 pages
Coa Unit-3 Part-2
No ratings yet
Coa Unit-3 Part-2
35 pages
Chap. 9 Pipeline and Vector Processing
No ratings yet
Chap. 9 Pipeline and Vector Processing
16 pages
Unit 5 - Pipeling and Multipoessors
No ratings yet
Unit 5 - Pipeling and Multipoessors
74 pages
Pipeline Processing Coa
No ratings yet
Pipeline Processing Coa
34 pages
Csso U 5
No ratings yet
Csso U 5
29 pages
Pipelining and Vector Processing Guide
No ratings yet
Pipelining and Vector Processing Guide
63 pages
CA Slides#3 Pipeline Introduction
No ratings yet
CA Slides#3 Pipeline Introduction
26 pages
Chapter 3 - Pipelining-And-Vector-Processing
100% (1)
Chapter 3 - Pipelining-And-Vector-Processing
29 pages
Advanced Computer Architectures
100% (6)
Advanced Computer Architectures
29 pages
Dld&Co Cse-Ds Unit 4-2
No ratings yet
Dld&Co Cse-Ds Unit 4-2
38 pages
RISC vs CISC: A Comparative Guide
No ratings yet
RISC vs CISC: A Comparative Guide
35 pages
ACA - Pipelining
No ratings yet
ACA - Pipelining
25 pages
Pipelining and Vector Processing: - Parallel
No ratings yet
Pipelining and Vector Processing: - Parallel
37 pages
CS212 Unit 5
No ratings yet
CS212 Unit 5
38 pages
Parallel Computer Architecture
No ratings yet
Parallel Computer Architecture
22 pages
Unit 6 - Pipeline, Vector Processing and Multiprocessors
No ratings yet
Unit 6 - Pipeline, Vector Processing and Multiprocessors
23 pages
Pipelining and Vector Processing
No ratings yet
Pipelining and Vector Processing
37 pages
Flynn's Taxonomy & Pipelining
No ratings yet
Flynn's Taxonomy & Pipelining
13 pages
Pipeline and Vector Processing
No ratings yet
Pipeline and Vector Processing
18 pages
Coa Unit 5
No ratings yet
Coa Unit 5
71 pages
Pipeline & Parallel Processing
No ratings yet
Pipeline & Parallel Processing
19 pages
5.pipeline and Multiprocessors
100% (1)
5.pipeline and Multiprocessors
16 pages
Pipelining and Vector Processing
No ratings yet
Pipelining and Vector Processing
39 pages
CO Module 5 Notes
No ratings yet
CO Module 5 Notes
16 pages
Research Writing Guide by Dr. Padama
No ratings yet
Research Writing Guide by Dr. Padama
7 pages
Managing The Quality of Consulting Engagements
No ratings yet
Managing The Quality of Consulting Engagements
32 pages
Cisco IOS Password Security Guide
No ratings yet
Cisco IOS Password Security Guide
14 pages
Essential Spreadsheets Exercises
100% (1)
Essential Spreadsheets Exercises
23 pages
小五組 Grade 5: 時限：分鐘 Time allowed: minutes
100% (1)
小五組 Grade 5: 時限：分鐘 Time allowed: minutes
5 pages
Exercise 2 Implementing The Shop With EJB: 2.1 Overview
No ratings yet
Exercise 2 Implementing The Shop With EJB: 2.1 Overview
8 pages
Algebra 2 MP3 Cumulative Exam Review
No ratings yet
Algebra 2 MP3 Cumulative Exam Review
3 pages
Induction Insights for AJ Turbo Staff
100% (1)
Induction Insights for AJ Turbo Staff
3 pages
Second Puc Computer Science Notes Complete
100% (4)
Second Puc Computer Science Notes Complete
149 pages
Ses-Cdegs 2k - Malz
100% (1)
Ses-Cdegs 2k - Malz
75 pages
18865-OnePlus 7T Pro User Manual EN
No ratings yet
18865-OnePlus 7T Pro User Manual EN
108 pages
Focus 40 Blue Keystroke Guide
No ratings yet
Focus 40 Blue Keystroke Guide
30 pages
Clustered and Non Clustered
No ratings yet
Clustered and Non Clustered
2 pages
SOP Template for Business Use
No ratings yet
SOP Template for Business Use
4 pages
Documentation Affected: Section # Page # Date Completed Initial
No ratings yet
Documentation Affected: Section # Page # Date Completed Initial
3 pages
PERSONAL PROFILE: Gender: Marital Status Date of
No ratings yet
PERSONAL PROFILE: Gender: Marital Status Date of
3 pages
Lab 2
100% (1)
Lab 2
4 pages
Algorithm Design & Analysis Course
No ratings yet
Algorithm Design & Analysis Course
2 pages
ISO 27001 Control Clauses List
No ratings yet
ISO 27001 Control Clauses List
7 pages
United States Patent: Tsujikawa Et Al
No ratings yet
United States Patent: Tsujikawa Et Al
75 pages
Lesson 1 Fundamentals of DSA
No ratings yet
Lesson 1 Fundamentals of DSA
17 pages
Exp 4 Alu 8086
No ratings yet
Exp 4 Alu 8086
21 pages
Process Design for Chemical Engineers
No ratings yet
Process Design for Chemical Engineers
1 page
BC-2800Vet Operator's Manual 12.0 EN H-2810-20-47202 ECM12.0 PDF
No ratings yet
BC-2800Vet Operator's Manual 12.0 EN H-2810-20-47202 ECM12.0 PDF
246 pages
Products by Benefit Level
No ratings yet
Products by Benefit Level
122 pages
3cd87what Is The Scope of CRM
100% (1)
3cd87what Is The Scope of CRM
6 pages
Virtual Assistant Project Report
No ratings yet
Virtual Assistant Project Report
24 pages
Glade Reference
No ratings yet
Glade Reference
286 pages
Sap Tables List
100% (1)
Sap Tables List
43 pages
Compute Fibonacci Using Java and Parallelization
No ratings yet
Compute Fibonacci Using Java and Parallelization
2,713 pages

Unit-5-Parallel Processing

Uploaded by

Unit-5-Parallel Processing

Uploaded by

PIPELINE AND VECTOR PROCESSING

 It refers to techniques that are used to provide simultaneous data processing.

 The system may have two or more processors operating concurrently.

• Example of parallel Processing:

– Multiple Functional Unit:

 Internal Organization of Processor

 Interconnection structure between processors

 Flow of information through system

» Based on the multiplicity of Instruction Streams and Data Streams

• Sequence of Instructions read from memory

• Operations performed on the data in the processor

 MIMD organization refers to a computer system capable of processing several

 Parallel Processing can be discussed under following topics:

• A technique of decomposing a sequential process into suboperations, with

• It is a technique of decomposing a sequential process into sub operations, with

• Each segment performs partial processing dictated by the way task is

• The result obtained from each segment is transferred to next segment.

• Suppose we have to perform the following task:

• Each sub operation is to be performed in a segment within a pipeline. Each segment

• General Structure of a 4-Segment Pipeline

The following diagram shows 6 tasks T1 through T6 executed in 4segments.

Consider the case where a k-segment pipeline used to execute n tasks.

• Pipelined Machine (k stages, n tasks)

 The remaining n-1 tasks require n-1 clock cycles

 The n tasks clock cycles = k+(n-1) (9 in previous example)

• Conventional Machine (Non-Pipelined)

 Cycles to complete each task in nonpipeline = k

 For n tasks, n cycles required is

 S = Nonpipeline time /Pipeline time

 For n tasks: S = nk/(k+n-1)

 As n becomes much larger than k-1; Therefore, S = nk/n = k

PIPELINE AND MULTIPLE FUNCTION UNITS:

- 100 tasks to be executed

- 1 task in non-pipelined system; 4 clock cycles

Pipelined System : k + n - 1 = 4 + 99 = 103 clock cycles

Non-Pipelined System : n*k = 100 * 4 = 400 clock cycles

Speedup : Sk = 400 / 103 = 3.88

 They are used to implement floating point operations.

 A and B are mantissas and a and b are the exponents.

[1] Compare the exponents

[2] Align the mantissa

[3] Add/sub the mantissa

[4] Normalize the result

X = A x 10a = 0.9504 x 103

Y = B x 10b = 0.8200 x 102

 An instruction pipeline reads consecutive instruction from memory while previous

Four Segment CPU Pipeline:

 FI segment fetches the instruction.

 DA segment decodes the instruction and calculate the effective address.

 FO segment fetches the operand.

 EX segment executes the instruction.

Pipeline processing can occur also in the instruction stream. An instruction

pipeline reads consecutive instructions from memory while previous

instructions are being executed in other segments.

Six Phases* in an Instruction Cycle

[1] Fetch an instruction from memory

[2] Decode the instruction

[4] Fetch the operands from memory

[5] Execute the operation

[6] Store the result in the proper place

* Some instructions skip some phases

==> 4-Stage Pipeline

[1] FI: Fetch an instruction from memory

[3] FO: Fetch the operand

[4] EX: Execute the operation

– Pipeline Conflicts : 3 major difficulties

2) Data dependency: when an instruction depend on the result of a previous instruction,

Handling Data Dependency:

 This problem can be solved in the following ways:

Handling of Branch Instruction:

 Pre fetch the target instruction.

 Branch target buffer(BTB) included in the fetch segment of the pipeline

 Simplicity of instruction set is utilized to implement an instruction pipeline using

Three Segment Instruction Pipeline:

Let us consider the program having the following 5 instructions

You might also like

Non-Pipelined System : nk = 100 4 = 400 clock cycles