0% found this document useful (0 votes)

210 views74 pages

William Stallings Computer Organization and Architecture 8 Edition Processor Structure and Function

This document discusses the internal structure and function of CPUs. It covers the following key points: 1. CPUs must fetch instructions, interpret instructions, fetch data, process data, and write data. They use registers for temporary storage and processing. 2. Registers include general purpose registers, data registers, address registers, condition code registers, control/status registers, and other special purpose registers. 3. Instruction cycles involve fetching instructions from memory, decoding and executing them, which may involve additional data fetches. Pipelining can improve performance by overlapping the stages of instruction processing. 4. Hazards like resource conflicts, data dependencies, and branch mispredictions can stall the pipeline and reduce efficiency unless addressed

Uploaded by

Rehman Hazrat

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PPT, PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

210 views74 pages

William Stallings Computer Organization and Architecture 8 Edition Processor Structure and Function

Uploaded by

Rehman Hazrat

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PPT, PDF, TXT or read online on Scribd

You are on page 1/ 74

William Stallings

Computer Organization
and Architecture
8th Edition
Chapter 12
Processor Structure and
Function

CPU Structure
CPU must:
Fetch instructions
Interpret instructions
Fetch data
Process data
Write data

CPU With Systems Bus

CPU Internal Structure

Registers
CPU must have some working space
(temporary storage)
Called registers
Number and function vary between
processor designs
One of the major design decisions
Top level of memory hierarchy

User Visible Registers

General Purpose
Data
Address
Condition Codes

General Purpose Registers (1)

May be true general purpose

May be restricted
May be used for data or addressing
Data
Accumulator

Addressing
Segment

General Purpose Registers (2)

Make them general purpose

Increase flexibility and programmer options
Increase instruction size & complexity

Make them specialized

Smaller (faster) instructions
Less flexibility

How Many GP Registers?

Between 8 - 32
Fewer = more memory references
More does not reduce memory references
and takes up processor real estate
See also RISC

How big?
Large enough to hold full address
Large enough to hold full word
Often possible to combine two data
registers
C programming
double int a;
long int a;

Condition Code Registers

Sets of individual bits
e.g. result of last operation was zero

Can be read (implicitly) by programs

e.g. Jump if zero

Can not (usually) be set by programs

Control & Status Registers

Program Counter
Instruction Decoding Register
Memory Address Register
Memory Buffer Register

Revision: what do these all do?

Program Status Word

A set of bits
Includes Condition Codes
Sign of last result
Zero
Carry
Equal
Overflow
Interrupt enable/disable
Supervisor

Supervisor Mode

Intel ring zero

Kernel mode
Allows privileged instructions to execute
Used by operating system
Not available to user programs

Other Registers
May have registers pointing to:
Process control blocks (see O/S)
Interrupt Vectors (see O/S)

N.B. CPU design and operating system

design are closely linked

Example Register Organizations

Instruction Cycle
Revision
Stallings Chapter 3

Indirect Cycle
May require memory access to fetch
operands
Indirect addressing requires more
memory accesses
Can be thought of as additional instruction
subcycle

Instruction Cycle with Indirect

Instruction Cycle State Diagram

Data Flow (Instruction Fetch)

Depends on CPU design
In general:
Fetch
PC contains address of next instruction
Address moved to MAR
Address placed on address bus
Control unit requests memory read
Result placed on data bus, copied to MBR,
then to IR
Meanwhile PC incremented by 1

Data Flow (Data Fetch)

IR is examined
If indirect addressing, indirect cycle is
performed
Right most N bits of MBR transferred to MAR
Control unit requests memory read
Result (address of operand) moved to MBR

Data Flow (Fetch Diagram)

Data Flow (Indirect Diagram)

Data Flow (Execute)

May take many forms
Depends on instruction being executed
May include
Memory read/write
Input/Output
Register transfers
ALU operations

Data Flow (Interrupt)

Simple
Predictable
Current PC saved to allow resumption
after interrupt
Contents of PC copied to MBR
Special memory location (e.g. stack
pointer) loaded to MAR
MBR written to memory
PC loaded with address of interrupt
handling routine
Next instruction (first of interrupt handler)
can be fetched

Data Flow (Interrupt Diagram)

Prefetch
Fetch accessing main memory
Execution usually does not access main
memory
Can fetch next instruction during
execution of current instruction
Called instruction prefetch

Improved Performance
But not doubled:
Fetch usually shorter than execution
Prefetch more than one instruction?

Any jump or branch means that prefetched

instructions are not the required instructions

Add more stages to improve performance

Pipelining

Fetch instruction
Decode instruction
Calculate operands (i.e. EAs)
Fetch operands
Execute instructions
Write result

Overlap these operations

Two Stage Instruction Pipeline

Timing Diagram for

Instruction Pipeline Operation

The Effect of a Conditional Branch on

Instruction Pipeline Operation

Six Stage
Instruction Pipeline

Alternative Pipeline Depiction

Speedup Factors
with Instruction
Pipelining

Pipeline Hazards
Pipeline, or some portion of pipeline, must
stall
Also called pipeline bubble
Types of hazards
Resource
Data
Control

Resource Hazards

Two (or more) instructions in pipeline need same resource

Executed in serial rather than parallel for part of pipeline
Also called structural hazard
E.g. Assume simplified five-stage pipeline
Each stage takes one clock cycle

Ideal case is new instruction enters pipeline each clock cycle

Assume main memory has single port
Assume instruction fetches and data reads and writes performed
one at a time
Ignore the cache
Operand read or write cannot be performed in parallel with
instruction fetch
Fetch instruction stage must idle for one cycle fetching I3
E.g. multiple instructions ready to enter execute instruction phase
Single ALU
One solution: increase available resources
Multiple main memory ports
Multiple ALUs

Data Hazards

Conflict in access of an operand location

Two instructions to be executed in sequence
Both access a particular memory or register operand
If in strict sequence, no problem occurs
If in a pipeline, operand value could be updated so as to
produce different result from strict sequential execution
E.g. x86 machine instruction sequence:
ADD EAX, EBX
SUB ECX, EAX

/* EAX = EAX + EBX

/* ECX = ECX EAX

ADD instruction does not update EAX until end of stage 5,

at clock cycle 5
SUB instruction needs value at beginning of its stage 2, at
clock cycle 4
Pipeline must stall for two clocks cycles
Without special hardware and specific avoidance
algorithms, results in inefficient pipeline usage

Data Hazard Diagram

Types of Data Hazard

Read after write (RAW), or true dependency
An instruction modifies a register or memory location
Succeeding instruction reads data in that location
Hazard if read takes place before write complete

Write after read (RAW), or antidependency

An instruction reads a register or memory location
Succeeding instruction writes to location
Hazard if write completes before read takes place

Write after write (RAW), or output dependency

Two instructions both write to same location
Hazard if writes take place in reverse of order intended
sequence

Previous example is RAW hazard

Resource Hazard Diagram

Control Hazard

Control Hazard
Also known as branch hazard
Pipeline makes wrong decision on branch
prediction
Brings instructions into pipeline that must
subsequently be discarded
Dealing with Branches
Multiple Streams
Prefetch Branch Target
Loop buffer
Branch prediction
Delayed branching

Multiple Streams
Have two pipelines
Prefetch each branch into a separate
pipeline
Use appropriate pipeline
Leads to bus & register contention
Multiple branches lead to further pipelines
being needed

Prefetch Branch Target

Target of branch is prefetched in addition
to instructions following branch
Keep target until branch is executed
Used by IBM 360/91

Loop Buffer

Very fast memory

Maintained by fetch stage of pipeline
Check buffer before fetching from memory
Very good for small loops or jumps
c.f. cache
Used by CRAY-1

Loop Buffer Diagram

Branch Prediction (1)

Predict never taken
Assume that jump will not happen
Always fetch next instruction
68020 & VAX 11/780
VAX will not prefetch after branch if a page
fault would result (O/S v CPU design)

Predict always taken

Assume that jump will happen
Always fetch target instruction

Branch Prediction (2)

Predict by Opcode
Some instructions are more likely to result in a
jump than thers
Can get up to 75% success

Taken/Not taken switch

Based on previous history
Good for loops
Refined by two-level or correlation-based branch
history

Correlation-based
In loop-closing branches, history is good predictor
In more complex structures, branch direction
correlates with that of related branches
Use recent branch history as well

Branch Prediction (3)

Delayed Branch
Do not take jump until you have to
Rearrange instructions

Branch Prediction Flowchart

Branch Prediction State Diagram

Dealing With
Branches

Intel 80486 Pipelining

Fetch
From cache or external memory
Put in one of two 16-byte prefetch buffers
Fill buffer with new data as soon as old data consumed
Average 5 instructions fetched per load
Independent of other stages to keep buffers full

Decode stage 1
Opcode & address-mode info
At most first 3 bytes of instruction
Can direct D2 stage to get rest of instruction

Decode stage 2
Expand opcode into control signals
Computation of complex address modes

Execute
ALU operations, cache access, register update

Writeback
Update registers & flags
Results sent to cache & bus interface write buffers

80486 Instruction Pipeline Examples

Pentium 4 Registers

EFLAGS Register

Control Registers

MMX Register Mapping

MMX uses several 64 bit data types
Use 3 bit register address fields
8 registers

No MMX specific registers

Aliasing to lower 64 bits of existing floating
point registers

Mapping of MMX Registers to

Floating-Point Registers

Pentium Interrupt Processing

Interrupts
Maskable
Nonmaskable

Exceptions
Processor detected
Programmed

Interrupt vector table

Each interrupt type assigned a number
Index to vector table
256 * 32 bit interrupt vectors

5 priority classes

ARM Attributes
RISC
Moderate array of uniform registers

More than most CISC, less than many RISC

Load/store model

Operations perform on operands in registers only

Uniform fixed-length instruction

32 bits standard set 16 bits Thumb

Shift or rotation can preprocess source registers

Separate ALU and shifter units

Small number of addressing modes

All load/store addressees from registers and instruction fields

No indirect or indexed addressing involving values in memory

Auto-increment and auto-decrement addressing

Improve loops

Conditional execution of instructions minimizes

conditional branches
Pipeline flushing is reduced

Simplified ARM Organization

ARM Processor Organization

Many variations depending on ARM version
Data exchanged between processor and memory
through data bus
Data item (load/store) or instruction (fetch)
Instructions go through decoder before execution
Pipeline and control signal generation in control
unit
Data goes to register file
Set of 32 bit registers
Byte & halfword twos complement data sign extended

Typically two source and one result register

Rotation or shift before ALU

ARM Processor Modes

User
Privileged
6 modes
OS can tailor systems software use
Some registers dedicated to each privileged mode
Swifter context changes

Exception
5 of privileged modes
Entered on given exceptions
Substitute some registers for user registers
Avoid corruption

Privileged Modes
System Mode
Not exception
Uses same registers as User mode
Can be interrupted by

Supervisor mode
OS
Software interrupt usedd to invoke operating system services

Abort mode
memory faults

Undefined mode
Attempt instruction that is not supported by integer core
coprocessors

Fast interrupt mode

Interrupt signal from designated fast interrupt source
Fast interrupt cannot be interrupted
May interrupt normal interrupt

Interrupt mode
Interrupt signal from any other interrupt source

Modes
Privilegedmodes

ARM
User
Register
OrganizationR0
R1
Table
R2

Exceptionmodes
System

Supervisor

Abort

Undefined

Interrupt

FastInterrupt

R8_fiq

R9_fiq

R10

R10_fiq

R11

R11_fiq

R12

R12_fiq

R13(SP)

R13_svc

R13_abt

R13_und

R13_irq

R13_fiq

R14(LR)

R14_svc

R14_abt

R14_und

R14_irq

R14_fiq

R15(PC)

CPSR

SPSR_svc

SPSR_abt

SPSR_und

SPSR_irq

SPSR_fiq

ARM Register Organization

37 x 32-bit registers
31 general-purpose registers
Some have special purposes
E.g. program counters

Six program status registers

Registers in partially overlapping banks
Processor mode determines bank

16 numbered registers and one or two

program status registers visible

General Register Usage

R13 normally stack pointer (SP)
Each exception mode has its own R13

R14 link register (LR)

Subroutine and exception mode return
address

R15 program counter

CPSR
CPSR process status register
Exception modes have dedicated SPSR

16 msb are user flags

Condition codes (N,Z,C,V)
Q overflow or saturation in some SMID
instructions
J Jazelle (8 bit) instructions
GEE[3:0] SMID use [19:16] as greater than or
equal flag

16 lsb system flags for privilege modes

E endian
Interrupt disable
T Normal or Thumb instruction
Mode

ARM CPSR and SPSR

ARM Interrupt (Exception) Processing

More than one exception allowed

Seven types
Execution forced from exception vectors
Multiple exceptions handled in priority order
Processor halts execution after current
instruction
Processor state preserved in SPSR for
exception
Address of instruction about to execute put in
link register
Return by moving SPSR to CPSR and R14 to PC

Foreground Reading
Processor examples
Stallings Chapter 12
Manufacturer web sites & specs

Computer Architecture Insights
100% (1)
Computer Architecture Insights
55 pages
William Stallings Computer Organization and Architecture 8 Edition
No ratings yet
William Stallings Computer Organization and Architecture 8 Edition
55 pages
William Stallings Computer Organization and Architecture 8 Edition Instruction Sets: Addressing Modes and Formats
No ratings yet
William Stallings Computer Organization and Architecture 8 Edition Instruction Sets: Addressing Modes and Formats
47 pages
William Stallings Computer Organization and Architecture 9 Edition
No ratings yet
William Stallings Computer Organization and Architecture 9 Edition
36 pages
Data Structure in Python
No ratings yet
Data Structure in Python
36 pages
Understanding Threads in Computing
No ratings yet
Understanding Threads in Computing
6 pages
Beyond Binary Classification
No ratings yet
Beyond Binary Classification
34 pages
ACA Notes UNIT-1
No ratings yet
ACA Notes UNIT-1
20 pages
Unit 2 - Advanced Computer Architecture - WWW - Rgpvnotes.in
No ratings yet
Unit 2 - Advanced Computer Architecture - WWW - Rgpvnotes.in
15 pages
1multiprocessors and Multicomputers: A. Multiprocessor System Interconnects
No ratings yet
1multiprocessors and Multicomputers: A. Multiprocessor System Interconnects
16 pages
7 Wonders of India Case Study
No ratings yet
7 Wonders of India Case Study
11 pages
Machine Learning Basics & Techniques
No ratings yet
Machine Learning Basics & Techniques
13 pages
Unit 5
No ratings yet
Unit 5
86 pages
CPR Notes - Chapter 01 Basics of C
No ratings yet
CPR Notes - Chapter 01 Basics of C
20 pages
DL - & - CO - Unit 5 - Material (N)
No ratings yet
DL - & - CO - Unit 5 - Material (N)
15 pages
Machine Learning: Presentation
100% (2)
Machine Learning: Presentation
23 pages
COA Notes for BCA Students
No ratings yet
COA Notes for BCA Students
83 pages
Slides Chapter 5 Basic Processing Unit
No ratings yet
Slides Chapter 5 Basic Processing Unit
44 pages
Fundamentals of Data Structures Lab Manual
No ratings yet
Fundamentals of Data Structures Lab Manual
52 pages
ADBMS Lab Manual
No ratings yet
ADBMS Lab Manual
33 pages
Instruction Pipeline Design, Arithmetic Pipeline Deign - Super Scalar Pipeline Design
No ratings yet
Instruction Pipeline Design, Arithmetic Pipeline Deign - Super Scalar Pipeline Design
34 pages
Chandigarh Group of Colleges College of Engineering Landran, Mohali
No ratings yet
Chandigarh Group of Colleges College of Engineering Landran, Mohali
47 pages
Types of Pipeline
100% (1)
Types of Pipeline
2 pages
Superpipelining
No ratings yet
Superpipelining
7 pages
22CS302 - UNIT 1 To 3 - Material
No ratings yet
22CS302 - UNIT 1 To 3 - Material
93 pages
15 Puzzle Problem Solving
No ratings yet
15 Puzzle Problem Solving
14 pages
Pipelining: Advanced Computer Architecture
100% (1)
Pipelining: Advanced Computer Architecture
30 pages
Digital Steganography
No ratings yet
Digital Steganography
38 pages
DBMS - Unit-3
No ratings yet
DBMS - Unit-3
35 pages
cs3351 Dpco Unit 4
No ratings yet
cs3351 Dpco Unit 4
26 pages
Computer Organization and Architecture: Notes On RISC-Pipelining
No ratings yet
Computer Organization and Architecture: Notes On RISC-Pipelining
14 pages
DMDW-Unit II
No ratings yet
DMDW-Unit II
19 pages
Processor and Memory Organization
No ratings yet
Processor and Memory Organization
17 pages
ML First Unit
0% (1)
ML First Unit
70 pages
Unit-4 Part-1 ML Ai&Ml r23
No ratings yet
Unit-4 Part-1 ML Ai&Ml r23
20 pages
Microprocessors and Interfacing Devices PDF
No ratings yet
Microprocessors and Interfacing Devices PDF
160 pages
UNIT 2-3 - Notes - Unit-2-3-Notes
No ratings yet
UNIT 2-3 - Notes - Unit-2-3-Notes
16 pages
PPL Unit 3-1
No ratings yet
PPL Unit 3-1
25 pages
Concurrent Process
No ratings yet
Concurrent Process
21 pages
RISC vs CISC: A Technical Overview
No ratings yet
RISC vs CISC: A Technical Overview
98 pages
Operating Systems for Beginners
No ratings yet
Operating Systems for Beginners
7 pages
C++ Viva Ques
No ratings yet
C++ Viva Ques
16 pages
Machine Learning
No ratings yet
Machine Learning
17 pages
SYSTEM CALLS in Operating System
No ratings yet
SYSTEM CALLS in Operating System
11 pages
Unix File System Case Study
No ratings yet
Unix File System Case Study
23 pages
The Memory System: Fundamental Concepts
No ratings yet
The Memory System: Fundamental Concepts
115 pages
Instruction Set, Addressing Modes, Assembler Directives
No ratings yet
Instruction Set, Addressing Modes, Assembler Directives
9 pages
Data Mining Concepts & Techniques
No ratings yet
Data Mining Concepts & Techniques
46 pages
Chapter 1 - Data Representation 1.1 - Data Types
No ratings yet
Chapter 1 - Data Representation 1.1 - Data Types
12 pages
Big Data NOTES and QB
No ratings yet
Big Data NOTES and QB
92 pages
Lecture 3 Multiprocessor Vs Multicomputer Vs DS
No ratings yet
Lecture 3 Multiprocessor Vs Multicomputer Vs DS
55 pages
Dbms PPT For Chapter 7
No ratings yet
Dbms PPT For Chapter 7
45 pages
Computer Organization and Architecture
100% (1)
Computer Organization and Architecture
21 pages
Web Programming Unit-1 Notes
No ratings yet
Web Programming Unit-1 Notes
85 pages
cs3401 Algorithm Unit5
No ratings yet
cs3401 Algorithm Unit5
8 pages
Data Mining Models - GeeksforGeeks
No ratings yet
Data Mining Models - GeeksforGeeks
4 pages
SQL & PL/SQL Exercises for Students
No ratings yet
SQL & PL/SQL Exercises for Students
10 pages
MiddleWare Technology Lab Manual
No ratings yet
MiddleWare Technology Lab Manual
170 pages
12 - Processor Structure and Function
No ratings yet
12 - Processor Structure and Function
73 pages
Unit - 1 Microprocessor Architecture
No ratings yet
Unit - 1 Microprocessor Architecture
52 pages
William Stallings Computer Organization and Architecture 8th Edition Cache Memory
No ratings yet
William Stallings Computer Organization and Architecture 8th Edition Cache Memory
71 pages
DBMS Deadlock
No ratings yet
DBMS Deadlock
10 pages
ESL Teacher Application Form
No ratings yet
ESL Teacher Application Form
4 pages
Ed 419437
No ratings yet
Ed 419437
546 pages
Specialist - Category Management.
No ratings yet
Specialist - Category Management.
3 pages
Sociology 2015 Subjective
No ratings yet
Sociology 2015 Subjective
1 page
2016fulbright Application
No ratings yet
2016fulbright Application
15 pages
International Law 2015
No ratings yet
International Law 2015
1 page
Prof. Sin-Min Lee Department of Computer Science San Jose State University
No ratings yet
Prof. Sin-Min Lee Department of Computer Science San Jose State University
53 pages
Assessment Brief1 EBS2
No ratings yet
Assessment Brief1 EBS2
9 pages
L06 - RISCV Datapath Design
100% (1)
L06 - RISCV Datapath Design
78 pages
Vlsi - Unit 1 Notes
No ratings yet
Vlsi - Unit 1 Notes
41 pages
Advanced Computer Architecture Guide
100% (1)
Advanced Computer Architecture Guide
2 pages
CMOS
No ratings yet
CMOS
6 pages
Ppi 8255
No ratings yet
Ppi 8255
48 pages
5400 SSD Tech Prod Spec
No ratings yet
5400 SSD Tech Prod Spec
10 pages
18EC72-VLSI Design-Jan-Feb.2023
No ratings yet
18EC72-VLSI Design-Jan-Feb.2023
2 pages
Nanotransistor Basics for Engineers
No ratings yet
Nanotransistor Basics for Engineers
21 pages
The 8085 Programming Model
100% (5)
The 8085 Programming Model
13 pages
Addressing Modes Explained
No ratings yet
Addressing Modes Explained
18 pages
Microprocessors. Architectures and Applications
100% (1)
Microprocessors. Architectures and Applications
160 pages
Multiple Choice Questions For Computer Operator
No ratings yet
Multiple Choice Questions For Computer Operator
5 pages
ECE 270: Embedded Logic Design: Dr. Sumit J Darak Algorithms To Architectures Lab Associate Professor, ECE, IIIT Delhi
No ratings yet
ECE 270: Embedded Logic Design: Dr. Sumit J Darak Algorithms To Architectures Lab Associate Professor, ECE, IIIT Delhi
34 pages
Bill of Materials: Project: Minized Zynq Development Board Project Name: Prj-Mi1Dev Revision: 1 Bom: 01 Variant: 03
No ratings yet
Bill of Materials: Project: Minized Zynq Development Board Project Name: Prj-Mi1Dev Revision: 1 Bom: 01 Variant: 03
3 pages
Memory QVL
No ratings yet
Memory QVL
4 pages
AT28C256
No ratings yet
AT28C256
14 pages
AVR Microcontroller Architecture - TechniCodes
No ratings yet
AVR Microcontroller Architecture - TechniCodes
4 pages
Demand Paging and Segmentation
No ratings yet
Demand Paging and Segmentation
13 pages
Enflame - Final Deck
No ratings yet
Enflame - Final Deck
27 pages
Introduction To 8085 Microprocessor
No ratings yet
Introduction To 8085 Microprocessor
35 pages
8051 Microcontroller Programming in Keil
No ratings yet
8051 Microcontroller Programming in Keil
8 pages
Z80 Assembly Language Subroutines 1983 - Leventhal PDF
No ratings yet
Z80 Assembly Language Subroutines 1983 - Leventhal PDF
512 pages
8086 Microprocessor Architecture
No ratings yet
8086 Microprocessor Architecture
104 pages
Computer Organization and Microprocessor-P1
0% (1)
Computer Organization and Microprocessor-P1
26 pages
PRACTICAL1
No ratings yet
PRACTICAL1
7 pages
8051 Microcontroller Guide
No ratings yet
8051 Microcontroller Guide
10 pages
Syllabus Lithography
No ratings yet
Syllabus Lithography
1 page
8-bit Microcontroller Data Sheet
No ratings yet
8-bit Microcontroller Data Sheet
138 pages
Solution: Answer All Questions in The Provided Space. All Questions Are Based On AVR Architecture. 18 November 2020
No ratings yet
Solution: Answer All Questions in The Provided Space. All Questions Are Based On AVR Architecture. 18 November 2020
4 pages
DP Chipset 14036 Drivers
No ratings yet
DP Chipset 14036 Drivers
706 pages