0% found this document useful (0 votes)

15 views23 pages

Lecture 02 - Computer Abstractions and Technology

Presentaciones universitarias profesionales de Arquitectura de Ordenadores en ingles

Uploaded by

guarrosbaratos

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

15 views23 pages

Lecture 02 - Computer Abstractions and Technology

Presentaciones universitarias profesionales de Arquitectura de Ordenadores en ingles

Uploaded by

guarrosbaratos

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 23

Lecture 2

Computer Abstractions
and Technology (Part II)
CPU Time
CPU Time = CPU Clock Cycles ´ Clock Cycle Time
CPU Clock Cycles
=
Clock Rate
n Performance improved by
n Reducing number of clock cycles
n Increasing clock rate
n Hardware designer must often trade off clock
rate against cycle count

Lecture 2 — Computer Abstractions and Technology — 2

CPU Time Example
n Computer A: 2GHz clock, 10s CPU time
n Designing Computer B
n Aim for 6s CPU time
n Can do faster clock, but causes 1.2 × clock cycles
n How fast must Computer B clock be?
Clock CyclesB 1.2 ´ Clock Cycles A
Clock RateB = =
CPU Time B 6s
Clock Cycles A = CPU Time A ´ Clock Rate A
= 10s ´ 2GHz = 20 ´ 10 9
1.2 ´ 20 ´ 10 9 24 ´ 10 9
Clock RateB = = = 4GHz
6s 6s
Lecture 2 — Computer Abstractions and Technology — 3
Instruction Count and CPI
Clock Cycles = Instruction Count ´ Cycles per Instruction
CPU Time = Instruction Count ´ CPI ´ Clock Cycle Time
Instruction Count ´ CPI
=
Clock Rate
n Instruction Count for a program
n Determined by program, ISA and compiler
n Average cycles per instruction
n Determined by CPU hardware
n If different instructions have different CPI
n Average CPI affected by instruction mix

Lecture 2 — Computer Abstractions and Technology — 4

CPI Example
n Computer A: Cycle Time = 250ps, CPI = 2.0
n Computer B: Cycle Time = 500ps, CPI = 1.2
n Same ISA
n Which is faster, and by how much?
CPU Time = Instruction Count ´ CPI ´ Cycle Time
A A A
= I ´ 2.0 ´ 250ps = I ´ 500ps A is faster…
CPU Time = Instruction Count ´ CPI ´ Cycle Time
B B B
= I ´ 1.2 ´ 500ps = I ´ 600ps
CPU Time
B = I ´ 600ps = 1.2
…by this much
CPU Time I ´ 500ps
A
Lecture 2 — Computer Abstractions and Technology — 5
CPI in More Detail
n If different instruction classes take different
numbers of cycles
n
Clock Cycles = å (CPIi ´ Instruction Count i )
i=1

n Weighted average CPI

Clock Cycles n
æ Instruction Count i ö
CPI = = å ç CPIi ´ ÷
Instruction Count i=1 è Instruction Count ø

Relative frequency

Lecture 2 — Computer Abstractions and Technology — 6

CPI Example
n Alternative compiled code sequences using
instructions in classes A, B, C

Class A B C
CPI for class 1 2 3
IC in sequence 1 2 1 2
IC in sequence 2 4 1 1

n Sequence 1: IC = 5 n Sequence 2: IC = 6
n Clock Cycles n Clock Cycles
= 2×1 + 1×2 + 2×3 = 4×1 + 1×2 + 1×3
= 10 =9
n Avg. CPI = 10/5 = 2.0 n Avg. CPI = 9/6 = 1.5
Lecture 2 — Computer Abstractions and Technology — 7
Performance Summary
The BIG Picture

Instructions Clock cycles Seconds

CPU Time = ´ ´
Program Instruction Clock cycle

n Performance depends on
n Algorithm: affects IC, possibly CPI
n Programming language: affects IC, CPI
n Compiler: affects IC, CPI
n Instruction set architecture: affects IC, CPI, Tc

Lecture 2 — Computer Abstractions and Technology — 8

§1.5 The Power Wall
Power Trends

n In CMOS IC technology
Power = Capacitive load ´ Voltage 2 ´ Frequency

×30 5V → 1V ×1000

Lecture 2 — Computer Abstractions and Technology — 9

Reducing Power
n Suppose a new CPU has
n 85% of capacitive load of old CPU
n 15% voltage and 15% frequency reduction
Pnew Cold ´ 0.85 ´ (Vold ´ 0.85)2 ´ Fold ´ 0.85
= 2
= 0.85 4
= 0.52
Pold Cold ´ Vold ´ Fold

n The power wall

n We can’t reduce voltage further
n We can’t remove more heat
n How else can we improve performance?
Lecture 2 — Computer Abstractions and Technology — 10
§1.6 The Sea Change: The Switch to Multiprocessors
Uniprocessor Performance

Constrained by power, instruction-level parallelism,

memory latency

Lecture 2 — Computer Abstractions and Technology — 11

Multiprocessors
n Multicore microprocessors
n More than one processor per chip
n Requires explicitly parallel programming
n Compare with instruction level parallelism
n Hardware executes multiple instructions at once
n Hidden from the programmer
n Hard to do
n Programming for performance
n Load balancing
n Optimizing communication and synchronization

Lecture 2 — Computer Abstractions and Technology — 12

§1.7 Real Stuff: The AMD Opteron X4
Manufacturing ICs

n Yield: proportion of working dies per wafer

Lecture 2 — Computer Abstractions and Technology — 13

AMD Opteron X2 Wafer

n X2: 300mm wafer, 117 chips, 90nm technology

n X4: 45nm technology
Lecture 2 — Computer Abstractions and Technology — 14
Integrated Circuit Cost
Cost per wafer
Cost per die =
Dies per wafer ´ Yield
Dies per wafer » Wafer area Die area
1
Yield =
(1+ (Defects per area ´ Die area/2))2

n Nonlinear relation to area and defect rate

n Wafer cost and area are fixed
n Defect rate determined by manufacturing process
n Die area determined by architecture and circuit design

Lecture 2 — Computer Abstractions and Technology — 15

SPEC CPU Benchmark
n Programs used to measure performance
n Supposedly typical of actual workload
n Standard Performance Evaluation Corp (SPEC)
n Develops benchmarks for CPU, I/O, Web, …
n SPEC CPU2006
n Elapsed time to execute a selection of programs
n Negligible I/O, so focuses on CPU performance
n Normalize relative to reference machine
n Summarize as geometric mean of performance ratios
n CINT2006 (integer) and CFP2006 (floating-point)

n
n
Õ Execution time ratio
i=1
i

Lecture 2 — Computer Abstractions and Technology — 16

CINT2006 for Opteron X4 2356
Name Description IC×109 CPI Tc (ns) Exec time Ref time SPECratio
perl Interpreted string processing 2,118 0.75 0.40 637 9,777 15.3
bzip2 Block-sorting compression 2,389 0.85 0.40 817 9,650 11.8
gcc GNU C Compiler 1,050 1.72 0.40 24 8,050 11.1
mcf Combinatorial optimization 336 10.00 0.40 1,345 9,120 6.8
go Go game (AI) 1,658 1.09 0.40 721 10,490 14.6
hmmer Search gene sequence 2,783 0.80 0.40 890 9,330 10.5
sjeng Chess game (AI) 2,176 0.96 0.40 37 12,100 14.5
libquantum Quantum computer simulation 1,623 1.61 0.40 1,047 20,720 19.8
h264avc Video compression 3,102 0.80 0.40 993 22,130 22.3
omnetpp Discrete event simulation 587 2.94 0.40 690 6,250 9.1
astar Games/path finding 1,082 1.79 0.40 773 7,020 9.1
xalancbmk XML parsing 1,058 2.70 0.40 1,143 6,900 6.0
Geometric mean 11.7

High cache miss rates

Lecture 2 — Computer Abstractions and Technology — 17

SPEC Power Benchmark
n Power consumption of server at different
workload levels
n Performance: ssj_ops/sec
n Power: Watts (Joules/sec)

æ 10 ö æ 10 ö
Overall ssj_ops per Watt = ç å ssj_ops i ÷ ç å poweri ÷
è i=0 ø è i=0 ø

Lecture 2 — Computer Abstractions and Technology — 18

SPECpower_ssj2008 for X4
Target Load % Performance (ssj_ops/sec) Average Power (Watts)
100% 231,867 295
90% 211,282 286
80% 185,803 275
70% 163,427 265
60% 140,160 256
50% 118,324 246
40% 920,35 233
30% 70,500 222
20% 47,126 206
10% 23,066 180
0% 0 141
Overall sum 1,283,590 2,605
∑ssj_ops/ ∑power 493

Lecture 2 — Computer Abstractions and Technology — 19

§1.8 Fallacies and Pitfalls
Pitfall: Amdahl’s Law
n Improving an aspect of a computer and
expecting a proportional improvement in
overall performance
Taffected
Timproved = + Tunaffected
improvemen t factor
n Example: multiply accounts for 80s/100s
n How much improvement in multiply performance to
get 5× overall?
80 n Can’t be done!
20 = + 20
n
n Corollary: make the common case fast
Lecture 2 — Computer Abstractions and Technology — 20
Fallacy: Low Power at Idle
n Look back at X4 power benchmark
n At 100% load: 295W
n At 50% load: 246W (83%)
n At 10% load: 180W (61%)
n Google data center
n Mostly operates at 10% – 50% load
n At 100% load less than 1% of the time
n Consider designing processors to make
power proportional to load

Lecture 2 — Computer Abstractions and Technology — 21

Pitfall: MIPS as a Performance Metric
n MIPS: Millions of Instructions Per Second
n Doesn’t account for
n Differences in ISAs between computers
n Differences in complexity between instructions

Instruction count
MIPS =
Execution time ´ 10 6
Instruction count Clock rate
= =
Instruction count ´ CPI CPI ´ 10 6
´ 10 6

Clock rate
n CPI varies between programs on a given CPU
Lecture 2 — Computer Abstractions and Technology — 22
§1.9 Concluding Remarks
Concluding Remarks
n Cost/performance is improving
n Due to underlying technology development
n Hierarchical layers of abstraction
n In both hardware and software
n Instruction set architecture
n The hardware/software interface
n Execution time: the best performance
measure
n Power is a limiting factor
n Use parallelism to improve performance
Lecture 2 — Computer Abstractions and Technology — 23

Chapter 1 Part 2: Computer Abstractions and Technology
No ratings yet
Chapter 1 Part 2: Computer Abstractions and Technology
27 pages
Chap1 PPA
No ratings yet
Chap1 PPA
30 pages
Chuong 2 2
No ratings yet
Chuong 2 2
24 pages
Computer Abstractions and Technology Measuring Performance
No ratings yet
Computer Abstractions and Technology Measuring Performance
21 pages
Lecture2 ch1
No ratings yet
Lecture2 ch1
23 pages
COAL Lecture 02
No ratings yet
COAL Lecture 02
36 pages
CA 02 Performance
No ratings yet
CA 02 Performance
21 pages
CH 1
No ratings yet
CH 1
55 pages
Computer Abstractions and Technology
No ratings yet
Computer Abstractions and Technology
231 pages
Chapter 1 Computer Abstractions and Technology
No ratings yet
Chapter 1 Computer Abstractions and Technology
39 pages
Chapter 1 Computer Abstractions and Technology
No ratings yet
Chapter 1 Computer Abstractions and Technology
46 pages
Computer Abstractions and Technology
No ratings yet
Computer Abstractions and Technology
46 pages
Computer Design & Performance Basics
No ratings yet
Computer Design & Performance Basics
25 pages
Computer Abstractions and Technology
No ratings yet
Computer Abstractions and Technology
39 pages
Computer Tech & Performance Basics
No ratings yet
Computer Tech & Performance Basics
46 pages
Computer Organization & Design Basics
No ratings yet
Computer Organization & Design Basics
33 pages
Alllpdf PDF
No ratings yet
Alllpdf PDF
253 pages
2024 Lecture3 Come321
No ratings yet
2024 Lecture3 Come321
23 pages
Chapter 1 Computer Abstractions and Technology
No ratings yet
Chapter 1 Computer Abstractions and Technology
46 pages
Computer Architecture & OS Syllabus
No ratings yet
Computer Architecture & OS Syllabus
30 pages
Ico22 - 1 - Computer Abstraction and Technology
No ratings yet
Ico22 - 1 - Computer Abstraction and Technology
42 pages
Lecture - 4 - Performance
No ratings yet
Lecture - 4 - Performance
31 pages
فاينل معمارية
No ratings yet
فاينل معمارية
92 pages
Computer Architecture Overview
83% (6)
Computer Architecture Overview
49 pages
ARM Computer Organization-Chapter01
No ratings yet
ARM Computer Organization-Chapter01
55 pages
Chapter 01 Modified
No ratings yet
Chapter 01 Modified
55 pages
Computer Abstractions and Technology
No ratings yet
Computer Abstractions and Technology
49 pages
Patterson6e MIPS Ch01 PPT
No ratings yet
Patterson6e MIPS Ch01 PPT
49 pages
Chapter 1: Computer Abstractions and Technology
No ratings yet
Chapter 1: Computer Abstractions and Technology
50 pages
Chapter 01 Jadara Short
No ratings yet
Chapter 01 Jadara Short
43 pages
CPU Performance and Optimization
No ratings yet
CPU Performance and Optimization
18 pages
Patterson6e MIPS Ch01 PPT
No ratings yet
Patterson6e MIPS Ch01 PPT
49 pages
Slides 2
No ratings yet
Slides 2
14 pages
Computer Architecture and Operating Systems (Caos) Course Code: CS31702 4-0-0
No ratings yet
Computer Architecture and Operating Systems (Caos) Course Code: CS31702 4-0-0
33 pages
Computer Abstractions and Technology: Omputer Rganization and Esign
No ratings yet
Computer Abstractions and Technology: Omputer Rganization and Esign
49 pages
01 - Chapter 1
No ratings yet
01 - Chapter 1
41 pages
Computer Abstractions and Technology: Adapted by Prof. Gheith Abandah
No ratings yet
Computer Abstractions and Technology: Adapted by Prof. Gheith Abandah
35 pages
Cpu Time Example: Computer A: 2Ghz Clock, 10S Cpu Time Designing Computer B
No ratings yet
Cpu Time Example: Computer A: 2Ghz Clock, 10S Cpu Time Designing Computer B
4 pages
Comp Organization
0% (1)
Comp Organization
49 pages
Chapter 01
No ratings yet
Chapter 01
20 pages
Intro
No ratings yet
Intro
14 pages
CMP2008 - NOTLAR - Chapter - 01 - 4UP
No ratings yet
CMP2008 - NOTLAR - Chapter - 01 - 4UP
13 pages
Lec 1
No ratings yet
Lec 1
32 pages
Computer Tech for Engineers
No ratings yet
Computer Tech for Engineers
9 pages
Son-CA - Lec1 - 1 - Computer Abstraction and Technology
No ratings yet
Son-CA - Lec1 - 1 - Computer Abstraction and Technology
31 pages
Chapter 01 RISC V
No ratings yet
Chapter 01 RISC V
30 pages
PPT#01
No ratings yet
PPT#01
30 pages
Chapter 1 Introduction
No ratings yet
Chapter 1 Introduction
49 pages
Chapter 01
No ratings yet
Chapter 01
54 pages
Chapter 01
No ratings yet
Chapter 01
50 pages
Chapter 01
No ratings yet
Chapter 01
49 pages
Performance Numericals
No ratings yet
Performance Numericals
24 pages
L1 Intro To Computer
No ratings yet
L1 Intro To Computer
42 pages
Computer Abstractions and Technology
No ratings yet
Computer Abstractions and Technology
47 pages
Computer Abstractions and Technology: Omputer Rganization AND Esign
No ratings yet
Computer Abstractions and Technology: Omputer Rganization AND Esign
50 pages
Computer Abstractions and Technology: The Hardware/Software Interface 5
No ratings yet
Computer Abstractions and Technology: The Hardware/Software Interface 5
52 pages
Chapter - 01 - Computer Abstractions
No ratings yet
Chapter - 01 - Computer Abstractions
37 pages
Lecture 01 - Computer Abstractions and Technology
No ratings yet
Lecture 01 - Computer Abstractions and Technology
24 pages
Espejo, John D.
No ratings yet
Espejo, John D.
35 pages
Willem Pcb3b To Pcb45
No ratings yet
Willem Pcb3b To Pcb45
19 pages
HP EliteBook 820 G1 Quickspecs c04370576
No ratings yet
HP EliteBook 820 G1 Quickspecs c04370576
55 pages
UNIT IV-R20 Cao
No ratings yet
UNIT IV-R20 Cao
80 pages
AL3452 Operating Systems Lecture Notes 1 32
No ratings yet
AL3452 Operating Systems Lecture Notes 1 32
32 pages
AIX Multibos Update Guide
No ratings yet
AIX Multibos Update Guide
9 pages
MTK Platform Flashing Guide V14: Info Summarize
No ratings yet
MTK Platform Flashing Guide V14: Info Summarize
6 pages
Compiler Code Generation Guide
No ratings yet
Compiler Code Generation Guide
31 pages
Display - LCD Data Sheet
No ratings yet
Display - LCD Data Sheet
11 pages
CCC Chapter 1 Question Answer PDF Download
No ratings yet
CCC Chapter 1 Question Answer PDF Download
20 pages
Computer Hardware Components Guide
No ratings yet
Computer Hardware Components Guide
10 pages
Samsung NP355E4C A04MX Compal La 8868p r1.0 Schematics
100% (2)
Samsung NP355E4C A04MX Compal La 8868p r1.0 Schematics
44 pages
Computer Basics Glossary
No ratings yet
Computer Basics Glossary
1 page
Computer Architecture Basics
No ratings yet
Computer Architecture Basics
6 pages
Computer Organization Concise Notes
No ratings yet
Computer Organization Concise Notes
57 pages
CIMTEC Ebook Genius Migration
No ratings yet
CIMTEC Ebook Genius Migration
16 pages
Dam Gate
100% (2)
Dam Gate
18 pages
ABCIP Us
No ratings yet
ABCIP Us
39 pages
Oracle SPARC Servers Assessment
No ratings yet
Oracle SPARC Servers Assessment
7 pages
Microcomputer Structure & Operation Guide
0% (2)
Microcomputer Structure & Operation Guide
18 pages
75 Inch Interactive Flat Panel - Inbuilt - Camera - 8 - 128 - GB
No ratings yet
75 Inch Interactive Flat Panel - Inbuilt - Camera - 8 - 128 - GB
4 pages
MICROCONTROLLER
No ratings yet
MICROCONTROLLER
26 pages
ESP8266 SDK: Getting Started Guide
No ratings yet
ESP8266 SDK: Getting Started Guide
33 pages
Gigabyte Products
No ratings yet
Gigabyte Products
7 pages
Tech Support & Repair Specialist
No ratings yet
Tech Support & Repair Specialist
2 pages
DMP Admin 60pr1 Sol
No ratings yet
DMP Admin 60pr1 Sol
166 pages
ESP8266 GPIO Pin Functions Guide
No ratings yet
ESP8266 GPIO Pin Functions Guide
2 pages
Borang Penyelenggaraan PM Cerdik
No ratings yet
Borang Penyelenggaraan PM Cerdik
2 pages
INTEL 8086 - Pin Diagram: UCS1502 Microprocessors and Interfacing
No ratings yet
INTEL 8086 - Pin Diagram: UCS1502 Microprocessors and Interfacing
23 pages
Jesuit Memorial College Mbodo Aluu Port Harcourt: First Term Examinations 2018/2019 Academic Session
No ratings yet
Jesuit Memorial College Mbodo Aluu Port Harcourt: First Term Examinations 2018/2019 Academic Session
8 pages

Lecture 02 - Computer Abstractions and Technology

Uploaded by

Lecture 02 - Computer Abstractions and Technology

Uploaded by

Lecture 2

Lecture 2 — Computer Abstractions and Technology — 2

Lecture 2 — Computer Abstractions and Technology — 4

n Weighted average CPI

Lecture 2 — Computer Abstractions and Technology — 6

Instructions Clock cycles Seconds

Lecture 2 — Computer Abstractions and Technology — 8

Lecture 2 — Computer Abstractions and Technology — 9

n The power wall

Constrained by power, instruction-level parallelism,

Lecture 2 — Computer Abstractions and Technology — 11

Lecture 2 — Computer Abstractions and Technology — 12

n Yield: proportion of working dies per wafer

Lecture 2 — Computer Abstractions and Technology — 13

n X2: 300mm wafer, 117 chips, 90nm technology

n Nonlinear relation to area and defect rate

Lecture 2 — Computer Abstractions and Technology — 15

Lecture 2 — Computer Abstractions and Technology — 16

High cache miss rates

Lecture 2 — Computer Abstractions and Technology — 17

Lecture 2 — Computer Abstractions and Technology — 18

Lecture 2 — Computer Abstractions and Technology — 19

Lecture 2 — Computer Abstractions and Technology — 21

You might also like