KEMBAR78
Multi core processor | PPTX
Intel x86 Multi-core Organization

1
A multi-core computer also known as chip
multiprocessor, combines two or more
processors (called cores) on a single piece of
silicon(called a die).

2
Hardware Performance Issues:
• Increase in parallelism
• Power consumption
 Software Performance Issues:
• Performance dependent on effective utilization
of parallel resources
• Even small amounts of serial code impact
performance


3
Main variables in multi-core organization :
 Number of core processors on chip
 Number of levels of cache memory
 Amount of cache memory that is shared

4
5

Intel Core i7

Intel Core
Duo

AMD
Opteron

ARM11
MP Core
Intel Core duo:
 Introduced in 2006, implements two x86
superscalar processor with shared L2 cache
 Each multi-core processor has dedicated L1
cache, 32kb instruction and data cache

6
 Independent Thermal control unit per core

• Manages chip heat dissipation with sensors
• Maximize performance within thermal

constraints

7
 Advanced Programmable Interrupt Controlled

(APIC)
• It can provide Inter-process interrupts
which allow any process to interrupt any
other process
• Routes interrupts to appropriate core
• Includes timer so OS can self-interrupt a core
8
 Power Management Logic

• Monitors thermal conditions and CPU

activity
• Adjusts voltage (and thus power
consumption)
• Can switch on/off individual logic subsystems
to save power
• Split-bus transactions can sleep on one end
9
 2MB shared L2 cache

• Dynamic allocation
• Extended to support multiple Core Duo in

SMP (not SMT)
• L2 data shared between local cores (fast) or
external
 Bus interface is FSB
10
11






November 2008
Four x86 SMT processors
Dedicated L2, shared L3 cache
Speculative pre-fetch for caches
On chip DDR3 memory controller
• Three 8 byte channels (192 bits) giving 32GB/s
• No front side bus
12


Quick Path Interconnection
• Cache coherent point-to-point link
• High speed communications between processor
chips
• 6.4G transfers per second, 16 bits per transfer
• Dedicated bi-directional pairs
• Total bandwidth 25.6GB/s
13
14
15

Multi core processor

  • 1.
    Intel x86 Multi-coreOrganization 1
  • 2.
    A multi-core computeralso known as chip multiprocessor, combines two or more processors (called cores) on a single piece of silicon(called a die). 2
  • 3.
    Hardware Performance Issues: •Increase in parallelism • Power consumption  Software Performance Issues: • Performance dependent on effective utilization of parallel resources • Even small amounts of serial code impact performance  3
  • 4.
    Main variables inmulti-core organization :  Number of core processors on chip  Number of levels of cache memory  Amount of cache memory that is shared 4
  • 5.
    5 Intel Core i7 IntelCore Duo AMD Opteron ARM11 MP Core
  • 6.
    Intel Core duo: Introduced in 2006, implements two x86 superscalar processor with shared L2 cache  Each multi-core processor has dedicated L1 cache, 32kb instruction and data cache 6
  • 7.
     Independent Thermalcontrol unit per core • Manages chip heat dissipation with sensors • Maximize performance within thermal constraints 7
  • 8.
     Advanced ProgrammableInterrupt Controlled (APIC) • It can provide Inter-process interrupts which allow any process to interrupt any other process • Routes interrupts to appropriate core • Includes timer so OS can self-interrupt a core 8
  • 9.
     Power ManagementLogic • Monitors thermal conditions and CPU activity • Adjusts voltage (and thus power consumption) • Can switch on/off individual logic subsystems to save power • Split-bus transactions can sleep on one end 9
  • 10.
     2MB sharedL2 cache • Dynamic allocation • Extended to support multiple Core Duo in SMP (not SMT) • L2 data shared between local cores (fast) or external  Bus interface is FSB 10
  • 11.
  • 12.
         November 2008 Four x86SMT processors Dedicated L2, shared L3 cache Speculative pre-fetch for caches On chip DDR3 memory controller • Three 8 byte channels (192 bits) giving 32GB/s • No front side bus 12
  • 13.
     Quick Path Interconnection •Cache coherent point-to-point link • High speed communications between processor chips • 6.4G transfers per second, 16 bits per transfer • Dedicated bi-directional pairs • Total bandwidth 25.6GB/s 13
  • 14.
  • 15.