CS152: Computer Systems Architecture Moore’s Law

CS152: Computer Systems ArchitectureMoore’s Law

Sang-Woo Jun

Winter 2021

Conventional performance scaling

❑ Traditional model of a computer is simpleo Single, in-order flow of instructions on a processor

o Simple, in-order memory model

❑ Large part of computer architecture research involved maintaining this abstraction while improving performanceo Transparent caches, Transparent superscalar scheduling, …

o Same software runs faster tomorrow

o (Slow software becomes acceptable tomorrow)

❑ Driven largely by continuing march of Moore's law

MemoryProgram

Moore’s Law

❑ What exactly does it mean?

❑ What is it that is scaling?

Moore’s Law

❑ Typically cast as: “Performance doubles every X months”

❑ Actually closer to: “Number of transistors per unit cost doubles every X months”

Moore’s Law

The complexity for minimum component costs has increased at a rate of

roughly a factor of two per year.

Over the longer term, the rate of increase is a bit more uncertain,

although there is no reason to believe it will not remain nearly constant

for at least 10 years.-- Gordon Moore, Electronics, 1965

Why is Moore’s Law conflated with processor performance?

Dennard Scaling: From Moore’s Law to performance

❑ “Power density stays constant as transistors get smaller” o Robert H. Dennard, 1974

❑ Intuitively:o Smaller transistors → shorter propagation delay → faster frequency

o Smaller transistors → smaller capacitance → lower voltage

o 𝑃𝑜𝑤𝑒𝑟 ∝ 𝐶𝑎𝑝𝑎𝑐𝑖𝑡𝑎𝑛𝑐𝑒 × 𝑉𝑜𝑙𝑡𝑎𝑔𝑒2 × 𝐹𝑟𝑒𝑞𝑢𝑒𝑛𝑐𝑦

Moore’s law → Faster performance @ Constant power!

Single-core performance scaling projection

What happened?

(Slightly) more accurateprocessor power consumption

𝑃𝑜𝑤𝑒𝑟 = (𝐴𝑐𝑡𝑖𝑣𝑒𝑇𝑟𝑎𝑛𝑠𝑖𝑠𝑡𝑜𝑟𝑠 × 𝐶𝑎𝑝𝑎𝑐𝑖𝑡𝑎𝑛𝑐𝑒 × 𝑉𝑜𝑙𝑡𝑎𝑔𝑒2 × 𝐹𝑟𝑒𝑞𝑢𝑒𝑛𝑐𝑦)

+ (𝑉𝑜𝑙𝑡𝑎𝑔𝑒 × 𝐿𝑒𝑎𝑘𝑎𝑔𝑒)

Dynamic power

Static power

𝐿𝑒𝑎𝑘𝑎𝑔𝑒 ∝1

𝑒𝑉𝑜𝑙𝑡𝑎𝑔𝑒EXTREMELY simplified model!

Unfortunately…

Gate-oxidestopped scaling

Stopped scaling due to leakage

https://www.design-reuse.com/articles/20296/power-management-leakage-control-process-compensation.html

Total power consumption with constant frequency

End of Dennard Scaling

❑ Even with smaller transistors, we cannot continue reducing powero What do we do now?

❑ Option 1: Continue scaling frequency at increased power budgeto Chip quickly become too hot to cool!

o Thermal runaway: Hotter chip → increased resistance → hotter chip → …

Option 1: Continue scaling frequency at increased power budget

0.01 μ

Option 2: Stop frequency scaling

Dennard Scaling Ended(~2006)

Danowitz et.al., “CPU DB: Recording Microprocessor History,” Communications of the ACM, 2012

Looking back: change of predictions

Kogge et. al., “Yearly update : exascale projections for 2013,”Sandia National Laboratoris, 2013

But Moore’s Law continues beyond 2006

State of things at this point (2006)

❑ Single-thread performance scaling endedo Frequency scaling ended (Dennard Scaling)

o Instruction-level parallelism scaling stalled … also around 2005

❑ Moore’s law continueso Double transistors every two years

o What do we do with them?

K. Olukotun, “Intel CPU Trends”

Instruction Level Parallelism

Crisis averted with manycores?

What happened?

𝑃𝑜𝑤𝑒𝑟 =

(𝐴𝑐𝑡𝑖𝑣𝑒𝑇𝑟𝑎𝑛𝑠𝑖𝑠𝑡𝑜𝑟𝑠 × 𝐶𝑎𝑝𝑎𝑐𝑖𝑡𝑎𝑛𝑐𝑒 × 𝑉𝑜𝑙𝑡𝑎𝑔𝑒2 × 𝐹𝑟𝑒𝑞𝑢𝑒𝑛𝑐𝑦)

+ (𝑉𝑜𝑙𝑡𝑎𝑔𝑒 × 𝐿𝑒𝑎𝑘𝑎𝑔𝑒𝐶𝑢𝑟𝑟𝑒𝑛𝑡)

Dynamic power

Static power

Can’t keep going upGate-oxide

stopped scaling

Regardless of Moore’s Law, a limited amount of gatescan be active at a given time

“Utilization Wall”

Stopped scaling due to thermal

Where To, From Here?

❑ The number of active transistors at a given time is limitedo We won’t get much performance improvements even if Moore’s law continues

o We need to make the best use of those active transistors!

Where To, From Here?

❑ Potential Solution 1: The software solutiono Write efficient software to make the efficient use of hardware resources

o No longer depend entirely on hardware performance scaling

o “Performance engineering” software, using hardware knowledge

❑ Solution 2: The specialized architectural solutiono Chip space is now cheap, but power is expensive

o Stop depending on more complex general-purpose cores

o Use space to build heterogeneous systems, with compute engines well-suited for each application

Taylor, Michael B. "Is dark silicon useful? Harnessing the four horsemen of the coming dark silicon apocalypse." DAC Design Automation Conference 2012. IEEE, 2012.

The Bottom Line: Architecture is No Longer Transparent

❑ Optimized software requires architecture knowledge

❑ Special-purpose “accelerators” (GPU, FPGA, …) programmed explicitly

❑ Even general-purpose processors implement specialized instructionso Single-Instruction Multiple Data (SIMD) instructions such as AVX

o Special-purpose instructions sets such as AES-NI

CS152: Computer Systems Architecture Moore’s Law

Documents

Transcript of CS152: Computer Systems Architecture Moore’s Law

CS 152 Computer Architecture and Engineering Lecture 27 ...cs152/sp05/lecnotes/lec15-2.pdfComputer Architecture and Engineering Lecture 27 – Multiprocessors cs152/ TAs: Ted Hong

CS 152 Computer Architecture and Engineering …cs152/sp10/lectures/L01...1/19/2010 CS152, Spring 2010 11 The “New” CS152 • New CS152 focuses on interaction of software and hardware

CS152 Computer Architecture and Engineering Lecture 8 Designing a Single Cycle Datapath

CS 152 Computer Architecture and Engineering Lecture …cs152/sp11/lectures/L18-Snoopy.pdf · April 13, 2011 CS152, Spring 2011 CS 152 Computer Architecture and Engineering Lecture

Computer Architecture and Engineering CS152 Quiz #1 ...cs152/sp13/handouts/quiz1sol_s2013.pdfPage 1 of 20 Computer Architecture and Engineering CS152 Quiz #1 February 19th, 2013 Professor

CS152 – Computer Architecture and Engineering Lecture 16 – Advanced Pipelining 2

CS 152 Computer Architecture and Engineering Last …inst.cs.berkeley.edu/~cs152/sp08/lectures/L02-SimpleImps.pdf · 1/24/2008 CS152-Spring!08 3 BurroughÕs B5000 Stack Architecture:

CS152 Computer Architecture and Engineering Lecture 20 Caches April 14, 2003 John Kubiatowicz (kubitron) lecture slides: cs152

CS152 – Computer Architecture and Engineering Lecture 10 – Introduction to Pipelining

inst.eecs.berkeley.eduinst.eecs.berkeley.edu/~cs152/sp20/assignments/sp20… · Web viewCS152 Computer Architecture and Engineering CS252 Graduate Computer Architecture Memory Consistency

CS152 Computer Architecture and Engineering Lecture 7 Divide, Floating Point, Pentium Bug

CS 152 Computer Architecture and Engineering Lecture …cs152/sp09/lectures/L02-SimpleImps.pdf · 1/27/2009 CS152-Spring!09 3 Burrough’s B5000 Stack Architecture: An ALGOL Machine,

CS152 / Kubiatowicz Lec3.1 1/23/01©UCB Spring 2001 CS152 Computer Architecture and Engineering Lecture 3 Performance, Technology & Delay Modeling January.

CS152 Lec6.1 CS152 Computer Architecture and Engineering Lecture 7 Divide, Floating Point, Pentium Bug.

CS 152 Computer Architecture and Engineering …cs152/sp12/lectures/L13...March 8, 2012 CS152, Spring 2012 CS 152 Computer Architecture and Engineering Lecture 13 - VLIW Machines and

CS152 Lec9.1 CS152 Computer Architecture and Engineering Lecture 9 Designing Single Cycle Control.

CS152: Computer Systems Architecture The …swjun/courses/2019W-CS152...CS152: Computer Systems Architecture The Hardware/Software Interface Sang-Woo Jun Winter 2019 Large amount of

CS152 – Computer Architecture and Engineering Lecture 13 – Fastest Cache Ever!

CS152 Computer Architecture and Engineering Lecture 6 VHDL, Multiply, Shift

CS152 – Computer Architecture and Engineering Lecture 15 ... fileCS 152 L15 Adv. Pipe. (1) Patterson Fall 2003 © UCB CS152 – Computer Architecture and Engineering Lecture 15 –