High-Quality, Deterministic Parallel Placement for FPGAs on Commodity Hardware Adrian Ludwin, Vaughn...

High-Quality, Deterministic Parallel Placement for FPGAson Commodity HardwareAdrian Ludwin, Vaughn Betz & Ketan Padalia

FPGA Seminar Presentation

Nov 10, 2009

Overview

Motivation Review simulated annealing Approaches Summary

Motivation

Simulated Annealing Placement

Probabilistic approach to finding optimal solution Behavior

Moves through solution space Greedily Randomly

Balance between greediness and randomness is controlled by a temperature

Temperature evolves through time based on a cooling schedule

Simulated Annealing Placement

For a single moveCompute change in

cost: ΔCAccept move:

ΔC < 0 ΔC > 0, with

probability e-ΔC/T

Repeat while gradually decreasing T and window size

Constraints

Runs on commodity hardware Good quality of results

Robust Determinism

Bug reportingConsistent regression results

Selected Previous Work

Close relatedMove accelerationParallel moves

Other methods Independent setsPartitioned placementsSpeculative

Algorithm #1

Algorithm #2

Objective

Determine efficacy Analyze runtime and categorize

MemorySynchronization InfrastructureEvaluationProposal

Methodology

Parallel equivalent flowSerial flow which mimic parallel flowEmulates behavior of multithreaded

application by using only one thread/core Useful for comparison

Accounts for infrastructure overhead

Methodology

Attributing runtime Two types of measurements

Bottom up (bu) measure each component of a move

End-to-end (e2e) measure runtime for entire run

Methodology

Test setsSet of 11 Stratix® II FPGA benchmark

designs IP and customer circuits 10k to 100k logic cells

Also tested on 40 Stratix II FPGA circuits Obtained similar result

Results for Algorithm #1

Moves attribution

Overhead analysis

Observations

Theoretical speedup 1.7xMeasured: 1.3x (best)

Increase in evaluation runtimeDue to reduced cache locality

Proposal time is “hidden”

Analysis

Time spent on stall is negligible Evaluation accounts for most of overhead Little to gain by removing determinism

Serial equivalency is less than 3% runtime

Summary for Algorithm #1

Speedup: 1 – 1.3x Memory inefficiency is the biggest

bottleneck Theoretically algorithm should scale

However, difficult to partition and balance two stages

Speedups for Algorithm #2

Attribution on 2 cores

Attribution on 4 core

Attribution on 4 cores

Observations

Memory latency due to inter-processor communicationWorsens with more cores

Summary for Algorithm #2

Parallel moves has better scalability than pipelined moves

Bottleneck is still memory Again serial equivalency costs little

Take Home Messages

Memory is important Good algorithms are even more important

High-Quality, Deterministic Parallel Placement for FPGAs on Commodity Hardware Adrian Ludwin, Vaughn...

Documents

Transcript of High-Quality, Deterministic Parallel Placement for FPGAs on Commodity Hardware Adrian Ludwin, Vaughn...

2011 MEMS Betz Boiling SHPiSHPo Pattern

David Betz | M.L.R. Smith | Robert C. Boyles Robert Mihara ... · David Betz King’s College London, Department of War Studies England David Betz is a Senior Lecturer in the War

Plugin-Wind Energy Conversion Theory Betz Equation.

WINE LIST - Amazon Web Servicess3-eu-west-2.amazonaws.com/assets.anthracitelounge... · Clos de Betz, Betz Family Winery 2013 Merlot blend Washington USA Château Pichon-Longueville

Betz Saturn2013

HR & Workforce Analytics Innovation Summitie.theinnovationenterprise.com/eb/1470046994989_HR... · Effective Storytelling in HR Analytics Chirac Padalia Director, Workforce Strategy

Betz - Cinephilia (Cinema Journal)

Wind Turbine Power: The Betz Limit and Beyond - InTechcdn.intechopen.com/pdfs/...power_the_betz_limit_and... · power coefficient, does not exceed the Betz limit. Based on the exact

2013-BETZ-Mobilya-conbetzaluminyum.com/Resimler/35e1ee29155f428.pdf · Title: 2013-BETZ-Mobilya-con.fh11 Author: mac Created Date: 7/22/2014 2:19:04 AM

Inauguration of the Twentieth President Don Betz, Ph.D. · Inauguration of the Twentieth President Don Betz, Ph.D. Friday, ... Musical Interlude ... Bill Anoatubby, ...

VS. - Harvard Universityusers.physics.harvard.edu/~wilson/HUMANRIGHTS/Risk/ALF/Betz/Betz... · VS. PNEUMO ABEX LLC, successor-in-interest to Abex Corporation, ... Corporation, Ford

SCIENCE ADMINISTRATION FREDERICK BETZ PORTLAND STATE UNIVERSITY LECTURE 4

Switching Electronics - Betz

Principles of Magnetic Testing Ce Betz

Rick Ludwin Collection - Miami Universityspec.lib.miamioh.edu/home/wp-content/uploads/2016/05/Rick-Ludwin... · The majority of the Rick Ludwin Collection focuses primarily on NBC

PDF generated by 'Newgen vijay' - About people.tamu.edupeople.tamu.edu/~timm.betz/Betz-Koremenos-2016.pdfChapter 27 MONITORING PROCESSES Timm Betz Barbara Koremenos Information on

Final rept 'Acute Toxicity of Betz Clam-Trol CT-1 ... · A sample of Betz Clam-Trol, CT-l, received from Betz Laboratories, Inc., Somerton Road, Trevose, PA 19047 on March 21, 1991,

(CCHPV) Systems Integration Betz

UNITED STATES ENVIRONMENTAL PROTECTION AGENCY …€¦ · former Betz Laboratories, Inc. (Betz) facility located at 985 Wheeler Way, Langhorne, Pennsylvania (Facility). EPA's review

Betz Butadiene Presentation