ScalaMeter 2014

ScalaMeter Performance regression testing framework

Aleksandar Prokopec, Josh Suereth, Vlad Ureche, Roman Zoller, Ngoc Duy Pham, Alexey Romanov, Roger Vion, Eugene Platonov, Lukas Rytz,

Paolo Giarusso, Dan Burkert, Christian Krause, and others

Background

Georges A., Buytaert D., Eeckhout L. Statistically rigorous Java performance evaluation. OOPSLA '07

Motivation

List(1 to 100000: _*).map(x => x * x)

Motivation

List(1 to 100000: _*).map(x => x * x)

class List[+T] extends Seq[T] { // implementation 1 }

Motivation

List(1 to 100000: _*).map(x => x * x)

Motivation

List(1 to 100000: _*).map(x => x * x)

First example

def measure() { val buffer = mutable.ArrayBuffer(0 until 2000000: _*) val start = System.currentTimeMillis() var sum = 0 buffer.foreach(sum += _) val end = System.currentTimeMillis() println(end - start) }

First example

def measure() { val buffer = mutable.ArrayBuffer(0 until 2000000: _*) val start = System.currentTimeMillis() var sum = 0 buffer.foreach(sum += _) val end = System.currentTimeMillis() println(end - start) } measure()

First example

def measure() { val buffer = mutable.ArrayBuffer(0 until 2000000: _*) val start = System.currentTimeMillis() var sum = 0 buffer.foreach(sum += _) val end = System.currentTimeMillis() println(end - start) } measure()

The warmup problem

def measure() { val buffer = mutable.ArrayBuffer(0 until 2000000: _*) val start = System.currentTimeMillis() var sum = 0 buffer.foreach(sum += _) val end = System.currentTimeMillis() println(end - start) } measure() measure()

26 ms, 11 ms

The warmup problem

26 ms, 11 ms

Why? Mainly: - JIT compilation - dynamic optimization

The warmup problem

45 ms, 10 ms

26 ms, 11 ms

The warmup problem

def measure2() { val buffer = mutable.ArrayBuffer(0 until 4000000: _*) val start = System.currentTimeMillis() buffer.map(_ + 1) val end = System.currentTimeMillis() println(end - start) }

The warmup problem

241, 238, 235, 236, 234

The warmup problem

241, 238, 235, 236, 234, 429

The warmup problem

241, 238, 235, 236, 234, 429, 209

The warmup problem

241, 238, 235, 236, 234, 429, 209, 194, 195, 195

The warmup problem

Bottomline: benchmark has to be repeated until the running time becomes “stable”. The number of repetitions is not known in advance.

241, 238, 235, 236, 234, 429, 209, 194, 195, 195

Warming up the JVM

241, 238, 235, 236, 234, 429, 209, 194, 195, 195, 194, 194, 193, 194, 196, 195, 195

Warming up the JVM

241, 238, 235, 236, 234, 429, 209, 194, 195, 195, 194, 194, 193, 194, 196, 195, 195

The interference problem

val buffer = ArrayBuffer(0 until 900000: _*) buffer.map(_ + 1) val buffer = ListBuffer(0 until 900000: _*) buffer.map(_ + 1)

Lets measure the first map 3 times with 7 repetitions:

61, 54, 54, 54, 55, 55, 56 186, 54, 54, 54, 55, 54, 53 54, 54, 53, 53, 53, 54, 51

Now, lets measure the list buffer map in between:

61, 54, 54, 54, 55, 55, 56 186, 54, 54, 54, 55, 54, 53 54, 54, 53, 53, 53, 54, 51

59, 54, 54, 54, 54, 54, 54 44, 36, 36, 36, 35, 36, 36 45, 45, 45, 45, 44, 46, 45 18, 17, 18, 18, 17, 292, 16 45, 45, 44, 44, 45, 45, 44

Now, lets measure the list buffer map in between:

61, 54, 54, 54, 55, 55, 56 186, 54, 54, 54, 55, 54, 53 54, 54, 53, 53, 53, 54, 51

59, 54, 54, 54, 54, 54, 54 44, 36, 36, 36, 35, 36, 36 45, 45, 45, 45, 44, 46, 45 18, 17, 18, 18, 17, 292, 16 45, 45, 44, 44, 45, 45, 44

Using separate JVM

Bottomline: always run the tests in a new JVM.

Using separate JVM

Bottomline: always run the tests in a new JVM. This may not reflect a real-world scenario, but it gives a good idea of how different several alternatives are.

Using separate JVM

Bottomline: always run the tests in a new JVM. It results in a reproducible, more stable measurement.

The List.map example

val list = (0 until 2500000).toList list.map(_ % 2 == 0)

The List.map example

37, 38, 37, 1175, 38, 37, 37, 37, 37, …, 38, 37, 37, 37, 37, 465, 35, 35, …

The garbage collection problem

37, 38, 37, 1175, 38, 37, 37, 37, 37, …, 38, 37, 37, 37, 37, 465, 35, 35, …

This benchmark triggers GC cycles!

37, 38, 37, 1175, 38, 37, 37, 37, 37, …, 38, 37, 37, 37, 37, 465, 35, 35, … -> mean: 47 ms

37, 37, 37, 647, 37, 36, 38, 37, 36, …, 36, 37, 36, 37, 36, 37, 534, 36, 33, … -> mean: 39 ms

Solutions: - repeat A LOT of times –an accurate mean, but takes A LONG time

Solutions: - repeat A LOT of times –an accurate mean, but takes A LONG time - ignore the measurements with GC – gives a reproducible value, and less measurements

- how to do this?

- manually - verbose:gc

Automatic GC detection

- manually - verbose:gc - automatically using callbacks in JDK7

37, 37, 37, 647, 37, 36, 38, 37, 36, …, 36, 37, 36, 37, 36, 37, 534, 36, 33, …

Automatic GC detection

- manually - verbose:gc - automatically using callbacks in JDK7

37, 37, 37, 647, 37, 36, 38, 37, 36, …, 36, 37, 36, 37, 36, 37, 534, 36, 33, …

raises a GC event

The runtime problem

- there are other runtime events beside GC – e.g. JIT compilation, dynamic optimization, etc.

- these take time, but cannot be determined accurately

The runtime problem

- these take time, but cannot be determined accurately - heap state also influences memory allocation patterns

and performance

The runtime problem

and performance

val list = (0 until 4000000).toList list.groupBy(_ % 10)

(allocation intensive)

The runtime problem

and performance

120, 121, 122, 118, 123, 794, 109, 111, 115, 113, 110

The runtime problem

and performance

120, 121, 122, 118, 123, 794, 109, 111, 115, 113, 110

affects the mean – 116 ms vs 178 ms

Outlier elimination

120, 121, 122, 118, 123, 794, 109, 111, 115, 113, 110

Outlier elimination

120, 121, 122, 118, 123, 794, 109, 111, 115, 113, 110

109, 110, 111, 113, 115, 118, 120, 121, 122, 123, 794

Outlier elimination

120, 121, 122, 118, 123, 794, 109, 111, 115, 113, 110

109, 110, 111, 113, 115, 118, 120, 121, 122, 123, 794

inspect tail and its variance contribution

109, 110, 111, 113, 115, 118, 120, 121, 122, 123

Outlier elimination

120, 121, 122, 118, 123, 794, 109, 111, 115, 113, 110

109, 110, 111, 113, 115, 118, 120, 121, 122, 123, 794

inspect tail and its variance contribution

109, 110, 111, 113, 115, 118, 120, 121, 122, 123

109, 110, 111, 113, 115, 118, 120, 121, 122, 123, 124

redo the measurement

ScalaMeter

Test info

ScalaMeter

Test info

Scala test

Java test

ScalaMeter

Test info Executor

Scala test

Java test

ScalaMeter

Test info Executor

Scala test

Java test

Warmer

Measurer

ScalaMeter

Test info Executor

Scala test

Java test

Warmer

Measurer

Memory

ScalaMeter

Test info Executor

Scala test

Java test

Warmer

Measurer

Memory

Duration

ScalaMeter

Test info Executor

Scala test

Java test

Warmer

Measurer

Memory

Duration

ScalaMeter

Test info Executor

Scala test

Java test

Warmer

Measurer

Memory

Duration

(box count)

ScalaMeter

Test info Executor

Scala test

Java test

New VM

Warmer

Measurer

ScalaMeter

Test info Executor Reporter(s)

Scala test

Java test

New VM

Warmer

Measurer

ScalaMeter

Scala test

Java test

New VM

Logging

Warmer

Measurer

ScalaMeter

Scala test

Java test

New VM

Logging

Warmer

Measurer

Regression

Historian

Tester

Persistor

ScalaMeter example

object ListTest extends PerformanceTest.Microbenchmark {

A range of predefined benchmark types

Scala test

ScalaMeter example

object ListTest extends PerformanceTest.Microbenchmark { val sizes = Gen.range("size”)(500000, 1000000, 100000)

Generators provide input data for tests

Scala test

Generator(s)

ScalaMeter example

object ListTest extends PerformanceTest.Microbenchmark { val sizes = Gen.range("size”)(500000, 1000000, 100000) val lists = for (sz <- sizes) yield (0 until sz).toList

Generators can be composed a la ScalaCheck

Scala test

Generator(s)

ScalaMeter example

object ListTest extends PerformanceTest.Microbenchmark { val sizes = Gen.range("size”)(500000, 1000000, 100000) val lists = for (sz <- sizes) yield (0 until sz).toList using(lists) in { xs => xs.groupBy(_ % 10) } }

Concise syntax to specify and group tests

Scala test

Generator(s)

Snippet(s)

ScalaMeter example

object ListTest extends PerformanceTest.Microbenchmark { val sizes = Gen.range("size”)(500000, 1000000, 100000) val lists = for (sz <- sizes) yield (0 until sz).toList measure method “groupBy” in { using(lists) in { xs => xs.groupBy(_ % 10) } using(ranges) in { xs => xs.groupBy(_ % 10) } } }

Automatic regression testing

using(lists) in { xs => var sum = 0 xs.foreach(x => sum += x) }

Regression

Historian

Tester

Persistor

using(lists) in { xs => var sum = 0 xs.foreach(x => sum += x) }

[info] Test group: foreach [info] - foreach.Test-0 measurements: [info] - at size -> 2000000, 1 alternatives: passed [info] (ci = <7.28, 8.22>, significance = 1.0E-10)

Regression

Historian

Tester

Persistor

using(lists) in { xs => var sum = 0 xs.foreach(x => sum += math.sqrt(x)) }

Regression

Historian

Tester

Persistor

using(lists) in { xs => var sum = 0 xs.foreach(x => sum += math.sqrt(x)) }

[info] Test group: foreach [info] - foreach.Test-0 measurements: [info] - at size -> 2000000, 2 alternatives: failed [info] (ci = <14.57, 15.38>, significance = 1.0E-10) [error] Failed confidence interval test: <-7.85, -6.60> [error] Previous (mean = 7.75, stdev = 0.44, ci = <7.28, 8.22>) [error] Latest (mean = 14.97, stdev = 0.38, ci = <14.57, 15.38>)

Regression

Historian

Tester

Persistor

Report generation

http://scala-blitz.github.io/home/documentation/benchmarks//chara.html

Online mode

import org.scalameter._ val time = measure { for (i <- 0 until 100000) yield i } println(s"Total time: $time")

Online mode

val time = config( Key.exec.benchRuns -> 20, Key.verbose -> true ) withWarmer { new Warmer.Default } withMeasurer { new Measurer.IgnoringGC } measure { for (i <- 0 until 100000) yield i } println(s"Total time: $time")

Tutorials online!

http://scalameter.github.io

Thank you!

ScalaMeter 2014

Documents

Transcript of ScalaMeter 2014

SSLC MARCH 2014 · SSLC MARCH 2014 . MARCH - 2014

ANNUAL REPORT 2015 PT. VICTORIA … I 2014 1st Quarter 2014 Kuartal II 2014 2nd Quarter 2014 Kuartal III 2014 3rd Quarter 2014 Kuartal IV 2014 3rd Quarter 2014 Total Total Tertinggi

KaTaLoG BkKbN 2014~ bKb Kit 2014~kIE KiT 2014~iUd Kit bkkBn 2014~PubliC AddResS BkkBn 2014~GenRe KIt 2014~SaRaNa PlkB 2014~Dak bkKbn 2014~~ Dak bk kb n 2014 bkkbn katalog

APPENDIX A: Overall Breakdown...APPENDIX A: Overall Breakdown Difference Difference 2014 2016 2014 2016 2014 2016 2014 2016 2014 2016 2014 2016 2014 2016 2014 2016 I have support to

Criminal petition 418/2014, 529/2014, 582/2014, 825/2014 ...ghconline.nic.in/Judgment/Crlpet4182014.pdf · Criminal petition 418/2014, 529/2014, 582/2014, ... The loan proposal ...

PROFICIENCY TESTING 2014 - AFSCA · LAB3 17/09/2014 18/09/2014 19/09/2014 LAB4 15/09/2014 25/09/2014 25/09/2014 LAB5 17/09/2014 22/09/2014 25/09/2014 LAB6 15/09/2014 22-23/09/2014

WARN Report - Employment Development Department · 01-07-2014 · 06/30/2014 06/30/2014 06/30/2014 07/01/2014 07/02/2014 07/08/2014 ... 09/15/2014 07/17/2014 Microsift and Nokia

Board of Directors Meeting - Access Health CT · OCT 2013 NOV 2013 DEC 2013 JAN 2014 FEB 2014 MAR 2014 APR 2014 MAY 2014 JUN 2014 JUL 2014 AUG 2014 SEP 2014 Insured ... (OHA, DSS,

Part 5 - Public Investment Planning: Portfolio Structure and … · 2014. 3. 12. · January-2014 February-2014 March-2014 April-2014 May-2014 June-2014 July-2014 DAY Net Adj Net

6/11/2014 What’s’New’in’Young’AdultLiterature’2014’...Jun 11, 2014 · What’s’New’in’Young’AdultLiterature’2014’ 6/11/2014 This’material’has’been’created’for’the’Infopeople’Project[

· 3248 3248 3248 3248 3248 3248 3248 1388 1388 1388 t 333 1388 1388 1338 1388 1338 3248 3248 2014 2014- 07-_ 2014- 07- 2014 2014- 07- 2014- 07- 2014 2014- 2014- 07-. 2014 2014-

· ASHtrroSH GOKUL NAUKARKAR MD MOHSIN ASMITA PATL Year of Passing Branch 2014 2014 2014 2014 2014 2014 2014 2014 2014 2014 2014 2014 2014 2014 2014 2014 2014 2014 2014 2014 2014

The RAILSPLITTER - Home - Lincoln High School · 2018-12-11 · Amy Luong Brandon Searcy @YoungPacman04 Graphic Designer 2014 2014 2014 2014 2014 2014 2014 2014 2014. 2014 2014 ...

Report ‘Booster ‘Booster ‘Booster –––– 2014’ 2014’ 2014’

Evaluation of CMMI Accountable Care Organization ... · April 1, 2012 Jan. 2014 - Dec. 2014 Jan. 2014 - Dec. 2014 PY2 July 1, 2012 Jan. 2014 - Dec. 2014 Jan. 2014 - Dec. 2014 Providers

Trends in Medication Management 2017 - TELUS Health · Mar 2014. Apr 2014. May 2014. Jun 2014. Jul 2014. Aug 2014. Sep 2014. Oct 2014. Nov 2014. Dec 2014. Jan 2015. Feb 2015. Mar

ATC Alcohol Violations - IN.gov06/04/2014 06/04/2014 06/04/2014 06/06/2014 06/06/2014 06/06/2014 06/10/2014 06/10/2014 06/16/2014 06/16/2014 06/16/2014 150.00 500.00 500.00 750.00

North East Ambulance Service NHS Foundation Trust ... · Sep 2013 Oct 2013 Nov 2013 Dec 2013 Jan 2014 Feb 2014 Mar 2014 Apr 2014 May 2014 Jun 2014 Jul 2014 Aug 2014 Sep 2014 YTD Prv

AUSTRALIAN INFLUENZA SURVEILLANCE REPORT€¦ · 3/01/2014 17/01/2014 31/01/2014 14/02/2014 28/02/2014 14/03/2014 28/03/2014 11/04/2014 25/04/2014 9/05/2014 23/05/2014 6/06/2014 20/06/2014

2014 Annual Report Abridged Version - Credit Suisse€¦ · 2014 2014 2014 2014 2014 2014 2014 2014 2014 2014 2013 2013 2013 2013 2013 2013 2013 2013 2013 2013 Key Figures At year-end,