Reactive On-line Machine Learning with Akka...

1 March 2017

Reactive On-lineMachine Learningwith Akka Streams

Jan Pustelnik & Kamil Owczarek

We all love retro, don’t we?

Reactive Streams Made Easy

BACKPRESSURE

FLOW SINKSOURCE

Reactive Fast Data with Akka!

Akka allows you to put your on-line / streaming data structure / algorithm in

context. You don’t need to think about how to take care of data flow and

backpressure.

Well thought out architecture of Akka lets you concentrate on the stuff

relevant to your problem domain.

Akka is geared towards high performance, and can be used in IoT setup,

where e.g. Spark streaming would not fit.

It is easy to port e.g. machine learning algorithms from other streaming

setups (like Spark streaming) to Akka.

Backpressure so retro! (1981)

Retro is future-proof! (2001)

SEDA is fast because

async and has buffers

and stages are single

threaded

On-line algorithms / data structures (1992)

Example: Kadane algorithm (on-line, streaming, 1984)

Source: https://en.wikipedia.org/wiki/Maximum_subarray_problem

(…) the maximum subarray problem is the task of finding the contiguous subarray within a one-

dimensional array of numbers which has the largest sum. For example, for the sequence of values

−2, 1, −3, 4, −1, 2, 1, −5, 4; the contiguous subarray with the largest sum is 4, −1, 2, 1, with sum 6.

KadaneFlowStage

IN OUT

Akka Plumbing

Stateful Kadane Logic

Flow Shape (plumbing)

Flow Shape – output handler (plumbing)

Better not

fail silently.

Flow stage – “Business logic”

Proper Kadane algo logic

Flow stage – “Business logic”

Proper Kadane algo logic

Let the flow flow…

Bloom filter (on-line, streaming, 1970)

BLOOM DICT

√ / X? / X

It is easy to create new shapes but you can (re)use existing

Tripod? Just like the in old days

Remember your Topology class? A shape is just a shape…

Bloom filter – CrossShape!

DATABLOOM

Remember, a shape is just a shape

+---------+In1 ~> | cross | ~> Out1

+---------+|v

BloomFilterCrossStage, ftw!

Two crossing flows… Common shared state… Single thread!

Machine Learning with Akka streams!

Online ML models

ON-LINE MACHINE LEARNING

ADVERSARIAL MODELS STATISTICAL MODELS

Statistical Models

Idea: the input variable (X) and predicted variable (Y) come from a

probability distribtion p(X, Y)

Aim: predict Y as good as possible: Pr(Y)

Cost function: cost of an error: V(Y, Pr(Y))

Generalized solution: minimze 𝑬[𝑽 𝒀, 𝐏𝐫 𝒀 ) = 𝑽(𝒀, 𝐏𝐫 𝒀 𝒅𝒑(𝑿, 𝒀)

Putting different V functions gives familiar ML algorithms: Linear

Regression, SVM etc.

Adversarial Models

• Not frequently mentioned outside scientific community/conferences!

• Problem as a game between the learner and nature:

1. Learner sees input X(i)

2. Learner „makes his move” - predicts output Pr[Y(i)]

3. Nature sees X(i) and Pr[Y(i)] and „makes a move” emitting actual output Y(i)

4. Learner „suffers a loss”: V[Y(i), Pr[Y(i)]]

• Important element: nature’s reaction can depend on prediction

• Actual games, asset trading, varying cost evaluation

Recursive Least Squares

• We all know the „least squares” metric from school, right?

• O(dn3) memory complexity

• The formula is recursive:

• Recursive = on-line = O(dn2) memory complexity

Reacursive Least Squares

Follow The Leader

Adversarial online ML algorithm

Not very complex

Pick the hypothesis one that performed best until now

Paradoxically: good for bounded loss

Careful investment, medical costs evaluation

Regularized for broadened set of applications

Follow The Leader

That’s it…

https://en.wikipedia.org/wiki/Banner_Mania

Reactive On-line Machine Learning with Akka...

Documents

Transcript of Reactive On-line Machine Learning with Akka...

Reactive Programming with Scala and Akka - Sample Chapter

Reactive Streams 1.0 and Akka Streams

jfokus-reactive The Reactive LandscapeThe reactive landscape Reactive A software showing responses to stimuli Reactive Systems Reactive Streams Akka, Vert.x Akka Streams, RX v2, Reactor,

CSC 536 Lecture 8. Outline Reactive Streams Streams Reactive streams Akka streams Case study Google infrastructure (part I)

Reactive programming with scala and akka

Building a Reactive RESTful API with Akka Http & Slick

Akka HTTP: The Reactive Web Toolkitdownloads.typesafe.com/.../ScalaDaysSF2015/T1_Kuhn_Akka_Strea… · • reactive-streams 1.0.0-RC3 • Akka Streams & HTTP 1.0-M4 • still missing:

Reactive Streams / Akka Streams - GeeCON Prague 2014

Reactive Apps with Akka and AngularJSpresentations2015.s3.amazonaws.com/61_presentation.pdfReactive Apps with Akka and AngularJS Heiko Seeberger GeeCON 2015, Kraków The Reactive Traits

Reactive Programming With Akka - Lessons Learned

Reactive applications using Akka

Rc201 010d Reactive Programming Akka

Building Reactive Applications with Akka (in Scala)

Designing Reactive Systems with Akka

Distributed Data Analytics Thorsten Papenbrock Akka Actor ... · Akka Streams Asynchronous, non-blocking, backpressured, reactive stream classes Akka Http Asynchronous, streaming-first

Delivering Transformative Reactive systems on OpenShift ... · Why Akka Streams? Akka Streams is a DSL on top of Akka. Akka uses the actor model to create stateful services. Actors

Exploring Reactive Integrations With Akka Streams, Alpakka And Apache Kafka

presentatieedegier.nl/presentations/reactive-programming-overview.pdfReactive Frameworks @Vert.x @ Spring 5 @Akka @ Runnable Jar @ Reactive @ Polyglot @ Distributed . HTTP2 Websockets

Building Reactive Systems with Akka (in Java 8 or Scala)

Scala usergroup stockholm - reactive integrations with akka streams