Extensions to message-passing inference S. M. Ali Eslami September 2014.

Extensions to message-passing inference

S. M. Ali Eslami

September 2014

Outline

Just-in-time learning for message-passingwith Daniel Tarlow, Pushmeet Kohli, John Winn

Deep RL for ATARI gameswith Arthur Guez, Thore Graepel

Contextual initialisation for message-passingwith Varun Jampani, Daniel Tarlow, Pushmeet Kohli, John Winn

Hierarchical RL for automated drivingwith Diana Borsa, Yoram Bachrach, Pushmeet Kohli and Thore Graepel

Team modelling for learning of traitswith Matej Balog, James Lucas, Daniel Tarlow, Pushmeet Kohli and Thore Graepel

Probabilistic programming

• Programmer specifies a generative model

• Compiler automatically creates code for inference in the model

Probabilistic graphics programming?

Challenges

• Specifying a generative model that is accurate and useful

• Compiling an inference algorithm for it that is efficient

Generative probabilistic models for visionManually designed inference

FSABMVC 2011

SBMCVPR 2012

MSBMNIPS 2013

Why is inference hard?

Sampling

Inference can mix slowlyActive area of research

Message-passing

Computation of messages can be slow (e.g. if using quadrature or sampling)Just-in-time learning (part 1)

Inference can require many iterations and may converge to bad fixed pointsContextual initialisation (part 2)

Just-In-Time Learning for Inferencewith Daniel Tarlow, Pushmeet Kohli, John Winn

NIPS 2014

Motivating example

Ecologists have strong empirical beliefs about the form of the relationship between temperature and yield.

It is important for them that the relationship is modelled faithfully.

We do not have a fast implementation of the Yield factor in Infer.NET.

Problem overview

Implementing a fast and robust factor is not always trivial.

Approach

1. Use general algorithms (e.g. Monte Carlo sampling or quadrature) to compute message integrals.

2. Gradually learn to increase the speed of computations by regressing from incoming to outgoing messages at run-time.

Message-passing

Incomingmessage

Outgoingmessage

Belief and expectation propagation

i k1 k2

How to compute messages for any

Learning to pass messages

Oracle allows us to compute all messages for any factor of interest:

However, sampling can be very slow. Instead, learn a direct mapping, parameterized by , from incoming to outgoing messages:

Heess, Tarlow and Winn (2013)

Before inference• Create a dataset of plausible incoming message groups.• Compute outgoing messages for each group using oracle.• Employ regressor to learn the mapping.

During inferenceGiven a group of incoming messages:• Use regressor to predict parameters of outgoing message.

Heess, Tarlow and Winn (2013)

Logistic regression

Logistic regression4 random UCI datasets

Learning to pass messages – an alternative approach

Before inference• Do nothing.

During inferenceGiven a group of incoming messages:• If unsure:

• Consult oracle for answer and update regressor.

• Otherwise:• Use regressor to predict parameters of outgoing message.

Just-in-time learning

Need an uncertainty aware regressor:

Just-in-time learning

Random decision forests for JIT learning

Tree 1 Tree 2 Tree T

Random decision forests for JIT learningPrediction model

Tree 1 Tree 2 Tree T

Could take the element-wise average of the parameters and reverse to obtain outgoing message .

Sensitive to chosen parameterisation.

Instead, compute the moment average of the distributions .

Ensemble model

Use degree of agreement in predictions as a proxy for uncertainty.

If all trees predict the same output, it means that their knowledge about the mapping is similar despite the randomness in their structure.

Conversely, if there is large disagreement between the predictions, then the forest has high uncertainty.

Uncertainty model

Random decision forests for JIT learning2 feature samples per node – maximum depth 4 – regressor degree 2 – 1,000 trees

Compute the moment average of the distributions .

Use degree of agreement in predictions as a proxy for uncertainty:

Ensemble model

Random decision forests for JIT learningTraining objective function

• How good is a prediction? Consider effect on induced belief on target random variable:

• Focus on the quantity of interest: accuracy of posterior marginals.• Train trees to partition training data in a way that the relationship

between incoming and outgoing messages is well captured by regression, as measured by symmetrised marginal KL.

Results

Logistic regression

Uncertainty aware regression of a logistic factorAre the forests accurate?

Uncertainty aware regression of a logistic factorAre the forests uncertain when they should be?

Just-in-time learning of a logistic factorOracle consultation rate

Just-in-time learning of a logistic factorInference time

Just-in-time learning of a logistic factorInference error

Just-in-time learning of a compound gamma factor

A model of corn yield

Just-in-time learning of a yield factor

Summary

• Speed up message passing inference using JIT learning:• Savings in human time (no need to implement factor operators).• Savings in computer time (reduce the amount of computation).

• JIT can even accelerate hand-coded message operators.

Open questions• Better measure of uncertainty?• Better methods for choosing umax?

Contextual Initialisation MachinesWith Varun Jampani, Daniel Tarlow, Pushmeet Kohli, John Winn

Gauss and CeresA deceptively simple problem

A point model of circles

A point model of circlesInitialisation makes a big difference

What’s going on?A common motif in vision models

Global variablesin each layer

Multiple layers

Many variables per layer

Possible solutionsStructured inference

Messages easy to computeFully-factorised representationLots of loops

No loops (within layers)Lots of loops (across layers)Messages difficult to compute

No loopsMessages difficult to computeComplex messages between layers

Contextual initialisationStructured accuracy without structured cost

Observations

• Beliefs about global variables are approximately predictable from layer below.

• Stronger beliefs about global variables leads to increased quality of messages to layer above.

Strategy

• Learn to send global messages in first iteration.

• Keep using fully factorised model for layer messages.

A point model of circles

A point model of circlesAccelerated inference using contextual initialisation

Centre Radius

A pixel model of squares

A pixel model of squaresRobustified inference using contextual initialisation

Side length Center

A pixel model of squaresRobustified inference using contextual initialisation

FG Color BG Color

A generative model of shadingWith Varun Jampani

Image X Reflectance R Shading S Normal N Light L

A generative model of shadingInference progress with and without context

A generative model of shadingFast and accurate inference using contextual initialisation

Summary

• Bridging the gap between Infer.NET and generative computer vision.• Initialisation makes a big difference.• The inference algorithm can learn to initialise itself.

Open questions• What is the best formulation of this approach?• What are the trade-offs between inference and prediction?

Questions

Extensions to message-passing inference S. M. Ali Eslami September 2014.

Documents

Transcript of Extensions to message-passing inference S. M. Ali Eslami September 2014.

Curriculum Vita - eohcrc.sbmu.ac.ir · Curriculum Vita Akbar Eslami ... Last name Work Address Telephone Fax Email Akbar Eslami Department of Environmental Health Engineering School

Falsafe va Kalam-e Eslami (Philosophy & Kalam)...Falsafe va Kalam-e Eslami, Vol. 50, No.1, Spring & Summer, 2017, pp.1-16 1 Ibn Taymiyyah and Robert Adams on the Relationship between

A COUSINS RESEARCH GROUP Report SHIPS PASSING ...SHIPS PASSING IN THE NIGHT SHIPS PASSING IN THE PASSING IN THE NIGHT SHIPS PASSING SHIPS PASSING IN THE NIGHT SHIPS PASSING IN THE

Passing Off Passing Off: Dilution and the Unprincipled ...

PASSING CONCEPTS - Nexwaydlpdf.nexway.com/prima/NCAA11_PP_up.pdf · PASSING CONCEPTS Most plays in NCAA Football 11 feature a common passing concept. What is a passing ... PISTOL

Fellenius, B. H., and Eslami, A. Soil Profile from CPT.pdf · Fellenius, B. H., and Eslami, A., 2000. ... Sanglerat et al., (1974) proposed the chart shown in Fig. 2, presenting data

By: Motahareh EslamiMehdiabadi eslami@ce.sharif.edu Sharif University of Technology Authors Motahareh EslamiMehdiabadi Hamid R. Rabiee Mostafa Salehi.

Eslami Fellenius Method

Extensions & Frameworks Geospatial Extensions For ESRI ...

Plugins( Extensions( add1ons) · 2 Extensions(for(privacy(protection • Extensions“ ad1blocking” Blockvisible+ advertisements • Extensions“ tracking1blocking” Blockinvisible+

Extensions Substation extensions for high-voltage ... · Extensions Substation extensions for high-voltage switchgear applications When you extend or redesign your substation or replace

First I “like” it, then I hide it: Folk Theories of Social ...csandvig/research/Eslami... · First I “like” it, then I hide it: Folk Theories of Social Feeds Motahhare Eslami,

FEATURES - keditel.com · PRODUCT BUNDLES: KEDI HO50 - for 50 extensions KEDI HO100 - for 100 extensions KEDI HO200 - for 200 extensions KEDI HO300 - for 300 extensions KEDI HO500

Gilda Eslami - web.ssu.ac.ir · Full Name Curriculum Vitae Page 3 of 17 International Relations Office SSU FACULTY MEMBERS PUBLICATIONS Books/Book Chapter: 1. Gilda Eslami, Rasoul

MPI: A Message-Passing Interface Standard · The MPI-2 Forum introduced MPI-1.2 as Chapter 3 in the standard ”MPI-2: Extensions to the Message-Passing Interface”, July 18, 1997.

The Islamic Republic of Iran Jomhuri-ye Eslami-ye Iran.

Create Faster Code—Faster - Microway · today’s and next-generation processors. ... Advanced Vector Extensions ... Message Passing Interface Library Intel® MPI Library

Variational Message Passing€¦ · VIBES (Variational Inference in BayEsian networkS). Some extensions to the algorithm are given in Section 6, and Section 7 concludes with an overall

Eslami Et Al_2

C++ Extensions for Parallel Programming - Open … · •Without standard support, concurrent programming often falls back on error-prone, ad-hoc protocols. –Similar to parameter-passing