Supervised Machine Learning: Learning SVMs and Deep Learninghelper.ipam.ucla.edu › publications...

Supervised Machine Learning: Learning SVMs and Deep Learning

Klaus-Robert Müller

!!et al.!!

Today‘s Tutorial

Machine Learning

• introduction: ingredients for ML

• Kernel Methods and Deep networks with explaining & remarks

Applications ML to Physics & Materials

• representation

• models

• remarks

Machine Learning in a nutshell

Typical scenario: learning from data

• given data set X and labels Y (generated by some joint probabilty distribution p(x,y))

• LEARN/INFER underlying unknown mapping

Y = f(X)

Example: left and right imagery...

BUT: how to do this optimally with good performance on unseen data?

Kernel-based Learning

Basic ideas in learning theory

Basic ideas in learning theory II

Structural Risk Minimization: the picture

VC Dimensions: an examples

Linear Hyperplane Classifier

VC Theory applied to hyperplane classifiers

Feature Spaces & curse of dimensionality

Margin Distributions – large margin hyperplanes

Feature Spaces & curse of dimensionality

Nonlinear Algorithms in Feature Space

The kernel trick: an example

Kernology

Kernology II

Dual Problem

Kernel Trick

good theory

non-linear decision by

implicitely mapping the data

into feature space by SV kernel function K

rsp. K(x,y) = (x) (y)

Support Vector Machines in a nutshell

Digestion: Use of kernels

[Mika et al. 02]

[SSM et al. 98]

[Zien et al. 00, Tsuda et al. 02, Sonnenburg et al. 05]

Kernels for graphs,

trees, strings etc.

Remark: Kernelizing linear algorithms

Digestion

Neural Networks

What is a deep network?

Training a (deep) Neural Network

{0,R22}{0,R

21} {0,R23}

ML4Physics @IPAM 2011: Part I

Klaus-Robert Müller, Matthias Rupp

Anatole von Lilienfeld and Alexandre Tkachenko

Machine Learning for chemical compound space

Ansatz:

instead of

[from von Lilienfeld]

GDB-13 database of all organic molecules (within stability & synthetic constraints) of 13 heavy atoms or less: 0.9B compounds

Blum & Reymond, JACS (2009)

The data

Coulomb representation of molecules

iii Z=M

{0,R22}{0,R

21} {0,R23}

+ phantom atoms

Coulomb Matrix (Rupp, Müller et al 2012, PRL)

2323 M

Kernel ridge regression

Distances between M define Gaussian kernel matrix K

Predict energy as sum over weighted Gaussians

using weights that minimize error in training set

Exact solution

As many parameters as molecules + 2 global parameters, characteristic length-scale or kT of system (σ), and noise-level (λ)

Remarks on Generalization and Model Selection in ML

Kernel Ridge Regression Model

Results

March 2012

Rupp et al., PRL

9.99 kcal/mol

(kernels + eigenspectrum)

December 2012

Montavon et al., NIPS

3.51 kcal/mol

(Neural nets + Coulomb sets)

2015 Tkatchenko 1.3kcal/mol

Prediction considered chemically

accurate when MAE is below 1

kcal/mol

Dataset available at http://quantum-machine.org

Perspectives

Explaining Predictions Pixel-wise

Neural networks Kernel methods

Application: Comparing Classifiers

[Bach et al. CVPR 2016]

Understanding Models is only possible if we explain

Neural networks Fisher

Neural Networks for

Molecules revisted

Quantum Chemical Insights

[Schütt et al. under review]

Conclusion

• Machine Learning & modern data analysis is of central importance in daily life

• input to ML algorithms can be vectors, matrices, graphs, strings, tensors etc.

• Representation is essential ! Modelselection, Optimization.

• ML 4 XC, ML for reaction transitions, ML for formation energy prediction etc.

• ML challenges from Physics: no noise, high dimensional systems, functionals …

• challenge: learn for Physics from ML representation: towards better understanding

Supervised Machine Learning: Learning SVMs and Deep Learninghelper.ipam.ucla.edu › publications...

Documents

Transcript of Supervised Machine Learning: Learning SVMs and Deep Learninghelper.ipam.ucla.edu › publications...

Basic Technical Concepts in Machine Learning Introduction Supervised learning Problems in supervised learning Bayesian decision theory.

2. Supervised Learning

Federated Semi-Supervised Learning with Inter-Client ...2.1. Preliminaries Semi-Supervised Learning Semi-Supervised Learning (SSL) refers to the problem of learning with partially

Graph-BasedSemi-Supervised Learningsemi-supervised learning, graph-based semi-supervised learning, manifold learn- ing, graph-based learning, transductive learning, inductive learning,

Lecture 2: Supervised Learning | Classi cationjaven/talk/L2 Supervised Learning.pdf · Recap Lecture 1 Concepts of Supervised Learning (SL) Classi cation algorithms Supervised Learning

Self-supervised Learning

Introduction to Machine Learninghelper.ipam.ucla.edu/publications/msetut/msetut_11483.pdfIntroduction to Machine Learning Felix Brockherde12 Kristof Schutt 1 1Technische Universit

Semi Supervised Learning

PrediksiCuacadiKotaPalembangBerbasis Supervised Learning ...

Supervised tensor learning

1 COMP3503 Semi-Supervised Learning COMP3503 Semi-Supervised Learning Daniel L. Silver.

Poorly Supervised Learning - statisticalcyber.comstatisticalcyber.com/talks/anagnostopoulos.pdf · Poorly Supervised Learning ... Semi-supervised learning and co-training Expectation-Maximisation:

Supervised learning network

Bits of Machine Learning Part 1: Supervised Learning - …wasp-sweden.org/custom/...proutiere-supervised-learning-20161028.pdf · Bits of Machine Learning Part 1: Supervised Learning

Chapter 3: Supervised Learning · 2019. 11. 24. · Supervised vs. unsupervised Learning •Supervised learning: classification is seen as supervised learning from examples. •Supervision:

Semi-Supervised Learning

Supervised learning, classification

Semi-supervised Learning with Ladder Networkspapers.nips.cc/...semi-supervised-learning-with-ladder-networks.pdf · Semi-Supervised Learning with Ladder Networks ... 3] or classiﬁcation

Supervised Learning - wnzhang.netwnzhang.net/teaching/ee448/slides/5-supervised-learning-2.pdf · Content of Supervised Learning •Introduction to Machine Learning •Linear Models

Large-Scale Deep Learninghelper.ipam.ucla.edu/publications/gss2012/gss2012_10783.pdf · Large-Scale Deep Learning IPAM SUMMER SCHOOL Marc'Aurelio Ranzato- ... deep learning methods?