The Science and the Engineering of Intelligence...The Science and the Engineering of Intelligence...

70
The Science and the Engineering of Intelligence Tomaso Poggio, The MIT Quest Center for Brains, Minds & Machines, McGovern Institute for Brain Research, Computer Science and Artificial Intelligence Agency, Brain and Cognitive Sciences, MIT

Transcript of The Science and the Engineering of Intelligence...The Science and the Engineering of Intelligence...

Page 1: The Science and the Engineering of Intelligence...The Science and the Engineering of Intelligence Tomaso Poggio, The MIT Quest Center for Brains, Minds & Machines, McGovern Institute

The Science and the Engineering of

Intelligence

Tomaso Poggio, The MIT Quest

Center for Brains, Minds & Machines, McGovern Institute for Brain Research,

Computer Science and Artificial Intelligence Agency, Brain and Cognitive Sciences, MIT

Page 2: The Science and the Engineering of Intelligence...The Science and the Engineering of Intelligence Tomaso Poggio, The MIT Quest Center for Brains, Minds & Machines, McGovern Institute

http://www.fujitsu.com/global/about/resources/news/press-releases/2018/1005-01.html

http://pr.fujitsu.com/jp/news/2018/10/5.html 

The CBMM-FLAB partnership

Page 3: The Science and the Engineering of Intelligence...The Science and the Engineering of Intelligence Tomaso Poggio, The MIT Quest Center for Brains, Minds & Machines, McGovern Institute

�3

Page 4: The Science and the Engineering of Intelligence...The Science and the Engineering of Intelligence Tomaso Poggio, The MIT Quest Center for Brains, Minds & Machines, McGovern Institute

Overview

• Motivations: the greatest problem in science, CBMM, the MIT Quest

• A bit of history: Neuroscience and AI, Science and Engineering

• CBMM and the Quest

• AI ethics and its neural bases

• Theory: explaining how deep networks work and what are their properties and limitations.

Page 5: The Science and the Engineering of Intelligence...The Science and the Engineering of Intelligence Tomaso Poggio, The MIT Quest Center for Brains, Minds & Machines, McGovern Institute

We aim to make progress in understanding intelligence, that is in understanding how the

brain makes the mind, how the brain works and how to build intelligent machines.

CBMM’s focus is the Science and the Engineering of Intelligence

Page 6: The Science and the Engineering of Intelligence...The Science and the Engineering of Intelligence Tomaso Poggio, The MIT Quest Center for Brains, Minds & Machines, McGovern Institute

CBMM Overview

The Center for Brains, Minds and Machines (CBMM) is a multi-institutional NSF Science and Technology Center dedicated to the study of intelligence -

how the brain produces intelligent behavior and how we may be able to replicate intelligence in machines. We believe in the synergy between the

science and the engineering of intelligence.

Publications 397

Research Institutions ~4

Faculty (CS+BCS+…) ~23

Researchers 223

Educational Institutions 12

Funding 2013-2023 ~$50M

Machine Learning,Computer Science

Science + Engineering of Intelligence

Cognitive Science Neuroscience,Computational

Page 7: The Science and the Engineering of Intelligence...The Science and the Engineering of Intelligence Tomaso Poggio, The MIT Quest Center for Brains, Minds & Machines, McGovern Institute

MIT

Research, Education & Diversity Partners

Boyden, Desimone, DiCarlo, Kanwisher, Katz, McDermott, Poggio, Rosasco, Sassanfar, Saxe, Schulz, Tegmark, Tenenbaum, Ullman, Wilson,

Winston

HarvardBlum, Gershman, Kreiman, Livingstone,

Nakayama, Sompolinsky, Spelke

Hunter College

Chodorow, Epstein,Sakas, Zeigler

Universidad Centraldel Caribe (UCC)

Jorquera

UMass BostonBlaser, Ciaramitaro,

Pomplun, Shukla

Howard U.Chouika, Manaye,

Rwebangira, Salmani

Queens College

BrumbergStanford U.

Goodman

Johns Hopkins U.

YuilleAllen Institute

Koch

Rockefeller U.Freiwald

Wellesley College

Hildreth, Wiest, Wilmer

UPR– Río PiedrasGarcia-Arraras, Maldonado-Vlaar,Megret, Ordóñez, Ortiz-Zuazaga

UPR – Mayagüez

Santiago, Vega-Riveros

University ofCentral Florida

McNair Program

Page 8: The Science and the Engineering of Intelligence...The Science and the Engineering of Intelligence Tomaso Poggio, The MIT Quest Center for Brains, Minds & Machines, McGovern Institute

Google

DeepMind

Academic and Corporate Partners

IITCingolani

A*starChuan Poh Lim

Hebrew U.Weiss

MPIBülthoff

Genoa U.Verri, Rosasco

WeizmannUllman

City U. HKSmale

IBM

Honda

Microsoft

BostonDynamics

Orcam

NVIDIA

Siemens

Schlumberger Mobileye Intel

Fujitsu

Page 9: The Science and the Engineering of Intelligence...The Science and the Engineering of Intelligence Tomaso Poggio, The MIT Quest Center for Brains, Minds & Machines, McGovern Institute

EAC Members & MeetingsDemis Hassabis, DeepMind

Charles Isbell, Jr., Georgia Tech

Christof Koch, Allen Institute

Fei-Fei Li, Stanford

Lore McGovern, MIBR, MIT

Joel Oppenheim, NYU

Pietro Perona, Caltech

Marc Raibert, Boston DynamicsJudith Richter, Medinol

Kobi Richter, Medinol

Dan Rockmore, Dartmouth

Amnon Shashua, Mobileye

David Siegel, Two Sigma

Susan Whitehead, MIT Corporation

Page 10: The Science and the Engineering of Intelligence...The Science and the Engineering of Intelligence Tomaso Poggio, The MIT Quest Center for Brains, Minds & Machines, McGovern Institute

Summer Course at Woods Hole: Our flagship initiative led by G. Kreiman

Brains, Minds & Machines Summer Course An intensive three-week course gives advanced students a “deep” introduction to the problem of intelligence

A community of scholars is being formed

Page 11: The Science and the Engineering of Intelligence...The Science and the Engineering of Intelligence Tomaso Poggio, The MIT Quest Center for Brains, Minds & Machines, McGovern Institute

The MIT Intelligence Quest

PROJECTSABOUTCONTACTFAQ

Forging connections between human and machine intelligence research, its applications, and its bearing on society.

The MIT Intelligence Quest will advance the science and engineering of both human and machine intelligence. Launched on February 1, 2018, this effort seeks to discover the foundations of human intelligence and drive the development of technological tools that can positively influence virtually every aspect of society.The Institute’s culture of collaboration …we seek to answer the deepest questions about intelligence.

The MIT Quest for Intelligence

Page 12: The Science and the Engineering of Intelligence...The Science and the Engineering of Intelligence Tomaso Poggio, The MIT Quest Center for Brains, Minds & Machines, McGovern Institute

Intelligence: The MIT Quest

Page 13: The Science and the Engineering of Intelligence...The Science and the Engineering of Intelligence Tomaso Poggio, The MIT Quest Center for Brains, Minds & Machines, McGovern Institute

Overview

• Motivations: the greatest problem in science, CBMM, the MIT Quest

• A bit of history: Neuroscience and AI, Science and Engineering

• CBMM and the Quest

• AI ethics and its neural bases

• Theory: explaining how deep networks work and what are their properties and limitations.

Page 14: The Science and the Engineering of Intelligence...The Science and the Engineering of Intelligence Tomaso Poggio, The MIT Quest Center for Brains, Minds & Machines, McGovern Institute

Logical for MIT…

2008 2012 - 2013 2018

“The Golden Age” 1950 - 1970

Marvin Minsky{ }Noam

ChomskyNorbert Wiener

Claude Shannon

Intelligence: The MIT Quest

Page 15: The Science and the Engineering of Intelligence...The Science and the Engineering of Intelligence Tomaso Poggio, The MIT Quest Center for Brains, Minds & Machines, McGovern Institute

Just a definition: I use the word science to mean natural science

Page 16: The Science and the Engineering of Intelligence...The Science and the Engineering of Intelligence Tomaso Poggio, The MIT Quest Center for Brains, Minds & Machines, McGovern Institute

We aim to make progress in understanding intelligence, that is in understanding how the

brain makes the mind, how the brain works and how to build intelligent machines. We believe

that the science of intelligence will enable better engineering of intelligence.

CBMM’s focus is the natural Science and the Engineering of Intelligence

Page 17: The Science and the Engineering of Intelligence...The Science and the Engineering of Intelligence Tomaso Poggio, The MIT Quest Center for Brains, Minds & Machines, McGovern Institute
Page 18: The Science and the Engineering of Intelligence...The Science and the Engineering of Intelligence Tomaso Poggio, The MIT Quest Center for Brains, Minds & Machines, McGovern Institute

�18

Two Main Recent Success Stories in AI

Page 19: The Science and the Engineering of Intelligence...The Science and the Engineering of Intelligence Tomaso Poggio, The MIT Quest Center for Brains, Minds & Machines, McGovern Institute
Page 20: The Science and the Engineering of Intelligence...The Science and the Engineering of Intelligence Tomaso Poggio, The MIT Quest Center for Brains, Minds & Machines, McGovern Institute
Page 21: The Science and the Engineering of Intelligence...The Science and the Engineering of Intelligence Tomaso Poggio, The MIT Quest Center for Brains, Minds & Machines, McGovern Institute

Real Engineering: Mobileye

Page 22: The Science and the Engineering of Intelligence...The Science and the Engineering of Intelligence Tomaso Poggio, The MIT Quest Center for Brains, Minds & Machines, McGovern Institute

Third Annual NSF Site Visit, June 8 – 9, 2016

Moore-like law for ML (1995-2018)

Page 23: The Science and the Engineering of Intelligence...The Science and the Engineering of Intelligence Tomaso Poggio, The MIT Quest Center for Brains, Minds & Machines, McGovern Institute

�23

Two Main Recent Success Stories in AI

Page 24: The Science and the Engineering of Intelligence...The Science and the Engineering of Intelligence Tomaso Poggio, The MIT Quest Center for Brains, Minds & Machines, McGovern Institute

DL and RL come from neuroscience

Minsky’s SNARC

RL

DL

Page 25: The Science and the Engineering of Intelligence...The Science and the Engineering of Intelligence Tomaso Poggio, The MIT Quest Center for Brains, Minds & Machines, McGovern Institute

NSF Site Visit, May 15-16, 2017

Desimone & Ungerleider 1989; vanEssen+Movshon

Background: State-of-the-art Machines (“Deep Learning”) Have Emerged From the Brain’s Visual Processing Architecture

State of the Art ResNets

Brains / MindsMachines(ventral visual stream)

What’s the engineering of the future?…

Page 26: The Science and the Engineering of Intelligence...The Science and the Engineering of Intelligence Tomaso Poggio, The MIT Quest Center for Brains, Minds & Machines, McGovern Institute

Riesenhuber & Poggio 1999, 2000; Serre Kouh Cadieu Knoblich Kreiman & Poggio 2005; Serre Oliva Poggio 2007

“Hubel-Wiesel” models include

Hubel & Wiesel, 1959: Fukushima, 1980, Wallis & Rolls, 1997; Mel, 1997; LeCun et al 1998; Riesenhuber & Poggio, 1999; Thorpe, 2002; Ullman et al., 2002; Wersing and Koerner, 2003; Serre et al., 2007; Freeman and Simoncelli, 2011….

Convolutional networks

Page 27: The Science and the Engineering of Intelligence...The Science and the Engineering of Intelligence Tomaso Poggio, The MIT Quest Center for Brains, Minds & Machines, McGovern Institute

Overview

• Motivations: the greatest problem in science, CBMM, the MIT Quest

• A bit of history: Neuroscience and AI, Science and Engineering

• CBMM and the Quest

• AI ethics and its neural bases

• Theory: explaining how deep networks work and what are their properties and limitations.

Page 28: The Science and the Engineering of Intelligence...The Science and the Engineering of Intelligence Tomaso Poggio, The MIT Quest Center for Brains, Minds & Machines, McGovern Institute

Visual intelligence, video ergo sum

Page 29: The Science and the Engineering of Intelligence...The Science and the Engineering of Intelligence Tomaso Poggio, The MIT Quest Center for Brains, Minds & Machines, McGovern Institute

T. Poggio, J. Dicarlo, M. Livingstone, S. Ullman

J. Tenenbaum, N. Kanwisher, SpelkeG, Kreiman, M. Wilson, B. Desimone

B. Katz, A. Barbu, S. Ullman, J. Tenenbaum

Running routines…

Depository of vision routines…

synthesizing routines as needed

Visual Intelligence

Page 30: The Science and the Engineering of Intelligence...The Science and the Engineering of Intelligence Tomaso Poggio, The MIT Quest Center for Brains, Minds & Machines, McGovern Institute
Page 31: The Science and the Engineering of Intelligence...The Science and the Engineering of Intelligence Tomaso Poggio, The MIT Quest Center for Brains, Minds & Machines, McGovern Institute

Within The CORE Intersection: CBMM + additional “moonshot” projects

• Visual Intelligence (CBMM)

• Development of Intelligence

• New circuits for deep nets in counter streams in cortical areas

• Planning and imagination

• Emotional Intelligence

• Language

Page 32: The Science and the Engineering of Intelligence...The Science and the Engineering of Intelligence Tomaso Poggio, The MIT Quest Center for Brains, Minds & Machines, McGovern Institute

Overview

• Motivations: the greatest problem in science, CBMM, the MIT Quest

• A bit of history: Neuroscience and AI, Science and Engineering

• CBMM and the Quest

• AI ethics and its neural bases

• Theory: explaining how deep networks work and what are their properties and limitations.

Page 33: The Science and the Engineering of Intelligence...The Science and the Engineering of Intelligence Tomaso Poggio, The MIT Quest Center for Brains, Minds & Machines, McGovern Institute

AI and ethics• Too much about

- AI more dangerous than nuclear bombs - the trolley problem

• More pressing issues: - What to publish/not publish - Jobs lost to machines

• Future: - how to build ethical machines - can the brain teach us how?

Page 34: The Science and the Engineering of Intelligence...The Science and the Engineering of Intelligence Tomaso Poggio, The MIT Quest Center for Brains, Minds & Machines, McGovern Institute

Studies with fMRI revealed that particular areas of the brain are associated with particular cognitive events such as our moral emotions and ethical reasoning.

Neuroscience of ethics

Page 35: The Science and the Engineering of Intelligence...The Science and the Engineering of Intelligence Tomaso Poggio, The MIT Quest Center for Brains, Minds & Machines, McGovern Institute

Overview

• Motivations: the greatest problem in science, CBMM, the MIT Quest

• A bit of history: Neuroscience and AI, Science and Engineering

• CBMM and the Quest

• AI ethics and its neural bases

• Theory: explaining how deep networks work and what are their properties and limitations.

Page 36: The Science and the Engineering of Intelligence...The Science and the Engineering of Intelligence Tomaso Poggio, The MIT Quest Center for Brains, Minds & Machines, McGovern Institute

�36

Page 37: The Science and the Engineering of Intelligence...The Science and the Engineering of Intelligence Tomaso Poggio, The MIT Quest Center for Brains, Minds & Machines, McGovern Institute

�37

Page 38: The Science and the Engineering of Intelligence...The Science and the Engineering of Intelligence Tomaso Poggio, The MIT Quest Center for Brains, Minds & Machines, McGovern Institute
Page 39: The Science and the Engineering of Intelligence...The Science and the Engineering of Intelligence Tomaso Poggio, The MIT Quest Center for Brains, Minds & Machines, McGovern Institute

Deep nets : a theory is needed (after alchemy, chemistry)

Page 40: The Science and the Engineering of Intelligence...The Science and the Engineering of Intelligence Tomaso Poggio, The MIT Quest Center for Brains, Minds & Machines, McGovern Institute

�40

Page 41: The Science and the Engineering of Intelligence...The Science and the Engineering of Intelligence Tomaso Poggio, The MIT Quest Center for Brains, Minds & Machines, McGovern Institute

Deep nets architecture and SGD training

Page 42: The Science and the Engineering of Intelligence...The Science and the Engineering of Intelligence Tomaso Poggio, The MIT Quest Center for Brains, Minds & Machines, McGovern Institute

�42

Page 43: The Science and the Engineering of Intelligence...The Science and the Engineering of Intelligence Tomaso Poggio, The MIT Quest Center for Brains, Minds & Machines, McGovern Institute

Approximation theory: when and why are deep networks better - no curse of dimensionality — than shallow networks?

Optimization: what is the landscape of the empirical risk?

Generalization by SGD: how can overparametrized networks generalize?

DLNNs: three main scientific questions

Work with Hrushikeshl Mhaskar+Lorenzo Rosasco+Fabio Anselmi+Chiyuan Zhang+Qianli Liao +Sasha Rakhlin + Noah G + Xavier B

Page 44: The Science and the Engineering of Intelligence...The Science and the Engineering of Intelligence Tomaso Poggio, The MIT Quest Center for Brains, Minds & Machines, McGovern Institute

When is deep better than shallow

Theorem (informal statement)

g(x) = cii=1

r

∑ < wi , x > +bi +

Suppose that a function of d variables is compositional . Both shallow and deep network can approximate f equally well. The number of parameters of the shallow network depends exponentially on d as with the dimension whereas for the deep network dance is dimension independent, i.e.

O(ε −d )O(ε −2 )

f (x1, x2,..., x8 ) = g3(g21(g11(x1, x2 ),g12 (x3, x4 ))g22 (g11(x5, x6 ),g12 (x7, x8 )))

Mhaskar, Poggio, Liao, 2016

Theory I: Why and when are deep networks better than shallow networks?

Page 45: The Science and the Engineering of Intelligence...The Science and the Engineering of Intelligence Tomaso Poggio, The MIT Quest Center for Brains, Minds & Machines, McGovern Institute

When is deep better than shallow

When can the curse of dimensionality be avoided

Page 46: The Science and the Engineering of Intelligence...The Science and the Engineering of Intelligence Tomaso Poggio, The MIT Quest Center for Brains, Minds & Machines, McGovern Institute

When is deep better than shallow

f (x1, x2,..., x8 ) = g3(g21(g11(x1, x2 ),g12 (x3, x4 ))g22 (g11(x5, x6 ),g12 (x7, x8 )))

Generic functions

Mhaskar, Poggio, Liao, 2016

f (x1, x2,..., x8 )

Compositional functions

Page 47: The Science and the Engineering of Intelligence...The Science and the Engineering of Intelligence Tomaso Poggio, The MIT Quest Center for Brains, Minds & Machines, McGovern Institute

�47

Microstructure of compositionalitytarget function

approximating function/network

Page 48: The Science and the Engineering of Intelligence...The Science and the Engineering of Intelligence Tomaso Poggio, The MIT Quest Center for Brains, Minds & Machines, McGovern Institute

When is deep better than shallow

Theorem (informal statement)

Suppose that a function of d variables is hierarchically, locally, compositional . Both shallow and deep network can approximate f equally well. The number of parameters of the shallow network depends exponentially on d as with the dimension whereas for the deep network dance is

O(ε −d )O(dε −2 )

f (x1, x2,..., x8 ) = g3(g21(g11(x1, x2 ),g12 (x3, x4 ))g22 (g11(x5, x6 ),g12 (x7, x8 )))

Mhaskar, Poggio, Liao, 2016

Hierarchically local compositionality

Page 49: The Science and the Engineering of Intelligence...The Science and the Engineering of Intelligence Tomaso Poggio, The MIT Quest Center for Brains, Minds & Machines, McGovern Institute

Binary Tree NN vs Shallow NN 8D

Page 50: The Science and the Engineering of Intelligence...The Science and the Engineering of Intelligence Tomaso Poggio, The MIT Quest Center for Brains, Minds & Machines, McGovern Institute

Locality of constituent functions is key not weight sharing: CIFAR

Page 51: The Science and the Engineering of Intelligence...The Science and the Engineering of Intelligence Tomaso Poggio, The MIT Quest Center for Brains, Minds & Machines, McGovern Institute

Why are compositional functions important?

Which one of these reasons:

Physics (Max Tegmark)? Neuroscience? <=== tp

Evolution?

What is special about locality of computation?

Locality in “space”? Locality in “time”?

Locality of Computation

Page 52: The Science and the Engineering of Intelligence...The Science and the Engineering of Intelligence Tomaso Poggio, The MIT Quest Center for Brains, Minds & Machines, McGovern Institute

When is deep better than shallow

Theorem (informal statement)

Liao, Poggio, 2017

Theory II: What is the Landscape of the empirical risk?

Replacing the RELUs with univariate polynomial approximation, Bezout theorem implies that the system of polynomial equations corresponding to zero empirical error has a very large number of degenerate solutions. The global zero-minimizers correspond to flat minima in many dimensions (generically unlike local minima). Thus SGD is biased towards finding global minima of the empirical risk.

Page 53: The Science and the Engineering of Intelligence...The Science and the Engineering of Intelligence Tomaso Poggio, The MIT Quest Center for Brains, Minds & Machines, McGovern Institute

Observation (theory and experiment): deep polynomial networks show same puzzles as RELU nets

Poggio et al., 2017

Page 54: The Science and the Engineering of Intelligence...The Science and the Engineering of Intelligence Tomaso Poggio, The MIT Quest Center for Brains, Minds & Machines, McGovern Institute

When is deep better than shallow

Page 55: The Science and the Engineering of Intelligence...The Science and the Engineering of Intelligence Tomaso Poggio, The MIT Quest Center for Brains, Minds & Machines, McGovern Institute

Bezout theorem

The set of polynomial equations above with k= degree of p(x) has a number of distinct zeros (counting points at infinity, using projective space, assigning an appropriate multiplicity to each intersection point, and excluding degenerate cases) equal to

the product of the degrees of each of the equations. As in the linear case, when the system of equations is underdetermined – as many equations as data points but more unknowns (the weights) – the theorem says that there are an infinite number of global minima, under the form of Z regions of zero empirical error.

Z = kn

p(xi )− yi = 0 for i = 1,...,n

Page 56: The Science and the Engineering of Intelligence...The Science and the Engineering of Intelligence Tomaso Poggio, The MIT Quest Center for Brains, Minds & Machines, McGovern Institute

f (xi )− yi = 0 for i = 1,...,n

Global and local zeros

n equations in W unknowns with W >> n

W equations in W unknowns

Page 57: The Science and the Engineering of Intelligence...The Science and the Engineering of Intelligence Tomaso Poggio, The MIT Quest Center for Brains, Minds & Machines, McGovern Institute

Langevin equation

with the Boltzmann equation as asymptotic “solution”

dwdt

= −γ t∇V (w(t), z(t))+ γ t 'dB(t)

p(w) ~ 1Z= e

−V (w)T

Page 58: The Science and the Engineering of Intelligence...The Science and the Engineering of Intelligence Tomaso Poggio, The MIT Quest Center for Brains, Minds & Machines, McGovern Institute

When is deep better than shallowSGD

Page 59: The Science and the Engineering of Intelligence...The Science and the Engineering of Intelligence Tomaso Poggio, The MIT Quest Center for Brains, Minds & Machines, McGovern Institute

GDL selects larger volume minima

Page 60: The Science and the Engineering of Intelligence...The Science and the Engineering of Intelligence Tomaso Poggio, The MIT Quest Center for Brains, Minds & Machines, McGovern Institute

GDL ~ SGD (empirically)

Page 61: The Science and the Engineering of Intelligence...The Science and the Engineering of Intelligence Tomaso Poggio, The MIT Quest Center for Brains, Minds & Machines, McGovern Institute

GDL selects degenerate minima

Page 62: The Science and the Engineering of Intelligence...The Science and the Engineering of Intelligence Tomaso Poggio, The MIT Quest Center for Brains, Minds & Machines, McGovern Institute

Concentration because of high dimensionality

Poggio, Rakhlin, Golovitc, Zhang, Liao, 2017

Page 63: The Science and the Engineering of Intelligence...The Science and the Engineering of Intelligence Tomaso Poggio, The MIT Quest Center for Brains, Minds & Machines, McGovern Institute

When is deep better than shallow

• There are may zero minimizer with overparametrized deep networks because of Bezout theorem

• SGDL finds with very high probability large volume, flat zero-minimizers; empirically SGD behaves in a similar way

• Flat minimizers correspond to degenerate zero-minimizers and thus to global minimizers;

SGDL and SGD observation: summary

Poggio, Rakhlin, Golovitc, Zhang, Liao, 2017

Page 64: The Science and the Engineering of Intelligence...The Science and the Engineering of Intelligence Tomaso Poggio, The MIT Quest Center for Brains, Minds & Machines, McGovern Institute

Theory III: How can underconstrained solutions generalize?

Page 65: The Science and the Engineering of Intelligence...The Science and the Engineering of Intelligence Tomaso Poggio, The MIT Quest Center for Brains, Minds & Machines, McGovern Institute

Classical Generalization Bounds

Page 66: The Science and the Engineering of Intelligence...The Science and the Engineering of Intelligence Tomaso Poggio, The MIT Quest Center for Brains, Minds & Machines, McGovern Institute

Very good generalization!

MNIST with different initializations

Page 67: The Science and the Engineering of Intelligence...The Science and the Engineering of Intelligence Tomaso Poggio, The MIT Quest Center for Brains, Minds & Machines, McGovern Institute

• Approximation theorems: for compositional functions deep but not shallow networks avoid the curse of dimensionality.

• Optimization remarks: SGD finds with high probability global minima which are degenerate.

Three Theory Questions: Summary of Answers

• Generalization: The gradient dynamics of deep networks near global minima converges to minimum norm solution for each layer of weights.

Page 68: The Science and the Engineering of Intelligence...The Science and the Engineering of Intelligence Tomaso Poggio, The MIT Quest Center for Brains, Minds & Machines, McGovern Institute

Musings on Near Future Breakthroughs

• new architectures/class of applications from basic DCN block (example GAN + RL/DL + …)

• new semisupervised training framework, avoiding labels: implicit labeling…predicting next “frame”…

• new basic supervised block/circuit ?

• new learning algorithm (Shim) instead of SGD …

Page 69: The Science and the Engineering of Intelligence...The Science and the Engineering of Intelligence Tomaso Poggio, The MIT Quest Center for Brains, Minds & Machines, McGovern Institute

General musings

The evolution of computer science

• there were programmers

• there are now labelers

• there may be schools for bots…

Page 70: The Science and the Engineering of Intelligence...The Science and the Engineering of Intelligence Tomaso Poggio, The MIT Quest Center for Brains, Minds & Machines, McGovern Institute

The first phase (and successes) of ML: supervised learning, big data:

Today’s science, tomorrow’s engineering:learn like children learn

n→∞

The next phase of ML: implicitly supervised learning, learning like children do, small data: n→ 1

from programmers… …to labelers… …to computers that learn like children…