Sparse Representations and the Basis Pursuit Algorithm...Sparse Modeling in Image Processing and...

68
Sparse Modeling in Image Processing and Deep Learning Michael Elad Computer Science Department The Technion - Israel Institute of Technology Haifa 32000, Israel The research leading to these results has been received funding from the European union's Seventh Framework Program (FP/2007-2013) ERC grant Agreement ERC-SPARSE- 320649

Transcript of Sparse Representations and the Basis Pursuit Algorithm...Sparse Modeling in Image Processing and...

Page 1: Sparse Representations and the Basis Pursuit Algorithm...Sparse Modeling in Image Processing and Deep Learning Michael Elad Computer Science Department The Technion - Israel Institute

Sparse Modeling in

Image Processing and Deep Learning

Michael EladComputer Science Department The Technion - Israel Institute of TechnologyHaifa 32000, Israel

The research leading to these results has been received funding from the European union's Seventh Framework Program

(FP/2007-2013) ERC grant Agreement ERC-SPARSE- 320649

Page 2: Sparse Representations and the Basis Pursuit Algorithm...Sparse Modeling in Image Processing and Deep Learning Michael Elad Computer Science Department The Technion - Israel Institute

This Lecture is About โ€ฆ

2Michael EladThe Computer-Science DepartmentThe Technion

A Proposed Theory for Deep-Learning (DL)

Explanation:

o DL has been extremely successful in solving a variety of learning problems

o DL is an empirical field, with numerous tricks and know-how, but almost no theoretical foundations

o A theory for DL has become the holy-grail of current research in Machine-Learning and related fields

Page 3: Sparse Representations and the Basis Pursuit Algorithm...Sparse Modeling in Image Processing and Deep Learning Michael Elad Computer Science Department The Technion - Israel Institute

Who Needs Theory ?

3Michael EladThe Computer-Science DepartmentThe Technion

We All Do !!

โ€ฆ because โ€ฆ A theory

o โ€ฆ could bring the next rounds of ideas to this field, breaking existing barriers and opening new opportunities

o โ€ฆ could map clearly the limitations of existing DL solutions, and point to key features that control their performance

o โ€ฆ could remove the feeling with many of us that DL is a โ€œdark magicโ€, turning it into a solid scientific discipline

o

Understanding is a good thing โ€ฆ but another goal is inventing methods. In the history of science and technology, engineering

preceded theoretical understanding: Lens & telescope Optics Steam engine Thermodynamics Airplane Aerodynamics Radio & Comm. Info. Theory Computer Computer Science

โ€œMachine learning has become alchemyโ€

Ali Rahimi: NIPS 2017

Test-of-Time Award

Yan LeCun

Page 4: Sparse Representations and the Basis Pursuit Algorithm...Sparse Modeling in Image Processing and Deep Learning Michael Elad Computer Science Department The Technion - Israel Institute

A Theory for DL ?

4Michael EladThe Computer-Science DepartmentThe Technion

Architecture

Algorithms

Data

Rene Vidal (JHU): Explained the ability to optimize the typical non-convex objective and yet get to a global minima

Naftali Tishby (HUJI): Introduced the Information Bottleneck (IB) concept and demonstrated its relevance to deep learning

Stefano Soattoโ€™s team (UCLA): Analyzed the Stochastic Gradient Descent (SGD) algorithm, connecting it to the IB objective

Stephane Mallat (ENS) & Joan Bruna (NYU): Proposed

the scattering transform (wavelet-based) and

emphasized the treatment of invariances in the input data

Richard Baraniuk & Ankit Patel (RICE): Offered a

generative probabilistic model for the data,

showing that classic architectures relate to it

Raja Giryes (TAU): Studied the architecture of DNN in the context of their ability to give distance-preserving embedding of signals

Gitta Kutyniok (TU) & Helmut Bolcskei (ETH): Studied the ability of DNN architectures to approximate families of functions

Page 5: Sparse Representations and the Basis Pursuit Algorithm...Sparse Modeling in Image Processing and Deep Learning Michael Elad Computer Science Department The Technion - Israel Institute

So, is there a Theory for DL ?

5Michael EladThe Computer-Science DepartmentThe Technion

The answer is tricky:

There are already various such attempts, and some of them are

truly impressive

โ€ฆ but โ€ฆ

none of them is complete

Page 6: Sparse Representations and the Basis Pursuit Algorithm...Sparse Modeling in Image Processing and Deep Learning Michael Elad Computer Science Department The Technion - Israel Institute

Interesting Observations

6Michael EladThe Computer-Science DepartmentThe Technion

o Theory origins: Signal Proc., Control Theory, Info. Theory, Harmonic Analysis, Sparse Represen., Quantum Physics, PDE, Machine learning โ€ฆ

Ron Kimmel: โ€œDL is a dark monster covered with mirrors. Everyone sees his reflection in it โ€ฆโ€

David Donoho: โ€œโ€ฆ these mirrors are taken from Cinderella's story, telling each that he is the most beautifulโ€

o Todayโ€™s talk is on our proposed theory:

โ€ฆ and yes, our theory is the bestVardan PapyanYaniv Romano Jeremias Sulam

Architecture

Algorithms

Data

Page 7: Sparse Representations and the Basis Pursuit Algorithm...Sparse Modeling in Image Processing and Deep Learning Michael Elad Computer Science Department The Technion - Israel Institute

ML-CSC Multi-Layered Convolutional Sparse Coding

SparselandSparse

Representation Theory

Another underlying idea that accompanies us

Generative modeling of data sources enableso A systematic algorithm development, & o A theoretical analysis of their performance

CSCConvolutional

Sparse Coding

This Lecture: More Specifically

7

Sparsity-Inspired Models

Michael EladThe Computer-Science DepartmentThe Technion

Deep-Learning

Disclaimer: Being a lecture on the theory

of DL, this lecture is โ€ฆ theoretical โ€ฆ and

mathematically oriented

Page 8: Sparse Representations and the Basis Pursuit Algorithm...Sparse Modeling in Image Processing and Deep Learning Michael Elad Computer Science Department The Technion - Israel Institute

Multi-Layered Convolutional Sparse Modeling

8Michael EladThe Computer-Science DepartmentThe Technion

Our eventual goal in todayโ€™s talk is to present the โ€ฆ

So, lets use this as our running title, parse it into words,

and explain each of them

Page 9: Sparse Representations and the Basis Pursuit Algorithm...Sparse Modeling in Image Processing and Deep Learning Michael Elad Computer Science Department The Technion - Israel Institute

Multi-Layered Convolutional Sparse Modeling

9Michael EladThe Computer-Science DepartmentThe Technion

Page 10: Sparse Representations and the Basis Pursuit Algorithm...Sparse Modeling in Image Processing and Deep Learning Michael Elad Computer Science Department The Technion - Israel Institute

3D Objects

Medical Imaging

10

Our Data is Structured

o We are surrounded by various diverse sources of massive information

o Each of these sources have an internal structure, which can be exploited

o This structure, when identified, is the engine behind the ability to process data

o How to identify structure?

Voice Signals

Stock Market Biological Signals

Videos

Text Documents

Radar Imaging

Matrix Data

Social Networks

Traffic info

Seismic DataStill Images

Michael EladThe Computer-Science DepartmentThe Technion

Page 11: Sparse Representations and the Basis Pursuit Algorithm...Sparse Modeling in Image Processing and Deep Learning Michael Elad Computer Science Department The Technion - Israel Institute

3D Objects

Medical Imaging

11

Our Data is Structured

o We are surrounded by various diverse sources of massive information

o Each of these sources have an internal structure, which can be exploited

o This structure, when identified, is the engine behind the ability to process data

o How to identify structure?

Voice Signals

Stock Market Biological Signals

Videos

Text Documents

Radar Imaging

Matrix Data

Social Networks

Traffic info

Seismic DataStill Images

Michael EladThe Computer-Science DepartmentThe Technion

Using models

Page 12: Sparse Representations and the Basis Pursuit Algorithm...Sparse Modeling in Image Processing and Deep Learning Michael Elad Computer Science Department The Technion - Israel Institute

12

Model?

Effective removal of noise (and many other tasks) relies on an proper modeling of the signal

Michael EladThe Computer-Science DepartmentThe Technion

Fact 1:This signal

contains AWGN โ„•(0,1)

Fact 2: The clean signal

is believed to be PWC

Page 13: Sparse Representations and the Basis Pursuit Algorithm...Sparse Modeling in Image Processing and Deep Learning Michael Elad Computer Science Department The Technion - Israel Institute

13

Models

o A model: a mathematicaldescription of the underlying signal of interest, describing our beliefs regarding its structure

o The following is a partial list of commonly used models for images

o Good models should be simple while matching the signals

o Models are almost always imperfect

Principal-Component-Analysis

Gaussian-Mixture

Markov Random Field

Laplacian Smoothness

DCT concentration

Wavelet Sparsity

Piece-Wise-Smoothness

C2-smoothness

Besov-Spaces

Total-Variation

Beltrami-Flow

Simplicity Reliability

Michael EladThe Computer-Science DepartmentThe Technion

Page 14: Sparse Representations and the Basis Pursuit Algorithm...Sparse Modeling in Image Processing and Deep Learning Michael Elad Computer Science Department The Technion - Israel Institute

14

What this Talk is all About?

Data Models and Their Useo Almost any task in data processing requires a model โ€“

true for denoising, deblurring, super-resolution, inpainting, compression, anomaly-detection, sampling, recognition, separation, and more

o Sparse and Redundant Representations offer a new and highly effective model โ€“ we call it

Sparselando We shall describe this and descendant versions of it that

lead all the way to โ€ฆ deep-learning

Michael EladThe Computer-Science DepartmentThe Technion

Page 15: Sparse Representations and the Basis Pursuit Algorithm...Sparse Modeling in Image Processing and Deep Learning Michael Elad Computer Science Department The Technion - Israel Institute

Multi-Layered Convolutional Sparse Modeling

15Michael EladThe Computer-Science DepartmentThe Technion

Page 16: Sparse Representations and the Basis Pursuit Algorithm...Sparse Modeling in Image Processing and Deep Learning Michael Elad Computer Science Department The Technion - Israel Institute

Machine Learning

16

MathematicsSignal

Processing

A New Emerging Model

Sparseland

Wavelet Theory

Signal Transforms

Multi-Scale Analysis

Approximation Theory

Linear Algebra

Optimization Theory

Denoising

Interpolation

PredictionCompression

Inference (solving inverse problems)

Anomaly detectionClustering

Summarizing

Sensor-FusionSource-Separation

Segmentation

Recognition

Semi-Supervised Learning

Identification

Classification

Synthesis

Michael EladThe Computer-Science DepartmentThe Technion

Page 17: Sparse Representations and the Basis Pursuit Algorithm...Sparse Modeling in Image Processing and Deep Learning Michael Elad Computer Science Department The Technion - Israel Institute

17

The Sparseland Model

o Task: model image patches of size 8ร—8 pixels

o We assume that a dictionary of such image patches is given,

containing 256 atom images

o The Sparseland model assumption:

every image patch can be described as a linear

combination of few atoms

ฮฑ1 ฮฑ2 ฮฑ3

ฮฃ

Michael EladThe Computer-Science DepartmentThe Technion

Page 18: Sparse Representations and the Basis Pursuit Algorithm...Sparse Modeling in Image Processing and Deep Learning Michael Elad Computer Science Department The Technion - Israel Institute

18

The Sparseland Model

o We start with a 8-by-8 pixels patch and represent it using 256 numbers

โ€“ This is a redundant representation

o However, out of those 256 elements in the representation, only 3 are non-zeros

โ€“ This is a sparse representation

o Bottom line in this case: 64 numbers representing the patch are replaced by 6 (3 for the indices of the non-zeros, and 3 for their entries)

Properties of this model:

Sparsity and Redundancyฮฑ1 ฮฑ2 ฮฑ3

ฮฃ

Michael EladThe Computer-Science DepartmentThe Technion

Page 19: Sparse Representations and the Basis Pursuit Algorithm...Sparse Modeling in Image Processing and Deep Learning Michael Elad Computer Science Department The Technion - Israel Institute

19

Chemistry of Data

ฮฑ1 ฮฑ2 ฮฑ3

ฮฃ

o Our dictionary stands for the Periodic Table containing all the elements

o Our model follows a similar rationale: Every molecule is built of few elements

We could refer to the Sparselandmodel as the chemistry of information:

Michael EladThe Computer-Science DepartmentThe Technion

Page 20: Sparse Representations and the Basis Pursuit Algorithm...Sparse Modeling in Image Processing and Deep Learning Michael Elad Computer Science Department The Technion - Israel Institute

20

Sparseland : A Formal Description

Mm

n

A Dictionary

o Every column in ๐ƒ(dictionary) is a prototype signal (atom)

o The vector is generated with few non-zeros at arbitrarylocations and values

A sparse vector

n

o This is a generative model that describes how (we believe) signals are created

x

ฮฑ๐ƒ

Michael EladThe Computer-Science DepartmentThe Technion

Page 21: Sparse Representations and the Basis Pursuit Algorithm...Sparse Modeling in Image Processing and Deep Learning Michael Elad Computer Science Department The Technion - Israel Institute

21

Difficulties with Sparseland

o Problem 1: Given a signal, how can we find its atom decomposition?

o A simple example:

There are 2000 atoms in the dictionary

The signal is known to be built of 15 atoms

possibilities

If each of these takes 1nano-sec to test, this will take ~7.5e20 years to finish !!!!!!

o So, are we stuck?

ฮฑ1 ฮฑ2 ฮฑ3

ฮฃ

20002.4e 37

15

Michael EladThe Computer-Science DepartmentThe Technion

Page 22: Sparse Representations and the Basis Pursuit Algorithm...Sparse Modeling in Image Processing and Deep Learning Michael Elad Computer Science Department The Technion - Israel Institute

22

Atom Decomposition Made Formal

Greedy methods

Thresholding/OMP

Relaxation methods

Basis-Pursuit

L0 โ€“ counting number of non-zeros in the vector

This is a projection onto

the Sparseland model

These problems are known to be NP-Hard problem

Approximation Algorithms

minฮฑ ฮฑ 0 s. t. ๐ƒฮฑ โˆ’ y 2 โ‰ค ฮต

minฮฑ ฮฑ 0 s. t. x = ๐ƒฮฑ

m

n ๐ƒ

xฮฑ

Michael EladThe Computer-Science DepartmentThe Technion

Page 23: Sparse Representations and the Basis Pursuit Algorithm...Sparse Modeling in Image Processing and Deep Learning Michael Elad Computer Science Department The Technion - Israel Institute

23

Pursuit Algorithms

Michael EladThe Computer-Science DepartmentThe Technion

Basis Pursuit

Change the L0 into L1

and then the problem becomes convex and manageable

Matching Pursuit

Find the support greedily,

one element at a time

Thresholding

Multiply y by ๐ƒ๐“

and apply shrinkage:

เทฮฑ = ๐’ซ๐›ฝ ๐ƒ๐“y

minฮฑ ฮฑ 0 s. t. ๐ƒฮฑ โˆ’ y 2 โ‰ค ฮต

Approximation Algorithms

minฮฑ ฮฑ 1

s. t.๐ƒฮฑ โˆ’ y 2 โ‰ค ฮต

Page 24: Sparse Representations and the Basis Pursuit Algorithm...Sparse Modeling in Image Processing and Deep Learning Michael Elad Computer Science Department The Technion - Israel Institute

ฮฑ1 ฮฑ2 ฮฑ3

ฮฃ

24

Difficulties with Sparselando There are various pursuit algorithms

o Here is an example using the Basis Pursuit (L1):

o Surprising fact: Many of these algorithms are often accompanied by theoretical guarantees for their success, if the unknown is sparse enough

Michael EladThe Computer-Science DepartmentThe Technion

Page 25: Sparse Representations and the Basis Pursuit Algorithm...Sparse Modeling in Image Processing and Deep Learning Michael Elad Computer Science Department The Technion - Israel Institute

25

The Mutual Coherence

o The Mutual Coherence ฮผ ๐ƒ is the largest off-diagonal entry in absolute value

o We will pose all the theoretical results in this talk using this property, due to its simplicity

o You may have heard of other ways to characterize the dictionary (Restricted Isometry Property - RIP, Exact Recovery Condition - ERC, Babel function, Spark, โ€ฆ)

=o Compute

Assume normalized

columns

๐ƒ

๐ƒT

๐ƒT๐ƒ

Michael EladThe Computer-Science DepartmentThe Technion

Page 26: Sparse Representations and the Basis Pursuit Algorithm...Sparse Modeling in Image Processing and Deep Learning Michael Elad Computer Science Department The Technion - Israel Institute

26

Basis-Pursuit Success

Comments: o If =0 เทฮฑ = ฮฑo This is a worst-case

analysis โ€“ better bounds exist

o Similar theorems exist for many other pursuit algorithms

Theorem: Given a noisy signal y = ๐ƒฮฑ + v where v 2 โ‰ค ฮตand ฮฑ is sufficiently sparse,

then Basis-Pursuit: minฮฑ ฮฑ 1 s. t. ๐ƒฮฑ โˆ’ y 2 โ‰ค ฮต

leads to a stable result: เทฮฑ โˆ’ ฮฑ 22 โ‰ค

4๐œ€2

1โˆ’ฮผ 4 ฮฑ 0โˆ’1

Michael EladThe Computer-Science DepartmentThe Technion

Donoho, Elad & Temlyakov (โ€˜06)

เทฮฑ

ฮฑ 0 <1

41 +

1

ฮผ

M

x

ฮฑ

๐ƒ+

y

v 2 โ‰ค ฮต

minฮฑ ฮฑ 1

s. t.๐ƒฮฑ โˆ’ y 2 โ‰ค ฮต

minฮฑ ฮฑ 0

s. t.๐ƒฮฑ โˆ’ y 2 โ‰ค ฮต

Page 27: Sparse Representations and the Basis Pursuit Algorithm...Sparse Modeling in Image Processing and Deep Learning Michael Elad Computer Science Department The Technion - Israel Institute

27

Difficulties with Sparseland

ฮฑ1 ฮฑ2 ฮฑ3

ฮฃo Problem 2: Given a family of signals, how do

we find the dictionary to represent it well?

o Solution: Learn! Gather a large set of signals (many thousands), and find the dictionary that sparsifies them

o Such algorithms were developed in the past 10 years (e.g., K-SVD), and their performance is surprisingly good

o We will not discuss this matter further in this talk due to lack of time

Michael EladThe Computer-Science DepartmentThe Technion

Page 28: Sparse Representations and the Basis Pursuit Algorithm...Sparse Modeling in Image Processing and Deep Learning Michael Elad Computer Science Department The Technion - Israel Institute

28

Difficulties with Sparseland

ฮฑ1 ฮฑ2 ฮฑ3

ฮฃo Problem 3: Why is this model suitable to

describe various sources? e.g., Is it goodfor images? Audio? Stocks? โ€ฆ

o General answer: Yes, this model is extremely effective in representing various sources

Theoretical answer: Clear connection to other models

Empirical answer: In a large variety of signal and image processing (and later machine learning), this model has been shown to lead to state-of-the-art results

Michael EladThe Computer-Science DepartmentThe Technion

Page 29: Sparse Representations and the Basis Pursuit Algorithm...Sparse Modeling in Image Processing and Deep Learning Michael Elad Computer Science Department The Technion - Israel Institute

29

Difficulties with Sparseland ?

o Problem 1: Given an image patch, how can we find its atom decomposition ?

o Problem 2: Given a family of signals, how do we find the dictionary to represent it well?

o Problem 3: Is this model flexible enough to describe various sources? E.g., Is it good for images? audio? โ€ฆ

ฮฑ1 ฮฑ2 ฮฑ3

ฮฃ

Michael EladThe Computer-Science DepartmentThe Technion

Page 30: Sparse Representations and the Basis Pursuit Algorithm...Sparse Modeling in Image Processing and Deep Learning Michael Elad Computer Science Department The Technion - Israel Institute

30

Difficulties with Sparseland ?

o Problem 1: Given an image patch, how can we find its atom decomposition ?

o Problem 2: Given a family of signals, how do we find the dictionary to represent it well?

o Problem 3: Is this model flexible enough to describe various sources? E.g., Is it good for images? audio? โ€ฆ

ฮฑ1 ฮฑ2 ฮฑ3

ฮฃ

Michael EladThe Computer-Science DepartmentThe Technion

Page 31: Sparse Representations and the Basis Pursuit Algorithm...Sparse Modeling in Image Processing and Deep Learning Michael Elad Computer Science Department The Technion - Israel Institute

31

o Sparseland has a great success in signal &

image processing and machine learning tasks

o In the past 8-9 years, many books were published on this and closely related fields

Michael EladThe Computer-Science DepartmentThe Technion

This Field has been rapidly GROWING โ€ฆ

Page 32: Sparse Representations and the Basis Pursuit Algorithm...Sparse Modeling in Image Processing and Deep Learning Michael Elad Computer Science Department The Technion - Israel Institute

32Michael EladThe Computer-Science DepartmentThe Technion

Page 33: Sparse Representations and the Basis Pursuit Algorithm...Sparse Modeling in Image Processing and Deep Learning Michael Elad Computer Science Department The Technion - Israel Institute

33

A New Massive Open Online Course

Michael EladThe Computer-Science DepartmentThe Technion

Page 34: Sparse Representations and the Basis Pursuit Algorithm...Sparse Modeling in Image Processing and Deep Learning Michael Elad Computer Science Department The Technion - Israel Institute

34

o When handling images, Sparseland is typically deployed on small

overlapping patches due to the desire to train the model to fit the data better

o The model assumption is: each patch in the image is believed to have a sparse representation w.r.t. a common local dictionary

o What is the corresponding global model? This brings us to โ€ฆ the Convolutional Sparse Coding (CSC)

Michael EladThe Computer-Science DepartmentThe Technion

Sparseland for Image Processing

Page 35: Sparse Representations and the Basis Pursuit Algorithm...Sparse Modeling in Image Processing and Deep Learning Michael Elad Computer Science Department The Technion - Israel Institute

Multi-Layered ConvolutionalSparse Modeling

35Michael EladThe Computer-Science DepartmentThe Technion

Joint work with

Vardan PapyanYaniv Romano Jeremias Sulam

Page 36: Sparse Representations and the Basis Pursuit Algorithm...Sparse Modeling in Image Processing and Deep Learning Michael Elad Computer Science Department The Technion - Israel Institute

Michael EladThe Computer-Science DepartmentThe Technion

Convolutional Sparse Coding (CSC)

[๐—] =

i=1

๐‘š

di โˆ— [ฮ“i]

๐‘š filters convolved with their sparse representations

An image with ๐‘pixels

i-th feature-map: An image of the same size as ๐— holding the sparse representation related to the i-filter

The i-th filter of small size ๐‘›

29

This model emerged in 2005-2010, developed and advocated by Yan LeCun and others. It serves as the foundation of Convolutional Neural Networks

Page 37: Sparse Representations and the Basis Pursuit Algorithm...Sparse Modeling in Image Processing and Deep Learning Michael Elad Computer Science Department The Technion - Israel Institute

Michael EladThe Computer-Science DepartmentThe Technion

oHere is an alternative global sparsity-based model formulation

o๐‚i โˆˆ โ„๐‘ร—๐‘ is a banded and Circulantmatrix containing a single atom with all of its shifts

o๐šชi โˆˆ โ„๐‘ are the corresponding coefficients ordered as column vectors

๐— =

i=1

๐‘š

๐‚i๐šชi

CSC in Matrix Form

๐‘›

๐‘

๐‚i =

=๐‚1 โ‹ฏ ๐‚๐‘š ๐šช1

โ‹ฎ๐šช๐‘š

= ๐ƒ๐šช

30

Page 38: Sparse Representations and the Basis Pursuit Algorithm...Sparse Modeling in Image Processing and Deep Learning Michael Elad Computer Science Department The Technion - Israel Institute

Michael EladThe Computer-Science DepartmentThe Technion

The CSC Dictionary

๐‚1 ๐‚2 ๐‚3 =

๐ƒ =๐‘›

๐ƒL

๐‘š

31

Page 39: Sparse Representations and the Basis Pursuit Algorithm...Sparse Modeling in Image Processing and Deep Learning Michael Elad Computer Science Department The Technion - Israel Institute

Michael EladThe Computer-Science DepartmentThe Technion

=

๐‘i๐— = ๐›€๐›„i

๐‘›

(2๐‘› โˆ’ 1)๐‘š

๐‘i๐—

๐›„i

Why CSC?

๐— = ๐ƒ๐šชstripe-dictionary stripe vector

32

Page 40: Sparse Representations and the Basis Pursuit Algorithm...Sparse Modeling in Image Processing and Deep Learning Michael Elad Computer Science Department The Technion - Israel Institute

Michael EladThe Computer-Science DepartmentThe Technion

=

๐‘i๐— = ๐›€๐›„i

๐‘›

(2๐‘› โˆ’ 1)๐‘š

๐‘i+1๐—

๐›„i+1

Why CSC?

๐— = ๐ƒ๐šชstripe-dictionary

Every patch has a sparse representation w.r.t. to the

same local dictionary (๐›€) just as assumed for images

stripe vector

32

๐‘i+1๐— = ๐›€๐›„i+1

Page 41: Sparse Representations and the Basis Pursuit Algorithm...Sparse Modeling in Image Processing and Deep Learning Michael Elad Computer Science Department The Technion - Israel Institute

Michael EladThe Computer-Science DepartmentThe Technion

Classical Sparse Theory for CSC ?

Theorem: BP is guaranteed to โ€œsucceedโ€ โ€ฆ. if ๐šช ๐ŸŽ <๐Ÿ

๐Ÿ’๐Ÿ +

๐Ÿ

๐›

min๐šช

๐šช 0 s. t. ๐˜ โˆ’ ๐ƒ๐šช 2 โ‰ค ฮต

oAssuming that ๐‘š = 2 and ๐‘› = 64 we have that [Welch, โ€™74]

ฮผ โ‰ฅ 0.063

o Success of pursuits is guaranteed as long as

๐šช 0 <1

41 +

1

ฮผ(๐ƒ)โ‰ค

1

21 +

1

0.063โ‰ˆ 4.2

oOnly few (4) non-zeros GLOBALLY are allowed!!! This is a very pessimistic result!

33

Page 42: Sparse Representations and the Basis Pursuit Algorithm...Sparse Modeling in Image Processing and Deep Learning Michael Elad Computer Science Department The Technion - Israel Institute

Michael EladThe Computer-Science DepartmentThe Technion

Classical Sparse Theory for CSC ?

Theorem: BP is guaranteed to โ€œsucceedโ€ โ€ฆ. if ๐šช ๐ŸŽ <๐Ÿ

๐Ÿ’๐Ÿ +

๐Ÿ

๐›

min๐šช

๐šช 0 s. t. ๐˜ โˆ’ ๐ƒ๐šช 2 โ‰ค ฮต

oAssuming that ๐‘š = 2 and ๐‘› = 64 we have that [Welch, โ€™74]

ฮผ โ‰ฅ 0.063

o Success of pursuits is guaranteed as long as

๐šช 0 <1

41 +

1

ฮผ(๐ƒ)โ‰ค

1

21 +

1

0.063โ‰ˆ 4.2

oOnly few (4) non-zeros GLOBALLY are allowed!!! This is a very pessimistic result!

33

Page 43: Sparse Representations and the Basis Pursuit Algorithm...Sparse Modeling in Image Processing and Deep Learning Michael Elad Computer Science Department The Technion - Israel Institute

Michael EladThe Computer-Science DepartmentThe Technion

The main question we aim to address is this:

Can we generalize the vast theory of Sparseland to this new notion of local sparsity? For example, could we provide guarantees for success for pursuit algorithms?

๐‘š = 2

Moving to Local Sparsity: Stripes

min๐šช

๐šช 0,โˆžs s. t. ๐˜ โˆ’ ๐ƒ๐šช 2 โ‰ค ฮต

โ„“0,โˆž Norm: ๐šช 0,โˆžs = max

i๐›„i 0

๐šช 0,โˆžs is low all ๐›„i are sparse every

patch has a sparse representation over ๐›€

34

๐›„i๐›„i+1

๐šช

Page 44: Sparse Representations and the Basis Pursuit Algorithm...Sparse Modeling in Image Processing and Deep Learning Michael Elad Computer Science Department The Technion - Israel Institute

Michael EladThe Computer-Science DepartmentThe Technion

Success of the Basis Pursuit

36

Theorem: For Y = Dฮ“ + E, if ฮป = 4 E 2,โˆžp

, if

๐šช ๐ŸŽ,โˆž๐ฌ <

๐Ÿ

๐Ÿ‘๐Ÿ +

๐Ÿ

๐› ๐ƒ

then Basis Pursuit performs very-well:

1. The support of ฮ“BP is contained in that of ฮ“

2. ฮ“BP โˆ’ ฮ“ โˆž โ‰ค 7.5 E 2,โˆžp

3. Every entry greater than 7.5 E 2,โˆžp

is found

4. ฮ“BP is unique

ฮ“BP = minฮ“

1

2Y โˆ’ Dฮ“ 2

2 + ฮป ฮ“ 1

Papyan, Sulam & Elad (โ€˜17)

This is a much better result โ€“ it allows few

non-zeros locally in each stripe, implying

a permitted O ๐‘non-zeros globally

Page 45: Sparse Representations and the Basis Pursuit Algorithm...Sparse Modeling in Image Processing and Deep Learning Michael Elad Computer Science Department The Technion - Israel Institute

Multi-Layered ConvolutionalSparse Modeling

40Michael EladThe Computer-Science DepartmentThe Technion

Page 46: Sparse Representations and the Basis Pursuit Algorithm...Sparse Modeling in Image Processing and Deep Learning Michael Elad Computer Science Department The Technion - Israel Institute

Michael EladThe Computer-Science DepartmentThe Technion

From CSC to Multi-Layered CSC๐— โˆˆ โ„๐‘

๐‘š1

๐‘›0

๐ƒ1 โˆˆ โ„๐‘ร—๐‘๐‘š1

๐‘›1๐‘š1

๐‘š2

๐ƒ2 โˆˆ โ„๐‘๐‘š1ร—๐‘๐‘š2

๐‘š1

๐šช1 โˆˆ โ„๐‘๐‘š1

๐šช1 โˆˆ โ„๐‘๐‘š1

๐šช2 โˆˆ โ„๐‘๐‘š2

Convolutional sparsity (CSC) assumes an

inherent structure is present in natural

signals

We propose to impose the same structure on the

representations themselves

Multi-Layer CSC (ML-CSC)

45

Page 47: Sparse Representations and the Basis Pursuit Algorithm...Sparse Modeling in Image Processing and Deep Learning Michael Elad Computer Science Department The Technion - Israel Institute

Michael EladThe Computer-Science DepartmentThe Technion

Intuition: From Atoms to Molecules๐— โˆˆ โ„๐‘ ๐ƒ1 โˆˆ โ„๐‘ร—๐‘๐‘š1 ๐ƒ2 โˆˆ โ„๐‘๐‘š1ร—๐‘๐‘š2

๐šช1 โˆˆ โ„๐‘๐‘š1

๐šช2 โˆˆ โ„๐‘๐‘š2

o We can chain the all the dictionaries into one effective dictionary๐ƒeff = ๐ƒ1๐ƒ2๐ƒ3 โˆ™โˆ™โˆ™ ๐ƒK ๐ฑ = ๐ƒeff ๐šชK

o This is a special Sparseland (indeed, a CSC) model

o However:

A key property in this model: sparsity of the intermediate representations

The effective atoms: atoms molecules cells tissue body-parts โ€ฆ

๐šช1 โˆˆ โ„๐‘๐‘š1

46

Page 48: Sparse Representations and the Basis Pursuit Algorithm...Sparse Modeling in Image Processing and Deep Learning Michael Elad Computer Science Department The Technion - Israel Institute

Michael EladThe Computer-Science DepartmentThe Technion

A Small Taste: Model Training (MNIST)

๐ƒ1๐ƒ2๐ƒ3 (28ร—28)

MNIST Dictionary:โ€ขD1: 32 filters of size 7ร—7, with stride of 2 (dense)โ€ขD2: 128 filters of size 5ร—5ร—32 with stride of 1 - 99.09 % sparseโ€ขD3: 1024 filters of size 7ร—7ร—128 โ€“ 99.89 % sparse

๐ƒ1๐ƒ2 (15ร—15)

๐ƒ1 (7ร—7)

47

Page 49: Sparse Representations and the Basis Pursuit Algorithm...Sparse Modeling in Image Processing and Deep Learning Michael Elad Computer Science Department The Technion - Israel Institute

Michael EladThe Computer-Science DepartmentThe Technion

A Small Taste: Model Training (MNIST)

๐ƒ1๐ƒ2๐ƒ3 (28ร—28)

MNIST Dictionary:โ€ขD1: 32 filters of size 7ร—7, with stride of 2 (dense)โ€ขD2: 128 filters of size 5ร—5ร—32 with stride of 1 - 99.09 % sparseโ€ขD3: 1024 filters of size 7ร—7ร—128 โ€“ 99.89 % sparse

๐ƒ1๐ƒ2 (15ร—15)

๐ƒ1 (7ร—7)

47

Page 50: Sparse Representations and the Basis Pursuit Algorithm...Sparse Modeling in Image Processing and Deep Learning Michael Elad Computer Science Department The Technion - Israel Institute

Michael EladThe Computer-Science DepartmentThe Technion

ML-CSC: Pursuit

o Deepโ€“Coding Problem ๐ƒ๐‚๐ฮป (dictionaries are known):

Find ๐šชj j=1

K๐‘ . ๐‘ก.

๐— = ๐ƒ1๐šช1 ๐šช1 0,โˆžs โ‰ค ฮป1

๐šช1 = ๐ƒ2๐šช2 ๐šช2 0,โˆžs โ‰ค ฮป2

โ‹ฎ โ‹ฎ๐šชKโˆ’1 = ๐ƒK๐šชK ๐šชK 0,โˆž

s โ‰ค ฮปK

o Or, more realistically for noisy signals,

Find ๐šชj j=1

K๐‘ . ๐‘ก.

๐˜ โˆ’ ๐ƒ1๐šช1 2 โ‰ค โ„ฐ ๐šช1 0,โˆžs โ‰ค ฮป1

๐šช1 = ๐ƒ2๐šช2 ๐šช2 0,โˆžs โ‰ค ฮป2

โ‹ฎ โ‹ฎ๐šชKโˆ’1 = ๐ƒK๐šชK ๐šชK 0,โˆž

s โ‰ค ฮปK

48

Page 51: Sparse Representations and the Basis Pursuit Algorithm...Sparse Modeling in Image Processing and Deep Learning Michael Elad Computer Science Department The Technion - Israel Institute

Michael EladThe Computer-Science DepartmentThe Technion

A Small Taste: Pursuit

ฮ“1

ฮ“2

ฮ“3

ฮ“0

Y

99.51% sparse(5 nnz)

99.52% sparse(30 nnz)

94.51 % sparse(213 nnz)

49

x=๐ƒ1ฮ“1

x=๐ƒ1๐ƒ2ฮ“2

x=๐ƒ1๐ƒ2๐ƒ3ฮ“3

x

Page 52: Sparse Representations and the Basis Pursuit Algorithm...Sparse Modeling in Image Processing and Deep Learning Michael Elad Computer Science Department The Technion - Israel Institute

Michael EladThe Computer-Science DepartmentThe Technion

The simplest pursuit algorithm (single-layer case) is the THR algorithm, which operates on a given input signal ๐˜ by:

๐šช = ๐’ซ๐›ฝ ๐ƒT๐˜

ML-CSC: The Simplest Pursuit

๐˜ = ๐ƒ๐šช + ๐„and ๐šช is sparse

50

Page 53: Sparse Representations and the Basis Pursuit Algorithm...Sparse Modeling in Image Processing and Deep Learning Michael Elad Computer Science Department The Technion - Israel Institute

Michael EladThe Computer-Science DepartmentThe Technion

The layered (soft nonnegative) thresholding and the CNN forward pass

algorithm are the very same thing !!!

o Layered thresholding (LT):

oNow letโ€™s take a look at how Conv. Neural Network operates:

๐šช2 = ๐’ซฮฒ2 ๐ƒ2T ๐’ซฮฒ1 ๐ƒ1

T๐˜๐šช2 = ๐’ซฮฒ2 ๐ƒ2T ๐’ซฮฒ1 ๐ƒ1

T๐˜๐šช2 = ๐’ซฮฒ2 ๐ƒ2T ๐’ซฮฒ1 ๐ƒ1

T๐˜

Consider this for Solving the DCP

Estimate ๐šช1 via the THR algorithm

Estimate ๐šช2 via the THR algorithm

๐ƒ๐‚๐ฮปโ„ฐ : Find ๐šชj j=1

K๐‘ . ๐‘ก.

๐˜ โˆ’ ๐ƒ1๐šช1 2 โ‰ค โ„ฐ ๐šช1 0,โˆžs โ‰ค ฮป1

๐šช1 = ๐ƒ2๐šช2 ๐šช2 0,โˆžs โ‰ค ฮป2

โ‹ฎ โ‹ฎ๐šชKโˆ’1 = ๐ƒK๐šชK ๐šชK 0,โˆž

s โ‰ค ฮปK

๐‘“ ๐˜ = ReLU ๐›2 +๐–2T ReLU ๐›1 +๐–1

T๐˜

51

Page 54: Sparse Representations and the Basis Pursuit Algorithm...Sparse Modeling in Image Processing and Deep Learning Michael Elad Computer Science Department The Technion - Israel Institute

Michael EladThe Computer-Science DepartmentThe Technion

๐˜

Theoretical Path

M A๐šชi i=1

K๐— = ๐ƒ1๐šช1๐šช1 = ๐ƒ2๐šช2

โ‹ฎ๐šชKโˆ’1 = ๐ƒK๐šชK

๐šชi is ๐‹0,โˆž sparse

๐ƒ๐‚๐ฮปโ„ฐ

Layered THR(Forward Pass)

Maybe other?

๐—

Armed with this view of a generative source model, we may ask new and daring theoretical questions

52

Page 55: Sparse Representations and the Basis Pursuit Algorithm...Sparse Modeling in Image Processing and Deep Learning Michael Elad Computer Science Department The Technion - Israel Institute

Michael EladThe Computer-Science DepartmentThe Technion

Success of the Layered-THR

Theorem: If ๐šชi 0,โˆžs <

1

21 +

1

ฮผ ๐ƒiโ‹…๐šชimin

๐šชimax โˆ’

1

ฮผ ๐ƒiโ‹…

ฮตLiโˆ’1

๐šชimax

then the Layered Hard THR (with the proper thresholds)

finds the correct supports and ๐šชiLT โˆ’ ๐šชi 2,โˆž

pโ‰ค ฮตL

i , where

we have defined ฮตL0 = ๐„ 2,โˆž

pand

ฮตLi = ๐šชi 0,โˆž

pโ‹… ฮตL

iโˆ’1 + ฮผ ๐ƒi ๐šชi 0,โˆžs โˆ’ 1 ๐šชi

max

The stability of the forward pass is guaranteed if the underlying representations are locally

sparse and the noise is locally bounded

Problems: 1. Contrast2. Error growth3. Error even if no noise

54

Papyan, Romano & Elad (โ€˜17)

Page 56: Sparse Representations and the Basis Pursuit Algorithm...Sparse Modeling in Image Processing and Deep Learning Michael Elad Computer Science Department The Technion - Israel Institute

Michael EladThe Computer-Science DepartmentThe Technion

Layered Basis Pursuit (BP)

๐šช1LBP = min

๐šช1

1

2๐˜ โˆ’ ๐ƒ1๐šช1 2

2 + ฮป1 ๐šช1 1

๐šช2LBP = min

๐šช2

1

2๐šช1LBP โˆ’ ๐ƒ2๐šช2 2

2+ ฮป2 ๐šช2 1

๐ƒ๐‚๐ฮปโ„ฐ : Find ๐šชj j=1

K๐‘ . ๐‘ก.

๐˜ โˆ’ ๐ƒ1๐šช1 2 โ‰ค โ„ฐ ๐šช1 0,โˆžs โ‰ค ฮป1

๐šช1 = ๐ƒ2๐šช2 ๐šช2 0,โˆžs โ‰ค ฮป2

โ‹ฎ โ‹ฎ๐šชKโˆ’1 = ๐ƒK๐šชK ๐šชK 0,โˆž

s โ‰ค ฮปK

oWe chose the Thresholding algorithm due to its simplicity, but we do know that there are better pursuit methods โ€“ how about using them?

o Lets use the Basis Pursuit instead โ€ฆ

Deconvolutional networks[Zeiler, Krishnan, Taylor & Fergus โ€˜10]

55

Page 57: Sparse Representations and the Basis Pursuit Algorithm...Sparse Modeling in Image Processing and Deep Learning Michael Elad Computer Science Department The Technion - Israel Institute

Michael EladThe Computer-Science DepartmentThe Technion

Success of the Layered BP

Theorem: Assuming that ๐šช๐ข ๐ŸŽ,โˆž๐ฌ <

๐Ÿ

๐Ÿ‘๐Ÿ +

๐Ÿ

๐› ๐ƒ๐ข

then the Layered Basis Pursuit performs very well:

1. The support of ๐šชiLBP is contained in that of ๐šชi

2. The error is bounded: ๐šชiLBP โˆ’ ๐šชi 2,โˆž

pโ‰ค ฮตL

i , where

ฮตLi = 7.5i ๐„ 2,โˆž

p ฯ‚j=1i ๐šชj 0,โˆž

p

3. Every entry in ๐šชi greater than

ฮตLi / ๐šชi 0,โˆž

pwill be found

56

Papyan, Romano & Elad (โ€˜17)

Problems: 1. Contrast2. Error growth3. Error even if no noise

Page 58: Sparse Representations and the Basis Pursuit Algorithm...Sparse Modeling in Image Processing and Deep Learning Michael Elad Computer Science Department The Technion - Israel Institute

Michael EladThe Computer-Science DepartmentThe Technion

Layered Iterative Soft-Thresholding:

๐šชjt = ๐’ฎฮพj/cj ๐šชj

tโˆ’1 +๐ƒjT ๐šชjโˆ’1 โˆ’๐ƒj๐šชj

tโˆ’1

Layered Iterative Thresholding

Layered BP: ๐šชjLBP = min

๐šชj

1

2๐šชjโˆ’1LBP โˆ’ ๐ƒj๐šชj 2

2+ ฮพj ๐šชj 1

Can be seen as a very deep recurrent neural network

[Gregor & LeCun โ€˜10]

t

j

j

Note that our suggestion implies that groups of layers share the same dictionaries

57

Page 59: Sparse Representations and the Basis Pursuit Algorithm...Sparse Modeling in Image Processing and Deep Learning Michael Elad Computer Science Department The Technion - Israel Institute

Michael EladThe Computer-Science DepartmentThe Technion

Where are the Labels?

M๐— = ๐ƒ1๐šช1๐šช1 = ๐ƒ2๐šช2

โ‹ฎ๐šชKโˆ’1 = ๐ƒK๐šชK

๐šชi is ๐‹0,โˆž sparse

๐—

52

L ๐—

Answer 1:

o We do not need labels because everything we show refer to the unsupervised case, in which we operate on signals, not necessarily in the context of recognition

We presented the ML-CSC as a machine that produces signals X

Page 60: Sparse Representations and the Basis Pursuit Algorithm...Sparse Modeling in Image Processing and Deep Learning Michael Elad Computer Science Department The Technion - Israel Institute

Michael EladThe Computer-Science DepartmentThe Technion

Where are the Labels?

M๐— = ๐ƒ1๐šช1๐šช1 = ๐ƒ2๐šช2

โ‹ฎ๐šชKโˆ’1 = ๐ƒK๐šชK

๐šชi is ๐‹0,โˆž sparse

๐—

Answer 2:

o In fact, this model could be augmented by a synthesis of the corresponding label by:

L ๐— = ๐‘ ๐‘–๐‘”๐‘› c + ฯƒj=1K wj

Tฮ“j

o This assumes that knowing the representations (or maybe their supports?) suffice for identifying the label

o Thus, a successful pursuit algorithm can lead to an accurate recognition if the network is augmented by a FC classification layer

52

L ๐—

We presented the ML-CSC as a machine that produces signals X

Page 61: Sparse Representations and the Basis Pursuit Algorithm...Sparse Modeling in Image Processing and Deep Learning Michael Elad Computer Science Department The Technion - Israel Institute

Michael EladThe Computer-Science DepartmentThe Technion

What About Learning?

All these models rely on proper Dictionary Learning Algorithms to fulfil their mission:

Sparseland: We have unsupervised and supervised such algorithms, and a beginning of theory to explain how these work

CSC: We have few and only unsupervised methods, and even these are not fully stable/clear

ML-CSC: One algorithm has been proposed (unsupervised) โ€“ see ArxiV

52

ML-CSC Multi-Layered Convolutional Sparse Coding

SparselandSparse

Representation Theory

CSCConvolutional

Sparse Coding

Page 62: Sparse Representations and the Basis Pursuit Algorithm...Sparse Modeling in Image Processing and Deep Learning Michael Elad Computer Science Department The Technion - Israel Institute

Michael EladThe Computer-Science DepartmentThe Technion

Time to Conclude

58

Page 63: Sparse Representations and the Basis Pursuit Algorithm...Sparse Modeling in Image Processing and Deep Learning Michael Elad Computer Science Department The Technion - Israel Institute

Michael EladThe Computer-Science DepartmentThe Technion

This Talk Sparseland The desire to model data

We spoke about the importance of models in signal/image

processing and described Sparseland in details

59

Page 64: Sparse Representations and the Basis Pursuit Algorithm...Sparse Modeling in Image Processing and Deep Learning Michael Elad Computer Science Department The Technion - Israel Institute

Michael EladThe Computer-Science DepartmentThe Technion

This Talk Sparseland The desire to model data

Novel View of Convolutional Sparse Coding

We presented a theoretical study of the CSC model and how to operate locally while getting global optimality

59

Page 65: Sparse Representations and the Basis Pursuit Algorithm...Sparse Modeling in Image Processing and Deep Learning Michael Elad Computer Science Department The Technion - Israel Institute

Michael EladThe Computer-Science DepartmentThe Technion

This Talk

Multi-Layer Convolutional Sparse Coding

Sparseland The desire to model data

Novel View of Convolutional Sparse Coding

We propose a multi-layer extension of CSC, shown to be tightly connected to CNN

59

Page 66: Sparse Representations and the Basis Pursuit Algorithm...Sparse Modeling in Image Processing and Deep Learning Michael Elad Computer Science Department The Technion - Israel Institute

Michael EladThe Computer-Science DepartmentThe Technion

This Talk

A novel interpretation and theoretical

understanding of CNN

Multi-Layer Convolutional Sparse Coding

Sparseland The desire to model data

Novel View of Convolutional Sparse Coding

The ML-CSC was shown to enable a theoretical study of CNN, along with new insights

59

Page 67: Sparse Representations and the Basis Pursuit Algorithm...Sparse Modeling in Image Processing and Deep Learning Michael Elad Computer Science Department The Technion - Israel Institute

Michael EladThe Computer-Science DepartmentThe Technion

This Talk

A novel interpretation and theoretical

understanding of CNN

Multi-Layer Convolutional Sparse Coding

Sparseland The desire to model data

Novel View of Convolutional Sparse Coding

Take Home Message 1: Generative modeling of data

sources enables algorithm development along with theoretically analyzing

algorithmsโ€™ performance

59

Take Home Message 2: The Multi-Layer

Convolutional Sparse Coding model could be

a new platform for understanding and developing deep-learning solutions

Page 68: Sparse Representations and the Basis Pursuit Algorithm...Sparse Modeling in Image Processing and Deep Learning Michael Elad Computer Science Department The Technion - Israel Institute

Michael EladThe Computer-Science DepartmentThe Technion

More on these (including these slides and the relevant papers) can be found in http://www.cs.technion.ac.il/~elad

Questions?