1. MOTIVATION - UCSBdonganh/files/teaching/cs290d... · 2014. 5. 29. · Agenda 1. Mo9vaon(2....

Building high-‐level features using large-‐scale unsupervised learning Anh Nguyen, Bay-‐yuan Hsu CS290D – Data Mining (Spring 2014) University of California, Santa Barbara Slide adapted from Andrew Ng (Stanford), Nando de Freitas (UBC) 1

Agenda 1.  Mo9va9on 2.  Approach 1.  Sparse Deep Auto-‐encoder 2.  Local Recep9ve Field 3.  L2 Pooling 4.  Local contrast normaliza9on 5.  Overall Model

3.  Parallelism 4.  Evalua9on 5.  Discussion

1. MOTIVATION

Mo9va9on

•  Feature learning •  Supervised learning

•  Need large number of labeled data • Unsupervised learning

•  Example: Build face detector without having labeled face images

• Building high-‐level features using unlabeled data.

Mo9va9on

• Previous works •  Auto encoder •  Sparse coding

• Result: Only learns low level features • Reason: Computa9onal constraints • Approach

•  Dataset • Model •  ComputaIonal resources 5

2. APPROACH

Sparse Deep Auto-‐encoder

• Auto-‐encoder •  Neural network •  Unsupervised learning •  Back-‐propaga9on

Sparse Deep Auto-‐encoder (cnt’d)

•  Sparse Coding •  Input: Images x(1), x(2) ... x(m) •  Learn: Bases (features) f1, f2, ..., fk, so that each input x can be approximately decomposed as: x=∑ajfj s.t. aj’s are mostly zero (“sparse”)

•  Sparse Coding • Regularizer

•  Sparse Deep Auto-‐encoder • Mul9ple hidden layers to achieve par9cular characteris9c in learning features

Local Recep9ve Field

• Defini9on: Each feature in the autoencoder can connect only to a small region of the lower layer

• Goal: •  Learn feature efficiently •  Parallelism

•  Training on small image patches

L2 Pooling

• Goal: Robust to local distorIon • Approach: Group similar features together to achieve invariance

L2 Pooling

Local Contrast Normaliza9on

• Goal: Robust to variaIon in light intensity • Approach: Normalize contrast

Local Contrast Normaliza9on

• Goal: Robust to variaIon in light intensity • Approach: Normalize contrast

Overall Model

•  3 layers •  Simple: 18x18 px

•  8 neurons/patch • Complex: 5x5 px •  LCN: 5x5 px

Overall Model

•  Train: •  Reconstruct input of each layer

• Op9miza9on func9on

Overall Model

•  Complex model?

3. PARALLELISM

Asynchronous SGD

n  Two recent lines of research in speeding up large learning problems: • Parallel/distributed compu9ng • Online (and mini-‐batch) learning algorithms: stochas9c gradient descent, perceptron, MIRA, stepwise EM n How can we bring together the benefits of parallel compu9ng and online learning?

Asynchronous SGD

n SGD: Stochas9c Gradient Descent: • Choose an ini9al vector of parameters W and learning rate α

• Repeat un9l an approximate minimum is obtained: •  Randomly shuffle examples in the training set

Model Parallelism

• Weights divided according to locality of image and store on different machine

5. EVALUATION

Evalua9on

• 10M Youtube unlabeled frames of size 200x200

• 1B parameters • 1000 machines • 16,000 cores

Experiment on Faces

• Test set •  37,000 images •  13,026 face images

• Best neuron

Experiment on Faces (cnt’d)

• Visualiza9on •  Top s9mulus (images) for face neuron • Op9mal s9mulus for face neuron

•  Invariances Proper9es

Experiment on Cat/Human body

• Test set • Cat: 10,000 posi9ve, 18,409 nega9ve • Human body: 13,026 posi9ve, 23,974 nega9ve

• Accuracy

ImageNet classifica9on

• Recognizing images • Dataset

•  20,000 categories •  14M images

• Accuracy •  15.8% •  State of art: 9.3%

5. DISCUSSION

Discussion

• Deep learning • Unsupervised feature learning •  Learning mul9ple layers of representa9on

•  Increase accuracy: Invariance, contrast normaliza9on

• Scalability

6. REFERENCES

References 1.  Quoc Le et al., “Building High-‐level Features using Large Scale Unsupervised

Learning” 2.  Nando de Freitas, “Deep Learning”, URL: hops://www.youtube.com/watch?

v=g4ZmJJWR34Q 3.  Andrew Ng, “Sparse autoencoder”, URL: hop://www.stanford.edu/class/archive/

cs/cs294a/cs294a.1104/sparseAutoencoder.pdf 4.  Andrew Ng, “Machine Learning and AI via Brain Simula9ons”, URL: hops://

forum.stanford.edu/events/2011slides/plenary/2011plenaryNg.pdf 5.  Andrew Ng, “Deep Learning”, URL: hop://www.ipam.ucla.edu/publica9ons/

gss2012/gss2012_10595.pdf

1. MOTIVATION - UCSBdonganh/files/teaching/cs290d... · 2014. 5. 29. · Agenda 1. Mo9vaon(2....

Documents

Transcript of 1. MOTIVATION - UCSBdonganh/files/teaching/cs290d... · 2014. 5. 29. · Agenda 1. Mo9vaon(2....

Data Sheet – Encoder and Encoder Cable Comparison … · 4 Data Sheet – Encoder and Encoder Cable Comparison DT.. / DV.. Motors vs. DR.. Motors 2 Meaning of the symbols Encoder

Card Technology - ATMIAEMV Encoder included EMV Encoder included EMV Encoder included EMV Encoder included EMV Encoder included 29.3”H x 19.1”D x 11.4”W 9.72”H x 8.07”D x

Audio/Video Encoder HER503 series Encoder...Audio/Video Encoder HER503 series Encoder Quick Operation Guide 1 Installation Pre-Installation The HER503 Series Audio/Video Encoder is

YAX Standard Y-Axis Unit guide - Baruffaldi YAX Standard...Motor encoder Encoder motore µm ≤ 20 Ball screw encoder (optional) Encoder vite a ricircolo (opzione) ≤ 15 Linear encoder

Encoder Products Company Encoder Quick Reference Guide

Encoder, Tristate Driver. Outline Review: demo decoder FPGA example Encoder – Demo Encoder, problems – Encoder using for loop Priority Decoder Application.

Sparse Normalized Local Alignment Nadav Efraty Gad M. Landau.

Centralized Sparse Representation for Image Restorationcslzhang/paper/conf/iccv11/... · age, the sparse coding coefﬁcients are often not accurate enough if only the local sparsity

The Local Marchenko-Pastur Law for Sparse …adlam/thesis.pdfThe Local Marchenko-Pastur Law for Sparse Covariance Matrices ... We review the resolvent method for proving local laws

Sparse Local Embeddings for Extreme Multi-label Classiﬁcationmanikvarma.org/pubs/bhatia15.pdfSparse Local Embeddings for Extreme Multi-label Classiﬁcation Kush Bhatia y, Himanshu

S2AD linear encoder manual - …machinetoolproducts.com/content/Fagor Linear Encoder...ENCODER LINEAL MODELO: S2AD LINEAR ENCODER MODEL: S2AD MANUAL DE INSTALACION / INSTALLATION MANUAL

Fast orthogonal sparse approximation algorithms over local … · Fast orthogonal sparse approximation algorithms over local dictionaries Boris Mailh e , R emi Gribonval , Pierre

Image Super-Resolution via Sparse Representation...results from the NMF global modeling and column 5 demon-strates the results after local sparse modeling. Comparing the two columns,

Progressive sparse representation- based classification using local ...epubs.surrey.ac.uk/813638/1/Progressive sparse representation base… · Progressive sparse representation-based

Sparse Coding in Sparse Winner networks

On Sparse Representation in Fourier and Local Bases · On Sparse Representation in Fourier and Local Bases Pier Luigi Dragotti, Senior Member, IEEE and Yue M. Lu, Senior Member, IEEE

Setting Up the DME 1000 and DME 2000 · PDF fileStep 7 Enter the Encoder Name and Encoder Description. These fields may be used to describe the encoder owner, encoder location, encoder

Sparse Auto Encoder

Micro RDS Encoder RDS Encoder RDS Encoder User Guide - Pira.cz

Variational Autoencoders - Computer Science … Hinton (in his 2014 AMA on Reddit) ... Noise Contrastive Estimation, ... Sparse coding, sparse auto-encoder, PSD