CIS 522: Lecture 5R - Penn Engineeringcis522/slides/CIS522_Lecture5R.pdf · Feedback / Logistics...

CIS 522: Lecture 5R

Autoencoders02/13/20

Feedback / Logistics● Lecture Notes● Puppy as option instead of kitten● HW2 - Due tonight at midnight!● HW3 - Computer Vision

○ Groups of 2● Final Project Abstract Proposal (Due Tuesday 2/18)

Generative Models

Cognitive scienceImagine a kittenMake it blackMake it play with a ball

Like this?

What is a Generative Model?Discriminative Model: Given a set of classes C, and a sample X, what is the probability X belongs to class c?

Succinctly represented as this quantity:

To generate a classification:

Examples: 1. Logistic Regression2. Decision Trees3. Support Vector Machines

(called a class probability estimate)

Generative Model: Given a class c from a set of classes C, and a sample X, what is the probability that X was generated from that class?

Succinctly represented as this distribution:

What is a Generative Model?

Now to generate an X, we can simply randomly sample

from this distribution!

Now to generate an X, we can simply randomly sample

from this distribution!Examples

● Naive Bayes● Variational Autoencoder (To be covered in this lecture)● Generative Adversarial Networks (To be covered in next lecture!)

Low-dimensional representations

Cute puppy… but takes ~ half a million numbers to represent

Artificial vs. biological representations

X 172k =

Autoencoders

Basic architecture

Training● “Self-supervised” training: desired output Y is just input X● e.g. MSE loss● Small bottleneck layer impedes training

y-hatX

What is a linear autoencoder?● What happens if no nonlinearity?● Linear encoder and decoder● If L2 on encoder and decoder, exactly PCA (Kunin et al. 2019)

Convolutional Autoencoders● Decoder has to “undo” the convolution / pooling● Generally involves upsampling or “deconvolutions”● Also conv layers reducing # of features

Source: https://stackabuse.com/autoencoders-for-image-reconstruction-in-python-and-keras/

Issue with Convolutional Autoencoders● Space forms clusters

● Clusters are far apart

● What if we want to

interpolate between

classes?

Source: https://towardsdatascience.com/intuitively-understanding-variational-autoencoders-1bfe67eb5daf

Constraining the Features● Currently no constraints

on the features.● What if we constrained the

features to the standard normal distribution?

Source: https://towardsdatascience.com/light-on-math-machine-learning-intuitive-guide-to-understanding-kl-divergence-2b382ca2b2a8

KL Divergence Loss

Only using the KL Divergence Loss

… clearly can’t only use a KL Divergence Loss

KL Divergence + Reconstruction Loss?

Much better!

Regularization Reconstruction loss

Variational autoencoders (VAEs) - motivation● Want the low-dim representation z to have independent components● Each component should contribute a similar amount● z according to unit Gaussian

Example:

● Low-dim representation of faces - eye color, hair length, etc.● Each feature should be independent● Scale so each feature has same variance

VAE Process

Source: http://kvfrans.com/variational-autoencoders-explained/

Why sample a latent vector?

VAE Process

VAE Computational Graph - Forward Pass

VAE Computational Graph - Backward Pass

Reparameterization Trick

Z ~ N(μ,σ)

ε ~ N(0,1)

Z = εσ + μ

Where did this loss function come from?

Estimating the VAE Lower Bound

VAE Objective Function:

Latent Representation: (Think HMM’s)

Can compute individual terms of integral, but not the integral itself!

Encoder Network: Decoder Network:

Equations from Stanford lecture linked here: https://www.youtube.com/watch?v=5WoItGTWV54

Log likelihood from our decoder network!

KL Divergence constraining z to STD Gaussian

Intractable term

Log likelihood from our encoder network!

KL Divergence constraining z to STD Gaussian

(Since KL Div always non-negative)

Transposed Convolution (“Deconvolution”)

Source: https://github.com/vdumoulin/conv_arithmetic

Issues with VAEs

Current fix is to use a VAE + GAN (you’ll learn about GANs in the next lecture)

Transposed Convolution (“Deconvolution”)

Source: https://github.com/vdumoulin/conv_arithmetic

Checkerboarding Artifact

Source: https://distill.pub/2016/deconv-checkerboard/

Resize Convolutions

Source: https://distill.pub/2016/deconv-checkerboard/

Better Weight Initialization

● Train unsupervised with unlabeled inputs

● Throw away decoder network

● Start training with labeled data!

“Explainable” AI

What do these weights mean? What if we do a mis-prediction?

“Explainable” AI

Vary each of the features in the code vector and see what the output looks like!

VAE Vector Arithmetic

Face x with moustac

Face x without moustache

moustacheFace x’ without moustache

+ moustache = Face x’ with moustache

Style Disentanglement

Zartist

Zstyle

Mona lisa painted by Dali in 2019?

Cubist “Starry Night” in 1500?

Style Disentanglement

Interpolation of Latent Vectors - Music VAE

CIS 522: Lecture 5R - Penn Engineeringcis522/slides/CIS522_Lecture5R.pdf · Feedback / Logistics...

Documents

Transcript of CIS 522: Lecture 5R - Penn Engineeringcis522/slides/CIS522_Lecture5R.pdf · Feedback / Logistics...

Emailing HW2

Simmons 520 HW3

ME234—HW3 Project

HW3 Solutions

Syllabus for SRA 111, Section 01- INTRO TO SRA (201516SPUP)€¦ · HW1: Student Intro - 1% HW2: Case Conficker- 2% HW3: PerSec Lab- 5% HW4: CryptoGame 5% HW5: Intel Analysis Game-

CS153: Compilers Lecture 15: Subtyping · •HW4 Oat v1 out •Due Tuesday Oct 29 (7 days) •Reference solns •Will be released on Canvas •HW2 later today •HW3 later this week

PHYS102 2019-2020 SPRING SEMESTER LAB EXPERIMENTS … · phys102 exp3 exp4 exp5 mt1 mt2 final lab hw (avg.) hw1 hw2 hw3 hw4 hw5 hw6 hw7 report quiz report quiz report report report

HW2 ECE 332 Feedback Control12000.org/.../fall_2015/ECE_332_feedback/HWs/HW2/HW2.pdf · 2016-01-31 · HW2 ECE 332 Feedback Control ... the rype ol'input signal l

Hw3 Problem Session

CSE 544: Principles of Database Systems€¦ · Parallel DB Wrapup CSE544 - Spring, 2013 1 . Announcements • HW2 (SimpleDB) was due last night • HW3 (MR/PigLatin) posted today

Hw3 Chapter 6

ALADDIN HW3

Homework –Continue Reading K&R Chapter 2 –We’ll go over HW2 –HW3 is posted Questions?

Lecture 6: Thu Sep 14, 2017 - Georgia Institute of …barry.ece.gatech.edu/3084/lectures/lec6.pdf0 Lecture 6: Thu Sep 14, 2017 Announce HW2 is due now, beginning of lecture. HW3 is

1 Homework –Continue Reading K&R Chapter 2 –We’ll go over HW2 at end of class today –Continue working on HW3 Questions?

HW3 Solution New

Linear Regression (continued)atalwalk/teaching/winter17/cs260/... · Announcements HW2 will be returned in section on Friday HW3 due in class next Monday Midterm is next Wednesday

Hw3 Solutions 2012

HW3 6130 Solution

HW2 Solution