CS 4803 / 7643: Deep Learning - College of Computing...a black person bail (C) Dhruv Batra & Zsolt...

80
CS 4803 / 7643: Deep Learning Zsolt Kira Georgia Tech Topics: (Brief) discussion of multi-modal work Fairness, bias – Wrap-up

Transcript of CS 4803 / 7643: Deep Learning - College of Computing...a black person bail (C) Dhruv Batra & Zsolt...

Page 1: CS 4803 / 7643: Deep Learning - College of Computing...a black person bail (C) Dhruv Batra & Zsolt Kira Slide Credit: David Madras 48. Example –Word Embeddings • Fairness is morally

CS 4803 / 7643: Deep Learning

Zsolt KiraGeorgia Tech

Topics: – (Brief) discussion of multi-modal work– Fairness, bias– Wrap-up

Page 2: CS 4803 / 7643: Deep Learning - College of Computing...a black person bail (C) Dhruv Batra & Zsolt Kira Slide Credit: David Madras 48. Example –Word Embeddings • Fairness is morally

Lots of Data Modalities

filename -2

Images

Videos

Speech/Audio

Text

CombinedSensors

How should we combine them?

Page 3: CS 4803 / 7643: Deep Learning - College of Computing...a black person bail (C) Dhruv Batra & Zsolt Kira Slide Credit: David Madras 48. Example –Word Embeddings • Fairness is morally

(C) Dhruv Batra & Zsolt Kira 3

Page 4: CS 4803 / 7643: Deep Learning - College of Computing...a black person bail (C) Dhruv Batra & Zsolt Kira Slide Credit: David Madras 48. Example –Word Embeddings • Fairness is morally

(C) Dhruv Batra & Zsolt Kira 4

Page 5: CS 4803 / 7643: Deep Learning - College of Computing...a black person bail (C) Dhruv Batra & Zsolt Kira Slide Credit: David Madras 48. Example –Word Embeddings • Fairness is morally

Cross-Modal Transformation

(C) Dhruv Batra & Zsolt Kira 5

Page 6: CS 4803 / 7643: Deep Learning - College of Computing...a black person bail (C) Dhruv Batra & Zsolt Kira Slide Credit: David Madras 48. Example –Word Embeddings • Fairness is morally

Cross-Modal Transformation• Lots of examples (some that we’ve seen)

– Captioning (image/video to text)– Text to speech (text to audio)– Speech to text (audio to text)– Sign language translation (video to text)

(C) Dhruv Batra & Zsolt Kira 6

Page 7: CS 4803 / 7643: Deep Learning - College of Computing...a black person bail (C) Dhruv Batra & Zsolt Kira Slide Credit: David Madras 48. Example –Word Embeddings • Fairness is morally

Cross-Modal Learning (Unsupervised)

(C) Dhruv Batra & Zsolt Kira 7

Page 8: CS 4803 / 7643: Deep Learning - College of Computing...a black person bail (C) Dhruv Batra & Zsolt Kira Slide Credit: David Madras 48. Example –Word Embeddings • Fairness is morally

(C) Dhruv Batra & Zsolt Kira 8

Page 9: CS 4803 / 7643: Deep Learning - College of Computing...a black person bail (C) Dhruv Batra & Zsolt Kira Slide Credit: David Madras 48. Example –Word Embeddings • Fairness is morally

(C) Dhruv Batra & Zsolt Kira 9

Page 10: CS 4803 / 7643: Deep Learning - College of Computing...a black person bail (C) Dhruv Batra & Zsolt Kira Slide Credit: David Madras 48. Example –Word Embeddings • Fairness is morally

(C) Dhruv Batra & Zsolt Kira 10

Page 11: CS 4803 / 7643: Deep Learning - College of Computing...a black person bail (C) Dhruv Batra & Zsolt Kira Slide Credit: David Madras 48. Example –Word Embeddings • Fairness is morally

(C) Dhruv Batra & Zsolt Kira 11

Page 12: CS 4803 / 7643: Deep Learning - College of Computing...a black person bail (C) Dhruv Batra & Zsolt Kira Slide Credit: David Madras 48. Example –Word Embeddings • Fairness is morally

(C) Dhruv Batra & Zsolt Kira 12

Page 13: CS 4803 / 7643: Deep Learning - College of Computing...a black person bail (C) Dhruv Batra & Zsolt Kira Slide Credit: David Madras 48. Example –Word Embeddings • Fairness is morally

Cross-Modal (Transfer)

(C) Dhruv Batra & Zsolt Kira 13SoundNet: Learning Sound Representations from Unlabeled Video

Page 14: CS 4803 / 7643: Deep Learning - College of Computing...a black person bail (C) Dhruv Batra & Zsolt Kira Slide Credit: David Madras 48. Example –Word Embeddings • Fairness is morally

(C) Dhruv Batra & Zsolt Kira 14

Page 15: CS 4803 / 7643: Deep Learning - College of Computing...a black person bail (C) Dhruv Batra & Zsolt Kira Slide Credit: David Madras 48. Example –Word Embeddings • Fairness is morally

(C) Dhruv Batra & Zsolt Kira 15

Page 16: CS 4803 / 7643: Deep Learning - College of Computing...a black person bail (C) Dhruv Batra & Zsolt Kira Slide Credit: David Madras 48. Example –Word Embeddings • Fairness is morally

(C) Dhruv Batra & Zsolt Kira 16

Page 17: CS 4803 / 7643: Deep Learning - College of Computing...a black person bail (C) Dhruv Batra & Zsolt Kira Slide Credit: David Madras 48. Example –Word Embeddings • Fairness is morally

Mult-Modal Fusion• Several approaches:

– Just combine features at some level of representation (which?)– Gating mechanisms– Learn joint feature space

(C) Dhruv Batra & Zsolt Kira 17

Page 18: CS 4803 / 7643: Deep Learning - College of Computing...a black person bail (C) Dhruv Batra & Zsolt Kira Slide Credit: David Madras 48. Example –Word Embeddings • Fairness is morally

Mult-Modal Fusion• Several approaches:

– Just combine features at some level of representation (which?)

– Gating mechanisms– Learn joint feature space

(C) Dhruv Batra & Zsolt Kira 18

Page 19: CS 4803 / 7643: Deep Learning - College of Computing...a black person bail (C) Dhruv Batra & Zsolt Kira Slide Credit: David Madras 48. Example –Word Embeddings • Fairness is morally

Purpose: To develop algorithms simulating ground-level sensors

that will be on quadrotor To be able to publish algorithms (requires comparison to

state of the art on a public dataset) Already ground-truthed!

The KITTI Dataset

Page 20: CS 4803 / 7643: Deep Learning - College of Computing...a black person bail (C) Dhruv Batra & Zsolt Kira Slide Credit: David Madras 48. Example –Word Embeddings • Fairness is morally

Dense 3D from Sparse LIDAR

Premebida, C., Carreira, J., Batista, J., Nunes, U., 2014. Pedestrian detection combining RGB and

dense LIDAR data, in: Intelligent Robots and Systems (IROS 2014),

2014 IEEE/RSJ International Conference on. IEEE, pp. 4112–4117.

From sparse lidar

To a dense depth map

Via spatial filtering

Page 21: CS 4803 / 7643: Deep Learning - College of Computing...a black person bail (C) Dhruv Batra & Zsolt Kira Slide Credit: David Madras 48. Example –Word Embeddings • Fairness is morally

HHA Representation

- We have modified algorithm/code to extract multiple aspects of 3D

information obtained from velodyne LIDAR

- Currently state of art in indoor RGBD tasks

- We have modified it to make it work for unstructured outdoor

scenes

- These additional channels of information will be incorporated in next redesigned CNN architecture

Gupta, S., Girshick, R., Arbeláez, P., Malik, J., 2014. Learning rich features from RGB-D images for object

detection and segmentation, in: Computer Vision–ECCV 2014. Springer, pp. 345–360.

Page 22: CS 4803 / 7643: Deep Learning - College of Computing...a black person bail (C) Dhruv Batra & Zsolt Kira Slide Credit: David Madras 48. Example –Word Embeddings • Fairness is morally

Object Detection Pipeline

22

RegionProposals

CNN Feature

Extraction

Classification & Box

Regression

Class Label,

Bounding Box

(x,y,w,h)

Page 23: CS 4803 / 7643: Deep Learning - College of Computing...a black person bail (C) Dhruv Batra & Zsolt Kira Slide Credit: David Madras 48. Example –Word Embeddings • Fairness is morally

• Research question: Given that– CNNs learn a hierarchical feature space– Each channel is representing a different

aspect of the scene

• Where should fusion occur?

Question: Where should fusion occur?

J. Schlosser, C. Chow, Z. Kira, “Fusing LIDAR and Images for Pedestrian Detection using Convolutional Neural Networks”, accepted to the International Conference on

Robotics & Automation (ICRA), 2015

Page 24: CS 4803 / 7643: Deep Learning - College of Computing...a black person bail (C) Dhruv Batra & Zsolt Kira Slide Credit: David Madras 48. Example –Word Embeddings • Fairness is morally

Deep Learning Architectures for Multi-Modal Fusion

Baseline /

Related Work

CNN Baseline

Fusion Conditions

Explore the question: At what level of representation should we fuse different

modalities (image and depth)?

Page 25: CS 4803 / 7643: Deep Learning - College of Computing...a black person bail (C) Dhruv Batra & Zsolt Kira Slide Credit: David Madras 48. Example –Word Embeddings • Fairness is morally

Fusion Architectures

25

Original Architecture

Fusion Architecture (Fusion-4c-conv3)

Page 26: CS 4803 / 7643: Deep Learning - College of Computing...a black person bail (C) Dhruv Batra & Zsolt Kira Slide Credit: David Madras 48. Example –Word Embeddings • Fairness is morally

Parameter Efficiency vs. Performance

Page 27: CS 4803 / 7643: Deep Learning - College of Computing...a black person bail (C) Dhruv Batra & Zsolt Kira Slide Credit: David Madras 48. Example –Word Embeddings • Fairness is morally

Multi-Modal Fusion

(C) Dhruv Batra & Zsolt Kira 27

Page 28: CS 4803 / 7643: Deep Learning - College of Computing...a black person bail (C) Dhruv Batra & Zsolt Kira Slide Credit: David Madras 48. Example –Word Embeddings • Fairness is morally

(C) Dhruv Batra & Zsolt Kira 28

Page 29: CS 4803 / 7643: Deep Learning - College of Computing...a black person bail (C) Dhruv Batra & Zsolt Kira Slide Credit: David Madras 48. Example –Word Embeddings • Fairness is morally

(C) Dhruv Batra & Zsolt Kira 29

Page 30: CS 4803 / 7643: Deep Learning - College of Computing...a black person bail (C) Dhruv Batra & Zsolt Kira Slide Credit: David Madras 48. Example –Word Embeddings • Fairness is morally

(C) Dhruv Batra & Zsolt Kira 30

CentralNet: a Multilayer Approach for Multimodal

Fusion, Valentin Vielzeuf, Alexis Lechervy, Stéphane Pateux, Frédéric Jurie

Page 31: CS 4803 / 7643: Deep Learning - College of Computing...a black person bail (C) Dhruv Batra & Zsolt Kira Slide Credit: David Madras 48. Example –Word Embeddings • Fairness is morally

(C) Dhruv Batra & Zsolt Kira 31

Robust Deep Multi-modal Learning Based on Gated Information Fusion Network, Kim, et al.

Page 32: CS 4803 / 7643: Deep Learning - College of Computing...a black person bail (C) Dhruv Batra & Zsolt Kira Slide Credit: David Madras 48. Example –Word Embeddings • Fairness is morally

(C) Dhruv Batra & Zsolt Kira 32

Robust Deep Multi-modal Learning Based on Gated Information Fusion Network, Kim, et al.

Page 33: CS 4803 / 7643: Deep Learning - College of Computing...a black person bail (C) Dhruv Batra & Zsolt Kira Slide Credit: David Madras 48. Example –Word Embeddings • Fairness is morally

Mult-Modal Fusion• Several approaches:

– Just combine features at some level of representation (which?)– Gating mechanisms– Learn joint feature space

(C) Dhruv Batra & Zsolt Kira 33

Page 34: CS 4803 / 7643: Deep Learning - College of Computing...a black person bail (C) Dhruv Batra & Zsolt Kira Slide Credit: David Madras 48. Example –Word Embeddings • Fairness is morally

(C) Dhruv Batra & Zsolt Kira 34

Page 35: CS 4803 / 7643: Deep Learning - College of Computing...a black person bail (C) Dhruv Batra & Zsolt Kira Slide Credit: David Madras 48. Example –Word Embeddings • Fairness is morally

(C) Dhruv Batra & Zsolt Kira 35

Page 36: CS 4803 / 7643: Deep Learning - College of Computing...a black person bail (C) Dhruv Batra & Zsolt Kira Slide Credit: David Madras 48. Example –Word Embeddings • Fairness is morally

(C) Dhruv Batra & Zsolt Kira 36

Page 37: CS 4803 / 7643: Deep Learning - College of Computing...a black person bail (C) Dhruv Batra & Zsolt Kira Slide Credit: David Madras 48. Example –Word Embeddings • Fairness is morally

(C) Dhruv Batra & Zsolt Kira 37

Page 38: CS 4803 / 7643: Deep Learning - College of Computing...a black person bail (C) Dhruv Batra & Zsolt Kira Slide Credit: David Madras 48. Example –Word Embeddings • Fairness is morally

(C) Dhruv Batra & Zsolt Kira 38

Page 39: CS 4803 / 7643: Deep Learning - College of Computing...a black person bail (C) Dhruv Batra & Zsolt Kira Slide Credit: David Madras 48. Example –Word Embeddings • Fairness is morally

(C) Dhruv Batra & Zsolt Kira 39

Page 40: CS 4803 / 7643: Deep Learning - College of Computing...a black person bail (C) Dhruv Batra & Zsolt Kira Slide Credit: David Madras 48. Example –Word Embeddings • Fairness is morally

Slides by Li, et al.

Combining multiple types of data (2D & 3D)(

Page 41: CS 4803 / 7643: Deep Learning - College of Computing...a black person bail (C) Dhruv Batra & Zsolt Kira Slide Credit: David Madras 48. Example –Word Embeddings • Fairness is morally

Image based Shape Retrieval

Slides by Li, et al.

Page 42: CS 4803 / 7643: Deep Learning - College of Computing...a black person bail (C) Dhruv Batra & Zsolt Kira Slide Credit: David Madras 48. Example –Word Embeddings • Fairness is morally

Shape based Image Retrieval

Slides by Li, et al.

Page 43: CS 4803 / 7643: Deep Learning - College of Computing...a black person bail (C) Dhruv Batra & Zsolt Kira Slide Credit: David Madras 48. Example –Word Embeddings • Fairness is morally

ML and Fairness• AI effects our lives in many ways• Widespread algorithms with many small interactions

– e.g. search, recommendations, social media• Specialized algorithms with fewer but higher-stakes

interactions– e.g. medicine, criminal justice, finance

• At this level of impact, algorithms can have unintended consequences

• Low classification error is not enough, need fairness

(C) Dhruv Batra & Zsolt Kira 43Slide Credit: David Madras

Page 44: CS 4803 / 7643: Deep Learning - College of Computing...a black person bail (C) Dhruv Batra & Zsolt Kira Slide Credit: David Madras 48. Example –Word Embeddings • Fairness is morally

(C) Dhruv Batra & Zsolt Kira 44

Page 45: CS 4803 / 7643: Deep Learning - College of Computing...a black person bail (C) Dhruv Batra & Zsolt Kira Slide Credit: David Madras 48. Example –Word Embeddings • Fairness is morally

(C) Dhruv Batra & Zsolt Kira 45

Page 46: CS 4803 / 7643: Deep Learning - College of Computing...a black person bail (C) Dhruv Batra & Zsolt Kira Slide Credit: David Madras 48. Example –Word Embeddings • Fairness is morally

(C) Dhruv Batra & Zsolt Kira 46

Why is this a problem?

Page 47: CS 4803 / 7643: Deep Learning - College of Computing...a black person bail (C) Dhruv Batra & Zsolt Kira Slide Credit: David Madras 48. Example –Word Embeddings • Fairness is morally

Decision Theory

Equal costs -> Cross Entropy!

(C) Dhruv Batra & Zsolt Kira 47

Page 48: CS 4803 / 7643: Deep Learning - College of Computing...a black person bail (C) Dhruv Batra & Zsolt Kira Slide Credit: David Madras 48. Example –Word Embeddings • Fairness is morally

ML and Fairness• Fairness is morally and legally motivated• Takes many forms• Criminal justice: recidivism algorithms (COMPAS)

– Predicting if a defendant should receive bail– Unbalanced false positive rates: more likely to wrongly deny

a black person bail

(C) Dhruv Batra & Zsolt Kira 48Slide Credit: David Madras

Page 49: CS 4803 / 7643: Deep Learning - College of Computing...a black person bail (C) Dhruv Batra & Zsolt Kira Slide Credit: David Madras 48. Example –Word Embeddings • Fairness is morally

Example – Word Embeddings• Fairness is morally and legally motivated• Takes many forms• Bias found in word embeddings (Bolukbasi et al.

2016)– Examined word embeddings (word2vec) trained on Google

News– Represent each word with high-dimensional vector– Vector arithmetic: analogies like Paris - France = London -

England– Found also: man - woman = programmer - homemaker =

surgeon - nurse• The good news: word embeddings learn so well!• The bad news: sometimes too well• Our chatbots should be less biased than we are

(C) Dhruv Batra & Zsolt Kira 49Slide Credit: David Madras

Page 50: CS 4803 / 7643: Deep Learning - College of Computing...a black person bail (C) Dhruv Batra & Zsolt Kira Slide Credit: David Madras 48. Example –Word Embeddings • Fairness is morally

Example – Word Embeddings• Algorithmic fairness: how can we ensure that our

algorithms act in ways that are fair?– This definition is vague and somewhat circular– Describes a broad set of problems, not a specic technical

approach– Related to accountability: who is responsible for automated

behaviour? How do we supervise/audit machines which have large impact?

– Also transparency: why does an algorithm behave in a certain way? Can we understand its decisions? Can it explain itself?

– Connections to AI safety and aligned AI: how can we make AI without unintended negative consequences? Aligns with our values?

(C) Dhruv Batra & Zsolt Kira 50Slide Credit: David Madras

Page 51: CS 4803 / 7643: Deep Learning - College of Computing...a black person bail (C) Dhruv Batra & Zsolt Kira Slide Credit: David Madras 48. Example –Word Embeddings • Fairness is morally

Why Fairness is Hard• Suppose we are a bank trying to fairly decide who should get a loan

– i.e. Who is most likely to pay us back?• Suppose we have two groups, A and B (the sensitive attribute)

– This is where discrimination could occur• The simplest approach is to remove the sensitive attribute from the

data, so that our classier doesn't know the sensitive attribute

(C) Dhruv Batra & Zsolt Kira 51Slide Credit: David Madras

Page 52: CS 4803 / 7643: Deep Learning - College of Computing...a black person bail (C) Dhruv Batra & Zsolt Kira Slide Credit: David Madras 48. Example –Word Embeddings • Fairness is morally

Why Fairness is Hard• However, if the sensitive attribute is correlated with the other

attributes, this isn't good enough• It is easy to predict race if you have lots of other information (e.g.

home address, spending patterns)• More advanced approaches are necessary

(C) Dhruv Batra & Zsolt Kira 52Slide Credit: David Madras

Page 53: CS 4803 / 7643: Deep Learning - College of Computing...a black person bail (C) Dhruv Batra & Zsolt Kira Slide Credit: David Madras 48. Example –Word Embeddings • Fairness is morally

Definitions of Fairness – Group Fairness

• So we've built our classier . . . how do we know if we're being fair?• One metric is demographic parity | requiring that the same

percentage of A and B receive loans– What if 80% of A is likely to repay, but only 60% of B is?– Then demographic parity is too strong

• Could require equal false positive/negative rates– When we make an error, the direction of that error is equally likely for both groups

• These are denitions of group fairness• Treat dierent groups equally"

(C) Dhruv Batra & Zsolt Kira 53Slide Credit: David Madras

Page 54: CS 4803 / 7643: Deep Learning - College of Computing...a black person bail (C) Dhruv Batra & Zsolt Kira Slide Credit: David Madras 48. Example –Word Embeddings • Fairness is morally

Definitions of Fairness – Individual Fairness

• Also can talk about individual fairness | \Treat similar examples similarly"

• Learn fair representations– Useful for classification, not for (unfair) discrimination– Related to domain adaptation– Generative modelling/adversarial approaches

(C) Dhruv Batra & Zsolt Kira 54Slide Credit: David Madras

Page 55: CS 4803 / 7643: Deep Learning - College of Computing...a black person bail (C) Dhruv Batra & Zsolt Kira Slide Credit: David Madras 48. Example –Word Embeddings • Fairness is morally

Conclusion

• This is an exciting eld, quickly developing• Central definitions still up in the air• AI moves fast | lots of (currently unchecked) power• Law/policy will one day catch up with technology• Those who work with AI should be ready

(C) Dhruv Batra & Zsolt Kira 55Slide Credit: David Madras

Page 56: CS 4803 / 7643: Deep Learning - College of Computing...a black person bail (C) Dhruv Batra & Zsolt Kira Slide Credit: David Madras 48. Example –Word Embeddings • Fairness is morally

Highlights of Class

(C) Dhruv Batra & Zsolt Kira 56

Page 57: CS 4803 / 7643: Deep Learning - College of Computing...a black person bail (C) Dhruv Batra & Zsolt Kira Slide Credit: David Madras 48. Example –Word Embeddings • Fairness is morally

Error Decomposition

(C) Dhruv Batra and Zsolt Kira 57

Reality

3x3 conv, 384

Pool

5x5 conv, 256

11x11 conv, 96

Input

Pool

3x3 conv, 384

3x3 conv, 256

Pool

FC 4096

FC 4096

Softmax

FC 1000

AlexNet

Slide Credit: Fei-Fei Li, Justin Johnson, Serena Yeung, CS 231n

Page 58: CS 4803 / 7643: Deep Learning - College of Computing...a black person bail (C) Dhruv Batra & Zsolt Kira Slide Credit: David Madras 48. Example –Word Embeddings • Fairness is morally

58

Example with an image with 4 pixels, and 3 classes (cat/dog/ship)

0.2 -0.5 0.1 2.0

1.5 1.3 2.1 0.0

0 0.25 0.2 -0.3

WInput image

56

231

24

2

56 231

24 2

Stretch pixels into column

1.1

3.2

-1.2

+-96.8

437.9

61.95

=Cat score

Dog score

Ship score

b

Slide Credit: Fei-Fei Li, Justin Johnson, Serena Yeung, CS 231n

Page 59: CS 4803 / 7643: Deep Learning - College of Computing...a black person bail (C) Dhruv Batra & Zsolt Kira Slide Credit: David Madras 48. Example –Word Embeddings • Fairness is morally

Before: Logistic Regression as Cascade

(C) Dhruv Batra 59

Given a library of simple functions

Compose into a

complicate function

Slide Credit: Marc'Aurelio Ranzato, Yann LeCun

Page 60: CS 4803 / 7643: Deep Learning - College of Computing...a black person bail (C) Dhruv Batra & Zsolt Kira Slide Credit: David Madras 48. Example –Word Embeddings • Fairness is morally

60

x hW1 sW2

3072 100 10

Neural networks: without the brain stuff

Slide Credit: Fei-Fei Li, Justin Johnson, Serena Yeung, CS 231n

(Before) Linear score function:

(Now) 2-layer Neural Network

Page 61: CS 4803 / 7643: Deep Learning - College of Computing...a black person bail (C) Dhruv Batra & Zsolt Kira Slide Credit: David Madras 48. Example –Word Embeddings • Fairness is morally

Key Computation: Back-Prop

(C) Dhruv Batra and Zsolt Kira 61Slide Credit: Marc'Aurelio Ranzato, Yann LeCun

Page 62: CS 4803 / 7643: Deep Learning - College of Computing...a black person bail (C) Dhruv Batra & Zsolt Kira Slide Credit: David Madras 48. Example –Word Embeddings • Fairness is morally

(C) Dhruv Batra and Zsolt Kira 62

Example: Reverse mode AD

+

sin( )

x1 x2

*

Page 63: CS 4803 / 7643: Deep Learning - College of Computing...a black person bail (C) Dhruv Batra & Zsolt Kira Slide Credit: David Madras 48. Example –Word Embeddings • Fairness is morally

April 2463

63

Adam (full form)

Kingma and Ba, “Adam: A method for stochastic optimization”, ICLR 2015

Momentum

AdaGrad / RMSProp

Bias correction

Bias correction for the fact that first and second moment estimates start at zero

Slide Credit: Fei-Fei Li, Justin Johnson, Serena Yeung, CS 231n

Page 64: CS 4803 / 7643: Deep Learning - College of Computing...a black person bail (C) Dhruv Batra & Zsolt Kira Slide Credit: David Madras 48. Example –Word Embeddings • Fairness is morally

April 246464

Batch Normalization

Page 65: CS 4803 / 7643: Deep Learning - College of Computing...a black person bail (C) Dhruv Batra & Zsolt Kira Slide Credit: David Madras 48. Example –Word Embeddings • Fairness is morally

April 246565

Convolution Layers

Page 66: CS 4803 / 7643: Deep Learning - College of Computing...a black person bail (C) Dhruv Batra & Zsolt Kira Slide Credit: David Madras 48. Example –Word Embeddings • Fairness is morally

Slide Credit: Fei-Fei Li, Justin Johnson, Serena Yeung, CS 231n

ResNet

Page 67: CS 4803 / 7643: Deep Learning - College of Computing...a black person bail (C) Dhruv Batra & Zsolt Kira Slide Credit: David Madras 48. Example –Word Embeddings • Fairness is morally

Slide Credit: Fei-Fei Li, Justin Johnson, Serena Yeung, CS 231n

Page 68: CS 4803 / 7643: Deep Learning - College of Computing...a black person bail (C) Dhruv Batra & Zsolt Kira Slide Credit: David Madras 48. Example –Word Embeddings • Fairness is morally

Semantic Segmentation Idea: Fully Convolutional

Input:3 x H x W Predictions:

H x W

Design network as a bunch of convolutional layers, with downsampling and upsampling inside the network!

High-res:D1 x H/2 x W/2

High-res:D1 x H/2 x W/2

Med-res:D2 x H/4 x W/4

Med-res:D2 x H/4 x W/4

Low-res:D3 x H/4 x W/4

Long, Shelhamer, and Darrell, “Fully Convolutional Networks for Semantic Segmentation”, CVPR 2015Noh et al, “Learning Deconvolution Network for Semantic Segmentation”, ICCV 2015

Downsampling:Pooling, strided convolution

Upsampling:???

Slide Credit: Fei-Fei Li, Justin Johnson, Serena Yeung, CS 231n

Page 69: CS 4803 / 7643: Deep Learning - College of Computing...a black person bail (C) Dhruv Batra & Zsolt Kira Slide Credit: David Madras 48. Example –Word Embeddings • Fairness is morally

69

3D CNNs

Page 70: CS 4803 / 7643: Deep Learning - College of Computing...a black person bail (C) Dhruv Batra & Zsolt Kira Slide Credit: David Madras 48. Example –Word Embeddings • Fairness is morally

April 2470

h0 fW h1 fW h2 fW h3

yT

…x

W

RNN: Computational Graph: One to Many

hT

y3y2y1

Slide Credit: Fei-Fei Li, Justin Johnson, Serena Yeung, CS 231n

Page 71: CS 4803 / 7643: Deep Learning - College of Computing...a black person bail (C) Dhruv Batra & Zsolt Kira Slide Credit: David Madras 48. Example –Word Embeddings • Fairness is morally

LSTMs Intuition: Additive Updates

(C) Dhruv Batra 71Image Credit: Christopher Olah (http://colah.github.io/posts/2015-08-Understanding-LSTMs/)

Backpropagation from ct to ct-1 only elementwise

multiplication by f, no matrix multiply by W

Page 72: CS 4803 / 7643: Deep Learning - College of Computing...a black person bail (C) Dhruv Batra & Zsolt Kira Slide Credit: David Madras 48. Example –Word Embeddings • Fairness is morally

(C) Dhruv Batra & Zsolt Kira 72

Unsupervised Learning

Page 73: CS 4803 / 7643: Deep Learning - College of Computing...a black person bail (C) Dhruv Batra & Zsolt Kira Slide Credit: David Madras 48. Example –Word Embeddings • Fairness is morally

Training GANs: Two-player game

Generator network: try to fool the discriminator by generating real-looking imagesDiscriminator network: try to distinguish between real and fake images

zRandom noise

Generator Network

Discriminator Network

Fake Images(from generator)

Real Images(from training set)

Real or Fake

Ian Goodfellow et al., “Generative Adversarial Nets”, NIPS 2014

Fake and real images copyright Emily Denton et al. 2015. Reproduced with permission.

Slide Credit: Fei-Fei Li, Justin Johnson, Serena Yeung, CS 231n

Page 74: CS 4803 / 7643: Deep Learning - College of Computing...a black person bail (C) Dhruv Batra & Zsolt Kira Slide Credit: David Madras 48. Example –Word Embeddings • Fairness is morally

74

Variational Autoencoders: Intractability

Since we’re modeling probabilistic generation of data, encoder and decoder networks are probabilistic

Slide Credit: Fei-Fei Li, Justin Johnson, Serena Yeung, CS 231n

Page 75: CS 4803 / 7643: Deep Learning - College of Computing...a black person bail (C) Dhruv Batra & Zsolt Kira Slide Credit: David Madras 48. Example –Word Embeddings • Fairness is morally

:neural network with weights

Q-network Architecture

Current state st: 84x84x4 stack of last 4 frames (after RGB->grayscale conversion, downsampling, and cropping)

16 8x8 conv, stride 4

32 4x4 conv, stride 2

FC-256

FC-4 (Q-values) Last FC layer has 4-d output (if 4 actions), corresponding to Q(st, a1), Q(st, a2), Q(st, a3), Q(st,a4)

Number of actions between 4-18 depending on Atari game

A single feedforward pass to compute Q-values for all actions from the current state => efficient!

[Mnih et al. NIPS Workshop 2013; Nature 2015]

Slide Credit: Fei-Fei Li, Justin Johnson, Serena Yeung, CS 231n

Page 76: CS 4803 / 7643: Deep Learning - College of Computing...a black person bail (C) Dhruv Batra & Zsolt Kira Slide Credit: David Madras 48. Example –Word Embeddings • Fairness is morally

(C) Dhruv Batra & Zsolt Kira 76Slide Credit: Thomas Kipf

Page 77: CS 4803 / 7643: Deep Learning - College of Computing...a black person bail (C) Dhruv Batra & Zsolt Kira Slide Credit: David Madras 48. Example –Word Embeddings • Fairness is morally

We have come a long way• ML background – error decomposition, overfitting, features, etc.• Linear classifiers

– & softmax• Computation Graph• Gradient Descent• Adding layers• Backpropagation, automatic differentiation• Optimization – regularization/normalization (batch norm, dropout), augmentation,

different optimizers (adam, adagrad, etc.)• Convolution and Pooling layers• Modern CNNs - AlexNet, VGG, Inception, ResNet• 3D CNNs• Recurrent Neural Networks and LSTMs

– NLP, word/sentence vectors, attention, etc.• Unsupervised feature learning• Generative models (GANs, VAEs)• Deep Reinforcement Learning• Other applications: Few-Shot Learning, structure

(C) Dhruv Batra & Zsolt Kira 77

Page 78: CS 4803 / 7643: Deep Learning - College of Computing...a black person bail (C) Dhruv Batra & Zsolt Kira Slide Credit: David Madras 48. Example –Word Embeddings • Fairness is morally

Things to Watch out For• Research is cyclical

– SVMs, boosting, probabilistic graphical models & Bayes Nets, Structural Learning, Sparse Coding, Deep Learning

– Deep learning is unique in its depth and breadth, but...– Deep learning may be improved, reinvented, combined, overtaken

• Learn fundamentals for techniques across the field:– Know the span of ML techniques and choose the ones that fit your

problem!– Be responsible in 1) how you use it, 2) promises you make and how

you convey it

• Try to understand landscape of the field– Look out for what is coming up next, not where we are

• Have fun!

Page 79: CS 4803 / 7643: Deep Learning - College of Computing...a black person bail (C) Dhruv Batra & Zsolt Kira Slide Credit: David Madras 48. Example –Word Embeddings • Fairness is morally

Some current/upcoming topics• Current / Recent Past

– AutoML– Meta-learning– Unsupervised, semi-supervised, domain adaptation, zero/one/few-shot

learning– Memory – Visual question answering, embodied question answering– Adversarial Examples

• More recent– Continual/lifelong learning without forgetting– World modeling, learning intuitive/physics models– Visual dialogue, agents, chatbots– Fixing reinforcement learning

• First you have to admit you have a problem– Simulation frameworks, joint perception, planning, and action

• Navigation, mapping– Deep Learning and logic!– Just scaling everything up and watch the magic!

Page 80: CS 4803 / 7643: Deep Learning - College of Computing...a black person bail (C) Dhruv Batra & Zsolt Kira Slide Credit: David Madras 48. Example –Word Embeddings • Fairness is morally

CIOS!

http://gatech.smartevals.com/

(C) Dhruv Batra & Zsolt Kira 80