Visual Dialog - Stanford University · Visual Dialog 1.0 2.0 1.5 Questions Answers o 10 5 67 Words...

Post on 25-May-2020

2 views 0 download

Transcript of Visual Dialog - Stanford University · Visual Dialog 1.0 2.0 1.5 Questions Answers o 10 5 67 Words...

Visual DialogAbhishek Das, Satwik Kottur, Khushi Gupta, Avi Singh, Deshraj Yadav, José M.F. Moura, Devi Parikh, Dhruv Batra

Presented by: Alan Luo

1

Introduction Natural Language Processing + Computer Vision

● Aiding visually impaired users in understanding their surroundings or social media content

● Interacting with an AI assistant

2

Video Captioning

Related Work Image/Video Captioning Image Captioning

3

Datasets

Related Work Visual-Semantic Alignments Visual-Semantic Alignments

4

5

Related Work Visual Q&A

Contributions1. Propose a new AI task: Visual Dialog

2. Develop a novel two-person chat data-collection protocol and introduce a new dataset

3. Introduce a family of neural encoder-decoder models for Visual Dialog

6

Technical Details With Late Fusion Encoder

7

Qualitative Quantitative

8

Dataset VisDial

ResultsQualitative Results

9

Quantitative Results