Visual Scene Understanding (CS 598)

Derek Hoiem

Course Number: 46411Instructor: Derek HoiemRoom: Siebel Center 1109Class Time: Tuesday and Thursday 11:00am – 12:15pmOffice Hours: Tuesday and Thursday 12:15-1pm; by appointmentContact: dhoiem@uiuc.edu, Siebel 3312

• Introductions

• Overview of logistics

• Overview of class material

Vision: What is it good for?

Biological (Humans)

1.2.3.4.5.6.7.8.9.10.

Technological (Computers)

1.2.3.4.5.6.7.8.9.10.

Note: Unfortunately, these got erased when my computer crashed

Course Logistics

Class Content Overview

• Tutorials and Perspectives

• Paper readingI) Spatial InferenceII) ObjectsIII) ActionsIV) Context and Integration

Visual Scene Understanding

Visual scene understanding is the ability to infer

general principles and current situations from imagery in a way that helps achieve goals.

I. Spatial Inference

Getting Around

Spatial Inference: applications

Household RobotsAutomated Vehicles

Graphics ApplicationsPredict object size/position

Spatial Inference: open questions

• How do we represent space?– Surface orientations, depth maps, voxels?

• How do we infer it from available sensory data (image, stereo, motion, laser range finder)?

II. Objects

Finding Things and Observing Them

Image classification: Are there any dogs?Photo credit: iansand – flickr.com

Object Localization: Where are the dog(s)?

Verification: Is this a dog?

Description: Furry, small, nice, side view

Identification: My friend Sally?

Recognizing Stuff

Object Recognition: applications

Photo SearchSecurity

Robots

Object Recognition: open questions

• How many examples does it take to learn one category well?

• How many examples does it take to learn 100 categories well?

• How do these answers depend on the level of supervision?

• Can recognition be solved with simple methods and massive amounts of data?

• How can we quickly recognize an object?

• How can we scale up to deal with thousands of categories?

III. Actions

Taking Action

[Saxena et al. 2008]

Recognizing Actions

KTH Dataset

Figure from Laptev et al. 2008

Recognizing Actions

Figure from Laptev et al. 2008

Reading Emotions

Photo credit: Comstok

Actions: applications

SecurityVideo Search

Actions: open questions

• How are actions defined?

• Does it make sense to categorize them?– If not, how do we recognize them?

• What are good visual representations for inferring actions?

• How can we recognize activities?

IV. Context and Integration

[Hoiem et al. 2008]

Context and Integration

[Hoiem et al. 2008]

• Objects + scene categories better detection

• Movement + objects action/activity recognition

• Space + objects navigation

Context and Integration: applications

Everything that vision is good for

Context and Integration: open questions

• Should context be explicit (e.g., “cars drive on the road”) or implicit (feature-based)?

• How do we model and learn the interactions between different processes and scene characteristics?

• How do we deal with the growing complexity as more and more pieces are put together?

General Problems in Computer Vision

• Better understanding of limitations and their sources– Need new experimental paradigms

• Improve generalization– Aim to generalize across datasets, categories, and

tasks– Work on knowledge sharing and transfer

• Vision as a way of learning about the world– Integration into AI– Systems that acquire knowledge over time

Successes of Computer Vision• Point matching (e.g. 2d3)

– Tracking– Structure from motion– Stitching

• Product inspection• Multiview 3d reconstruction• Face recognition and modeling• Object recognition on pre-2000 datasets• Interactive segmentation (ongoing)

• Register on bulletin board

• Post comments on Thursdays reading (due tomorrow)

• Look over schedule and decide which days to present (due next Tues)

• Start thinking about projects– Let me know if you want a specific pairing (due Tues)

Questions?

• Make you a better researcher (esp. in vision)– More knowledge– Better critical thinking skills– Improved communication skills– Improved research skills

Grades

• Participation: 25%– Posting– Class discussion

• Presentation: 25%

• Projects: 50%– Proposal, progress report, final paper, and oral

Policies

• Attendance required (see syllabus)

• Give credit where due

• No formal prerequisites

• Everything needs to be on time

Reading

• Read well

• Post comments to bulletin board at least 24 hours before class

Presentations• Presenter

– Everyone does two– Good quality coverage of topic (40 min)– See syllabus for guidelines– Sign up by next Tuesday (at latest)– TBAs are your choice (decide at least 4 weeks in advance)

• Demonstrator– If all days are taken, pair up– One person’s job will be to demonstrate some aspect of the algorithm

(e.g., where it succeeds and fails) by running it on many examples– May require implementation

• Note taker

Projects• Timeline

– Proposal: Feb 12 (3 ½ weeks!)– Progress report: Mar 19– Presentation: paper May 5, oral later

• Progress report• Presentation

– Paper– Oral

• In pairs– Can choose partner or be randomly paired

• Suggestions on web

• Potentially will lead to publication (e.g. NIPS)

• Register on bulletin board

• Post comments on Thursdays reading (due tomorrow)

• Look over schedule and decide which days to present (due next Tues)

• Start thinking about projects– Let me know if you want a specific pairing (due Tues)

Questions?

Visual Scene Understanding (CS 598)

Documents

Transcript of Visual Scene Understanding (CS 598)

O 7, 2014 - Hartford 598 Blue Hills Final Report.pdf · On the Fatal Fire of October 7, 2014 Date Report Issued: August 7, 2015 . 3 ... 14. Walkthrough of the fire scene at 598 Blue

CS 598: Spectral Graph Theory. Lecture 1 - Course Website Directory

PVZ 598-2013

CS 598 EVS: Tensor Computations - Tensor Decomposition

EVOLUTIONARY HMMS BAYESIAN APPROACH TO MULTIPLE ALIGNMENT Siva Theja Maguluri CS 598 SS.

BasketNews 598

CS 294-7: Digital Modulation - Spread Spectrum Scene Online: An

Visual Scene Understanding (CS 598) Derek Hoiem Course Number: 46411 Instructor: Derek Hoiem Room: Siebel Center 1109 Class Time: Tuesday and Thursday.

CS 598 JGE Fall 2017 One-Dimensional Computational Topologyweb.engr.illinois.edu/~jeffe/teaching/topology17/proposals/all... · One-Dimensional Computational Topology Project Proposals

CS 598 MCC – Advanced Internetworks Future Internet Architecture Locator-/Identifier-Split Quirin Scheitle scheitl2@illinois.edu.

CS 598 Scripting Languages Design and Implementation 10. Interpreters Part I: Lisp.

CS/BIOE 598: Algorithmic Computational Genomics Tandy Warnow Departments of Bioengineering and Computer Science .

CS 294-7: Radio Propagation - Spread Spectrum Scene Online: An

CS 598 MCC – Advanced Internetworks

Classification Derek Hoiem CS 598, Spring 2009 Jan 27, 2009.

CS 598 Scripting Languages Design and Implementation 2. MATLAB 1.

Botnets: Yesterday, Today, and Tomorrow CS 598: Advanced Internet Presented by: Imranul Hoque.

Open Scene Graph Visualization II MSIM 842, CS 795/895

1 High-Assurance Laboratory, CS&E Dept., ASUSourav@asu.edu CSE 434/598 - Computer Networks Network Layer Computer Science & Engineering Department Arizona.

CRIME SCENE PHOTOGRAPHY - AustinTexas.gov · 2019-06-07 · FORENSIC SCIENCE DIVISION CRIME SCENE SECTION TECHNICAL MANUAL . CS Technical Manual Effective Date: January 11, 2016 Approved