Sub-sampled dictionaries for coarse-to-fine sparse representation-based human action recognition

1
I. INTRODUCTION Sparse representation-based classification (SRC) has recently attracted substantial research attention However, the computational complexity of testing makes it challenging to deploy SRC in practice We propose a novel method for human action recognition, leveraging coarse-to-fine sparse representations that have been obtained through dictionary sub-sampling The proposed method reduces the time complexity of testing at no substantial loss in recognition accuracy JongHo Lee a , Hyun-seok Min a , Jeong-jik Seo a , Wesley De Neve a,b , and Yong Man Ro a a Image and Video Systems Lab, KAIST, Republic of Korea b Multimedia Lab, Ghent University-iMinds, Belgium website: http://ivylab.kaist.ac.kr IEEE International Conference on Multimedia & Expo (ICME), July 2014, Chengdu, China SUB-SAMPLED DICTIONARIES FOR COARSE-TO-FINE SPARSE REPRESENTATION-BASED HUMAN ACTION RECOGNITION e-mail: [email protected] II. PROPOSED APPROACH 1. Training Fig. 2. Time complexity of different human action recognition approaches. Fig. 1. Accuracy of different human action recognition approaches. 0 10 20 30 40 50 60 70 150 300 450 600 750 900 1050 1200 1350 1500 Tim e com plexity(s) N um berofatom s(ls) Proposed m ethod w ith ds =48 Proposed m ethod w ith ds =72 Proposed m ethod w ith ds =144 C onventional m ethod 0.76 0.78 0.8 0.82 0.84 0.86 0.88 0.9 150 300 450 600 750 900 1050 1200 1350 1500 R ecognition accuracy N um berofatom s(ls) Proposed m ethod w ith ds =48 Proposed m ethod w ith ds =72 Proposed m ethod w ith ds =144 C onventional m ethod III. EXPERIMENTS 1. Experimental setup Dataset: UCF-50 Feature: Cuboid detector + HOG descriptor Homotopy-based 1 -norm minimization 2. Experimental results Conventional method: classification only uses the FGD IV. CONCLUSIONS We proposed a novel method for human action recognition using coarse-to-fine sparse representations The proposed method achieves efficient human action recognition at no substantial loss in recognition accuracy 2. Testing Y Random projection Feature Extraction Test video clip Class 1 Class 2 Φ , 1 Φ , 2 Φ , 3 Φ , Sparse Coefficients Ranking 1 +1 +4 Candidate Actions Candidate Action Selection Coarse-Grained Dictionary (CGD) O X X O We select candidate actions Feature Extraction Action 1 Action 2 Action 3 Action Training Dataset Action 1 Action 2 Action 3 Action Action 1 Action 2 Action 3 Action Fine-Grained Dictionary (FGD) Coarse-Grained Dictionary (CGD) Φ , 1 Φ , 2 Φ , 3 Φ , Φ , 1 Φ , 2 Φ , 3 Φ , Random projection (for reducing the dimension of the atoms) Random sampling (for reducing the number of atoms) Dictionary Construction Action 1Action 2Action 3 Action Pruned FGD Φ , 1 Φ , 2 Φ , 3 Φ , Action 1Action 2Action 3 Action Candida te Actions O X X O Φ , 1 Action 1 Φ , Action Pruned FGD Classification We can find the sparse representation of with We label with the action that comes with the smallest residual error is a new vector whose only nonzero entries are the entries in associated with the action Φ , 1 Φ , 2 Φ , 3 Φ ,

description

Sub-Sampled Dictionaries for Coarse-to-Fine Sparse Representation-based Human Action Recognition. Poster presented at the main track of ICME 2014.

Transcript of Sub-sampled dictionaries for coarse-to-fine sparse representation-based human action recognition

Page 1: Sub-sampled dictionaries for coarse-to-fine sparse representation-based human action recognition

I. INTRODUCTION Sparse representation-based classification (SRC) has re-

cently attracted substantial research attention However, the computational complexity of testing makes it

challenging to deploy SRC in practice We propose a novel method for human action recognition,

leveraging coarse-to-fine sparse representations that have been obtained through dictionary sub-sampling

The proposed method reduces the time complexity of test-ing at no substantial loss in recognition accuracy

JongHo Leea, Hyun-seok Mina, Jeong-jik Seoa, Wesley De Nevea,b, and Yong Man Roa

aImage and Video Systems Lab, KAIST, Republic of KoreabMultimedia Lab, Ghent University-iMinds, Belgium

website: http://ivylab.kaist.ac.kr

IEEE International Conference on Multimedia & Expo (ICME), July 2014, Chengdu, China

SUB-SAMPLED DICTIONARIES FOR COARSE-TO-FINE SPARSE REPRESENTATION-BASED HUMAN ACTION RECOGNITION

e-mail: [email protected]

II. PROPOSED APPROACH 1. Training

Fig. 2. Time complexity of different human action recognition approaches.

Fig. 1. Accuracy of different human action recognition approaches.

0

10

20

30

40

50

60

70

150 300 450 600 750 900 1050 1200 1350 1500

Tim

e co

mp

lexi

ty(s

)

Number of atoms(ls)

Proposed method with ds =48

Proposed method with ds =72

Proposed method with ds =144

Conventional method

0.76

0.78

0.8

0.82

0.84

0.86

0.88

0.9

150 300 450 600 750 900 1050 1200 1350 1500

Rec

og

nit

ion

acc

ura

cy

Number of atoms(ls)

Proposed method with ds =48Proposed method with ds =72Proposed method with ds =144Conventional method

III. EXPERIMENTS 1. Experimental setupDataset: UCF-50Feature: Cuboid detector + HOG descriptorHomotopy-based 1-norm minimization

2. Experimental resultsConventional method: classification only uses the FGD

IV. CONCLUSIONS We proposed a novel method for human action recognition

using coarse-to-fine sparse representations The proposed method achieves efficient human action

recognition at no substantial loss in recognition accuracy

2. Testing

Y

Random projection

Feature Extraction

Test video clip

…Class 1 Class 2

Φ𝑠 ,1Φ𝑠 ,2Φ𝑠 ,3 Φ𝑠 ,𝐾

Sparse Coefficients

Ranking 1 +1 +4 𝑯Candidate

Actions

Candidate Action Selection

Coarse-Grained Dictionary (CGD)

O X X O

We select candidate actions

Feature Extraction

…Action 1 Action 2 Action 3 Action

Training Dataset

Action 1 Action 2 Action 3 Action

… … … …

Action 1Action 2Action 3 Action

Fine-Grained Dictionary

(FGD)

Coarse-Grained Dictionary (CGD)

Φ𝑠 ,1Φ𝑠 ,2Φ𝑠 ,3 Φ𝑠 ,𝐾

Φ𝑜 ,1 Φ𝑜 , 2 Φ𝑜 , 3 Φ𝑜 ,𝐾

Random projection (for reducing the dimension of the atoms) Random sampling (for reducing the number of atoms)

Dictionary Construction

Action 1 Action 2 Action 3 Action

Pruned FGD

Φ𝑜 ,1 Φ𝑜 , 2 Φ𝑜 , 3 Φ𝑜 ,𝐾

Action 1 Action 2 Action 3 Action

Candidate

Actions

O X X O

Φ𝑝𝑟 ,1

Action 1

Φ𝑝𝑟 ,𝐻

Action

… Pruned FGD𝐃𝑝𝑟

Classification We can find the sparse representation of with

We label with the action that comes with the smallest resid-ual error

is a new vector whose only nonzero entries are the entries in associated with the action

Φ𝑜 ,1 Φ𝑜 , 2 Φ𝑜 , 3 Φ𝑜 ,𝐾