Stereo-based Hand Gesture Tracking and Recognition in ... · Stereo-based Hand Gesture Tracking and...

54
Stereo-based Hand Gesture Tracking and Recognition in Immersive Stereoscopic Displays Habib Abi-Rached Thursday 17 February 2005.

Transcript of Stereo-based Hand Gesture Tracking and Recognition in ... · Stereo-based Hand Gesture Tracking and...

Page 1: Stereo-based Hand Gesture Tracking and Recognition in ... · Stereo-based Hand Gesture Tracking and Recognition in Immersive Stereoscopic Displays Habib Abi-Rached Thursday 17 February

Stereo-based Hand Gesture Tracking and Recognition in

Immersive Stereoscopic Displays

Habib Abi-RachedThursday 17 February 2005.

Page 2: Stereo-based Hand Gesture Tracking and Recognition in ... · Stereo-based Hand Gesture Tracking and Recognition in Immersive Stereoscopic Displays Habib Abi-Rached Thursday 17 February

Objective

• Mission: Facilitate communication:– Bandwidth.– Intuitiveness.– Efficiency.

• Means:– Visual (Displays, HMD …).– Gestural.

Page 3: Stereo-based Hand Gesture Tracking and Recognition in ... · Stereo-based Hand Gesture Tracking and Recognition in Immersive Stereoscopic Displays Habib Abi-Rached Thursday 17 February

Initial Exploration. (Kodak).

• Domes.• Driving simulators.• Cave like

environments.Simulator sickness.

Page 4: Stereo-based Hand Gesture Tracking and Recognition in ... · Stereo-based Hand Gesture Tracking and Recognition in Immersive Stereoscopic Displays Habib Abi-Rached Thursday 17 February

Initial Exploration. (Ford).

• Accuracy of the user’s mental models based on visual displays.

• Usefulness of stereo displays.

Page 5: Stereo-based Hand Gesture Tracking and Recognition in ... · Stereo-based Hand Gesture Tracking and Recognition in Immersive Stereoscopic Displays Habib Abi-Rached Thursday 17 February

Limitation of Current Technology.

• Limited efficiency.– Mouse Keyboard…

• No 3D. (Monitors).

• Small FOV. (Monitors).

• Few Degrees of Freedom. (Joysticks, Mice).

• Limited intuitiveness.• Physical connection.

– (Gloves, Mice, HMD, phantom, polhemus).

• Precision depends on distance.

Page 6: Stereo-based Hand Gesture Tracking and Recognition in ... · Stereo-based Hand Gesture Tracking and Recognition in Immersive Stereoscopic Displays Habib Abi-Rached Thursday 17 February

Hand Gestures• Human-computer interaction (HCI) has become an

increasingly important part of our daily lives.• Keyboards and mice are the most popular mode of HCI.• Virtual Reality and Wearable Computing require novel

interaction modalities with following characteristics:– in a way that humans communicate with each other.

• Hand gesture is a natural and intuitive communication mode.

• Other applications: Sign Language Recognition, video transmission, and so on.

Page 7: Stereo-based Hand Gesture Tracking and Recognition in ... · Stereo-based Hand Gesture Tracking and Recognition in Immersive Stereoscopic Displays Habib Abi-Rached Thursday 17 February

Introduction

• Vision-based recognition of dynamic hand gestures is a challenging interdisciplinary project.– hand gestures are rich in diversities, multi-

meanings, and space-time variation.– human hand is a complex non-rigid object.– computer vision itself is a ill-pose problem.

Page 8: Stereo-based Hand Gesture Tracking and Recognition in ... · Stereo-based Hand Gesture Tracking and Recognition in Immersive Stereoscopic Displays Habib Abi-Rached Thursday 17 February

Our Approach.

• Inexpensive immersive PC-based gesture tracking / recognition System.

Page 9: Stereo-based Hand Gesture Tracking and Recognition in ... · Stereo-based Hand Gesture Tracking and Recognition in Immersive Stereoscopic Displays Habib Abi-Rached Thursday 17 February

Gesture-based Interaction With 3D Displays.

• Intuitive interaction, easy to learn.

Page 10: Stereo-based Hand Gesture Tracking and Recognition in ... · Stereo-based Hand Gesture Tracking and Recognition in Immersive Stereoscopic Displays Habib Abi-Rached Thursday 17 February

Previous Gesture tracking and recognition methods.

• Temporal modeling and recognition: (Kendon-MIT).

• Spatial modeling and recognition:– Appearance-based approach:

• Predefined static image templates. (Freeman).

• Deformable 2D templates. (Taylor)

– 3D hand model • Volumetric models.• Physical models.• Skeletal models.

•Feature detection and recognition.

–Huang (silhouette).

–Darell (whole image).

–Essa (spatio-temporal motion).

Page 11: Stereo-based Hand Gesture Tracking and Recognition in ... · Stereo-based Hand Gesture Tracking and Recognition in Immersive Stereoscopic Displays Habib Abi-Rached Thursday 17 February

Calibration methods.

• Tsai method.• Stringa method.• Faugeras method.• Caprile method.

Page 12: Stereo-based Hand Gesture Tracking and Recognition in ... · Stereo-based Hand Gesture Tracking and Recognition in Immersive Stereoscopic Displays Habib Abi-Rached Thursday 17 February

Why develop our own calibration.

• Simple, inexpensive calibration tools.

• One iteration.• Orthographic cameras.• Vertical cameras.

Page 13: Stereo-based Hand Gesture Tracking and Recognition in ... · Stereo-based Hand Gesture Tracking and Recognition in Immersive Stereoscopic Displays Habib Abi-Rached Thursday 17 February

Why develop our own calibration.

• Faster stereo reconstruction.– Orthographic projection.– Simple complexity.– No rectification phase.

Page 14: Stereo-based Hand Gesture Tracking and Recognition in ... · Stereo-based Hand Gesture Tracking and Recognition in Immersive Stereoscopic Displays Habib Abi-Rached Thursday 17 February

Stereo Reconstruction.

• Matching process.• Triangulation.

Page 15: Stereo-based Hand Gesture Tracking and Recognition in ... · Stereo-based Hand Gesture Tracking and Recognition in Immersive Stereoscopic Displays Habib Abi-Rached Thursday 17 February

Epipolar lines.

• One dimensional search.

Page 16: Stereo-based Hand Gesture Tracking and Recognition in ... · Stereo-based Hand Gesture Tracking and Recognition in Immersive Stereoscopic Displays Habib Abi-Rached Thursday 17 February

Rectification phase.• Straightening, Blending and Shifting.

Page 17: Stereo-based Hand Gesture Tracking and Recognition in ... · Stereo-based Hand Gesture Tracking and Recognition in Immersive Stereoscopic Displays Habib Abi-Rached Thursday 17 February

Camera Calibration Method.

•Intrinsic parameters

•Extrinsic parameters

Page 18: Stereo-based Hand Gesture Tracking and Recognition in ... · Stereo-based Hand Gesture Tracking and Recognition in Immersive Stereoscopic Displays Habib Abi-Rached Thursday 17 February

State of the the Art of Hand Gesture Recognition

Hand gesture taxonomy and interaction modelHand gesture modelingHand gesture AnalysisHand gesture recognition techniques

Page 19: Stereo-based Hand Gesture Tracking and Recognition in ... · Stereo-based Hand Gesture Tracking and Recognition in Immersive Stereoscopic Displays Habib Abi-Rached Thursday 17 February

Taxonomy of Gesture for Human-computer Interaction

Hand / Arm M ovements

Gestures Unintentional M ovements

Manipulative Communicative

Acts Symbols

Mimetic Deictic Referential M odalizing

Dynamic Static

Fig.1: A Taxonomy of hand gestures for Human-computer Interaction. Meaningful gestures are differentiated from unintentional movements. Gestures used for manipulation of objects are separated from the gestures which possess inherent communicational character. Symbols are those gestures having a linguistic role. They symbolize some referential action or are used as modalizers, often of speech.

Page 20: Stereo-based Hand Gesture Tracking and Recognition in ... · Stereo-based Hand Gesture Tracking and Recognition in Immersive Stereoscopic Displays Habib Abi-Rached Thursday 17 February

State of the the Art of Hand Gesture Recognition

Hand gesture taxonomy and interaction modelHand gesture modelingHand gesture AnalysisHand gesture recognition techniques

Page 21: Stereo-based Hand Gesture Tracking and Recognition in ... · Stereo-based Hand Gesture Tracking and Recognition in Immersive Stereoscopic Displays Habib Abi-Rached Thursday 17 February

Taxonomy of Gesture for Human-computer Interaction

Hand / Arm M ovements

Gestures Unintentional M ovements

Manipulative Communicative

Acts Symbols

Mimetic Deictic Referential M odalizing

Dynamic Static

Fig.1: A Taxonomy of hand gestures for Human-computer Interaction. Meaningful gestures are differentiated from unintentional movements. Gestures used for manipulation of objects are separated from the gestures which possess inherent communicational character. Symbols are those gestures having a linguistic role. They symbolize some referential action or are used as modalizers, often of speech.

Page 22: Stereo-based Hand Gesture Tracking and Recognition in ... · Stereo-based Hand Gesture Tracking and Recognition in Immersive Stereoscopic Displays Habib Abi-Rached Thursday 17 February

Hand Gesture Modeling

Hand Gesture Modeling

Appearance based modeling

3-Dtextured

volumetricmodel

3-DSkeletonmodel

2-Ddeformable

templatebased model

Grayimagebasedmodel

Imageproperties

basedmodel

Imagemotionbasedmodel

3-D hand /arm model based modeling

3-Dwireframevolumetric

model

3-DGeome-

tricalmodel

Classification of hand gesture models

Page 23: Stereo-based Hand Gesture Tracking and Recognition in ... · Stereo-based Hand Gesture Tracking and Recognition in Immersive Stereoscopic Displays Habib Abi-Rached Thursday 17 February

Hand Gesture Modeling

(a) (b) (c) (d) (e)

Fig.3: Representing the same hand posture by different hand models. (a) 3-D textured volumetric model; (b) 3-D wireframe volumetric model; (c) 3-D skeletal model; (d) Binary silhouette; (e) Contour model.

Page 24: Stereo-based Hand Gesture Tracking and Recognition in ... · Stereo-based Hand Gesture Tracking and Recognition in Immersive Stereoscopic Displays Habib Abi-Rached Thursday 17 February

Gesture Analysis

1 Gesture detection and feature extraction - skin color clues based approaches- motion clues based approaches- multiple clues based approaches- features include gray image, binary silhouette,

moving region, edge, contour, and so on.

Page 25: Stereo-based Hand Gesture Tracking and Recognition in ... · Stereo-based Hand Gesture Tracking and Recognition in Immersive Stereoscopic Displays Habib Abi-Rached Thursday 17 February

Gesture AnalysisRecovering gesture model parameters

- Estimation of 3-D hand /arm model parameters- two sets of parameters: angular (joint angles) and linear (palm

dimensions)- the initial parameter estimation- the parameter update as the hand gesture evolve in time.

- Estimation of appearance based model parameters- image motion estimation (e.g. optical flow)- shape analysis (e.g. computing moments)- histogram based feature parameters (e.g. )- active contour model.

Page 26: Stereo-based Hand Gesture Tracking and Recognition in ... · Stereo-based Hand Gesture Tracking and Recognition in Immersive Stereoscopic Displays Habib Abi-Rached Thursday 17 February

Gesture Recognition Techniques

Gesture recognitiontechniques

Dynamic gesture recognition

Classicalclusteringmethods(e.g. K-mean)

Non-linearclusteringmethods

(e.g. neuralnetworks)

HiddenMarkovModelbased

methods

DynamicTime

Warpingbased

methods

Timereducedmethods

Static gesture recognition

Classification of hand gesture recognition techniques

Page 27: Stereo-based Hand Gesture Tracking and Recognition in ... · Stereo-based Hand Gesture Tracking and Recognition in Immersive Stereoscopic Displays Habib Abi-Rached Thursday 17 February

Hand Gesture Modeling

Hand Gesture Modeling

Appearance based modeling

3-Dtextured

volumetricmodel

3-DSkeletonmodel

2-Ddeformable

templatebased model

Grayimagebasedmodel

Imageproperties

basedmodel

Imagemotionbasedmodel

3-D hand /arm model based modeling

3-Dwireframevolumetric

model

3-DGeome-

tricalmodel

Classification of hand gesture models

Page 28: Stereo-based Hand Gesture Tracking and Recognition in ... · Stereo-based Hand Gesture Tracking and Recognition in Immersive Stereoscopic Displays Habib Abi-Rached Thursday 17 February

Hand Gesture Modeling

(a) (b) (c) (d) (e)

Fig.3: Representing the same hand posture by different hand models. (a) 3-D textured volumetric model; (b) 3-D wireframe volumetric model; (c) 3-D skeletal model; (d) Binary silhouette; (e) Contour model.

Page 29: Stereo-based Hand Gesture Tracking and Recognition in ... · Stereo-based Hand Gesture Tracking and Recognition in Immersive Stereoscopic Displays Habib Abi-Rached Thursday 17 February

Gesture Analysis

1 Gesture detection and feature extraction - skin color clues based approaches- motion clues based approaches- multiple clues based approaches- features include gray image, binary silhouette,

moving region, edge, contour, and so on.

Page 30: Stereo-based Hand Gesture Tracking and Recognition in ... · Stereo-based Hand Gesture Tracking and Recognition in Immersive Stereoscopic Displays Habib Abi-Rached Thursday 17 February

Gesture AnalysisRecovering gesture model parameters

- Estimation of 3-D hand /arm model parameters- two sets of parameters: angular (joint angles) and linear (palm

dimensions)- the initial parameter estimation- the parameter update as the hand gesture evolve in time.

- Estimation of appearance based model parameters- image motion estimation (e.g. optical flow)- shape analysis (e.g. computing moments)- histogram based feature parameters (e.g. )- active contour model.

Page 31: Stereo-based Hand Gesture Tracking and Recognition in ... · Stereo-based Hand Gesture Tracking and Recognition in Immersive Stereoscopic Displays Habib Abi-Rached Thursday 17 February

Gesture Recognition Techniques

Gesture recognitiontechniques

Dynamic gesture recognition

Classicalclusteringmethods(e.g. K-mean)

Non-linearclusteringmethods

(e.g. neuralnetworks)

HiddenMarkovModelbased

methods

DynamicTime

Warpingbased

methods

Timereducedmethods

Static gesture recognition

Classification of hand gesture recognition techniques

Page 32: Stereo-based Hand Gesture Tracking and Recognition in ... · Stereo-based Hand Gesture Tracking and Recognition in Immersive Stereoscopic Displays Habib Abi-Rached Thursday 17 February

Stereo-Reconstruction.

• Simple matching.

• Fast reconstruction.

• Thresholding.

• 3D reconstruction.

Page 33: Stereo-based Hand Gesture Tracking and Recognition in ... · Stereo-based Hand Gesture Tracking and Recognition in Immersive Stereoscopic Displays Habib Abi-Rached Thursday 17 February

Problems.

• Order constraint, occlusion, merging.

Page 34: Stereo-based Hand Gesture Tracking and Recognition in ... · Stereo-based Hand Gesture Tracking and Recognition in Immersive Stereoscopic Displays Habib Abi-Rached Thursday 17 February

Hand Modeling.• Dynamic Constraints

for all four fingers.

• Static Constraints for all four fingers.

• Kush, Wu.• Agee 1982.

Page 35: Stereo-based Hand Gesture Tracking and Recognition in ... · Stereo-based Hand Gesture Tracking and Recognition in Immersive Stereoscopic Displays Habib Abi-Rached Thursday 17 February

Dynamic Constraints.• For separate fingers.

Page 36: Stereo-based Hand Gesture Tracking and Recognition in ... · Stereo-based Hand Gesture Tracking and Recognition in Immersive Stereoscopic Displays Habib Abi-Rached Thursday 17 February

Initial Pose of the Hand Model.

Page 37: Stereo-based Hand Gesture Tracking and Recognition in ... · Stereo-based Hand Gesture Tracking and Recognition in Immersive Stereoscopic Displays Habib Abi-Rached Thursday 17 February

Precision of the Initial Pose.

Page 38: Stereo-based Hand Gesture Tracking and Recognition in ... · Stereo-based Hand Gesture Tracking and Recognition in Immersive Stereoscopic Displays Habib Abi-Rached Thursday 17 February

Tracking the Hand.

• General Diagram:– Initial pose,– Real time tracking.

Page 39: Stereo-based Hand Gesture Tracking and Recognition in ... · Stereo-based Hand Gesture Tracking and Recognition in Immersive Stereoscopic Displays Habib Abi-Rached Thursday 17 February

Linear Optimization.• Frame N-1: Feature vector:

• Frame N: Feature vector:

• Minimization of:

Page 40: Stereo-based Hand Gesture Tracking and Recognition in ... · Stereo-based Hand Gesture Tracking and Recognition in Immersive Stereoscopic Displays Habib Abi-Rached Thursday 17 February

Hand Modeling.

• Dynamic Constraints.

• Static Constraints.

Page 41: Stereo-based Hand Gesture Tracking and Recognition in ... · Stereo-based Hand Gesture Tracking and Recognition in Immersive Stereoscopic Displays Habib Abi-Rached Thursday 17 February

Dynamic Constraints.

Page 42: Stereo-based Hand Gesture Tracking and Recognition in ... · Stereo-based Hand Gesture Tracking and Recognition in Immersive Stereoscopic Displays Habib Abi-Rached Thursday 17 February

SVM gesture recognizer.

Page 43: Stereo-based Hand Gesture Tracking and Recognition in ... · Stereo-based Hand Gesture Tracking and Recognition in Immersive Stereoscopic Displays Habib Abi-Rached Thursday 17 February

Gestural phases: Kendon.

• 1- Preparation phase: prepares the hand from its idle state, by moving into a recognizable form.

• 2- The Nucleus phase: which has a definite form and is the peak or stroke of the gesture

• 3- The retraction phase: which usually returns the hand to the resting position.

Page 44: Stereo-based Hand Gesture Tracking and Recognition in ... · Stereo-based Hand Gesture Tracking and Recognition in Immersive Stereoscopic Displays Habib Abi-Rached Thursday 17 February

Super-State Machine.

Page 45: Stereo-based Hand Gesture Tracking and Recognition in ... · Stereo-based Hand Gesture Tracking and Recognition in Immersive Stereoscopic Displays Habib Abi-Rached Thursday 17 February

Mini-State Machine S1.

Page 46: Stereo-based Hand Gesture Tracking and Recognition in ... · Stereo-based Hand Gesture Tracking and Recognition in Immersive Stereoscopic Displays Habib Abi-Rached Thursday 17 February

States and Input events.

Page 47: Stereo-based Hand Gesture Tracking and Recognition in ... · Stereo-based Hand Gesture Tracking and Recognition in Immersive Stereoscopic Displays Habib Abi-Rached Thursday 17 February

Mini-State Machine S1.

Page 48: Stereo-based Hand Gesture Tracking and Recognition in ... · Stereo-based Hand Gesture Tracking and Recognition in Immersive Stereoscopic Displays Habib Abi-Rached Thursday 17 February

Actions

Page 49: Stereo-based Hand Gesture Tracking and Recognition in ... · Stereo-based Hand Gesture Tracking and Recognition in Immersive Stereoscopic Displays Habib Abi-Rached Thursday 17 February

Mini-State Machine S1.

Page 50: Stereo-based Hand Gesture Tracking and Recognition in ... · Stereo-based Hand Gesture Tracking and Recognition in Immersive Stereoscopic Displays Habib Abi-Rached Thursday 17 February

Parameters.

Page 51: Stereo-based Hand Gesture Tracking and Recognition in ... · Stereo-based Hand Gesture Tracking and Recognition in Immersive Stereoscopic Displays Habib Abi-Rached Thursday 17 February

Mini-State Machine S1.

Page 52: Stereo-based Hand Gesture Tracking and Recognition in ... · Stereo-based Hand Gesture Tracking and Recognition in Immersive Stereoscopic Displays Habib Abi-Rached Thursday 17 February

Functions.

Page 53: Stereo-based Hand Gesture Tracking and Recognition in ... · Stereo-based Hand Gesture Tracking and Recognition in Immersive Stereoscopic Displays Habib Abi-Rached Thursday 17 February

Mini-State Machine S1.

Page 54: Stereo-based Hand Gesture Tracking and Recognition in ... · Stereo-based Hand Gesture Tracking and Recognition in Immersive Stereoscopic Displays Habib Abi-Rached Thursday 17 February

Compensatory. Pursuit.