Real Time Human Posture Detection with Multiple Depth Sensors

Contexte - IntroductionDOCTORAT DE L’UNIVERSITÉ DE TOULOUSE

Université Toulouse III - Paul Sabatier Systèmes embarqués et robotique

Real time human posture detection withmultiple depth sensors

Paul CHECCHIN Rapporteur

Alberto IZAGUIRRE Rapporteur

Mohamed AKIL Examinateur

Michel DEVY Examinateur

Frédéric LERASLE Directeur de thèse

Jean-Louis BOIZARD Directeur de thèse

Groupe RAP - Groupe N2IS

Wassim FILALI 07 Novembre 2014

Theses background

Human posture recognition

Data acquisition

Learning

Real time reconstruction

Hardware integration

evaluations

Multiple depth sensors

Body parts detection

Kinect2Kinect

Mono sensor RGB-D multi sensor RGB-D

Mono sensor RGB

Multi sensor RGB

Introduction - Historic

Depth sensor technology

Optical Diffractive Element

active RGB-D camera

Primesense - Patent

Context - Application

Video games Videosurveillance(Health/Office)

mono/multisensors RGB Approches

Humain model[Sundaresan et al. 2005]

• Geometrical shapes adjustement

• Full model adjustment

Apparence Methods

• Images projection

• Adjusting the posture

3D Reconstruction Methods

• Voxellisation

• 3D Reconstruction

[Sigal et al. 2004]

Deformable surface[Li et al. 2011]

mono RGB-D Approches : Advanatges and Disadvantages

Resolution

Random errorFor depth estimation

• Compensated by processingto avoid overlearning

Orientation

• Relative to the sensor

• Has Impact on learning

Auto occultations

• No solution

Précision

• Limits the field of view

[Shotton et al 2011a]

[Koshelham et al 2012]

2.5 D Descriptor

multi Kinects Approches

[Zhang et al. 2012 ]

[Berger et al. 2011 ]

Particle filtering

Model adjustment

•No many examples of multi RGB-D in literature•No learning Approches

multi RGB-D Approches – Advantages and disatvantages

Advantages Disadvantages Avoid interferences

Temporal multiplexing

Vibration

Correction

[Maimone et al. 2012]

Our work on the Algorithmic

Our contributions

3D Descriptor for body

parts labeling

Free parameters

Database

Hardware architecture

New descriptor

Investigations on their

influence

Learning

Evaluations

Plateform

Example

Mocap in LAAS

Nombre de caméras Hawk 4

Résolution Hawk 640 x 480

Nombre de caméras Eagle 6

Résolution des caméras Eagle 2352 x 1728

Fréquence 200

MOCAP system Operation

Temporal synchronisation

1) Chess for image calibration

2) Active camera

3) MOCAP

4) MOCAP calibration square

Database - Recorded SequencesNSC13 IRSS35

Color views 3 3

Depth views 3 3

MOCAP cameras 10 10

MOCAP markers 13 35

Frequency 5 images / s 20 images / s

Nb sequences 5 8

Total Nb Postures 1 951 21 569

Sequences M2,

Posture en T, mouvements bras

jambes, marche, course, saut,

pompes, break dance, natation

(bras), accroupis, chute arrière,

chute avant, équilibre, ping-

pong, volley ball, haltérophilie,

Tennis

C1, C2, C3,

C4, C5, C6,

C7, C8, C9

Posture en T, mouvements bras jambes genoux,

accroupis, bascule, haltérophilie, tennis, volley ball, ping-

pong, natation (bras), pétanque, lancement de poids,

volley ball, Pétanque, marche, course, assis debout, assis

par terre, saut, équilibre, étirement, boxe, bowling,

danse, chute avant, chute arrière, conduite, déplacer

chaise, s’asseoir, balayer assis, déplacer meuble, bouger

et filmer, jouer avec des balles, karaté, échauffement,

saut à la corde

Evaluation criteria15

Recorded sequences - Illustrations

Intermediate body parts

Central body parts(defined by MOCAP)

Centers of body parts

Application

Our approch

Our approch (BPR) vs. [Shotton et al. 2011]

Real dataset MOCAP

Sythetic dataset for learning

Free parameters study

Our 3D descriptor

(X1,y1,z1)

(X2,y2,z2)

(X3,y3,z3)

(X4,y4,z4)

(X5,y5,z5)

(0,0,0,1,1)

(1,0,1,0,1)

1 Postur 7 0 K Voxels

T2(X2,y2,z2)

T3(x3,y3,z3)

T4(X4,y4,z4)

T5(x5,y5,z5)

T1(x1,y1,z1) Crossing the decision tree

Decision Tree generation

T2(X2,y2,z2)

T3(x3,y3,z3)

T4(X4,y4,z4)

T5(x5,y5,z5)

T1(x1,y1,z1)

Φ Ensemble de vecteurs candidats

75M, 90K

Descripteurs tirés

Decision forest

x log(x)

Entropy

Information gain

Trees Forest

Ponderation

Descriptors size

0.7680.800 0.786 0.786

0.77755.1%

73.1% 74.2% 73.3% 72.1% 71.3%

0.1 0.2 0.4 0.7 1 1.5 2

Valeur maximale de la norme des Vecteurs (m)

Taille de la fenêtre des vecteurs descripteurs - UniNorm

Classif

Number of Trees

1 2 3 4 5 7 9 12 16 20

Nombre d'Arbres (N)

Nombre d'arbres (N)

Classif

Quantitative Evaluations

<0,01 <0,02 <0,03 <0,04 <0,05 <0,10 <0,15 <0,20 <0,30 <0,50

Seuil du calcul du "Mean average precision" en (m)

Comparaison BPR vs. OpenNI(Séquence : IRSS35-C3)

Qualitative Evaluations

Our work on the Hardware level

Analysis of requirements

Architectural exploration

Comparative evaluation

Conclusion

Functional alaysis

Solutions catalogue

Functionnal analysis – Modelisation SysML

640x480x16bit

Box : 500K Voxels100K Full Voxels

1000 postures25M VoxelsTree of 700K nodes

Voxellisation

Hardware solution catalog

GP GPU

Embedded

Processors

Servers

microcontrollers

Dédiés

PIC12F/ 8bits / 30MHz / 2mW / 1$

i7-5960X / 8Cores / 3.5GHz / 140W / 1000$

Tesla K40 / 2880 Cores / 235W / 5500$

100x(16 Cores/ 104GB) => $140/h

Virtex-7 / 2M LC / 6.8 BT/20-40W/$17K-$40K

Architectures evaluation on the Background detection function

- principle

Image Background

CPU - Plateform

Xion Pro Live

ServerHP Z800

• Display• Calibration

Capture « multi thread »Background detection3D Geometry

•Cameras•Rays•Voxellisation,…

Decision forest

Capture platform

Benefits

Algorithms evaluation platform

ASIC PS-1080

Performance - 10 to 30 ms

Learning time : 1h to 10h

Prediction time of one full posture 70 ms

GPU – Background detection

Relatively quick handling

Parallelisation / Acceleration x30

DisatvantagesAvantages

High power consumption

CPU dependency

Memory copy Host/GPU

Performance - 1 to 2 ms

FPGA – Components

Demosaicing Line Fifo

Start of PacketEnd of Packet

Generation

I2C Control

Frame Writer

Counter

Frame Reader

Counter

Memory write Memory read

Pixel Fetcher

Data OutData In = @Fifo

Memory read

Reusable components library

Benefits Distorsion correction Rotation

Images fusionHomography

FPGA – Background detection

Hardware blosck for the background de tection

Optimised model

Image fond

FPGA – Integration in the SOPC

Ressource Usage Usage %

Logic elements 7 619 11%

Total logic

registres5 218 8%

Total LAB 630 15%

Total Internal

memory usage

739 840 64%

Total memory

bloc usage188 75%

PLLs 2 50%

Global clocks 16 100%

Performance - 3 ms

Altera Cyclone IV 115K

Architectures Comparision

CPU GPU FPGA

Runtime - - Xeon One Thread

10 ms to 30 ms

Quadro FX48001 to 2 ms

Altera Cyclone IV3 ms

Details - Depends on the number of pixels to

process

4 ms for 4 channles Time to read the image from the memory. Can be

concatenated with other functions.

Avantages •Flexibility•Development platform

•Average learning curve •Highly parallel architecture•Reduced processing time•Reduced consumption

Disadvantages •Processing time•Processing / power

•High consumption•CPU dependency•Bottlenecks

•Long learning curve•Important development time•Limited precision processing (fixed/floating point)

Repartition

Fonction

Solution Ressource

Console

Kinect – Sensor

Kinect – PS1080

Console – Processor

Console – GPU

Xtion – Sensor

Xtion – PS1080

Processor

External Sensor

Specific Module

Soft-core

Conclusions

Perspectives

Temporal filteringMulti Kinect : fusion of reconstructions

Synthetic datasetEnrichir la base de

données

Learning algorithm parallelisation

Servers/ Cloud / GPU Learn bigger database

Enhance labeling quality

Hardware integration

Integrate all functionalities

Prototype compact à faible consommation

Mono Kinect : pixels labeling

Fall detection

Human activities recognition

Human machine interaction

Thanks

Real Time Human Posture Detection with Multiple Depth Sensors

Technology

Transcript of Real Time Human Posture Detection with Multiple Depth Sensors

Gesture solution mass market vs active depth sensors

Automatic Posture and Movement Tracking of … › files › 40448955 › ELEC_Airaksinen...Automatic posture and Movement tracking of infants with Wearable Movement Sensors Manuiraksinen

Augmented Reality Gets Deep: The Impact of 3-D and Depth … · 2014. 5. 21. · Mapping your world using 3D Depth Sensors 3D depth sensing • Precise measurements of the environment

Using the RealSense D4xx Depth Sensors in Multi-Camera ... · Using the RealSense D4xx Depth Sensors in Multi-Camera Configurations Anders Grunnet-Jepsen, Paul Winer, Aki Takagi,

Posture stability and Balance. Posture Principles Definition of “good” posture Examples of poor posture.

Depth Sensors

Noise Modeling in TOF Sensors with Application to Depth ...igt.ip.uca.fr/~ab/Publications/BelHedi_etal_CV15.pdf · Noise Modeling in TOF Sensors with Application to Depth Noise Removal

3D Scanning with Multiple Depth Sensors

ORION – Sensors€¦ · –Recommendations for “Core” sensors vary among the 3 ORION components, geolocation, and depth. • “Community" sensors defined as those that are

3D Modeling with Depth Sensors - CVG · 2018-04-30 · Topics Today • Actively obtaining “depth maps” / “range images” • unstructured light • structured light • time-of-flight

SENSORS & GAS SYSTEMS. SENSORS Degital Sensors Depth Wheel. Stroke Counter. Degital RPM.

3D Depth Camera Based Human Posture Detection and ... depth … · context features of labeled body-parts. Fig. 2. Visualized depth image. (a) intensity image; (b) depth image visualized

The Enhanced Posture Correction ... - 50years.acs.org.au · The Enhanced Posture Correction Calibration Devices Using Bio-Sensors on WSN 78 Journal of Research and Practice in Information

Extended Multitouch: Recovering Touch Posture, Handedness, … · 2012-10-03 · Extended Multitouch: Recovering Touch Posture, Handedness, and User Identity using a Depth Camera

ICP WITH DEPTH COMPENSATION FOR CALIBRATION OF … · 2.2. Correcting distortion of depth map Depth sensors have distortions in depth maps, and these 3D co-ordinates also have distortions.

On Depth Sensors Applications for Computational Photography … · 2013-05-24 · On Depth Sensors Applications for Computational Photography and Vision Marina von Steinkirch, steinkirch@gmail.com

Feasibility of Depth Sensors to Study Breast Deformation ...eia.udg.edu/~marly/pubs/iwdm16b.pdfFeasibility of Depth Sensors to Study Breast Deformation During Mammography Procedures

Compact single-shot metalens depth sensors inspired by eyes of … · 2019/9/24 · ENGINEERING Compact single-shot metalens depth sensors inspired by eyes of jumping spiders Qi

Multi-task Forest for Human Pose Estimation in Depth Imagescampar.in.tum.de/pub/lallemand20133Dv/lallemand20133Dv.pdf · Recently, with the advent of depth sensors like the Kinect

Depth Sensors - uni-hamburg.de · Savitha WS 2014Nagaraju -15 Intelligent Robotics Seminar 2 . Contents Why Depth Sensors ... •Compatible with Mac OS and Windows Savitha WS 2014Nagaraju