TACKLING 3D TOF ARTIFACTS THROUGH LEARNING AND THE … · 2019-03-29 · 3 TACKLING 3D TOF...
Transcript of TACKLING 3D TOF ARTIFACTS THROUGH LEARNING AND THE … · 2019-03-29 · 3 TACKLING 3D TOF...
Iuri Frosio, GTC 2019 (San Jose, CA)
TACKLING 3D TOF ARTIFACTS THROUGH LEARNING AND THE FLAT DATASET
2
THE IMPORTANCE OF NEGATIVE RESULTS
“I shall require that [the] logical form [of thetheory] shall be such that it can be singledout, by means of empirical tests, in anegative sense: it must be possible for anempirical scientific system to be refuted byexperience” (Karl Popper, The Logic ofScientific Discovery, 1959).
In simple words, “negative results arefundamentals for the advancement ofscience”.
3
TACKLING 3D TOF ARTIFACTS THROUGH LEARNING AND THE FLAT DATASET
- Time Of Flight (TOF) cameras & artifacts
- Naïve Machine Learning (ML) for TOF reconstruction
- TOF cameras: working principles
- Camera calibration
- The FLAT dataset
- Spoiler: our non-Naïve ML solution works*
- Back to physics
- DNN architecture
- Results
- Conclusion
Agenda
* See Qi Guo, Iuri Frosio, Orazio Gallo,Todd Zickler, Jan Kautz, Tackling 3D ToFArtifacts Through Learning and the FLATDataset, ECCV 2018, Munich (Germany),Sept. 2018.
4
TIME OF FLIGHT (TOF) CAMERAS & ARTIFACTSE.g., Kinect 2
Image from https://stackoverflow.com/questions/22921390/how-to-scale-a-kinect-
depth-image-for-applying-lbp-on-it-in-matlab?rq=1
5
TIME OF FLIGHT (TOF) CAMERAS & ARTIFACTSApplications
Image from https://www.physio-
pedia.com/The_emerging_role_of_Microsoft_Kinect_in_physiotherapy_rehabilitation_for_stroke_patients
Image from amazon.com
Image from https://www.eenewsautomotive.com/news/3d-lidar-generates-environmental-model-time-flight-
measurement
6
TIME OF FLIGHT (TOF) CAMERAS & ARTIFACTSArtifact #1: shot noise
Image from https://ptgrey.com
7
TIME OF FLIGHT (TOF) CAMERAS & ARTIFACTSArtifact #2: movement
Image from https://support.xbox.com
8
TIME OF FLIGHT (TIF) CAMERAS & ARTIFACTSArtifact #3: multiple reflections
Image from
https://www.sciencedirect.com/science/article/pii/S0262885613001650
9
TIME OF FLIGHT (TIF) CAMERAS & ARTIFACTSArtifact #3: multiple reflections
10
TACKLING 3D TOF ARTIFACTS THROUGH LEARNING AND THE FLAT DATASET
- Time Of Flight (TOF) cameras & artifacts
- Naïve Machine Learning (ML) for TOF reconstruction
- TOF cameras: working principles
- Camera calibration
- The FLAT dataset
- Spoiler: our non-Naïve ML solution works*
- Back to physics
- DNN architecture
- Results
- Conclusion
Agenda
* See Qi Guo, Iuri Frosio, Orazio Gallo,Todd Zickler, Jan Kautz, Tackling 3D ToFArtifacts Through Learning and the FLATDataset, ECCV 2018, Munich (Germany),Sept. 2018.
11
NAÏVE MACHINE LEARNING (ML) FOR TOF RECONSTRUCTION
What do we need?
(1) A large dataset of scenes…
(2) … corrupted by:
(1.1) photon noise,(1.2) motion,(1.3) multiple reflections…
(3) … with clean output data…
(4) … And a DNN.
12
TACKLING 3D TOF ARTIFACTS THROUGH LEARNING AND THE FLAT DATASET
- Time Of Flight (TOF) cameras & artifacts
- Naïve Machine Learning (ML) for TOF reconstruction
- TOF cameras: working principles
- Camera calibration
- The FLAT dataset
- Spoiler: our non-Naïve ML solution works*
- Back to physics
- DNN architecture
- Results
- Conclusion
Agenda
* See Qi Guo, Iuri Frosio, Orazio Gallo,Todd Zickler, Jan Kautz, Tackling 3D ToFArtifacts Through Learning and the FLATDataset, ECCV 2018, Munich (Germany),Sept. 2018.
13
TOF WORKING PRINCIPLESTime of flight is not time of flight ☺
Image from https://www.semanticscholar.org/paper/Interference-mitigation-technique-for-(ToF)-camera-Islam-Hossain/
14
TOF WORKING PRINCIPLESTime of flight is not time of flight ☺
Images from https://www.semanticscholar.org/paper/Interference-mitigation-technique-for-(ToF)-camera-Islam-Hossain/
15
TOF WORKING PRINCIPLESMultiple measurements
Pulse method
Continuous wave method
16
TOF WORKING PRINCIPLESCamera functions and scene response
17
TOF WORKING PRINCIPLEMore on scene response
= න−𝑇2
𝑇2𝑓 𝑡 ⨂𝑔𝑖 𝑡 𝑟 𝑡 𝑑𝑡= න
−𝑇2
𝑇2𝑓 𝑡 ∗ 𝑟 𝑡 𝑔𝑖 𝑡 𝑑𝑡Raw measurement:
Emitted signal: 𝑓 𝑡
Returned signal: ℎ 𝑡
Demodulation signal: 𝑔𝑖 𝑡
Impulse response: 𝑟(𝑡)
Camera Scene
𝑄𝑖(𝑡) = න−𝑇2
𝑇2ℎ 𝑡 𝑔𝑖 𝑡 𝑑𝑡
Depth: 𝑍 =𝑇𝑐
4𝜋arctan
σ𝑖 sin𝜃𝑖 𝑄𝑖 𝑡
σ𝑖 cos𝜃𝑖 𝑄𝑖 𝑡
t
t
tDemodulation: gi(𝑡)
18
TOF WORKING PRINCIPLESMultiple frequencies
• Different max length (combinethem)
• Different resolutions
• Agreement between differentmeasurements
19
TACKLING 3D TOF ARTIFACTS THROUGH LEARNING AND THE FLAT DATASET
- Time Of Flight (TOF) cameras & artifacts
- Naïve Machine Learning (ML) for TOF reconstruction
- TOF cameras: working principles
- Camera calibration
- The FLAT dataset
- Spoiler: our non-Naïve ML solution works*
- Back to physics
- DNN architecture
- Results
- Conclusion
Agenda
* See Qi Guo, Iuri Frosio, Orazio Gallo,Todd Zickler, Jan Kautz, Tackling 3D ToFArtifacts Through Learning and the FLATDataset, ECCV 2018, Munich (Germany),Sept. 2018.
20
CAMERA CALIBRATIONCamera response functions (flat scene)
Inside coated with black-out material
21
CAMERA CALIBRATIONCamera response functions (flat scene)
• Three “frequencies”
• Three measurements perfrequency
22
CAMERA CALIBRATIONPhoton noise
Other calibration details (pixel delay, vignetting, … in Qi Guo, Iuri Frosio, Orazio Gallo, Todd Zickler, Jan Kautz, Tackling 3D ToFArtifacts Through Learning and the FLAT Dataset, ECCV 2018, Munich (Germany), Sept. 2018.
23
TACKLING 3D TOF ARTIFACTS THROUGH LEARNING AND THE FLAT DATASET
- Time Of Flight (TOF) cameras & artifacts
- Naïve Machine Learning (ML) for TOF reconstruction
- TOF cameras: working principles
- Camera calibration
- The FLAT dataset
- Spoiler: our non-Naïve ML solution works*
- Back to physics
- DNN architecture
- Results
- Conclusion
Agenda
* See Qi Guo, Iuri Frosio, Orazio Gallo,Todd Zickler, Jan Kautz, Tackling 3D ToFArtifacts Through Learning and the FLATDataset, ECCV 2018, Munich (Germany),Sept. 2018.
24
THE FLAT DATASETFlexible, Large, Augmentable, ToF (FLAT)
To disk (FLAT)https://github.com/NVlabs/FLAT
25
THE FLAT DATASETFlexible, Large, Augmentable, ToF (FLAT)
Transient rendering (scene response function) based on Jarabo, A., Marco, J., Muñoz, A.,Buisan, R., Jarosz, W., Gutierrez, D.: A framework for transient rendering. In: ACMTransactions on Graphics (SIGGRAPH ASIA), (2014).
26
THE FLAT DATASETFlexible, Large, Augmentable, ToF (FLAT)
Transient rendering (scene response function) based on Jarabo, A., Marco, J., Muñoz, A.,Buisan, R., Jarosz, W., Gutierrez, D.: A framework for transient rendering. In: ACMTransactions on Graphics (SIGGRAPH ASIA), (2014).
Brightness, travel time
27
Impulse response
28
Impulse response
29
Impulse response
30
Impulse response
31
Impulse response
32
Impulse response
33
Impulse response
34
Impulse response
35
Impulse response
36
Impulse response
37
Impulse response
38
Impulse response
39
Impulse response
40
Impulse response
41
Impulse response
42
Impulse response
43
Impulse response
44
Impulse response
45
Impulse response
46
Impulse response
47
Impulse response
48
Impulse response
49
Impulse response
50
Impulse response
51
Impulse response
52
Impulse response
53
Impulse response
54
Impulse response
55
Impulse response
56
Impulse response
57
Impulse response
58
Impulse response
59
Impulse response
60
Impulse response
61
Impulse response
62
Impulse response
63
Impulse response
64
Impulse response
65
Impulse response
66
Impulse response
67
Impulse response
68
Impulse response
69
Impulse response
70
Impulse response
71
Impulse response
72
Impulse response
73
Impulse response
74
Impulse response
75
Impulse response
76
Impulse response
77
Impulse response
78
Impulse response
79
Impulse response
80
Impulse response
81
Impulse response
82
Impulse response
83
Impulse response
84
Impulse response
85
Impulse response
86
Impulse response
87
Impulse response
88
Impulse response
89
Impulse response
90
Impulse response
91
Impulse response
92
Impulse response
93
Impulse response
94
Impulse response
95
Impulse response
96
Impulse response
97
Impulse response
98
Impulse response
99
Impulse response
100
Impulse response
101
Impulse response
102
Impulse response
103
Impulse response
104
Impulse response
105
Impulse response
106
Impulse response
107
Impulse response
108
Impulse response
109
Impulse response
110
Impulse response
111
Impulse response
112
Impulse response
113
THE FLAT DATASETFlexible, Large, Augmentable, ToF (FLAT)
Different cameras (beyond Kinect 2) can be simulated, after calibration.
114
THE FLAT DATASETFlexible, Large, Augmentable, ToF (FLAT)
Noise can be added…
115
THE FLAT DATASETFlexible, Large, Augmentable, ToF (FLAT)
… As well as motion (approximate model) and texture…
116
THE FLAT DATASETFlexible, Large, Augmentable, ToF (FLAT)
… and multiple reflections.
117
THE FLAT DATASETSamples
118
TACKLING 3D TOF ARTIFACTS THROUGH LEARNING AND THE FLAT DATASET
- Time Of Flight (TOF) cameras & artifacts
- Naïve Machine Learning (ML) for TOF reconstruction
- TOF cameras: working principles
- Camera calibration
- The FLAT dataset
- Spoiler: our non-Naïve ML solution works*
- Back to physics
- DNN architecture
- Results
- Conclusion
Agenda
* See Qi Guo, Iuri Frosio, Orazio Gallo,Todd Zickler, Jan Kautz, Tackling 3D ToFArtifacts Through Learning and the FLATDataset, ECCV 2018, Munich (Germany),Sept. 2018.
119
NAÏVE MACHINE LEARNING (ML) FOR TOF RECONSTRUCTION
Supervised learning
Take it easy: supervised learning, from raw data to 3D map.
Training input/output pairs from the FLAT dataset.
120
NAÏVE MACHINE LEARNING (ML) FOR TOF RECONSTRUCTION
Supervised learning
121
THE LESSON WE LEARNED*…* To advance science.
That’s a nice negative result!
122
… AND HOW WE IMPROVED
*1 Yes, it’s a fakepicture…
*2 … But the message iscorrect.
*1
*2
123
TACKLING 3D TOF ARTIFACTS THROUGH LEARNING AND THE FLAT DATASET
- Time Of Flight (TOF) cameras & artifacts
- Naïve Machine Learning (ML) for TOF reconstruction
- TOF cameras: working principles
- Camera calibration
- The FLAT dataset
- Spoiler: our non-Naïve ML solution works*
- Back to physics
- DNN architecture
- Results
- Conclusion
Agenda
* See Qi Guo, Iuri Frosio, Orazio Gallo,Todd Zickler, Jan Kautz, Tackling 3D ToFArtifacts Through Learning and the FLATDataset, ECCV 2018, Munich (Germany),Sept. 2018.
124
BACK TO PHYSICS
Cause: Sequential measurements
Effect: Misaligned moving object
Solution:Warping
And, more generally speaking, any a-priori knowledge.
time
125
BACK TO PHYSICS
Cause: DNN architecture andlearningmocks physics
Effect: Sub-optimal results
Solution:Include physics in the DNNarchitecture / reconstructionpipeline.
And, more generally speaking, any a-priori knowledge.
Naïve machine learning
Physics
126
DNN ARCHITECTURE#1: Motion Correction Module
Trained to warp imagesto the central one
127
DNN ARCHITECTURE#2: Motion Reflection Module
Trained to reducemultiple reflection afterre-alignment.
128
DNN ARCHITECTURE#3: Differential reconstruction pipeline
Non-trainable, butdifferentiable,physics-based reconstructionpipeline.
129
DNN ARCHITECTURE#3: Differential reconstruction pipeline
Can be refined withend-to-end training.
130
TACKLING 3D TOF ARTIFACTS THROUGH LEARNING AND THE FLAT DATASET
- Time Of Flight (TOF) cameras & artifacts
- Naïve Machine Learning (ML) for TOF reconstruction
- TOF cameras: working principles
- Camera calibration
- The FLAT dataset
- Spoiler: our non-Naïve ML solution works*
- Back to physics
- DNN architecture
- Results
- Conclusion
Agenda
* See Qi Guo, Iuri Frosio, Orazio Gallo,Todd Zickler, Jan Kautz, Tackling 3D ToFArtifacts Through Learning and the FLATDataset, ECCV 2018, Munich (Germany),Sept. 2018.
131NVIDIA CONFIDENTIAL. DO NOT DISTRIBUTE.
RESULTS
Compare against
LF2 [1] [Kinect, non DL]
DToF [3] [DL, Raw to 3D, no motion]
Phasor [2] [High frequencies reduce MPI]
[1] Xiang, et al. libfreenect2: Release 0.2[2] Marco, et al. DeepToF: Off-the-shelf real -time correction of multipath interference in time-of-flight imaging. In: ACM Transactions on Graphics (SIGGRAPH ASIA).[3] Gupta, et al. Phasor imaging: A generalization of correlation-based time-of-flight imaging. ACM Transactions on Graphics.
Competitors & ablation study
Ablation study
MOM [motion only]
MRM [multiple reflection and noise only]
MOM-MRM [motion, multiple reflection and noise ]
132
RESULTSAblation study: none, MRM, MOM+MRM [simulation]
Median [Med] and Inter Quartile Range [IQR] of the error decreased by MRM / MOM-MRM, in cm.
133
RESULTSAblation study: none, MRM, MOM+MRM [simulation]
134
RESULTSCompare against: DTOF, Phasor imaging [simulation]
Smaller error when compared to DToF or Phasor.
135
RESULTSCompare against: DTOF, Phasor imaging on multi-reflection and shot
noise [simulation]
Field of view
Multiple reflection removed through local reflection / a-priori information, no bias.
136
RESULTSCompare against: LF2, on multi-reflection and shot noise [real data]
Multiple reflection removed through local reflection / coherence / a-priori information, no bias.
137
RESULTSCompare against: LF2, on movement [real data]
Realignment of raw data reduce motion artifacts, specular reflections (red box) generate errors.
138
TACKLING 3D TOF ARTIFACTS THROUGH LEARNING AND THE FLAT DATASET
- Time Of Flight (TOF) cameras & artifacts
- Naïve Machine Learning (ML) for TOF reconstruction
- TOF cameras: working principles
- Camera calibration
- The FLAT dataset
- Spoiler: our non-Naïve ML solution works*
- Back to physics
- DNN architecture
- Results
- Conclusion
Agenda
* See Qi Guo, Iuri Frosio, Orazio Gallo,Todd Zickler, Jan Kautz, Tackling 3D ToFArtifacts Through Learning and the FLATDataset, ECCV 2018, Munich (Germany),Sept. 2018.
139
CONCLUSION
1. Naïve ML does not always work…
140
CONCLUSION
1. Naïve ML does not always work…
2. ….But going back to a priori knowledge may help.
141
CONCLUSION
The physics of ToF cameras: acquisition, reconstruction, artifacts
Photon shot noise, motion artifacts, multiple reflection
A large dataset of simulated data
Design the DNN architecture accordingly to a-priori knowledge
Effective reduction of reconstruction artifacts
142
RESOURCES
Tackling 3D ToF Artifacts Through Learning and the FLAT Dataset, Qi Guo, Iuri Frosio, Orazio Gallo, Todd Zickler, Jan Kautz; The European Conference on Computer Vision (ECCV), 2018, pp. 368-383, http://openaccess.thecvf.com/content_ECCV_2018/html/Qi_Guo_Tackling_3D_ToF_ECCV_2018_paper.html
The FLAT dataset (code and data): https://github.com/NVlabs/FLAT
Contact: {ifrosio, ogallo}@nvidia.com, [email protected]
Online