3D Conv Nets for Robotic Manipulation - Liangliang...

3D Conv Nets for Robotic ManipulationChad DeChant, Joaquín Ruales, Jake Varley

Outline- 3D Spatial CNNs

- Extended 2D to 3D- Grasping

- Heatmap Approach- Shape Completion Approach- Comparison

2D Conv Filters

* *

3D Conv Filters

* * *

x filter y filter z filter

3D Filters in Theano- We Implemented the following for the Keras

Theano library- 3DConvolution

- Theano conv3d2d (BZCXY format)- 3DMax pooling

- Created max_pool_3d using max_pool_2d

2D Conv Sum of 1D Convs

3D Spatial CNN Test Data- Random size geometric shapes

sphere axis-aligned cube rotated cube

Conv layers (8 5x5x5, relu)

Conv layers (16 3x3x3, relu)

Cost Function

INPUT (28x28x28)

Target output

Max Pooling

Sphere

Max Pooling

Sphere

Fully connected logistic layer

Trained Layer 0 Filters

1 2 3 4 5 6 7 88 5x5x5 filters

Grasping Heatmaps

Grasping Heatmaps

Palm Finger 1 Finger 2 Finger 3

Grasping HeatmapsPatch: 32x32x32 Label: 1x1x1x nheatmaps

Trained Layer 0 Filters 1 16x16x16

filter

Another Approach...

Shape CompletionComplete the shape, then plan grasps on it

Shape Completion● Is shape completion feasible?● Use MNIST

Convolutional layers (30 3x3x3)


Hidden layer 150 or 1000 units

Reconstruction layer

Cost function

INPUTTarget output

DigitCompletion

Digit Completion

Digit Completion

50 units 150 units

x

x

x

x1000 units

Per pixel test error rate by hidden layer size

% error

# of epochs

3D digit completionMNIST in three dimensions



Hidden layer 500 or 1000 units

Reconstruction layer

Cost function

INPUT

Hidden layer 500 or 1000 unitsTarget output

3D Digit Completion

Model Output Target Output

3D Digit Completion

500 units 1000 units

Per pixel test error rate by size of 2 hidden layers

# of epochs

% error

Synthetic shape completion

Training Data- Household objects, Modelnet, BigBird

Shape Completion Data

Voxelization

3D Shape completion

Model Output Target OutputModel Input

Grasp ComparisonsHeatmap vs Shape Completion Grasps

1) generate grasps 2) compare ground truth grasp qualities

Conclusion- 3D Spatial CNNs

- Extended 2d to 3d- Keras Library

- Grasping- Heatmap Approach- Shape Completion Approach- Comparison

Thank you

3D Conv Nets for Robotic Manipulation - Liangliang...

Documents

Transcript of 3D Conv Nets for Robotic Manipulation - Liangliang...

A Brief Introduction to Reinforcement Learningais.informatik.uni-freiburg.de/.../deep_learning... · Outline • Characteristics of Reinforcement Learning (RL) • The RL Problem

x4 - Bright Bricks | Brick Model Building Company · 1x1x1x 2 4x4x 3 13. 14 4 1x1x 1 1x1x 2 x8. 15 15. Created Date: 20180420121620Z ...

how to innovate - Liangliang Caollcao.net/cu-deeplearning17/lecture/lecture5_llc.pdfStrategy #3: X Do exactly the opposite • From LSTM to word2vec – LSTM: long context and deep

Basic Math for Neural Networks - Liangliang Caollcao.net/cu-deeplearning17/lecture/lecture2_xiaodong.pdf · Deep Learning for Computer Vision, Speech, and Language Outline Feedforward

FastText - Liangliang Caollcao.net/cu-deeplearning17/pp/class7_FastText.pdfFastText FastText is on par with state-of-the-art deep learning classifiers in terms of accuracy But it is

Lego Clock R4 Full Instructions - University of Cambridgehemh1/clock/lego_clock_R4_3_6nov171.pdf · 1x 1x1x1x 1 1x 4 1x 3x 2 1x 1x 3. 2x 1x 3x 1x 2x 8 1x 2x 4x1x 9 22. 1x 1x1x 10

M A CHINE LE A RNING - ITEPbook.itep.ru/depository/deep_learning/ifw_dd_2016_machine_learning... · M A CHINE LE A RNING IN THE ... face recognition are problems for which ... data

2x 1x2x 2x 2x 1x 1x 1x - nerd.dschlumpp.comnerd.dschlumpp.com/research_lab_transporter... · 1x 1x1x1x 13 4. 2x 1x 1x 14 2x 2x 2x 2x 15 5. 2x 1x 16 1x 1x 17 6. 1x 2x 3x 18 1x 2x4x

Deep Learning in Steganography and Steganalysis since 2015chaumont/publications/Deep_Learning... · Figure:Example of embedding with S-UNIWARD algorithm (2013) at 0.4 bpp Marc CHAUMONT

Deep Speech 2 - Liangliang Caollcao.net/cu-deeplearning17/pp/class6_Deep_Speech2.pdf · Deep Speech 2 - Components Model Architectures Combination of convolutional, recurrent and

DEEP LEARNING - meiwa-e.co.jp · Title: DEEP_LEARNING Created Date: 7/11/2018 10:40:21 AM

Neural Turing Machines - Liangliang Caollcao.net/cu-deeplearning15/presentation/NeuralTuringMachines.pdf · IntroducBon’ • Firstapplicaon’of’Machine’Learning’to’logical’

Deep Learning in Object Detection and Recognitionmmlab.ie.cuhk.edu.hk/resources/deep_learning/overview.pdf · Deep Learning in Object Detection, Segmentation, and Recognition ...

Lecture 3: Theano Programming - Liangliang Caollcao.net/cu-deeplearning15/pdf/Lecture 3.pdf · Review of Last Lecture • Deep network • How to learn deep network: • Backprop

2x - BARONSAT...1x1x1x 10 11 1x1x 11 Imaginated in France by Eric Druon 02011 BARONSAT AVERTISSEMENT : Cette notice est seulement destinée à un usage personnel. VOUS n'êtes pas

KapilThadani kapil@cs.columbia - Liangliang Caollcao.net/cu-deeplearning17/lecture/lecture7_kapil.pdf · 2020-03-18 · Lexicalsemantics 5 dog mammal pet hypernymy poodle puppy hyponymy

HORIZON SCAN 2 - Agrifutures Australia...telematics autonomous_vehicles customer_journey_analytics deep_learning digital_business digital_finance digital_mesh future_of_food infomatics

NEURAL NETS FOR VISION - NYU Computer Sciencefergus/tutorials/deep_learning... · - Neural Networks for Supervised Training - Architecture - Loss function - Neural Networks for Vision:

Deep Learning-Automatic Malignancy Detectionivylab.kaist.ac.kr/demo/Deep_Learning-Automatic... · · 2017-06-08Deep Learning-Automatic Malignancy Detection Yong Man Ro ... Automatic

2-7 Triple Draw Poker: With Learning - Liangliang Caollcao.net/cu-deeplearning15/projects/2-7-draw... · 2020. 12. 13. · Overview •Problem: learn strategy to play 2-7 triple draw