SA-1 Body Scheme Learning Through Self-Perception Jürgen Sturm, Christian Plagemann, Wolfram...

35
SA-1 Body Scheme Learning Through Self-Perception Jürgen Sturm, Christian Plagemann, Wolfram Burgard
  • date post

    19-Dec-2015
  • Category

    Documents

  • view

    218
  • download

    2

Transcript of SA-1 Body Scheme Learning Through Self-Perception Jürgen Sturm, Christian Plagemann, Wolfram...

Page 1: SA-1 Body Scheme Learning Through Self-Perception Jürgen Sturm, Christian Plagemann, Wolfram Burgard.

SA-1

Body Scheme Learning Through Self-Perception

Jürgen Sturm, Christian Plagemann, Wolfram Burgard

Page 2: SA-1 Body Scheme Learning Through Self-Perception Jürgen Sturm, Christian Plagemann, Wolfram Burgard.

Research question

Can we learn a body scheme for a manipulator?

Page 3: SA-1 Body Scheme Learning Through Self-Perception Jürgen Sturm, Christian Plagemann, Wolfram Burgard.

Outline

Introduction The concept of Body Schemes in

Neurophysiology Approach

Problem formulation Structure learning Forward and inverse models

Demo / Experiments / Evaluation Future work

Page 4: SA-1 Body Scheme Learning Through Self-Perception Jürgen Sturm, Christian Plagemann, Wolfram Burgard.

Introduction

Sensor model Motion model

I.e., for manipulators: Kinematic model Dynamic model

Page 5: SA-1 Body Scheme Learning Through Self-Perception Jürgen Sturm, Christian Plagemann, Wolfram Burgard.

Introduction

Typically, those models are derived analytically in advance fixed up to a number of parameters require (manual) calibration

Page 6: SA-1 Body Scheme Learning Through Self-Perception Jürgen Sturm, Christian Plagemann, Wolfram Burgard.

Introduction

Problems with fixed models: Wear-and-tear (wheel diameter, air

pressure) Recovery from failure

(malfunctioning actuators) Tool use (extending the model) Re-configurable robots (unknown

model structure)

Page 7: SA-1 Body Scheme Learning Through Self-Perception Jürgen Sturm, Christian Plagemann, Wolfram Burgard.

Biological inspiration

Same problems in humans/animals: Changing body properties (growth) Injured body parts Simple tool use (writing, operating a

gripper) Complex tool use (riding a bike)

Page 8: SA-1 Body Scheme Learning Through Self-Perception Jürgen Sturm, Christian Plagemann, Wolfram Burgard.

The concept of Body Scheme in Neurophysiology

Multi-modal mapping Localize and track sensations Spatially coded Modular Coherent Plasticity Interpersonal

Page 9: SA-1 Body Scheme Learning Through Self-Perception Jürgen Sturm, Christian Plagemann, Wolfram Burgard.

Research question

Can we learn a body scheme for a manipulator?

Elements: Proprioception (joint configurations) Spatial representation Visual perception (body part locations

in space)

Page 10: SA-1 Body Scheme Learning Through Self-Perception Jürgen Sturm, Christian Plagemann, Wolfram Burgard.

Related Work

Neurophysiology: Adaptive body schemes [Maravita and Iriki,

2004] Mirror neurons [Holmes and Spence, 2004]

Robotics: Self-calibration [Roy and Thrun, 1999] Cross-model maps [Yoshikawa et al., 2004] Structure learning [Dearden and Demiris,

2005]

Page 11: SA-1 Body Scheme Learning Through Self-Perception Jürgen Sturm, Christian Plagemann, Wolfram Burgard.

Problem formulation

Proprioception of m actuators (actions):

Spatial representation of n body parts:

Visual self-perception of n body parts:

Unknown correspondences between actuators and body parts!

(observation noise)

(homogeneous transformation matrix,6D position in space)

Page 12: SA-1 Body Scheme Learning Through Self-Perception Jürgen Sturm, Christian Plagemann, Wolfram Burgard.

Mathematical formulation

State vector (unobservable)

Observation vector

Observation history (Evidence)

Assumption: actions are noise-free observable

Page 13: SA-1 Body Scheme Learning Through Self-Perception Jürgen Sturm, Christian Plagemann, Wolfram Burgard.

Mathematical formulation

Body scheme as the probabilistic cross-modal map:

Full mapping

Forward model

Inverse model

Page 14: SA-1 Body Scheme Learning Through Self-Perception Jürgen Sturm, Christian Plagemann, Wolfram Burgard.

Earlier work

Learning the body scheme with function approximation: Nearest neighbor Neural nets Gaussian processes

Page 15: SA-1 Body Scheme Learning Through Self-Perception Jürgen Sturm, Christian Plagemann, Wolfram Burgard.

Earlier work

Learning the full mapping is a high-dimensional problem requires lots of training examples

Idea: Factorize the body scheme (e.g. body parts)

Page 16: SA-1 Body Scheme Learning Through Self-Perception Jürgen Sturm, Christian Plagemann, Wolfram Burgard.

Idea: Body Scheme Factorization

Body scheme represents a kinematic chain:

Bayesian network:

(remember that we previously defined )

Page 17: SA-1 Body Scheme Learning Through Self-Perception Jürgen Sturm, Christian Plagemann, Wolfram Burgard.

Local forward models Define local transform between body part i and j

Define local action subset

Learn local forward models

These local forward modelscan be approximated with GPs!

Page 18: SA-1 Body Scheme Learning Through Self-Perception Jürgen Sturm, Christian Plagemann, Wolfram Burgard.

Local forward models

• Example approximation of

Page 19: SA-1 Body Scheme Learning Through Self-Perception Jürgen Sturm, Christian Plagemann, Wolfram Burgard.

Body Scheme Factorization

Consider ALL local forward models:

..

Total number of local models:

Page 20: SA-1 Body Scheme Learning Through Self-Perception Jürgen Sturm, Christian Plagemann, Wolfram Burgard.

Minimum Spanning TreeForward Model Compose the full body scheme by

concatenating the local models of the minimum spanning tree:

Page 21: SA-1 Body Scheme Learning Through Self-Perception Jürgen Sturm, Christian Plagemann, Wolfram Burgard.

Body Scheme Factorization

Find minimal spanning tree: Translate each local model into nodes and

edges

Nodes: body parts Edges:

Large search space! Heuristic search (from simple to complex

local models)

Page 22: SA-1 Body Scheme Learning Through Self-Perception Jürgen Sturm, Christian Plagemann, Wolfram Burgard.

Model selection

Split the data in two parts:

Training set To train local models

Test set To evaluate data likelihood of each local

model Also possible: prediction accuracy

Page 23: SA-1 Body Scheme Learning Through Self-Perception Jürgen Sturm, Christian Plagemann, Wolfram Burgard.

Inverse model

Given a target pose, find the configuration

Compute Jacobians of forward model Gradient Descent towards target pose

Page 24: SA-1 Body Scheme Learning Through Self-Perception Jürgen Sturm, Christian Plagemann, Wolfram Burgard.

Evaluation

Demo video (real robot, 2-DOF) Experiment 1: Prediction Experiment 2: Control

Demo video (simulated robot, 7-DOF) Experiment 3: Partial observability

Page 25: SA-1 Body Scheme Learning Through Self-Perception Jürgen Sturm, Christian Plagemann, Wolfram Burgard.

Demo video

Real robot 2-DOF manipulator 3 body parts

Page 26: SA-1 Body Scheme Learning Through Self-Perception Jürgen Sturm, Christian Plagemann, Wolfram Burgard.

Experiment 1: Prediction

Real robot 2-DOF manipulator 3 body parts

Page 27: SA-1 Body Scheme Learning Through Self-Perception Jürgen Sturm, Christian Plagemann, Wolfram Burgard.

Experiment 1: Prediction

Real robot Simple models

learn faster than complex models

High accuracy Decomposition

into two 1st-order local models

Page 28: SA-1 Body Scheme Learning Through Self-Perception Jürgen Sturm, Christian Plagemann, Wolfram Burgard.

Experiment 2: Posture Control

Real robot Same body scheme Gradient descent Approach target

position

Page 29: SA-1 Body Scheme Learning Through Self-Perception Jürgen Sturm, Christian Plagemann, Wolfram Burgard.

Demo video

Simulated robot 7-DOF manipulator 10 body parts

Page 30: SA-1 Body Scheme Learning Through Self-Perception Jürgen Sturm, Christian Plagemann, Wolfram Burgard.

Experiment 3: Partial observability

Simulated robot 7-DOF manipulator 10 body parts Hidden body part 2nd-order local

model needed

Page 31: SA-1 Body Scheme Learning Through Self-Perception Jürgen Sturm, Christian Plagemann, Wolfram Burgard.

Experiment 3: Partial observability

Simulated robot 7-DOF manipulator 10 body parts Hidden body part 2nd-order local

model needed

Page 32: SA-1 Body Scheme Learning Through Self-Perception Jürgen Sturm, Christian Plagemann, Wolfram Burgard.

Experiment 3: Partial observability

Simulated robot 7-DOF manipulator 10 body parts Hidden body part 2nd-order local

model needed

Page 33: SA-1 Body Scheme Learning Through Self-Perception Jürgen Sturm, Christian Plagemann, Wolfram Burgard.

Summary

Body scheme learning without prior knowledge Structure learning Model learning

Purely generated from self-perception

Fast convergence Accurate prediction Accurate control

Page 34: SA-1 Body Scheme Learning Through Self-Perception Jürgen Sturm, Christian Plagemann, Wolfram Burgard.

Future work

Track natural visual features Identify geometrical structure (joint

types, rotation axes..) Dynamic adaptation of the body

scheme, e.g., during tool-use Imitation and imitation learning

Page 35: SA-1 Body Scheme Learning Through Self-Perception Jürgen Sturm, Christian Plagemann, Wolfram Burgard.

Questions?