Model-Agnostic Meta-Learning for Fast Adaptation …...Model-Agnostic Meta-Learning for Fast...

Model-Agnostic Meta-Learning for Fast Adaptation of Deep Networks

Chelsea Finn, Pieter Abbeel, Sergey Levine

Presentener: Siavash Khodadadeh

Overview

● One shot learning● Meta Learning● Model Agnostic Meta Learning● Supervised Learning

○ Experiments● Reinforcement Learning

○ Experiments● Conclusions

● Approaches○ Transfer Learning○ Meta Learning

■ Learn to learn

● Human○ Learn very quickly○ Few examples

One Shot Learning

1 2 2 ?

Meta Learning Approaches

● One-shot Learning with Memory-Augmented Neural Networks

● Optimization as a Model for Few-Shot Learning● Model-Agnostic Meta Learning

● Use Recurrent networks● Add a memory● Example

○ Character Recognition○ (3 labels)

Memory-Augmented Neural Networks

1 2 1 2 3

Optimization as a Model

Model Agnostic Meta Learning

● Intuition○ Internal representations○ Transferable among tasks

● Transfer Learning○ Good parameters trained on lots of data

● Meta Learning○ Parameters which are sensitive to small changes○ Large improvement on loss function for any task

Problem Definition

● Model f parameterized by ○ Maps x → a○ p( ): tasks distribution○ = { (x1, a1, …, xH, aH), q(x1), q(xt+1|xt, at), H}

■ Supervised learning: H = 1○ K-shot learning: K samples drawn from qi

● Method○ For task i model’s parameter become

○ Multiple gradient update also is extendable● Objective

Intuition3

Model Agnostic Meta Learning for Supervised Learning

● Regression:

● Classification:

Experiments

● Can MAML enable fast learning?● Can MAML be used in different domains?

○ Supervised regression○ Classification○ Reinforcement learning

● Can it be better with more data?

● Sine wave experiments○ Meta Training (700000)

■ Amplitude [0.1, 5.0]■ Phase [0, π]■ K points sampled from [-5.0, 5.0]

Regression Experiments

■ 2 fully connected layers (40 neurons) with ReLU

○ Meta Testing■ K samples from a sine wave

○ Evaluation■ Mean squared error for 600 points

○ Baselines■ Pretrained Model

● Train on all samples● Finetune on the given sine wave during test● Evaluated on 600 datapoints

■ Oracle

K = 10

MAML Pretrained

Oracle

Classification Examples

● N-way classification:○ Use N class during test with K-shot learning

● Network Architecture○ 4 modules

■ 3 × 3 convolutions and 64 filters■ ReLU nonlinearity■ 2 × 2 max-pooling

● No Convolution○ 256, 128, 64, 64 with Relu

Classification

● Few-shot learning benchmarks○ Omniglot

■ 1623 characters from 50 alphabets● 20 instances each drawn by a different person

■ Training: ● 1200 characters

■ Testing● 423 characters

Classification

● Few-shot learning benchmarks○ MiniImagenet

■ 80 training classes■ 20 test classes

First Order Approximation

Update step

First order approximation

Classification Omniglot

Classification MiniImagenet

Reinforcement Learning

Loss Function:

● 2D navigation○ Point agent must move to different goal positions○ Target randomly chosen from a unit square○ The agent should be within 0.01 of the goal○ Meta training: 100 iterations of batches of size 20○ Reward: Negative distance to goal○ H = 100 episode horizon limit○ Meta test batch size: 40

● Repeat

Vanilla Policy Gradient

● Randomly initialize ● Perform K rollouts● Update weights

○ Collected rewardsPolicy Network .

● Locomotion○ Two different simulated robots by MuJuCo

■ Planar Cheetah■ 3D quadruped (ant)

○ Run in particular direction or with a particular speed.

Reinforcement Learning Results

Conclusions

● Applicable on diverse methods○ Have parameters and smooth loss function

● Adaptation can be done with any amount of data● Future Research

○ Multi-task initialization■ CONTINUOUS ADAPTATION VIA META-LEARNING IN NONSTATIONARY AND COMPETITIVE

ENVIRONMENTS (ICLR 2018)

Questions

Thank you!

Model-Agnostic Meta-Learning for Fast Adaptation …...Model-Agnostic Meta-Learning for Fast...

Documents

Transcript of Model-Agnostic Meta-Learning for Fast Adaptation …...Model-Agnostic Meta-Learning for Fast...

iTAML : An Incremental Task-Agnostic Meta-learning Approach · cations. Javed et al.[11] propose a meta-learning approach that disentangles generic representations from task-speciﬁc

Through Agnostic Spectacles

Agnostic Capability (324)

Agnostic Atheism

Meta-DermDiagnosis: Few-Shot Skin Disease Identification using Meta-Learning · 2020. 12. 18. · Vuorio et al’s work ‘Multimodal model-agnostic meta-learning via task-aware modulation.’

Meta Exploration for Model-Agnostic Reinforcement Learning

Continuous Adaptation via Meta-Learning in Nonstationary ...mshediva/assets/pdf/ca-iclr2018-poster.pdfContinuous Adaptation via Meta-Learning in Nonstationary and Competitive Environments

Bayesian Model-Agnostic Meta-Learning · 2018-11-20 · Bayesian Model-Agnostic Meta-Learning Taesup Kimz2, Jaesik Yoon 3, Ousmane Dia1, Sungwoong Kim4, Yoshua Bengio2;5, Sungjin

Multimodal Model-Agnostic Meta-Learning via Task-Aware … · 2020-02-13 · Multimodal Model-Agnostic Meta-Learning via Task-Aware Modulation Risto Vuorio 1 Shao-Hua Sun 2Hexiang

A Meta-model Framework for Long-term Adaptation Planning in … · 2020-02-06 · A Meta-model Framework for Long-term Adaptation Planning in Korea Jung Hee Hyun1, Dong Kun Lee1,

Model-Agnostic Meta-Learning for Fast Adaptation of Deep ...cmp.felk.cvut.cz/~toliageo/rg/papers/slides/azayev_maml.pdf · One-shot Learning with Memory-Augmented Neural Networks

Fast Adaptation to Super-Resolution Networks via Meta-Learning · 3 Meta-Learning for Super-Resolution In this section, we introduce a new neural approach that integrates recent meta-learning

Meta-Learning Recipe, Black-Box Adaptation, Optimization ...web.stanford.edu/class/cs330/slides/cs330_lecture3.pdf · Meta-Learning Recipe, Black-Box Adaptation, Optimization-Based

iTAML: An Incremental Task-Agnostic Meta-learning Approach...iTAML: An Incremental Task-Agnostic Meta-learning Approach Jathushan Rajasegaran∗, Salman Khan∗, Munawar Hayat∗,

Meta-Learning Recipe, Black-Box Adaptation, Optimization ...cs330.stanford.edu/slides/cs330_lecture3.pdf · Meta-Learning with Memory-Augmented Neural Networks Santoro, Bartunov,

arXiv:2002.04766v1 [cs.LG] 12 Feb 2020Distribution-Agnostic Model-Agnostic Meta-Learning Liam Collins , Aryan Mokhtari , Sanjay Shakkottai February 13, 2020 Abstract TheModel-AgnosticMeta-Learning(MAML)algorithm[Finnetal.,2017]hasbeencel

Meta-data management issues underpinning Grid and … · Meta-data management issues underpinning Grid and P2P development ... Overview Meta-data management ... -Assess an adaptation

Probabilistic Model-Agnostic Meta-Learning · a critical challenge in few-shot learning is task ambiguity: even when a powerful prior can be meta-learned from a large number of prior

(2019) Aravind Rajeswaran, Chelsea Finn, Sham …...Aravind Rajeswaran, Chelsea Finn, Sham Kakade, Sergey Levine (2019) Mayank Mittal Model-Agnostic Meta-Learning Images from: Model-Agnostic

Model-Agnostic Meta-Learning (MAML) - for Fast Adaptation ... · Model-Agnostic Meta-Learning (MAML) for Fast Adaptation of Deep Networks from Chelsea Finn, Pieter Abbeel and Sergey