Image Domain Transfer - MIT Deep Learning...

Jan Kautz, VP of Learning and Perception Research

Image Domain Transfer

Image Domain Transfer: enabling machines to have human-like imagination abilities

Input image Domain transferred image

This image is generated by our method.

Example use cases

Low-res to high-res Blurry to sharp

Noisy to clean

LDR to HDR Thermal to color

Image to painting

Day to night Summer to winter

Synthetic to real

Two Approaches

• Example-based§ Non-parametric model§ The transfer function F is defined by an example image.

• Learning-based§ Parametric model§ The transfer function is learned via fitting a training dataset.

Input image Domain transferred image

F ( |Training Dataset)

F ( |xexample)

Example-based Image Domain Transfer

F ( |xexample)

Example-based image domain transfer

Stylized content photoContent photoStyle photo

xoutput = F (xinput|xexample)

Often referred to as Style Transfer

Example-based image domain transfer• Artistic Style Transfer• Content: real photo; • Style: painting• Gatys et. al. , Johnson

et. al., Li et al., Huang et. al.

• Photo Style Transfer• Content: real photo; • Style: real photo• Luan et. al., Pitie et.

al., Reinhard et. al.

Style (painting) Content Output

Style Content Output

FastPhotoStyle

• ”A Closed-form Solution to Photorealistic Image Stylization” by Yijun Li, Ming-Yu Liu, Xueting Li, Ming-Hsuan Yang, Jan Kautz,ECCV 2018• Code: https://github.com/NVIDIA/FastPhotoStyle

Fast Photo Style

Content

Stylization step Photo smoothing step

We model photo style transfer as a close-form function mapping given by

Content image

Style image

Stylization Step

VGG19 up to conv4_1

Assumption: Covariance matrix of deep features encodes the style information.

HCHTC = EC⇤CE

HSHTS = ES⇤SE

Stylization StepDecoder

whitening coloring

Architecture is similar to Li et al., but uses unpooling.

When semantic label maps are availableContent Style

ContentStyle Output

Photo Smoothing

Assumption: If we can compute a new image where the image pixel values resemble those in the intermediate image but the similarities between neighboring pixels resemble those in the content image, then we have a photorealistic stylization outputs.

Content

Stylization step Photo smoothing step

Intermediate image pixel values

Similarity between neighboring pixels(Gaussian/matting affinity)

R⇤ = (1� ↵)(I � ↵D� 12WD� 1

2 )�1YClose-form solution:

ContentStyle Output

Comparison

Quantitative results

Learning-based Image Domain Transfer

F ( |Training Dataset)

Generative Adversarial Networks (GANs)

Generator Discriminator

Discriminator

p(G(z)) ! p(X)

Supervised Unsupervised

Supervised vs Unsupervised

Unimodal vs Multimodal

F ( ) =

Unimodal

Multimodal

p(Y |X) = �(F (X))

p(Y |X) = F (X,S)

Categorization

Unimodal pix2pix, CRN, SRGAN UNIT, Coupled GAN, DTN, DiscoGAN, CycleGAN, DualGAN, StarGAN

Multimodal pix2pixHD, vid2vid, BiCycleGAN

Categorization

pix2pixHD: Supervised and multimodalimage domain transfer• ”High-Resolution Image Synthesis and Semantic Manipulation with

Conditional GANs” by Ting-Chun Wang, Ming-Yu Liu, Jun-Yan Zhu, Andrew Tao, Jan Kautz, Bryan Catanzaro, CVPR 2018

• Code: https://github.com/NVIDIA/pix2pixHD

pix2pixHD

pix2pix

Generator Discriminator

Discriminator

p(G(S), S) ! p(X,S)

pix2pixHD

Generator Discriminator False

FeatureEncoder

Low-dimensional feature map

InstancePooling

Once we finish training, we extract the instance-pooled feature maps from all the training images and fit a mixture of Gaussian model to features belong to the same semantic class.

During inference, we sample from the mixture of Gaussian distributions for achieving multimodal image translation.

Multi-scale Discriminators Robust Objective(GAN + discriminator feature matching loss)

Coarse-to-fine Generator

... Residual blocks

Residual blocks

2real synthesized

real synthesized

pix2pixHD multimodal results

pix2pixHD label changes

Comparison

Categorization

Multimodalpix2pixHD, vid2vid, BiCycleGAN

vid2vid: Video-to-Video Synthesis

• ”Video-to-Video Synthesis” by Ting-Chun Wang, Ming-Yu Liu, Jun-Yan Zhu, Guilin Liu, Andrew Tao, Jan Kautz, Bryan Catanzaro, NIPS 2018

• Code: https://github.com/NVIDIA/vid2vid

Motivation

Using pix2pixHD

Spatially progressive

Temporally progressive

Spatio-temporally Progressive Training

Residual blocks

Alternating training

Image Discriminator Video Discriminator

Final Image

Interm

ediate

Flow map

Sequential Generator

Warped

Semantic

Past im

Multi-scale Discriminators

Results

AI Rendered Game

Categorization

Unimodal pix2pix, CRN, SRGAN, … UNIT, Coupled GAN, DTN, DiscoGAN, CycleGAN, DualGAN, StarGAN

UNIT: Unsupervised and unimodal image domain transfer• ”Unsupervised Image-to-image Translation Networks” by

Ming-Yu Liu, Thomas Breuel, Jan Kautz, NIPS 2017• Code: https://github.com/mingyuliutw/UNIT

Supervised vs Unsupervised

Generator 1 Discriminator 1

Discriminator 1

Generator 2 Discriminator 2

Discriminator 2

p(G1(X1)|X1) ! p(X2)

p(G2(X2)|X2) ! p(X1)

p(G1(X1)|X1) 9 p(X2|X1)But

p(G2(X2)|X2) 9 p(X1|X2)But

UNIT assumption: Shared Latent Space

Coupling the mapping

function via weight-sharing

Domain 2 GAN Discriminator

Encoder 1 Discriminator 1

Discriminator 1

Discriminator 2

Decoder 2

Encoder 2

Decoder 1

Shared latent space

Encoder 1

Decoder 2

Encoder 2

Decoder 1

Day to Night Translation

Input Translated Input Translated

Resolution640x480

Snowy to Summery TranslationResolution640x480

Sunny to Rainy TranslationResolution640x480

Categorization

Multimodal pix2pixHD, BiCycleGAN MUNIT

MUNIT: Unsupervised and multimodalimage domain transfer• ”Multimodal Unsupervised Image-to-image Translation” by

Xun Huang, Ming-Yu Liu, Serge Belongie, Jan Kautz, ECCV 2018

• Code: https://github.com/NVlabs/MUNIT

MUNIT assumption: Partially Shared Latent Space

Content Encoder

1Discriminator 1

Discriminator 1

Discriminator 2

Decoder 2

Content Encoder

Decoder 1

Shared contentlatent space

Style Encoder

MUNIT network

MUNIT results

MUNIT Style TransferContent Encoder

Decoder 2

Style Encoder

Conclusion• Example-based image domain transfer• Learning-based image domain transfer

Learning-based Unimodal MultimodalSupervised

Unsupervised

100 Best Companies to Work For— Fortune

Employees’ Choice: Highest Rated CEOs— Glassdoor

Most Innovative Companies— Fast Company

World’s Most Admired Companies— Fortune

50 Smartest Companies— MIT Tech Review

JOIN NVIDIA

YOUR LIFE’S WORK STARTS HERE

INTERESTED? Email: aijobs@nvidia.com

Image Domain Transfer - MIT Deep Learning...

Documents

Transcript of Image Domain Transfer - MIT Deep Learning...

Wavelet Domain Style Transfer for an Effective Perception …openaccess.thecvf.com/content_ICCV_2019/papers/Deng... · 2019-10-23 · Wavelet Domain Style Transfer for an Effective

Spatial Domain Simultaneous Information and Power Transfer … · Spatial Domain Simultaneous Information and Power Transfer for MIMO Channels Stelios Timotheou, Member, IEEE, Ioannis

The Public Domain And Nineteenth Century Transfer Policy · THE PUBLIC DOMAIN AND NINETEENTH CENTURY TRANSFER POLICY Gary M. Anderson and Dolores 1’. Martin Introduction During

Actively Transfer Domain Knowledge

KNOWLEDGE TRANSFER NETWORK - Supergen … · - Dr Robert Quarshie ... • Applied Domain – Domain Director is Steve Welch • Living Domain ... The Knowledge Transfer Network. Enabling

Domain Transfer - How to go about it

CrDoCo: Pixel-Level Domain Transfer With Cross-Domain ...openaccess.thecvf.com/content_CVPR_2019/papers/Chen_CrDoCo_P… · CrDoCo: Pixel-level Domain Transfer with Cross-Domain Consistency

Preparing your Domain to transfer from GoDaddyyour-site.com/forms/transfers/GoDaddyDomain.pdf · transfer from GoDaddy. Getting Started Before you can transfer a domain: • Conﬁrm

Spectral Domain-Transfer Learningqyang/Docs/2008/p488-ling.pdf · Spectral Domain-Transfer Learning Xiao Ling† Wenyuan Dai† Gui-Rong Xue† Qiang Yang‡ Yong Yu† †Shanghai

Strengthening Weak Identities Through Inter-Domain Trust Transfer · 2017-11-03 · Inter-Domain Trust Transfer Giridhari’Venkatadri, ... Accountability Anonymity Adoption Resistance

Semantics-Aware Image to Image Translation and Domain Transfer · Semantics-Aware Image to Image Translation and Domain Transfer Pravakar Roy University of Minnesota Minneapolis,

Learning Cross-Domain Information Transfer for Location ...

Using Data-Driven Domain Randomization to Transfer Robust ...

Domain Transfer via Cross-Domain Analogy Matthew Klenk and Ken

Latent Space Domain Transfer between High Dimensional Overlapping Distributions

Transfer Learning based Activity Recognition via Domain ...jd92.wang/assets/files/l04_sdp.pdf · Transfer Learning based Activity Recognition via Domain Adaptation Jindong Wang, Liping

Natural Systems Analysisnatural-scenes.cps.utexas.edu/Geisler_et_al_SPIE_2008.pdfimage with a non-oriented log Gabor filter (in the Fourier domain) having a spatial-frequency bandwidth

Modelling Domain Relationships for Transfer Learning on … · Modelling Domain Relationships for Transfer Learning on Retrieval-based Question Answering Systems in E-commerce Jianfei

ICANN New Domain Transfer Policy

Non-Linear Domain Adaptation in Transfer Evolutionary ...memetic-computing.org/publication/journal/Non-Linear-DA-TEO.pdf · 1 . Non-Linear Domain Adaptation in Transfer Evolutionary