Tuesday, 9 May 2017 Andrew Edelsten - NVIDIA Developer...

Tuesday, 9 May 2017

Andrew Edelsten - NVIDIA Developer Technologies

ZOOM, ENHANCE, SYNTHESIZE! MAGIC UPSCALING AND MATERIAL SYNTHESIS USING DEEP LEARNING

DEEP LEARNING FOR ARTActive R&D but ready now

▪ Style transfer

▪ Generative networks creating images and voxels

▪ Adversarial networks (DCGAN) – still early but promising

▪ DL & ML based tools from NVIDIA and partners

▪ NVIDIA

▪ Artomatix

▪ Allegorithmic

▪ Autodesk

STYLE TRANSFERSomething Fun

Content Style▪ Doodle a masterpiece!

▪ Uses CNN to take the “style” from one image and apply it to another

▪ Sept 2015: A Neural Algorithm of Artistic Style by Gatys et al

▪ Dec 2015: neural-style (github)

▪ Mar 2016: neural-doodle (github)

▪ Mar 2016: texture-nets (github)

▪ Oct 2016: fast-neural-style (github)

▪ 2 May 2017 (last week!): Deep Image Analogy (arXiv)

▪ Also numerous services: Vinci, Prisma, Artisto, Ostagram

4HTTP://OSTAGRAM.RU/STATIC_PAGES/LENTA

STYLE TRANSFER

▪ Game remaster & texture enhancement

▪ Try Neural Style and use a real-world photo for the “style”

▪ For stylized or anime up-rez try https://github.com/nagadomi/waifu2x

▪ Experiment with art styles

▪ Dream or power-up sequences

▪ “Come Swim” by Kirsten Stewart - https://arxiv.org/pdf/1701.04928v1.pdf

Something Useful

GAMEWORKS: MATERIALS & TEXTURESUsing DL for Game Development & Content Creation

▪ Set of tools targeting the game industry using machine learning and deep learning

▪ Launched at Game Developer Conference in March, tools run as a web service

▪ Sign up for the Beta at: https://gwmt.nvidia.com

▪ Tools in this initial release:

▪ Photo to Material: 2shot

▪ Texture Multiplier

▪ Super-Resolution

PHOTO TO MATERIAL

▪ From two photos of a surface, generate a “material”

▪ Based on a SIGGRAPH 2015 paper by NVIDIA Research & Aalto University (Finland)

▪ “Two-Shot SVBRDF Capture for Stationary Materials”

▪ https://mediatech.aalto.fi/publications/graphics/TwoShotSVBRDF/

▪ Input is pixel aligned “flash” and “guide” photographs

▪ Use tripod and remote shutter or bracket

▪ Or align later

▪ Use for flat surfaces with repeating patterns

The 2Shot Tool

MATERIAL SYNTHESIS FROM TWO PHOTOS

Flash image Guide image

Diffuse

albedoSpecular Normals Glossiness Anisotropy

TEXTURE MULTIPLIER

▪ Put simply: texture in, new texture out

▪ Inspired by Gatys, Ecker & Bethge

▪ Texture Synthesis Using Convolutional Neural Networks

▪ https://arxiv.org/pdf/1505.07376.pdf

▪ Artomatix

▪ Similar product “Texture Mutation”

▪ https://artomatix.com/

Organic variations of textures

SUPER RESOLUTION

SUPER RESOLUTIONZoom.. ENHANCE!

Zoom in on the

license plate

OK!Sure!

Can you

enhance that?

SUPER RESOLUTIONThe task at hand

Upscale

(magic?)

Given alow-resolution image

Construct ahigh-resolution image

UPSCALE: CREATE MORE PIXELSAn ill-posed task?

Pixels of the upscaled image

Pixels of the given image? ? ?

? ? ? ? ? ?

TRADITIONAL APPROACH▪ Interpolation (bicubic, lanczos, etc.)

▪ Interpolation + Sharpening (and other filtration)

Filter-based sharpeningInterpolation

▪ Rough estimation of the data behavior too general

▪ Too many possibilities (8x8 grayscale has 256(8∗8) ≈ 10153 pixel combinations!)

A NEW APPROACHFirst: narrow the possible set

Photos

Textures

All possible imagesFocus on the domain of “natural images”

Natural images

A NEW APPROACH

Data from natural images is sparse, it’s compressible in some domain

Then “reconstruct” images (rather than create new ones)

Second: Place image in the domain, then reconstruct

+prior information

+constraints

ReconstructCompress

PATCH-BASED MAPPING: TRAINING

params

Mapping

Training images

LR,HR pairs of patches

training

Low-resolution patch High-resolution patch

PATCH-BASED MAPPING

LR patch

HR patch

Encode Decode

𝒙𝑳

𝒙𝑯

High-level information about the patch

PATCH-BASED MAPPING: SPARSE CODING

LR patch

HR patch

Sparse code

Encode Decode

𝒙𝑳

𝒙𝑯

High-level information about the patch“Features”

PATCH FEATURES & RECONSTRUCTION

𝒙 = 𝑫𝒛 = 𝒅𝟏𝒛𝟏 +⋯+ 𝒅𝑲𝒛𝑲

= 0.8 * + 0.3 * + 0.5 *

𝒅𝟑𝟔 𝒅𝟒𝟐 𝒅𝟔𝟑𝒙

Image patch can be reconstructed as a sparse linear combination of features

Features are learned from the dataset over time

𝑫 - dictionary

- patch

- sparse code

GENERALIZED PATCH-BASED MAPPING

MappingMapping

LR patch

HR patchHigh-level

representation of the LR patch

“Features”

High-level representation of

the HR patch

Mapping in feature space

GENERALIZED PATCH-BASED MAPPING

MappingMapping

LR patch

HR patch

Trainable parameters

𝑊1 𝑊2 𝑊3

MAPPING OF THE WHOLE IMAGEUsing Convolutions

LR image

HR image

MappingMapping

Convolutional operators

AUTO-ENCODERS

input output ≈ input

AUTO-ENCODER

features

Encode

output ≈ input

Decode

AUTO-ENCODER

𝑥 𝑦

Parameters

Training

𝑊 = 𝑎𝑟𝑔𝑚𝑖𝑛

𝐷𝑖𝑠𝑡(𝑥𝑖 , 𝐹𝑊 𝑥𝑖 )

𝑦 = 𝐹𝑊(𝑥)

Inference

𝑥𝑖 - training set

AUTO-ENCODER

Encode

information loss

▪ Our encoder is LOSSY by definition

SUPER-RESOLUTION AUTO-ENCODER

Training

𝑦 = 𝐹𝑊(𝑥)

Inference

𝐷𝑖𝑠𝑡(𝑥𝑖 , 𝐹𝑊 𝑥𝑖 )

𝑥 𝑦

Parameters

𝐷𝑖𝑠𝑡(𝑥𝑖 , 𝐹𝑊 𝐷(𝑥𝑖) )

SUPER RESOLUTION AE: TRAINING

Ground-truth HR image

Downscaling

LR image

Reconstructed HR image

ො𝑥

SUPER RESOLUTION AE: INFERENCE

Given LR image

Constructed HR image

ො𝑥

𝑦 = 𝐹𝑊(ො𝑥)

SUPER-RESOLUTION: ILL-POSED TASK?

THE LOSS FUNCTION

Distance function is a key element to obtaining good results.

Measuring the “distance” from a good result

𝐷 𝑥𝑖 , 𝐹𝑊(𝑥𝑖 )

Choice of the loss function is an important decision

LOSS FUNCTION

𝑁𝑥 − 𝐹 𝑥 2

MSEMean Squared Error

LOSS FUNCTION: PSNR

PSNR Peak Signal-to-Noise Ratio

10 ∗ 𝑙𝑜𝑔10𝑀𝐴𝑋2

𝑀𝑆𝐸

LOSS FUNCTION: HFEN

PSNR Peak Signal-to-Noise Ratio

10 ∗ 𝑙𝑜𝑔10𝑀𝐴𝑋2

𝑀𝑆𝐸

𝐻𝑃(𝑥 − 𝐹 𝑥 ) 2

HFEN(see A)

High Frequency Error Norm High-Pass filter

Perceptual loss

Ref A: http://ieeexplore.ieee.org/document/5617283/

REGULAR LOSS

Result 4x Result 4x

REGULAR LOSS + PERCEPTUAL LOSS

Result 4x Result 4x

WARNING… THIS IS EXPERIMENTAL!

SUPER-RESOLUTION: GAN-BASED LOSS

Total loss = Regular (MSE+PSNR+HFEN) loss + GAN loss

Generator Discriminator

𝑥𝐹(𝑥)

𝐷(𝑦)𝑦

= −𝑙𝑛𝐷(𝐹 𝑥 )GAN loss

Extended presentation from Game Developer Conference 2017

https://developer.nvidia.com/deep-learning-games

GameWorks: Materials & Textures

https://gwmt.nvidia.com

QUESTIONS?

Tuesday, 9 May 2017 Andrew Edelsten - NVIDIA Developer...

Documents

Transcript of Tuesday, 9 May 2017 Andrew Edelsten - NVIDIA Developer...

Version 134 - NVIDIA Developer

CUDA Fluid Simulation in NVIDIA PhysXsa08.idav.ucdavis.edu/CUDA_physx_fluids.Harris.pdf · Mark Harris NVIDIA Developer Technology CUDA Fluid Simulation in NVIDIA PhysX

Project SHIELD and Tegra 4: Redefining AFK SHIELD and...Project SHIELD and Tegra 4: Redefining AFK Andrew Edelsten (Manager, Tegra Developer Technologies) ... Intel Core i3-2377m 1.5GHz,

Kernel Profiling Guide - NVIDIA Developer

Integrating Shaders into Your Game Engine Bryan Dudash NVIDIA Developer Technology.

Android_Gaming_on_Tegra.ppt - NVIDIA Developer

© NVIDIA Corporation 2013 GPU Parallel Computing Zehuan Wang HPC Developer Technology Engineer.

NVIDIA Developer Tools for Graphics and PhysX · NVIDIA Developer Tools for Graphics and PhysX. PerfKit GPU Programming Guide ShaderPerf PerfHUD Conference Presentations PerfSDK GLExpert

Vulkan Multi-Threading - NVIDIA Developer · Khronos Munich Chapter meeting, April 8th 2016 Mathias Schott, Senior Developer Technology Engineer, NVIDIA Vulkan Multi-Threading

NVIDIA Jetson Nano Developer Kit Carrier Board P3449 B01 · 2020. 4. 2. · SP-09732-001_v1.1 | January 2020 . NVIDIA Jetson Nano Developer Kit Carrier Board P3449_B01 . Specification

D3D11 Deferred Contexts - NVIDIA Developer

OpenGL 4.x and Beyond - NVIDIA Developer

NVIDIA TEGRA 250 DEVELOPER KIT HARDWARE ...dspace.cusat.ac.in/jspui/bitstream/123456789/2219/1...NVIDIA TEGRA 250 DEVELOPER KIT HARDWARE INTRODUCTION AND SETUP SEMINAR REPORT Submitted

DEEP LEARNING INTRODUCTION - NVIDIA Developer

DX10, Batching, and Performance Considerations Bryan Dudash NVIDIA Developer Technology.

Randy Fernando NVIDIA Developer Technology Grouphttp.download.nvidia.com/developer/presentations/2005/I3D/I3D_05... · Randy Fernando NVIDIA Developer Technology Group (Original Slides

Practical DirectX 12 - NVIDIA Developer · Practical DirectX 12 - Programming Model and Hardware Capabilities Gareth Thomas & Alex Dunn AMD & NVIDIA

Real-World GPGPU Mark Harris NVIDIA Developer Technology.

NVIDIA Developer Tools for Graphics and PhysXdownload.nvidia.com/developer/cuda/seminar/TDCI_Tools.pdf · PerfKit GPU Programming Guide ShaderPerf PerfHUD Conference Presentations

Technical Reportdownload.nvidia.com/developer/Papers/2004/Bump_Map... · from the NVIDIA developer website. Overview Developers face several choices when implementing bump mapping.