Deep learning for image super resolution

DEEP LEARNING FOR IMAGE SUPER-RESOLUTIONCHAO DONG, CHEN CHANGE LOY, KAIMING HE, XIAOOU TANG

Presented By Prudhvi Raj DachapallyD. Prudhvi Raj

AbstractUsing Deep Convolutional Networks,

the machine can learn end-to-end mapping between the low/high-resolution images. Unlike traditional methods, this method jointly optimizes all the layers of the image. A light-weight CNN structure is used, which is simple to implement and provides formidable trade-off from the existential methods.

What is Deep Learning? A branch of Artificial Neural Networks and

Machine Learning that deals with more convolutional and realistic brain structures.

In the words of Dr. Andrew Ng, researcher at Stanford, Founder & CEO of Coursera, “Increased computing power has allowed us to map and process much larger neural networks than ever before.”

Appealing Properties of the Proposed Model The name given for this model is Super – Resolution

Convolutional Neural Network (or) SRCNN. Structure is simple, but provides superior accuracy

compared to state-of-the-art methods. Since it is a fully feed-forward network, it is

unnecessary to solve the optimization problem. Restoration quality can be further improved with more

diverse data and/or more deeper network without changing the core structure of the network.

SRCNN model can also cope with channels of color images simultaneously with ease, which in turn can improve performance.

Preliminaries Color Channel used – YCbCr

Y – Luminance Cb – Blue – difference Cr – Red – difference Cb and Cr are Chrominance components

First, we upscale the image to a desired size using bicubic interpolation method. This is just a pre-processing step.

Structure of the Network

Components in the Network Patch Extraction and Representation

Densely extracts patches and then represents them as a set of filters. This layer is expressed as a function F1, where

F 1(Y) = max(0, W1 * Y + B1)

This layer extracts a n1 –dimensional feature for each patch. Non – Linear Mapping

Maps each of the n1-dimensional vectors into an n2-dimensional one. This layer is expressed as a function F2, where

F 2(Y) = max(0, W2 * F1(Y) + B2)

It is possible to add more convolutional layers to this structure, but in perspective, increases the training time.

Reconstruction The predicted overlapping high-resolution patches are often averaged to

produce the final full image. This convolutional layer is defined as

F (Y) = W3 * F 2(Y) + B3

Terms Used in the Formulations W1 = Corresponds to the n1 filters of size c

* f1 * f1, Where c is number of channels and f1 is the spatial size

of the filter. B1 = An n1-dimensional vector, whose each

element is associated with a filter. W2 = n2 filters of size n1 * f2 * f2. B2 = n2 dimensional vector. W3 = Corresponds to c filters of size n2 * f3 * f3 B3 = c- dimensional vector.

Learning Process Estimation of network parameters can be

achieved through minimizing the loss between reconstructed images and the corresponding original high-resolution images. This is done by taking the Mean Squared Error (MSE).

Using MSE as a loss function, favors high PSNR( Peak Signal to Noise Ratio).

The loss is minimized by using stochastic gradient descent with regular back-propagation algorithm.

Experiments Training Data

Very Large Data Set of 395, 909 images from 2013 ImageNet Competition.

Test Data A BSD200 Data Set with 200 images.

Basic Network Settings These are f1 = 9, f2 = 1, f3 = 5, n1 = 64 and

n2 = 32.

Results

Comparison Against the State-of-the-art Methods

Real Time Results

Expansion Scope Using Large Filters

Increasing the filter size can increase the PSNR value, but also increases the training time.

Using Deeper Networks This can sometimes be a contradiction to the

rule “More the layers, so is the accuracy.”

Conclusion This approach, SRCNN, learns an end-to-end

mapping between low- and high-resolution images, with little extra pre/post-processing beyond the optimization. With a lightweight structure, the SRCNN achieves a superior performance than the state-of-the-art methods. Additional improvement in performance can be gained further by exploring more filters and different training strategies.

ReferencesImages, tables and some of the text used in this presentation as taken from Chao Dong et.al. “Image Super-Resolution Using Deep Convolutional Networks”, IEEE Transactions on Pattern Analysis and Machine Intelligence, Volume 38, February 2016.

Thank You

Deep learning for image super resolution

Data & Analytics

Transcript of Deep learning for image super resolution

and Super-Resolution arXiv:1603.08155v1 [cs.CV] 27 Mar 2016olga/Courses/Fall2016/CS9840/... · Keywords: Style transfer, super-resolution, deep learning 1 Introduction Many classic

SUPER RESOLUTION WITH DEEP CONVOLUTIONAL ...Published as a conference paper at ICLR 2016 SUPER-RESOLUTION WITH DEEP CONVOLUTIONAL SUFFICIENT STATISTICS Joan Bruna Department of Statistics

3D Appearance Super-Resolution with Deep Learningtimofter/publications/Li-CVPR...3D Appearance Super-Resolution with Deep Learning Yawei Li 1, Vagia Tsiminaki2, Radu Timofte , Marc

Super-resolution Using Constrained Deep Texture Synthesiscs.brown.edu/~lbsun/deep_texture_sr.pdf · Super-resolution Using Constrained Deep Texture Synthesis ... consider the combination

Image Super-Resolution via Deep Recursive Residual Networkcvlab.cse.msu.edu/pdfs/Tai_Yang_Liu_CVPR2017.pdf · Image Super-Resolution via Deep Recursive Residual Network ... pose two

Depth Map Super-Resolution by Deep Multi-Scale …personal.ie.cuhk.edu.hk/~ccloy/files/eccv_2016_depth.pdfDepth Map Super Resolution by Deep Multi-Scale Guidance 3 In this paper, we

Deep Learned Super Resolution for Feature Film Production · Training a super resolution model requires pairs of high-resolution and low-resolution frames. Most prior research relies

1 Super-Resolution via Deep Learning - arXiv · Super-Resolution via Deep Learning ... the pioneering work on the role of deep learning is as fresh ... super-resolution work not using

Video Super-Resolution via Deep Draft-Ensemble Learningopenaccess.thecvf.com/content_iccv_2015/papers/Liao_Video_Super... · 1. Introduction Video super-resolution (VideoSR), a.k.a.

Deep Network Cascade for Image Super-resolution · 2018-12-21 · Deep Network Cascade for Image Super-resolution Zhen Cui1;2, Hong Chang1, Shiguang Shan1, Bineng Zhong2, and Xilin

Robust Super-Resolution via Deep Learning and TV Priorsjmota.eps.hw.ac.uk/documents/Vella19-RobustSuper... · Robust Super-Resolution via Deep Learning and TV Priors Marija Vella⋆

“Zero-Shot” Super-Resolution using Deep Internal Learning

Detail-revealing Deep Video Super-resolution - arXiv Deep Video Super-resolution ... age detail fusion from multiple frames is the key to the suc-cess of video SR. ... take arbitrary-size

Multiscale brain MRI super-resolution using deep 3D ...

Supplementary File: Image Super-Resolution Using Very Deep ...yulunzhang.com/papers/ECCV-2018-RCAN_supp.pdf · Supplementary File: Image Super-Resolution Using Very Deep Residual

Deep learning achieves super-resolution in fluorescence ... · Network (GAN)8 model to transform an acquired low-resolution image into a high-resolution one. Once the deep network

Single Image Super-Resolution Using a Deep Encoder-Decoder ... · Single Image Super-Resolution Using a Deep Encoder-Decoder Symmetrical Network with Iterative Back Projection Heng

Super Resolution on 3D Point Clouds using Deep Learning€¦ · Master in Computer Vision ... Javier Ruiz Béatrice Pesquet Super Resolution on 3D Point Clouds using Deep Learning

Perceptual Deep Depth Super-Resolutionopenaccess.thecvf.com/content_ICCV_2019/papers/Voynov...Perceptual Deep Depth Super-Resolution Oleg Voynov1, Alexey Artemov1, Vage Egiazarian1,

Deep Networks for Image Super-Resolution with Sparse Priorjyang29/papers/ICCV15-rnnsr.pdf · Deep Networks for Image Super-Resolution with Sparse Prior ... several models based on