Colorful Image Colorizationkosecka/cs747/Colorization.pdf · Colorful Image Colorization Richard...

18
Colorful Image Colorization Richard Zhang, Phillip Isola, Alexei (Alyosha) Efros Presented by, Adityaraj P

Transcript of Colorful Image Colorizationkosecka/cs747/Colorization.pdf · Colorful Image Colorization Richard...

Page 1: Colorful Image Colorizationkosecka/cs747/Colorization.pdf · Colorful Image Colorization Richard Zhang, Phillip Isola, Alexei (Alyosha) Efros Presented by, Adityaraj P. Objective

Colorful Image ColorizationRichard Zhang, Phillip Isola, Alexei (Alyosha) Efros

Presented by,Adityaraj P

Page 2: Colorful Image Colorizationkosecka/cs747/Colorization.pdf · Colorful Image Colorization Richard Zhang, Phillip Isola, Alexei (Alyosha) Efros Presented by, Adityaraj P. Objective

Objective

● Given a grayscale photograph as input, this paper tries to to find a plausible color version of the

photograph

● The objective of this paper is not to find the actual ground truth colors of the image but rather to

produce plausible colorization that can fool any human observer.

● Previous approaches have either relied on significant user interaction or resulted in desaturated

colorizations.

● The system is implemented as a feed-forward pass in a CNN and is trained on over a million color

images.

Page 3: Colorful Image Colorizationkosecka/cs747/Colorization.pdf · Colorful Image Colorization Richard Zhang, Phillip Isola, Alexei (Alyosha) Efros Presented by, Adityaraj P. Objective

Inherent Ambiguity

Ground Truth Prediction

Page 4: Colorful Image Colorizationkosecka/cs747/Colorization.pdf · Colorful Image Colorization Richard Zhang, Phillip Isola, Alexei (Alyosha) Efros Presented by, Adityaraj P. Objective

Color space

● To train the network the author started with the ImageNet dataset and converted all images from

the RGB color space to the Lab color space.

● Lab encodes color information as follows:○ L channel encodes lightness intensity only○ a channel encodes green-red.○ b channel encodes blue-yellow

Page 5: Colorful Image Colorizationkosecka/cs747/Colorization.pdf · Colorful Image Colorization Richard Zhang, Phillip Isola, Alexei (Alyosha) Efros Presented by, Adityaraj P. Objective
Page 6: Colorful Image Colorizationkosecka/cs747/Colorization.pdf · Colorful Image Colorization Richard Zhang, Phillip Isola, Alexei (Alyosha) Efros Presented by, Adityaraj P. Objective

Continuous Discrete (313 buckets)

Page 7: Colorful Image Colorizationkosecka/cs747/Colorization.pdf · Colorful Image Colorization Richard Zhang, Phillip Isola, Alexei (Alyosha) Efros Presented by, Adityaraj P. Objective
Page 8: Colorful Image Colorizationkosecka/cs747/Colorization.pdf · Colorful Image Colorization Richard Zhang, Phillip Isola, Alexei (Alyosha) Efros Presented by, Adityaraj P. Objective

Problem with loss function

● This loss is not robust to the inherent ambiguity and multimodal nature of the colorization

problem.

● If an object can take on a set of distinct ab values, the optimal solution to the Euclidean loss will be

the mean of the set. In color prediction, this averaging affect favors grayish, desaturated results.

● Additionally the solution will in fact be out of the set, giving implausible results.

Page 9: Colorful Image Colorizationkosecka/cs747/Colorization.pdf · Colorful Image Colorization Richard Zhang, Phillip Isola, Alexei (Alyosha) Efros Presented by, Adityaraj P. Objective

Solution

● To appropriately model the multimodal nature of the problem, the authors predict a distribution of

possible colors for each pixel.

● Furthermore, the author reweight the loss at training time to emphasize rare colors.

● This encourages the model to exploit the full diversity of the large-scale data on which it is trained.

● Lastly, we produce final colorization by taking the annealed-mean of the distribution.

● The end result is colorizations that are more vibrant and perceptually realistic than those of

previous approaches.

Page 10: Colorful Image Colorizationkosecka/cs747/Colorization.pdf · Colorful Image Colorization Richard Zhang, Phillip Isola, Alexei (Alyosha) Efros Presented by, Adityaraj P. Objective

Our final system F is the composition of CNN G, which produces a predicted distribution over all pixels, and the annealed-mean operation H, which produces final prediction.

Page 11: Colorful Image Colorizationkosecka/cs747/Colorization.pdf · Colorful Image Colorization Richard Zhang, Phillip Isola, Alexei (Alyosha) Efros Presented by, Adityaraj P. Objective

Loss function

● To compare predicted value Z-hat against ground truth, we define a function Z = H-1(Y) which

converts ground truth color Y to vector Z.

● The loss function is then given by the equation

Page 12: Colorful Image Colorizationkosecka/cs747/Colorization.pdf · Colorful Image Colorization Richard Zhang, Phillip Isola, Alexei (Alyosha) Efros Presented by, Adityaraj P. Objective

● Define H, which maps the predicted distribution Z hat to point estimate Y hat in ab space.

● One choice is to take the mode of the predicted distribution for each pixel. This provides a vibrant but

sometimes spatially inconsistent result. For eg:- The red splotches on the bus.

● On the other hand, taking the mean of the predicted distribution produces spatially consistent but

desaturated results (left-most column).

● To try to get the best of both worlds, we interpolate by re-adjusting the temperature T of the softmax

distribution, and taking the mean of the result. This operation is referred to as Annealed Mean.

Page 13: Colorful Image Colorizationkosecka/cs747/Colorization.pdf · Colorful Image Colorization Richard Zhang, Phillip Isola, Alexei (Alyosha) Efros Presented by, Adityaraj P. Objective

Evaluating Colorization Quality

1. Perceptual Realism (AMT)2. Semantic interpretability3. Raw Accuracy

Page 14: Colorful Image Colorizationkosecka/cs747/Colorization.pdf · Colorful Image Colorization Richard Zhang, Phillip Isola, Alexei (Alyosha) Efros Presented by, Adityaraj P. Objective

Perceptual Realism

● In Perceptual Realism we check how compelling the color look to a human observer.

● To test this the author ran a real vs. fake two-alternative forced choice experiment on Amazon

Mechanical Turk.

● Participants in the experiment were shown a series of pairs of images. Each pair consisted of a real

color photo next to a re-colorized version produced by the algorithm. Participants were asked to

click on the photo they believed contained fake colors

● Each experimental session consisted of 10 practice trials followed by 40 test pairs.

Page 15: Colorful Image Colorizationkosecka/cs747/Colorization.pdf · Colorful Image Colorization Richard Zhang, Phillip Isola, Alexei (Alyosha) Efros Presented by, Adityaraj P. Objective

Semantic interpretability

● In Semantic interpretability a VGG classification network is used to determine whether the model

is able to produce realistic enough colorization.

● The author tested this by feeding the fake colorized images to a VGG network that was trained to

predict ImageNet classes from real color photos.

● If the classifier performs well, that means the colorizations are accurate enough to be informative

about object class

Page 16: Colorful Image Colorizationkosecka/cs747/Colorization.pdf · Colorful Image Colorization Richard Zhang, Phillip Isola, Alexei (Alyosha) Efros Presented by, Adityaraj P. Objective

Raw accuracy (AuC)

● Raw accuracy measure the percentage of predicted pixel colors within a thresholded L2 distance of

the ground truth in ab color space.

● Then we sweep across thresholds from 0 to 150 to produce a cumulative mass function, integrate

the area under the curve (AuC), and normalize.

Page 17: Colorful Image Colorizationkosecka/cs747/Colorization.pdf · Colorful Image Colorization Richard Zhang, Phillip Isola, Alexei (Alyosha) Efros Presented by, Adityaraj P. Objective
Page 18: Colorful Image Colorizationkosecka/cs747/Colorization.pdf · Colorful Image Colorization Richard Zhang, Phillip Isola, Alexei (Alyosha) Efros Presented by, Adityaraj P. Objective