Deep Learning with TensorFlow: Understanding Tensors, Computations Graphs, Images, and Text

Deep Learning with TensorFlowUnderstanding Tensors, Computation Graphs, Images, Text

Viswanath PuttaguntaTechnical Program Manager, Linaro

Vish Puttagunta

Technical Program Manager, Linaro

Statistics/Signal Processing in Images, Audio, RF

Data Analysis and Machine Learning on ARM SoCs.

Leading collaboration in ARMTM Ecosystem

Enterprise, Mobile, Home, Networking, IoT

Built on Open Source

Goals

Basics behind Neural Networks

Picturing Neural Networks. Understand the operations.

Representing Neural Networks using Tensors and Computation GraphsVision (MNIST), Text (Word2Vec)

TensorFlowGives you good tools to build a Neural Network Model.

Some basic models… but Google's secret sauce still inside Google :)

The Basics

Linear Regression

Weight / Bias

Cost Function

Gradient Descent to optimize a cost function

Logistic Regression

Neuron Activation

Linear Regression

Objective: Fit line/curve to minimize a cost function Line has "Weight / Slope" and "Bias / y-

intercept)

Cost Function, Gradient Descent

Source: https://www.youtube.com/watch?v=SqA6TujbmWw&list=PLE6Wd9FR--Ecf_5nCbnSQMHqORpiChfJf&index=16https://youtu.be/WnqQrPNYz5Q?list=PLaXDtXvwY-oDvedS3f4HW0b4KxqpJ_imw&t=284

● Cost Function, J( )𝞡○ Weight / Slope: 𝞡1

○ Bias / y-intercept: 𝞡2

● Gradient Descent○ Go down to the lowest

point in the cost function in small steps

BIG IDEA● Starts with some Weights & Biases● Define a Cost Function● Optimize the Cost function by doing

some sort of Gradient Descent● Lots of Help from TensorFlow.

https://www.youtube.com/watch?v=SqA6TujbmWw&list=PLE6Wd9FR--Ecf_5nCbnSQMHqORpiChfJf&index=16

https://youtu.be/WnqQrPNYz5Q?list=PLaXDtXvwY-oDvedS3f4HW0b4KxqpJ_imw&t=284

Weight, Bias in 0,1 Dimensions

Neuron and Activation Function

BIG IDEAS● Only one of the Neurons will fire at a time (value shoots greater than 0.5)● Which neuron depends on "x", Weights, Biases● Rest of neurons will not fire. (value much less than 0.5)● Outcomes are between 0 and 1 → Probabilities (all outputs add to one)

○ → Good for classification

Same picture: Less mess, more classes

Simple Neural Network for MNIST

Source: https://www.tensorflow.org/versions/r0.7/tutorials/mnist/beginners/index.html

Simple Neural Network for MNIST

Weights actually Mean something!!

Weights/Bias → Evidence → Probability

BIG IDEAS● After proper Gradient Descent, Weights Generalize well.● Correlation operation corresponds to "Evidence"● Then do softmax(evidence) and you will get Probability(x specific 𝞊

class)

Source: https://www.tensorflow.org/versions/r0.7/tutorials/mnist/beginners/index.html

MNIST Simple: Operations / Dimensions

● input image(x)○ 28x28 matrix → 1x784○ each element type: 0 or 1

● Weights (W0, W1…. W9)○ 28x28 matrix → 1x784○ each element: float(-1, +1)

● Eg: W1 x (Correlation)Ⓧ○ Multiply Accumulate!

■ matmul(W, xT)○ single float (-1, +1)

● Biases (B0, B1…. B9)○ single floating pt (-1, +1)

● Evidence: Eg: W1 x + B1Ⓧ○ single floating pt (-

784.0,+784.0)● Probability

○ single float (0,1)● Real output (Y), Predicted

o/p(Y^)○ 1x10 matrix○ each is 0 or 1

MNIST Simple: In TensorFlow

MNIST Simple: Back-Propagation

MNIST using Convolution Neural Network

Convolution: Signal/Image processing

Source

http://developer.amd.com/tools-and-sdks/opencl-zone/opencl-resources/programming-in-opencl/image-convolution-using-opencl/

Convolution: Signal/Image Processing

Source

https://youtu.be/n6hpQwq7Inw

Weight, Bias in 0,1 Dimensions

MNIST: Using CNN

Text Analytics

Word2VecThinking of words and as vectors

Sequence-to-Sequence modelsRecurrent Neural Networks (LSTM)

Understanding sequence of words

"Don't like" is closer to "hate"

Words as vectors (Embeddings)

list → [human, cow] → int list [0, 1] → embeddings [(2,2), (4,0)]

Sequence of words as vectors!

Projections/Dimensionality Reduction (PCA/TSNE)

source: https://www.youtube.com/watch?v=VINCQghQRuM

https://www.youtube.com/watch?v=VINCQghQRuM

word2vec: using Neural Network

word2vec: using Neural Network

Note:Only trying to understand relationship between words and understand best way to vectorize words!

Not really keen on sequence of words! (Later RNN)

word2vec: using NN (softmax classifier)

Realize wv could easily be w50000

words in vocabulary!

word2vec: using NN (softmax classifier-pitfall)

Too many computations for every set of inputs (Eg: Classifying across 50,000 classes if you vocabulary size is 50,000)

word2vec: using Noise Classifier

Big Idea: Instead of trying to move every word(vector) by little bit in each stepMove few random words (~16) at a timeIf you have lots of sentences, you should be good

word2vec: Compute Graph

word2vec Code Walk Throughhttps://github.com/tensorflow/tensorflow/blob/master/tensorflow/examples/tutorials/word2vec/word2vec_basic.py

https://github.com/tensorflow/tensorflow/blob/master/tensorflow/examples/tutorials/word2vec/word2vec_basic.py

https://github.com/tensorflow/tensorflow/blob/master/tensorflow/examples/tutorials/word2vec/word2vec_basic.py

Summary

TensorFlow was simple to install (Ubuntu 14.04)sudo apt-get install python-pip python-dev

sudo pip install --upgrade https://storage.googleapis.com/tensorflow/linux/cpu/tensorflow-0.8.0rc0-cp27-none-linux_x86_64.whl

Tutorials are 'relatively' easy to followGood pointers to papers and blogs

Would like to hear Google's perspective onHow good of open source citizens they will be.

Compared to Theano, Caffe, Torch, AOSP..

https://storage.googleapis.com/tensorflow/linux/cpu/tensorflow-0.8.0rc0-cp27-none-linux_x86_64.whl

https://storage.googleapis.com/tensorflow/linux/cpu/tensorflow-0.8.0rc0-cp27-none-linux_x86_64.whl

ReferencesConvolution Neural Network: https://youtu.be/n6hpQwq7Inw

TensorFlow Tutorials: https://www.tensorflow.org/versions/r0.7/tutorials/index.html

Khan Academy (Linear algebra): https://www.khanacademy.org/

LSTM, Text Analytics: Alec Ratford, Indico


Current trends in Deep Learning (Andrew Ng, Baidu)

https://www.youtube.com/watch?v=O0VN0pGgBZM

https://youtu.be/n6hpQwq7Inw

https://www.tensorflow.org/versions/r0.7/tutorials/index.html

https://www.khanacademy.org/


https://www.youtube.com/watch?v=O0VN0pGgBZM

Deep Learning with TensorFlow: Understanding Tensors, Computations Graphs, Images, and Text

Technology

Transcript of Deep Learning with TensorFlow: Understanding Tensors, Computations Graphs, Images, and Text