(BDT311) Deep Learning: Going Beyond Machine Learning
-
Upload
amazon-web-services -
Category
Technology
-
view
3.288 -
download
0
Transcript of (BDT311) Deep Learning: Going Beyond Machine Learning
© 2015, Amazon Web Services, Inc. or its Affiliates. All rights reserved.
Chida Chidambaram
Vishal Deshpande
BDT311
Deep Learning
Going Beyond Machine Learning
October 2015
What to Expect from the Session
Data analytics options on AWS
Machine learning (ML) – high level
Amazon ML from AWS
ML sample use case
Deep learning (DL) – high level
DL sample use cases
AWS GPU/HPCC server family
Q&A
Data Analytics Options on AWS
Amazon EMR
AnalyzeStoreIngest
Amazon
Kinesis DynamoDBAmazon Redshift
RDSS3 Amazon Kinesis
ConsumerMachine Learning
Amazon Kinesis
Producer
Traditional Server Mobile Clients
EC2 Machines
Machine Learning
Machine Learning
How can a machine identify Bruce Willis vs Jason
Statham?
Bruce Willis ???
Machine Learning
Machine Learning
Artificial Intelligence
Optimization & Control
Neuroscience and Neural Networks
Statistical Modeling
Information Theory
Machine Learning
Bear
Eagle
People
Sunset
Machine Learning
• Using machines to discover trends and patterns and compute
mathematical predictive models based on factual past data
• ML models provide insights into likely outcomes based on the past –
machine learning helps uncover the probability of an outcome in the
future rather than merely state what has already happened in the past
• Past data and statistical modeling is used to make predictions based
on probability
Where traditional business analytics aims at answering questions about
past events, machine learning aims at answering questions about the
possibilities of future events
Machine Learning
Supervised learning
Human intervention and validation required
Photo classification and tagging
Unsupervised learning
No human intervention required
Auto-classification of documents based on context
Machine Learning
Collect
Validation data Test dataTraining data
Model training Model validation Final predictions
Machine Learning – Process
• Input feature selection – what are my predictions going
to be based on
• Target – what you want to predict
• Prediction function – regression, classification,
dimensionality reduction
Xn -> F(xn) -> T(x)
Machine Learning – Process
X1 X2 X3 X4 X5 Y
0.3 0.25 0.4 0.34 0.2 1
0.14 0.17 0.2 0.3 0.2 0
0.24 0.21 0.19 0.15 0.35 1
0.3 0.25 0.35 0.4 0.45 1
𝜒𝑛𝜖𝐹(𝑥𝑛) ; Target: y
0
0.05
0.1
0.15
0.2
0.25
0.3
0.35
0.4
0.45
0.5
0 1 2 3 4 5
X1 X2 X3 X4 X5
Machine Learning – Process
How can a machine identify Bruce Willis vs Jason
Statham?
Image analysis –
Input feature set for image 1 -> bald, black suit
Bruce Willis ???
Machine Learning – Process• Start with data for which the answer is already known
• Identify the target – what you want to predict from the data
• Pick the variables/features that can be used to identify the patterns
to predict the target
• Train the ML model with the dataset for which you already know the
target answer
• Use the trained model to predict the target on the data for which the
answer is not known
• Evaluate the model for accuracy
• Improve the model accuracy as needed
Machine Learning – When to Use It
You need ML if
• Simple classification rules are inadequate
• Scalability is an issue with large number of datasets
You do not need ML if
• You can predict the answers by using simple rules and computations
• You can program predetermined steps without needing any data
driven learning
Machine Learning from AWS
Amazon Machine Learning is a service that makes it easy
for developers of all skill levels to use machine learning
technology.
Machine Learning from AWS
Machine Learning from AWS
Machine Learning from AWS
• AWS ecosystem integration
• Pre-built ML algorithms
• Batch and real-time prediction
• Faster models to predictions
• Data visualizations and exploration
• Data transformations
• Fully-managed
• Pay as you go
Machine Learning – Uses
Predictive analytics
• What is the likelihood that a customer visiting my e-commerce site will buy my product
• What is the probability of a congressional bill being passed
Classification / grouping
• Auto classification and tagging of images
• Video classification
• Auto categorization of raw text data based on predefined ontologies
Machine Learning Use Cases
• Personalization – present personalized e-commerce
experience
• Document classification – auto classify documents
based on the context
• Fraud detection – discover anomalies to regular
behavior to identify and flag fraudulent transactions
• Recommendation engines
• Customer churn prediction
Deep Learning – Advanced ML
Deep Learning – Going Beyond ML
ML algorithms that are either supervised or unsupervised
and
• Use many layers of nonlinear processing units for
feature extraction and transformation
• Are based on learning multiple levels of features or
representation in each layer, with the layers forming a
hierarchy of low-level to high-level features
Where traditional machine learning focuses on feature
engineering, deep learning focuses on end-to-end
learning based on raw features
Deep Learning
Bear
Eagle
People
Sunset
Object: Bear
Location: Yellowstone Park
Action: Looking for food
Object: Eagle
Location: Wakula Springs, FL
Action: Resting
Object: Multiple – people, ball
Location: Montana
Action: Playing
Object: ?
Location: Montana
Action: ?
Deep Learning – Neural Networks
A collection of simple, trainable mathematical units that
collectively learn complex functions
Output
Neural network
Input
Hidden layers
Deep Learning – Train
X Bear
Grizzly Bear
Polar Bear
Dog
Fox
Feedback
Neural network
Deep Learning – Deploy
Grizzly Bear
Neural network
Deep Learning – Flow
Train
DeployModel
Classification
Detection
Segmentation
Feedback
Training dataset
Solver
Neural network
Train
Solver
Network
Dashboard
Deep Learning – Data Representation
Hierarchy of representations
• Image – vectors of pixel, motif, part, contour, edge, etc.
• Videos – Image frames, pixels per frame, deltas per
frame, etc.
• Text – characters, words, clauses, sentences, etc.
• Speech – audio, band, frequency, wavelengths,
modulations, phonetics, etc.
Deep Learning – Advantages
• Features automatically deduced and optimally tuned for
the desired outcome
• Robustness to variations automatically learned
• Reusability – same neural network approach can be
used for many applications and data types
• Massively parallel computations through use of GPUs –
scalable for large volumes of data
Deep Learning – Traction
• Cloud and big data eco-system – cost reduction in
computation and storage capacity for huge volumes of
data
• New advancements in deep learning toolkit with better
GPU computation tools and libraries
• Advancements in GPU acceleration and availability of
GPU clusters through the cloud infrastructure
What is driving deep learning…
Deep Learning on AWS - GPU Servers
• Family of servers for DL/HPCC
• C4 instances – for high performance computing
• G2 instances – for additional CUDA processing used in
deep learning
• Four NVIDIA GRID GPUs, each with 1,536 CUDA cores and
4 GB of video
• 32 vCPUs
• 60 GB of memory
• 240 GB (2 x 120) of SSD storage
Application Code
CPUGPU
Compute
Intensive
Code
Rest of
Code
AWS GPU Servers
Deep Learning – GPU Acceleration
Batch size Training time
(CPU)
Training time
(GPU)
64 images 64s 7.5s
128 images 124s 14.5s
256 images 257s 28.5s
Training a deep neural network for image processing
CPU : Dual 10-core Ivy Bridge CPUs
GPU : 1 Tesla K40 GPU
Implemented with Caffe
* nVidia
Deep Learning – Software Tools and Libraries
• Theano (Python)
• Blocks (Python/Theano)
• Lasange (Python/Theano)
• Pylearn2 (Python)
• Torch (Lua)
• Deeplearning4J (Java)
• Caffe
• CUDA-convent
Deep Learning – Uses
• Automatic speech recognition
• Image recognition
• Natural language processing
• Drug discovery and toxicology
• CRM and e-commerce
• Human behavior analysis
• Driverless cars
• Search and advertising
Deep Learning – Research
And more…
Image Recognition /
Computer Vision
DL – Image Recognition / Computer Vision
• Visual searches for retail
Industries
• Self-driving cars
• Home security
• Wearables
Natural Language Processing and
Speech Recognition
DL - Natural Language Processing and Speech
Recognition
• Understanding the meaning
• Similar or dissimilar words
• Contextual meaning
• Language modeling
• Language neural network
Restaurants near me
ML to DL – From Siri/Cortana to J.A.R.V.I.S
Restaurants near me
Good morning, sir. Would you like a cup
of coffee or a shot of vodka? Probably
the vodka would be a better choice for
you today.
DL Implementation –
Driverless Cars
Driverless Cars
• Google, Baidu, Mercedes Benz , Audi,
Tesla
• Deep neural network (DNN) models
• Real-time pedestrian detection
algorithms
• Processes TBs of data in real-time
• Keep the car moving!
• In addition to basic functions
Eurocars.com
Demo
Demo
Useful Resources
• Bring Your Own Data (BYOData) campaign from Day1
http://day1solutions.com/byo-data
• Amazon Machine Learning
http://aws.amazon.com/machine-learning
• Deep-Learning lab and courses
https://developer.nvidia.com/deep-learning-courses
• Deep-Learning resources
http://deeplearning.net
• Public data sets for Deep-Learning research
http://deeplearning.net/datasets/
Remember to complete
your evaluations!
Thank you!