Practical Introduction to AI, Deep Learning, and Large Scale Image Analytics
-
Upload
kevin-mader -
Category
Technology
-
view
341 -
download
0
Transcript of Practical Introduction to AI, Deep Learning, and Large Scale Image Analytics
PRACTICAL INTRODUCTION TO
ARTIFICIAL INTELLIGENCEDEEP LEARNINGLARGE-SCALE IMAGE ANALYTICSKEVIN MADER / FLAVIO TROLESE4QUANT | BIG IMAGE ANALYTICS
PANTALKTUESDAY, MARCH 19 2016 / IMPACTHUB GARAGE ZURICH
4Quant | BIG IMAGE ANALYTICS
Die Länder, die Österreich umgeben.
↓
Was sind Schweiz, Italien, Slowenien, Ungarn, Tschechische Republik, Deutschland, Slovakei?
4Quant | BIG IMAGE ANALYTICS
4Quant | BIG IMAGE ANALYTICS
4Quant | BIG IMAGE ANALYTICS
4Quant | BIG IMAGE ANALYTICS
4Quant | BIG IMAGE ANALYTICS
4Quant | BIG IMAGE ANALYTICS
4Quant | BIG IMAGE ANALYTICS
4Quant | BIG IMAGE ANALYTICS
4Quant | BIG IMAGE ANALYTICS
http://4quant.com/javascript-breakout/
4Quant | BIG IMAGE ANALYTICS
HOW?
4Quant | BIG IMAGE ANALYTICS
STANDARD MACHINE LEARNING
4Quant | BIG IMAGE ANALYTICS
CORE IDEASWhat is an image?
What a human sees What a machine sees
4Quant | BIG IMAGE ANALYTICS
CORE IDEASFeature Generation → Making the computer see more
4Quant | BIG IMAGE ANALYTICS
CORE IDEASTraining / Validation
With all machine learning techniques it is critical to divide data into training and validation sets.
The algorithm can then be tested (validated) on data it has never seen before to ensure it generalizes
4Quant | BIG IMAGE ANALYTICS
OUTPUT / LOSS FUNCTION
4Quant | BIG IMAGE ANALYTICS
The in order for machine learning to work there has to be a single output for the system which quantifies how well it is working
- the number of correctly identified structures (true-positives)
- the number of correct letters in a sentences
- the score of a game
CORE IDEASLearning from tagged data (supervised)
4Quant | BIG IMAGE ANALYTICS
What is this?
PROBLEMSFeatures can be very difficult to ‘engineer’.
What makes a person a person?
More data doesn’t always lead to better results.
4Quant | BIG IMAGE ANALYTICS
DEEP LEARNING
4Quant | BIG IMAGE ANALYTICS
One of these can recognize without any programming by just experiencing and getting feedback.
THE IDEA
4Quant | BIG IMAGE ANALYTICS
https://flic.kr/p/2eryEj
The human brain is a large, layered, connected network of neurons.
THE IDEA
4Quant | BIG IMAGE ANALYTICS
https://flic.kr/p/5J4uci
We understand how some of these layers work and can make computationally fast models for simulating their behavior
THE IDEA
4Quant | BIG IMAGE ANALYTICS
DEEP LEARNING
Deep learning is a set of algorithms in machine learning that attempt to learn in multiple levels, corresponding to different levels of abstraction.
4Quant | BIG IMAGE ANALYTICS
THE IDEA
4Quant | BIG IMAGE ANALYTICS
A machine learning system with millions of inputs
And 1 output
THE IDEA
4Quant | BIG IMAGE ANALYTICS
The networks can get very large (hence the deep)
Here is the Inception Network from Google
TYPES OF ARTIFICIAL NEURONS
4Quant | BIG IMAGE ANALYTICS
Fully-connected → everything connected to everything
Convolutional (CNN) → mix things together
Recurring (RNN) → remember parts of sequences
Recurring Networks
4Quant | BIG IMAGE ANALYTICS
http://karpathy.github.io/2015/05/21/rnn-effectiveness/
Recurring Networks
4Quant | BIG IMAGE ANALYTICS
http://karpathy.github.io/2015/05/21/rnn-effectiveness/
Given the starting letter h
Predict the rest of the letters
SCHWIIZERDÜTSCH
4Quant | BIG IMAGE ANALYTICS
Goes through hundreds of pages of text character by character and trains neurons to predict the correct output
The text shows the algorithm learning to complete the sentence.
The curve shows how confident it is in each guess
SCHWIIZERDÜTSCH
4Quant | BIG IMAGE ANALYTICS
100 gu sisxt n eigeiua a esSWctaicobemhat,E out?s v t t uew
10K Uhe uf Hountigm don d’Bomura fürsyn al jerisim Sbeour Rucch
65K Übschamt wiänä wo und ebs haGscham, üblart uls zä flusch, zänsert. De Unner sindämzalagsel
100K Totatwärt. Dischtä Tittä vo dä ues und erwiä Gsacht agä schtüswongeilä. Beterischtiongehärne vordä em Verbichunt. Diä Mieräng ader h d Zientlichnig vu CHF
4Quant | BIG IMAGE ANALYTICS
SCHWIIZERDÜTSCH
Spell/Grammar Check (for a language with ‘no rules’) Dialect Detector
Autocomplete
APPLICATIONS
4Quant | BIG IMAGE ANALYTICS
Automatic C code
Wikipedia Text
BEYOND SCHWIIZERDÜTSCH
4Quant | BIG IMAGE ANALYTICS
CONVOLUTIONAL NETWORKS
4Quant | BIG IMAGE ANALYTICS
Pixels Edges Object parts Object models
→ → →
CONVOLUTIONAL NETWORKS
4Quant | BIG IMAGE ANALYTICS
CONVOLUTIONAL NETWORKS
4Quant | BIG IMAGE ANALYTICS
4Quant | BIG IMAGE ANALYTICS
CONVOLUTIONAL NETWORKS
Street, Trees, Fence, Bicycle
UNDERSTANDING COMPLEX SCENES
4Quant | BIG IMAGE ANALYTICS
Self-driving cars need to be able to identify walkways automatically
All point geo-referenced
IDENTIFY WALKWAYS
4Quant | BIG IMAGE ANALYTICS
Understanding what is happening inside of these complex networks
DREAMING
4Quant | BIG IMAGE ANALYTICS
Applying parts of trained networks to other types of images.
TRANSFER
4Quant | BIG IMAGE ANALYTICS
A challenging field
- noisy
- highly variable
- many tissues / diseases look the same
MEDICAL IMAGES
4Quant | BIG IMAGE ANALYTICS
Red are lungs
Yellow are bones
Blue are the other organs
MEDICAL IMAGES
4Quant | BIG IMAGE ANALYTICS
FINDING CANCER
4Quant | BIG IMAGE ANALYTICS
MEDICAL IMAGES
4Quant | BIG IMAGE ANALYTICS
Convolutional neurons act on the image and learn to extract the relevant information
MEDICAL IMAGES
4Quant | BIG IMAGE ANALYTICS
These representations can then be used to automatically find organs like the heart and measure blood flow
→ →
MEDICAL IMAGES
4Quant | BIG IMAGE ANALYTICS
These representations can then be used to automatically find organs like the heart and measure blood flow
→ →
Open Challenges
4Quant | BIG IMAGE ANALYTICS
You need a lot of data to identify (1K-100M)
Some networks learn well, others do not
Parameters can make a huge difference
Intermediate layers can be difficult to interpret
RESOURCESGoogle Cloud Vision APIIBM Watson Vision ServiceMicrosoft Project Oxford
TensorFlowCaffeMoodstocks
4Quant
PRACTICAL INTRODUCTION TO
ARTIFICIAL INTELLIGENCEDEEP LEARNINGLARGE-SCALE IMAGE ANALYTICSTHANK YOUKEVIN MADER / FLAVIO TROLESE4Quant Ltd.
PANTALKTHURSDAY MARCH 19 2016 / IMPACTHUB GARAGE ZURICH
4Quant| BIG IMAGE ANALYTICS