Practical Introduction to AI, Deep Learning, and Large Scale Image Analytics

Post on 19-Jan-2017

341 views 0 download

Transcript of Practical Introduction to AI, Deep Learning, and Large Scale Image Analytics

PRACTICAL INTRODUCTION TO

ARTIFICIAL INTELLIGENCEDEEP LEARNINGLARGE-SCALE IMAGE ANALYTICSKEVIN MADER / FLAVIO TROLESE4QUANT | BIG IMAGE ANALYTICS

PANTALKTUESDAY, MARCH 19 2016 / IMPACTHUB GARAGE ZURICH

Flavio Trolese
+kevin.mader@4quant.com können wir hier eine kurze demo zu video to text machen?

4Quant | BIG IMAGE ANALYTICS

Die Länder, die Österreich umgeben.

Was sind Schweiz, Italien, Slowenien, Ungarn, Tschechische Republik, Deutschland, Slovakei?

4Quant | BIG IMAGE ANALYTICS

4Quant | BIG IMAGE ANALYTICS

4Quant | BIG IMAGE ANALYTICS

4Quant | BIG IMAGE ANALYTICS

4Quant | BIG IMAGE ANALYTICS

4Quant | BIG IMAGE ANALYTICS

4Quant | BIG IMAGE ANALYTICS

4Quant | BIG IMAGE ANALYTICS

4Quant | BIG IMAGE ANALYTICS

http://4quant.com/javascript-breakout/

4Quant | BIG IMAGE ANALYTICS

HOW?

4Quant | BIG IMAGE ANALYTICS

STANDARD MACHINE LEARNING

4Quant | BIG IMAGE ANALYTICS

CORE IDEASWhat is an image?

What a human sees What a machine sees

4Quant | BIG IMAGE ANALYTICS

CORE IDEASFeature Generation → Making the computer see more

4Quant | BIG IMAGE ANALYTICS

CORE IDEASTraining / Validation

With all machine learning techniques it is critical to divide data into training and validation sets.

The algorithm can then be tested (validated) on data it has never seen before to ensure it generalizes

4Quant | BIG IMAGE ANALYTICS

OUTPUT / LOSS FUNCTION

4Quant | BIG IMAGE ANALYTICS

The in order for machine learning to work there has to be a single output for the system which quantifies how well it is working

- the number of correctly identified structures (true-positives)

- the number of correct letters in a sentences

- the score of a game

CORE IDEASLearning from tagged data (supervised)

4Quant | BIG IMAGE ANALYTICS

What is this?

PROBLEMSFeatures can be very difficult to ‘engineer’.

What makes a person a person?

More data doesn’t always lead to better results.

4Quant | BIG IMAGE ANALYTICS

DEEP LEARNING

4Quant | BIG IMAGE ANALYTICS

One of these can recognize without any programming by just experiencing and getting feedback.

THE IDEA

4Quant | BIG IMAGE ANALYTICS

https://flic.kr/p/2eryEj

The human brain is a large, layered, connected network of neurons.

THE IDEA

4Quant | BIG IMAGE ANALYTICS

https://flic.kr/p/5J4uci

We understand how some of these layers work and can make computationally fast models for simulating their behavior

THE IDEA

4Quant | BIG IMAGE ANALYTICS

DEEP LEARNING

Deep learning is a set of algorithms in machine learning that attempt to learn in multiple levels, corresponding to different levels of abstraction.

4Quant | BIG IMAGE ANALYTICS

THE IDEA

4Quant | BIG IMAGE ANALYTICS

A machine learning system with millions of inputs

And 1 output

THE IDEA

4Quant | BIG IMAGE ANALYTICS

The networks can get very large (hence the deep)

Here is the Inception Network from Google

TYPES OF ARTIFICIAL NEURONS

4Quant | BIG IMAGE ANALYTICS

Fully-connected → everything connected to everything

Convolutional (CNN) → mix things together

Recurring (RNN) → remember parts of sequences

Recurring Networks

4Quant | BIG IMAGE ANALYTICS

http://karpathy.github.io/2015/05/21/rnn-effectiveness/

Recurring Networks

4Quant | BIG IMAGE ANALYTICS

http://karpathy.github.io/2015/05/21/rnn-effectiveness/

Given the starting letter h

Predict the rest of the letters

SCHWIIZERDÜTSCH

4Quant | BIG IMAGE ANALYTICS

Goes through hundreds of pages of text character by character and trains neurons to predict the correct output

The text shows the algorithm learning to complete the sentence.

The curve shows how confident it is in each guess

SCHWIIZERDÜTSCH

4Quant | BIG IMAGE ANALYTICS

100 gu sisxt n eigeiua a esSWctaicobemhat,E out?s v t t uew

10K Uhe uf Hountigm don d’Bomura fürsyn al jerisim Sbeour Rucch

65K Übschamt wiänä wo und ebs haGscham, üblart uls zä flusch, zänsert. De Unner sindämzalagsel

100K Totatwärt. Dischtä Tittä vo dä ues und erwiä Gsacht agä schtüswongeilä. Beterischtiongehärne vordä em Verbichunt. Diä Mieräng ader h d Zientlichnig vu CHF

4Quant | BIG IMAGE ANALYTICS

SCHWIIZERDÜTSCH

Spell/Grammar Check (for a language with ‘no rules’) Dialect Detector

Autocomplete

APPLICATIONS

4Quant | BIG IMAGE ANALYTICS

Automatic C code

Wikipedia Text

BEYOND SCHWIIZERDÜTSCH

4Quant | BIG IMAGE ANALYTICS

CONVOLUTIONAL NETWORKS

4Quant | BIG IMAGE ANALYTICS

Pixels Edges Object parts Object models

→ → →

CONVOLUTIONAL NETWORKS

4Quant | BIG IMAGE ANALYTICS

CONVOLUTIONAL NETWORKS

4Quant | BIG IMAGE ANALYTICS

4Quant | BIG IMAGE ANALYTICS

CONVOLUTIONAL NETWORKS

Street, Trees, Fence, Bicycle

UNDERSTANDING COMPLEX SCENES

4Quant | BIG IMAGE ANALYTICS

Self-driving cars need to be able to identify walkways automatically

All point geo-referenced

IDENTIFY WALKWAYS

4Quant | BIG IMAGE ANALYTICS

Understanding what is happening inside of these complex networks

DREAMING

4Quant | BIG IMAGE ANALYTICS

4Quant | BIG IMAGE ANALYTICS

DREAMING

Applying parts of trained networks to other types of images.

TRANSFER

4Quant | BIG IMAGE ANALYTICS

A challenging field

- noisy

- highly variable

- many tissues / diseases look the same

MEDICAL IMAGES

4Quant | BIG IMAGE ANALYTICS

Red are lungs

Yellow are bones

Blue are the other organs

MEDICAL IMAGES

4Quant | BIG IMAGE ANALYTICS

FINDING CANCER

4Quant | BIG IMAGE ANALYTICS

MEDICAL IMAGES

4Quant | BIG IMAGE ANALYTICS

Convolutional neurons act on the image and learn to extract the relevant information

MEDICAL IMAGES

4Quant | BIG IMAGE ANALYTICS

These representations can then be used to automatically find organs like the heart and measure blood flow

→ →

MEDICAL IMAGES

4Quant | BIG IMAGE ANALYTICS

These representations can then be used to automatically find organs like the heart and measure blood flow

→ →

Open Challenges

4Quant | BIG IMAGE ANALYTICS

You need a lot of data to identify (1K-100M)

Some networks learn well, others do not

Parameters can make a huge difference

Intermediate layers can be difficult to interpret

PRACTICAL INTRODUCTION TO

ARTIFICIAL INTELLIGENCEDEEP LEARNINGLARGE-SCALE IMAGE ANALYTICSTHANK YOUKEVIN MADER / FLAVIO TROLESE4Quant Ltd.

PANTALKTHURSDAY MARCH 19 2016 / IMPACTHUB GARAGE ZURICH

4Quant| BIG IMAGE ANALYTICS