Deep Neural Networks for Safety-Critical Applications ...

Deep Neural Networks for Safety-Critical Applications: Vision and Open Problems

Daniel Casini, Alessandro Biondi, Giorgio Buttazzo

ReTiS Lab, Scuola Superiore Sant’Anna, Pisa, Italy

2Daniel Casini

Motivations

Currently, many car manufacturers are tackling the race towards

autonomous cars

Huge improvements in DNN

accuracy for many tasks (e.g.

image classification)

1

2

3Daniel Casini

Motivations

Currently, many car manufacturers are tackling the race towards

autonomous cars

Huge improvements in DNN

accuracy for many tasks (e.g.

image classification)

1

2

DNNs may be useful for achieving an autonomous car!

But there are many issues…

4Daniel Casini

Motivations

But not only autonomous driving…

• DNNs can be also adopted for other types ofautonomous systems (e.g., robotics, industrialcontrol)

5Daniel Casini

Neural Networks

• Used to solve problems that are difficult toformalize by a set of rules.

• Neural Networks imitatethe way our brain works

Problems that depend on too many detailsare learned by direct experience

6Daniel Casini

Neural Networks

• Neural Networks consist of aset of neurons, often organizedinto layers

• Neurons are connected toeach other by synapticweights

• They are used for manydifferent purposes, as speechrecognition, image processing,wheather forecast, etc.

7Daniel Casini

Supervised Learning

• They can learn “by examples” to associateinput-output pairs

Red Semaphore

• Example: Before training

Input Desired Output

Car

Actual Output

Incorrect Classification!

8Daniel Casini

Supervised Learning• The training algorithm regulates the internal

parameters (i.e., weights) of the network forproducing the expected output

Red Semaphore

• Example: After training

Input Desired Output

Red Semaphore

Actual Output

Correct!

9Daniel Casini

• The ILSVRC Challenge is a competition held from

2010 in which networks compete in classifying

objects from images to labels, with 1000 possible

categories

Training set: 1.2 million images (1,000 categories)

Test set: 150,000 images

Image Recognition

10Daniel Casini

The winning network of 2017 (SENet), achievedan accuracy of 97.74%

Source: http://blog.paralleldots.com/data-science/must-read-path-breaking-papers-about-image-classification/

Are DNNs good enough?

11Daniel Casini

1. Certification Issues

Motivation

Deep Neural Networks in Safety Critical Scenarios:

12Daniel Casini

Certification Issues

Huge problem for certification!

• Deep Neural Networks do not have awell-defined behavior

• Their results are difficult to be replicated (e.g.,changing few pixels of an image may lead todifferent results)

13Daniel Casini

Certification Issues

Huge problem for certification!

• Deep Neural Networks do not have awell-defined behavior

• Their results are difficult to be replicated (e.g.,changing few pixels of an image may lead todifferent results)

How to support DNNs in a safety-critical context

to build a system that can be certified?

14Daniel Casini

Hint of SolutionIDEA: Match each DNN with a corresponding algorithmbased on hard computing (e.g., a convolution filter) tomonitor their behavior and redirect the actuation tosafe actions in case of detected misbehavior

Deep Neural Network

Hard Computing Algorithm

Decision Algorithm

IMA

GE

Actuation

15Daniel Casini

Example of safe action

DNN Fault!

IDEA: Match each DNN with a corresponding algorithmbased on hard computing (e.g., a convolution filter) tomonitor their behavior and redirect the actuation tosafe actions in case of detected misbehavior

16Daniel Casini

Example of safe action

Safe action: stop the car in an empty space

IDEA: Match each DNN with a corresponding algorithmbased on hard computing (e.g., a convolution filter) tomonitor their behavior and redirect the actuation tosafe actions in case of detected misbehavior

17Daniel Casini

Redundant Neural NetworksThe average-case behavior can be improved byinserting redundant neural networks, based on differentmodels or trained with a different algorithm.


Decision AlgorithmIM

AG

E

Actuation

DNN-1

DNN-2

DNN-3

VOTING

18Daniel Casini

Redundant Neural NetworksThe average-case behavior can be improved byinserting redundant neural networks, based on differentmodels or trained with a different algorithm.


Decision AlgorithmIM

AG

E

Actuation

DNN-1

DNN-2

DNN-3

VOTING

Which type of supervision algorithms are

needed to detect wrong outputs and redirect

actuation to safe actions?

19Daniel Casini

2. Security and Isolation

Motivation


20Daniel Casini

Security and Isolation

• A DNN is a complex software, exposed tosecurity threats

• What if an attacker exploits the weakness of aDNN to take control of the steering system?

21Daniel Casini

Security and Isolation

• A DNN is a complex software, exposed tosecurity threats

• What if an attacker exploits the weakness of aDNN to take control of the steering system?

How to avoid that the complexity of DNNs may

lead to security threats for a safety critical

system running on top of a shared platforms?

22Daniel Casini

A hypervisor-based solution

IDEA: Divide a multicore heterogeneousplatform in two domains

Multicore Heterogeneous Platform

Safety-critical

Hypervisor

Prone to attacks and malfunctioning

Deep Neural Networks

23Daniel Casini

A hypervisor-based solution

IDEA: Divide a multicore heterogeneousplatform in two domains

Multicore Heterogeneous Platform

Safety-critical

Hypervisor

Prone to attacks and malfunctioning

Deep Neural NetworksWhich mechanisms have to be provided to allow

them interacting while running on different OSes?

24Daniel Casini

3. Predictability

Motivation


25Daniel Casini

Predictability

KEY ISSUE: Guaranteeing that a real-timeworkload composed of DNNs is schedulable

• Focus on the inference phase only

• A DNN is composed of a pipeline of layers,where each one implements an operation

Layer Layer Layer

IMA

GE Data Data Data

…

26Daniel Casini

Predictability

• Many inference frameworks furtherlyparallelize each layer

• The resulting computational activity can berepresented by a Direct Acyclic Graph (DAG)

𝑣3

𝑣4𝑣2

𝑣7𝑣1

𝑣5

𝑣6

• A properly definedtask model shouldalso account fortensors (i.e., memory)exchanged amongnodes

27Daniel Casini

Predictability

• Many inference frameworks furtherlyparallelize each layer

• The resulting computational activity can berepresented by a Direct Acyclic Graph (DAG)

𝑣3

𝑣4𝑣2

𝑣7𝑣1

𝑣5

𝑣6

• A properly definedtask model shouldalso account fortensors (i.e., memory)exchanged amongnodes

Which is a suitable task model to describe and

analyze the temporal behavior of DNNs?

28Daniel Casini

Heterogeneous Platforms

• Timing analysis should also account for theheterogeneity of the underlying hardwareplatform

• DNN execution on FPGA is not yet fullysupported by inference engines

Dynamic partial reconfiguration can be exploited foraccelerating complex layers

Heterogeneous Platform

Multicore CPU GPUReconfigurable

FPGA

29Daniel Casini


• Recently, ad hoc application specificintegrated circuits have been recentlyproduced (e.g., the Tensor Processing Unit byGoogle)

• They can be included in commercialheterogeneous platforms soon

30Daniel Casini


• Recently, ad hoc application specificintegrated circuits have been recentlyproduced (e.g., the Tensor Processing Unit byGoogle)

• They can be included in commercialheterogeneous platforms soon

How to account for novel (highly heterogenous)

computing platforms?

31Daniel Casini

Inference Engines

• DNNs are typically executed by means ofinference engines• Inference engines can affect the execution of

DNNs

Google

32Daniel Casini

Inference Engines

• DNNs are typically executed by means ofinference engines• Inference engines can affect the execution of

DNNs

GoogleHow to account for inference engines that affect

the DNN’s execution?

33Daniel Casini

How Tensorflow works on CPUs?

C

TensorFlow Runtime Environment

• TensorFlow assigns ready nodes to threads of a threadpool (of globally-scheduled threads)

CPU 0

Core 0 Core 1 Core 2 Core 3

TensorFlow Dispatcher

OS Scheduler

Thread Pool

Thread 0 Thread 1 Thread 2 Thread 3

.

.

.

RunQueue...

.

.

.

.

.

.

Work Stealing

Nodes

34Daniel Casini

How Tensorflow works on CPUs?

C

TensorFlow Runtime Environment

• TensorFlow assigns ready nodes to threads of a threadpool (of globally-scheduled threads)

CPU 0

Core 0 Core 1 Core 2 Core 3

TensorFlow Dispatcher

OS Scheduler

Thread Pool

Thread 0 Thread 1 Thread 2 Thread 3

.

.

.

RunQueue...

.

.

.

.

.

.

Work Stealing

Nodes

Best effort, conceived to maximize average-case performance and not to be predictable!

35Daniel Casini

Adding a layer for predictability in TensorFlow

• Extracting a computation model from DNNs, derivingprecedence constraints, computation time of eachnode, memory exchanged, etc.

• Providing a predictable scheduling layer of nodesalso aware of memory accesses

• Designing a partitioning scheme that considersproducer-consumer relationships among nodes, forimproving cache coherency

• Development of analysis techniques to assessschedulability of neural networks

What we are doing?

36Daniel Casini

• Deep Neural Networks represent a promisingtechnique for enacting autonomous driving,but…

• their adoption in safety-critical scenarios presentsmany issues

• We focused on:• Certificability

• Security and Isolation

• Predictability

Summary and conclusions

37Daniel Casini

• Deep Neural Networks represent a promisingtechnique for enacting autonomous driving,but…

• their adoption in safety-critical scenarios presentsmany issues

• We focused on:• Certificability

• Security and Isolation

• Predictability

Summary and conclusions

There is still a lot of work to do…

Let’s start!

Thank you!Daniel Casini

[email protected]

Deep Neural Networks for Safety-Critical Applications ...

Documents

Transcript of Deep Neural Networks for Safety-Critical Applications ...