The Impact of Artificial Intelligence on Storage and IT · 2020-05-11 · Data is – Acquired –...

33
1 | ©2020 Storage Networking Association. All Rights Reserved. The Impact of Artificial Intelligence on Storage and IT A SNIA EMEA Webcast

Transcript of The Impact of Artificial Intelligence on Storage and IT · 2020-05-11 · Data is – Acquired –...

Page 1: The Impact of Artificial Intelligence on Storage and IT · 2020-05-11 · Data is – Acquired – Cached and stored locally – Applied with rules and analytic models – Queued,

1 | ©2020 Storage Networking Association. All Rights Reserved.

The Impact of Artificial Intelligence on Storage and ITA SNIA EMEA Webcast

Page 2: The Impact of Artificial Intelligence on Storage and IT · 2020-05-11 · Data is – Acquired – Cached and stored locally – Applied with rules and analytic models – Queued,

2 | ©2020 Storage Networking Association. All Rights Reserved.

Today’s Presenters

Alex McDonaldChair, SNIA EMEA

NetApp

Glyn BowdenChief Architect, AI & Data Science

PracticeHPE

Paul TalbutGeneral Manager, SNIA EMEA

Page 3: The Impact of Artificial Intelligence on Storage and IT · 2020-05-11 · Data is – Acquired – Cached and stored locally – Applied with rules and analytic models – Queued,

3 | ©2020 Storage Networking Association. All Rights Reserved.

SNIA Legal Notice

§ The material contained in this presentation is copyrighted by the SNIA unless otherwise noted. § Member companies and individual members may use this material in presentations and

literature under the following conditions:§ Any slide or slides used must be reproduced in their entirety without modification§ The SNIA must be acknowledged as the source of any material used in the body of any document containing material from

these presentations.

§ This presentation is a project of the SNIA.§ Neither the author nor the presenter is an attorney and nothing in this presentation is intended to

be, or should be construed as legal advice or an opinion of counsel. If you need legal advice or a legal opinion please contact your attorney.

§ The information presented herein represents the author's personal opinion and current understanding of the relevant issues involved. The author, the presenter, and the SNIA do not assume any responsibility or liability for damages arising out of any reliance on or use of this information.

NO WARRANTIES, EXPRESS OR IMPLIED. USE AT YOUR OWN RISK.

Page 4: The Impact of Artificial Intelligence on Storage and IT · 2020-05-11 · Data is – Acquired – Cached and stored locally – Applied with rules and analytic models – Queued,

4 | ©2020 Storage Networking Association. All Rights Reserved.

SNIA-At-A-Glance

Page 5: The Impact of Artificial Intelligence on Storage and IT · 2020-05-11 · Data is – Acquired – Cached and stored locally – Applied with rules and analytic models – Queued,

5 | ©2020 Storage Networking Association. All Rights Reserved.

Agenda

§What is Intelligence, Artificial Intelligence and Machine Learning?§ The anatomy of an AI / Analytics Solution§Building the AI Stack

Page 6: The Impact of Artificial Intelligence on Storage and IT · 2020-05-11 · Data is – Acquired – Cached and stored locally – Applied with rules and analytic models – Queued,

6 | ©2020 Storage Networking Association. All Rights Reserved.

Pursuit to teach computers to develop

similar mental capabilities

What is intelligence?

§ Intelligence is a person’s mental capability to perceive, reason, act, learn quickly, and solve problems (among other things)

KNOWLEDGE

INTERACTING

LEARNING

REASONING

PERCEPTION

PROBLEM SOLVING

MANIPULATE, SENSE, AND MOVE OBJECTS

Page 7: The Impact of Artificial Intelligence on Storage and IT · 2020-05-11 · Data is – Acquired – Cached and stored locally – Applied with rules and analytic models – Queued,

7 | ©2020 Storage Networking Association. All Rights Reserved.

What is Artificial Intelligence?

§Definition§ The ability of computer systems to perform tasks that normally require

human intelligence, such as visual perception, speech recognition, decision-making, and translation between languages

– Example– Turing Test: Inability to distinguish computer

responses from human responses

Page 8: The Impact of Artificial Intelligence on Storage and IT · 2020-05-11 · Data is – Acquired – Cached and stored locally – Applied with rules and analytic models – Queued,

8 | ©2020 Storage Networking Association. All Rights Reserved.

History of AI§ AI has been around for more than 50 years§ The term AI was introduced in 1956 by John McCarthy, an American computer scientist§ The growth and adoption of Machine Learning and Deep Learning have made AI real

Evolution of Artificial Intelligence

Page 9: The Impact of Artificial Intelligence on Storage and IT · 2020-05-11 · Data is – Acquired – Cached and stored locally – Applied with rules and analytic models – Queued,

9 | ©2020 Storage Networking Association. All Rights Reserved.

AISupervised Learning

Reinforcement Learning

Unsupervised Learning

Semi-Supervised

Learning

Many Approaches to Analytics & AINo One size fits all…

Machinelearning

Deep learning

RegressionClassification

ClusteringDecision Trees

Data Generation

Image ProcessingSpeech Processing

Natural Language ProcessingRecommender Systems

Adversarial Networks

Page 10: The Impact of Artificial Intelligence on Storage and IT · 2020-05-11 · Data is – Acquired – Cached and stored locally – Applied with rules and analytic models – Queued,

10 | ©2020 Storage Networking Association. All Rights Reserved.

Why is Everyone Talking about AI Now?

Actionable data

Enormous amount of data of all types

Critical for machines to learn

Algorithms

Availability of New Algorithms, Neural

Network Research and Software

Compute power

GPU based computing with super compute power

Relatively low cost

Page 11: The Impact of Artificial Intelligence on Storage and IT · 2020-05-11 · Data is – Acquired – Cached and stored locally – Applied with rules and analytic models – Queued,

11 | ©2020 Storage Networking Association. All Rights Reserved.

What should be done (Prescriptive)

What will happen (Predictive)

What is happening (Monitoring)

Why did it happen (Diagnostic)

From Descriptive to Prescriptive

What happened (Descriptive)

Value & Benefits

Proactive

Competitive Advantage

Reactive

Page 12: The Impact of Artificial Intelligence on Storage and IT · 2020-05-11 · Data is – Acquired – Cached and stored locally – Applied with rules and analytic models – Queued,

12 | ©2020 Storage Networking Association. All Rights Reserved.

Event Streams Simplify Data Pipelines

DATA PIPELINE OPTIMIZATION

Capture Your Data At or Near the SourceFilter for Specific Event Information

Take any immediate actions necessary

Aggregate By Time IntervalJoin with Other Data Sources

Analyze with AI/ML, and Analytics Tools

Analytics & AI/ML

Capture & Filter

Aggregate by Time Interval

Joinother

Sources

Page 13: The Impact of Artificial Intelligence on Storage and IT · 2020-05-11 · Data is – Acquired – Cached and stored locally – Applied with rules and analytic models – Queued,

13 | ©2020 Storage Networking Association. All Rights Reserved.

Unify End-to-End Data Pipeline for AI

IoTEdge processing

of data in motion

Data is– Acquired – Cached and stored locally– Applied with rules and

analytic models– Queued, routed and

orchestrated

Fast dataCore processing

of data in motion

Data is– Ingested – Restructured, enriched– Persisted for real-time

usage and offline analytics– Applied with rules and

analytic models

Big dataAnalysis of data at rest

Data is– Hosted in data lakes– Transformed and

restructured – Aggregated– Structured by rules/models – Prepared for DL

AIDeep learning/

machine learning

Data– Trains and builds analytic

models– Creates test models

Business systems

Analytic models

Primarily Core Data CenterPrimarily

Edge / Distributed

Page 14: The Impact of Artificial Intelligence on Storage and IT · 2020-05-11 · Data is – Acquired – Cached and stored locally – Applied with rules and analytic models – Queued,

14 | ©2020 Storage Networking Association. All Rights Reserved.

Training Model

The Anatomy of AI Solutions

Page 15: The Impact of Artificial Intelligence on Storage and IT · 2020-05-11 · Data is – Acquired – Cached and stored locally – Applied with rules and analytic models – Queued,

15 | ©2020 Storage Networking Association. All Rights Reserved.

Training Model

Training Data

Test Data

Raw Archive Data

Transform / Clean Data Processing

Data Science Workbench / Tooling

Primarily Core Data Center

The Anatomy of AI Solutions

Business Systems

Page 16: The Impact of Artificial Intelligence on Storage and IT · 2020-05-11 · Data is – Acquired – Cached and stored locally – Applied with rules and analytic models – Queued,

16 | ©2020 Storage Networking Association. All Rights Reserved.

Training Model

Training Data

Test Data

Live Data

Raw Archive Data

Transform / Clean Data Processing

Data Transport Transform / Clean Data Processing

Inference Model

Activity Function

Visualization Function

Data Science Workbench / Tooling

Model Packaging / OptimisationModel Distribution

Data Pipeline Config Management

Primarily Core Data Center

Primarily Edge / Distributed

The Anatomy of AI Solutions

Business Systems

Business Systems

Page 17: The Impact of Artificial Intelligence on Storage and IT · 2020-05-11 · Data is – Acquired – Cached and stored locally – Applied with rules and analytic models – Queued,

17 | ©2020 Storage Networking Association. All Rights Reserved.

Training Model

Training Data

Test Data

Live Data

Raw Archive Data

Transform / Clean Data Processing

Data Transport Transform / Clean Data Processing

Inference Model

Activity Function

Visualization Function

Data Science Workbench / Tooling

Model Packaging / OptimisationModel Distribution

Data Pipeline Config Management

Primarily Core Data Center

Primarily Edge / Distributed

The Anatomy of AI Solutions

Business Systems

Business Systems

Traditional File Archive

Streaming Data

Sources

TradNew/NoSQLBiz Process Shared Central Data Lake

SDS or HCI eg. HDFS, CEPH, NFS, Object, etc.

Caching or Streaming layer SSD

eg. Cache, Redis, Kafka, NiFi etc.

Caching or Streaming layer… SSD eg. Cache, Redis, Kafka, NiFi etc.

Local Data Laketypically

Hyperconverged (HCI) or Converged

TradNew/NoSQLBiz Process

Page 18: The Impact of Artificial Intelligence on Storage and IT · 2020-05-11 · Data is – Acquired – Cached and stored locally – Applied with rules and analytic models – Queued,

18 | ©2020 Storage Networking Association. All Rights Reserved.

Building the AI StackCompute Hardware§ CPUs

§ Traditional source of raw compute§ Used for both training & inference§ Hybrids with other technologies

§ GPUs§ High speed floating point hardware§ De facto for AI training

§ ASICs & TPUs§ Application Specific Integrated Circuit§ Reduced precision FP (for training) or integer (for inference) operations

§ FPGAs § Field Programmable Gated Arrays§ Effective for inference§ Reprogrammable

Page 19: The Impact of Artificial Intelligence on Storage and IT · 2020-05-11 · Data is – Acquired – Cached and stored locally – Applied with rules and analytic models – Queued,

19 | ©2020 Storage Networking Association. All Rights Reserved.

Software Frameworks

19

Page 20: The Impact of Artificial Intelligence on Storage and IT · 2020-05-11 · Data is – Acquired – Cached and stored locally – Applied with rules and analytic models – Queued,

20 | ©2020 Storage Networking Association. All Rights Reserved.

AI Stack

GPUs, TPUs, FPGAs§ Optimized hardware to provide tremendous

speed-up for training, sometimes inference§ More easily available on cloud for rentModern Compute

Page 21: The Impact of Artificial Intelligence on Storage and IT · 2020-05-11 · Data is – Acquired – Cached and stored locally – Applied with rules and analytic models – Queued,

21 | ©2020 Storage Networking Association. All Rights Reserved.

AI Stack

Tensorflow, PyTorch, Caffe2, MxNet, CNTK, Keras, Gluon§ Library that implements algorithms, provides

execution engine and programming APIs§ Used to train and build sophisticated models,

and to do predictions based on the trained model for new data

Software

Modern Compute

Page 22: The Impact of Artificial Intelligence on Storage and IT · 2020-05-11 · Data is – Acquired – Cached and stored locally – Applied with rules and analytic models – Queued,

22 | ©2020 Storage Networking Association. All Rights Reserved.

AI Stack

Laptop, Cloud compute instances, H2O Deep Water, Spark DL pipelines, HPE Container Platform & MLOps§ Hardware accelerated platforms, supporting

common software frameworks, to run the training and/or inference of deep neural networks

§ Typically optimized for a preferred software framework

§ Can be hosted on-premises or cloud § Also offered as fully-managed service (PaaS)

by cloud vendors like Amazon SageMaker, Google Cloud ML, Azure ML

Platform

Software

Modern Compute

Page 23: The Impact of Artificial Intelligence on Storage and IT · 2020-05-11 · Data is – Acquired – Cached and stored locally – Applied with rules and analytic models – Queued,

23 | ©2020 Storage Networking Association. All Rights Reserved.

AI Stack

Amazon Rekognition, Lex & Polly; Google Cloud API; Microsoft Cognitive Services;§ Allows query based service access to

generalizable state of art AI models for common tasks

§ Ex: send an image and get object tags as result, send mp3 and get converted text as result and so on

§ No dataset, no training of model required by user§ Per-call cost model§ Integrated with cloud storage and/or bundled into

end-to-end solutions and AI consultancy offerings like IBM Services Watson AI, ML & Cognitive consulting, Amazon's ML Solutions Lab, Google's Advanced Solutions Lab

API-based service

Platform

Software

Modern Compute

Page 24: The Impact of Artificial Intelligence on Storage and IT · 2020-05-11 · Data is – Acquired – Cached and stored locally – Applied with rules and analytic models – Queued,

24 | ©2020 Storage Networking Association. All Rights Reserved.

Dataset Transform – ImageNet Example

§ Raw data vs TFRecords§ Raw data is converted into packed binary format for training called TFRecord (One

time step)§ 1.2 M image files are converted into 1024 TFRecords with each TFRecord 100s of MB in size

0100020003000400050006000700080009000

10000

1 8 68 142

216

290

364

438

512

586

660

734

808

882

956

1,63

8

FREQ

UEN

CY

SIZE (IN KB)

0

20

40

60

80

100

120

126 128 130 132 134 136 138 140 142 144 146 148 150 152 154 158

FREQ

UEN

CY

SIZE (IN MB)

Raw ImageNet Data

TFRecord ImageNet Data

Page 25: The Impact of Artificial Intelligence on Storage and IT · 2020-05-11 · Data is – Acquired – Cached and stored locally – Applied with rules and analytic models – Queued,

25 | ©2020 Storage Networking Association. All Rights Reserved.

TensorFlow Data Pipeline

1. IO: Read data from persistent storage

2. Prepare: Use CPU cores to parse and preprocess data§ Preprocessing includes Shuffling, data

transformations, batching etc.

3. Train: Load the transformed data onto the accelerator devices (GPUs, TPUs) and execute the DL model

Read IO Prepare Train

Storage

Network CPU/RAM

GPU

PCIe/NVLink GPU

PCIe/NV

Link

TFRecords

Host

Page 26: The Impact of Artificial Intelligence on Storage and IT · 2020-05-11 · Data is – Acquired – Cached and stored locally – Applied with rules and analytic models – Queued,

26 | ©2020 Storage Networking Association. All Rights Reserved.

Compute Pipelining

§Without pipelining

§With Pipelining (using prefetch API)

Image source: https://www.tensorflow.org/guide/

Page 27: The Impact of Artificial Intelligence on Storage and IT · 2020-05-11 · Data is – Acquired – Cached and stored locally – Applied with rules and analytic models – Queued,

27 | ©2020 Storage Networking Association. All Rights Reserved.

Parallelize IO and Prepare Phase

§ Parallelize IO

27

§ Parallelize prepare

Image source: https://www.tensorflow.org/guide/

Page 28: The Impact of Artificial Intelligence on Storage and IT · 2020-05-11 · Data is – Acquired – Cached and stored locally – Applied with rules and analytic models – Queued,

28 | ©2020 Storage Networking Association. All Rights Reserved.

Research Directions in AI

Academic:§ Using DL to replace heuristics-based

decision within systems software, or even data structures

§ Systems and platforms for DL§ Practical engineering optimizations to

improve DL process/lifecycle/performance§ Workload and benchmarking§ Other areas like security, privacy, power

etc.Industry:

§ Google Brain: hardware, AutoML§ FAIR: vision, video, AR§ Apple: speech & vision on-device

Page 29: The Impact of Artificial Intelligence on Storage and IT · 2020-05-11 · Data is – Acquired – Cached and stored locally – Applied with rules and analytic models – Queued,

29 | ©2020 Storage Networking Association. All Rights Reserved.

New Use Cases & Workloads Are Changing “Storage”

Scientific Apps

Machine Data

Analytics + Log

Image/video Processing Build

TestContainers

Global Repository

ERPCRM

Cold Data BackupHome

Dir

OLTP

VMs

Perf

Capacity

Delivered in a cloud-scale architecture with commodity economics across public and private clouds.

Workloads

▪ Legacy Operational apps▪ Media & Entertainment▪ Image Processing▪ Analytics & Log Processing▪ Container based apps

Capabilities

▪ Global Namespace▪ Multi-temperature store▪ Extreme Scale & Reliability▪ Guaranteed Performance▪ Agile Platform for

heterogeneous data types▪ Integrated Analytics

Page 30: The Impact of Artificial Intelligence on Storage and IT · 2020-05-11 · Data is – Acquired – Cached and stored locally – Applied with rules and analytic models – Queued,

30 | ©2020 Storage Networking Association. All Rights Reserved.

Summary

§AI is more than just training models!§Data is no longer single purpose, but purpose+§We need to simplify and unify data treatment§Move to Shared Data Storage Architecture§Unify Data Life Cycle §Unify End-to-End Data Pipeline §Pipelines are the new SAN

Page 31: The Impact of Artificial Intelligence on Storage and IT · 2020-05-11 · Data is – Acquired – Cached and stored locally – Applied with rules and analytic models – Queued,

31 | ©2020 Storage Networking Association. All Rights Reserved.

Additional Resources

§Presentation: Customer Support through Natural Language Processing and Machine Learning

https://youtu.be/u1iRvWzMioM§Presentation: Introducing the AI/ML and Genomics Workloads from

the SPEC Storage Subcommitteehttps://youtu.be/47pmqFXYi-4

§White Paper: Is Your Storage Ready for AI?§https://www.intel.com/content/www/us/en/products/docs/storage/are-you-ready-for-ai-tech-brief.html

§Article: Want optimized AI? Rethink your storage infrastructure and data pipeline§https://venturebeat.com/2020/01/16/want-optimized-ai-rethink-your-

storage-infrastructure-and-data-pipeline/

Page 32: The Impact of Artificial Intelligence on Storage and IT · 2020-05-11 · Data is – Acquired – Cached and stored locally – Applied with rules and analytic models – Queued,

32 | ©2020 Storage Networking Association. All Rights Reserved.

After This Webcast

§Please rate this webcast and provide us with feedback§ This webcast and a PDF of the slides will be posted to the SNIA

Cloud Storage Technologies Initiative website and available on-demand at https://www.snia.org/forum/csti/knowledge/webcasts

§A Q&A from this webcast will be posted to the SNIA Cloud blog: www.sniacloud.com/

§ Follow us on Twitter @SNIACloud

Page 33: The Impact of Artificial Intelligence on Storage and IT · 2020-05-11 · Data is – Acquired – Cached and stored locally – Applied with rules and analytic models – Queued,

33 | ©2020 Storage Networking Association. All Rights Reserved.

Thank You!