NIST Big Data for and Beyond · 2017-06-02 · 2nd NBD‐PWG Workshop, Wo Chang, NIST/ITL...

11
NIST Big Data Reference Architecture for Analytics and Beyond Wo Chang Digital Data Advisor [email protected] June 2, 2017

Transcript of NIST Big Data for and Beyond · 2017-06-02 · 2nd NBD‐PWG Workshop, Wo Chang, NIST/ITL...

Page 1: NIST Big Data for and Beyond · 2017-06-02 · 2nd NBD‐PWG Workshop, Wo Chang, NIST/ITL Complement to IEEE Big Data Initiative & Direction Enable data scientists, engineers, researchers,

NIST Big Data Reference Architecture for Analytics and Beyond

Wo ChangDigital Data [email protected]

June 2, 2017

Page 2: NIST Big Data for and Beyond · 2017-06-02 · 2nd NBD‐PWG Workshop, Wo Chang, NIST/ITL Complement to IEEE Big Data Initiative & Direction Enable data scientists, engineers, researchers,

2nd NBD‐PWG Workshop, Wo Chang, NIST/ITL

• Preposition: Assume NBD-RA is Established, then What?

• Revisit Enterprise Computing

• What’s the Computing Infrastructure Trend?

• What are the Big Data Architecture/Infrastructure Challenges?

• Goals for Big Data Analytics and Beyond

• Complement to IEEE Big Data Initiative & Direction

Agenda

Page 3: NIST Big Data for and Beyond · 2017-06-02 · 2nd NBD‐PWG Workshop, Wo Chang, NIST/ITL Complement to IEEE Big Data Initiative & Direction Enable data scientists, engineers, researchers,

2nd NBD‐PWG Workshop, Wo Chang, NIST/ITL

Data Sources‐ Sensors‐ Simulations‐ Modeling‐ Etc.

Data Consumers‐ End users‐ Repositories ‐ Systems‐ Etc. 

Data Scientist 

BDRA Interface

Resource Management/Monitoring, Analytics Libraries, etc.

BDRA Ecosystem Components

Computing Resources

AnalyticsResources

Distributed File System ServicesInfrastructure Services

Database ServicesData Types ServicesSupport Infrastructure

Value-added Content ServicesSecurity and Privacy Services

Visualization & BI ServicesAnalytics Services

Analytics Application

Preposition: Assume NBD‐RA Established, then What?

Page 4: NIST Big Data for and Beyond · 2017-06-02 · 2nd NBD‐PWG Workshop, Wo Chang, NIST/ITL Complement to IEEE Big Data Initiative & Direction Enable data scientists, engineers, researchers,

2nd NBD‐PWG Workshop, Wo Chang, NIST/ITL

Enterprise computing is sometimes sold to business users as an entire platform that can be applied broadly across an organization and then further customized by users within each area. This means the analytics, reporting, database management and other applications are standard across the system, while the application packages being used and the data being accessed in each area will be different. In this sense, enterprise computing is a departure from finding single software solutions to specific business problems, such as inventory or accounting software. Instead, enterprise computing is intended to offer integrated solutions to these problems.

Source: https://www.techopedia.com/definition/27854/enterprise-computing

Revisit Enterprise Computing

Page 5: NIST Big Data for and Beyond · 2017-06-02 · 2nd NBD‐PWG Workshop, Wo Chang, NIST/ITL Complement to IEEE Big Data Initiative & Direction Enable data scientists, engineers, researchers,

2nd NBD‐PWG Workshop, Wo Chang, NIST/ITL

What’s the Computing Infrastructure Trend?

Distributed Computing

Remote Procedure Calls

(RPC)

Common Object Request Broker

Architecture (CORBA)

Grid Computing Others…

Virtualization Partial Virtualization Full Virtualization Paravirtualization Others…

Cloud Computing

Software as a Service

Platform as a Service

Infrastructure as a Service Others…

Containerization Microservices Orchestrations Others…

Page 6: NIST Big Data for and Beyond · 2017-06-02 · 2nd NBD‐PWG Workshop, Wo Chang, NIST/ITL Complement to IEEE Big Data Initiative & Direction Enable data scientists, engineers, researchers,

2nd NBD‐PWG Workshop, Wo Chang, NIST/ITL

Big Data Architecture/Infrastructure – Challenges (Computing Stack)

Page 7: NIST Big Data for and Beyond · 2017-06-02 · 2nd NBD‐PWG Workshop, Wo Chang, NIST/ITL Complement to IEEE Big Data Initiative & Direction Enable data scientists, engineers, researchers,

2nd NBD‐PWG Workshop, Wo Chang, NIST/ITL

Big Data Architecture/Infrastructure – Challenges (Analytics Stack)

Source: http://1.bp.blogspot.com/-PKiTQa0mrn4/T_mGb6AI3yI/AAAAAAAAA8Q/TtH7xyjQ3FA/s640/analytics+tools+landscape.bmp

Page 8: NIST Big Data for and Beyond · 2017-06-02 · 2nd NBD‐PWG Workshop, Wo Chang, NIST/ITL Complement to IEEE Big Data Initiative & Direction Enable data scientists, engineers, researchers,

2nd NBD‐PWG Workshop, Wo Chang, NIST/ITL

Big Data Architecture/Infrastructure – Challenges (Applications Stack)

Source: http://1.bp.blogspot.com/‐PKiTQa0mrn4/T_mGb6AI3yI/AAAAAAAAA8Q/TtH7xyjQ3FA/s640/analytics+tools+landscape.bmp

Internet of Things

Analytics Engine

Electronic Health Records Life Science

Others…

Social Media

Page 9: NIST Big Data for and Beyond · 2017-06-02 · 2nd NBD‐PWG Workshop, Wo Chang, NIST/ITL Complement to IEEE Big Data Initiative & Direction Enable data scientists, engineers, researchers,

2nd NBD‐PWG Workshop, Wo Chang, NIST/ITL

Big Data Architecture/Infrastructure – Challenges (Integration)

Source: https://www.interana.com/wp‐content/uploads/2017/03/Big‐Data‐Landscape‐2016‐v18‐FINAL.jpg

Page 10: NIST Big Data for and Beyond · 2017-06-02 · 2nd NBD‐PWG Workshop, Wo Chang, NIST/ITL Complement to IEEE Big Data Initiative & Direction Enable data scientists, engineers, researchers,

2nd NBD‐PWG Workshop, Wo Chang, NIST/ITL

Goals for Big Data Analytics and Beyond

CloudLaptop

……Data Center

Many CPUs/Cores/GPUs/FPGAs/ASICs/accelerators

Desktop/Server

Enable Big Data analytics tools for interoperability, portability, reusability, and extensibility.Practical Aspect: Analytics tools can be reusable, deployable, and operational (max. use of resources) [similar approach as to HTML, PCI bus, etc.]

Page 11: NIST Big Data for and Beyond · 2017-06-02 · 2nd NBD‐PWG Workshop, Wo Chang, NIST/ITL Complement to IEEE Big Data Initiative & Direction Enable data scientists, engineers, researchers,

2nd NBD‐PWG Workshop, Wo Chang, NIST/ITL

Complement to IEEE Big Data Initiative & DirectionEnable data scientists, engineers, researchers, etc. to increase productive and enhance quality in data science through standard modularized Big Data Analytics tools.

Virtual ResourcesPhysical Resources

Indexed StorageFile Systems

Processing: Computing and Analytic

Platforms: Data Organization and Distribution 

Infrastructures: Networking, Computing, Storage

Messaging/ Co

mmun

ications

Messaging/ Co

mmun

ications

Resource M

anagem

ent

Resource M

anagem

entBatch StreamingInteractive

Big Data 

Fram

ework Provider

Collection Preparation Analytics Visualization Access

Data Scientists 

Data

Provider

Big Data

Application

Provider