BIG ATA MODELING - Buckeye DAMA · Big Data •Volume Huge Volumes of Data •Velocity Drinking...

17
B IG D ATA M ODELING

Transcript of BIG ATA MODELING - Buckeye DAMA · Big Data •Volume Huge Volumes of Data •Velocity Drinking...

Page 1: BIG ATA MODELING - Buckeye DAMA · Big Data •Volume Huge Volumes of Data •Velocity Drinking from a Fire Hose •Variety n-Structured Data •Veracity Quality, Accuracy, Reliability,

BIG DATA

MODELING

Page 2: BIG ATA MODELING - Buckeye DAMA · Big Data •Volume Huge Volumes of Data •Velocity Drinking from a Fire Hose •Variety n-Structured Data •Veracity Quality, Accuracy, Reliability,

• Big Data1

• Data Modeling2

• Big Data Modeling3

AGENDA

Page 3: BIG ATA MODELING - Buckeye DAMA · Big Data •Volume Huge Volumes of Data •Velocity Drinking from a Fire Hose •Variety n-Structured Data •Veracity Quality, Accuracy, Reliability,

Session Objectives

• Big Data Fundamentals– Components of Big Data– Structure & Schemas– Tools & Architecture

• Data Modeling – Integration & History– Data Warehousing & BI– Conceptual to Physical

• Big Data Modeling– Focus on Meaning

• Ensemble Modeling– The Blended Architecture

Page 4: BIG ATA MODELING - Buckeye DAMA · Big Data •Volume Huge Volumes of Data •Velocity Drinking from a Fire Hose •Variety n-Structured Data •Veracity Quality, Accuracy, Reliability,

BIG DATA

Page 5: BIG ATA MODELING - Buckeye DAMA · Big Data •Volume Huge Volumes of Data •Velocity Drinking from a Fire Hose •Variety n-Structured Data •Veracity Quality, Accuracy, Reliability,

Big Data

“Huge” Data Volumes

n-Structured & Very Complex

Streaming & Shape-Shifting

Typical Data

v v

v v

v v

v v

Typical Data Big Data

A

B

C

Page 6: BIG ATA MODELING - Buckeye DAMA · Big Data •Volume Huge Volumes of Data •Velocity Drinking from a Fire Hose •Variety n-Structured Data •Veracity Quality, Accuracy, Reliability,

Big Data

• VolumeHuge Volumes of Data

• VelocityDrinking from a Fire Hose

• Varietyn-Structured Data

• VeracityQuality, Accuracy, Reliability, Trustworthiness

• ValueBusiness Value and Value Potential

Page 7: BIG ATA MODELING - Buckeye DAMA · Big Data •Volume Huge Volumes of Data •Velocity Drinking from a Fire Hose •Variety n-Structured Data •Veracity Quality, Accuracy, Reliability,

Modeling and Understanding

• Schema on Write

• Schema on Read

• Dismantled Schema on Write

• Schema on Focus

• Schema on Leverage

7

LOAD

MODEL APPLYEXPLORE

Page 8: BIG ATA MODELING - Buckeye DAMA · Big Data •Volume Huge Volumes of Data •Velocity Drinking from a Fire Hose •Variety n-Structured Data •Veracity Quality, Accuracy, Reliability,

Modeling and Understanding

• Big Data

Possibilities

8

LOADMODEL APPLY

EXPLORE

Page 9: BIG ATA MODELING - Buckeye DAMA · Big Data •Volume Huge Volumes of Data •Velocity Drinking from a Fire Hose •Variety n-Structured Data •Veracity Quality, Accuracy, Reliability,

Inconvenient Truth about BIG DATA

http://community.embarcadero.com/blogs/entry/the-hidden-elephant-in-big-data-modeling

Page 10: BIG ATA MODELING - Buckeye DAMA · Big Data •Volume Huge Volumes of Data •Velocity Drinking from a Fire Hose •Variety n-Structured Data •Veracity Quality, Accuracy, Reliability,

DATA MODELING

Page 11: BIG ATA MODELING - Buckeye DAMA · Big Data •Volume Huge Volumes of Data •Velocity Drinking from a Fire Hose •Variety n-Structured Data •Veracity Quality, Accuracy, Reliability,

Data Modeling

Mans Search for Meaning…

• Conceptual Modeling

• Logical Modeling

• Information Modeling

• Physical Data Modeling

Page 12: BIG ATA MODELING - Buckeye DAMA · Big Data •Volume Huge Volumes of Data •Velocity Drinking from a Fire Hose •Variety n-Structured Data •Veracity Quality, Accuracy, Reliability,

Three Paths for Modeling

Structured / Known

• CBC

• NBR

• Attribution

• Columns

Results in a backbone model with attributes in defined columns

N-Structured / NVP

• CBC

• NBR

• Attribution

Results in a backbone modes with known/expected attribute names/tags

N-Structured / KVP

• CBC

• NBR

Results in a backbone model with capacity to capture unknown attribution either named/tagged or not

CBC: Core Business ConceptsNBR: Natural Business Relationships

Page 13: BIG ATA MODELING - Buckeye DAMA · Big Data •Volume Huge Volumes of Data •Velocity Drinking from a Fire Hose •Variety n-Structured Data •Veracity Quality, Accuracy, Reliability,

APPLYING THE ENSEMBLE

Integration

across

Platforms

Page 14: BIG ATA MODELING - Buckeye DAMA · Big Data •Volume Huge Volumes of Data •Velocity Drinking from a Fire Hose •Variety n-Structured Data •Veracity Quality, Accuracy, Reliability,

Logical business model

• Leveraged for all logical

model needs including

the data warehouse, big

data lake, master data

management (MDM) and

operational integration

initiatives

• Closely aligned to DV

physical model

14

Ensemble Logical Form ( )

Customer

Region Store

Sale

Vendor

Product

Sale LI

Employee

Customer

RegionStore

Sale

Vendor

Product

Sale LI

Employee

CustomerRegion

Store

Sale

Vendor

Product

Sale LI

Employee

Page 15: BIG ATA MODELING - Buckeye DAMA · Big Data •Volume Huge Volumes of Data •Velocity Drinking from a Fire Hose •Variety n-Structured Data •Veracity Quality, Accuracy, Reliability,

Summary

Ensemble in the Big Data World

• Conceptual Modeling

• Logical Modeling

• Information Modeling

• Physical Data Modeling

• Integration Platform

+++-+ + +

Page 16: BIG ATA MODELING - Buckeye DAMA · Big Data •Volume Huge Volumes of Data •Velocity Drinking from a Fire Hose •Variety n-Structured Data •Veracity Quality, Accuracy, Reliability,

16

About Data Vault Ensemble

Estimated 1400 + Data Vault based Data Warehouses around the world