Zen of metadata 09212010

58
The Zen of Metadata Scott Northrup Data Architect September 21, 2010

description

 

Transcript of Zen of metadata 09212010

Page 1: Zen of metadata 09212010

The Zen of MetadataScott NorthrupData Architect

September 21, 2010

Page 2: Zen of metadata 09212010

PAGE 1

biography

I work for CCSF DPW

(City and County of San Francisco

Department of Public Works)

I’m a DA

(that’s Data Architect)

with no particular credentials

except 30+ years experience in database design

Page 3: Zen of metadata 09212010

PAGE 2

HomERwin

Page 4: Zen of metadata 09212010

PAGE 3

biography continued

• Prior roles

– Analyst – business & operational

– Programmer – at least 6 languages

– Network tech

– Project manager – SDLC

– Data modeler – logical & physical

• Current roles

– Data Architect for Data Governance and MDM

– Metadata manager – data dictionary

– Data standards admin

Page 5: Zen of metadata 09212010

PAGE 4

agenda

– Intro and environment

– DATA

– DATA and METADATA

– Business Requirements

– Metadata types and philosophies

– CIA’s compilation and usage of Metadata

– Some examples and caveats

© 2010 Wells Fargo Bank, N.A. All rights reserved.

Page 6: Zen of metadata 09212010

PAGE 5

ZEN…

…emphasizes experiential wisdom, discernment, or understanding, particularly as realized in the form of meditation for the attainment of enlightenment…

??? Inmon or Kimball ???

who cares…

! BEST PRACTICES !

© 2010 Wells Fargo Bank, N.A. All rights reserved.

Page 7: Zen of metadata 09212010

PAGE 6

application environment

• Teradata

• ERwin

• E/R Studio

• Ab Initio

• Microstrategy

• SQL Server

© 2010 Wells Fargo Bank, N.A. All rights reserved.

Page 8: Zen of metadata 09212010

PAGE 7

data environment

• Data Warehouse

– 70 databases from 62 source systems

– DSL – Data Source Layer

• 62 databases

– DIL – Data Integration Layer (Data Mart)

• 3 databases

– DAL – Data Application Layer

• 5 databases

– Aggregates & Composites

© 2010 Wells Fargo Bank, N.A. All rights reserved.

Page 9: Zen of metadata 09212010

PAGE 8

classic rock

A little bit of data is better than no data

Even the bad data is better than no data

And any kind of data is better than no data at all

Inspired by BJ Thomas, “No Love At All”

© 2010 Wells Fargo Bank, N.A. All rights reserved.

Page 10: Zen of metadata 09212010

PAGE 9

data is…

DATA

© 2010 Wells Fargo Bank, N.A. All rights reserved.

Page 11: Zen of metadata 09212010

PAGE 10

KING… of OUR world !

DATA

© 2010 Wells Fargo Bank, N.A. All rights reserved.

Page 12: Zen of metadata 09212010

PAGE 11

data “analysis”

What does a BYTE of data cost in your organization (and does anyone know or care)?

WHAT is the ROI of your data usage? (Alton Brown)

HOW can you leverage your data if you don’t really know what it is?

WHO cares most about any given piece of data?

© 2010 Wells Fargo Bank, N.A. All rights reserved.

Page 13: Zen of metadata 09212010

PAGE 12

data management

Data Governance

Metadata Quality

Analysis Design Load

Data Acquisition

© 2010 Wells Fargo Bank, N.A. All rights reserved.

Page 14: Zen of metadata 09212010

PAGE 13

key components

• MDM

– Master Data Management (-OR- Master Data Mess)

• Metadata management

– robust metadata with acceptable content & presentation

• Data quality and governance

– EVERY data element needs a steward

© 2010 Wells Fargo Bank, N.A. All rights reserved.

Page 15: Zen of metadata 09212010

PAGE 14

so… what is?

DATA about DATA

© 2010 Wells Fargo Bank, N.A. All rights reserved.

Page 16: Zen of metadata 09212010

PAGE 15

co-existence of data and metadata

– –

© 2010 Wells Fargo Bank, N.A. All rights reserved.

Page 17: Zen of metadata 09212010

PAGE 16

co-existence of data and metadata

Business Card Request

Joe Camel

4321 Puff St.

El Nicotine, CA

415–555–1212

Mascot

[email protected]

© 2010 Wells Fargo Bank, N.A. All rights reserved.

Page 18: Zen of metadata 09212010

PAGE 17

co-existence of data and metadata

Business Card Request

Name Joe Camel

Location 4321 Puff St.

El Nicotine, CA

Phone 415–555–1212

Title Mascot

Email [email protected]

© 2010 Wells Fargo Bank, N.A. All rights reserved.

Page 19: Zen of metadata 09212010

PAGE 18

validating business requirements

© 2010 Wells Fargo Bank, N.A. All rights reserved.

ObjectivesGeneral statements of what we

intend to accomplish

ProblemsIssues that may

arise that are under our

control

OpportunitiesFactors that may enhance our

chances of success

GoalsTimed and

measurable milestones

ActivitiesManual and

automated processes and

procedures

MetricsThe performance measures we

use to assess our progressOperational DataData used and produced

by manual and automated activities

Business RequirementsInformation to be included in data

models

Constraints

Issues that may arise that are beyond our influence

Purpose and ScopeWhat the project is intended to

accomplish; what’s “in” and “out”

Defining and Planning a Project

Process

Data

Business

Systems

Technology

Organization

Physical

RisksUncertainties

that might unfavorably impact our

effort

Page 20: Zen of metadata 09212010

PAGE 19

INTRODUCTION

As CIA is responsible for maintaining a Data Warehouse, we do not create new sources

of data, per se. We extract data from other source systems and load it into the warehouse.

We may possibly manipulate some of the source data which could result in “new”

warehouse data.

To that end, designing a database in the warehouse requires a somewhat different

approach than designing a database from scratch that would be used to support an

application.

To aid in our design, we should understand things from these perspectives:

What the data functionality and make-up of the source system is

o What is the functional data flow, content and meaning

o What is the layout/design

Why we want to have this data included in the warehouse

o What business need will this support (high-level)

o What business question(s) will this answer (detail)

How the data will be added to our warehouse or how it may replace or change

existing data in our warehouse

o Will this be a new source

o Can it be integrated into an existing database

o How does it differ from other similar data sources

o Are there any downstream affects

How we can best make this data available to the widest possible audience

o How does it relate to the overall architecture

o What standards will be applied

o What is the level of granularity - are composites or aggregates needed

How the data might be integrated with our datamart layer

o Is it “Partyable”

o Can it incorporate, or be incorporated into, any existing objects (reference

tables, etc.)

JOINT REQUIREMENTS DISCOVERY (JRD)

I. DEFINING THE PROJECT (and Scope)

A. Project Purpose

B. Project Scope

D. Risk Assessment

E. Roles and Responsibilities

II. DESCRIBING THE CURRENT SITUATION - Source System

A. Systems and Procedural Analysis

B. Data Analysis

C. Technical and Environmental Analysis

III. GAP and/or IMPACT ANALYSIS - Data Warehouse

A. Current Systems and Procedures

B. Data Impacts

C. Technical and Environmental Impacts

IV. SPECIFYING BUSINESS REQUIREMENTS

A. Business Process Analysis

1. Process Analysis

a) How will we leverage existing documentation or

diagrams

2. Process Analysis Deliverables

B. Business Data Analysis 1. Data Analysis

a) What user output is needed (Views, Reports, Forms)

2. Data Analysis Deliverables

V. PLANNING the PROJECT

A. Resource allocation

joint requirements discovery

© 2010 Wells Fargo Bank, N.A. All rights reserved.

Page 21: Zen of metadata 09212010

PAGE 20

metadata management (80/20 principle)

Business Policies Information Policies Business Rules

Information Usage Information Quality Access Controls

Entity Models Relational Tables Master Objects

Critical Elements Data Formats Alias & Synonyms

Conceptual & Value

DomainsReference Tables Mapping & Lineage

Business Terms &

ConceptsDefinitions Semantics

Reference Metadata

Business Definitions

Business Metadata

Data Governance

Information Architecture

Data Elements

© 2010 Wells Fargo Bank, N.A. All rights reserved.

Page 22: Zen of metadata 09212010

PAGE 21

metadata types

• Bretheron & Singley

– Structural metadata is used to describe the structure of computer systems such as tables, columns and indexes

– Guide metadata is used to help humans find specific items and is usually expressed as a set of keywords in a natural language

• Kimball

– Business metadata describes data content (data dictionary, maps, rules)

– Technical metadata describes objects and processes (sources, systems)

– Process metadata describes results of operations (audits, outputs)

• NISO

– Descriptive metadata is the information used to search and locate an object such as title, author, subjects, keywords, publisher

– Structural metadata gives a description of how the components of the object are organised

– Administrative metadata refers to the technical information including file type. Two sub-types of Administrative metadata are rights management metadata and preservation metadata

© 2010 Wells Fargo Bank, N.A. All rights reserved.

Page 23: Zen of metadata 09212010

PAGE 22

Kimball - DW architecture metadata worksheet

TABLE metadata

Metadata Item Short Description Possible Values

Description A short description of the table Free text

Is Audit Subsystem Is this table part of the audit subsystem? Y or N

Display Name The name displayed on the screen Free text

Table Type The type of table Free text

Used In Schemas The schemas that the table is used in Free text

View Name The name of the single table view Free text

COLUMN metadata

Metadata Item Short Description Possible Values

Description Short description of the column Free text

Display Name The name of the column as displayed on the screen Free Text

Example Values Example values stored in the column Free Text

Is Audit Subsystem Is this column part of the audit subsystem? Y or N

Is Key What type (if any) of key this column is

PK (Primary Key), SK

(Surrogate Key), or FK

(Foreign Key)

© 2010 Wells Fargo Bank, N.A. All rights reserved.

Page 24: Zen of metadata 09212010

PAGE 23

metadata benefits

– Information can be viewed in an understandable context

– Better usage of the organization’s data assets

– Preservation of corporate knowledge and business rules

– Improved impact and gap analysis (new or existing systems)

– Facilitation of standards and reusability

– Decrease in data redundancy and storage

– Easier transitions or interactions with new platforms

– Assists in answering security and regulatory questions

– Employees/contractors can get up to speed more quickly

– Potential reduction in costs for equipment and manpower

– Data becomes USEFUL INFORMATION

– Data becomes a TRUE ASSET

© 2010 Wells Fargo Bank, N.A. All rights reserved.

Page 25: Zen of metadata 09212010

PAGE 24

metadata matrix

PLANVIEW NUMBER: ___________________________________ 0

DELIVERABLE 1. C

olle

ct/u

pdat

e

busi

ness

defin

ition

s2.

Cre

ate

and

publ

ish

sour

ce-to-

targ

et m

app

ing

3. R

evie

w /

verif

y

busi

ness

defin

ition

s4.

Upd

ate

ER

Win

Mod

el5.

Upd

ate

and

publ

ish

data

bas

e m

ap6.

Upd

ate

MD

S u

sing

ER

Win

==>

MIM

B =

=>

MD

S p

roce

ss7.

Con

firm

Succ

essf

ul

Deliv

ery

of M

etad

ata

8. S

end

inte

rnal

notif

icat

ion

to d

evelo

pers

9. P

ublis

h da

ta m

odel

on

CIA

web

site

10. P

ubl

ish

data

bas

e

map

on

CIA

web

site

11. S

end

Use

r

Com

muni

catio

n re

gard

ing

Pro

duc

tion

Sta

tus

Entity names

Entity definitions

Attribute names

Attribute definitions

Relationships

Table names

Source-to-target file/table mapping

Column names

Column Transformations

Source-to-target element/column

mapping

Column data types

Column null options

Column valid values, decoded

(validation rules)

Column derivation rules (embedded

in definitions)

Primary Key and Primary Indexes

Relationships

Groupings of tables to define a

subject area

Cardinality between tables

Join strategies

Signature of person responsible

for project: Date:

Title:

Data Modeler: Date:

Physic

al

Model

Logic

al

Model

DB

Map

© 2010 Wells Fargo Bank, N.A. All rights reserved.

Page 26: Zen of metadata 09212010

PAGE 25

ERwin DM – data warehouse source

© 2010 Wells Fargo Bank, N.A. All rights reserved.

Page 27: Zen of metadata 09212010

PAGE 26

ERwin DM – UDP’s

© 2010 Wells Fargo Bank, N.A. All rights reserved.

Page 28: Zen of metadata 09212010

PAGE 27

source to target using MS Excel

© 2010 Wells Fargo Bank, N.A. All rights reserved.

Page 29: Zen of metadata 09212010

PAGE 28

ERwin DM – validation rules

© 2010 Wells Fargo Bank, N.A. All rights reserved.

Page 30: Zen of metadata 09212010

PAGE 29

domain of values

© 2010 Wells Fargo Bank, N.A. All rights reserved.

Page 31: Zen of metadata 09212010

PAGE 30

ERwin DM – data model

© 2010 Wells Fargo Bank, N.A. All rights reserved.

Page 32: Zen of metadata 09212010

PAGE 31

ERwin DM – import/export bridge

© 2010 Wells Fargo Bank, N.A. All rights reserved.

Page 33: Zen of metadata 09212010

PAGE 32

ERwin DM – column definitions via RTB

Table Name Column Name Column Datatype Attribute Definition

T_XYZ_ACCT XYZ_ACCT_ID DECIMAL(18) The system generated identifier for an existing user's Account

T_XYZ_ACCT XYZ_ACCT_NUM VARCHAR(17) The unique identifier used for an account within a given account type.

T_XYZ_ACCT XYZ_CO_ID VARCHAR(5) This identifier denotes the original (legacy) bank that booked or owned the associated account. type.

T_XYZ_ACCT XYZ_CRTE_DT_TM TIMESTAMP(0) The date and time a particular Account was added to the system.

T_XYZ_ACCT XYZ_STAT_CD INTEGER Status of the account

T_XYZ_ACCT XYZ_STAT_DT_TM TIMESTAMP(0) The STATUS is as-of this time, per the Server's local time.

T_XYZ_ACCT XYZ_IS_BUS_IND CHAR(1) This is a Business Account.

T_XYZ_ACCT XYZ_LST_UPDT_DT_TM TIMESTAMP(0) Server local time.

T_XYZ_ACCT XYZ_MASK_ACCT_NUM VARCHAR(64) This is a masked verison of or surrogate key for a real Account number,

T_XYZ_ACCT XYZ_PROD_CD VARCHAR(6) The BOS product code.

T_XYZ_ACCT XYZ_PROD_SUBCODE VARCHAR(6) a.k.a. Sub-Product Code.

T_XYZ_ACCT XYZ_SOR_ACCT_NUM VARCHAR(17) The Account Number, per the SOR. Generally a padded version of ACCOUNT_NO.

T_XYZ_ACCT XYZ_SOR_PROD_CD CHAR(3) The SOR doesn't use the BOS code as its Product Code, so this what it wants.

T_XYZ_ACCT XYZ_SOR_TYP INTEGER System of Record Type

T_XYZ_EMAIL_EV XYZ_EMAIL_ID DECIMAL(18) This identifier is the key randomly assigned to an email item when one is created for a given user.

T_XYZ_EMAIL_EV XYZ_CNTNT_TYP_CD VARCHAR(40) This is the MIME type of the email message (this could be "text/plain" or "text/html" ).

T_XYZ_EMAIL_EV XYZ_CRTD_DT_TM TIMESTAMP(0) Date and time this email item was created.

T_XYZ_EMAIL_EV XYZ_EDS_PROF_ID VARCHAR(30) The EDS profile used, in conjunction with a template Identifier, for sending the message.

T_XYZ_EMAIL_EV XYZ_EMAIL_TYP DECIMAL(18) Used for sub-type identification by Hibernate.

T_XYZ_EMAIL_EV XYZ_EVNT_ID DECIMAL(18) This identifier is the key randomly assigned to an inbox item when one is created for a given user.

T_XYZ_EMAIL_EV XYZ_INBOX_ID DECIMAL(18)

T_XYZ_EMAIL_EV XYZ_LST_UPDT_DT_TM TIMESTAMP(0) Date and time any attributes of this email item were last updated.

T_XYZ_EMAIL_EV XYZ_NNM VARCHAR(120) The nickname from the ContactPoint (at the time the Email was created), used for the "friendly name" of the recipient.

T_XYZ_EMAIL_EV XYZ_PRI_CD INTEGER Identifies the priority queue used to sequence handling. (loosely related to Urgent Indicator)

T_XYZ_EMAIL_EV XYZ_PROC_STAT_CD INTEGER Status of the email processing.

T_XYZ_EMAIL_EV XYZ_SNT_DT_TM TIMESTAMP(0) The date and time the email message was sent.

T_XYZ_EMAIL_EV XYZ_SMTP_FAILURE_CD VARCHAR(60) This is the failure code returned with failed email messages, per RFC821.

T_XYZ_EMAIL_EV XYZ_TMPLT_ID VARCHAR(100) The identifier of the template used to create this email item, if one was used.

T_XYZ_EMAIL_EV XYZ_TO_ADDR_LN VARCHAR(200) The destination email address.

T_XYZ_EMAIL_EV XYZ_URGENT_IND INTEGER This is the "importance" flag on a typical email item. It is carried along with the email message to indicate that the sender considers it important.

T_XYZ_EMAIL_EV XYZ_USR_ID DECIMAL(18) This identifier is the key randomly assigned to a user when he/she completes registration within the system.

© 2010 Wells Fargo Bank, N.A. All rights reserved.

Page 34: Zen of metadata 09212010

PAGE 33

SYS_MDS database using MS Access

Database Column Attribute Attribute Def

DB_DAD ABC_CN AbC_CN

AbC cleansed version of the CN: Customer Number assigned to the

customer.

DB_HIGH ABC_CN AbC customer number AbC customer number found in T_ABC_MBR_CN table.

DB_EVX ABC_CN ABC customer number A unique number assigned by Hogan for identifying a bank customer. This attribute stores the cleansed version of the raw input attribute "customer number".

DB_CGG ABC_CN ABC customer number AbC customer number found in T_ABC_MBR_CN table.

DB_RS ABC_CN ABC_CN See corresponding column in DB_AbC.

DB_OPS ABC_CN ABC_CN The Customer Number of the person associated with this account in AbC format.

DB_ABC ABC_CN AbC_CN @@

DB_BMT ABC_CN AbC customer number AbC datamart

DB_EDS ABC_CN AbC CN see AbC definition

DB_VSS ABC_CN ABC customer number Refer AbC

DB_RS ABC_CN AbC_CN See corresponding column in DB_AbC.

DB_OPS ABC_CN AbC_CN The Customer Number of the person associated with this account in AbC format.

DB_APK ABC_CN AbC customer number AbC customer number found in T_ABC_MBR_CN table.

DB_XYZ ABC_CN customer number AbC

DB_EDS ABC_CN AbC CN see AbC definition

DB_CM ABC_CN customer number undefined OR unable to load definition

DB_PE ABC_CN CN wells fargo CN of the visitor

DB_BMT ABC_CN ABC_CN AbC datamart

DB_EDS ABC_CN ABC_CN see AbC definition

DB_AA ABC_CN AbC customer number AbC customer number located in T_ABC_MBR_CN table.

DB_STS ABC_CN customer number See AbC Business Definitions

DB_APK ABC_CN AbC customer number AbC customer number found in T_ABC_MBR_CN table.

DB_OPS ABC_CN AbC_CN The Customer Number of the person associated with this account in AbC format.

© 2010 Wells Fargo Bank, N.A. All rights reserved.

Page 35: Zen of metadata 09212010

PAGE 34

SYS_MDS database continued

Database Column Attribute Attribute Def

DB_DAD ABC_CN AbC_CN

A unique number assigned for identifying a bank customer associated

with this account.

DB_HIGH ABC_CN AbC customer number A unique number assigned for identifying a bank customer associated with this account.

DB_EVX ABC_CN ABC customer number A unique number assigned for identifying a bank customer associated with this account.

DB_CGG ABC_CN ABC customer number A unique number assigned for identifying a bank customer associated with this account.

DB_RS ABC_CN ABC_CN A unique number assigned for identifying a bank customer associated with this account.

DB_OPS ABC_CN ABC_CN A unique number assigned for identifying a bank customer associated with this account.

DB_ABC ABC_CN AbC_CN A unique number assigned for identifying a bank customer associated with this account.

DB_BMT ABC_CN AbC customer number A unique number assigned for identifying a bank customer associated with this account.

DB_EDS ABC_CN AbC CN A unique number assigned for identifying a bank customer associated with this account.

DB_VSS ABC_CN ABC customer number A unique number assigned for identifying a bank customer associated with this account.

DB_RS ABC_CN AbC_CN A unique number assigned for identifying a bank customer associated with this account.

DB_OPS ABC_CN AbC_CN A unique number assigned for identifying a bank customer associated with this account.

DB_APK ABC_CN AbC customer number A unique number assigned for identifying a bank customer associated with this account.

DB_XYZ ABC_CN customer number A unique number assigned for identifying a bank customer associated with this account.

DB_EDS ABC_CN AbC CN A unique number assigned for identifying a bank customer associated with this account.

DB_CM ABC_CN customer number A unique number assigned for identifying a bank customer associated with this account.

DB_PE ABC_CN CN A unique number assigned for identifying a bank customer associated with this account.

DB_BMT ABC_CN ABC_CN A unique number assigned for identifying a bank customer associated with this account.

DB_EDS ABC_CN ABC_CN A unique number assigned for identifying a bank customer associated with this account.

DB_AA ABC_CN AbC customer number A unique number assigned for identifying a bank customer associated with this account.

DB_STS ABC_CN customer number A unique number assigned for identifying a bank customer associated with this account.

DB_APK ABC_CN AbC customer number A unique number assigned for identifying a bank customer associated with this account.

DB_OPS ABC_CN AbC_CN A unique number assigned for identifying a bank customer associated with this account.

© 2010 Wells Fargo Bank, N.A. All rights reserved.

Page 36: Zen of metadata 09212010

PAGE 35

MetaSurf – Meta Data Services

© 2010 Wells Fargo Bank, N.A. All rights reserved.

Page 37: Zen of metadata 09212010

PAGE 36

MetaSurf search

© 2010 Wells Fargo Bank, N.A. All rights reserved.

Page 38: Zen of metadata 09212010

PAGE 37

Metadata Lite

T_XYZ_ACCT This represents an account.

T_XYZ_CNTCT A Contact is a channel for notifying a user. An Email address is the primary type

T_XYZ_EMAIL_MSG_TYP table level has NO definition

T_XYZ_IN_EV Table of In Items

T_XYZ_SUBSCR

555 AccountBasedSubscript 666 PasswordLockoutSubscript 777 CheckClearSubscript 888

CheckReorderReminderSubscript 999 DepositAccountUpdateSubscript

DATABASE TABLE COLUMN DESCRIPTION

DB_XYZ T_XYZ_CNTCT_PT XYZ_CNTCT_PT_ID

The system generated identitfier for each Contact Point entered

by a user.

DB_XYZ T_XYZ_CNTCT_PT XYZ_CNTCT_PT_TYP

This is the type of method or device used as a contact point by

the user.

DB_XYZ T_XYZ_CNTCT_PT XYZ_CRTD_DT_TM

The date and time a particular Contact Point was setup (created)

by the system.

DB_XYZ T_XYZ_CNTCT_PT XYZ_EMAIL_ADDR If the contact point type is an email then this is the address.

DB_XYZ T_XYZ_CNTCT_PT XYZ_LST_UPDT_DT_TM

Date and time any attributes of this contact point item were last

updated.

DB_XYZ T_XYZ_CNTCT_PT XYZ_PURP_CD

This code is set by the user to rank the priority or order in which

a contact point is used.

DB_XYZ T_XYZ_CNTCT_PT XYZ_USR_ID

This identifier is the key randomly assigned to a user when

he/she completes registration.

© 2010 Wells Fargo Bank, N.A. All rights reserved.

Page 39: Zen of metadata 09212010

PAGE 38

enterprise metadata repository – EMR

© 2010 Wells Fargo Bank, N.A. All rights reserved.

Page 40: Zen of metadata 09212010

PAGE 39

EMR lineage diagram

© 2010 Wells Fargo Bank, N.A. All rights reserved.

Page 41: Zen of metadata 09212010

PAGE 40

source systems’ (metadata) websites

© 2010 Wells Fargo Bank, N.A. All rights reserved.

Page 42: Zen of metadata 09212010

PAGE 41

Data Modeling Set of Standards – DMSOS

© 2010 Wells Fargo Bank, N.A. All rights reserved.

Page 43: Zen of metadata 09212010

PAGE 42

MODEL Submission Checklist (v5)

This Submission Checklist is used to ensure that all models meet CIA modeling standards. Check mark each entry to indicate that it has been successfully completed and reviewed.

If an item is not checked, explain in the comments section.

Model Name (in MM):

Date:

Data Modeler: other

Data Base Administrator: other

Check each requirement below if complete X

Logical model Entity

All lower case names

Singular nouns used (no plural or past tense)

English business name

Abbreviations not used

- AND/OR -

Standard acronyms used (when necessary)

Special characters not used

Naming standards followed for References & Composites

Definition entered

Attribute

All lower case names

Singular nouns used (no plural or past tense)

English business name (consisting of descriptors,

followed by a classword)

Classwords taken from template of logical domains

Abbreviations not used in descriptors

- AND/OR -

Standard acronyms used as needed

Special characters not used

Naming standards followed for:

Keys, Indicators, Codes, Identifiers, Dates or Timestamps

Definition entered

Logical Datatype

Assigned to every attribute

Datatypes assigned properly for:

Indicators, Codes, Dates or Timestamps

Keys & Key Groups

Primary key has been assigned for every entity

All attributes which participate in a key are either part

Physical model

Table

“T” plus acronym used in name prefix

All UPPER case names

Standard abbreviations applied

Column

All UPPER case names

Acronym used in name prefix

Standard abbreviations applied

Misc.

Template logical domains set to Physical Only, as needed

Indexes have been identified

Complete Compare done with previous Production model

Both (L/P)

Model

Model considers usage of third normal form in design

If colors used, there is a legend for color scheme

Subject Areas used for functional or project categorization

Text Boxes and comments use a standard format layout

Successful “export” and “import” of model via XML

Metadata Matrix form has been reviewed

Subject Area

Template Subject Areas have NOT been modified or deleted

Subject Areas added and/or used appropriately for

functional or project categorization

Website (to be completed by modeler and website Admin)

(See the document „Data Model Implementation.doc‟ for further detailed information)

PDF file of model diagram (w/o date) moved to staging

S2T (source to target) file for model moved to staging

New or Existing Source: Existing

Supplemental Documents: No

Source Description (required):

Source Summary:

PDF file of model diagram uploaded to website

S2T (source to target) file uploaded to website

model submission checklist

© 2010 Wells Fargo Bank, N.A. All rights reserved.

Page 44: Zen of metadata 09212010

PAGE 43

ERwin DM – database maps

© 2010 Wells Fargo Bank, N.A. All rights reserved.

Page 45: Zen of metadata 09212010

PAGE 44

corporate GLOSSARY of acronyms

© 2010 Wells Fargo Bank, N.A. All rights reserved.

Page 46: Zen of metadata 09212010

PAGE 45

intranet web site using Sharepoint

© 2010 Wells Fargo Bank, N.A. All rights reserved.

Page 47: Zen of metadata 09212010

PAGE 46

‘Data Sources’ page for a given Source System

© 2010 Wells Fargo Bank, N.A. All rights reserved.

Page 48: Zen of metadata 09212010

PAGE 47

integrating technical documents

• Business Requirements Document

• Data Modeling Checklist

• Database Design Spec

• ETL Design Spec

• Service Delivery Runbook

© 2010 Wells Fargo Bank, N.A. All rights reserved.

Page 49: Zen of metadata 09212010

PAGE 48

global objects

• Model Mart Reports

• Data Dictionary Search

• Data mining tools

Looking for common element names and/or datatypes and/or data values

Create reference or cross-reference tables

© 2010 Wells Fargo Bank, N.A. All rights reserved.

Page 50: Zen of metadata 09212010

PAGE 49

enticement !

© 2010 Wells Fargo Bank, N.A. All rights reserved.

Page 51: Zen of metadata 09212010

PAGE 50

user support and support of users

• Brown Bag presentations – business unit functionality

• Training Sessions - new projects & ongoing courses

– in person

– video / audio

– online

• CIA Communications

– What’s happening?

– How am I impacted?

– Questions – call…

• User meetings & group discussions

© 2010 Wells Fargo Bank, N.A. All rights reserved.

Page 52: Zen of metadata 09212010

PAGE 51

examples

• Expanding acronyms

– CIA =

– MOB =

• Abbreviations

– st_cd CHAR(2)

– sta_cd CHAR(2)

• Data element value/content

– 10 digit ID =

© 2010 Wells Fargo Bank, N.A. All rights reserved.

Page 53: Zen of metadata 09212010

PAGE 52

examples

• Expanding acronyms

– CIA = Certified Internal Auditor

– MOB = Mobile Browser

• Abbreviations

– st_cd status code CHAR(2)

– sta_cd state code CHAR(2)

• Data element value/content

– 10 digit ID = SSN + check digit (potential security violation)

© 2010 Wells Fargo Bank, N.A. All rights reserved.

Page 54: Zen of metadata 09212010

PAGE 53

caveats

• Metadata

– Data Element Definition – be detailed in your descriptors and be consistent in their use

– It’s better to be consistently wrong than inconsistently right

– A good example is worth a thousand words… of definition

• Models

– Normalize the physical data structure and de-normalize the view of that data or create separate (aggregate/composite) physical structures

– Create Reference and Cross Reference information that can be shared across models/databases

© 2010 Wells Fargo Bank, N.A. All rights reserved.

Page 55: Zen of metadata 09212010

PAGE 54

in conclusion

• Data and Metadata

• Standards and Roadmap

• Tools and Usage

• Communication

© 2010 Wells Fargo Bank, N.A. All rights reserved.

Page 56: Zen of metadata 09212010

PAGE 55

Vendor environment / products

• Computer Associates – ERwin DM

• Embarcadero Technologies – E/R Studio

• Axis Software Designs – Data Model Set of Standards

• Teradata – MetaSurf and Metadata Services

• ASG – Rochade / Enterprise Metadata Repository

• Microsoft – MS Excel, MS Word, MS Access, MOSS

© 2010 Wells Fargo Bank, N.A. All rights reserved.

Page 57: Zen of metadata 09212010

PAGE 56

Q&A

Page 58: Zen of metadata 09212010

PAGE 57

thank you