Automating Data Governance and Stewardship to Build Data Trust
-
Upload
pieter-de-leenheer -
Category
Technology
-
view
729 -
download
2
Transcript of Automating Data Governance and Stewardship to Build Data Trust
Automating Data Governance and Stewardship
To Build Data Trust
Pieter De Leenheer, PhDFounder & VP, Research and Education
June 2016
2©Collibra 2016 Collibra Data Citizens Conference | #collibracitizens
Misconceptions of Data Governance• A published repository of common definitions
• Concern of - hence managed by - IT
• Just Data quality management or MDM
• Siloed Islands
• No ownership, no process hence no trust in data
• Lack of data citizen participation
Who approved this?
I wish these guys spoke our language
I can’t understand this report !
I’ve never seen this product code! Who introduced this ?
Are we sure this definition of ‘customer’ is correct ?The Problem
This data quality rule is differently implemented in our department!
Are we allowed to share this customer data with analysts?
3©Collibra 2016 Collibra Data Citizens Conference | #collibracitizens
• Commonalities and differences in definitions for reports, terms, policies, etc.
• Business Traceability
• Business Data Lineage
• Technical Data Lineage
Understand & Explain
Data Governance is anholistic lens on your ever-expanding data universe
• Onboarding and approval of CDEs
• Report Certification and Watermarking
• Helpdesk and Issue Management
• Data Access and Usage Agreements
• …
Monitor & Predict
Through a Data Collaboration Platform
4©Collibra 2016 Collibra Data Citizens Conference | #collibracitizens
Customers in Higher Education
5©Collibra 2016 Collibra Data Citizens Conference | #collibracitizens
Data Governance Framework• Three Tiers
– DG Operating Model– Stewardship Applications– Integrations
• 1 single platform • N steward applications• Education and Certification
university.collibra.com
https://compass.collibra.com/display/COOK/Collibra+Body+of+Knowledge
6©Collibra 2016 Collibra Data Citizens Conference | #collibracitizens
Data Governance Platform Demo
7©Collibra 2016 Collibra Data Citizens Conference | #collibracitizens
Search and Filter Reports
8©Collibra 2016 Collibra Data Citizens Conference | #collibracitizens
Report Definition, Attributes and Relations
9©Collibra 2016 Collibra Data Citizens Conference | #collibracitizens
Report Ownership
10©Collibra 2016 Collibra Data Citizens Conference | #collibracitizens
Traceability vs Lineage
11©Collibra 2016 Collibra Data Citizens Conference | #collibracitizens
Workflows, Statuses and Roles
12©Collibra 2016 Collibra Data Citizens Conference | #collibracitizens
The Rise of the CDO, Business Data AuthorityData governance & stewardship provide the right level of control and trust in data
Data Infrastructure (IT) Data Consumers (Business)
LEADERSHIPCEO, CFO, VP, Marketing
ROLESData Scientist, Business Analyst
TECHNOLOGYVisualization, Self-service BI
NEED
Data Authority
LEADERSHIPCIO
ROLESInformation Manager, Data Architect, Data Modeler
TECHNOLOGYHadoop, Databases, Data Integration
Data Authority
LEADERSHIPChief Data Officer
ROLESData Governance Manager,
Data Steward
TECHNOLOGYData Stewardship
Platform
13©Collibra 2016 Collibra Data Citizens Conference | #collibracitizens
• Collaboration: inwards / outwards
• Data Space: traditional data / big data
• Value Impact: service / strategy
• MIT Sloan & Collibra: http://www.iscdo.org/
Full Text: http://www.mitcdoiq.org/wp-content/uploads/2014/01/Lee-et-al.-A-Cubic-Framework-for-the-CDO-MISQE-Forthcoming-2014-copy.pdf
CDO Roles
14©Collibra 2016 Collibra Data Citizens Conference | #collibracitizens
Stanford University Data Stewardship (SUDS)
• All Materials available here dg.stanford.edu• Establish foundation for Institutional Research
• Data Quality– How many faculty do we have?
• Context and Meaning– What does faculty mean in which context?– How is faculty data structured and where is it
stored?• Data Usage Request
– Am I allowed to use faculty or student name and age for external reporting?
15©Collibra 2016 Collibra Data Citizens Conference | #collibracitizens
SUDS: Approach
• DecentralizedØ 1 DG coordinator (also show vacancy)Ø Project staffØ cross-functional working groups : natural scope and
resourcesØ focus on BI reporting, with input from above projectsØ sign off by DG coordinator and end user through usage
(full cycle)• Step-by step; success by success
16©Collibra 2016 Collibra Data Citizens Conference | #collibracitizens
SUDS: First Success in OBIEE reporting
REST / JSON / CSV / Excel
17©Collibra 2016 Collibra Data Citizens Conference | #collibracitizens
What attribute- and relation-types do we want to capture?
• https://stanford.app.box.com/CollibraQuickReference• https://stanford.box.com/UsingCollibraFields
How to execute and monitor? From Best Practice to Auto-Validation Rules
http://web.stanford.edu/dept/pres-provost/cgi-bin/dg/wordpress/?p=577
(generic example – not from SUDS)
19©Collibra 2016 Collibra Data Citizens Conference | #collibracitizens
George Washington University
• GW is the largest institution of higher education in the District of Columbia. • More than 20,000 students— studying a rich range of disciplines: from forensic
science and creative writing to international affairs and computer engineering, as well as medicine, public health, law and public policy.
• The university is currently ranked in the top 100 universities in the country.
20©Collibra 2016 Collibra Data Citizens Conference | #collibracitizens
GWU Data Governance & Stewardship Vision
+ + =People Process / Policy Technology
Data Governance Center
Ensuring the highest quality data is delivered throughout the university providing valuable information serving individual and organizational needs
Data governance at GW focuses on improving data quality, protecting access to data, establishing business definitions, maintaining metadata, documenting data policies and setting the foundation for analytics and reporting.
• Policy – The What• Process – The
How
21©Collibra 2016 Collibra Data Citizens Conference | #collibracitizens
Everyone has a seat at the table
AcademicsAdvancementFinanceResearchHuman ResourcesServices &Resources
The Data Governance Committee meets once a month to review data quality issues, discuss proposed business terms, review policies and discuss other institutional data related topics. This committee is comprised of functional data stewards from across all functions and departments of the university.
GWU Data Governance Vision
22©Collibra 2016 Collibra Data Citizens Conference | #collibracitizens
GWU - Technology – The game changerTechnology is helping GWU to achieve their vision of commonly understood, consistent, trusted and high-quality data throughout GW.
• Making data transparent • Serves as Single source of truth of all our
data governance and stewardship activities• Makes business terms visible an searchable by all• Common agreed upon business terms and data assets• Provides traceability between business and technical assets, policies and rules
• Data Quality • Allows us to assess the integrity of data and resolve Data Quality issues.
• Analytics and Reporting • Enables portfolios to define reports and visualizations• Provides workflow to share data• Provides workflow to certify reports and visualizations
• Bonus - Provides metrics and KPIs to track progress and maturity
Page 22
23©Collibra 2016 Collibra Data Citizens Conference | #collibracitizens
How can we make YOU visible on this thriving new competency market?
Collibra university, COMPASS and OUR certification program
24©Collibra 2016 Collibra Data Citizens Conference | #collibracitizens
Introducing Collibra University• Free Guided Self Learning• Delivers the knowledge you need to
become a high value data governance professional
• Best place to learn Collibra’sthought-leading technology and how to apply it to implement data governance
• Choose your own level: Steward, Community Manager, Developer, Ranger, etc.
• Sample courses:<<ADD LINKS>>– Report Cert., DHD, Good Definitions
https://university.collibra.com/shared/start/key:ZLBDNHRK
25©Collibra 2016 Collibra Data Citizens Conference | #collibracitizens
Introducing the Collibra Certification• https://compass.collibra.com/dis
play/COOK/Collibra+University+Certifications
• Respond to the Challenge• Objective standard
– for skills and competence in practical data governance applying and integrating Collibra technologies
– against which to measure quality of implementations
Prove your value today : [email protected]
COLLIBRA RANGER
Jane Smith
has been formally evaluated for demonstrated experience, knowledge and performance of the Collibra Data Governance Software and is hereby
bestowed the global credential
CERTIFICATE NUMBER
CUR.2015.007
ORIGINAL GRANT DATE
August 21th, 2015
This is to certify that
In testimony whereof, we have subscribed our signatures
Director of Professional Services Co-founder & Collibra University Dean
Ram Naresh Pratti Dr. Pieter De Leenheer
http://university.collibra.com
Collibra NVOorlogskruisenlaan 116, 1120 BrusselsBelgium
Collibra Inc25 Broadway, NY, 10004 New York United States
DATA DRIVEN . BUSINESS . DRIVEN DATA
CO
LLIB
RA
UNIVERSITY CERTIFIED
RANGER
26©Collibra 2016 Collibra Data Citizens Conference | #collibracitizens
Customer Success supported by communities
Self-paced leaning platform for all customers and partners. Ranger certifications awarded upon completion.
university.collibra.com
Knowledge repository with BOK, documentation, questions and answers, use-cases, integrations, and more.
compass.collibra.com
CUSTOMER COMPASS
Thank You