Data Management Data, Databases and Warehousing ...

51
AMTI Data Management: Data, Databases and Warehousing Databases and Warehousing Antonius Rachmat C

Transcript of Data Management Data, Databases and Warehousing ...

Page 1: Data Management Data, Databases and Warehousing ...

AMTIData Management: Data,

Databases and WarehousingDatabases and Warehousing

Antonius Rachmat C

Page 2: Data Management Data, Databases and Warehousing ...

Data Life Cycle Process

Page 3: Data Management Data, Databases and Warehousing ...

Forms for Modeling Data (ERD)

• Entities : suatu kesatuan data yang unik yang memisahkannya dari entitas lainnya

• Attributes : ciri khas yang menempel pada entitas, biasa disebut field/column

Chapter 3 3

• Relations : relasi yang terjadi antar entitas yang dihubungkan oleh atribut masing-masing entitas

– Biasanya dihubungkan dengan key

Page 4: Data Management Data, Databases and Warehousing ...

E-R Diagrams

• Rectangles represent entity sets.

• Diamonds represent relationship sets.

• Lines link attributes to entity sets and entity sets to relationship sets.

• Ellipses represent attributes

• Double ellipses represent multivalued attributes.

• Dashed ellipses denote derived attributes.

• Underline indicates primary key attributes

Page 5: Data Management Data, Databases and Warehousing ...

E-R Diagram With Composite, Multivalued, and Derived Attributes

Page 6: Data Management Data, Databases and Warehousing ...

Relationship Sets with Attributes

Page 7: Data Management Data, Databases and Warehousing ...

Example of primary and foreign keys.

3-7

Page 8: Data Management Data, Databases and Warehousing ...

Hierarchy of Data

Chapter 3 8

Page 9: Data Management Data, Databases and Warehousing ...

Hierarchy of Data (cont’d)

Chapter 3 9

Page 10: Data Management Data, Databases and Warehousing ...

Indexing

• Indexing sering dilupakan• Indexing pada tabel biasanya dilakukan

otomatis pada primary key– Tidak demikian dengan field lainnya– Tidak demikian dengan field lainnya

• Field yang sering digunakan untuk query select klause where SEBAIKNYA diindex!– Index berdasarkan ASCII / Numerik– Index berdasarkan binary

Page 11: Data Management Data, Databases and Warehousing ...

Indexed sequential access method(ISAM)

3-11

Page 12: Data Management Data, Databases and Warehousing ...

MySQL -> MyISAM vs InnoDB

Page 13: Data Management Data, Databases and Warehousing ...

Computer based system data IS

3-13

Computer-based files of this type cause problems such as redundancy,inconsistency, and data isolation.

Page 14: Data Management Data, Databases and Warehousing ...

3-14

(a) Centralized database. (b) Distributed database with complete or partial copies of the central database in more than one location.

Page 15: Data Management Data, Databases and Warehousing ...

Data Problems

Copyright 2010 John Wiley & Sons, Inc.

3-15

Page 16: Data Management Data, Databases and Warehousing ...

Adv of DBMS

3-16

Page 17: Data Management Data, Databases and Warehousing ...

DBMS

Ex: MySQL, FoxPro, SQLServer, Postgres, Access, SQLite, Oracle

3-17

Database management system provides access to all data in the database.

Page 18: Data Management Data, Databases and Warehousing ...

DBMS connector

• Dibutuhkan juga DBMS connector yang berguna untuk menghubungkan antara DBMS dengan bahasa pemrograman yang sifatnya generalyang sifatnya general

• Connector sangat ditentukan oleh bahasa pemrogramannya

• Contoh: JDBC, ODBC, Connector for .NET (ADO.NET)

Page 19: Data Management Data, Databases and Warehousing ...

JDBC

Page 20: Data Management Data, Databases and Warehousing ...

ADO.NET

Page 21: Data Management Data, Databases and Warehousing ...

Data Warehouse

• Kumpulan bermacam-macam data yang bersifat subject oriented, integrated, time variant, dan persistent dalam mendukung proses pembuatan keputusan.

• Sumber data berasal dari berbagai macam • Sumber data berasal dari berbagai macam format, software, platform dan jaringan yang beda.– Biasanya berasal dari berbagai departemen

• Data pada datawarehouse harus tersimpan dalam sebuah format yang baku (standar).

Page 22: Data Management Data, Databases and Warehousing ...

Karakteristik DW

• Data terintegrasi dari berbagai sumber yang berasal dari proses transaksional (OLTP)

• Data harus dibuat konsisten• Merupakan aggregate data/kesimpulan data, bukan

data yang detail• Data tersimpan dalam format yang standar sehinngga • Data tersimpan dalam format yang standar sehinngga

proses query dan analisa dapat dilakukan dengan cepat

• Data bersifat read only

Page 23: Data Management Data, Databases and Warehousing ...

Data Mart

• Data yang terdapat dalam data warehouse dapat dibagi perbagian sesuai dengan kebutuhan dalam informasi -> Data Mart

• Data mart memiliki karakteristik yang sama dengan data warehouse, perbedaannya hanya dengan data warehouse, perbedaannya hanya terdapat pada jumlah data yang dimiliki.

• Dalam data mart, data yang ada hanya berasal dari satu bagian atau satu departemen saja, sedangkan pada data warehouse, data yang ada berasal dari seluruh bagian dalam perusahaan tersebut.

Page 24: Data Management Data, Databases and Warehousing ...

Data Mart

Page 25: Data Management Data, Databases and Warehousing ...

Data Warehouse

• Data di dalam data warehouse mempunyaikarakter khusus berupa time-series dalambentuk data historical

• Data terdiri dari suatu seri dari “pemotretan”keadaan yang ditandai dengan waktu dankeadaan yang ditandai dengan waktu dannilai data pada saat “pemotretan” tersebut

• Hal ini dapat digunakan untuk melakukan trend analysis dari data tersebut

• Data warehouse tidak diupdate secara terus menerus, tapi secara periodik

Page 26: Data Management Data, Databases and Warehousing ...

DW = OLAP

• On-Line Analytical Processing• Umumnya digunakan oleh pembuat

kebijakan seperti analis dan manajer• Data yang ada di dalam sebuah data• Data yang ada di dalam sebuah data

warehouse umumnya berasal dari bermacam-macam OLTP (On-Line Transaction Processing) systems

• Dikarakterisasikan oleh data reading(pelaporan, queries) bukan data entry

Page 27: Data Management Data, Databases and Warehousing ...

The use of DW

Copyright 2010 John Wiley & Sons, Inc.

3-27

Page 28: Data Management Data, Databases and Warehousing ...

Why DW Fails

Copyright 2010 John Wiley & Sons, Inc.

3-28

Page 29: Data Management Data, Databases and Warehousing ...

DW vs DM

• DW: A generic term for a system of storing, retrieving and managing large amounts of data– Software often includes sophisticated compression

and hashing techniques for fast searching andfilteringfiltering

• DM: type of data warehouse designed mainly to address a specific function or department’s needs– Often uses aggregation or summarization of the

data to enhance query performance– Important, however, to maintain the ability to access

the underlying base data to enable drilldownanalysis as necessary

Page 30: Data Management Data, Databases and Warehousing ...

The Data Warehouse & Data Management

Chapter 3 30

Page 31: Data Management Data, Databases and Warehousing ...

Model of an enterprise data warehouse.

ETL : Extraction, Transformation, and Loading

3-31(Source: From Syncsort, synchsort.com. Used with permission.)

Page 32: Data Management Data, Databases and Warehousing ...

• Lots of data is being collected and warehoused – Web data, e-commerce– purchases at department/

grocery stores– Bank/Credit Card

transactions

Why Mine Data? - Commercial Viewpoint

transactions• Data collected and stored at

enormous speeds (GB/hour)

– remote sensors on a satellite

– telescopes scanning the skies

– scientific simulations generating terabytes of data

Page 33: Data Management Data, Databases and Warehousing ...

What is Data Mining?• Exploration & analysis, by automatic or

semi-automatic means, of large quantities of data in order to discover meaningful patterns

Page 34: Data Management Data, Databases and Warehousing ...

the extraction of predictive information from large databases.

• Data trend, connection and behavior pattern analysis

Data Mining

pattern analysis• Data quality -> data cleaning• Data mining tools -> ex: RapidMiner• Use for predictive and description• Statistical techniques and algorithms

Page 35: Data Management Data, Databases and Warehousing ...

Data Mining Tasks...

• Classification [Predictive]

• Clustering [Descriptive]

• Association Rule Discovery [Descriptive]

• Sequential Pattern Discovery • Sequential Pattern Discovery [Descriptive]

• Regression [Predictive]

• Deviation/Anomaly Detection [Predictive]

Page 36: Data Management Data, Databases and Warehousing ...

• Visualisasi adalah konversi data ke dalam format visual atau tabel sehingga karakteristik dari data dan relasi diantara item data atau atribut dapat di analisis atau dilaporkan

Data Visualization

Suhu permukaan laut pada bulan Juli tahun 1982

Page 37: Data Management Data, Databases and Warehousing ...

Teknik Visualisasi

• Histogram

Page 38: Data Management Data, Databases and Warehousing ...

Teknik Visualisasi Data

• Blok Plot

Page 39: Data Management Data, Databases and Warehousing ...

Teknik Visualisasi data

• Scatter Plot

Page 40: Data Management Data, Databases and Warehousing ...

Teknik Visualisasi data

• Countour Plot

Page 41: Data Management Data, Databases and Warehousing ...

Apa itu Business Intelligence?

• Business Intelligence is the processes, technologies, and tools that help us change data into information, information to knowledge, and knowledge into plans that guide organization.

• Technologies for gathering, storing, analysing dan providing access to data to help enterprise

• Technologies for gathering, storing, analysing dan providing access to data to help enterprise users make better business decisions.

• Business Intelligence menjelaskan tentang suatu konsep dan metode bagiamana untuk meningkatkan kualitas pengambilan keputusan bisnis berdasarkan sistem yang berbasiskan data.

Page 42: Data Management Data, Databases and Warehousing ...

Apa itu Business Intelligence?...

Page 43: Data Management Data, Databases and Warehousing ...

Tiga Aspek Penting Business Intelligence

1. Converting Data Into Information2. Making Better Decisions Faster2. Making Better Decisions Faster

3. Rational Approach to Management

Page 44: Data Management Data, Databases and Warehousing ...

Arsitektur BI

Page 45: Data Management Data, Databases and Warehousing ...

BI Pyramid

Page 46: Data Management Data, Databases and Warehousing ...

• Database systems and database integration• Data warehousing, data stores and data

marts• Enterprise resource planning (ERP) systems• Query and report technologies

Technologies Supporting BI

• Query and report technologies• Data mining and analytics tools• Decision support systems• Customer relation management software• Product lifecycle and supply chain

management systems

Page 47: Data Management Data, Databases and Warehousing ...

BI careers cross over all industries:• BI solution architects and integration

specialists• Business and BI analysts

Examples of BI Careers

• Business and BI analysts• BI application developers and testers• Data warehouse specialists• Database analysts, developers and testers• Database support specialists

Page 48: Data Management Data, Databases and Warehousing ...

Bagaimana BI bekerja?

Page 49: Data Management Data, Databases and Warehousing ...

BI Component

• Data– Data store collection by individual applications

• Production systems– Gathering of all data stores

• Data warehousing• Data warehousing

• Presentation– Scorecards & Dashboards– Enterprise Reporting– OLAP Analysis– Advanced & Predictive Analysis– Alerts & Proactive Notification

Page 50: Data Management Data, Databases and Warehousing ...

Kategori Analisis Bisnis

Page 51: Data Management Data, Databases and Warehousing ...

NEXT

• Networking: Discovery, Communication, Collaboration