Data Organization Data files Databases Data streams Data files Databases Data streams.

17
Data Organization Data files Databases Data streams
  • date post

    19-Dec-2015
  • Category

    Documents

  • view

    279
  • download

    6

Transcript of Data Organization Data files Databases Data streams Data files Databases Data streams.

Page 1: Data Organization Data files Databases Data streams Data files Databases Data streams.

Data OrganizationData Organization

Data filesDatabases

Data streams

Data filesDatabases

Data streams

Page 2: Data Organization Data files Databases Data streams Data files Databases Data streams.

The organization of data

The organization of data

Data/information must be organized

Computer/network accessible Stored or streaming

Data/information must be organized

Computer/network accessible Stored or streaming

Page 3: Data Organization Data files Databases Data streams Data files Databases Data streams.

Stored informationStored information

Spreadsheet Worksheet oriented Cells, columns, rows Can be set up to look like a table

Database Table oriented Column headings describe a row Rows are records Tables can be related (relational database)

Spreadsheet Worksheet oriented Cells, columns, rows Can be set up to look like a table

Database Table oriented Column headings describe a row Rows are records Tables can be related (relational database)

Page 4: Data Organization Data files Databases Data streams Data files Databases Data streams.

Spreadsheets in ExcelSpreadsheets in Excel

Limited in size to 64K rows, 256 columns

Slow to compute changes Accessed via GUI or Visual Basic

Can be exported to relational database table

Limited in size to 64K rows, 256 columns

Slow to compute changes Accessed via GUI or Visual Basic

Can be exported to relational database table

Page 5: Data Organization Data files Databases Data streams Data files Databases Data streams.

Excel WorksheetExcel Worksheet

Page 6: Data Organization Data files Databases Data streams Data files Databases Data streams.

Relational DatabasesRelational Databases

Licensed Databases MS Access, MS Sequel Server, Oracle

Free databases MySQL, PostgresSQL, MaxDB, Firebird, Ingres (http://www.geocities.com/mailsoftware42/db/)

Interfaces: GUI, SQL, Programming languages (Java, Perl, PHP, C++, VB)

Licensed Databases MS Access, MS Sequel Server, Oracle

Free databases MySQL, PostgresSQL, MaxDB, Firebird, Ingres (http://www.geocities.com/mailsoftware42/db/)

Interfaces: GUI, SQL, Programming languages (Java, Perl, PHP, C++, VB)

Page 7: Data Organization Data files Databases Data streams Data files Databases Data streams.

Relational TableRelational Table

Page 8: Data Organization Data files Databases Data streams Data files Databases Data streams.

Relational Table Elements

Relational Table Elements

Page 9: Data Organization Data files Databases Data streams Data files Databases Data streams.

Relating TablesRelating Tables

Page 10: Data Organization Data files Databases Data streams Data files Databases Data streams.

Relating TablesRelating Tables

Page 11: Data Organization Data files Databases Data streams Data files Databases Data streams.

Relating TablesRelating Tables

Page 12: Data Organization Data files Databases Data streams Data files Databases Data streams.

One to One RelationshipOne to One

Relationship

Page 13: Data Organization Data files Databases Data streams Data files Databases Data streams.

One to Many RelationshipOne to Many Relationship

Page 14: Data Organization Data files Databases Data streams Data files Databases Data streams.

Many to Many RelationshipMany to Many Relationship

Page 15: Data Organization Data files Databases Data streams Data files Databases Data streams.

Data StreamsData Streams

Not stored in a file Serial data Real-time Underlying model to discover changes over time

Drift in the data mining/analysis tools needed

Incremental adaptation of the learning algorithm

2006 inaugural conference http://www.lsi.us.es/~aguilar/ds/

Not stored in a file Serial data Real-time Underlying model to discover changes over time

Drift in the data mining/analysis tools needed

Incremental adaptation of the learning algorithm

2006 inaugural conference http://www.lsi.us.es/~aguilar/ds/

Page 16: Data Organization Data files Databases Data streams Data files Databases Data streams.

Data Stream ExamplesData Stream Examples

Network Packets AP news wire Live audio / video Control system data Customer click streams Telephone records Large sets of web pages Retail transactions

Network Packets AP news wire Live audio / video Control system data Customer click streams Telephone records Large sets of web pages Retail transactions

Page 17: Data Organization Data files Databases Data streams Data files Databases Data streams.

SummarySummary

Data is stored or streaming Data is organized Data is accessible programmatically

Need to know How data is organized How to access the data How to apply DI tools to the data

Data is stored or streaming Data is organized Data is accessible programmatically

Need to know How data is organized How to access the data How to apply DI tools to the data