Foundations of Data Analytics

Post on 22-Jun-2015

201 views 0 download

Tags:

description

Foundations of Data Analytics

Transcript of Foundations of Data Analytics

Data Analysis Fundamentals/Eduphoria Tutorial

Dr. Michael G. RichardsDirector, Data & AccountabilityDistrict Testing Coordinator

What is a Relational Database?• The invention of this database system has standardized the

way that data is stored and processed.• A relational database consists of a collection of tables that

store particular sets of data.• The concept of a relational database derives from the

principles of relational algebra, developed by E. F. Codd in the 1970s.

• Relational databases are built-in to software like EDUPHORIA, so the database is generally invisible to the user.

What Does One Do With a Relational Database?

1. Upload data2. Query

Querying

1. Write code (SQL, R, SAS)2. Employ a Graphic User Interface (GUI)

Smart Querying• Know your research question well before pushing any buttons!• Think in terms of ROWS & COLUMNS

column

row

Just for fun…SQL term Relational database term Description

Row Tuple or record A data set representing a single item

Column Attribute or fieldA labeled element of a tuple, e.g. "Address" or "Date of birth"

Table Relation or Base relvarA set of tuples sharing the same attributes; a set of columns and rows

View or result set Derived relvarAny set of tuples; a data report from the RDBMS in response to a query

Eduphoria borrows its terminology from SQL

Shopping Analogy

Choosing your mall is equivalent to choosing ROWS

Filling your shopping cart is equivalent to choosing COLUMNS

ROW/COLUMN Quiz

Q1. There is a teacher of 32 8th-grade math students and the teacher has just given a DA and wants to debrief with her students. She needs a quick read on how they did on each item.

A1. ROWS: students; COLUMNS: test items

Bonus: What will appear in the table cells?

answer choice and disposition

Let’s Complicate Things (Slightly)

• The BY function• It is hidden by the GUI in Eduphoria• Often used to create additional levels in either ROWS or

COLUMNS• Ex: Scores: SE% correct BY ethnicity (extra level of COLUMNS)• Student scores BY teacher (could be extra level of ROWS)

ROW/COLUMN Quiz

Q2. I want to know if the fifth graders did better this year on STAAR math than they did last year.

A2. ROWS: students; COLUMNS: scores by test

Choosing ROWS through the Wizard

• Not as apparent as choosing COLUMNS• You begin choosing ROWS as soon as you begin querying the

database by narrowing down the data (driving to the mall)• “Select a Test”

• “How do you want to view the results?”

• “Select a timeframe”

Note: almost always choose “Where students are now”

Choosing COLUMNS Through the Wizard

Next

COLUMNS

NextMy preference:

Let’s Run the 2 Quiz Questions in Eduphoria

1. Using (on not using) the Wizard2. Color coding/heat mapping3. Data pivot4. Summary view options5. Graphing6. Editing7. Exporting

The 2 Questions• Q1. There is a teacher of 32 8th-grade math students and the

teacher has just given a DA and wants to debrief with her students. She needs a quick read on how they did on each item.

• Q2. I want to know if all 5th-graders did better this year on STAAR math than they did last year.

Exporting Data from Eduphoria

• For distribution analysis• For inferential analysis

Descriptive Stats12 13

Inferential Stats: Significance

SignalNoise

Significant difference or relationship?

α = 0.05

Difference in the 5th Grade Math Performance?

We are only 74% sure

FERPA• Do not send emails with text or datasets (attached or

otherwise) that contain information identifying students• All data analysis that leaves the campus or the district offices

must be vetted through C&I

Q&A