Guidelines for Assessment and Instruction in Statistics Education A Curriculum Framework for...

Post on 30-Dec-2015

216 views 1 download

Tags:

Transcript of Guidelines for Assessment and Instruction in Statistics Education A Curriculum Framework for...

Guidelines for Assessment and Instruction in Statistics Education

A Curriculum Framework for Pre-K-12 Statistics Education

The GAISE Report (2007) The American Statistical Association

http://www.amstat.org/education/gaise/

Christine Franklin & Henry Kranendonk

NCSM Conference

March 19, 2007

Outline of Presentation

• Overview of the GAISE Report

• The Evolution of a Statistical Concept- The Mean as Fair Share/Variation from Fair Share- The Mean as the Balance Point/Variation from the Mean- The Sampling Distribution of the Mean/Variation in Sample

Means

• Summary

Benchmarks in Statistical Education in the United States (1980-2007)

• The Quantitative Literacy Project (ASA/NCTM Joint Committee, Early 1980’s)

• Curriculum and Evaluation Standards for School Mathematics (NCTM, 1989)

• Principles and Standards for School Mathematics (NCTM, 2000)

• Mathematics and Statistics College Board Standards for College Success (2006)

• The GAISE Report (2005, 2007)

GOALS of the GAISE Report

• Promote and develop statistical literacy

• Provide links with the NCTM Standards

• Discuss differences between Mathematics and Statistics*

• Clarify the role of probability in statistics*

• Illustrate concepts associated with the data analysis process*

• Present the statistics curriculum for grades Pre-K-12 as a cohesive and coherent curriculum strand*

• Provide developmental sequences of learning experiences*

Stakeholders

• Writers of state standards

• Writers of assessment items

• Curriculum directors

• Pre K-12 teachers

• Educators at teacher preparation programs 

STATISTICAL THINKING versus

MATHEMATICAL THINKING  

• The Focus of Statistics on Variation in Data

• The Importance of Context in Statistics

 

PROBABILITY Randomization

• Sampling -- "select at random from a population"

 • Experiments -- "assign at random to a

treatment"   

THE FRAMEWORKUnderlying Principles

PROBLEM SOLVING PROCESS

Formulate Questions•         clarify the problem at hand •         formulate one (or more) questions that can be answered with data  

Collect Data•         design a plan to collect appropriate data•         employ the plan to collect the data  

Analyze Data•         select appropriate graphical or numerical methods•         use these methods to analyze the data  

Interpret Results• interpret the analysis taking into account the scope of inference based on the

data collection design•       relate the interpretation to the original question

 

Developmental Levels

• The GAISE Report proposes three developmental levels for evolvingstatistical concepts.

Levels A, B, and C

The Framework Model

A Two-Dimensional Model

• One dimension is the four components of the statistical problem-solving process, along with the nature of and the focus on variability

• The second dimension is comprised of three developmental levels (A, B, and C)

THE FRAMEWORK MODEL 

ProcessComponent

Level A Level B Level C

FormulateQuestion

Beginning awareness of the statistics question distinction 

Increased awareness of the statistics question distinction 

Students can make the statistics question distinction 

CollectData

Do not yet design for differences 

Awareness of design for differences 

Students make designs for differences 

AnalyzeData

Use particular properties of distributions in context of specific example

Learn to use particular properties of distributions as tools of analysis 

Understand and use distributions in analysis as a global concept

InterpretResults

Do not look beyond the data

Acknowledge that looking beyond the data is feasible 

Able to look beyond the data in some contexts

THE FRAMEWORK MODEL 

Nature ofVariability   

Focus onVariability

Measurement variabilityNatural variabilityInduced variability  Variability within a group   

Sampling variability    

Variability within a group and variability between groups Co-variability

Chance variability    

Variability in model fitting

Activity Based Learning

• The GAISE Report promotes active learning of statistical content and concepts

Two Types of Learning Activities

• Problem Solving Activities• Concept Activities

The STN article illustrates a Problem Solving Activity across the three developmental levels.

The evolution of a statistical concept --

• What is the mean?

• Quantifying variation in data from the mean

Level A Activity

The Family Size Problem

A Conceptual Activity for:

• Developing an Understanding of the Mean as the “Fair Share” value

• Developing a Measure of Variation from “Fair Share”

A Question

How large are families today?

• Nine children were asked how many

people are in your family.

• Each child represented her/his family size with a collection snap cubes.

Snap Cube Representation for Nine Family Sizes

How might we examine the data on the family sizes for these nine children?

Ordered Snap Cube & Numerical Representations of Nine Family Sizes

2 3 3 4 4 5 6 7 9

Notice that the family sizes vary.

What if we used all our family members and tried to make all families the same size, in which case there is no variability.

How many people would be in each family?

How can we go about creating these new families?

We might start by separating all the family members into one large group.

All 43 Family Members

Step 1

Have each child select a snap cube to represent her/him-self.

These cubes are indicated in red.

Create Nine “New” Families/Step1

Step 2

Next have each child select one family member from the remaining group.

These new family members are shown in red.

Create Nine “New” Families/Step2

Continue this process untilthere are not enough family members for each child to select from.

Create Nine “New” Families/Step4

Discuss results

• The fair share value

Note that this is developing the division algorithm and eventually, the algorithm for finding the mean.

A New Problem

What if the fair share value for nine children is 6?

What are some different snap cube representations that might produce a fair share value of 6?

Snap Cube Representation of Nine Families, Each of Size 6

Have Groups of Children Create New Snap Cube Representations

For example, following are two different collections of data with a fair share value of 6.

Two Examples with Fair Share Value of 6.

Which group is “closer” to being “fair?”

How might we measure “how close” a group of numeric data is to being fair?

Which group is “closer” to being “fair?”

The upper group in blue is closer to fair since it requires only one “step” to make it fair. The lower group requires two “steps.”

How do we define a “step?”

• One step occurs when a snap cube is removed from a stack higher than the fair share value and placed on a stack lower than the fair share value .

• A measure of the degree of fairness in a snap cube distribution is the “number of steps” required to make it fair.

Note -- Fewer steps indicates closer to fair

Number of Steps to Make Fair: 8

Number of Steps to Make Fair: 9

Students completing Level A understand:

• the notion of “fair share” for a set of numeric data

• the fair share value is also called the mean value

• the algorithm for finding the mean

• the notion of “number of steps” to make fair as a measure of variability about the mean

• the fair share/mean value provides a basis for comparison between two groups of numerical data with different sizes (thus can’t use total)

Level B Activity

The Family Size Problem

• How large are families today?

A Conceptual Activity for:

• Developing an Understanding of the Mean as the “Balance Point” of a Distribution

• Developing Measures of Variation about the Mean

Level B Activity

How many people are in your family?

Nine children were asked this question. The following dot plot is one possible result for the nine children:

-+--+--+--+--+--+--+--+--+- 2 3 4 5 6 7 8 9 10

Have groups of students create different dot plot representations of nine families with a mean of 6.

-+--+--+--+--+--+--+--+--+- 2 3 4 5 6 7 8 9 10

-+--+--+--+--+--+--+--+--+- 2 3 4 5 6 7 8 9 10

In which group do the data (family sizes) vary (differ) more from the mean value of 6?

-+--+--+--+--+--+--+--+--+- 2 3 4 5 6 7 8 9 10

-+--+--+--+--+--+--+--+--+- 2 3 4 5 6 7 8 9 10

4 2 1

1

0 1 2

2

3

4 3 2 0

0

0

2 3 4

In Distribution 1, the Total Distance from the Mean is 16.

In Distribution 2, the Total Distance from the Mean is 18.

Consequently, the data in Distribution 2 differ more from the mean than the data in Distribution 1.

-+--+--+--+--+--+--+--+--+- 2 3 4 5 6 7 8 9 10

4 2 1

1

0 1 2

2

3

Note that the total distance for the values below the mean of 6 is 8, the same as the total distance for the values above the mean. For this reason, the distribution will “balance” at 6 (the mean)

The SAD is defined to be:

The Sum of the Absolute Deviations

Note the relationship between SAD and Number of Steps to Fair from Level A:

SAD = 2xNumber of Steps

Number of Steps to Make Fair: 8

Number of Steps to Make Fair: 9

An Illustration where the SAD doesn’t work!

-+--+--+--+--+--+--+--+--+- 2 3 4 5 6 7 8 9 10

-+--+--+--+--+--+--+--+--+- 2 3 4 5 6 7 8 9 10

4 4

1

1

1 1

1

1

11

The SAD is 8 for each distribution, but in the first distribution the data vary more from the mean.

Why doesn’t the SAD work?

Adjusting the SAD for group sizes yields the:

MAD = Mean Absolute Deviation

Measuring Variation about the Mean

• SAD = Sum of Absolute Deviations

• MAD = Mean of Absolute Deviations

• Variance = Mean of Squared Deviations

• Standard Deviation = Square Root of Variance

Summary of Level B and Transitions to Level C

• Mean as the balance point of a distribution

• Mean as a “central” point

• Various measures of variation about the mean.

The Mean at Level C

• At Level C, the notion of the “Sampling Distribution of the Sample Mean” is Developed.

• This development connects probability and statistics and provides the link between the descriptive statistics students have learned at Levels A and B and concepts of inferential statistics they will learn at Level C.

Eighty Circles/What is the Mean Diameter?

Activity

• Students select samples of 10 circles they considered to be representative of the 80 circles. The mean for each sample isdetermined.

• Students select simple random samples of 10 circles. The mean for each sample is determined.

How do the results from self-selection compare with random sampling?

• Following are results from two introductory-level statistics classes (50 students).

Sample Means2.22.01.81.61.41.21.0

Random Selection

Self Selection

Dotplot of Random Selection versus Self Selection

Population Mean = 1.25

Sampling Distributions provide the link to two important concepts in statistical inference.

• Margin of Error

• Statistical Significance

The STN article in your packet provides an illustration of how a sampling distribution is used to develop these statistical concepts.

SUMMARY: GOALS of GAISE Report• Promote and develop statistical literacy

• Provide links with the NCTM Standards

• Discuss differences between Mathematics and Statistics

• Clarify the role of probability in statistics

• Illustrate concepts associated with the data analysis process

• Present the statistics curriculum for grades Pre-K-12 as a cohesive and coherent curriculum strand

• Provide developmental sequences of learning experiences