EDUC 200C Section 1– Describing Data Melissa Kemmerle September 28, 2012.
-
Upload
hannah-freeman -
Category
Documents
-
view
216 -
download
0
Transcript of EDUC 200C Section 1– Describing Data Melissa Kemmerle September 28, 2012.
![Page 1: EDUC 200C Section 1– Describing Data Melissa Kemmerle September 28, 2012.](https://reader036.fdocuments.us/reader036/viewer/2022071710/56649db05503460f94a9db69/html5/thumbnails/1.jpg)
EDUC 200CSection 1– Describing
DataMelissa Kemmerle
September 28, 2012
![Page 2: EDUC 200C Section 1– Describing Data Melissa Kemmerle September 28, 2012.](https://reader036.fdocuments.us/reader036/viewer/2022071710/56649db05503460f94a9db69/html5/thumbnails/2.jpg)
First things…
• Hi, I’m Melissa– 3rd year CTE student, math education
• Goal of section this quarter– Keep material as painless as possible– Present some new material as necessary– Review and answer questions about class
concepts and problem sets
Questions, comments, or concerns?
![Page 3: EDUC 200C Section 1– Describing Data Melissa Kemmerle September 28, 2012.](https://reader036.fdocuments.us/reader036/viewer/2022071710/56649db05503460f94a9db69/html5/thumbnails/3.jpg)
Today’s Goals
• Discuss mean, variance, standard deviation
• Look at Hands data
• Introduce z-scores
• Briefly introduce correlation
• Answer any homework questions
![Page 4: EDUC 200C Section 1– Describing Data Melissa Kemmerle September 28, 2012.](https://reader036.fdocuments.us/reader036/viewer/2022071710/56649db05503460f94a9db69/html5/thumbnails/4.jpg)
How do we describe data?
• Measures of “central tendency” and measures of “spread”
![Page 5: EDUC 200C Section 1– Describing Data Melissa Kemmerle September 28, 2012.](https://reader036.fdocuments.us/reader036/viewer/2022071710/56649db05503460f94a9db69/html5/thumbnails/5.jpg)
Measures of Central Tendency
Mode, Median, Mean…
![Page 6: EDUC 200C Section 1– Describing Data Melissa Kemmerle September 28, 2012.](https://reader036.fdocuments.us/reader036/viewer/2022071710/56649db05503460f94a9db69/html5/thumbnails/6.jpg)
The mode
The mode is the score with the highest frequency of occurrences.
It is the easiest score to spot in a distribution.
It is the only way to express the central tendency of a nominal level variable.
![Page 7: EDUC 200C Section 1– Describing Data Melissa Kemmerle September 28, 2012.](https://reader036.fdocuments.us/reader036/viewer/2022071710/56649db05503460f94a9db69/html5/thumbnails/7.jpg)
The median
The median is the middle-ranked score (50th percentile).
If there is an even number of scores, it is the arithmetic average of the two middle scores.
The median is unchanged by outliers. Even if Bill Gateswere deleted from the U.S. economy, the median asset of U.S. citizens would remain (more or less) the same.
![Page 8: EDUC 200C Section 1– Describing Data Melissa Kemmerle September 28, 2012.](https://reader036.fdocuments.us/reader036/viewer/2022071710/56649db05503460f94a9db69/html5/thumbnails/8.jpg)
The Mean
• We’ll most commonly use the mean
![Page 9: EDUC 200C Section 1– Describing Data Melissa Kemmerle September 28, 2012.](https://reader036.fdocuments.us/reader036/viewer/2022071710/56649db05503460f94a9db69/html5/thumbnails/9.jpg)
Visualizing the Mean
![Page 10: EDUC 200C Section 1– Describing Data Melissa Kemmerle September 28, 2012.](https://reader036.fdocuments.us/reader036/viewer/2022071710/56649db05503460f94a9db69/html5/thumbnails/10.jpg)
Measures of Spread
• Variance, standard deviation• Why do we care about spread?
![Page 11: EDUC 200C Section 1– Describing Data Melissa Kemmerle September 28, 2012.](https://reader036.fdocuments.us/reader036/viewer/2022071710/56649db05503460f94a9db69/html5/thumbnails/11.jpg)
Deviation score
• Measure the distance of each point from the mean
![Page 12: EDUC 200C Section 1– Describing Data Melissa Kemmerle September 28, 2012.](https://reader036.fdocuments.us/reader036/viewer/2022071710/56649db05503460f94a9db69/html5/thumbnails/12.jpg)
How do we summarize this?
• Could use “mean deviation”
– But the sum of deviation scores will always be 0 (why?), thus mean deviation will always be 0
• What about mean absolute value of the deviation?
– This will guarantee a positive sum of deviation scores, but has undesirable properties for more advanced statistics
![Page 13: EDUC 200C Section 1– Describing Data Melissa Kemmerle September 28, 2012.](https://reader036.fdocuments.us/reader036/viewer/2022071710/56649db05503460f94a9db69/html5/thumbnails/13.jpg)
Variance and Standard Deviation
• The answer is to take the average of the squared deviation scores
• This is called the variance– Hard to interpret—still in “squared deviation” units
• Standard deviation is the square root of the variance
– Gives a measure of deviation in the units of the original observations– Note the N-1 is used to correct bias in estimates of sample standard deviation
and variance
![Page 14: EDUC 200C Section 1– Describing Data Melissa Kemmerle September 28, 2012.](https://reader036.fdocuments.us/reader036/viewer/2022071710/56649db05503460f94a9db69/html5/thumbnails/14.jpg)
Calculating Mean and SD
• It’s probably a good idea to do it by hand once or twice.
• After that, you can use Excel.
• Let’s look at our hands data.
• Calculate mean and SD for each cohort’s hands data. Which cohort is best at estimating hand size? How can we tell?
![Page 15: EDUC 200C Section 1– Describing Data Melissa Kemmerle September 28, 2012.](https://reader036.fdocuments.us/reader036/viewer/2022071710/56649db05503460f94a9db69/html5/thumbnails/15.jpg)
Z-scores
• We can transform data about different variables to the same scale by creating z-scores
• This makes it easier to compare variables
• Z-scores always have a mean of 0 and standard deviation of 1
![Page 16: EDUC 200C Section 1– Describing Data Melissa Kemmerle September 28, 2012.](https://reader036.fdocuments.us/reader036/viewer/2022071710/56649db05503460f94a9db69/html5/thumbnails/16.jpg)
Correlation
• Correlation is used to describe how two variables vary with each other
• What are some examples of variables that might have positive or negative or zero correlation?
![Page 17: EDUC 200C Section 1– Describing Data Melissa Kemmerle September 28, 2012.](https://reader036.fdocuments.us/reader036/viewer/2022071710/56649db05503460f94a9db69/html5/thumbnails/17.jpg)
Z-scores don’t change correlation
![Page 18: EDUC 200C Section 1– Describing Data Melissa Kemmerle September 28, 2012.](https://reader036.fdocuments.us/reader036/viewer/2022071710/56649db05503460f94a9db69/html5/thumbnails/18.jpg)
Questions?