MAT 1000 Mathematics in Today's World. Last Time We looked at the standard deviation, a measurement...
-
Upload
barbara-nash -
Category
Documents
-
view
213 -
download
1
Transcript of MAT 1000 Mathematics in Today's World. Last Time We looked at the standard deviation, a measurement...
MAT 1000
Mathematics in Today's World
Last Time
We looked at the standard deviation, a measurement of the spread of a distribution.
We introduced a special type of distribution, the normal distribution. These highly symmetric distributions are very common.
We saw how, using only the mean and standard deviation, we can find the first and third quartiles of a normal distribution.
Today
Using the mean and standard deviation, we can find out much more about a normal distribution.
In particular, we will be able to easily find any of the percentiles of the distribution. To do so, we need to find standard scores, or z-scores.
First, we address the question of why normal distributions are so common.
Today
Note: Today’s material is not in the textbook.
Example of normal distributions
• Physical characteristics like height or weight.
• The annual returns on the S&P 500 over the last 50 years.
• Cars in the parking lot of a mall.
• How long it takes a kernel of popcorn to pop in the microwave.
Why are normal distributions so common?
Normal distributions are “bell” shaped, so most of the data is close to the center (the mean), and only rarely are there numbers far from the mean.
We expect this distribution whenever there are many conflicting forces that tend to cancel each other out.
This is the case in lots of situations.
Why are normal distributions so common?
ExampleWhat forces can affect stock returns during a year?
Usually lots of little things: new products, bad publicity, changing government regulations, even the weather.
If we combine the returns of 500 companies, then all of these small factors tend to cancel out. This means the S&P 500 return will usually be close to the average.
Why are normal distributions so common?
ExampleWhat forces determine a person’s height?
Lots of reasons. There are genetic factors, but things like childhood nutrition or illness also play a role.
With lots of small forces that tend to conflict, it’s no surprise that most people tend to be close to average height.
Why are normal distributions so common?
Isn’t it true that in any data set most of the data will be close to the mean?Absolutely not!
Suppose 10 people take a test. Five score 0, and five score 100. The mean is 50, but nobody is close to that.
Why do people believe that most of the data in a distribution should be close to the mean? Precisely because normal distributions are so common.
PercentilesThe median and the first and third quartiles are examples of what are called percentiles.
For any number P between 0 and 100, we can find the Pth percentile of a distribution.
By definition, P percent of the data is less than the Pth percentile
For example, Q1 could also be called the 25th percentile—25% of the data is less than Q1
PercentilesExampleThe heights of adult men in the US are normally distributed with mean 69.3 in. () and standard deviation 2.9 in.
We will see that the 90th percentile of this distribution is: 73.1 in ()
This tells us that a man who is is taller than 90% of the men in the US
Percentiles
Example
On the other hand, a man who is 66 in. tall () has a height equal to about the 14th percentile.
So 14% of the adult men in the US are less than tall.
PercentilesPercentiles tell us what percent of the data is below a number. What if we want to know what percent is above that number?
ExampleIf 14% of adult men in the US are shorter than , what percent are taller than ?
The percent of men shorter than plus the percent of men taller adds up to 100%
PercentilesExampleWhy? Think of it this way: any man is either taller than or he is not. (With an accurate enough ruler we can assume no one is exactly )
(% of men shorter than ) + (% of men taller than )
(% of men taller than )
% of men taller than
% of men taller than
Standard scoresFor a data value in a normally distributed data set, we can find its percentile by first finding its standard score.
Let’s call the data value . With our usual notation for the mean and for the standard deviation, the standard score (also called the z score) is:
Standard scoresExampleAs I said earlier, the heights of adult men in the US are normally distributed with mean 69.3 in. () and standard deviation 2.9 in.
What is the standard score for a man who is 73.1 in () tall?
Finding percentilesExampleThe standard score for a man who is 73.1 in () tall is .
Using the standard score we can consult a table to tell us the percentile.
The table uses standard scores rounded to the nearest tenth, so we need to look up the percentile corresponding to
Table of percentilesStandard Standard StandardScore Percentile Score Percentile Score Percentile–3.4 0.03 –1.1 13.57 1.2 88.49–3.3 0.05 –1.0 15.87 1.3 90.32–3.2 0.07 –0.9 18.41 1.4 91.92–3.1 0.10 –0.8 21.19 1.5 93.32–3.0 0.13 –0.7 24.20 1.6 94.52–2.9 0.19 –0.6 27.42 1.7 95.54–2.8 0.26 –0.5 30.85 1.8 96.41–2.7 0.35 –0.4 34.46 1.9 97.13–2.6 0.47 –0.3 38.21 2.0 97.73–2.5 0.62 –0.2 42.07 2.1 98.21–2.4 0.82 –0.1 46.02 2.2 98.61–2.3 1.07 0.0 50.00 2.3 98.93–2.2 1.39 0.1 53.98 2.4 99.18–2.1 1.79 0.2 57.93 2.5 99.38–2.0 2.27 0.3 61.79 2.6 99.53–1.9 2.87 0.4 65.54 2.7 99.65–1.8 3.59 0.5 69.15 2.8 99.74–1.7 4.46 0.6 72.58 2.9 99.81–1.6 5.48 0.7 75.80 3.0 99.87–1.5 6.68 0.8 78.81 3.1 99.90–1.4 8.08 0.9 81.59 3.2 99.93–1.3 9.68 1.0 84.13 3.3 99.95–1.2 11.51 1.1 86.43 3.4 99.97
Finding percentilesExampleThe percentile is 90, so that means a height of 73.1 in () tall is the 90th percentile of all heights of American men.
In other words, a man who is is taller than 90% of the men in the US.
NoteUse the same table for standard scores from any data set.
Finding percentilesExampleScores on the SAT math exam are normally distributed with a mean of 500 points and a standard deviation of 100 points. What percent of test takers score below 450? What percent are below 600?
We need to find the percentiles. Start with the standard scores:
Now find the percentile from the table:
Table of percentilesStandard Standard StandardScore Percentile Score Percentile Score Percentile–3.4 0.03 –1.1 13.57 1.2 88.49–3.3 0.05 –1.0 15.87 1.3 90.32–3.2 0.07 –0.9 18.41 1.4 91.92–3.1 0.10 –0.8 21.19 1.5 93.32–3.0 0.13 –0.7 24.20 1.6 94.52–2.9 0.19 –0.6 27.42 1.7 95.54–2.8 0.26 –0.5 30.85 1.8 96.41–2.7 0.35 –0.4 34.46 1.9 97.13–2.6 0.47 –0.3 38.21 2.0 97.73–2.5 0.62 –0.2 42.07 2.1 98.21–2.4 0.82 –0.1 46.02 2.2 98.61–2.3 1.07 0.0 50.00 2.3 98.93–2.2 1.39 0.1 53.98 2.4 99.18–2.1 1.79 0.2 57.93 2.5 99.38–2.0 2.27 0.3 61.79 2.6 99.53–1.9 2.87 0.4 65.54 2.7 99.65–1.8 3.59 0.5 69.15 2.8 99.74–1.7 4.46 0.6 72.58 2.9 99.81–1.6 5.48 0.7 75.80 3.0 99.87–1.5 6.68 0.8 78.81 3.1 99.90–1.4 8.08 0.9 81.59 3.2 99.93–1.3 9.68 1.0 84.13 3.3 99.95–1.2 11.51 1.1 86.43 3.4 99.97
Finding percentilesExampleThe percentile corresponding to a standard score of is 30.85, and the percentile corresponding to a standard score of is 84.13
This means that (roughly) 31% of test takers score below 450 on the SAT math exam, and 84% are below 600.
What percent of test takers score between 450 and 600?
Finding percentilesExampleWe can find the percent who score between 450 and 600 using the fact that 31% of test takers are below 450 and 84% are below 600.
Take the number of people who score below 600 and subtract the number who scored below 450. The result is the number who scored between 450 and 600.
The same is true for percentages:(% below 600)-(% below 450)= (% between 450 and 600)
So the percent who score between 450 and 600 is:
Comparing percentilesUsing the mean and standard deviation of a normal distribution, we can find the percentile of any data value from that distribution.
Percentiles are also very useful for comparing data values from different distributions.
Is a 600 on the SAT math test better or worse than a 26 on the ACT math test?
We can’t compare the numbers—the SAT is out of 800 and the ACT is out of 36.
Comparing percentilesInstead of comparing the numbers, we compare these test scores using percentiles.
Scores on the SAT are normally distributed with a mean of 500 and standard deviation of 100. Scores on the ACT are normally distributed with a mean of 18 and standard deviation of 6.
Find the standard scores:
Comparing percentilesFrom the table, a standard score of 1 is the 84th percentile, while a standard score of 1.3 is the 90th percentile. So a 26 on the ACT is better than a 600 on the SAT.
In what sense is it a better score?
Percentiles describe these scores relative to all the other test takers.
Scoring higher than 90% of the people who took a test is better than scoring higher than 84%.
Another normal distributionOne of the most important examples of a normal distribution is the sampling distribution of statistics
In a sample survey, we choose a sample and compute a statistic.
A different sample would have given a different statistic.
If we consider every possible sample, we would have a distribution of statistics (which numbers occur, and how often they occur).
Another normal distributionIt turns out that if our sample size is large enough, the distribution of statistics will be normal.
What is a large enough sample?
A general rule of thumb is a sample size of 30.
Another normal distributionExampleIn 2012 Barack Obama won the presidential election with 51.1% of the vote.
In the run up to the election, there were many polls of likely voters.
These polls were producing statistics to estimate a parameter: the proportion of all voters who were going to vote for Obama. Now, we know the value of this parameter to be 51.1%
Another normal distributionExampleSuppose a polling company sampled 100 voters before the election. It turns out that the distribution of statistics for a sample size of 100 is normal with mean 51.1% and standard deviation 5%.
(We’ll see the formulas for these later in the course.)
As decimals these are 0.511 and 0.05.
Another normal distributionExampleIn what percent of samples of 100 would more than 50% of the sample support Obama?
This is a question about percentiles.
As a decimal 50% is 0.5
Find the standard score:
Another normal distributionExampleFrom the table, the corresponding percentile is 42.07.
This means that in 42.07% of samples of 100 voters, the proportion who supported Obama would have been less than 50%.
To answer our question we must subtract the percentile from 100:
Another normal distributionExampleSo, in about 58% of the possible samples of 100 voters, we would have seen more than 50% of the sample supporting Obama.
Another normal distribution
This graphic illustrates the idea of a sampling distribution of statistics (denoted ). Imagine taking many samples of size 100 from a population with parameter 0.511.