MAT 1000 Mathematics in Today's World. Last Time We looked at the standard deviation, a measurement...

34
MAT 1000 Mathematics in Today's World

Transcript of MAT 1000 Mathematics in Today's World. Last Time We looked at the standard deviation, a measurement...

Page 1: MAT 1000 Mathematics in Today's World. Last Time We looked at the standard deviation, a measurement of the spread of a distribution. We introduced a special.

MAT 1000

Mathematics in Today's World

Page 2: MAT 1000 Mathematics in Today's World. Last Time We looked at the standard deviation, a measurement of the spread of a distribution. We introduced a special.

Last Time

We looked at the standard deviation, a measurement of the spread of a distribution.

We introduced a special type of distribution, the normal distribution. These highly symmetric distributions are very common.

We saw how, using only the mean and standard deviation, we can find the first and third quartiles of a normal distribution.

Page 3: MAT 1000 Mathematics in Today's World. Last Time We looked at the standard deviation, a measurement of the spread of a distribution. We introduced a special.

Today

Using the mean and standard deviation, we can find out much more about a normal distribution.

In particular, we will be able to easily find any of the percentiles of the distribution. To do so, we need to find standard scores, or z-scores.

First, we address the question of why normal distributions are so common.

Page 4: MAT 1000 Mathematics in Today's World. Last Time We looked at the standard deviation, a measurement of the spread of a distribution. We introduced a special.

Today

Note: Today’s material is not in the textbook.

Page 5: MAT 1000 Mathematics in Today's World. Last Time We looked at the standard deviation, a measurement of the spread of a distribution. We introduced a special.

Example of normal distributions

• Physical characteristics like height or weight.

• The annual returns on the S&P 500 over the last 50 years.

• Cars in the parking lot of a mall.

• How long it takes a kernel of popcorn to pop in the microwave.

Page 6: MAT 1000 Mathematics in Today's World. Last Time We looked at the standard deviation, a measurement of the spread of a distribution. We introduced a special.

Why are normal distributions so common?

Normal distributions are “bell” shaped, so most of the data is close to the center (the mean), and only rarely are there numbers far from the mean.

We expect this distribution whenever there are many conflicting forces that tend to cancel each other out.

This is the case in lots of situations.

Page 7: MAT 1000 Mathematics in Today's World. Last Time We looked at the standard deviation, a measurement of the spread of a distribution. We introduced a special.

Why are normal distributions so common?

ExampleWhat forces can affect stock returns during a year?

Usually lots of little things: new products, bad publicity, changing government regulations, even the weather.

If we combine the returns of 500 companies, then all of these small factors tend to cancel out. This means the S&P 500 return will usually be close to the average.

Page 8: MAT 1000 Mathematics in Today's World. Last Time We looked at the standard deviation, a measurement of the spread of a distribution. We introduced a special.

Why are normal distributions so common?

ExampleWhat forces determine a person’s height?

Lots of reasons. There are genetic factors, but things like childhood nutrition or illness also play a role.

With lots of small forces that tend to conflict, it’s no surprise that most people tend to be close to average height.

Page 9: MAT 1000 Mathematics in Today's World. Last Time We looked at the standard deviation, a measurement of the spread of a distribution. We introduced a special.

Why are normal distributions so common?

Isn’t it true that in any data set most of the data will be close to the mean?Absolutely not!

Suppose 10 people take a test. Five score 0, and five score 100. The mean is 50, but nobody is close to that.

Why do people believe that most of the data in a distribution should be close to the mean? Precisely because normal distributions are so common.

Page 10: MAT 1000 Mathematics in Today's World. Last Time We looked at the standard deviation, a measurement of the spread of a distribution. We introduced a special.

PercentilesThe median and the first and third quartiles are examples of what are called percentiles.

For any number P between 0 and 100, we can find the Pth percentile of a distribution.

By definition, P percent of the data is less than the Pth percentile

For example, Q1 could also be called the 25th percentile—25% of the data is less than Q1

Page 11: MAT 1000 Mathematics in Today's World. Last Time We looked at the standard deviation, a measurement of the spread of a distribution. We introduced a special.

PercentilesExampleThe heights of adult men in the US are normally distributed with mean 69.3 in. () and standard deviation 2.9 in.

We will see that the 90th percentile of this distribution is: 73.1 in ()

This tells us that a man who is is taller than 90% of the men in the US

Page 12: MAT 1000 Mathematics in Today's World. Last Time We looked at the standard deviation, a measurement of the spread of a distribution. We introduced a special.

Percentiles

Example

On the other hand, a man who is 66 in. tall () has a height equal to about the 14th percentile.

So 14% of the adult men in the US are less than tall.

Page 13: MAT 1000 Mathematics in Today's World. Last Time We looked at the standard deviation, a measurement of the spread of a distribution. We introduced a special.

PercentilesPercentiles tell us what percent of the data is below a number. What if we want to know what percent is above that number?

ExampleIf 14% of adult men in the US are shorter than , what percent are taller than ?

The percent of men shorter than plus the percent of men taller adds up to 100%

Page 14: MAT 1000 Mathematics in Today's World. Last Time We looked at the standard deviation, a measurement of the spread of a distribution. We introduced a special.

PercentilesExampleWhy? Think of it this way: any man is either taller than or he is not. (With an accurate enough ruler we can assume no one is exactly )

(% of men shorter than ) + (% of men taller than )

(% of men taller than )

% of men taller than

% of men taller than

Page 15: MAT 1000 Mathematics in Today's World. Last Time We looked at the standard deviation, a measurement of the spread of a distribution. We introduced a special.

Standard scoresFor a data value in a normally distributed data set, we can find its percentile by first finding its standard score.

Let’s call the data value . With our usual notation for the mean and for the standard deviation, the standard score (also called the z score) is:

Page 16: MAT 1000 Mathematics in Today's World. Last Time We looked at the standard deviation, a measurement of the spread of a distribution. We introduced a special.

Standard scoresExampleAs I said earlier, the heights of adult men in the US are normally distributed with mean 69.3 in. () and standard deviation 2.9 in.

What is the standard score for a man who is 73.1 in () tall?

Page 17: MAT 1000 Mathematics in Today's World. Last Time We looked at the standard deviation, a measurement of the spread of a distribution. We introduced a special.

Finding percentilesExampleThe standard score for a man who is 73.1 in () tall is .

Using the standard score we can consult a table to tell us the percentile.

The table uses standard scores rounded to the nearest tenth, so we need to look up the percentile corresponding to

Page 18: MAT 1000 Mathematics in Today's World. Last Time We looked at the standard deviation, a measurement of the spread of a distribution. We introduced a special.

Table of percentilesStandard Standard StandardScore Percentile Score Percentile Score Percentile–3.4 0.03 –1.1 13.57 1.2 88.49–3.3 0.05 –1.0 15.87 1.3 90.32–3.2 0.07 –0.9 18.41 1.4 91.92–3.1 0.10 –0.8 21.19 1.5 93.32–3.0 0.13 –0.7 24.20 1.6 94.52–2.9 0.19 –0.6 27.42 1.7 95.54–2.8 0.26 –0.5 30.85 1.8 96.41–2.7 0.35 –0.4 34.46 1.9 97.13–2.6 0.47 –0.3 38.21 2.0 97.73–2.5 0.62 –0.2 42.07 2.1 98.21–2.4 0.82 –0.1 46.02 2.2 98.61–2.3 1.07 0.0 50.00 2.3 98.93–2.2 1.39 0.1 53.98 2.4 99.18–2.1 1.79 0.2 57.93 2.5 99.38–2.0 2.27 0.3 61.79 2.6 99.53–1.9 2.87 0.4 65.54 2.7 99.65–1.8 3.59 0.5 69.15 2.8 99.74–1.7 4.46 0.6 72.58 2.9 99.81–1.6 5.48 0.7 75.80 3.0 99.87–1.5 6.68 0.8 78.81 3.1 99.90–1.4 8.08 0.9 81.59 3.2 99.93–1.3 9.68 1.0 84.13 3.3 99.95–1.2 11.51 1.1 86.43 3.4 99.97

Page 19: MAT 1000 Mathematics in Today's World. Last Time We looked at the standard deviation, a measurement of the spread of a distribution. We introduced a special.

Finding percentilesExampleThe percentile is 90, so that means a height of 73.1 in () tall is the 90th percentile of all heights of American men.

In other words, a man who is is taller than 90% of the men in the US.

NoteUse the same table for standard scores from any data set.

Page 20: MAT 1000 Mathematics in Today's World. Last Time We looked at the standard deviation, a measurement of the spread of a distribution. We introduced a special.

Finding percentilesExampleScores on the SAT math exam are normally distributed with a mean of 500 points and a standard deviation of 100 points. What percent of test takers score below 450? What percent are below 600?

We need to find the percentiles. Start with the standard scores:

Now find the percentile from the table:

Page 21: MAT 1000 Mathematics in Today's World. Last Time We looked at the standard deviation, a measurement of the spread of a distribution. We introduced a special.

Table of percentilesStandard Standard StandardScore Percentile Score Percentile Score Percentile–3.4 0.03 –1.1 13.57 1.2 88.49–3.3 0.05 –1.0 15.87 1.3 90.32–3.2 0.07 –0.9 18.41 1.4 91.92–3.1 0.10 –0.8 21.19 1.5 93.32–3.0 0.13 –0.7 24.20 1.6 94.52–2.9 0.19 –0.6 27.42 1.7 95.54–2.8 0.26 –0.5 30.85 1.8 96.41–2.7 0.35 –0.4 34.46 1.9 97.13–2.6 0.47 –0.3 38.21 2.0 97.73–2.5 0.62 –0.2 42.07 2.1 98.21–2.4 0.82 –0.1 46.02 2.2 98.61–2.3 1.07 0.0 50.00 2.3 98.93–2.2 1.39 0.1 53.98 2.4 99.18–2.1 1.79 0.2 57.93 2.5 99.38–2.0 2.27 0.3 61.79 2.6 99.53–1.9 2.87 0.4 65.54 2.7 99.65–1.8 3.59 0.5 69.15 2.8 99.74–1.7 4.46 0.6 72.58 2.9 99.81–1.6 5.48 0.7 75.80 3.0 99.87–1.5 6.68 0.8 78.81 3.1 99.90–1.4 8.08 0.9 81.59 3.2 99.93–1.3 9.68 1.0 84.13 3.3 99.95–1.2 11.51 1.1 86.43 3.4 99.97

Page 22: MAT 1000 Mathematics in Today's World. Last Time We looked at the standard deviation, a measurement of the spread of a distribution. We introduced a special.

Finding percentilesExampleThe percentile corresponding to a standard score of is 30.85, and the percentile corresponding to a standard score of is 84.13

This means that (roughly) 31% of test takers score below 450 on the SAT math exam, and 84% are below 600.

What percent of test takers score between 450 and 600?

Page 23: MAT 1000 Mathematics in Today's World. Last Time We looked at the standard deviation, a measurement of the spread of a distribution. We introduced a special.

Finding percentilesExampleWe can find the percent who score between 450 and 600 using the fact that 31% of test takers are below 450 and 84% are below 600.

Take the number of people who score below 600 and subtract the number who scored below 450. The result is the number who scored between 450 and 600.

The same is true for percentages:(% below 600)-(% below 450)= (% between 450 and 600)

So the percent who score between 450 and 600 is:

Page 24: MAT 1000 Mathematics in Today's World. Last Time We looked at the standard deviation, a measurement of the spread of a distribution. We introduced a special.

Comparing percentilesUsing the mean and standard deviation of a normal distribution, we can find the percentile of any data value from that distribution.

Percentiles are also very useful for comparing data values from different distributions.

Is a 600 on the SAT math test better or worse than a 26 on the ACT math test?

We can’t compare the numbers—the SAT is out of 800 and the ACT is out of 36.

Page 25: MAT 1000 Mathematics in Today's World. Last Time We looked at the standard deviation, a measurement of the spread of a distribution. We introduced a special.

Comparing percentilesInstead of comparing the numbers, we compare these test scores using percentiles.

Scores on the SAT are normally distributed with a mean of 500 and standard deviation of 100. Scores on the ACT are normally distributed with a mean of 18 and standard deviation of 6.

Find the standard scores:

Page 26: MAT 1000 Mathematics in Today's World. Last Time We looked at the standard deviation, a measurement of the spread of a distribution. We introduced a special.

Comparing percentilesFrom the table, a standard score of 1 is the 84th percentile, while a standard score of 1.3 is the 90th percentile. So a 26 on the ACT is better than a 600 on the SAT.

In what sense is it a better score?

Percentiles describe these scores relative to all the other test takers.

Scoring higher than 90% of the people who took a test is better than scoring higher than 84%.

Page 27: MAT 1000 Mathematics in Today's World. Last Time We looked at the standard deviation, a measurement of the spread of a distribution. We introduced a special.

Another normal distributionOne of the most important examples of a normal distribution is the sampling distribution of statistics

In a sample survey, we choose a sample and compute a statistic.

A different sample would have given a different statistic.

If we consider every possible sample, we would have a distribution of statistics (which numbers occur, and how often they occur).

Page 28: MAT 1000 Mathematics in Today's World. Last Time We looked at the standard deviation, a measurement of the spread of a distribution. We introduced a special.

Another normal distributionIt turns out that if our sample size is large enough, the distribution of statistics will be normal.

What is a large enough sample?

A general rule of thumb is a sample size of 30.

Page 29: MAT 1000 Mathematics in Today's World. Last Time We looked at the standard deviation, a measurement of the spread of a distribution. We introduced a special.

Another normal distributionExampleIn 2012 Barack Obama won the presidential election with 51.1% of the vote.

In the run up to the election, there were many polls of likely voters.

These polls were producing statistics to estimate a parameter: the proportion of all voters who were going to vote for Obama. Now, we know the value of this parameter to be 51.1%

Page 30: MAT 1000 Mathematics in Today's World. Last Time We looked at the standard deviation, a measurement of the spread of a distribution. We introduced a special.

Another normal distributionExampleSuppose a polling company sampled 100 voters before the election. It turns out that the distribution of statistics for a sample size of 100 is normal with mean 51.1% and standard deviation 5%.

(We’ll see the formulas for these later in the course.)

As decimals these are 0.511 and 0.05.

Page 31: MAT 1000 Mathematics in Today's World. Last Time We looked at the standard deviation, a measurement of the spread of a distribution. We introduced a special.

Another normal distributionExampleIn what percent of samples of 100 would more than 50% of the sample support Obama?

This is a question about percentiles.

As a decimal 50% is 0.5

Find the standard score:

Page 32: MAT 1000 Mathematics in Today's World. Last Time We looked at the standard deviation, a measurement of the spread of a distribution. We introduced a special.

Another normal distributionExampleFrom the table, the corresponding percentile is 42.07.

This means that in 42.07% of samples of 100 voters, the proportion who supported Obama would have been less than 50%.

To answer our question we must subtract the percentile from 100:

Page 33: MAT 1000 Mathematics in Today's World. Last Time We looked at the standard deviation, a measurement of the spread of a distribution. We introduced a special.

Another normal distributionExampleSo, in about 58% of the possible samples of 100 voters, we would have seen more than 50% of the sample supporting Obama.

Page 34: MAT 1000 Mathematics in Today's World. Last Time We looked at the standard deviation, a measurement of the spread of a distribution. We introduced a special.

Another normal distribution

This graphic illustrates the idea of a sampling distribution of statistics (denoted ). Imagine taking many samples of size 100 from a population with parameter 0.511.