Last lecture summary Bias, Bessel's correction MAD Normal distribution Empirical rule.

38
Last lecture summary Bias, Bessel's correction MAD Normal distribution Empirical rule

Transcript of Last lecture summary Bias, Bessel's correction MAD Normal distribution Empirical rule.

Page 1: Last lecture summary Bias, Bessel's correction MAD Normal distribution Empirical rule.

Last lecture summary• Bias, Bessel's correction• MAD• Normal distribution• Empirical rule

Page 2: Last lecture summary Bias, Bessel's correction MAD Normal distribution Empirical rule.

Standard deviation – empirical rule

Page 3: Last lecture summary Bias, Bessel's correction MAD Normal distribution Empirical rule.

Standard deviation – empirical rule

Page 4: Last lecture summary Bias, Bessel's correction MAD Normal distribution Empirical rule.

Standard deviation – empirical rule

Page 5: Last lecture summary Bias, Bessel's correction MAD Normal distribution Empirical rule.

The nature of the normal distribution• In laboratory experiments, results vary due to several

factors: imprecise weighting of reagents, imprecise pipetting, nonhomogeneous suspensions of cells or membarnes ...

• Similarly, variation in a clinical value might be caused by many genetic and environmental factors.

• These random factors are independent and they tend to offset each other.

• Variation among values will approximate a Gaussian distribution• when there are many independent sources of variation• when individual sources add up to get the final result

Page 6: Last lecture summary Bias, Bessel's correction MAD Normal distribution Empirical rule.

New stuff

Page 7: Last lecture summary Bias, Bessel's correction MAD Normal distribution Empirical rule.

STANDARD NORMAL DISTRIBUTION

Page 8: Last lecture summary Bias, Bessel's correction MAD Normal distribution Empirical rule.

Who is more popular?

Page 9: Last lecture summary Bias, Bessel's correction MAD Normal distribution Empirical rule.

Who is more popular

s.d. = 36

s.d. = 60

Z = -3.53

Z = -2.57

Page 10: Last lecture summary Bias, Bessel's correction MAD Normal distribution Empirical rule.

Standardizing

Page 11: Last lecture summary Bias, Bessel's correction MAD Normal distribution Empirical rule.

Formula

Page 12: Last lecture summary Bias, Bessel's correction MAD Normal distribution Empirical rule.

Quiz• What does a negative Z-score mean?

1. The original value is negative.

2. The original value is less than mean.

3. The original value is less than 0.

4. The original value minus the mean is negative.

Page 13: Last lecture summary Bias, Bessel's correction MAD Normal distribution Empirical rule.

Quiz II• If we standardize a distribution by converting every value

to a Z-score, what will be the new mean of this standardized distribution?

• If we standardize a distribution by converting every value to a Z-score, what will be the new standard deviation of this standardized distribution?

Page 14: Last lecture summary Bias, Bessel's correction MAD Normal distribution Empirical rule.

Standard normal distribution

N(,)

N(,)

Page 15: Last lecture summary Bias, Bessel's correction MAD Normal distribution Empirical rule.

Z

Z – number of standard deviations away from the mean

If the Z-value is 1, how many percent are less than that value?

cca 84 %

0 +1 +2 +3-1-2-3

Page 16: Last lecture summary Bias, Bessel's correction MAD Normal distribution Empirical rule.

Proportion of human heights

Page 17: Last lecture summary Bias, Bessel's correction MAD Normal distribution Empirical rule.

+1-1-2 +20

Page 18: Last lecture summary Bias, Bessel's correction MAD Normal distribution Empirical rule.

Quiz• Approximately what proportion of people is smaller than

168 cm?

173 178 183168163

16%

Page 19: Last lecture summary Bias, Bessel's correction MAD Normal distribution Empirical rule.

Quiz• Approximately what proportion of people is higher than

183 cm?

173 178 183168163

2.5%

Page 20: Last lecture summary Bias, Bessel's correction MAD Normal distribution Empirical rule.

Quiz• Approximately what proportion of people is between 163

cm and 178 cm high?

173 178 183168163

81.5%

Page 21: Last lecture summary Bias, Bessel's correction MAD Normal distribution Empirical rule.

Quiz• Approximately what proportion of people is smaller than

180 cm?

173 178 183168163

ca 91.5%

Page 22: Last lecture summary Bias, Bessel's correction MAD Normal distribution Empirical rule.

Quiz• What is the probability of randomly selecting a height in

the sample that is >5 standard deviations above the mean?1. 0.01

2. 0.3

3. 0.8

4. 0.99

Page 23: Last lecture summary Bias, Bessel's correction MAD Normal distribution Empirical rule.

Quiz• What is the probability of randomly selecting a height in

the sample that is <5 standard deviations below the mean?1. 0.01

2. 0.3

3. 0.8

4. 0.99

Page 24: Last lecture summary Bias, Bessel's correction MAD Normal distribution Empirical rule.

Quiz• What proportion of the data is either below 2 standard

deviations or above 2 standard deviations from the mean for a normal distribution?

95%

2.5% 2.5%

Page 25: Last lecture summary Bias, Bessel's correction MAD Normal distribution Empirical rule.

Z-tableWhat is the proportion less than the point with the Z-score -2,75?

Page 26: Last lecture summary Bias, Bessel's correction MAD Normal distribution Empirical rule.

Use Z-table

What proportion of people is smaller than 180 cm?

Z-value of 1.4 corresponds to 91.92%.

Page 27: Last lecture summary Bias, Bessel's correction MAD Normal distribution Empirical rule.

Quiz – height data

• What proportion of people is smaller than you?• see Z-table

• What proportion of people is taller than you?• see Z-table• Table gives a value “less than”.• Note, that “greater than x” is the same as “less than -x”.

Page 28: Last lecture summary Bias, Bessel's correction MAD Normal distribution Empirical rule.

Quiz – height data

• What proportion of people lie between you and you?

• How tall should you be to be in the top 5% of the highest people?

Page 29: Last lecture summary Bias, Bessel's correction MAD Normal distribution Empirical rule.

An intriguing fact

Page 30: Last lecture summary Bias, Bessel's correction MAD Normal distribution Empirical rule.

DISTRIBUTION, DISTRIBUTION, ARE YOU NORMAL?

Page 31: Last lecture summary Bias, Bessel's correction MAD Normal distribution Empirical rule.

Life expectancy data – histogram

life expectancy

fre

qu

en

cy

Page 32: Last lecture summary Bias, Bessel's correction MAD Normal distribution Empirical rule.

Making conclusions from a histogram• What can you tell about life expectancy data?

• how many modes?• where is the mode?• symmetric, left skewed or right skewed?• outliers – yes or no?

life expectancy

fre

qu

en

cy

Page 33: Last lecture summary Bias, Bessel's correction MAD Normal distribution Empirical rule.

Making conclusions from a histogram• Where is the mode, the median, the mean?

life expectancy

fre

qu

en

cy

Page 34: Last lecture summary Bias, Bessel's correction MAD Normal distribution Empirical rule.

Min. Q1 Median Q3 Max. 47.79 64.67 73.24 76.65 83.39

Five numbers summary

8.5>3.5

25.4>10.2

mean=69.9

Page 35: Last lecture summary Bias, Bessel's correction MAD Normal distribution Empirical rule.

Lognormal distribution• Frazier et al. measured the ability of a drug isoprenaline to

relax the bladder muscle.• The results are expressed as the EC50, which is the

concentration required to relax the bladder halfway between its minimum and maximum possible relaxation.

Page 36: Last lecture summary Bias, Bessel's correction MAD Normal distribution Empirical rule.

Lognormal distribution

Page 37: Last lecture summary Bias, Bessel's correction MAD Normal distribution Empirical rule.

Geometric mean

Geometric mean – transform all values to their logarithms, calculate the mean of the logarithms, transform this mean back to the units of original data (antilog)

𝑥=1333𝑛𝑀 𝑥=2.71𝑥=102.71=513nM

Page 38: Last lecture summary Bias, Bessel's correction MAD Normal distribution Empirical rule.

The nature of the lognormal distribution

• Lognormal distributions arise when multiple random factors are multiplied together to determine the value.• A typical example: cancer (cell division is multiplicative)

• Lognormal distributions are very common in many scientific fields.• Drug potency is lognormal

• To analyse lognormal data, do not use methods that assume the Gaussian distribution. You will get misleding results (e.g.,non-existing outliers).

• Better way is to convert data to logarithm and analyse the converted values.