Problem :

62
Problem : Assume that among diabetics the fasting blood level of glucose is approximately normally distributed with a mean of 105mg per 100ml and an SD of 9 mg per 100 ml. What proportion of diabetics having fasting blood glucose levels between 90 and 125 mg per 100 ml ?

description

Problem : Assume that among diabetics the fasting blood level of glucose is approximately normally distributed with a mean of 105mg per 100ml and an SD of 9 mg per 100 ml. What proportion of diabetics having fasting blood glucose levels between 90 and 125 mg per 100 ml ?. - PowerPoint PPT Presentation

Transcript of Problem :

Page 1: Problem :

Problem:Assume that among diabetics the fasting blood level of glucose is approximately normally distributed with a mean of 105mg per 100ml and an SD of 9 mg per 100 ml. What proportion of diabetics having fasting blood glucose levels between 90 and 125 mg per 100 ml ?

Page 2: Problem :

NORMAL DISTRIBUTION AND ITS APPL ICATION

Page 3: Problem :

INTRODUCTION

Statistically, a population is the set of all possible values of a variable.

Random selection of objects of the population makes the variable a random variable ( it involves chance mechanism)

Example: Let ‘x’ be the weight of a newly born baby. ‘x’ is a random variable representing the weight of the baby.

The weight of a particular baby is not known until he/she is born.

Page 4: Problem :

Discrete random variable: If a random variable can only take

values that are whole numbers, it is called a discrete random variable.

Example: No. of daily admissions No. of boys in a family of 5 No. of smokers in a group of

100 persons.Continuous random variable:If a random variable can take any value,

it is called a continuous random variable.

Example: Weight, Height, Age & BP.

Page 5: Problem :

Continuous Probability Distributions

Continuous distribution has an infinite number of values between any two values assumed by the continuous variableAs with other probability distributions, the total area under the curve equals 1Relative frequency (probability) of occurrence of values between any two points on the x-axis is equal to the total area bounded by the curve, the x-axis, and perpendicular lines erected at the two points on the x-axis

Page 6: Problem :

Histogram

Figure 1 Histogram of ages of 60 subjects

11.5 21.5 31.5 41.5 51.5 61.5 71.5

0

10

20

Age

Freq

uen

cy

Page 7: Problem :

The Normal or Gaussian distribution is the most important continuous probability distribution in statistics.

The term “Gaussian” refers to ‘Carl Freidrich Gauss’ who develop this distribution.

The word ‘normal’ here does not mean ‘ordinary’ or ‘common’ nor does it mean ‘disease-free’.

It simply means that the distribution conforms to a certain formula and shape.

Page 8: Problem :

Histograms

A kind of bar or line chart Values on the x-axis (horizontal) Numbers on the y-axis (vertical)

Normal distribution is defined by a particular shape Symmetrical Bell-shaped

Page 9: Problem :

A Perfect Normal Distribution

Page 10: Problem :

Gaussian Distribution

Many biologic variables follow this pattern

Hemoglobin, Cholesterol, Serum Electrolytes, Blood pressures, age, weight, height

One can use this information to define what is normal and what is extremeIn clinical medicine 95% or 2 Standard deviations around the mean is normal Clinically, 5% of “normal” individuals

are labeled as extreme/abnormal We just accept this and move on.

Page 11: Problem :
Page 12: Problem :

The Normal Distribution

Page 13: Problem :

Characteristics of Normal DistributionSymmetrical about mean, Mean, median, and mode are equalTotal area under the curve above the x-axis is one square unit1 standard deviation on both sides of the mean includes approximately 68% of the total area 2 standard deviations includes

approximately 95% 3 standard deviations includes

approximately 99%

Page 14: Problem :

Characteristics of the Normal Distribution

Normal distribution is completely determined by the parameters and Different values of shift the

distribution along the x-axis Different values of determine

degree of flatness or peakedness of the graph

Page 15: Problem :
Page 16: Problem :

Applications of Normal DistributionFrequently, data are normally distributed Essential for some statistical procedures If not, possible to transform to a more

normal form

Approximations for other distributionsBecause of the frequent occurrence of the normal distribution in nature, much statistical theory has been developed for it.

Page 17: Problem :

What’s so Great about the Normal Distribution?

If you know two things, you know everything about the distribution Mean Standard deviation

You know the probability of any value arising

Page 18: Problem :

Standardised ScoresMy diastolic blood pressure is 100 So what ?

Normal is 90 (for my age and sex) Mine is high

But how much high?

Express it in standardised scores How many SDs above the mean is

that?

Page 19: Problem :

Mean = 90, SD = 4 (my age and sex)

This is a standardised score, or z-scoreCan consult tables (or computer) See how often this high (or higher)

score occur 99.38% of people have lower scores

My Score - Mean Score 100-90

2.5SD 4

Page 20: Problem :

Standard Scores

The Z score makes it possible, under some circumstances, to compare scores that originally had different units of measurement.

Page 21: Problem :

Z score

Allows you to describe a particular score in terms of where it fits into the overall group of scores. Whether it is above or below the average and

how much it is above or below the average.

A standard score that states the position of a score in relation to the mean of the distribution, using the standard deviation as the unit of measurement. The number of standard deviations a score is

above or below a mean.

Page 22: Problem :

Z Score

Suppose you scored a 60 on a numerical test and a 30 on a verbal test. On which test did you perform better? First, we need to know how other people did on

the same tests. Suppose that the mean score on the numerical test

was 50 and the mean score on the verbal test was 20. You scored 10 points above the mean on each test. Can you conclude that you did equally well on both

tests? You do not know, because you do not know if 10

points on the numerical test is the same as 10 points on the verbal test.

Page 23: Problem :

Z Score

Suppose you scored a 60 on a numerical test and a 30 on a verbal test. On which test did you perform better? Suppose also that the standard deviation on

the numerical test was 15 and the standard deviation on the verbal test was 5. Now can you determine on which test you did

better?

Page 24: Problem :

Z Score

Page 25: Problem :

Z Score

Page 26: Problem :

Z score

To find out how many standard deviations away from the mean a particular score is, use the Z formula:Population: Sample:

X

ZS

XXZ

Page 27: Problem :

Z Score

667.15

5060

Z

25

2030

Z

In relation to the rest of the people who took the tests, you did better on the verbal test than the numerical test.

X

Z

Page 28: Problem :

Properties of Z scores

The standard deviation of any distribution expressed in Z scores is always one. In calculating Z scores, the standard

deviation of the raw scores is the unit of measurement.

Page 29: Problem :

Properties of Z scores

Transforming raw scores to Z scores changes the mean to 0 and the standard deviation to 1, but it does not change the shape of the distribution. For each raw score we first subtract a constant

(the mean) and then divide by a constant (the standard deviation).

The proportional relation that exists among the distances between the scores remains the same.

If the shape of the distribution was not normal before it was transformed, it will not be normal afterward.

Page 30: Problem :

The Standard Normal Curve

Page 31: Problem :

The Standard Normal Table

Using the standard normal table, you can find the area under the curve that corresponds with certain scores.The area under the curve is proportional to the frequency of scores.The area under the curve gives the probability of that score occurring.

Page 32: Problem :

Standard Normal Table

Page 33: Problem :

Reading the Z Table

Finding the proportion of observations between the mean and a score when Z = 1.80

Page 34: Problem :

Reading the Z Table

Finding the proportion of observations above a score when Z = 1.80

Page 35: Problem :

Reading the Z Table

Finding the proportion of observations between a score and the mean when Z = -2.10

Page 36: Problem :

Reading the Z Table

Finding the proportion of observations below a score when Z = -2.10

Page 37: Problem :

Z scores and the Normal Distribution

Can answer a wide variety of questions about any normal distribution with a known mean and standard deviation.Will address how to solve two main types of normal curve problems: Finding a proportion given a score. Finding a score given a proportion.

Page 38: Problem :

Finding Proportions

Page 39: Problem :

Example: Finding a Proportion Below the Mean

2. Use C’ Column1.

110

3020

10,30,20

Z

X

XZ

3.

Page 40: Problem :

Example: Finding a Proportion Below the Mean

2. Use C’ Column

1.

3. 15.87 % of customers

4.

Page 41: Problem :

Example: Finding a Proportion Between the Mean and a Score

1.

2. Use B Column

116

100116

16,100,116

Z

X

XZ

3.

Page 42: Problem :

Example: Finding a Proportion Between the Mean and a Score

1.

2. Use B Column

3.

4.

34.13% of the population

Page 43: Problem :

Example: Finding a Proportion Below a Score

3.

Given that IQ is normally distributed with a mean of 100 and a standard deviation of 16, what proportion of the population has an IQ below 124?

1.

2. 0.50 + B Column

5.116

100124

16,100,124

Z

X

XZ

Page 44: Problem :

Example: Finding a Proportion Below a Score

3.

Given that IQ is normally distributed with a mean of 100 and a standard deviation of 16, what proportion of the population has an IQ below 124?

1.

2. 0.50 + B Column

.50 + .4332 = .9332

.9332 of the population

4.

Page 45: Problem :

Example: Finding a Proportion Between Two Scores

1.

2. B’ + B

75.24

89100

25.24

8980

4,89,100,80

2

1

21

x

x

Z

Z

XX

XZ

3.

Page 46: Problem :

Example: Finding a Proportion Between Two Scores

1.

2. B’ + B

3.

.4878+.4970=.9848

98.48%

4.

Page 47: Problem :

Example: Finding a Proportion Between Two Scores

Given that scores on the Social Adjustment Scale are normally distributed with a mean of 50 and a standard deviation of 10, what proportion of scores are between 30 and 40?

1.

2. Larger C’-Smaller C’

210

5030

110

5040

10,50,30,40

2

1

21

x

x

Z

Z

XX

XZ

3.

4030

Page 48: Problem :

Example: Finding a Proportion Between Two Scores

Given that scores on the Social Adjustment Scale are normally distributed with a mean of 50 and a standard deviation of 10, what proportion of scores are between 30 and 40?

1.

2. Larger C’-Smaller C’

3.

.1587-.0228 =.13594.

30 40

Page 49: Problem :

Finding Scores

ZX

XZ

XZ

XZ

Page 50: Problem :

Example: Finding a Score from a Proportion (or Percentile Rank)

Given that scores on the Social Adjustment Scale are normally distributed with a mean of 50 and a standard deviation of 10, what two scores correspond to the middle 95%?

3.

1.

2. 0.025 in C and C’

6.694.30

)10(96.150

10,50,96.1

andX

X

andZ

zX

4.

Page 51: Problem :

Example: Finding a Score from a Proportion (or Percentile Rank)

Given that IQ scores are normally distributed with a mean of 100 and a standard deviation of 16, what score corresponds to the 95th percentile?

1.

2. 0.50 + 0.45 (B Column)

3.

40.126

)16(65.1100

16,100,65.1

X

X

andz

ZX

4.

Page 52: Problem :
Page 53: Problem :
Page 54: Problem :
Page 55: Problem :
Page 56: Problem :

Normal Distributions Go Wrong

Wrong shape Non-symmetrical

Skew Too fat or too narrow

Kurtosis

Aberrant values Outliers

Page 57: Problem :

Effects of Non-Normality

Skew Bias parameter estimates

E.g. mean

Kurtosis Doesn’t effect parameter estimates

Does effect standard errors

Outliers Depends

Page 58: Problem :

Distributions

Bell-Shaped (also known as symmetric” or “normal”)

Skewed: positively (skewed to

the right) – it tails off toward larger values

negatively (skewed to the left) – it tails off toward smaller values

Page 59: Problem :

0

0.001

0.002

0.003

0.004

0.005

0.006

0.007

0.008

Positive Kurtosis

Negative Kurtosis

Normal

Kurtosis

Page 60: Problem :

Outliers

Value

Fre

qu

en

cy

20

10

0

Page 61: Problem :

Dealing with Outliers

Error Data entry error Correct it

Real value Difficult Delete it

Page 62: Problem :

ANY

QUESTIONS