Essential Question: How do scientists use statistical analyses to draw meaningful conclusions from...

53
STATISTICAL ANALYSIS

Transcript of Essential Question: How do scientists use statistical analyses to draw meaningful conclusions from...

Page 1: Essential Question:  How do scientists use statistical analyses to draw meaningful conclusions from experimental results?  Design, conduct and ANALYZE.

STATISTICAL ANALYSIS

Page 2: Essential Question:  How do scientists use statistical analyses to draw meaningful conclusions from experimental results?  Design, conduct and ANALYZE.

Essential Question:

How do scientists use statistical analyses to draw meaningful conclusions from experimental results?

Design, conduct and ANALYZE an experimental study

Standard

Page 3: Essential Question:  How do scientists use statistical analyses to draw meaningful conclusions from experimental results?  Design, conduct and ANALYZE.

Purpose of Statistical Analysis

Determines whether the results found in an experiment are meaningful.

Answers the question:Does the data gathered in an

experiment support the research hypothesis or cause us to reject it?

Page 4: Essential Question:  How do scientists use statistical analyses to draw meaningful conclusions from experimental results?  Design, conduct and ANALYZE.

2 types of Statistics

Descriptive statistics is the use of statistical and graphical techniques to present information about the data set being studied.

Inferential statistics is the use of drawing conclusions about a population using a sample considered to be representative of that population.

Page 5: Essential Question:  How do scientists use statistical analyses to draw meaningful conclusions from experimental results?  Design, conduct and ANALYZE.

Example StudyA researcher performs a clinical trial (randomized, controlled, double-blind experimental study) to determine whether a new medication is effective in lowering LDL cholesterol levels in patients with hypercholesterolemia.

○ The researcher believes that the medication will be more effective in lowering LDL cholesterol levels in patients with hypercholesterolemia than a placebo.

Page 6: Essential Question:  How do scientists use statistical analyses to draw meaningful conclusions from experimental results?  Design, conduct and ANALYZE.

Step 1

Where does a researcher begin analysis of the data collected?

Create a Hypothesis

Page 7: Essential Question:  How do scientists use statistical analyses to draw meaningful conclusions from experimental results?  Design, conduct and ANALYZE.

First: Identify the Hypothesis When statistically analyzing experiments, it is

necessary to set up two hypotheses.

The first hypothesis is called the null hypothesis.

The second hypothesis is called the alternative hypothesis.

○ Note: The alternative hypothesis should always be set up before beginning the experiment.

Page 8: Essential Question:  How do scientists use statistical analyses to draw meaningful conclusions from experimental results?  Design, conduct and ANALYZE.

Null Hypothesis Null Hypothesis (H0): The starting point in scientific

research where the experimenter assumes there is no effect of the treatment or no relationship between two variables.

Example: The mean of group one’s data set is equal to the mean of group two’s data set.

LDL experiment Null Hypothesis: There is no difference in LDL cholesterol levels in patients

given the medication than in patients given the placebo.

Page 9: Essential Question:  How do scientists use statistical analyses to draw meaningful conclusions from experimental results?  Design, conduct and ANALYZE.

Alternative Hypothesis

Alternative Hypothesis (Ha): (Also called research hypothesis) What the experimenter thinks may be true (what will happen) before beginning the experiment.

Two types of alternative hypotheses: Directional Nondirectional

Page 10: Essential Question:  How do scientists use statistical analyses to draw meaningful conclusions from experimental results?  Design, conduct and ANALYZE.

Alternative Hypothesis Directional Alternative Hypothesis: Hypothesis

predicting that the mean ( ) of one group will be more or less than the mean of the other group. Example:

Non-directional Alternative Hypothesis: Hypothesis predicting that the mean of one group is not equal to the mean of the other group(without specifying whether it will be more or less).

○ Example:

Page 11: Essential Question:  How do scientists use statistical analyses to draw meaningful conclusions from experimental results?  Design, conduct and ANALYZE.

Alternative Hypotheses for the LDL Medication Experiment

Directional Alternative Hypothesis: The LDL cholesterol levels will be lower in the patients given

the medication than the patients given the placebo.

Non-Directional Alternative Hypothesis: The LDL cholesterol levels for the patients receiving the

medication will not equal the LDL cholesterol levels for the patients receiving the placebo.

Page 12: Essential Question:  How do scientists use statistical analyses to draw meaningful conclusions from experimental results?  Design, conduct and ANALYZE.

Researcher Gathers Data

Patient LDL levels

A 159 g/dL

B 169 g/dL

C 133 g/dL

D 126 g/dL

Patient LDL levels

A 174 g/dL

B 151 g/dL

C 148 g/dL

D 156 g/dL

Patients given the medication Patients given the placebo

What does the researcher do now? Does this data support/reject the alternative

hypothesis? How can he/she be sure?

Page 13: Essential Question:  How do scientists use statistical analyses to draw meaningful conclusions from experimental results?  Design, conduct and ANALYZE.

Descriptive Statistics

Begin with the descriptive Measuring mean, median and mode

○ Know when to use which oneMeasure range, standard deviation and

variance ○ Understand degrees of freedom

Page 14: Essential Question:  How do scientists use statistical analyses to draw meaningful conclusions from experimental results?  Design, conduct and ANALYZE.

Determine Typical Data Values

Determine what values are typical, or normal, for the data collected (for the sample).

○Calculate the mean, variance, and standard deviation.

Page 15: Essential Question:  How do scientists use statistical analyses to draw meaningful conclusions from experimental results?  Design, conduct and ANALYZE.

Mean

Mean ( ): The arithmetic average.

Page 16: Essential Question:  How do scientists use statistical analyses to draw meaningful conclusions from experimental results?  Design, conduct and ANALYZE.

Example Calculation for the LDL Medication Experiment

Page 17: Essential Question:  How do scientists use statistical analyses to draw meaningful conclusions from experimental results?  Design, conduct and ANALYZE.

Next Step?

Now that the mean has been calculated, it is important to determine how spread out the data is from the mean.Two measures of spread are variance and

standard deviation.

Page 18: Essential Question:  How do scientists use statistical analyses to draw meaningful conclusions from experimental results?  Design, conduct and ANALYZE.

VarianceVariance: The measure of the spread of the data about the mean. Variance is referred to as the average deviation of the data points from the mean. The more spread out the data points are, the larger the variance will be.

○ Step One: Calculate the deviation (or difference of the data point from the mean) for each data point.

○ Step Two: Square these deviations to ensure that all values are positive.

○ Step Three: Calculate the sum of all of these squared deviations.

○ Step Four: Divide by the number of data points in the data set minus one.

Page 19: Essential Question:  How do scientists use statistical analyses to draw meaningful conclusions from experimental results?  Design, conduct and ANALYZE.

Calculating the Variation

Sample size (number of data points) = n

Page 20: Essential Question:  How do scientists use statistical analyses to draw meaningful conclusions from experimental results?  Design, conduct and ANALYZE.

Variance Calculation for the LDL Medication Experiment

Page 21: Essential Question:  How do scientists use statistical analyses to draw meaningful conclusions from experimental results?  Design, conduct and ANALYZE.

Standard DeviationStandard Deviation: The measure of the spread of

the data about the mean

When calculating the variance, you must square each difference in order to obtain all positive numbers. This results in large numbers.

The standard deviation is the square root of the variance, resulting in a number representative of the data set because it is in the same scale and same unit of measure as the original data points. This provides a more accurate measure of the spread than the variance.

Step One: Calculate the variance for the data set.Step Two: Take the square root of the variance.

Page 22: Essential Question:  How do scientists use statistical analyses to draw meaningful conclusions from experimental results?  Design, conduct and ANALYZE.

Calculating the Standard Deviation

Page 23: Essential Question:  How do scientists use statistical analyses to draw meaningful conclusions from experimental results?  Design, conduct and ANALYZE.

Example Calculation for the LDL Medication Experiment

Page 24: Essential Question:  How do scientists use statistical analyses to draw meaningful conclusions from experimental results?  Design, conduct and ANALYZE.

Importance of Calculating the Standard Deviation

The standard deviation summarizes how close the data points are to the mean.

Determines if data set is normally distributed.

We can say that the data set is normally distributed if:○ Approximately 68% of the data falls within one standard

deviation of the mean;○ Approximately 95% of the data falls within two standard

deviations of the mean; and○ Over 99% of the data falls within three standard

deviations of the mean.

Page 25: Essential Question:  How do scientists use statistical analyses to draw meaningful conclusions from experimental results?  Design, conduct and ANALYZE.
Page 26: Essential Question:  How do scientists use statistical analyses to draw meaningful conclusions from experimental results?  Design, conduct and ANALYZE.
Page 27: Essential Question:  How do scientists use statistical analyses to draw meaningful conclusions from experimental results?  Design, conduct and ANALYZE.

If normally distributed:

Can confidently represent the data set with the mean

Must use either the median or the mode to represent the data set due to outliers and/or a confounding variable

If NOT normally distributed:

Page 28: Essential Question:  How do scientists use statistical analyses to draw meaningful conclusions from experimental results?  Design, conduct and ANALYZE.

Determining Statistical Significance

The results show that the mean of the LDL levels of the subjects given the experimental medication were lower than the LDL levels of the subjects given the placebo.

How can the researcher determine whether the difference between the experimental group and control group in lowering blood LDL levels was actually due to the medication or due to chance?i.e., Were the results statistically significant?

Page 29: Essential Question:  How do scientists use statistical analyses to draw meaningful conclusions from experimental results?  Design, conduct and ANALYZE.

Statistical Significance

The measure of the probability of getting a test statistic rare enough that the null hypothesis can be comfortably rejected.

0.05 (5%) is the widely accepted statistical significance level in biology

○ Meaning: there is a probability of 5% or less that the test statistic is calculated by chance, given the sample data.

Page 30: Essential Question:  How do scientists use statistical analyses to draw meaningful conclusions from experimental results?  Design, conduct and ANALYZE.

How do we determine significant differences between the means of two sets of data?

t-test

Page 31: Essential Question:  How do scientists use statistical analyses to draw meaningful conclusions from experimental results?  Design, conduct and ANALYZE.

t-tests

t-test: Type of statistical calculation used to determine whether the differences between the means of two samples are statistically significant.Two main types of t-tests are commonly

used to analyze biomedical data:○ Student’s t-test (i.e., independent t-test)○ Paired t-test (i.e., dependent t-test)

Page 32: Essential Question:  How do scientists use statistical analyses to draw meaningful conclusions from experimental results?  Design, conduct and ANALYZE.

Student’s t-test

Student’s t-test: Used to determine whether the difference between the means of two independent groups (both which are being tested for the same dependent variable) is statistically significant.

Example: Study set up where one group is given the experimental treatment and another group is given the placebo.

Page 33: Essential Question:  How do scientists use statistical analyses to draw meaningful conclusions from experimental results?  Design, conduct and ANALYZE.

There are three variations of the same formula for the student’s t-test:

The first variation should be used when the sample sizes are unequal AND either one or both samples are small (n<30).

The second variation should be used when the sample sizes are equal (regardless of size).

The third variation should be used when sample sizes are unequal AND both sample sizes are large (n>30).

Page 34: Essential Question:  How do scientists use statistical analyses to draw meaningful conclusions from experimental results?  Design, conduct and ANALYZE.

Paired (dependent) t-test

Paired t-test: Used to determine whether the difference between the means of two groups (each containing the same participants and being tested at two different points) is statistically significant.

Example: Study set up where the same group of participants is followed before and after an experimental treatment.

Page 35: Essential Question:  How do scientists use statistical analyses to draw meaningful conclusions from experimental results?  Design, conduct and ANALYZE.

Formula for Paired t-test

Page 36: Essential Question:  How do scientists use statistical analyses to draw meaningful conclusions from experimental results?  Design, conduct and ANALYZE.

Now Back to the LDL Medication Experiment Determine which type of t-test (student’s

t-test or paired t-test) is most appropriate for our LDL levels experiment.Because the two groups being tested are

independent of each other (participants in the experimental group and the control group are different), the student’s t-test is the appropriate test to use.

Page 37: Essential Question:  How do scientists use statistical analyses to draw meaningful conclusions from experimental results?  Design, conduct and ANALYZE.

Which variation of the formula should we use?

Both the experimental group and control group contain four participants. Because the sample sizes are equal, the second variation should be used.

Page 38: Essential Question:  How do scientists use statistical analyses to draw meaningful conclusions from experimental results?  Design, conduct and ANALYZE.

Calculations:

Page 39: Essential Question:  How do scientists use statistical analyses to draw meaningful conclusions from experimental results?  Design, conduct and ANALYZE.

Calculations:

Page 40: Essential Question:  How do scientists use statistical analyses to draw meaningful conclusions from experimental results?  Design, conduct and ANALYZE.

You’re Not Done Yet

What you just calculated is called the t value.

Next you will use a t-table in order to determine whether your t value is statistically significant.

Page 41: Essential Question:  How do scientists use statistical analyses to draw meaningful conclusions from experimental results?  Design, conduct and ANALYZE.

T-Table

Page 42: Essential Question:  How do scientists use statistical analyses to draw meaningful conclusions from experimental results?  Design, conduct and ANALYZE.

Step 1Calculate the degrees of freedom for the

experiment.

How to Use a T-Table

Page 43: Essential Question:  How do scientists use statistical analyses to draw meaningful conclusions from experimental results?  Design, conduct and ANALYZE.

Degrees of Freedom

Degrees of Freedom: A measure of certainty that the sample populations are representative of the population being studied.To calculate the degrees of freedom for a

sample:

Total # data points in sample(s) - # populations being sampled

(Typically n-1)

Page 44: Essential Question:  How do scientists use statistical analyses to draw meaningful conclusions from experimental results?  Design, conduct and ANALYZE.

Formulas for DF:

Page 45: Essential Question:  How do scientists use statistical analyses to draw meaningful conclusions from experimental results?  Design, conduct and ANALYZE.

Calculating the Degrees of Freedom for the LDL Medication Experiment

Page 46: Essential Question:  How do scientists use statistical analyses to draw meaningful conclusions from experimental results?  Design, conduct and ANALYZE.

How to Use a T-Table

Step 2

Find the row corresponding with the appropriate degrees of freedom for your experiment.

Page 47: Essential Question:  How do scientists use statistical analyses to draw meaningful conclusions from experimental results?  Design, conduct and ANALYZE.

How to Use a T-TableStep 3 - Determine whether the t value exceeds any of the critical values in the corresponding row.If the t value exceeds any of the critical values in

the row, the alternative hypothesis can be accepted. This means that the results ARE STATISTICALLY SIGNIFICANT.

If the t value is smaller than all of the critical values in the row, the alternative hypothesis can be rejected and the null hypothesis can be accepted. This means that the results are NOT STATISTICALLY SIGNIFICANT.

Page 48: Essential Question:  How do scientists use statistical analyses to draw meaningful conclusions from experimental results?  Design, conduct and ANALYZE.

How to Use a T-Table Step 4If the t value exceeds any of the critical values

in the row, you need to:1) Follow the corresponding row over to the right until

you locate the column with the critical value that is just slightly smaller than the t value.

2) Follow this column to the top of the table and determine its corresponding p value.

3) Determine which p value to use, the p value corresponding with a one-tailed test for significance versus the p value corresponding with a two-tailed test for significance.

Page 49: Essential Question:  How do scientists use statistical analyses to draw meaningful conclusions from experimental results?  Design, conduct and ANALYZE.

One-Tailed vs. Two-Tailed Test for Significance In order to determine whether you

completed a one-tailed or two-tailed test for significance, look back to your alternative hypothesis.If the alternative hypothesis was directional,

you have completed a one-tailed test for significance.

If the alternative hypothesis was nondirectional, you have completed a two-tailed test for significance.

Page 50: Essential Question:  How do scientists use statistical analyses to draw meaningful conclusions from experimental results?  Design, conduct and ANALYZE.

One-Tailed or Two-Tailed Test for Significance?

The Alternative Hypothesis was as follows:The researcher hypothesizes that the

medication will be more effective in lowering LDL cholesterol levels in patients with hypercholesterolemia than the placebo.• Since this is a directional alternative

hypothesis, you have completed a one-tailed test for significance.

Page 51: Essential Question:  How do scientists use statistical analyses to draw meaningful conclusions from experimental results?  Design, conduct and ANALYZE.

What Does the p Value Mean?

The p values indicate the probability that the difference between the means of the two samples are due only to chance.If the p value ≤ 0.01, the results are VERY

SIGNIFICANT.○ The probability that the difference is due to chance

is less than or equal to1%.If the p value ≤ 0.05, the results are SIGNIFICANT.

○ The probability that the difference is due to chance is less than or equal to 5%.

If the p value > 0.05, the results are NOT SIGNIFICANT. ○ The probability that the difference is due to chance

is greater than 5%.

Page 52: Essential Question:  How do scientists use statistical analyses to draw meaningful conclusions from experimental results?  Design, conduct and ANALYZE.

Using the T-Table for the LDL Medication Experiment

Page 53: Essential Question:  How do scientists use statistical analyses to draw meaningful conclusions from experimental results?  Design, conduct and ANALYZE.

Putting It All Together The group given the new medication had a mean

of 146.75 and a standard deviation of 20.53. The group given the placebo had a mean of

157.25 and a standard deviation of 11.64. The t value for this study was 0.89, with a p value

greater than 0.05. This means that the results are NOT statistically significant. This means that the researcher can reject the

alternative hypothesis and accept the null hypothesis that the new medication did NOT lower patients’ LDL levels more than the placebo.