Section 4.2: How to Look for Differences. Cross-Tabulations College student binge drinkers...

Section 4.2: How to Look for Differences

Cross-Tabulations

College student binge drinkers experienced many personal and social problems, the researchers said. For instance, 90% of those who binged more than once a week said they experienced a hangover and 63% said they did something they regretted. --- Newsday, 12/94

Making Sense of the Data

Were binge drinkers more likely to do things they regretted than students who did not binge drink?

Defn: Binge drinking – consuming 5 or more dinks in a row for men or four or more in a row for women.

Wechsler Study on American College Students

Nonbinge drinkers (n=6894)

Infrequent binge drinkers (n=4090)

Frequent binge drinkers (n=3291)

Did something you regret

14% 37% 63%

Missed a class

8% 30% 61%

Got behind in schoolwork

6% 21% 46%

Cross-Tab Tables

Categories of one measurement are listed on the top and the categories of the other measurement are listed on the side. Also, list the number of people that appear in each cell.

Wechsler Cross Tab

Nonbinge Infrequent Binge

Frequent Binge

Kept up with schoolwork

6,480 3,231 1,777

Got behind with schoolwork

414 859 1,514

Marginal Totals and Proportions

Cross Tabs will often show the totals (proportions) of each column in the bottom margin under the column and the total (proportions) of each row in the margin to the right of the row.

Cross-Tab with Column Marginals

Nonbinge Infrequent Binge

Frequent Binge

Kept up with schoolwork

6,480 3,231 1,777

Got behind with schoolwork

414 859 1,514

Column Count

6,894 4,090 3,291

Column % 48% 29% 23%

Correlation

A cross-tab is a useful way of checking for correlation between two categorical variables.

Two variables are correlated if knowing an observation’s value for one of the measurements helps you to better estimate the likelihoods of that observation’s value on the other measurement.

Correlation – Another View

If the counts you would expect if there was no correlation are the counts you observed, then there is no correlation.

Computing Expected Counts

Expected counts can be thought of as the count that would be if the row proportion applied equally to all the data in it; that is, if there were no correlation between the observations in the cross-tab.

Correlation in the Sample vs. Population Even though the expected counts may

show a correlation in the sample data, how does one show there is correlation in the population?Perform a hypothesis test.

Step 1: State the Null Hypothesis

H0: No correlation in the population data.

Remember…. H0 is always the hypothesis of “no difference”. In this instance “no difference” means no difference between expected and observed counts – or no correlation.

Step 2: Collect Sample Data

Use the Wechsler data.

Is the sample data statistically significant? That is, assuming the null hypothesis is correct (i.e. there is no correlation between the two variables), how likely is it to get the sample data that we observed?

Step 3: Compute the Appropriate Test Statistic In testing a claim about one sample

proportion, the test statistic is the z-test statistic.

Since we have several sample proportions we’ll need a new test statistic – the chi-squared test statistic.

To compute the test statistic:

Calculate the expected counts for each cell.

We’ll actually calculate the chi-squared test statistic tomorrow.

Step 4: Find the p-value

A p-value means just what it did earlier…. Assuming the null hypothesis is true (that is, there is no correlation between the two variables), it is the probability of getting sample data as extreme or more extreme than we observed.

Or more simply….

If there is no correlation (association) between the two variables, could the differences in the observed and expected cell counts be attributed to chance?

With this test we’ll use the TI-83 to find the p-value.

Step 4: Reject or retain the null hypothesis

Step 5: State Your Conclusions

There is statistical evidence to suggest that there is (is not) correlation between the two variables…..

Section 4.2: How to Look for Differences. Cross-Tabulations College student binge drinkers...

Documents

Transcript of Section 4.2: How to Look for Differences. Cross-Tabulations College student binge drinkers...