SPSS Session 5: Association between Nominal Variables Using Chi-Square Statistic

42
SPSS Session 5: Association between Nominal Variables Using Chi-Square Statistic

description

SPSS Session 5: Association between Nominal Variables Using Chi-Square Statistic. Learning Objectives. Review previous statistical tests and how they relate to the levels of measurement for a variable Describe how a Chi-Square test uses cross tabulation - PowerPoint PPT Presentation

Transcript of SPSS Session 5: Association between Nominal Variables Using Chi-Square Statistic

Page 1: SPSS Session 5: Association between Nominal Variables Using Chi-Square Statistic

SPSS Session 5:

Association between Nominal Variables Using Chi-Square Statistic

Page 2: SPSS Session 5: Association between Nominal Variables Using Chi-Square Statistic

Learning Objectives

• Review previous statistical tests and how they relate to the levels of measurement for a variable

• Describe how a Chi-Square test uses cross tabulation

• Using a table, calculate a Chi-Square statistic• Conduct a Chi-Square analysis and SPSS and

interpret the findings

Page 3: SPSS Session 5: Association between Nominal Variables Using Chi-Square Statistic

Statistical Tests Review• t-tests

– Mean differences between two groups – Uses a t-test statistic– One nominal variable and one interval/ratio variable

• ANOVA (Analysis of Variance)– Mean differences between three or more groups– Uses a F-test statistic– One nominal variable and one interval/ratio variable

• Correlation– Test of association between two interval/ratio variables– Uses Pearson’s r-values as a correlation coefficient– Tells you the magnitude, direction, and statistical significance of the relationship

• Regression– Test of prediction between two interval/ratio variables– Uses a F-test statistic– Helps create a useful equation for prediction of the value in one variable from

another

Page 4: SPSS Session 5: Association between Nominal Variables Using Chi-Square Statistic

Chi-Square (χ2) Test

• The Chi-Square (χ2) test offers a test of association between two nominal/ordinal variables

• Social work examples:– Gender and whether or not an individual reports

having elevated distress scores or not– Gender and either normal, borderline, or abnormal

scores on a measure– Previous involvement in child protection services and

whether or not an individual reports having elevated distress scores or not

Page 5: SPSS Session 5: Association between Nominal Variables Using Chi-Square Statistic

Cross Tabulation Example 1

• The Chi-Square (χ2) test is best shown through the use of a cross tabulation table

• Take for example a table that displays the relationship between gender of the parent/carer and whether or not they reported having elevated distress scores on the GHQ measure

• This will be a 2x2 table with each variable having two categories

Page 6: SPSS Session 5: Association between Nominal Variables Using Chi-Square Statistic

We can see the following from this table:• 6 men reported normal levels of distress (subclinical) on the GHQ measure• 8 men reported clinically elevated levels of distress on the GHQ measure• 47 women reported normal levels of distress (subclinical) on the GHQ measure• 34 women reported clinically elevated levels of distress on the GHQ measure• There were 14 men and 81 women in the sample• 53 individuals reported normal levels of distress (subclinical) on the GHQ

measure• 42 individuals reported clinically elevated levels of distress on the GHQ measure• 95 individuals were in the sample

Page 7: SPSS Session 5: Association between Nominal Variables Using Chi-Square Statistic

Cross Tabulation Example 2

• A second example is the relationship between the degree of previous involvement in child protections services and whether or not they reported having elevated distress scores on the GHQ measure

• This will be a 3x2 table with degree of previous involvement having three categories

Page 8: SPSS Session 5: Association between Nominal Variables Using Chi-Square Statistic

Cross Tabulation Example 2

Page 9: SPSS Session 5: Association between Nominal Variables Using Chi-Square Statistic

The Chi-Square (χ2) Test Process• The χ2 Test is designed to test for differences

between the what was observed and what you would expect if there is no association between the variables

• Observed numbers in each category in your table • Expected numbers is the arrangement of the

cases in the table if all of the cases were spread evenly across all the categories in the table

Page 10: SPSS Session 5: Association between Nominal Variables Using Chi-Square Statistic

Expected vs. Observed Values

• Expected values are those in each category if there was no association between your two variables.

• Expected values are your null hypothesis, that is no association between the variables

• Let’s look at an example

Page 11: SPSS Session 5: Association between Nominal Variables Using Chi-Square Statistic

Chi-Square Example 1

• In our child protection study, we later collected information about if cases were referred for child protection services within a year of their closure.

• Cases were viewed as successfully closed if there was no later re-referral for additional child protection services.

• This information in the “Case_Referral” variable has two categories and is a nominal variable.

Page 12: SPSS Session 5: Association between Nominal Variables Using Chi-Square Statistic

Chi-Square Example 1

• We wanted to know if there was an association between whether a parent/carer reported elevated levels of psychological distress and later re-referral for additional child protection services.

• We would expect that these variables would be associated (research hypothesis).

• Our null hypothesis would be that no significant association exists.

Page 13: SPSS Session 5: Association between Nominal Variables Using Chi-Square Statistic

Chi-Square Example 1: Observed Values

• Here is the cross tabulation table from this analysis.

• These are our observed values

Page 14: SPSS Session 5: Association between Nominal Variables Using Chi-Square Statistic

Chi-Square Example 1: Observed Values

• If the null hypothesis was true and no association between these variables existed, what would the numbers in the table be?

Page 15: SPSS Session 5: Association between Nominal Variables Using Chi-Square Statistic

Chi-Square Example 1: Expected Values

• If the null hypothesis was true and no association between these variables existed, these would be the expected values:

27 2622 20

Page 16: SPSS Session 5: Association between Nominal Variables Using Chi-Square Statistic

Chi-Square Example 1: Expected Values• For each category, the numbers are evenly

spread across the table as though the null hypothesis was true and no association between the variables exists.

27 2622 20

Page 17: SPSS Session 5: Association between Nominal Variables Using Chi-Square Statistic

Expected Values in Chi-Square • Expected values for each category are

calculated by this formula:

E = Expected FrequencyR = Marginal Row total

C = Marginal Column totalN = Total number of cases

E R CN

( )( )

Page 18: SPSS Session 5: Association between Nominal Variables Using Chi-Square Statistic

Marginal Totals

27.3 25.721.7 20.3

Page 19: SPSS Session 5: Association between Nominal Variables Using Chi-Square Statistic

27.3 25.721.7 20.3

Marginal Totals

Expected valuesSubclinical scores and no referral = (53 x 49)/95 = 27.3Subclinical scores and referral = (53 x 46)/95 = 25.7Clinical score and no referral = (42 x 49)/95 = 21.7Clinical score and referral = (42 x 46)/95 = 20.3

Calculating Expected Values

Page 20: SPSS Session 5: Association between Nominal Variables Using Chi-Square Statistic

Chi-Square Statistic

• The Chi-Square (χ2) assesses the size of the differences between the observed and expected values in a cross tabulation table

22

( )O E

E

Page 21: SPSS Session 5: Association between Nominal Variables Using Chi-Square Statistic

Observed values (the actual data)

Expected values (if null hypothesis is true)

27.3 25.721.7 20.3

Page 22: SPSS Session 5: Association between Nominal Variables Using Chi-Square Statistic

Chi-Square Statistic Calculated

(33-27.3)2 + (20-25.7)2 + (26-20.3)2 + (16-21.7)2

27.3 25.7 20.3 21.7

32.49 + 32.49 + 32.49 + 32.4927.3 25.7 20.3 21.7

1.19 + 1.264 + 1.6 + 1.497

X 2 = 5.55

22

( )O E

E

Page 23: SPSS Session 5: Association between Nominal Variables Using Chi-Square Statistic

Chi-Square (χ2) Example 1• We wanted to know if there was an association

between whether a parent/carer reported elevated levels of psychological distress and later re-referral for additional child protection services.

• We would expect that these variables would be associated (research hypothesis).

• Our null hypothesis would be that no significant association exists.

• Our χ2= 5.55 is an approximate calculation.• To determine if it is statistically significant and not

due to chance, let’s turn to SPSS

Page 24: SPSS Session 5: Association between Nominal Variables Using Chi-Square Statistic

Chi-Square (χ2) Example 1: SPSS• From the “Analyze” menu, select “Descriptive

Statistics”, and finally “Crosstabs”

Page 25: SPSS Session 5: Association between Nominal Variables Using Chi-Square Statistic

Chi-Square (χ2) Example 1: SPSS

• Find “GHQ_Cutoff_4” variable which is our variable indicating whether a parent or carer reported clinically elevated GHQ scores

• Place this variable in the “Row(s)” list

• Find “Case_Referral” variable which indicates whether a case was referred later for additional child protection services

• Place this variable in the “Column(s)” list

Page 26: SPSS Session 5: Association between Nominal Variables Using Chi-Square Statistic

Chi-Square (χ2) Example 1: SPSS

Page 27: SPSS Session 5: Association between Nominal Variables Using Chi-Square Statistic

Chi-Square (χ2) Example 1: SPSS

• Within the “Statistics” menu, select “Chi-Square”• Press “Continue”

Page 28: SPSS Session 5: Association between Nominal Variables Using Chi-Square Statistic

Chi-Square (χ2) Example 1: SPSS

• Within the “Cells” menu, there are plenty of options:

Page 29: SPSS Session 5: Association between Nominal Variables Using Chi-Square Statistic

Chi-Square (χ2) Example 1: SPSS

• Within the “Cells” menu, there are plenty of options:– Counts: will provide the observed and expected values

for the formula we have seen previously– Percentages: will provide all of the percentages per

category, marginal totals, or grand total in the analysis which can be useful

• For our analysis, just select “Observed” and “Expected” counts for now.

• Press “Continue” and then “OK” to conduct analysis

Page 30: SPSS Session 5: Association between Nominal Variables Using Chi-Square Statistic

Chi-Square (χ2) Example 1: SPSS

• From the new output, we see the new cross tabulation table with the observed and expected values

Page 31: SPSS Session 5: Association between Nominal Variables Using Chi-Square Statistic

Chi-Square (χ2) Example 1: SPSS

• Finally, we look to the top of the last table for our significance value and the SPSS calculated χ2 value

• The χ2 value is 5.480, and the significance level is p<.05. It was calculated to be lower than our hand calculation due to rounding error.

• There is a statistically significant association between these two nominal/ordinal variables.

• We would reject the null hypothesis.

Page 32: SPSS Session 5: Association between Nominal Variables Using Chi-Square Statistic

Chi-Square (χ2) Example 1: SPSS

• Whether a parent/carer reported elevated GHQ scores was associated with future referrals to additional child protection services (χ2 =5.480, df=1, p<.05).

• These variables are associated but interpreting the Chi-Square test requires a visual inspection to know where exactly our observed values were different than our expected values.

Page 33: SPSS Session 5: Association between Nominal Variables Using Chi-Square Statistic

• Look the cross tabulation table, and compare observed and expected values per category

• New referrals appeared to be less common in the group of parents/carers reporting lower GHQ scores.

• New referrals appeared much more in group with elevated GHQ scores.

• With these findings and a statistically significant χ2 value, we can assume parent/carer psychological distress was associated with future need for additional child protection services!

Page 34: SPSS Session 5: Association between Nominal Variables Using Chi-Square Statistic

Chi-Square (χ2) Example 2

• Now let’s consider the Chi-Square analysis example from earlier.

• We wanted to know if the gender of the respondent (parent or carer) was associated with their reporting of elevated GHQ scores or not.

Page 35: SPSS Session 5: Association between Nominal Variables Using Chi-Square Statistic

Chi-Square (χ2) Example 2: SPSS

• Suppose for a moment that we have no reason to suspect that gender is associated with psychological distress.

• Our research hypothesis then would actually be the null hypothesis!

• We would hypothesize that gender and elevated psychological distress scores are unrelated.

• To demonstrate our research hypothesis, we would hope to fail to reject the null hypothesis.

Page 36: SPSS Session 5: Association between Nominal Variables Using Chi-Square Statistic

Chi-Square (χ2) Example 2: SPSS• From the “Analyze” menu, select “Descriptive

Statistics”, and finally “Crosstabs”

Page 37: SPSS Session 5: Association between Nominal Variables Using Chi-Square Statistic

Chi-Square (χ2) Example 2: SPSS

• Find “Gender_Respondent” is the gender of the parent or carer who responded to the family questionnaire

• Place this variable in the “Row(s)” list

• Find “GHQ_Cutoff_4” variable which is our variable indicating whether a parent or carer reported clinically elevated GHQ scores

• Place this variable in the “Column(s)” list

Page 38: SPSS Session 5: Association between Nominal Variables Using Chi-Square Statistic

Chi-Square (χ2) Example 2: SPSS

Page 39: SPSS Session 5: Association between Nominal Variables Using Chi-Square Statistic

Chi-Square (χ2) Example 2: SPSS

• Within the “Statistics” menu, select “Chi-Square”• Press “Continue”

Page 40: SPSS Session 5: Association between Nominal Variables Using Chi-Square Statistic

• Within the “Cells” menu, select “Observed” and “Expected”, and then “Continue”.

• Press “Continue” and then “OK” to conduct analysis

Page 41: SPSS Session 5: Association between Nominal Variables Using Chi-Square Statistic

• Below is the cross tabulation between the two variables.

• Note that the observed values are rather close to the expected values.

• This may indicate that there is not much of an association between the variables.

Page 42: SPSS Session 5: Association between Nominal Variables Using Chi-Square Statistic

Chi-Square (χ2) Example 2: SPSS

• The last table confirms our research hypothesis which was the null hypothesis on this occasion.

• Gender and reporting of elevated GHQ scores do not have a statistically significant association in our study (χ2= 1.113, df=1, p>.05). The obtained p-value in this analysis was .291, well above our standard of α=.05.