Canonical analysis

24
CANONICAL ANALYSIS Wei-Jiun, Shen Ph. D.

Transcript of Canonical analysis

Page 1: Canonical analysis

CANONICAL ANALYSISWei-Jiun, Shen Ph. D.

Page 2: Canonical analysis

Purpose

To analyze the relationship between 2 sets of variables

Multiple IVs Multiple DVs

Page 3: Canonical analysis

Kinds of research questions

It is considered a descriptive technique or a screening procedure rather than hypothesis-testing procedure Number of canonical variate pairs interpretation of canonical variates Importance of canonical variates Canonical variate scores

Page 4: Canonical analysis

Limitations to factor analysis

Theoretical issues Interpretability Linear relationship Sensitivity Causality

Practical issues Ratio of cases to IVs 10:1 Normality, linearity and homoscedasticity (not required) Missing data Absence of outliers Absence of multicollinearity and singularity

Page 5: Canonical analysis

Fundamental equation for canonical analysis

Multiple regression

When Y is more than oneโ€ฆ

ipipiii xxxy 2211

Rpiiii xxxy 21

piiipiii xxxyyy 2121

Page 6: Canonical analysis

Fundamental equation for canonical analysis

Step 1: division of RR=R๐‘ฆ๐‘ฆ

โˆ’ 1R ๐‘ฆ๐‘ฅR๐‘ฅ๐‘ฅโˆ’1 R๐‘ฅ๐‘ฆ

Id TS TC BS BC1 1.0 1.0 1.0 1.02 7.0 1.0 7.0 1.03 4.6 5.6 7.0 7.04 1.0 6.6 1.0 5.95 7.0 4.9 7.0 2.96 7.0 7.0 6.4 3.87 7.0 1.0 7.0 1.08 7.0 1.0 2.4 1.0

TS TC BS BCTS 1.00

0-.16

1.758 -.34

1TC -.16

11.00

0.110 .857

BS .758 .110 1.000

.051

BC -.341

.857 .051 1.000

R๐‘ฅ ๐‘ฅ

R๐‘ฆ ๐‘ฅ

R๐‘ฅ๐‘ฆ

R๐‘ฆ๐‘ฆ

Page 7: Canonical analysis

Fundamental equation for canonical analysis

Step 2: eigenvalue and eigenvector

R=R๐‘ฆ๐‘ฆโˆ’ 1R ๐‘ฆ๐‘ฅR๐‘ฅ๐‘ฅ

โˆ’1 R๐‘ฅ๐‘ฆ

(Rโˆ’ ฮป I )K=0

(R ๐‘ฆ๐‘ฆโˆ’1 R๐‘ฆ๐‘ฅ R๐‘ฅ๐‘ฅ

โˆ’ 1R๐‘ฅ๐‘ฆโˆ’๐‘Ÿ ๐‘๐‘–2 I )K๐‘ž=0

๐‘’๐‘–๐‘”๐‘’๐‘›๐‘ฃ๐‘Ž๐‘™๐‘ข๐‘’=ฮ›=[๐‘Ÿ ๐‘12 โ‹ฏ โ‹ฏโ‹ฎ โ‹ฑ โ‹ฎโ‹ฎ โ‹ฏ ๐‘Ÿ๐‘ ๐‘–2 ]

๐‘’๐‘–๐‘”๐‘’๐‘›๐‘ฃ๐‘’๐‘๐‘ก๐‘œ๐‘Ÿ=K=[๐‘˜1 โ‹ฏ ๐‘˜๐‘ž ]

Do you smell something?

Page 8: Canonical analysis

Fundamental equation for canonical analysis

Step 1: division of R1

1

N

P

N*P

X

11

N

Q

N*Q

Y

11

N

Q

N*P

X Y

P1

1

Q

Q

(P+Q)*(P+Q)

P

PR๐‘ฅ๐‘ฅ

R๐‘ฆ ๐‘ฅ

R๐‘ฅ๐‘ฆ

R๐‘ฆ๐‘ฆ

Page 9: Canonical analysis

Fundamental equation for canonical analysis

Step 2: eigenvalue and eigenvector

1

NN*n

Y

1 2 3 nโ€ฆ1

NN*m

X

1 2 3 mโ€ฆ

๐‘’๐‘–๐‘”๐‘’๐‘›๐‘ฃ๐‘Ž๐‘™๐‘ข๐‘’=ฮ›

โ€ฆ โ€ฆ

๐‘’๐‘–๐‘”๐‘’๐‘›๐‘ฃ๐‘’๐‘๐‘ก๐‘œ๐‘Ÿ=K

Page 10: Canonical analysis

Fundamental equation for canonical analysis

ฯ‡1

ฯ‡2

ฯ‡3

ฯ‡4

X1

X2

X3

X4

X5

ฮท1

ฮท2

ฮท3

ฮท4

Y1

Y2

Y3

Y4

๐‘Ÿ๐‘ 1โ‘

๐‘Ÿ๐‘ 2โ‘

๐‘Ÿ๐‘ 3โ‘

๐‘Ÿ๐‘ 4โ‘

0 0

Canonical variate ฯ‡

Canonical variate ฮท

Canonical correlation

Page 11: Canonical analysis

Number of set of canonical correlation

๐œ’2=โˆ’[๐‘โˆ’1โˆ’(๐‘˜๐‘ฅ+๐‘˜๐‘ฆ+12 )] ln ฮ›๐‘š

ฮ›๐‘š=โˆ1

๐‘š

(1โˆ’ฮป ๐‘– )

F-test Wilkโ€™s lambda Pillaiโ€™s trace Hotellingโ€™s trace Royโ€™s gcr

Page 12: Canonical analysis

Canonical weight

Beta in regression Partialed out due to multicollinearity Instability

ฯ‡n

X1X2X3X4X5

ฮทn

Y1

Y2

Y3

Y4

๐‘Ÿ๐‘๐‘›โ‘ฮป๐‘ค๐‘ฅ๐‘›1

โ‘

ฮป๐‘ค๐‘ฅ๐‘›2โ‘

ฮป๐‘ค๐‘ฅ๐‘›3โ‘

ฮป๐‘ค๐‘ฅ๐‘›4โ‘

ฮป๐‘ค๐‘ฅ๐‘›5โ‘

ฮป๐‘ค๐‘ฆ๐‘›1โ‘

ฮป๐‘ค๐‘ฆ๐‘›2โ‘

ฮป๐‘ค๐‘ฆ๐‘›3โ‘

ฮป๐‘ค๐‘ฆ๐‘›4โ‘

ฯ‡ ๐‘›=โˆ‘1

๐‘–

๐‘‹๐‘–ร— ฮป๐‘ค๐‘ฅ๐‘›๐‘– ฮท๐‘›=โˆ‘1

๐‘–

๐‘Œ ๐‘–ร— ฮป๐‘ค๐‘ฆ๐‘›๐‘–

Page 13: Canonical analysis

Canonical loading

Structure factor loading in FA Criterion: >.3

ฯ‡n

X1X2X3X4X5

ฮทn

Y1

Y2

Y3

Y4

๐‘Ÿ๐‘๐‘›โ‘ฮป๐‘ฅ๐‘›1

โ‘

ฮป๐‘ฅ๐‘› 2โ‘

ฮป๐‘ฅ๐‘›3โ‘

ฮป๐‘ฅ๐‘› 4โ‘

ฮป๐‘ฅ๐‘› 5โ‘

ฮป ๐‘ฆ๐‘›1โ‘

ฮป ๐‘ฆ๐‘›2โ‘

ฮป ๐‘ฆ๐‘›3โ‘

ฮป ๐‘ฆ๐‘›4โ‘

Page 14: Canonical analysis

Canonical cross-loading

Correlations of each variable and other canonical variate

ฮป๐‘ฅ๐‘›๐‘– : ๐‘ฆโ‘ =๐‘Ÿ ๐‘๐‘›ร— ฮป๐‘ฅ๐‘›๐‘–

โ‘

ฮป ๐‘ฆ๐‘›๐‘–: ๐‘ฅโ‘ =๐‘Ÿ๐‘๐‘›ร—ฮป ๐‘ฆ๐‘›๐‘–โ‘

ฯ‡n

X1X2X3X4X5

ฮทn

Y1

Y2

Y3

Y4

๐‘Ÿ๐‘๐‘›โ‘ฮป๐‘ฅ๐‘› 1

โ‘

ฮป๐‘ฅ๐‘› 2โ‘

ฮป๐‘ฅ๐‘›3โ‘

ฮป๐‘ฅ๐‘› 4โ‘

ฮป๐‘ฅ๐‘›5โ‘

ฮป ๐‘ฆ๐‘›1โ‘

ฮป ๐‘ฆ๐‘›2โ‘

ฮป ๐‘ฆ๐‘›3โ‘

ฮป ๐‘ฆ๐‘› 4โ‘

๐‘Ÿ๐‘๐‘›ร—ฮป๐‘ฅ๐‘› 1โ‘

๐‘Ÿ๐‘๐‘›ร—ฮป๐‘ฅ๐‘› 2โ‘

๐‘Ÿ๐‘๐‘›ร—ฮป๐‘ฅ๐‘› 3โ‘

๐‘Ÿ๐‘๐‘›ร—ฮป๐‘ฅ๐‘›4โ‘

๐‘Ÿ๐‘๐‘›ร—ฮป๐‘ฅ๐‘› 5โ‘

๐‘Ÿ๐‘๐‘›ร—ฮป ๐‘ฆ๐‘›1โ‘

๐‘Ÿ๐‘๐‘›ร—ฮป ๐‘ฆ๐‘›2โ‘

๐‘Ÿ๐‘๐‘›ร—ฮป ๐‘ฆ๐‘›3โ‘

๐‘Ÿ๐‘๐‘›ร—ฮป ๐‘ฆ๐‘› 4โ‘

Page 15: Canonical analysis

Which interpretation approach to use

Priority (Hair et al., 2010)1. Canonical cross-loading2. Canonical loading3. Canonical weight

Page 16: Canonical analysis

Redundancy (index)

Variance the canonical variates from the IVs and extract from the DVs, and vice versa

๐‘๐‘ฃ ๐‘ฅ๐‘=โˆ‘1

๐‘– ฮป๐‘ฅ๐‘›๐‘–2

๐‘–

๐‘๐‘ฃ ๐‘ฆ๐‘=โˆ‘1

๐‘– ฮป๐‘ฆ๐‘›๐‘–2

๐‘–

๐‘Ÿ๐‘‘=๐‘๐‘ฃร—๐‘Ÿ๐‘๐‘›2

Adequacy coefficientRedundan

cy index

Page 17: Canonical analysis

Redundancy (index)

Variance the canonical variates from the IVs and extract from the DVs, and vice versa

ฯ‡n

X1X2X3X4X5

ฮทn

Y1

Y2

Y3

Y4

๐‘Ÿ๐‘๐‘›โ‘ฮป๐‘ฅ๐‘› 1

โ‘

ฮป๐‘ฅ๐‘› 2โ‘

ฮป๐‘ฅ๐‘›3โ‘

ฮป๐‘ฅ๐‘› 4โ‘

ฮป๐‘ฅ๐‘› 5โ‘

ฮป ๐‘ฆ๐‘›1โ‘

ฮป ๐‘ฆ๐‘›2โ‘

ฮป ๐‘ฆ๐‘›3โ‘

ฮป ๐‘ฆ๐‘› 4โ‘

๐‘๐‘ฃ ๐‘ฅ๐‘ ๐‘๐‘ฃ ๐‘ฆ๐‘

๐‘Ÿ ๐‘‘ฮท๐‘›โ†’ X=๐‘๐‘ฃ ๐‘ฅ๐‘ร—๐‘Ÿ๐‘๐‘›2 ๐‘Ÿ ๐‘‘ฯ‡๐‘›โ†’Y=๐‘๐‘ฃ ๐‘ฆ๐‘ร—๐‘Ÿ๐‘๐‘›2

Page 18: Canonical analysis

Some important issue

Importance of canonical variates Test for the significance Canonical correlation >.3 Variate and its own variables Redundancy

Interpretation of canonical variates Mathematical resolution of combining variables Loading >.3

Page 19: Canonical analysis

Procedure

1. Research question2. Designing a canonical analysis3. Check the assumptions4. Derive canonical analysis and assess overall

fit5. Interpret the canonical variate6. Validation and diagnosis

Page 20: Canonical analysis

PRACTICE

Page 21: Canonical analysis

้ŽๅŽปๅญธๆฅญ่กจ็พ่ˆ‡็พๅœจๅญธๆฅญ่กจ็พ็ ”็ฉถ็”Ÿ็„ฆ่‚ฒๅธƒๆƒณ็žญ่งฃ้ŽๅŽปๅญธๆฅญ่กจ็พ่ˆ‡็พๅœจๅญธๆฅญ่กจ็พไน‹้–“็š„้—œไฟ‚ใ€‚ไป–็š„็ ”็ฉถๅ•้กŒๆ˜ฏ๏ผŒๅคงๅญธ็”Ÿๅœจ้ซ˜ไธญๆ™‚ๆœŸ็š„ๅญธๆฅญ่กจ็พๆ˜ฏๅฆ่ˆ‡็พ้šŽๆฎต็š„ๅญธๆฅญ่กจ็พๆœ‰้—œ๏ผŸๅ…ถไธญ๏ผŒ้ซ˜ไธญๅญธๆฅญ่กจ็พๅŒ…ๅซๅœ‹ๆ–‡ใ€่‹ฑๆ–‡ใ€ไธ‰่ง’ๅ‡ฝๆ•ธ่ˆ‡็ทšๆ€งไปฃๆ•ธ็ญ‰ๅ››ๅ€‹็ง‘็›ฎ็š„่ฉ•้‡ๅˆ†ๆ•ธ๏ผŒๅคงๅญธ้šŽๆฎต็š„ๅญธๆฅญ่กจ็พๆŒ‡ๆจ™ๅ‰‡ๅŒ…ๅซๅœ‹ๆ–‡ใ€ๅค–่ชžใ€ๅพฎ็ฉๅˆ†่ˆ‡็ตฑ่จˆ็š„่ฉ•้‡ๅˆ†ๆ•ธใ€‚่ซ‹ไปฅๅ…ธๅž‹็›ธ้—œๅˆ†ๆž่งฃ็ญ”ๆญคๅ•้กŒใ€‚

Page 22: Canonical analysis

Canonical correlation

ฯ‡1

HS_LAN

HS_ENG

HS_TRI

HS_LIA

ฮท1

=.994-.99

-.99

-.61

-.30

CO_LAN

CO_ENG

CO_CAL

CO_STA

-.94

-.98

-.13

.15

ฯ‡1

HS_LAN

HS_ENG

HS_TRI

HS_LIA

ฮท1

=.965-.01

-.06

.75

.65

CO_LAN

CO_ENG

CO_CAL

CO_STA

-.27

-.17

.73

.77

๐‘Ÿ ๐‘‘ฯ‡๐‘›โ†’Y=.58

๐‘Ÿ ๐‘‘ฯ‡๐‘›โ†’Y=.29

๐‘Ÿ ๐‘‘ฮท๐‘›โ†’ X=.60

๐‘Ÿ ๐‘‘ฮท๐‘›โ†’ X=.23

Page 23: Canonical analysis

่บซ้ซ”ๆดปๅ‹•่ˆ‡ๆ™บ่ƒฝ็ ”็ฉถ็”Ÿๆธธๅฟ—็นชไพๆƒณ็žญ่งฃ่บซ้ซ”ๆดปๅ‹•ๅž‹ๆ…‹ๅฐๆ–ผๆ™บๅŠ›็š„ๅฝฑ้Ÿฟใ€‚ไป–็š„็ ”็ฉถๅ•้กŒๆ˜ฏ๏ผŒ้’ๅฐ‘ๅนด็š„่บซ้ซ”ๆดปๅ‹•่ˆ‡ๆ™บๅŠ›ไน‹้–“ๆ˜ฏๅฆๆœ‰้—œ๏ผŸๅ…ถไธญ๏ผŒ่บซ้ซ”ๆดปๅ‹•ๅŒ…ๅซๅๅผ็”Ÿๆดปใ€ๅฅ่ตฐใ€ไธญ็ญ‰ๅผทๅบฆไปฅๅŠ้ซ˜็ญ‰ๅผทๅบฆๆดปๅ‹•้‡็ญ‰ๅ››้ …ๆŒ‡ๆจ™๏ผŒๆ™บๅŠ›็š„ๆŒ‡ๆจ™ๅ‰‡ๅŒ…ๅซ่ชžๆ–‡ใ€ๆ•ธๅญธ้‚่ผฏใ€็ฉบ้–“ใ€้Ÿณๆจ‚ใ€่‚ข้ซ”ๅ‹•่ฆบใ€ๅ…ง็œใ€ไบบ้š›่ˆ‡่‡ช็„ถ่ง€ๅฏŸ็š„ๆธฌ้ฉ—่กจ็พใ€‚่ซ‹ไปฅๅ…ธๅž‹็›ธ้—œๅˆ†ๆž่งฃ็ญ”ๆญคๅ•้กŒใ€‚

Page 24: Canonical analysis

Canonical correlation

ฯ‡1

Strenuous

moderate

Walk

Sedentary

ฮท1

Language

Math

Space

Music=.351

-.98

-.74

-.13

.15

Kinesthesis

Introspection

Interpersonal

Nature science

-.43

-.06

.05

-.01

-.70

-.22

-.44

.01