METHODS OF TRANSFORMING NON-POSITIVE DEFINITE CORRELATION MATRICES Katarzyna Wojtaszek student...

19
METHODS OF TRANSFORMING NON- POSITIVE DEFINITE CORRELATION MATRICES Katarzyna Wojtaszek student number 1118676 CROSS

Transcript of METHODS OF TRANSFORMING NON-POSITIVE DEFINITE CORRELATION MATRICES Katarzyna Wojtaszek student...

Page 1: METHODS OF TRANSFORMING NON-POSITIVE DEFINITE CORRELATION MATRICES Katarzyna Wojtaszek student number 1118676 CROSS.

METHODS OF TRANSFORMING NON-POSITIVE DEFINITE

CORRELATION MATRICES

Katarzyna Wojtaszek

student number 1118676

CROSS

Page 2: METHODS OF TRANSFORMING NON-POSITIVE DEFINITE CORRELATION MATRICES Katarzyna Wojtaszek student number 1118676 CROSS.

I will try to answer questions:  How can I estimate correlation matrix when I have

data?

  What can I do if matrices are non-PD?

Shrinking method

Eigenvalues method

Vines method

 How can we calculate distances between original and transformed matrices?

Which method is the best?

comparing

conclusions

Page 3: METHODS OF TRANSFORMING NON-POSITIVE DEFINITE CORRELATION MATRICES Katarzyna Wojtaszek student number 1118676 CROSS.

  How can I estimate correlation matrix if I have data?

I can estimate the correlation matrices from data as follows:

1. I can estimate each off-diagonal element separately

n

ii

n

ii

n

iii

yyxx

yyxx

1

2_

1

2_

1

__

)()(

))((

Page 4: METHODS OF TRANSFORMING NON-POSITIVE DEFINITE CORRELATION MATRICES Katarzyna Wojtaszek student number 1118676 CROSS.

2. I can also estimate whole data together:

with

i=1,…,s ; j=1,…,n

__

1_

_

1

......)( n

n

T

xx

x

x

s

XXxC

s

xx

s

kik

i

1

_

][ ijxX

Page 5: METHODS OF TRANSFORMING NON-POSITIVE DEFINITE CORRELATION MATRICES Katarzyna Wojtaszek student number 1118676 CROSS.

What can I do when matrices are non- PD?

 We can use some methods for transforming these matrices to PD correlation matrices using:

Shrinking method

Eigenvalues method

Vines method

Page 6: METHODS OF TRANSFORMING NON-POSITIVE DEFINITE CORRELATION MATRICES Katarzyna Wojtaszek student number 1118676 CROSS.

How can we calculate distances between original and transformed matrices?  

There are many methods which we can use to calculate the distance between matrices .

In my project I used formula:

n

i

n

iijij rrRRd

1 1

2~~

2 )() ,(

Page 7: METHODS OF TRANSFORMING NON-POSITIVE DEFINITE CORRELATION MATRICES Katarzyna Wojtaszek student number 1118676 CROSS.

1. SHRINKING METHOD 

linear shrinking

Assumptions:

• Rnxn is given non-PD pseudo correlation matrix

• is arbitrary correlation matrix

Define: ( [0,1]) =R+ (R* - R) is a pseudo correlation matrix.

*nxnR

~

R

Page 8: METHODS OF TRANSFORMING NON-POSITIVE DEFINITE CORRELATION MATRICES Katarzyna Wojtaszek student number 1118676 CROSS.

Idea:

find the smallest such that matrix will be PD. Since R is non-PD then the smallest eigenvalue of R is negative , so we have to choose such that will be positive. Hence:

~

~

*)1()*)1((minmin1

~

1

xRxRxxxRx TT

xx

T

xx tt

And 0 if - / (*-).

So we find matrix which is PD matrix given

non-PD matrix R.

~

~

R

~

R

Page 9: METHODS OF TRANSFORMING NON-POSITIVE DEFINITE CORRELATION MATRICES Katarzyna Wojtaszek student number 1118676 CROSS.

non-linear shrinking

Assumption:

Rnxn is given non-PD pseudo correlation matrix

Procedure:

)())((

)(0

)())((

)(

11

1

11

frifrff

frif

frifrff

rg

ijij

ij

ijij

ij

where f is strictly increasing odd function with f(0)=0 and >0.

Page 10: METHODS OF TRANSFORMING NON-POSITIVE DEFINITE CORRELATION MATRICES Katarzyna Wojtaszek student number 1118676 CROSS.

I considered the following four functions:

)(tanh)(1 xxf

)(tanh)( 12 xxf

)arctan(2

)(3 xxf

2tan)(4

xxf

Page 11: METHODS OF TRANSFORMING NON-POSITIVE DEFINITE CORRELATION MATRICES Katarzyna Wojtaszek student number 1118676 CROSS.

In

Rnxn

nxnR~

nxnR~

SET OF PD-MATRICES

Linear shrinking

Non-linear shrinking

Comparison of the linear and non-linear shrinking methods

Page 12: METHODS OF TRANSFORMING NON-POSITIVE DEFINITE CORRELATION MATRICES Katarzyna Wojtaszek student number 1118676 CROSS.

 2.THE EIGENVALUE METHOD.

 Assumptions:

• Rnxn non-PD pseudo correlation matrix

•P -orthogonal matrix such that R=PDPT

•D matrix which the eigenvalues of R on the diagonal

is some constant 0

Page 13: METHODS OF TRANSFORMING NON-POSITIVE DEFINITE CORRELATION MATRICES Katarzyna Wojtaszek student number 1118676 CROSS.

Idea:

Replaced negative values in matrix D by .

We obtain:

R*=PD*PT

= where is a diagonal matrix

with diagonal elements equal

for i=1,2,…,n.

~

RT

DRD~

*~ ~

D

*

1

ijr

Page 14: METHODS OF TRANSFORMING NON-POSITIVE DEFINITE CORRELATION MATRICES Katarzyna Wojtaszek student number 1118676 CROSS.

3.VINES METHOD. Assumptions:

•Rnxn pseudo correlation matrix

Idea:

First we have to check if our matrix is PD

21,...,3;2

21,...,3;1

1,...,3;21,..,3;11,...,3;12,...,4,3;12

11

nnnn

nnnnnn

Page 15: METHODS OF TRANSFORMING NON-POSITIVE DEFINITE CORRELATION MATRICES Katarzyna Wojtaszek student number 1118676 CROSS.

If some (-1,1) we change the value

V( ) (-1,1)) and recalculate partial correlation using:

V( ) =V( )

+

We obtain new matrix , witch we have check again.

n,...,3;12

n,...,3;12

1,...,3;12 n n,...,4,3;12 21,...,3;2

21,...,3;1 11 nnnn

1,...,3;21,..,3;1 nnnn

Page 16: METHODS OF TRANSFORMING NON-POSITIVE DEFINITE CORRELATION MATRICES Katarzyna Wojtaszek student number 1118676 CROSS.

Example

Let say that we have matrix R4x4

44434241

34333231

24232221

14131211

Very useful is making graphical model

1

2

3

4

1;23

1;24

1413

12;34

12

Page 17: METHODS OF TRANSFORMING NON-POSITIVE DEFINITE CORRELATION MATRICES Katarzyna Wojtaszek student number 1118676 CROSS.

Which method is the best?Comparing. Using Matlab I chose randomly 500 non-PD matrices, transformed them and calculated the average distances between non-PD and PD matrices. This table shows us my results.

n 3 4 5 6 7 8 9 10

Lin. shrinking 2.7868 4.371 6.7233 9.8977 14.0027 18.4047 23.7102 29.6013

Shrinking f1 0.1388 0.4028 1.1251 2.5161 4.3623 6.76 9.8484 13.8416

Shrinking f2 0.2756 0.9696 2.382 4.6464 8.1327 11.4816 16.3835 20.5501

Shrinking f3 0.1441 0.4589 1.1432 2.5153 4.4483 6.9127 10.176 13.7543

Shrinking f4 0.4091 1.4379 3.3365 5.7357 8.6839 11.7034 15.686 18.9959

Eigenvalues 0.0861 0.2039 0.451 0.913 1.5799 2.3263 3.3845 4.7033

Vines 0.2285 1.2999 3.3251 6.6395 11.3295 17.813 24.7021 34.4963

Page 18: METHODS OF TRANSFORMING NON-POSITIVE DEFINITE CORRELATION MATRICES Katarzyna Wojtaszek student number 1118676 CROSS.

ILUSTATION: average distance

2 3 4 5 6 7 8 9 10 11

0

5

10

15

20

25

30

35

SIZE OF MATRICES

ME

AN

DIS

TA

NC

E

MEAN DISTANCE BETWEEN ORIGINAL AND TRANSFORMED MATRICES

linear shrinkink shrinking f1 shrinking f2 shrinking f3 shrinking f4 eigenvalue vines

Page 19: METHODS OF TRANSFORMING NON-POSITIVE DEFINITE CORRELATION MATRICES Katarzyna Wojtaszek student number 1118676 CROSS.

Conclusions:

1. The reason that the linear shrinking is very bad method is that we shrink all elements by the same relative amount

2. The eigenvalues method performes fast and gives very good results regardless matrices dimensions

3. For the non-linear shrinking method the best choice of the projection function are and )(tanh)(1 xxf )arctan(

2)(3 xxf