Determinants of Correlation Matrices with...

71
Determinants of Correlation Matrices with Applications Tiefeng Jiang School of Statistics, University of Minnesota July 14, 2016

Transcript of Determinants of Correlation Matrices with...

Page 1: Determinants of Correlation Matrices with Applicationsmath0.bnu.edu.cn/probab/Workshop2016/Talks/JIangTF.pdf · Determinants of Correlation Matrices with Applications Tiefeng Jiang

Determinants of Correlation Matrices withApplications

Tiefeng Jiang

School of Statistics, University of Minnesota

July 14, 2016

Page 2: Determinants of Correlation Matrices with Applicationsmath0.bnu.edu.cn/probab/Workshop2016/Talks/JIangTF.pdf · Determinants of Correlation Matrices with Applications Tiefeng Jiang

Outline

Correlation matrix

What we know about it?

Why study determinant?

Our result

Application

Proof

Page 3: Determinants of Correlation Matrices with Applicationsmath0.bnu.edu.cn/probab/Workshop2016/Talks/JIangTF.pdf · Determinants of Correlation Matrices with Applications Tiefeng Jiang

Outline

Correlation matrix

What we know about it?

Why study determinant?

Our result

Application

Proof

Page 4: Determinants of Correlation Matrices with Applicationsmath0.bnu.edu.cn/probab/Workshop2016/Talks/JIangTF.pdf · Determinants of Correlation Matrices with Applications Tiefeng Jiang

Outline

Correlation matrix

What we know about it?

Why study determinant?

Our result

Application

Proof

Page 5: Determinants of Correlation Matrices with Applicationsmath0.bnu.edu.cn/probab/Workshop2016/Talks/JIangTF.pdf · Determinants of Correlation Matrices with Applications Tiefeng Jiang

Outline

Correlation matrix

What we know about it?

Why study determinant?

Our result

Application

Proof

Page 6: Determinants of Correlation Matrices with Applicationsmath0.bnu.edu.cn/probab/Workshop2016/Talks/JIangTF.pdf · Determinants of Correlation Matrices with Applications Tiefeng Jiang

Outline

Correlation matrix

What we know about it?

Why study determinant?

Our result

Application

Proof

Page 7: Determinants of Correlation Matrices with Applicationsmath0.bnu.edu.cn/probab/Workshop2016/Talks/JIangTF.pdf · Determinants of Correlation Matrices with Applications Tiefeng Jiang

Outline

Correlation matrix

What we know about it?

Why study determinant?

Our result

Application

Proof

Page 8: Determinants of Correlation Matrices with Applicationsmath0.bnu.edu.cn/probab/Workshop2016/Talks/JIangTF.pdf · Determinants of Correlation Matrices with Applications Tiefeng Jiang

Correlation Matrix

Pearson’s correlation coefficient:

a = (a1, · · · , an); b = (b1, · · · , bn)

a = 1n

∑ni=1 ai; b = 1

n

∑ni=1 bi

ra,b =

∑(ai − a)(bi − b)√∑

(ai − a)2√∑

(bi − b)2

ra,b is cosine of angle between(a1 − a, · · · , an − a) and (b1 − b, · · · , bn − b)

Page 9: Determinants of Correlation Matrices with Applicationsmath0.bnu.edu.cn/probab/Workshop2016/Talks/JIangTF.pdf · Determinants of Correlation Matrices with Applications Tiefeng Jiang

Correlation Matrix

Pearson’s correlation coefficient:

a = (a1, · · · , an); b = (b1, · · · , bn)

a = 1n

∑ni=1 ai; b = 1

n

∑ni=1 bi

ra,b =

∑(ai − a)(bi − b)√∑

(ai − a)2√∑

(bi − b)2

ra,b is cosine of angle between(a1 − a, · · · , an − a) and (b1 − b, · · · , bn − b)

Page 10: Determinants of Correlation Matrices with Applicationsmath0.bnu.edu.cn/probab/Workshop2016/Talks/JIangTF.pdf · Determinants of Correlation Matrices with Applications Tiefeng Jiang

Correlation Matrix

Pearson’s correlation coefficient:

a = (a1, · · · , an); b = (b1, · · · , bn)

a = 1n

∑ni=1 ai; b = 1

n

∑ni=1 bi

ra,b =

∑(ai − a)(bi − b)√∑

(ai − a)2√∑

(bi − b)2

ra,b is cosine of angle between(a1 − a, · · · , an − a) and (b1 − b, · · · , bn − b)

Page 11: Determinants of Correlation Matrices with Applicationsmath0.bnu.edu.cn/probab/Workshop2016/Talks/JIangTF.pdf · Determinants of Correlation Matrices with Applications Tiefeng Jiang

Correlation Matrix

Pearson’s correlation coefficient:

a = (a1, · · · , an); b = (b1, · · · , bn)

a = 1n

∑ni=1 ai; b = 1

n

∑ni=1 bi

ra,b =

∑(ai − a)(bi − b)√∑

(ai − a)2√∑

(bi − b)2

ra,b is cosine of angle between(a1 − a, · · · , an − a) and (b1 − b, · · · , bn − b)

Page 12: Determinants of Correlation Matrices with Applicationsmath0.bnu.edu.cn/probab/Workshop2016/Talks/JIangTF.pdf · Determinants of Correlation Matrices with Applications Tiefeng Jiang

x1, · · · , xn i.i.d from Np(µ,Σ), Σ = (σij)p×p.

• Correlation matrix Rn = (rij)p×p, where

rii = 1 and rij =σij√σiiσjj

• Sample correlation matrix Rn := (rij)p×p, where

rij is Pearson’s correlation coefficient of ith row and jth row of(x1, · · · , xn)p×n

Remember difference between Rn and Rn

Page 13: Determinants of Correlation Matrices with Applicationsmath0.bnu.edu.cn/probab/Workshop2016/Talks/JIangTF.pdf · Determinants of Correlation Matrices with Applications Tiefeng Jiang

x1, · · · , xn i.i.d from Np(µ,Σ), Σ = (σij)p×p.

• Correlation matrix Rn = (rij)p×p, where

rii = 1 and rij =σij√σiiσjj

• Sample correlation matrix Rn := (rij)p×p, where

rij is Pearson’s correlation coefficient of ith row and jth row of(x1, · · · , xn)p×n

Remember difference between Rn and Rn

Page 14: Determinants of Correlation Matrices with Applicationsmath0.bnu.edu.cn/probab/Workshop2016/Talks/JIangTF.pdf · Determinants of Correlation Matrices with Applications Tiefeng Jiang

x1, · · · , xn i.i.d from Np(µ,Σ), Σ = (σij)p×p.

• Correlation matrix Rn = (rij)p×p, where

rii = 1 and rij =σij√σiiσjj

• Sample correlation matrix Rn := (rij)p×p, where

rij is Pearson’s correlation coefficient of ith row and jth row of(x1, · · · , xn)p×n

Remember difference between Rn and Rn

Page 15: Determinants of Correlation Matrices with Applicationsmath0.bnu.edu.cn/probab/Workshop2016/Talks/JIangTF.pdf · Determinants of Correlation Matrices with Applications Tiefeng Jiang

x1, · · · , xn i.i.d from Np(µ,Σ), Σ = (σij)p×p.

• Correlation matrix Rn = (rij)p×p, where

rii = 1 and rij =σij√σiiσjj

• Sample correlation matrix Rn := (rij)p×p, where

rij is Pearson’s correlation coefficient of ith row and jth row of(x1, · · · , xn)p×n

Remember difference between Rn and Rn

Page 16: Determinants of Correlation Matrices with Applicationsmath0.bnu.edu.cn/probab/Workshop2016/Talks/JIangTF.pdf · Determinants of Correlation Matrices with Applications Tiefeng Jiang

x1, · · · , xn i.i.d from Np(µ,Σ), Σ = (σij)p×p.

• Correlation matrix Rn = (rij)p×p, where

rii = 1 and rij =σij√σiiσjj

• Sample correlation matrix Rn := (rij)p×p, where

rij is Pearson’s correlation coefficient of ith row and jth row of(x1, · · · , xn)p×n

Remember difference between Rn and Rn

Page 17: Determinants of Correlation Matrices with Applicationsmath0.bnu.edu.cn/probab/Workshop2016/Talks/JIangTF.pdf · Determinants of Correlation Matrices with Applications Tiefeng Jiang

Connections

• Largest entry of Rn test independence of entries of N(µ,Σ)Cai and J. (2011)

• Largest entry of Rn is a statistic in Compressed Sensing; MIPCai and J. (2011)

• Random packing on sphere Sp−1

Cai, Fan and J. (2013).

Page 18: Determinants of Correlation Matrices with Applicationsmath0.bnu.edu.cn/probab/Workshop2016/Talks/JIangTF.pdf · Determinants of Correlation Matrices with Applications Tiefeng Jiang

Connections

• Largest entry of Rn test independence of entries of N(µ,Σ)Cai and J. (2011)

• Largest entry of Rn is a statistic in Compressed Sensing; MIPCai and J. (2011)

• Random packing on sphere Sp−1

Cai, Fan and J. (2013).

Page 19: Determinants of Correlation Matrices with Applicationsmath0.bnu.edu.cn/probab/Workshop2016/Talks/JIangTF.pdf · Determinants of Correlation Matrices with Applications Tiefeng Jiang

Connections

• Largest entry of Rn test independence of entries of N(µ,Σ)Cai and J. (2011)

• Largest entry of Rn is a statistic in Compressed Sensing; MIPCai and J. (2011)

• Random packing on sphere Sp−1

Cai, Fan and J. (2013).

Page 20: Determinants of Correlation Matrices with Applicationsmath0.bnu.edu.cn/probab/Workshop2016/Talks/JIangTF.pdf · Determinants of Correlation Matrices with Applications Tiefeng Jiang

What We Know about Sample Correlation Matrix?

Assume Σ = I.

• Joint pdf of eigenvalues unknown, no invariance

• Empirical dist. of eigenvalues of Rn is asymptoticallyMarchenko-Pastur law. J. (2004)

• Largest eigenvalue of Rn → Tracy-Widom lawBao, Pan & Zhou. (2012)

• log |Rn| satisfies CLTJiang and Yang (2013) and Jiang and Qi (2015)

Page 21: Determinants of Correlation Matrices with Applicationsmath0.bnu.edu.cn/probab/Workshop2016/Talks/JIangTF.pdf · Determinants of Correlation Matrices with Applications Tiefeng Jiang

What We Know about Sample Correlation Matrix?

Assume Σ = I.

• Joint pdf of eigenvalues unknown, no invariance

• Empirical dist. of eigenvalues of Rn is asymptoticallyMarchenko-Pastur law. J. (2004)

• Largest eigenvalue of Rn → Tracy-Widom lawBao, Pan & Zhou. (2012)

• log |Rn| satisfies CLTJiang and Yang (2013) and Jiang and Qi (2015)

Page 22: Determinants of Correlation Matrices with Applicationsmath0.bnu.edu.cn/probab/Workshop2016/Talks/JIangTF.pdf · Determinants of Correlation Matrices with Applications Tiefeng Jiang

What We Know about Sample Correlation Matrix?

Assume Σ = I.

• Joint pdf of eigenvalues unknown, no invariance

• Empirical dist. of eigenvalues of Rn is asymptoticallyMarchenko-Pastur law. J. (2004)

• Largest eigenvalue of Rn → Tracy-Widom lawBao, Pan & Zhou. (2012)

• log |Rn| satisfies CLTJiang and Yang (2013) and Jiang and Qi (2015)

Page 23: Determinants of Correlation Matrices with Applicationsmath0.bnu.edu.cn/probab/Workshop2016/Talks/JIangTF.pdf · Determinants of Correlation Matrices with Applications Tiefeng Jiang

What We Know about Sample Correlation Matrix?

Assume Σ = I.

• Joint pdf of eigenvalues unknown, no invariance

• Empirical dist. of eigenvalues of Rn is asymptoticallyMarchenko-Pastur law. J. (2004)

• Largest eigenvalue of Rn → Tracy-Widom lawBao, Pan & Zhou. (2012)

• log |Rn| satisfies CLTJiang and Yang (2013) and Jiang and Qi (2015)

Page 24: Determinants of Correlation Matrices with Applicationsmath0.bnu.edu.cn/probab/Workshop2016/Talks/JIangTF.pdf · Determinants of Correlation Matrices with Applications Tiefeng Jiang

In particular, Jiang, Yang & Qi proved

Theorem

Assume p := pn satisfy that n > p + 4 and p→∞. Set

µn =(p− n +

32)

log(

1− pn− 1

)− n− 2

n− 1p ;

σ2n = −2

[ pn− 1

+ log(

1− pn− 1

)].

Then, (log |Rn| − µn)/σn → N(0, 1).

Page 25: Determinants of Correlation Matrices with Applicationsmath0.bnu.edu.cn/probab/Workshop2016/Talks/JIangTF.pdf · Determinants of Correlation Matrices with Applications Tiefeng Jiang

Little is known when Rn 6= I.

Page 26: Determinants of Correlation Matrices with Applicationsmath0.bnu.edu.cn/probab/Workshop2016/Talks/JIangTF.pdf · Determinants of Correlation Matrices with Applications Tiefeng Jiang

Why study |Rn| under Rn 6= I?

• New understanding of sample correlation matrix |Rn|

• Recently, Tao and Vu (2012); Nguyen & Vu (2014): CLT fordeterminant of Wigner matrix

• Cai, Liang, Zhou (2015) study CLT for determinant of Wishartmatrix

•We have a problem from high-dimensional statistics on |Rn|

High-dimensional statistics + Machine Learning = Big Data

Page 27: Determinants of Correlation Matrices with Applicationsmath0.bnu.edu.cn/probab/Workshop2016/Talks/JIangTF.pdf · Determinants of Correlation Matrices with Applications Tiefeng Jiang

Why study |Rn| under Rn 6= I?

• New understanding of sample correlation matrix |Rn|

• Recently, Tao and Vu (2012); Nguyen & Vu (2014): CLT fordeterminant of Wigner matrix

• Cai, Liang, Zhou (2015) study CLT for determinant of Wishartmatrix

•We have a problem from high-dimensional statistics on |Rn|

High-dimensional statistics + Machine Learning = Big Data

Page 28: Determinants of Correlation Matrices with Applicationsmath0.bnu.edu.cn/probab/Workshop2016/Talks/JIangTF.pdf · Determinants of Correlation Matrices with Applications Tiefeng Jiang

Why study |Rn| under Rn 6= I?

• New understanding of sample correlation matrix |Rn|

• Recently, Tao and Vu (2012); Nguyen & Vu (2014): CLT fordeterminant of Wigner matrix

• Cai, Liang, Zhou (2015) study CLT for determinant of Wishartmatrix

•We have a problem from high-dimensional statistics on |Rn|

High-dimensional statistics + Machine Learning = Big Data

Page 29: Determinants of Correlation Matrices with Applicationsmath0.bnu.edu.cn/probab/Workshop2016/Talks/JIangTF.pdf · Determinants of Correlation Matrices with Applications Tiefeng Jiang

Why study |Rn| under Rn 6= I?

• New understanding of sample correlation matrix |Rn|

• Recently, Tao and Vu (2012); Nguyen & Vu (2014): CLT fordeterminant of Wigner matrix

• Cai, Liang, Zhou (2015) study CLT for determinant of Wishartmatrix

•We have a problem from high-dimensional statistics on |Rn|

High-dimensional statistics + Machine Learning = Big Data

Page 30: Determinants of Correlation Matrices with Applicationsmath0.bnu.edu.cn/probab/Workshop2016/Talks/JIangTF.pdf · Determinants of Correlation Matrices with Applications Tiefeng Jiang

What kind of Rn are interesting?

• Compound symmetry structure1 a a · · · aa 1 a · · · aa a 1 · · · a...

......

...a a a · · · 1

• Autoregressive process of order 1

1 ρ ρ2 · · · ρp−1

ρ 1 ρ · · · ρp−2

ρ2 ρ 1 · · · ρp−3

......

......

ρp−1 ρp−2 ρp−3 · · · 1

Page 31: Determinants of Correlation Matrices with Applicationsmath0.bnu.edu.cn/probab/Workshop2016/Talks/JIangTF.pdf · Determinants of Correlation Matrices with Applications Tiefeng Jiang

What kind of Rn are interesting?

• Compound symmetry structure1 a a · · · aa 1 a · · · aa a 1 · · · a...

......

...a a a · · · 1

• Autoregressive process of order 11 ρ ρ2 · · · ρp−1

ρ 1 ρ · · · ρp−2

ρ2 ρ 1 · · · ρp−3

......

......

ρp−1 ρp−2 ρp−3 · · · 1

Page 32: Determinants of Correlation Matrices with Applicationsmath0.bnu.edu.cn/probab/Workshop2016/Talks/JIangTF.pdf · Determinants of Correlation Matrices with Applications Tiefeng Jiang

What kind of Rn are interesting?

• Compound symmetry structure1 a a · · · aa 1 a · · · aa a 1 · · · a...

......

...a a a · · · 1

• Autoregressive process of order 1

1 ρ ρ2 · · · ρp−1

ρ 1 ρ · · · ρp−2

ρ2 ρ 1 · · · ρp−3

......

......

ρp−1 ρp−2 ρp−3 · · · 1

Page 33: Determinants of Correlation Matrices with Applicationsmath0.bnu.edu.cn/probab/Workshop2016/Talks/JIangTF.pdf · Determinants of Correlation Matrices with Applications Tiefeng Jiang

Banded matrix

1 a12 0 · · · 0 0a21 1 a23 · · · 0 00 a32 1 · · · 0 0...

......

......

...0 0 0 · · · 1 ap−1 p

0 0 0 · · · ap p−1 1

Other important matrices:Toeplitz, Hankel, symmetric circulant matricesBrockwell and Davis (2002)

Page 34: Determinants of Correlation Matrices with Applicationsmath0.bnu.edu.cn/probab/Workshop2016/Talks/JIangTF.pdf · Determinants of Correlation Matrices with Applications Tiefeng Jiang

Banded matrix

1 a12 0 · · · 0 0a21 1 a23 · · · 0 00 a32 1 · · · 0 0...

......

......

...0 0 0 · · · 1 ap−1 p

0 0 0 · · · ap p−1 1

Other important matrices:Toeplitz, Hankel, symmetric circulant matricesBrockwell and Davis (2002)

Page 35: Determinants of Correlation Matrices with Applicationsmath0.bnu.edu.cn/probab/Workshop2016/Talks/JIangTF.pdf · Determinants of Correlation Matrices with Applications Tiefeng Jiang

Our Result

x1, · · · , xn i.i.d from Np(µ,Σ), Σ = (σij)p×p.

• Correlation matrix Rn = (rij)p×p, where

rii = 1 and rij =σij√σiiσjj

• Sample correlation matrix Rn := (rij)p×p, where

rij is Pearson’s correlation coefficient of ith row and jth row of(x1, · · · , xn)p×n

Page 36: Determinants of Correlation Matrices with Applicationsmath0.bnu.edu.cn/probab/Workshop2016/Talks/JIangTF.pdf · Determinants of Correlation Matrices with Applications Tiefeng Jiang

Theorem

Assume p := pn:n > p + 4→∞ and infn≥6 λmin(Rn) > 1

2 . Set

µn =(p− n +

32)

log(

1− pn− 1

)− n− 2

n− 1p + log |Rn| ;

σ2n = −2

[ pn− 1

+ log(

1− pn− 1

)]+

2n− 1

tr [(Rn − I)2].

Then, (log |Rn| − µn)/σn → N(0, 1) ifpn → c > 0 or supn≥6

pn‖Rn−I‖∞‖Rn−I‖2

<∞

00 := 1

Page 37: Determinants of Correlation Matrices with Applicationsmath0.bnu.edu.cn/probab/Workshop2016/Talks/JIangTF.pdf · Determinants of Correlation Matrices with Applications Tiefeng Jiang

Why we need infn≥6 λmin(Rn) > 12 ?

• Essentially, it is from 1√2π

e−12 x2

• Technically, it will be clear latter

• If λmin(Rn) < 12 , a different limit dist. is possible

Page 38: Determinants of Correlation Matrices with Applicationsmath0.bnu.edu.cn/probab/Workshop2016/Talks/JIangTF.pdf · Determinants of Correlation Matrices with Applications Tiefeng Jiang

Corollaries

• Compound symmetry structure

Rn =

1 a a · · · aa 1 a · · · aa a 1 · · · a...

......

...a a a · · · 1

Smallest eigenvalue is 1− a.

Page 39: Determinants of Correlation Matrices with Applicationsmath0.bnu.edu.cn/probab/Workshop2016/Talks/JIangTF.pdf · Determinants of Correlation Matrices with Applications Tiefeng Jiang

‖Rn − I‖∞ = a and ‖Rn − I‖2 = a√

p(p− 1). Then

p‖Rn − I‖∞‖Rn − I‖2

=

√p

p− 1< 2

Corollary

Assume p := pn satisfy n > p + 4→∞. Let a ∈ (0, 1/2). Then,(log |Rn| − µn)/σn → N(0, 1), where

|Rn| = (1 + a(p− 1))(1− a)p−1 and tr [(Rn − I)2] = p(p− 1)a2

Page 40: Determinants of Correlation Matrices with Applicationsmath0.bnu.edu.cn/probab/Workshop2016/Talks/JIangTF.pdf · Determinants of Correlation Matrices with Applications Tiefeng Jiang

‖Rn − I‖∞ = a and ‖Rn − I‖2 = a√

p(p− 1). Then

p‖Rn − I‖∞‖Rn − I‖2

=

√p

p− 1< 2

Corollary

Assume p := pn satisfy n > p + 4→∞. Let a ∈ (0, 1/2). Then,(log |Rn| − µn)/σn → N(0, 1), where

|Rn| = (1 + a(p− 1))(1− a)p−1 and tr [(Rn − I)2] = p(p− 1)a2

Page 41: Determinants of Correlation Matrices with Applicationsmath0.bnu.edu.cn/probab/Workshop2016/Talks/JIangTF.pdf · Determinants of Correlation Matrices with Applications Tiefeng Jiang

‖Rn − I‖∞ = a and ‖Rn − I‖2 = a√

p(p− 1). Then

p‖Rn − I‖∞‖Rn − I‖2

=

√p

p− 1< 2

Corollary

Assume p := pn satisfy n > p + 4→∞. Let a ∈ (0, 1/2). Then,(log |Rn| − µn)/σn → N(0, 1), where

|Rn| = (1 + a(p− 1))(1− a)p−1 and tr [(Rn − I)2] = p(p− 1)a2

Page 42: Determinants of Correlation Matrices with Applicationsmath0.bnu.edu.cn/probab/Workshop2016/Talks/JIangTF.pdf · Determinants of Correlation Matrices with Applications Tiefeng Jiang

• Autoregressive process of order 1• Banded matrix

Corollary

Assume pn → c > 0. Then

(i) If Rn = (rij)p×p with rij = ρ|i−j| and |ρ| < 15 then CLT holds

(ii) Let Rn = (rij)p×p satisfy rij = 0 for |j− i| > k andmaxi6=j |rij| < 1

4k . Then, CLT holds

Page 43: Determinants of Correlation Matrices with Applicationsmath0.bnu.edu.cn/probab/Workshop2016/Talks/JIangTF.pdf · Determinants of Correlation Matrices with Applications Tiefeng Jiang

• Autoregressive process of order 1• Banded matrix

Corollary

Assume pn → c > 0. Then

(i) If Rn = (rij)p×p with rij = ρ|i−j| and |ρ| < 15 then CLT holds

(ii) Let Rn = (rij)p×p satisfy rij = 0 for |j− i| > k andmaxi6=j |rij| < 1

4k . Then, CLT holds

Page 44: Determinants of Correlation Matrices with Applicationsmath0.bnu.edu.cn/probab/Workshop2016/Talks/JIangTF.pdf · Determinants of Correlation Matrices with Applications Tiefeng Jiang

• Autoregressive process of order 1• Banded matrix

Corollary

Assume pn → c > 0. Then

(i) If Rn = (rij)p×p with rij = ρ|i−j| and |ρ| < 15 then CLT holds

(ii) Let Rn = (rij)p×p satisfy rij = 0 for |j− i| > k andmaxi6=j |rij| < 1

4k . Then, CLT holds

Page 45: Determinants of Correlation Matrices with Applicationsmath0.bnu.edu.cn/probab/Workshop2016/Talks/JIangTF.pdf · Determinants of Correlation Matrices with Applications Tiefeng Jiang

Application

x1, . . . , xn are i.i.d. from Np(µ,Σ) with large p. Consider

H0 : R = Ip vs Ha : R 6= Ip.

Entries of x1 are independent.

Likelihood Ratio Test: rejection region {|Rn| ≤ cα},where cα = µn,0 + σn,0Φ−1(α) and

µn,0 = (p− n +32

) log(1− pn− 1

)− n− 2n− 1

p ;

σ2n,0 = −2

[ pn− 1

+ log(

1− pn− 1

)].

Jiang, Qi and Yang (2013, 2015).

Page 46: Determinants of Correlation Matrices with Applicationsmath0.bnu.edu.cn/probab/Workshop2016/Talks/JIangTF.pdf · Determinants of Correlation Matrices with Applications Tiefeng Jiang

Application

x1, . . . , xn are i.i.d. from Np(µ,Σ) with large p. Consider

H0 : R = Ip vs Ha : R 6= Ip.

Entries of x1 are independent.

Likelihood Ratio Test: rejection region {|Rn| ≤ cα},where cα = µn,0 + σn,0Φ−1(α) and

µn,0 = (p− n +32

) log(1− pn− 1

)− n− 2n− 1

p ;

σ2n,0 = −2

[ pn− 1

+ log(

1− pn− 1

)].

Jiang, Qi and Yang (2013, 2015).

Page 47: Determinants of Correlation Matrices with Applicationsmath0.bnu.edu.cn/probab/Workshop2016/Talks/JIangTF.pdf · Determinants of Correlation Matrices with Applications Tiefeng Jiang

To evaluate test, need to compute power:P(reject | null hypothesis is wrong). That is,

β(R) = P(log |Rn| ≤ cα|R)

∼ Φ(cα − µn

σn

)where

µn =(p− n +

32)

log(

1− pn− 1

)− n− 2

n− 1p + log |Rn| ;

σ2n = −2

[ pn− 1

+ log(

1− pn− 1

)]+

2n− 1

tr [(Rn − I)2].

Page 48: Determinants of Correlation Matrices with Applicationsmath0.bnu.edu.cn/probab/Workshop2016/Talks/JIangTF.pdf · Determinants of Correlation Matrices with Applications Tiefeng Jiang

To evaluate test, need to compute power:P(reject | null hypothesis is wrong). That is,

β(R) = P(log |Rn| ≤ cα|R)

∼ Φ(cα − µn

σn

)where

µn =(p− n +

32)

log(

1− pn− 1

)− n− 2

n− 1p + log |Rn| ;

σ2n = −2

[ pn− 1

+ log(

1− pn− 1

)]+

2n− 1

tr [(Rn − I)2].

Page 49: Determinants of Correlation Matrices with Applicationsmath0.bnu.edu.cn/probab/Workshop2016/Talks/JIangTF.pdf · Determinants of Correlation Matrices with Applications Tiefeng Jiang

To evaluate test, need to compute power:P(reject | null hypothesis is wrong). That is,

β(R) = P(log |Rn| ≤ cα|R)

∼ Φ(cα − µn

σn

)where

µn =(p− n +

32)

log(

1− pn− 1

)− n− 2

n− 1p + log |Rn| ;

σ2n = −2

[ pn− 1

+ log(

1− pn− 1

)]+

2n− 1

tr [(Rn − I)2].

Page 50: Determinants of Correlation Matrices with Applicationsmath0.bnu.edu.cn/probab/Workshop2016/Talks/JIangTF.pdf · Determinants of Correlation Matrices with Applications Tiefeng Jiang

Sketch of Proof

• Zn → N(0, 1) if EetZn → et2/2 for |t| ≤ 1

The above may fail if EetZn → et2/2 for 0 < t ≤ 1 only.

• So to prove (log |Rn|−µn)/σn → N(0, 1), need to evaluate E(|Rn|s)

• Generalized Gamma function:

Γp(z) := πp(p−1)/4p∏

i=1

Γ(

z− 12

(i− 1))

for z with Re(z) > 12(p− 1).

Page 51: Determinants of Correlation Matrices with Applicationsmath0.bnu.edu.cn/probab/Workshop2016/Talks/JIangTF.pdf · Determinants of Correlation Matrices with Applications Tiefeng Jiang

Sketch of Proof

• Zn → N(0, 1) if EetZn → et2/2 for |t| ≤ 1

The above may fail if EetZn → et2/2 for 0 < t ≤ 1 only.

• So to prove (log |Rn|−µn)/σn → N(0, 1), need to evaluate E(|Rn|s)

• Generalized Gamma function:

Γp(z) := πp(p−1)/4p∏

i=1

Γ(

z− 12

(i− 1))

for z with Re(z) > 12(p− 1).

Page 52: Determinants of Correlation Matrices with Applicationsmath0.bnu.edu.cn/probab/Workshop2016/Talks/JIangTF.pdf · Determinants of Correlation Matrices with Applications Tiefeng Jiang

Sketch of Proof

• Zn → N(0, 1) if EetZn → et2/2 for |t| ≤ 1

The above may fail if EetZn → et2/2 for 0 < t ≤ 1 only.

• So to prove (log |Rn|−µn)/σn → N(0, 1), need to evaluate E(|Rn|s)

• Generalized Gamma function:

Γp(z) := πp(p−1)/4p∏

i=1

Γ(

z− 12

(i− 1))

for z with Re(z) > 12(p− 1).

Page 53: Determinants of Correlation Matrices with Applicationsmath0.bnu.edu.cn/probab/Workshop2016/Talks/JIangTF.pdf · Determinants of Correlation Matrices with Applications Tiefeng Jiang

Proposition

x1, · · · , xn: i.i.d. with Np(µ,Σ) and n = m + 1 > p.Set ∆n = Rn − I. Then

E[|Rn|t] =( Γ(m

2 )

Γ(m2 + t)

)p·

Γp(m2 + t)

Γp(m2 )

·|Rn|t · E[|I + ∆n · diag(V1, · · · ,Vp)|−(m/2)−t]

for t > 0, where V1, · · · ,Vp: i.i.d. Beta(t, m2 )-dist.

• Drawback: Beta(t, m2 )-dist forces t > 0. Traditional method not

work; Moments not explicit

• Chance: i.i.d.!

Page 54: Determinants of Correlation Matrices with Applicationsmath0.bnu.edu.cn/probab/Workshop2016/Talks/JIangTF.pdf · Determinants of Correlation Matrices with Applications Tiefeng Jiang

Proposition

x1, · · · , xn: i.i.d. with Np(µ,Σ) and n = m + 1 > p.Set ∆n = Rn − I. Then

E[|Rn|t] =( Γ(m

2 )

Γ(m2 + t)

)p·

Γp(m2 + t)

Γp(m2 )

·|Rn|t · E[|I + ∆n · diag(V1, · · · ,Vp)|−(m/2)−t]

for t > 0, where V1, · · · ,Vp: i.i.d. Beta(t, m2 )-dist.

• Drawback: Beta(t, m2 )-dist forces t > 0. Traditional method not

work; Moments not explicit

• Chance: i.i.d.!

Page 55: Determinants of Correlation Matrices with Applicationsmath0.bnu.edu.cn/probab/Workshop2016/Talks/JIangTF.pdf · Determinants of Correlation Matrices with Applications Tiefeng Jiang

Proposition

x1, · · · , xn: i.i.d. with Np(µ,Σ) and n = m + 1 > p.Set ∆n = Rn − I. Then

E[|Rn|t] =( Γ(m

2 )

Γ(m2 + t)

)p·

Γp(m2 + t)

Γp(m2 )

·|Rn|t · E[|I + ∆n · diag(V1, · · · ,Vp)|−(m/2)−t]

for t > 0, where V1, · · · ,Vp: i.i.d. Beta(t, m2 )-dist.

• Drawback: Beta(t, m2 )-dist forces t > 0. Traditional method not

work; Moments not explicit

• Chance: i.i.d.!

Page 56: Determinants of Correlation Matrices with Applicationsmath0.bnu.edu.cn/probab/Workshop2016/Talks/JIangTF.pdf · Determinants of Correlation Matrices with Applications Tiefeng Jiang

If Rn = (rij) with rij = a for i 6= j. Then

|I + ∆n · diag(V1, · · · ,Vp)|

=[ p∏

i=1

(1− aVi)]·(

1 +

p∑i=1

aVi

1− aVi

)where V1, · · · ,Vp: i.i.d. Beta(t, m

2 )-dist.

Page 57: Determinants of Correlation Matrices with Applicationsmath0.bnu.edu.cn/probab/Workshop2016/Talks/JIangTF.pdf · Determinants of Correlation Matrices with Applications Tiefeng Jiang

Proposition

{Zn; n ≥ 1}: supn≥0 E(|Zn|p) <∞ for p ≥ 1 andlimn→∞ EetZn = EetZ0 for t ∈ [0, δ].

If dist. of Z0 can be determined uniquely by moments{E(Zp

0); p = 1, 2, · · · }, then Zn → Z0

Page 58: Determinants of Correlation Matrices with Applicationsmath0.bnu.edu.cn/probab/Workshop2016/Talks/JIangTF.pdf · Determinants of Correlation Matrices with Applications Tiefeng Jiang

Proposition

{Zn; n ≥ 1}: supn≥0 E(|Zn|p) <∞ for p ≥ 1 andlimn→∞ EetZn = EetZ0 for t ∈ [0, δ].

If dist. of Z0 can be determined uniquely by moments{E(Zp

0); p = 1, 2, · · · }, then Zn → Z0

Page 59: Determinants of Correlation Matrices with Applicationsmath0.bnu.edu.cn/probab/Workshop2016/Talks/JIangTF.pdf · Determinants of Correlation Matrices with Applications Tiefeng Jiang

• Step 1

|I + ∆n · diag(V1, · · · ,Vp)|−(m/2)−t ∼ e−t2m · tr (∆2

n)

in probability as n→∞

• Step 2 (most efforts){LHSRHS

; n ≥ 6}

is uniformly integrable

Then

E[|I + ∆n · diag(V1, · · · ,Vp)|−(m/2)−t] ∼ e−

t2m · tr (∆2

n)

Page 60: Determinants of Correlation Matrices with Applicationsmath0.bnu.edu.cn/probab/Workshop2016/Talks/JIangTF.pdf · Determinants of Correlation Matrices with Applications Tiefeng Jiang

• Step 1

|I + ∆n · diag(V1, · · · ,Vp)|−(m/2)−t ∼ e−t2m · tr (∆2

n)

in probability as n→∞

• Step 2 (most efforts){LHSRHS

; n ≥ 6}

is uniformly integrable

Then

E[|I + ∆n · diag(V1, · · · ,Vp)|−(m/2)−t] ∼ e−

t2m · tr (∆2

n)

Page 61: Determinants of Correlation Matrices with Applicationsmath0.bnu.edu.cn/probab/Workshop2016/Talks/JIangTF.pdf · Determinants of Correlation Matrices with Applications Tiefeng Jiang

• Step 1

|I + ∆n · diag(V1, · · · ,Vp)|−(m/2)−t ∼ e−t2m · tr (∆2

n)

in probability as n→∞

• Step 2 (most efforts){LHSRHS

; n ≥ 6}

is uniformly integrable

Then

E[|I + ∆n · diag(V1, · · · ,Vp)|−(m/2)−t] ∼ e−

t2m · tr (∆2

n)

Page 62: Determinants of Correlation Matrices with Applicationsmath0.bnu.edu.cn/probab/Workshop2016/Talks/JIangTF.pdf · Determinants of Correlation Matrices with Applications Tiefeng Jiang

Write

E[|Rn|t]

= E[|Rn,0|t] · |Rn|t · E[|I + ∆n · diag(V1, · · · ,Vp)|−

n−12 −t]

where E[|Rn,0|t] is the case when Rn = I. By earlier result, we knowbehavior of E[|Rn,0|t].

Combine them to have

E exp( log |Rn| − µn

σns)→ es2/2

for 0 ≤ s ≤ δ.

Page 63: Determinants of Correlation Matrices with Applicationsmath0.bnu.edu.cn/probab/Workshop2016/Talks/JIangTF.pdf · Determinants of Correlation Matrices with Applications Tiefeng Jiang

Write

E[|Rn|t]

= E[|Rn,0|t] · |Rn|t · E[|I + ∆n · diag(V1, · · · ,Vp)|−

n−12 −t]

where E[|Rn,0|t] is the case when Rn = I. By earlier result, we knowbehavior of E[|Rn,0|t].

Combine them to have

E exp( log |Rn| − µn

σns)→ es2/2

for 0 ≤ s ≤ δ.

Page 64: Determinants of Correlation Matrices with Applicationsmath0.bnu.edu.cn/probab/Workshop2016/Talks/JIangTF.pdf · Determinants of Correlation Matrices with Applications Tiefeng Jiang

Write

E[|Rn|t]

= E[|Rn,0|t] · |Rn|t · E[|I + ∆n · diag(V1, · · · ,Vp)|−

n−12 −t]

where E[|Rn,0|t] is the case when Rn = I. By earlier result, we knowbehavior of E[|Rn,0|t].

Combine them to have

E exp( log |Rn| − µn

σns)→ es2/2

for 0 ≤ s ≤ δ.

Page 65: Determinants of Correlation Matrices with Applicationsmath0.bnu.edu.cn/probab/Workshop2016/Talks/JIangTF.pdf · Determinants of Correlation Matrices with Applications Tiefeng Jiang

• Step 3

supn≥6

E[( log |Rn| − µn

σn

)2k]<∞

for k = 1, 2, · · ·

• N(0, 1) is uniquely determined by its moments.

The proof is complete.

Page 66: Determinants of Correlation Matrices with Applicationsmath0.bnu.edu.cn/probab/Workshop2016/Talks/JIangTF.pdf · Determinants of Correlation Matrices with Applications Tiefeng Jiang

• Step 3

supn≥6

E[( log |Rn| − µn

σn

)2k]<∞

for k = 1, 2, · · ·

• N(0, 1) is uniquely determined by its moments.

The proof is complete.

Page 67: Determinants of Correlation Matrices with Applicationsmath0.bnu.edu.cn/probab/Workshop2016/Talks/JIangTF.pdf · Determinants of Correlation Matrices with Applications Tiefeng Jiang

Two Concentration Inequalities

Lemma

Given s > 0, define t = tn = sσn

.V1, · · · ,Vp: i.i.d. Beta(t, m

2 )-dist.Then, for ρ ∈ (0, 1

2), there exist M > 0 and n0 ≥ 1 s.t.

P( p∑

i=1

Vi > y)≤ e−ρmy

as y ≥ M ptm and n ≥ n0

Not tandard Chernoff bound

Page 68: Determinants of Correlation Matrices with Applicationsmath0.bnu.edu.cn/probab/Workshop2016/Talks/JIangTF.pdf · Determinants of Correlation Matrices with Applications Tiefeng Jiang

Lemma

Given s > 0, define t = sσn

. Recall Rn = (rij).V1, · · · ,Vp: i.i.d. Beta(t, m

2 )-dist.Assume infn≥6

pnn > 0. Then, there exists δ > 0 s.t.

P(∑

i 6=j

r2ijViVj ≥ y

)≤ exp

(− 1

256· m2y

pt + m√

y

)for all y > 1

m , s ∈ (0, δ] and n ≥ 6

The proof is based on Yurinskii’s ineq. + matrix tricks

Hanson-Wright ineq. is not enough (Rudelson, Vershynin, 2013)

Page 69: Determinants of Correlation Matrices with Applicationsmath0.bnu.edu.cn/probab/Workshop2016/Talks/JIangTF.pdf · Determinants of Correlation Matrices with Applications Tiefeng Jiang

Let Qn = |I + ∆n · diag(V1, · · · ,Vp)|−m2−t. Then

Qn � exp(

m1− λmin

2λmin

p∑i=1

Vi

).

λmin = λmin(Rn). To bound EQn, use formula

EeγH ≤ c +

∫ ∞0

eγxP(H > x) dx

where H is r.v. and γ > 0 is const.

By earlier concentration ineq., this forces λmin >12 .

Page 70: Determinants of Correlation Matrices with Applicationsmath0.bnu.edu.cn/probab/Workshop2016/Talks/JIangTF.pdf · Determinants of Correlation Matrices with Applications Tiefeng Jiang

Let Qn = |I + ∆n · diag(V1, · · · ,Vp)|−m2−t. Then

Qn � exp(

m1− λmin

2λmin

p∑i=1

Vi

).

λmin = λmin(Rn). To bound EQn, use formula

EeγH ≤ c +

∫ ∞0

eγxP(H > x) dx

where H is r.v. and γ > 0 is const.

By earlier concentration ineq., this forces λmin >12 .

Page 71: Determinants of Correlation Matrices with Applicationsmath0.bnu.edu.cn/probab/Workshop2016/Talks/JIangTF.pdf · Determinants of Correlation Matrices with Applications Tiefeng Jiang

The End!