Estimation of Means in Graphical Gaussian Models with Symmetries · 2012-04-16 · Kruskal’s...

43
Introduction Graphical models with symmetry Symmetry restrictions on concentrations Symmetry restrictions on partial correlations Permutation symmetry Stability of mean spaces Group generated partitions References Estimation of Means in Graphical Gaussian Models with Symmetries Steffen Lauritzen 1 University of Oxford Fields Institute April 16, 2012 1 Representing joint work with Helene Gehrmann Steffen Lauritzen University of Oxford Estimation of Means in Graphical Gaussian Models with Symme

Transcript of Estimation of Means in Graphical Gaussian Models with Symmetries · 2012-04-16 · Kruskal’s...

Page 1: Estimation of Means in Graphical Gaussian Models with Symmetries · 2012-04-16 · Kruskal’s theorem Consider Y = (Y ) j2V ˘N V (0;) and let let K = 1 be the concentration matrix.

IntroductionGraphical models with symmetry

Symmetry restrictions on concentrationsSymmetry restrictions on partial correlations

Permutation symmetryStability of mean spaces

Group generated partitionsReferences

Estimation of Means in Graphical GaussianModels with Symmetries

Steffen Lauritzen1

University of Oxford

Fields Institute

April 16, 2012

1Representing joint work with Helene GehrmannSteffen Lauritzen University of Oxford Estimation of Means in Graphical Gaussian Models with Symmetries

Page 2: Estimation of Means in Graphical Gaussian Models with Symmetries · 2012-04-16 · Kruskal’s theorem Consider Y = (Y ) j2V ˘N V (0;) and let let K = 1 be the concentration matrix.

IntroductionGraphical models with symmetry

Symmetry restrictions on concentrationsSymmetry restrictions on partial correlations

Permutation symmetryStability of mean spaces

Group generated partitionsReferences

Graphical Gaussian modelsAdding meansBehrens–Fisher problemKruskal’s theorem

ConsiderY = (Yα)α∈V ∼ N|V |(0,Σ)

and let let K = Σ−1 be the concentration matrix.The partial correlation between Yα and Yβ given all other variablesis

ραβ |V \α,β = −kαβ/√

kααkββ . (1)

Thuskαβ = 0 ⇐⇒ Yα⊥⊥Yβ |YV \α,β.

A graphical Gaussian model is represented by an undirected graphG = (V ,E ) with Y as above and K ∈ S+(G), the set of(symmetric) positive definite matrices with

α 6∼ β ⇒ kαβ = 0.

Steffen Lauritzen University of Oxford Estimation of Means in Graphical Gaussian Models with Symmetries

Page 3: Estimation of Means in Graphical Gaussian Models with Symmetries · 2012-04-16 · Kruskal’s theorem Consider Y = (Y ) j2V ˘N V (0;) and let let K = 1 be the concentration matrix.

IntroductionGraphical models with symmetry

Symmetry restrictions on concentrationsSymmetry restrictions on partial correlations

Permutation symmetryStability of mean spaces

Group generated partitionsReferences

Graphical Gaussian modelsAdding meansBehrens–Fisher problemKruskal’s theorem

We shall be interested in also adding means so that Y ∼ N (µ,Σ)with µ ∈ Ω, where Ω is a linear subspace of RV .

Based on observations Y 1, . . . ,Y n the likelihood function is

L(µ,K ) ∝ det Kn/2 exp−∑

1≤i≤n(y i−µ)TK(y i−µ)/2 . (2)

If µ is unrestricted so that µ ∈ Ω = RV , L is maximised over µ forfixed K by µ = µ∗ = y and inference about K can be based on

L(µ,K ; y) ∝ det Kn/2 exp− tr(KW )/2, (3)

where W =∑n

i=1(y i − µ∗)(y i − µ∗)T is the matrix of sums ofsquares and products of the residuals.

Steffen Lauritzen University of Oxford Estimation of Means in Graphical Gaussian Models with Symmetries

Page 4: Estimation of Means in Graphical Gaussian Models with Symmetries · 2012-04-16 · Kruskal’s theorem Consider Y = (Y ) j2V ˘N V (0;) and let let K = 1 be the concentration matrix.

IntroductionGraphical models with symmetry

Symmetry restrictions on concentrationsSymmetry restrictions on partial correlations

Permutation symmetryStability of mean spaces

Group generated partitionsReferences

Graphical Gaussian modelsAdding meansBehrens–Fisher problemKruskal’s theorem

In general the situation is more complex. Consider the graph

Y1 Y2

representing two independent Gaussian variables with unknownvariances σ2

1 and σ22. The Behrens–Fisher problem (Scheffe, 1944)

occurs when estimating µ = (µ1, µ2) under the restriction µ1 = µ2.

The least squares estimator (LSE) µ∗ = (y1, y2) is then not theMLE, the likelihood function (2) under the hypothesis µ1 = µ2

may have multiple modes (Drton, 2008), and there there is nosimilar test for the hypothesis.

Steffen Lauritzen University of Oxford Estimation of Means in Graphical Gaussian Models with Symmetries

Page 5: Estimation of Means in Graphical Gaussian Models with Symmetries · 2012-04-16 · Kruskal’s theorem Consider Y = (Y ) j2V ˘N V (0;) and let let K = 1 be the concentration matrix.

IntroductionGraphical models with symmetry

Symmetry restrictions on concentrationsSymmetry restrictions on partial correlations

Permutation symmetryStability of mean spaces

Group generated partitionsReferences

Graphical Gaussian modelsAdding meansBehrens–Fisher problemKruskal’s theorem

Kruskal (1968) found the following necessary and sufficientcondition for the LSE µ∗ and MLE µ to agree for a fixed Σ:

Theorem (Kruskal)

Let Y ∼ N (µ,Σ) with unknown mean µ ∈ Ω and known Σ. Thenthe estimators µ∗ and µ coincide if and only if Ω is invariant underK = Σ−1, i.e. if and only if

K Ω ⊆ Ω. (4)

As K Ω ⊆ Ω if and only if ΣΩ ⊆ Ω this can equivalently beexpressed in terms of Σ.

Steffen Lauritzen University of Oxford Estimation of Means in Graphical Gaussian Models with Symmetries

Page 6: Estimation of Means in Graphical Gaussian Models with Symmetries · 2012-04-16 · Kruskal’s theorem Consider Y = (Y ) j2V ˘N V (0;) and let let K = 1 be the concentration matrix.

IntroductionGraphical models with symmetry

Symmetry restrictions on concentrationsSymmetry restrictions on partial correlations

Permutation symmetryStability of mean spaces

Group generated partitionsReferences

Graphical Gaussian modelsAdding meansBehrens–Fisher problemKruskal’s theorem

Consequently, if K ∈ Θ is unknown and K Ω ⊆ Ω for all K ∈ Θ wealso have µ∗ = µ and inference on K can be based on the profilelikelihood function (3)

L(µ,K ) ∝ det Kn/2 exp− tr(KW )/2.

The Behrens–Fisher problem is then resolved if we also restrict thevariances σ2

1 = σ22 = σ2 since(

σ2 0

0 σ2

)(α

α

)=

(σ2α

σ2α

)=

β

),

so the mean space is stable under Σ.

Steffen Lauritzen University of Oxford Estimation of Means in Graphical Gaussian Models with Symmetries

Page 7: Estimation of Means in Graphical Gaussian Models with Symmetries · 2012-04-16 · Kruskal’s theorem Consider Y = (Y ) j2V ˘N V (0;) and let let K = 1 be the concentration matrix.

IntroductionGraphical models with symmetry

Symmetry restrictions on concentrationsSymmetry restrictions on partial correlations

Permutation symmetryStability of mean spaces

Group generated partitionsReferences

Graphical Gaussian modelsAdding meansBehrens–Fisher problemKruskal’s theorem

The additional symmetry in the concentration matrix induced bythe restriction σ2

1 = σ22 is represented by a coloured graph

Y1 Y2

where nodes of same colour have identical elements in theirconcentration matrix.

Steffen Lauritzen University of Oxford Estimation of Means in Graphical Gaussian Models with Symmetries

Page 8: Estimation of Means in Graphical Gaussian Models with Symmetries · 2012-04-16 · Kruskal’s theorem Consider Y = (Y ) j2V ˘N V (0;) and let let K = 1 be the concentration matrix.

IntroductionGraphical models with symmetry

Symmetry restrictions on concentrationsSymmetry restrictions on partial correlations

Permutation symmetryStability of mean spaces

Group generated partitionsReferences

Basic definitionsSome classical examples

Three types of symmetry restrictions:

I RCON restricts concentration matrix;

I RCOR restricts partial correlations;

I RCOP has restrictions generated by permutation symmetry.

Steffen Lauritzen University of Oxford Estimation of Means in Graphical Gaussian Models with Symmetries

Page 9: Estimation of Means in Graphical Gaussian Models with Symmetries · 2012-04-16 · Kruskal’s theorem Consider Y = (Y ) j2V ˘N V (0;) and let let K = 1 be the concentration matrix.

IntroductionGraphical models with symmetry

Symmetry restrictions on concentrationsSymmetry restrictions on partial correlations

Permutation symmetryStability of mean spaces

Group generated partitionsReferences

Basic definitionsSome classical examples

Three types of symmetry restrictions:

I RCON restricts concentration matrix;

I RCOR restricts partial correlations;

I RCOP has restrictions generated by permutation symmetry.

Steffen Lauritzen University of Oxford Estimation of Means in Graphical Gaussian Models with Symmetries

Page 10: Estimation of Means in Graphical Gaussian Models with Symmetries · 2012-04-16 · Kruskal’s theorem Consider Y = (Y ) j2V ˘N V (0;) and let let K = 1 be the concentration matrix.

IntroductionGraphical models with symmetry

Symmetry restrictions on concentrationsSymmetry restrictions on partial correlations

Permutation symmetryStability of mean spaces

Group generated partitionsReferences

Basic definitionsSome classical examples

Three types of symmetry restrictions:

I RCON restricts concentration matrix;

I RCOR restricts partial correlations;

I RCOP has restrictions generated by permutation symmetry.

Steffen Lauritzen University of Oxford Estimation of Means in Graphical Gaussian Models with Symmetries

Page 11: Estimation of Means in Graphical Gaussian Models with Symmetries · 2012-04-16 · Kruskal’s theorem Consider Y = (Y ) j2V ˘N V (0;) and let let K = 1 be the concentration matrix.

IntroductionGraphical models with symmetry

Symmetry restrictions on concentrationsSymmetry restrictions on partial correlations

Permutation symmetryStability of mean spaces

Group generated partitionsReferences

Basic definitionsSome classical examples

Three types of symmetry restrictions:

I RCON restricts concentration matrix;

I RCOR restricts partial correlations;

I RCOP has restrictions generated by permutation symmetry.

In principle one could/should also study RCOV-models given bysymmetry restrictions in the covariance matrix. These are ingeneral different from any of the above but have as far as I knownot been investigated at this point in time.

Steffen Lauritzen University of Oxford Estimation of Means in Graphical Gaussian Models with Symmetries

Page 12: Estimation of Means in Graphical Gaussian Models with Symmetries · 2012-04-16 · Kruskal’s theorem Consider Y = (Y ) j2V ˘N V (0;) and let let K = 1 be the concentration matrix.

IntroductionGraphical models with symmetry

Symmetry restrictions on concentrationsSymmetry restrictions on partial correlations

Permutation symmetryStability of mean spaces

Group generated partitionsReferences

Basic definitionsSome classical examples

Graph colouring

Undirected graph G = (V ,E ).

Colouring vertices of G with different colours induces partitioningof V into vertex colour classes.

Colouring edges E partitions E into disjoint edge colour classes

V = V1 ∪ · · · ∪ Vp, E = E1 ∪ · · · ∪ Eq.

V = V1, . . . ,Vp is a vertex colouring,

E = E1, . . . ,Eq is an edge colouring,

G = (V, E) is a coloured graph.

Steffen Lauritzen University of Oxford Estimation of Means in Graphical Gaussian Models with Symmetries

Page 13: Estimation of Means in Graphical Gaussian Models with Symmetries · 2012-04-16 · Kruskal’s theorem Consider Y = (Y ) j2V ˘N V (0;) and let let K = 1 be the concentration matrix.

IntroductionGraphical models with symmetry

Symmetry restrictions on concentrationsSymmetry restrictions on partial correlations

Permutation symmetryStability of mean spaces

Group generated partitionsReferences

Basic definitionsSome classical examples

I Models with symmetry in covariance are classical and admitunified theory (Wilks, 1946; Votaw, 1948; Olkin and Press,1969; Andersson, 1975; Andersson et al., 1983);

I Stationary autoregressions (circular) (Anderson, 1942; Leipnik,1947);

I Spatial Markov models (Whittle, 1954; Besag, 1974; Besagand Moran, 1975);

I General combinations with conditional independence are morerecent (Hylleberg et al., 1993; Andersson and Madsen, 1998;Madsen, 2000).

Steffen Lauritzen University of Oxford Estimation of Means in Graphical Gaussian Models with Symmetries

Page 14: Estimation of Means in Graphical Gaussian Models with Symmetries · 2012-04-16 · Kruskal’s theorem Consider Y = (Y ) j2V ˘N V (0;) and let let K = 1 be the concentration matrix.

IntroductionGraphical models with symmetry

Symmetry restrictions on concentrationsSymmetry restrictions on partial correlations

Permutation symmetryStability of mean spaces

Group generated partitionsReferences

Basic definitionsSome classical examples

I Models with symmetry in covariance are classical and admitunified theory (Wilks, 1946; Votaw, 1948; Olkin and Press,1969; Andersson, 1975; Andersson et al., 1983);

I Stationary autoregressions (circular) (Anderson, 1942; Leipnik,1947);

I Spatial Markov models (Whittle, 1954; Besag, 1974; Besagand Moran, 1975);

I General combinations with conditional independence are morerecent (Hylleberg et al., 1993; Andersson and Madsen, 1998;Madsen, 2000).

Steffen Lauritzen University of Oxford Estimation of Means in Graphical Gaussian Models with Symmetries

Page 15: Estimation of Means in Graphical Gaussian Models with Symmetries · 2012-04-16 · Kruskal’s theorem Consider Y = (Y ) j2V ˘N V (0;) and let let K = 1 be the concentration matrix.

IntroductionGraphical models with symmetry

Symmetry restrictions on concentrationsSymmetry restrictions on partial correlations

Permutation symmetryStability of mean spaces

Group generated partitionsReferences

Basic definitionsSome classical examples

I Models with symmetry in covariance are classical and admitunified theory (Wilks, 1946; Votaw, 1948; Olkin and Press,1969; Andersson, 1975; Andersson et al., 1983);

I Stationary autoregressions (circular) (Anderson, 1942; Leipnik,1947);

I Spatial Markov models (Whittle, 1954; Besag, 1974; Besagand Moran, 1975);

I General combinations with conditional independence are morerecent (Hylleberg et al., 1993; Andersson and Madsen, 1998;Madsen, 2000).

Steffen Lauritzen University of Oxford Estimation of Means in Graphical Gaussian Models with Symmetries

Page 16: Estimation of Means in Graphical Gaussian Models with Symmetries · 2012-04-16 · Kruskal’s theorem Consider Y = (Y ) j2V ˘N V (0;) and let let K = 1 be the concentration matrix.

IntroductionGraphical models with symmetry

Symmetry restrictions on concentrationsSymmetry restrictions on partial correlations

Permutation symmetryStability of mean spaces

Group generated partitionsReferences

Basic definitionsSome classical examples

I Models with symmetry in covariance are classical and admitunified theory (Wilks, 1946; Votaw, 1948; Olkin and Press,1969; Andersson, 1975; Andersson et al., 1983);

I Stationary autoregressions (circular) (Anderson, 1942; Leipnik,1947);

I Spatial Markov models (Whittle, 1954; Besag, 1974; Besagand Moran, 1975);

I General combinations with conditional independence are morerecent (Hylleberg et al., 1993; Andersson and Madsen, 1998;Madsen, 2000).

Steffen Lauritzen University of Oxford Estimation of Means in Graphical Gaussian Models with Symmetries

Page 17: Estimation of Means in Graphical Gaussian Models with Symmetries · 2012-04-16 · Kruskal’s theorem Consider Y = (Y ) j2V ˘N V (0;) and let let K = 1 be the concentration matrix.

IntroductionGraphical models with symmetry

Symmetry restrictions on concentrationsSymmetry restrictions on partial correlations

Permutation symmetryStability of mean spaces

Group generated partitionsReferences

Mathematics marksModel specification

Empirical concentration matrix (inverse covariance) of examinationmarks of 88 students in 5 mathematical subjects.

Mechanics Vectors Algebra Analysis Statistics

Mechanics 5.24 −2.44 −2.74 0.01 −0.14

Vectors −2.44 10.43 −4.71 −0.79 −0.17

Algebra −2.74 −4.71 26.95 −7.05 −4.70

Analysis 0.01 −0.79 −7.05 9.88 −2.02

Statistics −0.14 −0.17 −4.70 −2.02 6.45

Data reported in Mardia et al. (1979)

Steffen Lauritzen University of Oxford Estimation of Means in Graphical Gaussian Models with Symmetries

Page 18: Estimation of Means in Graphical Gaussian Models with Symmetries · 2012-04-16 · Kruskal’s theorem Consider Y = (Y ) j2V ˘N V (0;) and let let K = 1 be the concentration matrix.

IntroductionGraphical models with symmetry

Symmetry restrictions on concentrationsSymmetry restrictions on partial correlations

Permutation symmetryStability of mean spaces

Group generated partitionsReferences

Mathematics marksModel specification

RCON model

Data support model with symmetry restrictions as in figure:

Mechanics

Vectors

Algebra

Analysis

Statistics

PPPPPP

PPPPPPss

css

Elements of concentration matrix corresponding to same coloursare identical.Black or white neutral and corresponding parameters vary freely.RCON model since restrictions apply to concentration matrix

Steffen Lauritzen University of Oxford Estimation of Means in Graphical Gaussian Models with Symmetries

Page 19: Estimation of Means in Graphical Gaussian Models with Symmetries · 2012-04-16 · Kruskal’s theorem Consider Y = (Y ) j2V ˘N V (0;) and let let K = 1 be the concentration matrix.

IntroductionGraphical models with symmetry

Symmetry restrictions on concentrationsSymmetry restrictions on partial correlations

Permutation symmetryStability of mean spaces

Group generated partitionsReferences

Mathematics marksModel specification

RCON model

1. Diagonal elements K corresponding to vertices in the samevertex colour class must be identical.

2. Off–diagonal entries of K corresponding to edges in the sameedge colour class must be identical.

Diagonal of K thus specified by Tp-dimensional vector η andoff-diagonal elements by a q dimensional vector δ so K = K (η, δ).The set of positive definite matrices which satisfy these restrictionsis denoted S+(V, E).

Steffen Lauritzen University of Oxford Estimation of Means in Graphical Gaussian Models with Symmetries

Page 20: Estimation of Means in Graphical Gaussian Models with Symmetries · 2012-04-16 · Kruskal’s theorem Consider Y = (Y ) j2V ˘N V (0;) and let let K = 1 be the concentration matrix.

IntroductionGraphical models with symmetry

Symmetry restrictions on concentrationsSymmetry restrictions on partial correlations

Permutation symmetryStability of mean spaces

Group generated partitionsReferences

Mathematics marksModel specification

u

u u

uY4

Y1

Y3

Y2

Corresponding RCON model will have concentration matrix

K =

k11 k12 0 k14

k21 k22 k23 0

0 k32 k33 k34

k41 0 k43 k44

=

η1 δ1 0 δ2

δ1 η2 δ1 0

0 δ1 η1 δ2

δ2 0 δ2 η2

.

Steffen Lauritzen University of Oxford Estimation of Means in Graphical Gaussian Models with Symmetries

Page 21: Estimation of Means in Graphical Gaussian Models with Symmetries · 2012-04-16 · Kruskal’s theorem Consider Y = (Y ) j2V ˘N V (0;) and let let K = 1 be the concentration matrix.

IntroductionGraphical models with symmetry

Symmetry restrictions on concentrationsSymmetry restrictions on partial correlations

Permutation symmetryStability of mean spaces

Group generated partitionsReferences

Anxiety and angerModel specificationLikelihood equations

Cox and Wermuth (1993) report data on personality characteristicson 684 students:Table below shows empirical concentrations (×100) (on and abovediagonal), partial correlations (below diagonal), and standarddeviations for personality characteristics of 684 students.

SX SN TX TN

SX (State anxiety) 0.58 −0.30 −0.23 0.02

SN (State anger) 0.45 0.79 −0.02 −0.15

TX (Trait anxiety) 0.47 0.03 0.41 −0.11

TN (Trait anger) −0.04 0.33 0.32 0.27

Standard deviations 6.10 6.70 5.68 6.57

Steffen Lauritzen University of Oxford Estimation of Means in Graphical Gaussian Models with Symmetries

Page 22: Estimation of Means in Graphical Gaussian Models with Symmetries · 2012-04-16 · Kruskal’s theorem Consider Y = (Y ) j2V ˘N V (0;) and let let K = 1 be the concentration matrix.

IntroductionGraphical models with symmetry

Symmetry restrictions on concentrationsSymmetry restrictions on partial correlations

Permutation symmetryStability of mean spaces

Group generated partitionsReferences

Anxiety and angerModel specificationLikelihood equations

RCOR model

Data strongly support conditional independence model displayedbelow with partial correlations strikingly similar in pairs:

e

e e

eTX

SX

TN

SN

Scales for individual variables may not be compatible. Partialcorrelations invariant under changes of scale, and more meaningful.Such symmetry models are denoted RCOR models.

Steffen Lauritzen University of Oxford Estimation of Means in Graphical Gaussian Models with Symmetries

Page 23: Estimation of Means in Graphical Gaussian Models with Symmetries · 2012-04-16 · Kruskal’s theorem Consider Y = (Y ) j2V ˘N V (0;) and let let K = 1 be the concentration matrix.

IntroductionGraphical models with symmetry

Symmetry restrictions on concentrationsSymmetry restrictions on partial correlations

Permutation symmetryStability of mean spaces

Group generated partitionsReferences

Anxiety and angerModel specificationLikelihood equations

RCOR models

1. Diagonal elements of K corresponding to vertices in samevertex colour class must be identical.

2. partial correlations along edges in the same edge colour classmust be identical.

The set of positive definite matrices which satisfy the restrictionsof an RCOR(V, E) model is denoted R+(V, E).

Steffen Lauritzen University of Oxford Estimation of Means in Graphical Gaussian Models with Symmetries

Page 24: Estimation of Means in Graphical Gaussian Models with Symmetries · 2012-04-16 · Kruskal’s theorem Consider Y = (Y ) j2V ˘N V (0;) and let let K = 1 be the concentration matrix.

IntroductionGraphical models with symmetry

Symmetry restrictions on concentrationsSymmetry restrictions on partial correlations

Permutation symmetryStability of mean spaces

Group generated partitionsReferences

Anxiety and angerModel specificationLikelihood equations

Define A as diagonal matrix with

aα =√

kαα = ηu, α ∈ u ∈ V

We can uniquely represent K ∈ R+(V, E) as

K = ACA = A(η)C (δ)A(η),

where C has all diagonal entries equal to one and off-diagonalentries are negative partial correlations

cαβ = −ραβ |V \α,β = kαβ/√

kααkββ = kαβ/(aαaβ).

Vertex colour classes restrict A, whereas edge colour classesrestrict C .

Steffen Lauritzen University of Oxford Estimation of Means in Graphical Gaussian Models with Symmetries

Page 25: Estimation of Means in Graphical Gaussian Models with Symmetries · 2012-04-16 · Kruskal’s theorem Consider Y = (Y ) j2V ˘N V (0;) and let let K = 1 be the concentration matrix.

IntroductionGraphical models with symmetry

Symmetry restrictions on concentrationsSymmetry restrictions on partial correlations

Permutation symmetryStability of mean spaces

Group generated partitionsReferences

Anxiety and angerModel specificationLikelihood equations

Although restrictions linear in each of A and C , they are in generalnot linear in K .

For unrestricted mean, or mean zero RCOR models are curvedexponential families.

Letting λu = log ηu the profile likelihood function becomes

log L =f

2log detC (δ)+ f

∑u∈V

λu tr(Ku)− 1

2trC (δ)A(λ)WA(λ)

log L concave in λ for fixed δ and vice versa, but not in generaljointly.

Steffen Lauritzen University of Oxford Estimation of Means in Graphical Gaussian Models with Symmetries

Page 26: Estimation of Means in Graphical Gaussian Models with Symmetries · 2012-04-16 · Kruskal’s theorem Consider Y = (Y ) j2V ˘N V (0;) and let let K = 1 be the concentration matrix.

IntroductionGraphical models with symmetry

Symmetry restrictions on concentrationsSymmetry restrictions on partial correlations

Permutation symmetryStability of mean spaces

Group generated partitionsReferences

Frets’ headsModel specificationIdentifying the graph colouring

RCOP model

Data from Frets (1921). Length and breadth of the heads of 25pairs of first and second sons. Data support the model

u

u u

uB1

L1

B2

L2

Assume distribution unchanged if sons are switched. RCOP modelas determined by permutation of labels.

Both RCON and RCOR because all aspects of the jointdistribution are unaltered when labels are switched.

Steffen Lauritzen University of Oxford Estimation of Means in Graphical Gaussian Models with Symmetries

Page 27: Estimation of Means in Graphical Gaussian Models with Symmetries · 2012-04-16 · Kruskal’s theorem Consider Y = (Y ) j2V ˘N V (0;) and let let K = 1 be the concentration matrix.

IntroductionGraphical models with symmetry

Symmetry restrictions on concentrationsSymmetry restrictions on partial correlations

Permutation symmetryStability of mean spaces

Group generated partitionsReferences

Frets’ headsModel specificationIdentifying the graph colouring

Let G be permutation matrix for elements of V . If Y ∼ N|V |(0,Σ)

then GY ∼ N|V |(0,G ΣG>).

Let Γ ⊆ S(V ) be a subgroup of such permutations.

Distribution of Y invariant under the action of Γ if and only if

G ΣG> = Σ for all G ∈ Γ. (5)

Since G satisfies G−1 = G>, (5) is equivalent to

G Σ = ΣG for all G ∈ Γ, (6)

i.e. that G commutes with Σ or, equivalently, that G commuteswith K :

GK = KG for all G ∈ Γ.

Steffen Lauritzen University of Oxford Estimation of Means in Graphical Gaussian Models with Symmetries

Page 28: Estimation of Means in Graphical Gaussian Models with Symmetries · 2012-04-16 · Kruskal’s theorem Consider Y = (Y ) j2V ˘N V (0;) and let let K = 1 be the concentration matrix.

IntroductionGraphical models with symmetry

Symmetry restrictions on concentrationsSymmetry restrictions on partial correlations

Permutation symmetryStability of mean spaces

Group generated partitionsReferences

Frets’ headsModel specificationIdentifying the graph colouring

We must insist that zero elements of K are preserved, i.e. G isautomorphism of the graph, mapping edges to edges:

G (α) ∼ G (β) ⇐⇒ α ∼ β for all G ∈ Γ,

An RCOP model RCOP(G, Γ) generated by Γ ⊆ Aut(G) is given byassuming

K ∈ S+(G, Γ) = S+(G) ∩ S+(Γ)

where S+(Γ) is the set of positive definite matrices satisfying

GK = KG for all G ∈ Γ.

Steffen Lauritzen University of Oxford Estimation of Means in Graphical Gaussian Models with Symmetries

Page 29: Estimation of Means in Graphical Gaussian Models with Symmetries · 2012-04-16 · Kruskal’s theorem Consider Y = (Y ) j2V ˘N V (0;) and let let K = 1 be the concentration matrix.

IntroductionGraphical models with symmetry

Symmetry restrictions on concentrationsSymmetry restrictions on partial correlations

Permutation symmetryStability of mean spaces

Group generated partitionsReferences

Frets’ headsModel specificationIdentifying the graph colouring

An RCOP model can also be represented by a graph colouring:

If V denotes the vertex orbits of Γ, i.e. the equivalence classes of

α ≡Γ β ⇐⇒ β = G (α) for some G ∈ Γ,

and similarly E the edge orbits, i.e. the equivalence classes of

α, γ ≡Γ β, δ ⇐⇒ β, δ = G (α),G (γ) for some G ∈ Γ,

then we have

S+(G, Γ) = S+(V, E) = R+(V, E).

Hence an RCOP model can also be represented as an RCON or anRCOR model with vertex orbits as vertex colour classes and edgeorbits as edge colour classes.

Steffen Lauritzen University of Oxford Estimation of Means in Graphical Gaussian Models with Symmetries

Page 30: Estimation of Means in Graphical Gaussian Models with Symmetries · 2012-04-16 · Kruskal’s theorem Consider Y = (Y ) j2V ˘N V (0;) and let let K = 1 be the concentration matrix.

IntroductionGraphical models with symmetry

Symmetry restrictions on concentrationsSymmetry restrictions on partial correlations

Permutation symmetryStability of mean spaces

Group generated partitionsReferences

A technical lemmaMean partitions

For each vertex colour class v ∈ V let T v be the |V | × |V |diagonal matrix with entries T v

αα = 1 if α ∈ u and 0 otherwise,i.e.T v is the indicator for v .

Similarly, for each edge colour class e ∈ E let T e have entriesT eαβ = 1 if α, β ∈ e and 0 otherwise, i.e. T e is the adjacency

matrix for e.

Now any K ∈ S+(V, E) can in a unique way be written as

K =∑

u∈V∪EθuT u

and any K ∈ R+(V, E) as ACA with

A =∑u∈V

ηuT u, C = I +∑u∈E

δuT u.

Steffen Lauritzen University of Oxford Estimation of Means in Graphical Gaussian Models with Symmetries

Page 31: Estimation of Means in Graphical Gaussian Models with Symmetries · 2012-04-16 · Kruskal’s theorem Consider Y = (Y ) j2V ˘N V (0;) and let let K = 1 be the concentration matrix.

IntroductionGraphical models with symmetry

Symmetry restrictions on concentrationsSymmetry restrictions on partial correlations

Permutation symmetryStability of mean spaces

Group generated partitionsReferences

A technical lemmaMean partitions

Gehrmann and Lauritzen (2012) now show that for a given coloredgraph G = (V, E) we have:

LemmaThe following are equivalent

K Ω ⊆ Ω for all K ∈ S+(V, E);

K Ω ⊆ Ω for all K ∈ R+(V, E);

T uΩ ⊆ Ω for all u ∈ V ∪ E .

Thus, by Kruskal’s theorem, we can check stability of mean spacesin both RCON and RCON models by checking stability under theaction of the model generators T u, u ∈ V ∪ E .

Steffen Lauritzen University of Oxford Estimation of Means in Graphical Gaussian Models with Symmetries

Page 32: Estimation of Means in Graphical Gaussian Models with Symmetries · 2012-04-16 · Kruskal’s theorem Consider Y = (Y ) j2V ˘N V (0;) and let let K = 1 be the concentration matrix.

IntroductionGraphical models with symmetry

Symmetry restrictions on concentrationsSymmetry restrictions on partial correlations

Permutation symmetryStability of mean spaces

Group generated partitionsReferences

A technical lemmaMean partitions

We shall be particularly interested in mean spaces generated by apartition M = m of the vertex set V , so that for

Ω = Ω(M) = µ : µα = µβ whenever α, β ∈ m..

It is straightforward to show (Gehrmann and Lauritzen, 2012) that

Proposition (Vertex stability)

The space Ω(M) is stable under T v , v ∈ V if and only if thepartition M is finer than V.

The Behrens–Fisher problem represents a case where this conditionis violated unless variances are assumed identical.

Steffen Lauritzen University of Oxford Estimation of Means in Graphical Gaussian Models with Symmetries

Page 33: Estimation of Means in Graphical Gaussian Models with Symmetries · 2012-04-16 · Kruskal’s theorem Consider Y = (Y ) j2V ˘N V (0;) and let let K = 1 be the concentration matrix.

IntroductionGraphical models with symmetry

Symmetry restrictions on concentrationsSymmetry restrictions on partial correlations

Permutation symmetryStability of mean spaces

Group generated partitionsReferences

A technical lemmaMean partitions

To discuss stability under edge colour classes we require the notionof an equitable partition.

A partition M of V is equitable w.r.t. a graph G = (V ,E ) if forany α, β ∈ n ∈M it holds that

| neE (α) ∩m| = | neE (β) ∩m| for all m ∈M.

In words, any two vertices in the same partition set have the samenumber of neighbours in any other partition set. So in particular,all subgraphs induced by partition sets are regular graphs.

Steffen Lauritzen University of Oxford Estimation of Means in Graphical Gaussian Models with Symmetries

Page 34: Estimation of Means in Graphical Gaussian Models with Symmetries · 2012-04-16 · Kruskal’s theorem Consider Y = (Y ) j2V ˘N V (0;) and let let K = 1 be the concentration matrix.

IntroductionGraphical models with symmetry

Symmetry restrictions on concentrationsSymmetry restrictions on partial correlations

Permutation symmetryStability of mean spaces

Group generated partitionsReferences

A technical lemmaMean partitions

Vertex regular graphs

We say that a coloured graph G = (V, E) is vertex regular if V isan equitable partition of the subgraph G e = (V , e) induced by theedge colour class e for all e ∈ E .

It now follows easily and also from a result of Chan and Godsil(1997) that

Proposition (Edge stability)

The space Ω(M) is stable under T e , e ∈ V if and only if thecoloured graph (M, E) is vertex regular.

Steffen Lauritzen University of Oxford Estimation of Means in Graphical Gaussian Models with Symmetries

Page 35: Estimation of Means in Graphical Gaussian Models with Symmetries · 2012-04-16 · Kruskal’s theorem Consider Y = (Y ) j2V ˘N V (0;) and let let K = 1 be the concentration matrix.

IntroductionGraphical models with symmetry

Symmetry restrictions on concentrationsSymmetry restrictions on partial correlations

Permutation symmetryStability of mean spaces

Group generated partitionsReferences

A technical lemmaMean partitions

Combining the propositions with the lemma and Kruskal’s theoremwe find that for both RCON and RCOP models we have

TheoremThe LSE and MLE for µ under the assumption that µ ∈ Ω(M) areidentical if and only if both of the following hold:

(i) M is finer than V;

(ii) The coloured graph (M, E) is vertex regular.

Steffen Lauritzen University of Oxford Estimation of Means in Graphical Gaussian Models with Symmetries

Page 36: Estimation of Means in Graphical Gaussian Models with Symmetries · 2012-04-16 · Kruskal’s theorem Consider Y = (Y ) j2V ˘N V (0;) and let let K = 1 be the concentration matrix.

IntroductionGraphical models with symmetry

Symmetry restrictions on concentrationsSymmetry restrictions on partial correlations

Permutation symmetryStability of mean spaces

Group generated partitionsReferences

Examples

If both the mean symmetry and concentration symmetry isdetermined by group invariance, we have M = V. Gehrmann andLauritzen (2012) show that if G = (V, E) represents and RCOPmodel, G is necessarily vertex regular.

Hence for RCOP models with natural mean restrictions we haveµ∗ = µ.

Steffen Lauritzen University of Oxford Estimation of Means in Graphical Gaussian Models with Symmetries

Page 37: Estimation of Means in Graphical Gaussian Models with Symmetries · 2012-04-16 · Kruskal’s theorem Consider Y = (Y ) j2V ˘N V (0;) and let let K = 1 be the concentration matrix.

IntroductionGraphical models with symmetry

Symmetry restrictions on concentrationsSymmetry restrictions on partial correlations

Permutation symmetryStability of mean spaces

Group generated partitionsReferences

Examples

Frets’ heads revisited

u

u u

uB1

L1

B2

L2

For the mean partition to be finer than the concentration partitionwe can either have different mean lengths, or different meanbreadths, or both, or none of these.

For the mean partition to be vertex regular we need to have eitherboth means identical or all means different. Thus there are twobenign possibilities.

Steffen Lauritzen University of Oxford Estimation of Means in Graphical Gaussian Models with Symmetries

Page 38: Estimation of Means in Graphical Gaussian Models with Symmetries · 2012-04-16 · Kruskal’s theorem Consider Y = (Y ) j2V ˘N V (0;) and let let K = 1 be the concentration matrix.

IntroductionGraphical models with symmetry

Symmetry restrictions on concentrationsSymmetry restrictions on partial correlations

Permutation symmetryStability of mean spaces

Group generated partitionsReferences

Examples

Anxiety and anger revisited

e

e e

eTX

SX

TN

SN

Here there are no benign mean hypotheses as the individualconcentrations are all different.

Steffen Lauritzen University of Oxford Estimation of Means in Graphical Gaussian Models with Symmetries

Page 39: Estimation of Means in Graphical Gaussian Models with Symmetries · 2012-04-16 · Kruskal’s theorem Consider Y = (Y ) j2V ˘N V (0;) and let let K = 1 be the concentration matrix.

IntroductionGraphical models with symmetry

Symmetry restrictions on concentrationsSymmetry restrictions on partial correlations

Permutation symmetryStability of mean spaces

Group generated partitionsReferences

Anderson, R. L. (1942). Distribution of the serial correlationcoefficient. The Annals of Mathematical Statistics, 13:1–13.

Andersson, S. A. (1975). Invariant normal models. The Annals ofStatistics, 3:132–154.

Andersson, S. A., Brøns, H., and Jensen, S. T. (1983).Distribution of eigenvalues in multivariate statistical analysis.The Annals of Statistics, 11:392–415.

Andersson, S. A. and Madsen, J. (1998). Symmetry and latticeconditional independence in a multivariate normal distribution.The Annals of Statistics, 26:525–572.

Besag, J. (1974). Spatial interaction and the statistical analysis oflattice systems (with discussion). Journal of the Royal StatisticalSociety, Series B, 36:302–309.

Steffen Lauritzen University of Oxford Estimation of Means in Graphical Gaussian Models with Symmetries

Page 40: Estimation of Means in Graphical Gaussian Models with Symmetries · 2012-04-16 · Kruskal’s theorem Consider Y = (Y ) j2V ˘N V (0;) and let let K = 1 be the concentration matrix.

IntroductionGraphical models with symmetry

Symmetry restrictions on concentrationsSymmetry restrictions on partial correlations

Permutation symmetryStability of mean spaces

Group generated partitionsReferences

Besag, J. E. and Moran, P. A. P. (1975). On the estimation andtesting of spatial interaction in Gaussian lattice processes.Biometrika, 62:555–562.

Chan, A. and Godsil, C. (1997). Symmetry and eigenvectors. InHahn, G. and Sabidussi, G., editors, Graph Symmetry. AlgebraicMethods and Applications, volume 497 of NATO ASI Series C:Mathematical and Physical Sciences, pages 75–106. KluwerAcademic Publishers.

Cox, D. R. and Wermuth, N. (1993). Linear dependenciesrepresented by chain graphs (with discussion). StatisticalScience, 8:204–218; 247–277.

Drton, M. (2008). Multiple solutions to the likelihood equations inthe Behrens–Fisher problem. Statist. Probab. Lett.,78:3288–3293.

Steffen Lauritzen University of Oxford Estimation of Means in Graphical Gaussian Models with Symmetries

Page 41: Estimation of Means in Graphical Gaussian Models with Symmetries · 2012-04-16 · Kruskal’s theorem Consider Y = (Y ) j2V ˘N V (0;) and let let K = 1 be the concentration matrix.

IntroductionGraphical models with symmetry

Symmetry restrictions on concentrationsSymmetry restrictions on partial correlations

Permutation symmetryStability of mean spaces

Group generated partitionsReferences

Frets, G. P. (1921). Heredity of head form in man. Genetica,3:193–400.

Gehrmann, H. and Lauritzen, S. (2012). Estimation of means ingraphical Gaussian models with symmetries. The Annals ofStatistics, 40:To appear.

Hylleberg, B., Jensen, M., and Ørnbøl, E. (1993). Graphicalsymmetry models. Master’s thesis, Aalborg University, Aalborg,Denmark.

Kruskal, W. (1968). When are Gauss–Markov and least squaresestimators identical? A coordinate-free approach. Ann. Math.Statist., 39:70–75.

Leipnik, R. B. (1947). Distribution of the serial correlationcoefficent in a circularly correlated universe. The Annals ofMathematical Statistics, 18:80–87.

Steffen Lauritzen University of Oxford Estimation of Means in Graphical Gaussian Models with Symmetries

Page 42: Estimation of Means in Graphical Gaussian Models with Symmetries · 2012-04-16 · Kruskal’s theorem Consider Y = (Y ) j2V ˘N V (0;) and let let K = 1 be the concentration matrix.

IntroductionGraphical models with symmetry

Symmetry restrictions on concentrationsSymmetry restrictions on partial correlations

Permutation symmetryStability of mean spaces

Group generated partitionsReferences

Madsen, J. (2000). Invariant normal models with recursivegraphical Markov structure. The Annals of Statistics,28(4):1150–1178.

Mardia, K. V., Kent, J. T., and Bibby, J. M. (1979). MultivariateAnalysis. Academic Press.

Olkin, I. and Press, S. J. (1969). Testing and estimation for acircular stationary model. The Annals of MathematicalStatistics, 40:1358–1373.

Scheffe, H. (1944). A note on the Behrens–Fisher problem. Ann.Math. Statist., 15:430–432.

Votaw, D. F. (1948). Testing compound symmetry in a normalmultivariate distribution. The Annals of Mathematical Statistics,19:447–473.

Whittle, P. (1954). On stationary processes in the plane.Biometrika, 41:439–449.

Steffen Lauritzen University of Oxford Estimation of Means in Graphical Gaussian Models with Symmetries

Page 43: Estimation of Means in Graphical Gaussian Models with Symmetries · 2012-04-16 · Kruskal’s theorem Consider Y = (Y ) j2V ˘N V (0;) and let let K = 1 be the concentration matrix.

IntroductionGraphical models with symmetry

Symmetry restrictions on concentrationsSymmetry restrictions on partial correlations

Permutation symmetryStability of mean spaces

Group generated partitionsReferences

Wilks, S. S. (1946). Sample criteria for testing equality of means,equality of variances, and equality of covariances in a normalmultivariate distribution. The Annals of Mathematical Statistics,17:257–281.

Steffen Lauritzen University of Oxford Estimation of Means in Graphical Gaussian Models with Symmetries