Univariate Split-Plot Analysis 2003 LPGA Data. Background Information 6 Golfers (Treated as only 6...

14
Univariate Split-Plot Analysis 2003 LPGA Data

Transcript of Univariate Split-Plot Analysis 2003 LPGA Data. Background Information 6 Golfers (Treated as only 6...

Page 1: Univariate Split-Plot Analysis 2003 LPGA Data. Background Information 6 Golfers (Treated as only 6 of interest  Fixed) 8 Tournaments (Treated as random.

Univariate Split-Plot Analysis

2003 LPGA Data

Page 2: Univariate Split-Plot Analysis 2003 LPGA Data. Background Information 6 Golfers (Treated as only 6 of interest  Fixed) 8 Tournaments (Treated as random.

Background Information

• 6 Golfers (Treated as only 6 of interest Fixed)

• 8 Tournaments (Treated as random sample of all possible tournaments)

• 4 Rounds per tournament (fixed factor)

Daniel Kung Ochoa PakPark Webb

Page 3: Univariate Split-Plot Analysis 2003 LPGA Data. Background Information 6 Golfers (Treated as only 6 of interest  Fixed) 8 Tournaments (Treated as random.

Data Description and Model

• Tournaments act as blocks. They are each associated with a particular golf course, region and weather pattern (they may differ significantly in terms of difficulty)

• Tournaments are made up of Rounds (these tournaments are all 4 rounds). It is impossible to break up rounds within blocks, thus they are the whole plot factor (in an experiment, the treatments would be randomly assigned to whole plots)

• Golfers all play rounds on the same day (all play round 1, then 2, etc), thus they are the subplot factor (in an experiment, their positions would be assigned at random within whole plots)

Page 4: Univariate Split-Plot Analysis 2003 LPGA Data. Background Information 6 Golfers (Treated as only 6 of interest  Fixed) 8 Tournaments (Treated as random.

Data Description and Model

• Let factor A be whole plot factor (round) with a=4 levels and subscript i be associated with it

• Let factor B be block factor (tournament) with b=8 levels and subscript j be associated with it

• Let factor C be subplot factor (golfer) with c=6 levels and subscript k be associated with it

• Interaction between round and tournament allows for climate effects to vary across courses (WP error term)

• Interaction between golfer and round allows golfer skill to vary across rounds (e.g. pressure effects)

• Model assumes no tournament by golfer interaction (can be tested) or 3-way interaction (SP error term)

Page 5: Univariate Split-Plot Analysis 2003 LPGA Data. Background Information 6 Golfers (Treated as only 6 of interest  Fixed) 8 Tournaments (Treated as random.

Data Description and Model

2ijk

1ik

1ik

1

2

2

1

,0~ error term random

0)()(n interactio /golfer round )(

0golfer ofeffect

,0~)b(n interactio /tourney round )(

,0~t tournamenofeffect

0 round ofeffect

)()(

N

ki

k

Najiab

Nbjb

i

abbY

ijk

c

k

a

iik

c

kkk

abijij

bjj

a

iii

ijkikkijjiijk

Page 6: Univariate Split-Plot Analysis 2003 LPGA Data. Background Information 6 Golfers (Treated as only 6 of interest  Fixed) 8 Tournaments (Treated as random.

Observed Means

Daniel 70.4375 Tourney1 68.2917 Round1 70.9792Kung 72.0938 Tourney2 70.2083 Round2 70.5417Park 69.5398 Tourney3 70.9583 Round3 69.875Webb 70.5000 Tourney4 70.9583 Round4 70.9167Ochoa 71.3438 Tourney5 69.9167Pak 69.5000 Tourney6 69.8750

Tourney7 70.9583Tourney8 73.4583 Overall 70.5781

Means of Golfer/Courses and Golfer/Rounds and Courses/Rounds are on separate EXCEL spread sheet

Page 7: Univariate Split-Plot Analysis 2003 LPGA Data. Background Information 6 Golfers (Treated as only 6 of interest  Fixed) 8 Tournaments (Treated as random.

Analysis of Variance

Source of Variation df SS MS F P-valueRound (WPT) 3 37.0156 12.3385 1.3041 0.2994Tournament (Block) 7 360.6198 51.5171 5.4449RxT (Error1) 21 198.6927 9.4616Golfer (SPT) 5 161.2969 32.2594 4.8690 0.0004RxG 15 83.6406 5.5760 0.8416 0.6302TxG + RxTxG (Error2) 140 927.5625 6.6254Total 191 1768.8281

There are significant differences among golfers, none among rounds, nor a golfer by round interaction

Page 8: Univariate Split-Plot Analysis 2003 LPGA Data. Background Information 6 Golfers (Treated as only 6 of interest  Fixed) 8 Tournaments (Treated as random.

Post-hoc Comparisons Among Golfers

)8(4

2140)),15(2/(05.'....

ERRORkk

MStYY

Golfer Comparisons Golfer i Mean Golfer j Mean Mean i-j SE(Mean i-j) t(.05/(2*15) t*SE CI Lower CI UpperDaniel vs Kung (1v2) 70.4375 72.0938 -1.6563 0.6435 3.2062 2.0632 -3.7195 0.4070Daniel vs Park (1v3) 70.4375 69.5938 0.8438 0.6435 3.2062 2.0632 -1.2195 2.9070Daniel v Webb (1v4) 70.4375 70.5000 -0.0625 0.6435 3.2062 2.0632 -2.1257 2.0007Daniel v Ochoa (1v5) 70.4375 71.3438 -0.9063 0.6435 3.2062 2.0632 -2.9695 1.1570

Daniel v Pak (1v6) 70.4375 69.5000 0.9375 0.6435 3.2062 2.0632 -1.1257 3.0007Kung v Park (2v3) 72.0938 69.5938 2.5000 0.6435 3.2062 2.0632 0.4368 4.5632 ***

Kung v Webb (2v4) 72.0938 70.5000 1.5938 0.6435 3.2062 2.0632 -0.4695 3.6570Kung v Ochoa (2v5) 72.0938 71.3438 0.7500 0.6435 3.2062 2.0632 -1.3132 2.8132

Kung v Pak (2v6) 72.0938 69.5000 2.5938 0.6435 3.2062 2.0632 0.5305 4.6570 ***Park v Webb (3v4) 69.5938 70.5000 -0.9063 0.6435 3.2062 2.0632 -2.9695 1.1570Park v Ochoa (3v5) 69.5938 71.3438 -1.7500 0.6435 3.2062 2.0632 -3.8132 0.3132

Park v Pak (3v6) 69.5938 69.5000 0.0938 0.6435 3.2062 2.0632 -1.9695 2.1570Webb v Ochoa (4v5) 70.5000 71.3438 -0.8438 0.6435 3.2062 2.0632 -2.9070 1.2195

Webb v Pak (4v6) 70.5000 69.5000 1.0000 0.6435 3.2062 2.0632 -1.0632 3.0632Ochoa v Pak (5v6) 71.3438 69.5000 1.8438 0.6435 3.2062 2.0632 -0.2195 3.9070

Page 9: Univariate Split-Plot Analysis 2003 LPGA Data. Background Information 6 Golfers (Treated as only 6 of interest  Fixed) 8 Tournaments (Treated as random.

Post-Hoc Comparisons

Daniel (70.44) Kung (72.09)Ochoa (71.34)Pak (69.50) Park (69.59) Webb (70.50)

Page 10: Univariate Split-Plot Analysis 2003 LPGA Data. Background Information 6 Golfers (Treated as only 6 of interest  Fixed) 8 Tournaments (Treated as random.

Another Possibility - Mixed Model

• In reality, there are hundreds of golfers that are “certified” members of LPGA

• Re-analyze the data as a mixed model (rounds are still fixed)

• ANOVA hasn’t changed, but error terms have.

• The golfer effects are now random variables that we assume to be normal with variance c

2

Page 11: Univariate Split-Plot Analysis 2003 LPGA Data. Background Information 6 Golfers (Treated as only 6 of interest  Fixed) 8 Tournaments (Treated as random.

Expected Mean Squares (Fixed WP/Random SP)

2 2 2

2

2 2 2 1

2 2

2 2

Unrestricted Model wrt GolferxCourse Interaction

1

11

( 1)( 1)

1

BLOCKS b ab BLOCKS

a

ii

WP ab ac WP

BLK WP ab BLK WP

SP c SP

WP SP

E MS ac c df b

E MS c b bc df aa

E MS c df a b

E MS ab df c

E MS b

2 2

2

( 1)( 1)

( 1)( 1)

ac WP SP

ERROR ERROR

df a c

E MS df a b c

Page 12: Univariate Split-Plot Analysis 2003 LPGA Data. Background Information 6 Golfers (Treated as only 6 of interest  Fixed) 8 Tournaments (Treated as random.

Testing for WP (Round) Fixed Effects

301.37.2

68.3534.6

20.226

)16)(14()58.5(

)18)(14()46.9(

)58.546.9(

)1)(1()1)(1(

72.657.53

86.359

)16)(18(4)63.6(

14)64.12(

)63.634.12(

)1)(1(1

:Freedom of Degrees eApproximat

26.104.15

97.18

58.546.9

63.634.12 :StatisticTest

1 :Note

0 allNot :0:

35,6,05.68.35,34.6,05.

22

2

22

2

2

22

2

22

2

1

2

410

PFF

ca

MS

ba

MS

MSMS

cbaMS

a

MS

MSMS

MSMS

MSMSF

abcMSEMSEMSEMSE

HH

SPWPWPBLK

SPWPWPBLK

ERRWP

ERRWP

SPWPWPBLK

ERRWPWP

iSPWPWPBLKERRWP

iA

Page 13: Univariate Split-Plot Analysis 2003 LPGA Data. Background Information 6 Golfers (Treated as only 6 of interest  Fixed) 8 Tournaments (Treated as random.

Testing for SP Effects and WP/SP Interaction

6302.

14015 :Freedom of Degrees

84.063.6

58.5 :StatisticTest

0:0: :nInteractio SPWP

0004.

1405 :Freedom of Degrees

87.463.6

26.32 :StatisticTest

0:0: :EffectsSubplot

21

220

21

220

P

MS

MSF

HH

P

MS

MSF

HH

ERR

SPWPSPWP

acAac

ERR

SPSP

cAc

Page 14: Univariate Split-Plot Analysis 2003 LPGA Data. Background Information 6 Golfers (Treated as only 6 of interest  Fixed) 8 Tournaments (Treated as random.

SAS Program (Fixed Effects Model)

options nodate nonumber ps=54 ls=76;

data one;infile ‘C:\lpgasplt.dat';input golfer $ 1-24 tourney round score;run;

proc glm;class golfer tourney round;model score = round tourney round*tourney golfer golfer*round;test h=round e=round*tourney;means golfer / bon;run;

proc mixed;class golfer tourney round;model score = round golfer golfer*round;random tourney round*tourney;run;

quit;