Casement Schedule - Keough-Naughton Institute for Irish Studies
Quinn & Keough Stats
-
Upload
ajaxtelamonio -
Category
Documents
-
view
247 -
download
0
Transcript of Quinn & Keough Stats
-
7/28/2019 Quinn & Keough Stats
1/12
EXPERIMENTALDE SI GN AN D DATA
AN AL Y SI S FORBIOLOGISTS
G. P. QUINNM onash U niversi ty
MICHAEL J. K EOUGHU niversi ty Of M el bourne
April 2001
TO BE PUBLI SHED BY CAMBRI DGEUN I VERSI TY PRESS, 2001
This work is G. Quinn & M. Keough 2001 and Cambridge University Press. Please donot distribute printed or electronic copies without the authors permission.
-
7/28/2019 Quinn & Keough Stats
2/12
i
PREFACE
Statistical analysis is at the core of most modern biology, and many biological hypotheses,even deceptively simple ones, are matched by complex statistical models. Prior to thedevelopment of modern desktop computers, determining whether the data fit these complexmodels was the province of professional statisticians. Many biologists instead opted for simplermodels whose structure had been simplified quite arbitrarily. Now, with immensely powerfulstatistical software available to most of us, these complex models can be fitted, creating a new setof demands & problems for biologists.
We need to:
Know the pitfalls and assumptions of particular statistical models
Be able to identify the type of model appropriate for the sampling design and kind of data that we plan to collect
Be able to interpret the output of analyses using these models
Be able to design experiments and sampling programs optimally, i.e. with the bestpossible use of our limited time and resources.
The analysis may be done by professional statisticians, rather than statistically trainedbiologists, especially in large research groups or multidisciplinary teams. In these situations, weneed to be able to speak a common language
Frame our questions in such a way as to get a sensible answer
Be aware of biological considerations that may cause statistical problems; we can notexpect a statistician to be aware of the biological idiosyncrasies or our particular study,but if he or she lacks that information, we may get misleading or incorrect advice
Understand the advice or analyses that we receive, and be able to translate that back intobiology
This book aims to place biologists in a better position to do these things. It arose from ourinvolvment in designing and analyzing our own data, but also providing advice to students andcolleagues, and teaching classes in design and analysis. As part of these activities, we becameaware, first of our limitations, prompting us to read more widely in the primary statisticalliterature, and second, and more importantly, of the complexity of the statistical models
underlying much biological research. In particular, we continually encountered experimentaldesigns that were not described comprehensively in many of our favourite texts. This bookdescribes many of the common designs used in biological research, and we present the statisticalmodels underlying those designs, with enough information to highlight their benefits and pitfalls.
Our emphasis here is on dealing with biological data how to design sampling programs thatrepresent the best use of our resources, how to avoid mistakes that make analyzing our datadifficult, and how to analyse the data when they are collected. We emphasise the problemsassociated with real world biological situations.
-
7/28/2019 Quinn & Keough Stats
3/12
ii
IN TH IS BOOK
Our approach is to encourage readers to understand the models underlying the mostcommon experimental designs. We describe the models that are appropriate for various kinds of biological data continuous and categorical response variables, continuous and categoricalpredictor or independent variables. Our emphasis is on general linear models, and we begin with
the simplest situations - single, continuous variables - describing those models in detail. We usethese models as building blocks to understanding a wide range of other kinds of data all of thecommon statistical analyses, rather then being distinctly different kinds of analyses, are variationson a common theme of statistical modeling constructing a model for the data and thendetermining whether observed data fit this particular model. Our aim is to show how a broadunderstanding of the models allow us to deal with a wide range of more complex situations.
We have illustrated this approach of fitting models primarily with parametric statistics. Mostbiological data is still analyzed with linear models that assume underlying normal distributions.However, we introduce readers to a range of more general approaches, and stress that, once youunderstand the general modeling approach for normally-distributed data, you can use thatinformation to begin modeling data with nonlinear relationships, variables that follow other
statistical distributions, etc.LEARNING BY EXAMPLE
One of our strongest beliefs is that we understand statistical principles much better when wesee how they are applied to situations in our own discipline. Examples let us make the linkbetween statistical models and formal statistical terms (blocks, plots, etc.) or papers written inother disciplines, and the biological situations that we are dealing with. For example, how is ouranalysis and interpretation of an experiment repeated on several times helped by reading aliterature about blocks of agricultural land? How does literature developed for psychologicalresearch let us deal with measuring changes in physiological responses of plants?
Throughout this book, we illustrate all of the statistical techniques with examples from thecurrent biological literature. We describe why (we think) the authors chose to do an experimentin a particular way, and how to analyse the data, including assessing assumptions and interpretingstatistical output. These examples appear as boxes through each chapter, and we are delightedthat authors of most of these studies have made their raw data available to us. We provide thoseraw data files on a web site, allowing readers to run these analyses using their particular softwarepackage.
The other value of published examples is that we can see how particular analyses can bedescribed and reported. When fitting complex statistical models, it is easy to allow the biology tobe submerged by a mass of statistical output. We hope that the examples, together with our ownthoughts on this subject, presented in the final chapter, will help prevent this happening.
TH IS BOOK IS A BRIDGE
It is not possible to produce a book that introduces a reader to biological statistics and takesthem far enough to understand complex models, at least while having a book that is smallenough to transport. We therefore assume that readers are familiar with basic statistical concepts,such as would result from a one or two semester introductory course, or have read one of theexcellent basic texts (e.g. Sokal & Rohlf 1995). We take the reader from theses texts into morecomplex areas, explaining the principles, assumptions, and pitfalls, and encourage a reader to readthe excellent detailed treatments (e.g, for analysis of variance, Winer et al. 1991 or Underwood1997).
Biological data are often messy, and many readers will find that their research questions
require more complex models than we describe here. Ways of dealing with messy data or
-
7/28/2019 Quinn & Keough Stats
4/12
iii
solutions to complex problems are often provided in the primary statistical literature. We try topoint the way to key pieces of that statistical literature, providing the reader with the basic toolsto be able to deal with that literature, or to be able to seek professional (statistical) help whenthings become too complex.
We must always remember that for biologists, statistics is a tool that we use to illuminate andclarify biological problems. Our aim is to be able to use these tools efficiently, without losingsight of the biology that is the motivation for most of us entering this field.
SOME ACKNOWLE DGMENT S
Our biggest debt is to the range of colleagues who have read, commented upon, andcorrected various versions of these chapters. Many of these colleagues have their own researchgroups, who they enlisted in this exercise. These altruistic and diligent souls include(alphabetically), Andrew Constable, Barb Downes, Peter Fairweather, Ivor Grownf, JacquiBrooks, Ralph Mac Nally, Richard Marchant, Pete Raimondi, Wayne Robinson, SuvaluckSatumanatpan, Sabine Schreiber. Perhaps the most innocent victims were the graduate studentswho have been part of our research groups over the period we produced this book. We greatly
appreciate their willingness to trade the chance of some illumination for reading and highlightingour obfuscations.
We also wish to thank the various researchers whose data we used as examples throughout.Most of them willingly gave of their raw data, trusting that we would neither criticize or findflaws in their published work (we didnt!), or were public-spirited enough to have published theirraw data.
-
7/28/2019 Quinn & Keough Stats
5/12
Table of Contents
1. Introduction .................................................................................11.1. Scientific method .............................................................................................1
1.1.1. Pattern description........................................................... ................................ 21.1.2. Models.......................................................... ................................................... 21.1.3. Hypotheses and tests ....................................................... ................................ 31.1.4. Alternatives to falsification ....................................................... ...................... 41.1.5. Role of statistical analysis ......................................................... ...................... 5
1.2. Experiments and other tests ............................................................................51.3. Data, observations & variables.......................................................................7 1.4. Probability.......................................................................................................7 1.5. Probability distributions .................................................................................9
1.5.1. Distributions for variables ......................................................... .................... 101.5.2. Distributions for statistics......... ........................................................... .......... 12
2. Estimation ..................................................................................152.1. Samples and populations...............................................................................152.2. Common Parameters & Statistics .................................................................16
2.2.1. Centre (location) of distribution .......................................................... .......... 162.2.2. Spread or variability ........................................................ .............................. 17
2.3. Standard errors and confidence intervals for the mean................................17 2.3.1. Normal distributions and the Central Limit Theorem ................................... 172.3.2. Standard error of the sample mean...................................................... .......... 182.3.3. Confidence intervals for population mean..................................................... 192.3.4. Interpretation of confidence intervals for population mean........................... 202.3.5. Standard errors for other statistics ....................................................... .......... 20
2.4. Methods for estimating parameters...............................................................212.4.1. Maximum likelihood ....................................................... .............................. 212.4.2. Ordinary least squares (OLS) .................................................... .................... 222.4.3. ML vs OLS estimation. ................................................... .............................. 22
2.5. Resampling Methods for Estimation .............................................................232.5.1. Bootstrap ...................................................... ................................................. 232.5.2. Jackknife................................................................. ....................................... 24
2.6. Bayesian inference - estimation ....................................................................242.6.1. Bayesian estimation...................................... ................................................. 252.6.2. Prior knowledge and probability ......................................................... .......... 252.6.3. Likelihood function ......................................................... .............................. 262.6.4. Posterior probability ........................................................ .............................. 262.6.5. Examples ...................................................... ................................................. 272.6.6. Other comments ..................................................... ....................................... 27
3. Hypothesis testing......................................................................373.1. Statistical hypothesis testing .........................................................................37
3.1.1. Classical statistical hypothesis testing........................................................... 373.1.2. Associated probability and Type I error ........................................................ 393.1.3. Hypothesis tests for a single population ........................................................ 403.1.4. One and two tailed tests.......................................... ....................................... 413.1.5. Hypotheses for two populations .......................................................... .......... 423.1.6. Parametric tests and their assumptions .......................................................... 44
3.2. Decision errors..............................................................................................453.2.1. Type I and II errors....................................... ................................................. 453.2.2. Asymmetry and Scalable Decision Criteria............................... .................... 46
3.3. Other testing methods....................................................................................47 3.3.1. Robust parametric tests ................................................... .............................. 473.3.2. Randomization (permutation) tests......................... ....................................... 47
http://qk01.pdf/http://qk01.pdf/http://qk01.pdf/http://qk01.pdf/http://qk01.pdf/http://qk01.pdf/http://qk01.pdf/http://qk01.pdf/http://qk01.pdf/http://qk01.pdf/http://qk01.pdf/http://qk01.pdf/http://qk01.pdf/http://qk02.pdf/http://qk02.pdf/http://qk02.pdf/http://qk02.pdf/http://qk02.pdf/http://qk02.pdf/http://qk02.pdf/http://qk02.pdf/http://qk02.pdf/http://qk02.pdf/http://qk02.pdf/http://qk02.pdf/http://qk02.pdf/http://qk02.pdf/http://qk02.pdf/http://qk02.pdf/http://qk02.pdf/http://qk02.pdf/http://qk02.pdf/http://qk02.pdf/http://qk02.pdf/http://qk02.pdf/http://qk02.pdf/http://qk02.pdf/http://qk02.pdf/http://qk03.pdf/http://qk03.pdf/http://qk03.pdf/http://qk03.pdf/http://qk03.pdf/http://qk03.pdf/http://qk03.pdf/http://qk03.pdf/http://qk03.pdf/http://qk03.pdf/http://qk03.pdf/http://qk03.pdf/http://qk03.pdf/http://qk03.pdf/http://qk03.pdf/http://qk02.pdf/http://qk01.pdf/ -
7/28/2019 Quinn & Keough Stats
6/12
3.3.3. Rank-based nonparametric tests .......................................................... .......... 483.4. Multiple testing..............................................................................................50
3.4.1. The problem ........................................................... ....................................... 503.4.2. Adjusting significance levels and/or P values ............................................... 51
3.5. Combining results from statistical tests ........................................................523.5.1. Combining P values......................................................... .............................. 52
3.5.2. Meta-analysis ......................................................... ....................................... 533.6. Critique of statistical hypothesis testing .......................................................53
3.6.1. Dependence on sample size and stopping rules............................................. 543.6.2. Sample space relevance of data not observed............................................. 543.6.3. P values as measure of evidence ......................................................... .......... 543.6.4. Null hypothesis always false ..................................................... .................... 543.6.5. Arbitrary significance levels...................................................... .................... 553.6.6. Alternatives to statistical hypothesis testing.................... .............................. 55
3.7. Bayesian hypothesis testing...........................................................................56
4. Graphical exploration of data..................................................664.1. Exploratory data analysis .......................................................... .................... 66
4.1.1. Exploring samples ........................................................... .............................. 664.2. Analysis with graphs .....................................................................................694.2.1. Assumptions of parametric linear models ..................................................... 69
4.3. Transforming Data........................................................................................704.3.1. Transformations and distributional assumptions ........................................... 724.3.2. Transformations and linearity.................................................... .................... 734.3.3. Transformations and additivity............................................................ .......... 73
4.4. Standardizations............................................................................................734.5. Outliers..........................................................................................................744.6. Censored and missing data ...........................................................................74
4.6.1. Missing data ........................................................... ....................................... 744.6.2. Censored (truncated) data............................. ................................................. 75
4.7. General issues and hints for analysis............................................................77 4.7.1. General Issues ........................................................ ....................................... 774.7.2. Captions to Figures............................. ........................................................... 79
5. Correlation and regression.......................................................805.1. Correlation analysis......................................................................................80
5.1.1. Parametric correlation model .................................................... .................... 805.1.2. Robust correlation ........................................................... .............................. 825.1.3. Parametric and nonparametric confidence regions........................................ 82
5.2. Linear models................................................................................................835.3. Linear regression analysis ............................................................................84
5.3.1. Simple (bivariate) linear regression................................. .............................. 845.3.2. Linear model for regression....................................................... .................... 855.3.3. Estimating the model parameters ........................................................ .......... 865.3.4. Analysis of variance ........................................................ .............................. 885.3.5. Null hypotheses in regression................................. ....................................... 895.3.6. Comparing regression models ................................................... .................... 905.3.7. Variance explained .......................................................... .............................. 915.3.8. Assumptions of regression analysis................................. .............................. 925.3.9. Regression diagnostics .................................................... .............................. 945.3.10. Diagnostic graphics ......................................................... .............................. 965.3.11. Transformations ........................................................................ .................... 975.3.12. Regression through the origin ....................................................................... 985.3.13. Weighted least squares .................................................... .............................. 995.3.14. X random (Model II regression) .......................................................... .......... 995.3.15. Robust Regression........................................................... ............................ 102
5.4. Relationship between regression and correlation.......................................1035.5. Smoothing....................................................................................................104
http://qk03.pdf/http://qk03.pdf/http://qk03.pdf/http://qk03.pdf/http://qk03.pdf/http://qk03.pdf/http://qk03.pdf/http://qk03.pdf/http://qk03.pdf/http://qk03.pdf/http://qk03.pdf/http://qk03.pdf/http://qk03.pdf/http://qk03.pdf/http://qk03.pdf/http://qk04.pdf/http://qk04.pdf/http://qk04.pdf/http://qk04.pdf/http://qk04.pdf/http://qk04.pdf/http://qk04.pdf/http://qk04.pdf/http://qk04.pdf/http://qk04.pdf/http://qk04.pdf/http://qk04.pdf/http://qk04.pdf/http://qk04.pdf/http://qk04.pdf/http://qk04.pdf/http://qk04.pdf/http://qk05.pdf/http://qk05.pdf/http://qk05.pdf/http://qk05.pdf/http://qk05.pdf/http://qk05.pdf/http://qk05.pdf/http://qk05.pdf/http://qk05.pdf/http://qk05.pdf/http://qk05.pdf/http://qk05.pdf/http://qk05.pdf/http://qk05.pdf/http://qk05.pdf/http://qk05.pdf/http://qk05.pdf/http://qk05.pdf/http://qk05.pdf/http://qk05.pdf/http://qk05.pdf/http://qk05.pdf/http://qk05.pdf/http://qk05.pdf/http://qk03.pdf/http://qk05.pdf/http://qk04.pdf/ -
7/28/2019 Quinn & Keough Stats
7/12
5.5.1. Running means....................................................... ..................................... 1055.5.2. LO(W)ESS ................................................... ............................................... 1055.5.3. Splines .......................................................... ............................................... 1055.5.4. Kernels ......................................................... ............................................... 1065.5.5. Other issues ............................................................ ..................................... 106
5.6. Power of tests in correlation and regression ..............................................107
5.7. General issues and hints for analysis..........................................................107
6. Multiple regression and correlation ......................................1246.1. Multiple linear regression analysis.............................................................124
6.1.1. Multiple linear regression model........ ......................................................... 1246.1.2. Estimating model parameters .................................................... .................. 1266.1.3. Analysis of variance ........................................................ ............................ 1266.1.4. Null hypotheses and model comparisons .................................................... 1276.1.5. Variance explained .......................................................... ............................ 1286.1.6. Which predictors are important? ......................................................... ........ 1286.1.7. Assumptions of multiple regression .................................................... ........ 1306.1.8. Regression diagnostics .................................................... ............................ 131
6.1.9. Diagnostic graphics ......................................................... ............................ 1316.1.10. Transformations ........................................................................ .................. 1326.1.11. Collinearity..................... ........................................................... .................. 1326.1.12. Interactions in multiple regression ............................................ .................. 1356.1.13. Polynomial regression ..................................................... ............................ 1376.1.14. Indicator (dummy) variables ....................................................................... 1386.1.15. Finding the best regression model ......................................... .................. 1406.1.16. Hierarchical partitioning...................................................................... ........ 1426.1.17. Other issues in multiple linear regression............................................ ........ 144
6.2. Regression trees ..........................................................................................1456.3. Path analysis and structural equation modelling........................................147 6.4. Nonlinear models ........................................................................................1496.5. Smoothing and response surfaces ...............................................................1506.6. General issues and hints for analysis..........................................................151
Figure legends ........................................................................................................174
7. Design and power analysis......................................................1757.1. Sampling......................................................................................................175
7.1.1. Sampling designs................................ ......................................................... 1757.1.2. Size of sample ........................................................ ..................................... 177
7.2. Experimental design....................................................................................177 7.2.1. Replication ................................................... ............................................... 1787.2.3. Controls ........................................................ ............................................... 1807.2.4. Randomization ....................................................... ..................................... 1817.2.5. Independence................................................ ............................................... 183
7.2.6. Reducing unexplained variance................................................. .................. 1837.3. Power analysis ............................................................................................183
7.3.1. Using power to plan experiments (a priori power analysis) ........................ 1847.3.2. Post-hoc power calculation.................................................................. ........ 1867.3.3. The effect size ........................................................ ..................................... 1877.3.4. Using the power analyses .......................................................... .................. 189
7.4. General issues and hints for analysis..........................................................190 List of Figures.........................................................................................................194
8. Comparing groups or treatments - analysis of variance .....1958.1. Single factor (one way) analysis of variance ..............................................195
8.1.1. Types of predictor variables (factors)..................... ..................................... 196
8.1.2. Linear model for single factor ANOVA.......................... ............................ 1978.1.3. Analysis of variance ........................................................ ............................ 1988.1.4. Null hypotheses ...................................................... ..................................... 199
http://qk05.pdf/http://qk05.pdf/http://qk05.pdf/http://qk05.pdf/http://qk05.pdf/http://qk05.pdf/http://qk05.pdf/http://qk06.pdf/http://qk06.pdf/http://qk06.pdf/http://qk06.pdf/http://qk06.pdf/http://qk06.pdf/http://qk06.pdf/http://qk06.pdf/http://qk06.pdf/http://qk06.pdf/http://qk06.pdf/http://qk06.pdf/http://qk06.pdf/http://qk06.pdf/http://qk06.pdf/http://qk06.pdf/http://qk06.pdf/http://qk06.pdf/http://qk06.pdf/http://qk06.pdf/http://qk06.pdf/http://qk06.pdf/http://qk06.pdf/http://qk06.pdf/http://qk06.pdf/http://qk07.pdf/http://qk07.pdf/http://qk07.pdf/http://qk07.pdf/http://qk07.pdf/http://qk07.pdf/http://qk07.pdf/http://qk07.pdf/http://qk07.pdf/http://qk07.pdf/http://qk07.pdf/http://qk07.pdf/http://qk07.pdf/http://qk07.pdf/http://qk07.pdf/http://qk07.pdf/http://qk07.pdf/http://qk08.pdf/http://qk08.pdf/http://qk08.pdf/http://qk08.pdf/http://qk08.pdf/http://qk08.pdf/http://qk05.pdf/http://qk08.pdf/http://qk07.pdf/http://qk06.pdf/ -
7/28/2019 Quinn & Keough Stats
8/12
8.1.5. Comparing ANOVA models ..................................................... .................. 2008.1.6. Unequal sample sizes (unbalanced designs). ............................................... 200
8.2. Factor effects...............................................................................................2018.2.1. Random effects: variance components ........................................................ 2018.2.2. Fixed effects ........................................................... ..................................... 203
8.3. Assumptions.................................................................................................2048.3.1. Normality ..................................................... ............................................... 2048.3.2. Variance homogeneity............................................ ..................................... 2048.3.3. Independence................................................ ............................................... 205
8.4. ANOVA diagnostics.....................................................................................206 8.5. Robust ANOVA............................................................................................206
8.5.1. Tests with heterogeneous variances .................................................... ........ 2068.5.2. Rank-based (nonparametric) tests .................................................... ........ 2078.5.3. Randomization tests ........................................................ ............................ 207
8.6. Specific Comparisons Of Means .................................................................2088.6.1. Planned comparisons or contrasts ....................................................... ........ 2088.6.2. Unplanned pairwise comparisons........................................................ ........ 2108.6.3. Specific contrasts versus unplanned pairwise comparisons ........................ 213
8.7. Tests for trends............................................................................................2138.8. Testing equality of group variances............................................................2148.9. Power of single factor ANOVA ...................................................................2158.10. General issues and hints for analysis..........................................................216
9. Multifactor analysis of variance.............................................2389.1. Nested (hierarchical) designs......................................................................238
9.1.1. Linear models for nested analyses.......................... ..................................... 2399.1.2. Analysis of variance ........................................................ ............................ 2409.1.3. Null hypotheses ...................................................... ..................................... 2419.1.4. Unequal sample sizes (unbalanced designs). ............................................... 2429.1.5. Comparing ANOVA models ..................................................... .................. 2429.1.6. Factor effects in nested models ........................................................... ........ 2439.1.7. Assumptions for nested models.................... ............................................... 2449.1.8. Specific comparisons for nested designs ..................................................... 2449.1.9. More complex designs.................................. ............................................... 2449.1.10. Design and power............................................................ ............................ 245
9.2. Factorial designs.........................................................................................247 9.2.1. Linear models for factorial designs ..................................................... ........ 2499.2.2. Analysis of variance ........................................................ ............................ 2519.2.3. Null hypotheses ...................................................... ..................................... 2529.2.4. What are main effects and interactions really measuring? .......................... 2549.2.5. Comparing ANOVA models ..................................................... .................. 2579.2.6. Unbalanced designs ......................................................... ............................ 2589.2.7. Factor effects .......................................................... ..................................... 262
9.2.8. Assumptions ........................................................... ..................................... 2639.2.9. Robust factorial ANOVAs ........................................................ .................. 2649.2.10. Specific comparisons on main effects ......................................................... 2659.2.11. Interpreting interactions ...................................................................... ........ 2659.2.12. More complex designs......................................................................... ........ 2689.2.13. Power and design in factorial ANOVA.................. ..................................... 271
9.3. Pooling in multifactor designs ....................................................................2719.4. Relationship between factorial and nested designs.....................................2729.5. General issues and hints for analysis..........................................................272
9.5.1. General Issues ........................................................ ..................................... 2729.5.2. Hints for analysis...................... ........................................................... ........ 273
10. Randomized blocks and simple repeated measures:unreplicated two-factor designs .............................................314
10.1. Unreplicated two factor experimental designs............................................314
http://qk08.pdf/http://qk08.pdf/http://qk08.pdf/http://qk08.pdf/http://qk08.pdf/http://qk08.pdf/http://qk08.pdf/http://qk08.pdf/http://qk08.pdf/http://qk08.pdf/http://qk08.pdf/http://qk08.pdf/http://qk08.pdf/http://qk08.pdf/http://qk08.pdf/http://qk08.pdf/http://qk08.pdf/http://qk08.pdf/http://qk08.pdf/http://qk08.pdf/http://qk08.pdf/http://qk08.pdf/http://qk09.pdf/http://qk09.pdf/http://qk09.pdf/http://qk09.pdf/http://qk09.pdf/http://qk09.pdf/http://qk09.pdf/http://qk09.pdf/http://qk09.pdf/http://qk09.pdf/http://qk09.pdf/http://qk09.pdf/http://qk09.pdf/http://qk09.pdf/http://qk09.pdf/http://qk09.pdf/http://qk09.pdf/http://qk09.pdf/http://qk09.pdf/http://qk09.pdf/http://qk09.pdf/http://qk09.pdf/http://qk09.pdf/http://qk09.pdf/http://qk09.pdf/http://qk09.pdf/http://qk09.pdf/http://qk09.pdf/http://qk09.pdf/http://qk09.pdf/http://qk09.pdf/http://qk10.pdf/http://qk10.pdf/http://qk10.pdf/http://qk08.pdf/http://qk10.pdf/http://qk09.pdf/ -
7/28/2019 Quinn & Keough Stats
9/12
10.1.1. Randomized complete block (RCB) designs ............................................... 31410.1.2. Repeated measures (RM) designs ............................................................... 315
10.2. Analyzing RCB and RM designs..................................................................316 10.2.1. Linear models for RCB and RM analyses ................................................... 31610.2.2. Analysis of variance ........................................................ ............................ 31810.2.3. Null hypotheses ...................................................... ..................................... 318
10.2.4. Comparing ANOVA models ..................................................... .................. 31910.3. Interactions in RCB and RM models...........................................................320
10.3.1. Importance of treatment by block interactions ............................................ 32010.3.2. Checks for interaction in unreplicated designs................ ............................ 321
10.4. Assumptions.................................................................................................32210.4.1. Normality, independence of errors ...................................................... ........ 32210.4.2. Variances and covariances - sphericity ............................................... ........ 32310.4.3. Recommended strategy ............................................................................... 325
10.5. Robust RCB and RM analyses.....................................................................32510.6. Specific comparisons...................................................................................326 10.7. Efficiency of blocking (to block or not to block?) .......................................326 10.8. Time as a blocking factor ............................................................................327
10.9. Analysis of unbalanced RCB designs ..........................................................327 10.10. Power of RCB or simple RM designs ..........................................................32910.11. More complex block designs .......................................................................329
10.11.1. Factorial randomized block designs............ ..................................... 32910.11.2. Incomplete block designs............................ ..................................... 33010.11.3. Latin square designs.......................... ............................................... 33010.11.4. Crossover designs .......................................................... .................. 333
10.12. Generalized randomized block designs.......................................................33410.13. RCB and RM designs and statistical software ............................................33410.14. General issues and hints for analysis..........................................................335
11. Split plot and repeated measures designs: Partly nested
anovas ...................................................................................36111.1. Partly nested designs...................................................................................361
11.1.1. Split-plot designs .................................................... ..................................... 36111.1.2. Repeated measures designs ....................................................... .................. 36311.1.3. Reasons for using these designs .......................................................... ........ 364
11.2. Analyzing partly nested designs ..................................................................36511.2.1. Linear models for partly nested analyses ............... ..................................... 36511.2.2. Analysis of variance ........................................................ ............................ 36711.2.3. Null hypotheses ...................................................... ..................................... 36711.2.4. Comparing ANOVA models ..................................................... .................. 369
11.3. Assumptions.................................................................................................36911.3.1. Between plots/subjects .................................................... ............................ 37011.3.2. Within plots/subjects and multisample sphericity ....................................... 370
11.4. Robust partly nested analyses .....................................................................37111.5. Specific comparisons...................................................................................372
11.5.1. Main effects............................................................ ..................................... 37211.5.2. Interactions ................................................... ............................................... 37211.5.3. Profile (i.e. trend) analysis......................................................... .................. 373
11.6. Analysis of unbalanced partly nested designs.............................................37411.7. Power for partly nested designs ..................................................................37511.8. More complex designs.................................................................................375
11.8.1. Additional between plots/subjects factors ................................................... 37611.8.2. Additional within plots/subjects factors ...................................................... 37811.8.3. Additional between plots/subjects and within plots/subjects factors ........... 37911.8.4. General comments about complex designs.................................................. 379
11.9. Partly nested designs and statistical software ............................................380General Issues and Hints for analysis ....................................................................381
http://qk10.pdf/http://qk10.pdf/http://qk10.pdf/http://qk10.pdf/http://qk10.pdf/http://qk10.pdf/http://qk10.pdf/http://qk10.pdf/http://qk10.pdf/http://qk10.pdf/http://qk10.pdf/http://qk10.pdf/http://qk10.pdf/http://qk10.pdf/http://qk10.pdf/http://qk10.pdf/http://qk10.pdf/http://qk10.pdf/http://qk10.pdf/http://qk10.pdf/http://qk10.pdf/http://qk10.pdf/http://qk10.pdf/http://qk10.pdf/http://qk10.pdf/http://qk10.pdf/http://qk10.pdf/http://qk10.pdf/http://qk11.pdf/http://qk11.pdf/http://qk11.pdf/http://qk11.pdf/http://qk11.pdf/http://qk11.pdf/http://qk11.pdf/http://qk11.pdf/http://qk11.pdf/http://qk11.pdf/http://qk11.pdf/http://qk11.pdf/http://qk11.pdf/http://qk11.pdf/http://qk11.pdf/http://qk11.pdf/http://qk11.pdf/http://qk11.pdf/http://qk11.pdf/http://qk11.pdf/http://qk11.pdf/http://qk11.pdf/http://qk11.pdf/http://qk11.pdf/http://qk11.pdf/http://qk11.pdf/http://qk11.pdf/http://qk11.pdf/http://qk10.pdf/http://qk11.pdf/ -
7/28/2019 Quinn & Keough Stats
10/12
12. Analyses of covariance ............................................................40912.1. Single factor analysis of covariance (ANCOVA) ........................................409
12.1.1. Linear models for analysis of covariance .................................................... 41012.1.2. Analysis of (co)variance........... ........................................................... ........ 41212.1.3. Null hypotheses ...................................................... ..................................... 412
12.1.4. Comparing ANCOVA models .................................................................... 41312.2. Assumptions of ANCOVA............................................................................41312.2.1. Linearity ....................................................... ............................................... 41312.2.2. Covariate values similar across groups ........................... ............................ 41412.2.3. Fixed covariate (X).................................................................... .................. 414
12.3. Homogeneous slopes...................................................................................41412.3.1. Testing for homogeneous within group regression slopes........................... 41412.3.2. Dealing with heterogeneous within group regression slopes..... .................. 41512.3.3. Comparing regression lines ....................................................... .................. 416
12.4. Robust ANCOVA .........................................................................................416 12.5. Unequal sample sizes (unbalanced designs)...............................................416 12.6. Specific comparisons of adjusted means.....................................................416
12.6.1. Planned contrasts.................................................... ..................................... 41612.6.2. Unplanned comparisons .................................................. ............................ 417
12.7. More complex designs.................................................................................417 12.7.1. Designs with two or more covariates .............................. ............................ 41712.7.2. Factorial designs... ........................................................... ............................ 41712.7.3. Nested designs with one covariate............................................. .................. 41812.7.4. Partly nested models with one covariate ..................................................... 419
12.8. General Issues and hints for analysis..........................................................420General issues .......................................................... ............................................... 420Hints for analysis ..................................................... ............................................... 421
13 Generalized linear models and logistic regression ...............43513.1 Generalized linear models ..........................................................................43513.2 Logistic regression......................................................................................436
13.2.1 Simple logistic regression ......................................................... .................. 43613.2.2 Multiple logistic regression ....................................................... .................. 44013.2.3 Categorical predictors...................................................... ............................ 44113.2.4 Assumptions of logistic regression............... ............................................... 44113.2.5 Goodness-of-fit and residuals... ........................................................... ........ 44213.2.6 Model diagnostics................................................... ..................................... 44313.2.7 Model selection ...................................................... ..................................... 44413.2.8 Software for logistic regression........................................................... ........ 444
13.3 Poisson regression ......................................................................................44513.4 Generalised additive models .......................................................................446 13.5 Models for correlated data..........................................................................448
13.5.1 Multi-level (random effects) models ................................................... ........ 44813.5.2 Generalized estimating equations........................................................ ........ 44913.6 General Issues and hints for analysis..........................................................451
14. Analyzing frequencies .............................................................45814.1. Single variable goodness-of-fit tests............................................................45814.2. Contingency tables ......................................................................................459
14.2.1. Two way tables.......................................................................... .................. 45914.2.2. Three way tables........................................................................ .................. 463
14.3. Loglinear models.........................................................................................466 14.3.1. Two way tables.......................................................................... .................. 46614.3.2. Log-linear models for three way tables ....................................................... 467
14.3.3. More complex tables ....................................................... ............................ 47214.4. General Issues and hints for analysis..........................................................472General issues .......................................................... ............................................... 472
http://qk12.pdf/http://qk12.pdf/http://qk12.pdf/http://qk12.pdf/http://qk12.pdf/http://qk12.pdf/http://qk12.pdf/http://qk12.pdf/http://qk12.pdf/http://qk12.pdf/http://qk12.pdf/http://qk12.pdf/http://qk12.pdf/http://qk12.pdf/http://qk12.pdf/http://qk12.pdf/http://qk12.pdf/http://qk12.pdf/http://qk12.pdf/http://qk12.pdf/http://qk12.pdf/http://qk12.pdf/http://qk12.pdf/http://qk12.pdf/http://qk12.pdf/http://qk12.pdf/http://qk12.pdf/http://qk13.pdf/http://qk13.pdf/http://qk13.pdf/http://qk13.pdf/http://qk13.pdf/http://qk13.pdf/http://qk13.pdf/http://qk13.pdf/http://qk13.pdf/http://qk13.pdf/http://qk13.pdf/http://qk13.pdf/http://qk13.pdf/http://qk13.pdf/http://qk13.pdf/http://qk13.pdf/http://qk13.pdf/http://qk14.pdf/http://qk14.pdf/http://qk14.pdf/http://qk14.pdf/http://qk14.pdf/http://qk14.pdf/http://qk14.pdf/http://qk14.pdf/http://qk14.pdf/http://qk14.pdf/http://qk14.pdf/http://qk14.pdf/http://qk13.pdf/http://qk12.pdf/ -
7/28/2019 Quinn & Keough Stats
11/12
Hints for analysis ..................................................... ............................................... 473
15. Introduction to multivariate analyses ...................................48615.1. Multivariate data.........................................................................................486 15.2. Distributions and associations....................................................................487 15.3. Linear combinations, eigenvectors and eigenvalues...................................488
15.3.1. Linear combinations of variables ........................................................ ........ 48815.3.2. Eigenvalues ................................................................................................. 48815.3.3. Eigenvectors ........................................................... ..................................... 48915.3.4. Derivation of components ........................................................................... 489
15.4. Multivariate distance and dissimilarity measures.......................................48915.4.1. Dissimilarity measures for continuous variables....................... .................. 49015.4.2. Dissimilarity measures for dichotomous (binary) variables ........................ 49115.4.3. General dissimilarity measures for mixed variables............................ ........ 49115.4.4. Comparison of dissimilarity measures ........................................................ 492
15.5. Comparing distance and/or dissimilarity matrices.....................................49215.6. Data standardization...................................................................................49315.7. Standardization, association and dissimilarity ...........................................495
15.8. Multivariate graphics..................................................................................49515.9. Screening multivariate data sets .................................................................496
15.9.1. Multivariate outliers ........................................................ ............................ 49615.9.2. Missing observations.................................... ............................................... 496
15.10. General issues and hints for analysis..........................................................499General Issues.................. ........................................................... ............................ 499Hints for analysis ..................................................... ............................................... 499
Captions to Figures ................................................................................................515
16. Multivariate analysis of variance and discriminantanalysis ...................................................................................516
16.1. Multivariate analysis of variance (MANOVA)............................................516 16.1.1. Single factor MANOVA ............................................................................. 51716.1.2. Specific comparisons................................................................. .................. 51916.1.3. Relative importance of each response variable ........................................... 51916.1.4. Assumptions of MANOVA....................................................... .................. 52016.1.5. Robust MANOVA..................................................................... .................. 52116.1.6. More complex designs......................................................................... ........ 521
16.2. Discriminant function analysis....................................................................52216.2.1. Description and hypothesis testing...................................................... ........ 52216.2.2. Classification and prediction ..................................................... .................. 52316.2.3. Assumptions of discriminant function analysis......................... .................. 52416.2.4. More complex designs......................................................................... ........ 525
16.3. MANOVA vs Discriminant function analysis ..............................................525
16.4. General issues and hints for analysis..........................................................525General issues .......................................................... ............................................... 525Hints for analysis ..................................................... ............................................... 525
Captions to Figures ................................................................................................540
17. Principal components and correspondence analysis............54117.1. Principal Components Analysis ..................................................................541
17.1.1. Deriving components .................................................................................. 54217.1.2. Which association matrix to use? ........................................................ ........ 54317.1.3. Interpreting the components ...................................................... .................. 54417.1.4. Rotation of components............................................................................... 54417.1.5. How many components to retain? ....................................................... ........ 54517.1.6. Assumptions ........................................................... ..................................... 54617.1.7. Robust PCA............................................................ ..................................... 54717.1.8. Graphical representations .......................................................... .................. 547
http://qk15.pdf/http://qk15.pdf/http://qk15.pdf/http://qk15.pdf/http://qk15.pdf/http://qk15.pdf/http://qk15.pdf/http://qk15.pdf/http://qk15.pdf/http://qk15.pdf/http://qk15.pdf/http://qk15.pdf/http://qk15.pdf/http://qk15.pdf/http://qk15.pdf/http://qk15.pdf/http://qk15.pdf/http://qk15.pdf/http://qk15.pdf/http://qk15.pdf/http://qk15.pdf/http://qk15.pdf/http://qk15.pdf/http://qk15.pdf/http://qk16.pdf/http://qk16.pdf/http://qk16.pdf/http://qk16.pdf/http://qk16.pdf/http://qk16.pdf/http://qk16.pdf/http://qk16.pdf/http://qk16.pdf/http://qk16.pdf/http://qk16.pdf/http://qk16.pdf/http://qk16.pdf/http://qk16.pdf/http://qk16.pdf/http://qk16.pdf/http://qk16.pdf/http://qk16.pdf/http://qk16.pdf/http://qk17.pdf/http://qk17.pdf/http://qk17.pdf/http://qk17.pdf/http://qk17.pdf/http://qk17.pdf/http://qk17.pdf/http://qk17.pdf/http://qk17.pdf/http://qk17.pdf/http://qk17.pdf/http://qk16.pdf/http://qk15.pdf/ -
7/28/2019 Quinn & Keough Stats
12/12
17.1.9. Other uses of components ........................................................................... 54917.2. Factor analysis............................................................................................55017.3. Correspondence analysis ............................................................................551
17.3.1. Mechanics............................................................... ..................................... 55217.3.2. Scaling and joint plots ..................................................... ............................ 55217.3.3. Reciprocal averaging....................................................... ............................ 553
17.3.4. Use of CA with ecological data....................................... ............................ 55417.3.5. Detrending.. ............................................................ ..................................... 554
17.4. Canonical correlation analysis ...................................................................55517.5. Redundancy analysis ...................................................................................556 17.6. Canonical correspondence analysis............................................................557 17.7. Constrained and partial ordination........................................................55817.8. General issues and hints for analysis..........................................................559
General issues .......................................................... ............................................... 559Hints for analysis ..................................................... ............................................... 560
18. Multidimensional scaling and cluster analysis .....................57418.1. Multidimensional scaling ............................................................................574
18.1.1. Classical scaling Principal Coordinates Analysis (PCoA) ............... ........ 57518.1.2. Enhanced multidimensional scaling .................................................... ........ 57618.1.3. Dissimilarities and testing hypotheses about groups of objects................... 57918.1.4. Relating MDS to original variables ..................................................... ........ 58418.1.5. Relating MDS to covariates......................................................................... 585
18.2. Classification...............................................................................................58518.2.1. Cluster Analysis .................................................................................. ........ 586
18.3. Scaling (ordination) and clustering for biological data..............................58818.4. General issues and hints for analysis..........................................................589
General issues .......................................................... ............................................... 589Hints for analysis ..................................................... ............................................... 589
19 Presentation of results.............................................................59819.1 Presentation of analyses..............................................................................598
19.1.1 Linear models ......................................................... ..................................... 59819.1.2 Other analyses ........................................................ ..................................... 600
19.2 Layout of tables ...........................................................................................60019.3 Displaying summaries of the data...............................................................601
19.3.1 Bar graph...................................................... ............................................... 60219.3.2 Line graph (category plot) ......................................................... .................. 60319.3.3 Scatterplots ................................................... ............................................... 60319.3.4 Pie charts ...................................................... ............................................... 604
19.4 Error bars....................................................................................................60419.4.1 Alternative approaches .................................................... ............................ 605
19.5 Oral presentations.......................................................................................606 19.5.1 Slides, computers, or overheads? ........................................................ ........ 60619.5.2 Graphics packages........................................................... ............................ 60819.5.3 Working with colour ....................................................... ............................ 60819.5.4 Scanned images ...................................................... ..................................... 60819.5.5 Information content ......................................................... ............................ 609
19.6 General issues and hints .............................................................................609
20. References ................................................................................615
http://qk17.pdf/http://qk17.pdf/http://qk17.pdf/http://qk17.pdf/http://qk17.pdf/http://qk17.pdf/http://qk17.pdf/http://qk17.pdf/http://qk17.pdf/http://qk17.pdf/http://qk17.pdf/http://qk17.pdf/http://qk17.pdf/http://qk17.pdf/http://qk17.pdf/http://qk18.pdf/http://qk18.pdf/http://qk18.pdf/http://qk18.pdf/http://qk18.pdf/http://qk18.pdf/http://qk18.pdf/http://qk18.pdf/http://qk18.pdf/http://qk18.pdf/http://qk18.pdf/http://qk18.pdf/http://qk18.pdf/http://qk19.pdf/http://qk19.pdf/http://qk19.pdf/http://qk19.pdf/http://qk19.pdf/http://qk19.pdf/http://qk19.pdf/http://qk19.pdf/http://qk19.pdf/http://qk19.pdf/http://qk19.pdf/http://qk19.pdf/http://qk19.pdf/http://qk19.pdf/http://qk19.pdf/http://qk19.pdf/http://qk19.pdf/http://qk19.pdf/http://qk19.pdf/http://qk20.pdf/http://qk17.pdf/http://qk20.pdf/http://qk19.pdf/http://qk18.pdf/