Quinn & Keough Stats

download Quinn & Keough Stats

of 12

Transcript of Quinn & Keough Stats

  • 7/28/2019 Quinn & Keough Stats

    1/12

    EXPERIMENTALDE SI GN AN D DATA

    AN AL Y SI S FORBIOLOGISTS

    G. P. QUINNM onash U niversi ty

    MICHAEL J. K EOUGHU niversi ty Of M el bourne

    April 2001

    TO BE PUBLI SHED BY CAMBRI DGEUN I VERSI TY PRESS, 2001

    This work is G. Quinn & M. Keough 2001 and Cambridge University Press. Please donot distribute printed or electronic copies without the authors permission.

  • 7/28/2019 Quinn & Keough Stats

    2/12

    i

    PREFACE

    Statistical analysis is at the core of most modern biology, and many biological hypotheses,even deceptively simple ones, are matched by complex statistical models. Prior to thedevelopment of modern desktop computers, determining whether the data fit these complexmodels was the province of professional statisticians. Many biologists instead opted for simplermodels whose structure had been simplified quite arbitrarily. Now, with immensely powerfulstatistical software available to most of us, these complex models can be fitted, creating a new setof demands & problems for biologists.

    We need to:

    Know the pitfalls and assumptions of particular statistical models

    Be able to identify the type of model appropriate for the sampling design and kind of data that we plan to collect

    Be able to interpret the output of analyses using these models

    Be able to design experiments and sampling programs optimally, i.e. with the bestpossible use of our limited time and resources.

    The analysis may be done by professional statisticians, rather than statistically trainedbiologists, especially in large research groups or multidisciplinary teams. In these situations, weneed to be able to speak a common language

    Frame our questions in such a way as to get a sensible answer

    Be aware of biological considerations that may cause statistical problems; we can notexpect a statistician to be aware of the biological idiosyncrasies or our particular study,but if he or she lacks that information, we may get misleading or incorrect advice

    Understand the advice or analyses that we receive, and be able to translate that back intobiology

    This book aims to place biologists in a better position to do these things. It arose from ourinvolvment in designing and analyzing our own data, but also providing advice to students andcolleagues, and teaching classes in design and analysis. As part of these activities, we becameaware, first of our limitations, prompting us to read more widely in the primary statisticalliterature, and second, and more importantly, of the complexity of the statistical models

    underlying much biological research. In particular, we continually encountered experimentaldesigns that were not described comprehensively in many of our favourite texts. This bookdescribes many of the common designs used in biological research, and we present the statisticalmodels underlying those designs, with enough information to highlight their benefits and pitfalls.

    Our emphasis here is on dealing with biological data how to design sampling programs thatrepresent the best use of our resources, how to avoid mistakes that make analyzing our datadifficult, and how to analyse the data when they are collected. We emphasise the problemsassociated with real world biological situations.

  • 7/28/2019 Quinn & Keough Stats

    3/12

    ii

    IN TH IS BOOK

    Our approach is to encourage readers to understand the models underlying the mostcommon experimental designs. We describe the models that are appropriate for various kinds of biological data continuous and categorical response variables, continuous and categoricalpredictor or independent variables. Our emphasis is on general linear models, and we begin with

    the simplest situations - single, continuous variables - describing those models in detail. We usethese models as building blocks to understanding a wide range of other kinds of data all of thecommon statistical analyses, rather then being distinctly different kinds of analyses, are variationson a common theme of statistical modeling constructing a model for the data and thendetermining whether observed data fit this particular model. Our aim is to show how a broadunderstanding of the models allow us to deal with a wide range of more complex situations.

    We have illustrated this approach of fitting models primarily with parametric statistics. Mostbiological data is still analyzed with linear models that assume underlying normal distributions.However, we introduce readers to a range of more general approaches, and stress that, once youunderstand the general modeling approach for normally-distributed data, you can use thatinformation to begin modeling data with nonlinear relationships, variables that follow other

    statistical distributions, etc.LEARNING BY EXAMPLE

    One of our strongest beliefs is that we understand statistical principles much better when wesee how they are applied to situations in our own discipline. Examples let us make the linkbetween statistical models and formal statistical terms (blocks, plots, etc.) or papers written inother disciplines, and the biological situations that we are dealing with. For example, how is ouranalysis and interpretation of an experiment repeated on several times helped by reading aliterature about blocks of agricultural land? How does literature developed for psychologicalresearch let us deal with measuring changes in physiological responses of plants?

    Throughout this book, we illustrate all of the statistical techniques with examples from thecurrent biological literature. We describe why (we think) the authors chose to do an experimentin a particular way, and how to analyse the data, including assessing assumptions and interpretingstatistical output. These examples appear as boxes through each chapter, and we are delightedthat authors of most of these studies have made their raw data available to us. We provide thoseraw data files on a web site, allowing readers to run these analyses using their particular softwarepackage.

    The other value of published examples is that we can see how particular analyses can bedescribed and reported. When fitting complex statistical models, it is easy to allow the biology tobe submerged by a mass of statistical output. We hope that the examples, together with our ownthoughts on this subject, presented in the final chapter, will help prevent this happening.

    TH IS BOOK IS A BRIDGE

    It is not possible to produce a book that introduces a reader to biological statistics and takesthem far enough to understand complex models, at least while having a book that is smallenough to transport. We therefore assume that readers are familiar with basic statistical concepts,such as would result from a one or two semester introductory course, or have read one of theexcellent basic texts (e.g. Sokal & Rohlf 1995). We take the reader from theses texts into morecomplex areas, explaining the principles, assumptions, and pitfalls, and encourage a reader to readthe excellent detailed treatments (e.g, for analysis of variance, Winer et al. 1991 or Underwood1997).

    Biological data are often messy, and many readers will find that their research questions

    require more complex models than we describe here. Ways of dealing with messy data or

  • 7/28/2019 Quinn & Keough Stats

    4/12

    iii

    solutions to complex problems are often provided in the primary statistical literature. We try topoint the way to key pieces of that statistical literature, providing the reader with the basic toolsto be able to deal with that literature, or to be able to seek professional (statistical) help whenthings become too complex.

    We must always remember that for biologists, statistics is a tool that we use to illuminate andclarify biological problems. Our aim is to be able to use these tools efficiently, without losingsight of the biology that is the motivation for most of us entering this field.

    SOME ACKNOWLE DGMENT S

    Our biggest debt is to the range of colleagues who have read, commented upon, andcorrected various versions of these chapters. Many of these colleagues have their own researchgroups, who they enlisted in this exercise. These altruistic and diligent souls include(alphabetically), Andrew Constable, Barb Downes, Peter Fairweather, Ivor Grownf, JacquiBrooks, Ralph Mac Nally, Richard Marchant, Pete Raimondi, Wayne Robinson, SuvaluckSatumanatpan, Sabine Schreiber. Perhaps the most innocent victims were the graduate studentswho have been part of our research groups over the period we produced this book. We greatly

    appreciate their willingness to trade the chance of some illumination for reading and highlightingour obfuscations.

    We also wish to thank the various researchers whose data we used as examples throughout.Most of them willingly gave of their raw data, trusting that we would neither criticize or findflaws in their published work (we didnt!), or were public-spirited enough to have published theirraw data.

  • 7/28/2019 Quinn & Keough Stats

    5/12

    Table of Contents

    1. Introduction .................................................................................11.1. Scientific method .............................................................................................1

    1.1.1. Pattern description........................................................... ................................ 21.1.2. Models.......................................................... ................................................... 21.1.3. Hypotheses and tests ....................................................... ................................ 31.1.4. Alternatives to falsification ....................................................... ...................... 41.1.5. Role of statistical analysis ......................................................... ...................... 5

    1.2. Experiments and other tests ............................................................................51.3. Data, observations & variables.......................................................................7 1.4. Probability.......................................................................................................7 1.5. Probability distributions .................................................................................9

    1.5.1. Distributions for variables ......................................................... .................... 101.5.2. Distributions for statistics......... ........................................................... .......... 12

    2. Estimation ..................................................................................152.1. Samples and populations...............................................................................152.2. Common Parameters & Statistics .................................................................16

    2.2.1. Centre (location) of distribution .......................................................... .......... 162.2.2. Spread or variability ........................................................ .............................. 17

    2.3. Standard errors and confidence intervals for the mean................................17 2.3.1. Normal distributions and the Central Limit Theorem ................................... 172.3.2. Standard error of the sample mean...................................................... .......... 182.3.3. Confidence intervals for population mean..................................................... 192.3.4. Interpretation of confidence intervals for population mean........................... 202.3.5. Standard errors for other statistics ....................................................... .......... 20

    2.4. Methods for estimating parameters...............................................................212.4.1. Maximum likelihood ....................................................... .............................. 212.4.2. Ordinary least squares (OLS) .................................................... .................... 222.4.3. ML vs OLS estimation. ................................................... .............................. 22

    2.5. Resampling Methods for Estimation .............................................................232.5.1. Bootstrap ...................................................... ................................................. 232.5.2. Jackknife................................................................. ....................................... 24

    2.6. Bayesian inference - estimation ....................................................................242.6.1. Bayesian estimation...................................... ................................................. 252.6.2. Prior knowledge and probability ......................................................... .......... 252.6.3. Likelihood function ......................................................... .............................. 262.6.4. Posterior probability ........................................................ .............................. 262.6.5. Examples ...................................................... ................................................. 272.6.6. Other comments ..................................................... ....................................... 27

    3. Hypothesis testing......................................................................373.1. Statistical hypothesis testing .........................................................................37

    3.1.1. Classical statistical hypothesis testing........................................................... 373.1.2. Associated probability and Type I error ........................................................ 393.1.3. Hypothesis tests for a single population ........................................................ 403.1.4. One and two tailed tests.......................................... ....................................... 413.1.5. Hypotheses for two populations .......................................................... .......... 423.1.6. Parametric tests and their assumptions .......................................................... 44

    3.2. Decision errors..............................................................................................453.2.1. Type I and II errors....................................... ................................................. 453.2.2. Asymmetry and Scalable Decision Criteria............................... .................... 46

    3.3. Other testing methods....................................................................................47 3.3.1. Robust parametric tests ................................................... .............................. 473.3.2. Randomization (permutation) tests......................... ....................................... 47

    http://qk01.pdf/http://qk01.pdf/http://qk01.pdf/http://qk01.pdf/http://qk01.pdf/http://qk01.pdf/http://qk01.pdf/http://qk01.pdf/http://qk01.pdf/http://qk01.pdf/http://qk01.pdf/http://qk01.pdf/http://qk01.pdf/http://qk02.pdf/http://qk02.pdf/http://qk02.pdf/http://qk02.pdf/http://qk02.pdf/http://qk02.pdf/http://qk02.pdf/http://qk02.pdf/http://qk02.pdf/http://qk02.pdf/http://qk02.pdf/http://qk02.pdf/http://qk02.pdf/http://qk02.pdf/http://qk02.pdf/http://qk02.pdf/http://qk02.pdf/http://qk02.pdf/http://qk02.pdf/http://qk02.pdf/http://qk02.pdf/http://qk02.pdf/http://qk02.pdf/http://qk02.pdf/http://qk02.pdf/http://qk03.pdf/http://qk03.pdf/http://qk03.pdf/http://qk03.pdf/http://qk03.pdf/http://qk03.pdf/http://qk03.pdf/http://qk03.pdf/http://qk03.pdf/http://qk03.pdf/http://qk03.pdf/http://qk03.pdf/http://qk03.pdf/http://qk03.pdf/http://qk03.pdf/http://qk02.pdf/http://qk01.pdf/
  • 7/28/2019 Quinn & Keough Stats

    6/12

    3.3.3. Rank-based nonparametric tests .......................................................... .......... 483.4. Multiple testing..............................................................................................50

    3.4.1. The problem ........................................................... ....................................... 503.4.2. Adjusting significance levels and/or P values ............................................... 51

    3.5. Combining results from statistical tests ........................................................523.5.1. Combining P values......................................................... .............................. 52

    3.5.2. Meta-analysis ......................................................... ....................................... 533.6. Critique of statistical hypothesis testing .......................................................53

    3.6.1. Dependence on sample size and stopping rules............................................. 543.6.2. Sample space relevance of data not observed............................................. 543.6.3. P values as measure of evidence ......................................................... .......... 543.6.4. Null hypothesis always false ..................................................... .................... 543.6.5. Arbitrary significance levels...................................................... .................... 553.6.6. Alternatives to statistical hypothesis testing.................... .............................. 55

    3.7. Bayesian hypothesis testing...........................................................................56

    4. Graphical exploration of data..................................................664.1. Exploratory data analysis .......................................................... .................... 66

    4.1.1. Exploring samples ........................................................... .............................. 664.2. Analysis with graphs .....................................................................................694.2.1. Assumptions of parametric linear models ..................................................... 69

    4.3. Transforming Data........................................................................................704.3.1. Transformations and distributional assumptions ........................................... 724.3.2. Transformations and linearity.................................................... .................... 734.3.3. Transformations and additivity............................................................ .......... 73

    4.4. Standardizations............................................................................................734.5. Outliers..........................................................................................................744.6. Censored and missing data ...........................................................................74

    4.6.1. Missing data ........................................................... ....................................... 744.6.2. Censored (truncated) data............................. ................................................. 75

    4.7. General issues and hints for analysis............................................................77 4.7.1. General Issues ........................................................ ....................................... 774.7.2. Captions to Figures............................. ........................................................... 79

    5. Correlation and regression.......................................................805.1. Correlation analysis......................................................................................80

    5.1.1. Parametric correlation model .................................................... .................... 805.1.2. Robust correlation ........................................................... .............................. 825.1.3. Parametric and nonparametric confidence regions........................................ 82

    5.2. Linear models................................................................................................835.3. Linear regression analysis ............................................................................84

    5.3.1. Simple (bivariate) linear regression................................. .............................. 845.3.2. Linear model for regression....................................................... .................... 855.3.3. Estimating the model parameters ........................................................ .......... 865.3.4. Analysis of variance ........................................................ .............................. 885.3.5. Null hypotheses in regression................................. ....................................... 895.3.6. Comparing regression models ................................................... .................... 905.3.7. Variance explained .......................................................... .............................. 915.3.8. Assumptions of regression analysis................................. .............................. 925.3.9. Regression diagnostics .................................................... .............................. 945.3.10. Diagnostic graphics ......................................................... .............................. 965.3.11. Transformations ........................................................................ .................... 975.3.12. Regression through the origin ....................................................................... 985.3.13. Weighted least squares .................................................... .............................. 995.3.14. X random (Model II regression) .......................................................... .......... 995.3.15. Robust Regression........................................................... ............................ 102

    5.4. Relationship between regression and correlation.......................................1035.5. Smoothing....................................................................................................104

    http://qk03.pdf/http://qk03.pdf/http://qk03.pdf/http://qk03.pdf/http://qk03.pdf/http://qk03.pdf/http://qk03.pdf/http://qk03.pdf/http://qk03.pdf/http://qk03.pdf/http://qk03.pdf/http://qk03.pdf/http://qk03.pdf/http://qk03.pdf/http://qk03.pdf/http://qk04.pdf/http://qk04.pdf/http://qk04.pdf/http://qk04.pdf/http://qk04.pdf/http://qk04.pdf/http://qk04.pdf/http://qk04.pdf/http://qk04.pdf/http://qk04.pdf/http://qk04.pdf/http://qk04.pdf/http://qk04.pdf/http://qk04.pdf/http://qk04.pdf/http://qk04.pdf/http://qk04.pdf/http://qk05.pdf/http://qk05.pdf/http://qk05.pdf/http://qk05.pdf/http://qk05.pdf/http://qk05.pdf/http://qk05.pdf/http://qk05.pdf/http://qk05.pdf/http://qk05.pdf/http://qk05.pdf/http://qk05.pdf/http://qk05.pdf/http://qk05.pdf/http://qk05.pdf/http://qk05.pdf/http://qk05.pdf/http://qk05.pdf/http://qk05.pdf/http://qk05.pdf/http://qk05.pdf/http://qk05.pdf/http://qk05.pdf/http://qk05.pdf/http://qk03.pdf/http://qk05.pdf/http://qk04.pdf/
  • 7/28/2019 Quinn & Keough Stats

    7/12

    5.5.1. Running means....................................................... ..................................... 1055.5.2. LO(W)ESS ................................................... ............................................... 1055.5.3. Splines .......................................................... ............................................... 1055.5.4. Kernels ......................................................... ............................................... 1065.5.5. Other issues ............................................................ ..................................... 106

    5.6. Power of tests in correlation and regression ..............................................107

    5.7. General issues and hints for analysis..........................................................107

    6. Multiple regression and correlation ......................................1246.1. Multiple linear regression analysis.............................................................124

    6.1.1. Multiple linear regression model........ ......................................................... 1246.1.2. Estimating model parameters .................................................... .................. 1266.1.3. Analysis of variance ........................................................ ............................ 1266.1.4. Null hypotheses and model comparisons .................................................... 1276.1.5. Variance explained .......................................................... ............................ 1286.1.6. Which predictors are important? ......................................................... ........ 1286.1.7. Assumptions of multiple regression .................................................... ........ 1306.1.8. Regression diagnostics .................................................... ............................ 131

    6.1.9. Diagnostic graphics ......................................................... ............................ 1316.1.10. Transformations ........................................................................ .................. 1326.1.11. Collinearity..................... ........................................................... .................. 1326.1.12. Interactions in multiple regression ............................................ .................. 1356.1.13. Polynomial regression ..................................................... ............................ 1376.1.14. Indicator (dummy) variables ....................................................................... 1386.1.15. Finding the best regression model ......................................... .................. 1406.1.16. Hierarchical partitioning...................................................................... ........ 1426.1.17. Other issues in multiple linear regression............................................ ........ 144

    6.2. Regression trees ..........................................................................................1456.3. Path analysis and structural equation modelling........................................147 6.4. Nonlinear models ........................................................................................1496.5. Smoothing and response surfaces ...............................................................1506.6. General issues and hints for analysis..........................................................151

    Figure legends ........................................................................................................174

    7. Design and power analysis......................................................1757.1. Sampling......................................................................................................175

    7.1.1. Sampling designs................................ ......................................................... 1757.1.2. Size of sample ........................................................ ..................................... 177

    7.2. Experimental design....................................................................................177 7.2.1. Replication ................................................... ............................................... 1787.2.3. Controls ........................................................ ............................................... 1807.2.4. Randomization ....................................................... ..................................... 1817.2.5. Independence................................................ ............................................... 183

    7.2.6. Reducing unexplained variance................................................. .................. 1837.3. Power analysis ............................................................................................183

    7.3.1. Using power to plan experiments (a priori power analysis) ........................ 1847.3.2. Post-hoc power calculation.................................................................. ........ 1867.3.3. The effect size ........................................................ ..................................... 1877.3.4. Using the power analyses .......................................................... .................. 189

    7.4. General issues and hints for analysis..........................................................190 List of Figures.........................................................................................................194

    8. Comparing groups or treatments - analysis of variance .....1958.1. Single factor (one way) analysis of variance ..............................................195

    8.1.1. Types of predictor variables (factors)..................... ..................................... 196

    8.1.2. Linear model for single factor ANOVA.......................... ............................ 1978.1.3. Analysis of variance ........................................................ ............................ 1988.1.4. Null hypotheses ...................................................... ..................................... 199

    http://qk05.pdf/http://qk05.pdf/http://qk05.pdf/http://qk05.pdf/http://qk05.pdf/http://qk05.pdf/http://qk05.pdf/http://qk06.pdf/http://qk06.pdf/http://qk06.pdf/http://qk06.pdf/http://qk06.pdf/http://qk06.pdf/http://qk06.pdf/http://qk06.pdf/http://qk06.pdf/http://qk06.pdf/http://qk06.pdf/http://qk06.pdf/http://qk06.pdf/http://qk06.pdf/http://qk06.pdf/http://qk06.pdf/http://qk06.pdf/http://qk06.pdf/http://qk06.pdf/http://qk06.pdf/http://qk06.pdf/http://qk06.pdf/http://qk06.pdf/http://qk06.pdf/http://qk06.pdf/http://qk07.pdf/http://qk07.pdf/http://qk07.pdf/http://qk07.pdf/http://qk07.pdf/http://qk07.pdf/http://qk07.pdf/http://qk07.pdf/http://qk07.pdf/http://qk07.pdf/http://qk07.pdf/http://qk07.pdf/http://qk07.pdf/http://qk07.pdf/http://qk07.pdf/http://qk07.pdf/http://qk07.pdf/http://qk08.pdf/http://qk08.pdf/http://qk08.pdf/http://qk08.pdf/http://qk08.pdf/http://qk08.pdf/http://qk05.pdf/http://qk08.pdf/http://qk07.pdf/http://qk06.pdf/
  • 7/28/2019 Quinn & Keough Stats

    8/12

    8.1.5. Comparing ANOVA models ..................................................... .................. 2008.1.6. Unequal sample sizes (unbalanced designs). ............................................... 200

    8.2. Factor effects...............................................................................................2018.2.1. Random effects: variance components ........................................................ 2018.2.2. Fixed effects ........................................................... ..................................... 203

    8.3. Assumptions.................................................................................................2048.3.1. Normality ..................................................... ............................................... 2048.3.2. Variance homogeneity............................................ ..................................... 2048.3.3. Independence................................................ ............................................... 205

    8.4. ANOVA diagnostics.....................................................................................206 8.5. Robust ANOVA............................................................................................206

    8.5.1. Tests with heterogeneous variances .................................................... ........ 2068.5.2. Rank-based (nonparametric) tests .................................................... ........ 2078.5.3. Randomization tests ........................................................ ............................ 207

    8.6. Specific Comparisons Of Means .................................................................2088.6.1. Planned comparisons or contrasts ....................................................... ........ 2088.6.2. Unplanned pairwise comparisons........................................................ ........ 2108.6.3. Specific contrasts versus unplanned pairwise comparisons ........................ 213

    8.7. Tests for trends............................................................................................2138.8. Testing equality of group variances............................................................2148.9. Power of single factor ANOVA ...................................................................2158.10. General issues and hints for analysis..........................................................216

    9. Multifactor analysis of variance.............................................2389.1. Nested (hierarchical) designs......................................................................238

    9.1.1. Linear models for nested analyses.......................... ..................................... 2399.1.2. Analysis of variance ........................................................ ............................ 2409.1.3. Null hypotheses ...................................................... ..................................... 2419.1.4. Unequal sample sizes (unbalanced designs). ............................................... 2429.1.5. Comparing ANOVA models ..................................................... .................. 2429.1.6. Factor effects in nested models ........................................................... ........ 2439.1.7. Assumptions for nested models.................... ............................................... 2449.1.8. Specific comparisons for nested designs ..................................................... 2449.1.9. More complex designs.................................. ............................................... 2449.1.10. Design and power............................................................ ............................ 245

    9.2. Factorial designs.........................................................................................247 9.2.1. Linear models for factorial designs ..................................................... ........ 2499.2.2. Analysis of variance ........................................................ ............................ 2519.2.3. Null hypotheses ...................................................... ..................................... 2529.2.4. What are main effects and interactions really measuring? .......................... 2549.2.5. Comparing ANOVA models ..................................................... .................. 2579.2.6. Unbalanced designs ......................................................... ............................ 2589.2.7. Factor effects .......................................................... ..................................... 262

    9.2.8. Assumptions ........................................................... ..................................... 2639.2.9. Robust factorial ANOVAs ........................................................ .................. 2649.2.10. Specific comparisons on main effects ......................................................... 2659.2.11. Interpreting interactions ...................................................................... ........ 2659.2.12. More complex designs......................................................................... ........ 2689.2.13. Power and design in factorial ANOVA.................. ..................................... 271

    9.3. Pooling in multifactor designs ....................................................................2719.4. Relationship between factorial and nested designs.....................................2729.5. General issues and hints for analysis..........................................................272

    9.5.1. General Issues ........................................................ ..................................... 2729.5.2. Hints for analysis...................... ........................................................... ........ 273

    10. Randomized blocks and simple repeated measures:unreplicated two-factor designs .............................................314

    10.1. Unreplicated two factor experimental designs............................................314

    http://qk08.pdf/http://qk08.pdf/http://qk08.pdf/http://qk08.pdf/http://qk08.pdf/http://qk08.pdf/http://qk08.pdf/http://qk08.pdf/http://qk08.pdf/http://qk08.pdf/http://qk08.pdf/http://qk08.pdf/http://qk08.pdf/http://qk08.pdf/http://qk08.pdf/http://qk08.pdf/http://qk08.pdf/http://qk08.pdf/http://qk08.pdf/http://qk08.pdf/http://qk08.pdf/http://qk08.pdf/http://qk09.pdf/http://qk09.pdf/http://qk09.pdf/http://qk09.pdf/http://qk09.pdf/http://qk09.pdf/http://qk09.pdf/http://qk09.pdf/http://qk09.pdf/http://qk09.pdf/http://qk09.pdf/http://qk09.pdf/http://qk09.pdf/http://qk09.pdf/http://qk09.pdf/http://qk09.pdf/http://qk09.pdf/http://qk09.pdf/http://qk09.pdf/http://qk09.pdf/http://qk09.pdf/http://qk09.pdf/http://qk09.pdf/http://qk09.pdf/http://qk09.pdf/http://qk09.pdf/http://qk09.pdf/http://qk09.pdf/http://qk09.pdf/http://qk09.pdf/http://qk09.pdf/http://qk10.pdf/http://qk10.pdf/http://qk10.pdf/http://qk08.pdf/http://qk10.pdf/http://qk09.pdf/
  • 7/28/2019 Quinn & Keough Stats

    9/12

    10.1.1. Randomized complete block (RCB) designs ............................................... 31410.1.2. Repeated measures (RM) designs ............................................................... 315

    10.2. Analyzing RCB and RM designs..................................................................316 10.2.1. Linear models for RCB and RM analyses ................................................... 31610.2.2. Analysis of variance ........................................................ ............................ 31810.2.3. Null hypotheses ...................................................... ..................................... 318

    10.2.4. Comparing ANOVA models ..................................................... .................. 31910.3. Interactions in RCB and RM models...........................................................320

    10.3.1. Importance of treatment by block interactions ............................................ 32010.3.2. Checks for interaction in unreplicated designs................ ............................ 321

    10.4. Assumptions.................................................................................................32210.4.1. Normality, independence of errors ...................................................... ........ 32210.4.2. Variances and covariances - sphericity ............................................... ........ 32310.4.3. Recommended strategy ............................................................................... 325

    10.5. Robust RCB and RM analyses.....................................................................32510.6. Specific comparisons...................................................................................326 10.7. Efficiency of blocking (to block or not to block?) .......................................326 10.8. Time as a blocking factor ............................................................................327

    10.9. Analysis of unbalanced RCB designs ..........................................................327 10.10. Power of RCB or simple RM designs ..........................................................32910.11. More complex block designs .......................................................................329

    10.11.1. Factorial randomized block designs............ ..................................... 32910.11.2. Incomplete block designs............................ ..................................... 33010.11.3. Latin square designs.......................... ............................................... 33010.11.4. Crossover designs .......................................................... .................. 333

    10.12. Generalized randomized block designs.......................................................33410.13. RCB and RM designs and statistical software ............................................33410.14. General issues and hints for analysis..........................................................335

    11. Split plot and repeated measures designs: Partly nested

    anovas ...................................................................................36111.1. Partly nested designs...................................................................................361

    11.1.1. Split-plot designs .................................................... ..................................... 36111.1.2. Repeated measures designs ....................................................... .................. 36311.1.3. Reasons for using these designs .......................................................... ........ 364

    11.2. Analyzing partly nested designs ..................................................................36511.2.1. Linear models for partly nested analyses ............... ..................................... 36511.2.2. Analysis of variance ........................................................ ............................ 36711.2.3. Null hypotheses ...................................................... ..................................... 36711.2.4. Comparing ANOVA models ..................................................... .................. 369

    11.3. Assumptions.................................................................................................36911.3.1. Between plots/subjects .................................................... ............................ 37011.3.2. Within plots/subjects and multisample sphericity ....................................... 370

    11.4. Robust partly nested analyses .....................................................................37111.5. Specific comparisons...................................................................................372

    11.5.1. Main effects............................................................ ..................................... 37211.5.2. Interactions ................................................... ............................................... 37211.5.3. Profile (i.e. trend) analysis......................................................... .................. 373

    11.6. Analysis of unbalanced partly nested designs.............................................37411.7. Power for partly nested designs ..................................................................37511.8. More complex designs.................................................................................375

    11.8.1. Additional between plots/subjects factors ................................................... 37611.8.2. Additional within plots/subjects factors ...................................................... 37811.8.3. Additional between plots/subjects and within plots/subjects factors ........... 37911.8.4. General comments about complex designs.................................................. 379

    11.9. Partly nested designs and statistical software ............................................380General Issues and Hints for analysis ....................................................................381

    http://qk10.pdf/http://qk10.pdf/http://qk10.pdf/http://qk10.pdf/http://qk10.pdf/http://qk10.pdf/http://qk10.pdf/http://qk10.pdf/http://qk10.pdf/http://qk10.pdf/http://qk10.pdf/http://qk10.pdf/http://qk10.pdf/http://qk10.pdf/http://qk10.pdf/http://qk10.pdf/http://qk10.pdf/http://qk10.pdf/http://qk10.pdf/http://qk10.pdf/http://qk10.pdf/http://qk10.pdf/http://qk10.pdf/http://qk10.pdf/http://qk10.pdf/http://qk10.pdf/http://qk10.pdf/http://qk10.pdf/http://qk11.pdf/http://qk11.pdf/http://qk11.pdf/http://qk11.pdf/http://qk11.pdf/http://qk11.pdf/http://qk11.pdf/http://qk11.pdf/http://qk11.pdf/http://qk11.pdf/http://qk11.pdf/http://qk11.pdf/http://qk11.pdf/http://qk11.pdf/http://qk11.pdf/http://qk11.pdf/http://qk11.pdf/http://qk11.pdf/http://qk11.pdf/http://qk11.pdf/http://qk11.pdf/http://qk11.pdf/http://qk11.pdf/http://qk11.pdf/http://qk11.pdf/http://qk11.pdf/http://qk11.pdf/http://qk11.pdf/http://qk10.pdf/http://qk11.pdf/
  • 7/28/2019 Quinn & Keough Stats

    10/12

    12. Analyses of covariance ............................................................40912.1. Single factor analysis of covariance (ANCOVA) ........................................409

    12.1.1. Linear models for analysis of covariance .................................................... 41012.1.2. Analysis of (co)variance........... ........................................................... ........ 41212.1.3. Null hypotheses ...................................................... ..................................... 412

    12.1.4. Comparing ANCOVA models .................................................................... 41312.2. Assumptions of ANCOVA............................................................................41312.2.1. Linearity ....................................................... ............................................... 41312.2.2. Covariate values similar across groups ........................... ............................ 41412.2.3. Fixed covariate (X).................................................................... .................. 414

    12.3. Homogeneous slopes...................................................................................41412.3.1. Testing for homogeneous within group regression slopes........................... 41412.3.2. Dealing with heterogeneous within group regression slopes..... .................. 41512.3.3. Comparing regression lines ....................................................... .................. 416

    12.4. Robust ANCOVA .........................................................................................416 12.5. Unequal sample sizes (unbalanced designs)...............................................416 12.6. Specific comparisons of adjusted means.....................................................416

    12.6.1. Planned contrasts.................................................... ..................................... 41612.6.2. Unplanned comparisons .................................................. ............................ 417

    12.7. More complex designs.................................................................................417 12.7.1. Designs with two or more covariates .............................. ............................ 41712.7.2. Factorial designs... ........................................................... ............................ 41712.7.3. Nested designs with one covariate............................................. .................. 41812.7.4. Partly nested models with one covariate ..................................................... 419

    12.8. General Issues and hints for analysis..........................................................420General issues .......................................................... ............................................... 420Hints for analysis ..................................................... ............................................... 421

    13 Generalized linear models and logistic regression ...............43513.1 Generalized linear models ..........................................................................43513.2 Logistic regression......................................................................................436

    13.2.1 Simple logistic regression ......................................................... .................. 43613.2.2 Multiple logistic regression ....................................................... .................. 44013.2.3 Categorical predictors...................................................... ............................ 44113.2.4 Assumptions of logistic regression............... ............................................... 44113.2.5 Goodness-of-fit and residuals... ........................................................... ........ 44213.2.6 Model diagnostics................................................... ..................................... 44313.2.7 Model selection ...................................................... ..................................... 44413.2.8 Software for logistic regression........................................................... ........ 444

    13.3 Poisson regression ......................................................................................44513.4 Generalised additive models .......................................................................446 13.5 Models for correlated data..........................................................................448

    13.5.1 Multi-level (random effects) models ................................................... ........ 44813.5.2 Generalized estimating equations........................................................ ........ 44913.6 General Issues and hints for analysis..........................................................451

    14. Analyzing frequencies .............................................................45814.1. Single variable goodness-of-fit tests............................................................45814.2. Contingency tables ......................................................................................459

    14.2.1. Two way tables.......................................................................... .................. 45914.2.2. Three way tables........................................................................ .................. 463

    14.3. Loglinear models.........................................................................................466 14.3.1. Two way tables.......................................................................... .................. 46614.3.2. Log-linear models for three way tables ....................................................... 467

    14.3.3. More complex tables ....................................................... ............................ 47214.4. General Issues and hints for analysis..........................................................472General issues .......................................................... ............................................... 472

    http://qk12.pdf/http://qk12.pdf/http://qk12.pdf/http://qk12.pdf/http://qk12.pdf/http://qk12.pdf/http://qk12.pdf/http://qk12.pdf/http://qk12.pdf/http://qk12.pdf/http://qk12.pdf/http://qk12.pdf/http://qk12.pdf/http://qk12.pdf/http://qk12.pdf/http://qk12.pdf/http://qk12.pdf/http://qk12.pdf/http://qk12.pdf/http://qk12.pdf/http://qk12.pdf/http://qk12.pdf/http://qk12.pdf/http://qk12.pdf/http://qk12.pdf/http://qk12.pdf/http://qk12.pdf/http://qk13.pdf/http://qk13.pdf/http://qk13.pdf/http://qk13.pdf/http://qk13.pdf/http://qk13.pdf/http://qk13.pdf/http://qk13.pdf/http://qk13.pdf/http://qk13.pdf/http://qk13.pdf/http://qk13.pdf/http://qk13.pdf/http://qk13.pdf/http://qk13.pdf/http://qk13.pdf/http://qk13.pdf/http://qk14.pdf/http://qk14.pdf/http://qk14.pdf/http://qk14.pdf/http://qk14.pdf/http://qk14.pdf/http://qk14.pdf/http://qk14.pdf/http://qk14.pdf/http://qk14.pdf/http://qk14.pdf/http://qk14.pdf/http://qk13.pdf/http://qk12.pdf/
  • 7/28/2019 Quinn & Keough Stats

    11/12

    Hints for analysis ..................................................... ............................................... 473

    15. Introduction to multivariate analyses ...................................48615.1. Multivariate data.........................................................................................486 15.2. Distributions and associations....................................................................487 15.3. Linear combinations, eigenvectors and eigenvalues...................................488

    15.3.1. Linear combinations of variables ........................................................ ........ 48815.3.2. Eigenvalues ................................................................................................. 48815.3.3. Eigenvectors ........................................................... ..................................... 48915.3.4. Derivation of components ........................................................................... 489

    15.4. Multivariate distance and dissimilarity measures.......................................48915.4.1. Dissimilarity measures for continuous variables....................... .................. 49015.4.2. Dissimilarity measures for dichotomous (binary) variables ........................ 49115.4.3. General dissimilarity measures for mixed variables............................ ........ 49115.4.4. Comparison of dissimilarity measures ........................................................ 492

    15.5. Comparing distance and/or dissimilarity matrices.....................................49215.6. Data standardization...................................................................................49315.7. Standardization, association and dissimilarity ...........................................495

    15.8. Multivariate graphics..................................................................................49515.9. Screening multivariate data sets .................................................................496

    15.9.1. Multivariate outliers ........................................................ ............................ 49615.9.2. Missing observations.................................... ............................................... 496

    15.10. General issues and hints for analysis..........................................................499General Issues.................. ........................................................... ............................ 499Hints for analysis ..................................................... ............................................... 499

    Captions to Figures ................................................................................................515

    16. Multivariate analysis of variance and discriminantanalysis ...................................................................................516

    16.1. Multivariate analysis of variance (MANOVA)............................................516 16.1.1. Single factor MANOVA ............................................................................. 51716.1.2. Specific comparisons................................................................. .................. 51916.1.3. Relative importance of each response variable ........................................... 51916.1.4. Assumptions of MANOVA....................................................... .................. 52016.1.5. Robust MANOVA..................................................................... .................. 52116.1.6. More complex designs......................................................................... ........ 521

    16.2. Discriminant function analysis....................................................................52216.2.1. Description and hypothesis testing...................................................... ........ 52216.2.2. Classification and prediction ..................................................... .................. 52316.2.3. Assumptions of discriminant function analysis......................... .................. 52416.2.4. More complex designs......................................................................... ........ 525

    16.3. MANOVA vs Discriminant function analysis ..............................................525

    16.4. General issues and hints for analysis..........................................................525General issues .......................................................... ............................................... 525Hints for analysis ..................................................... ............................................... 525

    Captions to Figures ................................................................................................540

    17. Principal components and correspondence analysis............54117.1. Principal Components Analysis ..................................................................541

    17.1.1. Deriving components .................................................................................. 54217.1.2. Which association matrix to use? ........................................................ ........ 54317.1.3. Interpreting the components ...................................................... .................. 54417.1.4. Rotation of components............................................................................... 54417.1.5. How many components to retain? ....................................................... ........ 54517.1.6. Assumptions ........................................................... ..................................... 54617.1.7. Robust PCA............................................................ ..................................... 54717.1.8. Graphical representations .......................................................... .................. 547

    http://qk15.pdf/http://qk15.pdf/http://qk15.pdf/http://qk15.pdf/http://qk15.pdf/http://qk15.pdf/http://qk15.pdf/http://qk15.pdf/http://qk15.pdf/http://qk15.pdf/http://qk15.pdf/http://qk15.pdf/http://qk15.pdf/http://qk15.pdf/http://qk15.pdf/http://qk15.pdf/http://qk15.pdf/http://qk15.pdf/http://qk15.pdf/http://qk15.pdf/http://qk15.pdf/http://qk15.pdf/http://qk15.pdf/http://qk15.pdf/http://qk16.pdf/http://qk16.pdf/http://qk16.pdf/http://qk16.pdf/http://qk16.pdf/http://qk16.pdf/http://qk16.pdf/http://qk16.pdf/http://qk16.pdf/http://qk16.pdf/http://qk16.pdf/http://qk16.pdf/http://qk16.pdf/http://qk16.pdf/http://qk16.pdf/http://qk16.pdf/http://qk16.pdf/http://qk16.pdf/http://qk16.pdf/http://qk17.pdf/http://qk17.pdf/http://qk17.pdf/http://qk17.pdf/http://qk17.pdf/http://qk17.pdf/http://qk17.pdf/http://qk17.pdf/http://qk17.pdf/http://qk17.pdf/http://qk17.pdf/http://qk16.pdf/http://qk15.pdf/
  • 7/28/2019 Quinn & Keough Stats

    12/12

    17.1.9. Other uses of components ........................................................................... 54917.2. Factor analysis............................................................................................55017.3. Correspondence analysis ............................................................................551

    17.3.1. Mechanics............................................................... ..................................... 55217.3.2. Scaling and joint plots ..................................................... ............................ 55217.3.3. Reciprocal averaging....................................................... ............................ 553

    17.3.4. Use of CA with ecological data....................................... ............................ 55417.3.5. Detrending.. ............................................................ ..................................... 554

    17.4. Canonical correlation analysis ...................................................................55517.5. Redundancy analysis ...................................................................................556 17.6. Canonical correspondence analysis............................................................557 17.7. Constrained and partial ordination........................................................55817.8. General issues and hints for analysis..........................................................559

    General issues .......................................................... ............................................... 559Hints for analysis ..................................................... ............................................... 560

    18. Multidimensional scaling and cluster analysis .....................57418.1. Multidimensional scaling ............................................................................574

    18.1.1. Classical scaling Principal Coordinates Analysis (PCoA) ............... ........ 57518.1.2. Enhanced multidimensional scaling .................................................... ........ 57618.1.3. Dissimilarities and testing hypotheses about groups of objects................... 57918.1.4. Relating MDS to original variables ..................................................... ........ 58418.1.5. Relating MDS to covariates......................................................................... 585

    18.2. Classification...............................................................................................58518.2.1. Cluster Analysis .................................................................................. ........ 586

    18.3. Scaling (ordination) and clustering for biological data..............................58818.4. General issues and hints for analysis..........................................................589

    General issues .......................................................... ............................................... 589Hints for analysis ..................................................... ............................................... 589

    19 Presentation of results.............................................................59819.1 Presentation of analyses..............................................................................598

    19.1.1 Linear models ......................................................... ..................................... 59819.1.2 Other analyses ........................................................ ..................................... 600

    19.2 Layout of tables ...........................................................................................60019.3 Displaying summaries of the data...............................................................601

    19.3.1 Bar graph...................................................... ............................................... 60219.3.2 Line graph (category plot) ......................................................... .................. 60319.3.3 Scatterplots ................................................... ............................................... 60319.3.4 Pie charts ...................................................... ............................................... 604

    19.4 Error bars....................................................................................................60419.4.1 Alternative approaches .................................................... ............................ 605

    19.5 Oral presentations.......................................................................................606 19.5.1 Slides, computers, or overheads? ........................................................ ........ 60619.5.2 Graphics packages........................................................... ............................ 60819.5.3 Working with colour ....................................................... ............................ 60819.5.4 Scanned images ...................................................... ..................................... 60819.5.5 Information content ......................................................... ............................ 609

    19.6 General issues and hints .............................................................................609

    20. References ................................................................................615

    http://qk17.pdf/http://qk17.pdf/http://qk17.pdf/http://qk17.pdf/http://qk17.pdf/http://qk17.pdf/http://qk17.pdf/http://qk17.pdf/http://qk17.pdf/http://qk17.pdf/http://qk17.pdf/http://qk17.pdf/http://qk17.pdf/http://qk17.pdf/http://qk17.pdf/http://qk18.pdf/http://qk18.pdf/http://qk18.pdf/http://qk18.pdf/http://qk18.pdf/http://qk18.pdf/http://qk18.pdf/http://qk18.pdf/http://qk18.pdf/http://qk18.pdf/http://qk18.pdf/http://qk18.pdf/http://qk18.pdf/http://qk19.pdf/http://qk19.pdf/http://qk19.pdf/http://qk19.pdf/http://qk19.pdf/http://qk19.pdf/http://qk19.pdf/http://qk19.pdf/http://qk19.pdf/http://qk19.pdf/http://qk19.pdf/http://qk19.pdf/http://qk19.pdf/http://qk19.pdf/http://qk19.pdf/http://qk19.pdf/http://qk19.pdf/http://qk19.pdf/http://qk19.pdf/http://qk20.pdf/http://qk17.pdf/http://qk20.pdf/http://qk19.pdf/http://qk18.pdf/