Power and Sample Size

Shaun Purcell & Danielle Posthuma

Twin Workshop March 2002

Aims of Session

Introduce concept of power and errors in inference

Practical 1 : Using probability distribution

functions to calculate power

Power in the classical ACE twin study

Practical 2 : using Mx to calculate power

Practical 3 : Monte-Carlo simulation

Power primer

Statistics (e.g. chi-squared, z-score) are continuous

measures of support for a certain hypothesis

Test statistic

Inevitably leads to two types of mistake : false positive (YES instead of NO) (Type I)false negative (NO instead of YES) (Type II)

YES OR NO decision-making : significance testing

Hypothesis testing

Null hypothesis : no effect

A ‘significant’ result means that we can reject the

null hypothesis

A ‘nonsignificant’ result means that we cannot

reject the null hypothesis

Statistical significance

The ‘p-value’

The probability of a false positive error if the null

were in fact true

Typically, we are willing to incorrectly reject the null

5% or 1% of the time (Type I error)

Misunderstandings

p - VALUES

that the p value is the probability of the null

hypothesis being true

that very low p values mean large and important

effects

NULL HYPOTHESIS

that nonrejection of the null implies its truth

Limitations

IF A RESULT IS SIGNIFICANT

leads to the conclusion that the null is false

BUT, this may be trivial

IF A RESULT IS NONSIGNIFICANT

leads only to the conclusion that it cannot be

concluded that the null is false

Alternate hypothesis

Neyman & Pearson (1928)

ALTERNATE HYPOTHESIS

specifies a precise, non-null state of affairs with

associated risk of error

Critical value

Sampling distribution if HA were true

Sampling distribution if H0 were true

Rejection of H0 Nonrejection of H0

H0 true

HA true

POWER =(1- )

Nonsignificant resultType I error at rate

Type II error at rate

Significant result

The probability of rejection of a false null-

hypothesis

depends on - the significance crtierion ()- the sample size (N) - the effect size (NCP)

“The probability of detecting a given effect size in a population from a sample of size N, using significance criterion ”

Impact of alpha

Critical value

Impact of effect size, N

Critical value

Applications

POWER SURVEYS / META-ANALYSES- low power undermines the confidence that can be

placed in statistically significant results

INTERPRETING NONSIGIFICANT RESULTS- nonsignficant results only meaningful if power is high

EXPERIMENTAL DESIGN- avoiding false positives vs. dealing with false negatives

MAGNITUDE VS. SIGNIFICANCE- highly significant very important

Practical Exercise 1

Calculation of power for simple case-control study.

DATA : frequency of risk factor in 30 cases and 30

controls

TEST : 2-by-2 contingency table : chi-squared

(1 degree of freedom)

Step 1 : determine expected chi-squared

Hypothetical risk factor frequencies

Case Control

Risk present 20 10

Risk absent 10 20

Chi-squared statistic = 6.666

EO 22 )(

Critical value

Step 2. Determine the critical value for a given type I error rate,

- inverse central chi-squared distribution

Critical value

Step 3. Determine the power for a given critical valueand non-centrality parameter

- non-central chi-squared distribution

Calculating Power

1. Calculate critical value (Inverse central 2)

Alpha 0 (under the null)

2. Calculate power (Non-central 2)

Crit. value Expected NCP

http://workshop.colorado.edu/~pshaun/

df = 1 , NCP = 0

3.84146

6.63489

10.82754

Determining power

df = 1 , NCP = 6.666

X Power

0.05 3.84146

0.01 6.6349

0.001 10.827

Exercise 1

Calculate power (for the 3 levels of alpha) if sample

size were two times larger (assume proportions

remain constant) ?

Hint: the NCP is a linear function of sample size, and will also

be two times larger

Answers

df = 1 , NCP = 13.333

X Power

0.05 3.84146

0.01 6.6349

0.001 10.827

nb. Stata : di 1-nchi(df,NCP,invchi(df,))

Twin 1

a’ 0 e’

Estimating power for twin models

The power to detect, e.g., common environment

Expected covariance matrices arecalculated under the alternate model :

Fit model to data with value of interest fixed to null value, e.g. c = 0

NCP = -2LLSUB

0.51 0.28

0.41 0.20

Model A C E

1 30% 20% 50%

2 0% 20% 80%

(350 MZ pairs, 350 DZ pairs)

Model Power to detect C

Alpha 0.05 0.01

Using power.mx script

Qu. You observe MZ and DZ correlations of 0.8

and 0.5 respectively, in 100 MZ and 100 DZ twin

pairs. What is the power to detect an additive

genetic effect, with a type I error rate of 1 in

Absolute ACE effects

Power to detect :

A C E A C

0.1 0.1 0.8 0.02 0.02

0.2 0.2 0.6 0.06 0.09

0.3 0.3 0.4 0.29 0.32

0.4 0.4 0.2 0.95 0.79

150 MZ twins, 150 DZ twins, = 0.01

Relative ACE effects

Power to detect :

A C E A C

0.2 0.2 0.6 0.06 0.09

0.2 0.0 0.8 0.57

0.0 0.2 0.8 0.82

150 MZ twins, 150 DZ twins, = 0.01

Sample Size

NMZ NDZ A C

150 150 0.83 0.53

250 250 0.98 0.86

350 350 1.00 0.96

500 500 1.00 0.99

A:C:E = 2:2:1, = 0.001

NCP and power

0 5 10 15 20

Relative MZ and DZ sample N

NMZ NDZ A C

150 150 0.83 0.53

500 500 1.00 0.99

500 150 0.99 0.56

150 500 0.95 0.99

A:C:E = 2:2:1, = 0.001

Increasing power

Increase sample size

Increase

Multivariate analysis

Adding other family members

Adding other siblings

Power compared to twins only design

(keeping total # individuals constant)

Power to detect

+ 1 sibling + ++ ++

+ 2 siblings - ++ ++

Monte-Carlo simulation

Instead of calculating expected NCP under

population parameter values, simulate multiple

randomly-sampled datasets

Perform test on each dataset

Due to random sampling variation, the effect will

not always be detectable

The proportion of significant results Power

Expected NCP

Critical value

More importantly...

Meike says …

“people are going skiing Saturday and all are

welcome to join”

Power and Sample Size

Documents

Transcript of Power and Sample Size

Sample size and power calculations made simple

[PSS] Power and Sample Size - merit.unu.edu

Power and sample size. Objectives Explain why sample size is important Explain what makes up a sample size calculation Demonstrate sample size calculations.

Power and Sample Size Calculations -

Hypothesis Testing, Power, Sample Size and Confidence ...biostat.mc.vanderbilt.edu/wiki/pub/Main/AnesShortCourse/Hypothesis... · Hypothesis Testing, Power, Sample Size and Con dence

Sample Size and Statistical Power

BIOSTATISTICS Lecture 15 Power and Sample Size ' The power of

Power and Sample Size + Principles of Simulation

[PSS] Power and Sample Size

Power, Sample Size, Effect Size: Considerations for Research · Power, Sample Size, Effect Size: Considerations for Research Carol B ... •Eta2 % of variance based on group diffs

Error, Power And Sample Size - brighamresearcheducation.org · Fayers and A Pinol. ... adequate sample size is still a relevant concern. ... Advantage: This reduces the sample size

Power and Sample Size Lecture

Sample Size and Power Analysis - Vanderbilt Universitybiostat.mc.vanderbilt.edu/wiki/pub/Main/...Sample Size and Power Analysis 8 15 Free Software for Sample Size and Power – PS

5. Sample Size, Power & Thresholdspages.stat.wisc.edu/~yandell/statgen/course/notes/course5.pdf · • determine power or sample size under alternative – power = chance to reject

Sample Size and Power

Sample Size Power 2009

Power and Sample Size

Power and Sample Size Determination for Linear Models · Power and Sample Size Determination for Linear Models ... design, (2) posit a scenario ... terms like power analysis, sample

Introduction to sample size and power calculations

Lecture 10: Power and Sample Size, Design Considerations ...faculty.washington.edu/tathornt/SISG2019/lectures/SISG2019session… · Lecture 10: Power and Sample Size, Design Considerations,