Ch 6 the 2 k Factorial Design

download Ch 6 the 2 k Factorial Design

of 56

Transcript of Ch 6 the 2 k Factorial Design

  • 7/29/2019 Ch 6 the 2 k Factorial Design

    1/56

    Design and

    Analysis ofExperiments

    Chapter 6: The 2

    k

    FactorialDesign

  • 7/29/2019 Ch 6 the 2 k Factorial Design

    2/56

    Introduction

    Special case of the general factorial design;k

    factors, allat two levels

    A complete replicate of such a design requires 2 2

    2 = 2kobservations and is called a2k Factorial Design

    The two levels are usually called low and high (they couldbe either quantitative or qualitative)

    Very widely used in industrial experimentation (factor

    screening experiments)

    we assume that (1) the factors are fixed,

    (2) the designs are completely randomized,

    (3) the usual normality assumptions are satisfied.

    (4) the response is approximately linear

  • 7/29/2019 Ch 6 the 2 k Factorial Design

    3/56

    The Simplest Case: The 22

    - and + denote the low and

    high levels of a factor,respectively

    Low and high are arbitrary

    terms

    Geometrically, the four runs

    form the corners of a

    square

    Factors can be quantitative

    or qualitative, although their

    treatment in the final model

    will be different

  • 7/29/2019 Ch 6 the 2 k Factorial Design

    4/56

    Analysis Procedure for a

    Factorial Design

    Estimate factoreffects

    FormulatemodelWith replication, use full model

    With an unreplicated design, use normal probabilityplots

    Statisticaltesting(ANOVA)

    Refinethe model Analyzeresiduals(graphical)

    Interpretresults

  • 7/29/2019 Ch 6 the 2 k Factorial Design

    5/56

    Estimation of Factor Effects

    a:A at the high level and B

    at the low level

    b :A at the low level and B

    at the high level,

    abrepresents both factors

    at the high level

    (1) both factors at the low

    level.

    1

    2

    1

    2

    1

    2

    (1)2 2

    [ (1)]

    (1)

    2 2

    [ (1)]

    (1)

    2 2

    [ (1) ]

    A A

    n

    B B

    n

    n

    A y y

    ab a bn n

    ab a b

    B y y

    ab b a

    n n

    ab b a

    ab a b

    AB n n

    ab a b

    The effect estimates are: A

    = 8.33, B = -5.00, AB = 1.67

  • 7/29/2019 Ch 6 the 2 k Factorial Design

    6/56

    Statistical Testing - ANOVA

    Examine the magnitude and direction of thefactor effects

    ANOVA can generally be used to confirm this

    interpretation. Total effects of A, B and AB are estimated in the

    form of contrasts for example:

    ContrastA = ab + a - b - (1)

    In the standard order:

  • 7/29/2019 Ch 6 the 2 k Factorial Design

    7/56

    Based on the P-values, we conclude that the main effects

    are statistically significant and that there is no interaction

    between these factors.

  • 7/29/2019 Ch 6 the 2 k Factorial Design

    8/56

    The Regression Model It is easy to express the results of the experiment in terms

    of a regression model.

    Where X1 and X2 are coded variables for reactant

    concentration and amount of catalyst respectively.

    The relationship between the natural variables and coded

    variables:

    The fitted regression model is:

  • 7/29/2019 Ch 6 the 2 k Factorial Design

    9/56

    The Response Surface

    The regression model can be used to generateresponse surface plots.

    Substituting the coded variables with the normal

    variables we have:

    we use a fitted surface such as this to find a

    direction of potential improvement for a process.

  • 7/29/2019 Ch 6 the 2 k Factorial Design

    10/56

    Because the model is first-order (that is, it containsonly the main effects), the fitted response surfaceis a plane.

  • 7/29/2019 Ch 6 the 2 k Factorial Design

    11/56

    Residuals and Model Adequacy

    The residuals are the differences between the

    observed and fitted values of y. For example,

    when the reactant concentration is at the low

    level (X1

    = -1) and the catalyst is at the low level(X2 = -1), the predicted yield is:

  • 7/29/2019 Ch 6 the 2 k Factorial Design

    12/56

    Residuals and Model Adequacy

  • 7/29/2019 Ch 6 the 2 k Factorial Design

    13/56

    The 23 Factorial Design

    three factors, A, B, andC, each at two levelsand the eight treatmentcombinations can bedisplayed geometrically

    as a cube.

  • 7/29/2019 Ch 6 the 2 k Factorial Design

    14/56

    Effects in The 23 Factorial Design

  • 7/29/2019 Ch 6 the 2 k Factorial Design

    15/56

  • 7/29/2019 Ch 6 the 2 k Factorial Design

    16/56

  • 7/29/2019 Ch 6 the 2 k Factorial Design

    17/56

  • 7/29/2019 Ch 6 the 2 k Factorial Design

    18/56

    Properties of the Table

    Except for columnI, every column has an equal number of+ and signs

    The sum of the product of signs in any two columns is zero

    Multiplying any column by Ileaves that column unchanged(identity element)

    The product of any two columns yields a column in thetable:

    Orthogonal design Orthogonality is an important property shared by all

    factorial designs

    2

    A B AB

    AB BC AB C AC

  • 7/29/2019 Ch 6 the 2 k Factorial Design

    19/56

    Statistical analysis (ANOVA)

    Sums of squares for the effects are easily computed. Inthe 23 design with n replicates, the sum of squares for

    any effect is:

  • 7/29/2019 Ch 6 the 2 k Factorial Design

    20/56

  • 7/29/2019 Ch 6 the 2 k Factorial Design

    21/56

    The Regression Model and Response Surface

  • 7/29/2019 Ch 6 the 2 k Factorial Design

    22/56

    Unreplicated 2kFactorial Designs

    2k

    factorial designs with one observation foreach experimental run

    An unreplicated 2kfactorial design is also

    sometimes called a single replicate

    These designs are very widely used

    Lack of replication causes potential problems instatistical testing

    Replication admits an estimate of pure error

    With no replication, fitting the full model results in zero

    degrees of freedom for error (Modeling the noise)

    chance of unusual response observations

  • 7/29/2019 Ch 6 the 2 k Factorial Design

    23/56

    Spacing of Factor Levels in the

    Unreplicated 2kFactorial Designs

    If the factors are spaced too closely, it increases the chances

    that the noise will overwhelm the signal in the data

    More aggressive spacing is usually best

  • 7/29/2019 Ch 6 the 2 k Factorial Design

    24/56

    solutions to this problem

    Sparsity of effects principle: Pooling high-orderinteractions to estimate error.

    Normal probability plottingof effects (Daniels, 1959)

    The effects that are negligible are normally distributed, with mean

    zero and variance and will tend to fall along a straight line on this

    plot

    significant effects will have nonzero means and will not lie along

    the straight line. Thus the preliminary model will be

    Half-normal plot

    This is a plot of the absolute value of the effect estimates against

    their cumulative normal probabilities.

  • 7/29/2019 Ch 6 the 2 k Factorial Design

    25/56

    A Single Replicate of the 24 Design

    Investigation the effects of four factors on thefiltration rate of a resin

    The factors are A = temperature, B= pressure, C= mole

    ratio, D= stirring rate

  • 7/29/2019 Ch 6 the 2 k Factorial Design

    26/56

  • 7/29/2019 Ch 6 the 2 k Factorial Design

    27/56

  • 7/29/2019 Ch 6 the 2 k Factorial Design

    28/56

    Estimates of the Effects

  • 7/29/2019 Ch 6 the 2 k Factorial Design

    29/56

    The Half-Normal Probability Plot of

    Effects

  • 7/29/2019 Ch 6 the 2 k Factorial Design

    30/56

    Design Projection: ANOVA Summary for the

    Model as a 23 in Factors A, C, and D

  • 7/29/2019 Ch 6 the 2 k Factorial Design

    31/56

    The Regression Model

    The coded variables x1, x3, x4 take on values between -

    1 and + 1. The predicted filtration rate at run (1) is

  • 7/29/2019 Ch 6 the 2 k Factorial Design

    32/56

    Model Residuals are Satisfactory

  • 7/29/2019 Ch 6 the 2 k Factorial Design

    33/56

    Model Interpretation Main Effects and

    Interactions

  • 7/29/2019 Ch 6 the 2 k Factorial Design

    34/56

    Model Interpretation Response

    Surface Plots

    With concentration at either the low or high level, high

    temperature and high stirring rate results in high filtration rates

  • 7/29/2019 Ch 6 the 2 k Factorial Design

    35/56

    Other Methods for Analyzing

    Unreplicated Factorials

    Lenth's method:The basis of Lenth's method is

    to estimate the variance of a contrast from the

    smallest (in absolute value) contrast estimates.

    "pseudo standard error"

    margin of error

    simultaneous margin of error

    For Filtration Example

  • 7/29/2019 Ch 6 the 2 k Factorial Design

    36/56

    Adjusted multipliers for Lenths method

    the original method makes too many type I errors,especially for small designs (few contrasts)

    To address the problem adjusted multipliers have been

    suggested:

    Lenths method is a nice supplement to the normal

    probability plot of effects

  • 7/29/2019 Ch 6 the 2 k Factorial Design

    37/56

    Conditional inference chart

    The purpose of the graph is to help the experimenter injudging significant effects.

    In unreplicated designs, there is no internal estimate of

    variance

    conditional inference chart is designed to help theexperimenter evaluate effect magnitude for a range of

    standard deviation values.

  • 7/29/2019 Ch 6 the 2 k Factorial Design

    38/56

  • 7/29/2019 Ch 6 the 2 k Factorial Design

    39/56

    The Drilling Experiment:Data

    Transformation in a Factorial Design

    A = drill load,B = flow, C= speed,D = type of mud,

    y = advance rate of the drill

  • 7/29/2019 Ch 6 the 2 k Factorial Design

    40/56

    Normal Probability Plot of Effects

    The Drilling Experiment

  • 7/29/2019 Ch 6 the 2 k Factorial Design

    41/56

    Residual Plots

  • 7/29/2019 Ch 6 the 2 k Factorial Design

    42/56

    The residual plots indicate that there are problems withtheequality of varianceassumption

    The usual approach to this problem is to employ atransformationon the response

    Power familytransformations are widely used

    Transformations are typically performed to

    Stabilize variance Induce at least approximate normality

    Simplify the model

    *y y

  • 7/29/2019 Ch 6 the 2 k Factorial Design

    43/56

    Effect Estimates Following the

    Log Transformation

    Three main effects arelarge

    No indication of large

    interaction effects

    What happened to theinteractions?expressing the data in the

    correct metric has

    simplified its structure to

    the point that the twointeractions are no longer

    required in the

    explanatory model.

  • 7/29/2019 Ch 6 the 2 k Factorial Design

    44/56

    Residuals

  • 7/29/2019 Ch 6 the 2 k Factorial Design

    45/56

    Location and Dispersion Effects in an

    Unreplicated Factorial

  • 7/29/2019 Ch 6 the 2 k Factorial Design

    46/56

  • 7/29/2019 Ch 6 the 2 k Factorial Design

    47/56

  • 7/29/2019 Ch 6 the 2 k Factorial Design

    48/56

    Duplicate Measurements on the

    Response The four design factors are temperature (A), time(B), pressure (C), and gas flow (D).

  • 7/29/2019 Ch 6 the 2 k Factorial Design

    49/56

  • 7/29/2019 Ch 6 the 2 k Factorial Design

    50/56

  • 7/29/2019 Ch 6 the 2 k Factorial Design

    51/56

  • 7/29/2019 Ch 6 the 2 k Factorial Design

    52/56

  • 7/29/2019 Ch 6 the 2 k Factorial Design

    53/56

    Addition of Center Points

    to a 2kDesigns

    Runs at the center provide an estimate of

    error and allow the experimenter to

    distinguish between two possible models:

    0

    1 1

    20

    1 1 1

    First-order model (interaction)

    Second-order model

    k k k

    i i ij i j

    i i j i

    k k k k

    i i ij i j ii i

    i i j i i

    y x x x

    y x x x x

  • 7/29/2019 Ch 6 the 2 k Factorial Design

    54/56

  • 7/29/2019 Ch 6 the 2 k Factorial Design

    55/56

    no "curvature"F Cy y

    The hypotheses are:

    0

    1

    1

    1

    : 0

    : 0

    k

    ii

    i

    k

    ii

    i

    H

    H

    2

    Pure Quad

    ( )F C F C

    F C

    n n y ySS

    n n

    This sum of squares has a

    single degree of freedom

  • 7/29/2019 Ch 6 the 2 k Factorial Design

    56/56

    Central Composite Design