Ch 6 the 2 k Factorial Design
Transcript of Ch 6 the 2 k Factorial Design
-
7/29/2019 Ch 6 the 2 k Factorial Design
1/56
Design and
Analysis ofExperiments
Chapter 6: The 2
k
FactorialDesign
-
7/29/2019 Ch 6 the 2 k Factorial Design
2/56
Introduction
Special case of the general factorial design;k
factors, allat two levels
A complete replicate of such a design requires 2 2
2 = 2kobservations and is called a2k Factorial Design
The two levels are usually called low and high (they couldbe either quantitative or qualitative)
Very widely used in industrial experimentation (factor
screening experiments)
we assume that (1) the factors are fixed,
(2) the designs are completely randomized,
(3) the usual normality assumptions are satisfied.
(4) the response is approximately linear
-
7/29/2019 Ch 6 the 2 k Factorial Design
3/56
The Simplest Case: The 22
- and + denote the low and
high levels of a factor,respectively
Low and high are arbitrary
terms
Geometrically, the four runs
form the corners of a
square
Factors can be quantitative
or qualitative, although their
treatment in the final model
will be different
-
7/29/2019 Ch 6 the 2 k Factorial Design
4/56
Analysis Procedure for a
Factorial Design
Estimate factoreffects
FormulatemodelWith replication, use full model
With an unreplicated design, use normal probabilityplots
Statisticaltesting(ANOVA)
Refinethe model Analyzeresiduals(graphical)
Interpretresults
-
7/29/2019 Ch 6 the 2 k Factorial Design
5/56
Estimation of Factor Effects
a:A at the high level and B
at the low level
b :A at the low level and B
at the high level,
abrepresents both factors
at the high level
(1) both factors at the low
level.
1
2
1
2
1
2
(1)2 2
[ (1)]
(1)
2 2
[ (1)]
(1)
2 2
[ (1) ]
A A
n
B B
n
n
A y y
ab a bn n
ab a b
B y y
ab b a
n n
ab b a
ab a b
AB n n
ab a b
The effect estimates are: A
= 8.33, B = -5.00, AB = 1.67
-
7/29/2019 Ch 6 the 2 k Factorial Design
6/56
Statistical Testing - ANOVA
Examine the magnitude and direction of thefactor effects
ANOVA can generally be used to confirm this
interpretation. Total effects of A, B and AB are estimated in the
form of contrasts for example:
ContrastA = ab + a - b - (1)
In the standard order:
-
7/29/2019 Ch 6 the 2 k Factorial Design
7/56
Based on the P-values, we conclude that the main effects
are statistically significant and that there is no interaction
between these factors.
-
7/29/2019 Ch 6 the 2 k Factorial Design
8/56
The Regression Model It is easy to express the results of the experiment in terms
of a regression model.
Where X1 and X2 are coded variables for reactant
concentration and amount of catalyst respectively.
The relationship between the natural variables and coded
variables:
The fitted regression model is:
-
7/29/2019 Ch 6 the 2 k Factorial Design
9/56
The Response Surface
The regression model can be used to generateresponse surface plots.
Substituting the coded variables with the normal
variables we have:
we use a fitted surface such as this to find a
direction of potential improvement for a process.
-
7/29/2019 Ch 6 the 2 k Factorial Design
10/56
Because the model is first-order (that is, it containsonly the main effects), the fitted response surfaceis a plane.
-
7/29/2019 Ch 6 the 2 k Factorial Design
11/56
Residuals and Model Adequacy
The residuals are the differences between the
observed and fitted values of y. For example,
when the reactant concentration is at the low
level (X1
= -1) and the catalyst is at the low level(X2 = -1), the predicted yield is:
-
7/29/2019 Ch 6 the 2 k Factorial Design
12/56
Residuals and Model Adequacy
-
7/29/2019 Ch 6 the 2 k Factorial Design
13/56
The 23 Factorial Design
three factors, A, B, andC, each at two levelsand the eight treatmentcombinations can bedisplayed geometrically
as a cube.
-
7/29/2019 Ch 6 the 2 k Factorial Design
14/56
Effects in The 23 Factorial Design
-
7/29/2019 Ch 6 the 2 k Factorial Design
15/56
-
7/29/2019 Ch 6 the 2 k Factorial Design
16/56
-
7/29/2019 Ch 6 the 2 k Factorial Design
17/56
-
7/29/2019 Ch 6 the 2 k Factorial Design
18/56
Properties of the Table
Except for columnI, every column has an equal number of+ and signs
The sum of the product of signs in any two columns is zero
Multiplying any column by Ileaves that column unchanged(identity element)
The product of any two columns yields a column in thetable:
Orthogonal design Orthogonality is an important property shared by all
factorial designs
2
A B AB
AB BC AB C AC
-
7/29/2019 Ch 6 the 2 k Factorial Design
19/56
Statistical analysis (ANOVA)
Sums of squares for the effects are easily computed. Inthe 23 design with n replicates, the sum of squares for
any effect is:
-
7/29/2019 Ch 6 the 2 k Factorial Design
20/56
-
7/29/2019 Ch 6 the 2 k Factorial Design
21/56
The Regression Model and Response Surface
-
7/29/2019 Ch 6 the 2 k Factorial Design
22/56
Unreplicated 2kFactorial Designs
2k
factorial designs with one observation foreach experimental run
An unreplicated 2kfactorial design is also
sometimes called a single replicate
These designs are very widely used
Lack of replication causes potential problems instatistical testing
Replication admits an estimate of pure error
With no replication, fitting the full model results in zero
degrees of freedom for error (Modeling the noise)
chance of unusual response observations
-
7/29/2019 Ch 6 the 2 k Factorial Design
23/56
Spacing of Factor Levels in the
Unreplicated 2kFactorial Designs
If the factors are spaced too closely, it increases the chances
that the noise will overwhelm the signal in the data
More aggressive spacing is usually best
-
7/29/2019 Ch 6 the 2 k Factorial Design
24/56
solutions to this problem
Sparsity of effects principle: Pooling high-orderinteractions to estimate error.
Normal probability plottingof effects (Daniels, 1959)
The effects that are negligible are normally distributed, with mean
zero and variance and will tend to fall along a straight line on this
plot
significant effects will have nonzero means and will not lie along
the straight line. Thus the preliminary model will be
Half-normal plot
This is a plot of the absolute value of the effect estimates against
their cumulative normal probabilities.
-
7/29/2019 Ch 6 the 2 k Factorial Design
25/56
A Single Replicate of the 24 Design
Investigation the effects of four factors on thefiltration rate of a resin
The factors are A = temperature, B= pressure, C= mole
ratio, D= stirring rate
-
7/29/2019 Ch 6 the 2 k Factorial Design
26/56
-
7/29/2019 Ch 6 the 2 k Factorial Design
27/56
-
7/29/2019 Ch 6 the 2 k Factorial Design
28/56
Estimates of the Effects
-
7/29/2019 Ch 6 the 2 k Factorial Design
29/56
The Half-Normal Probability Plot of
Effects
-
7/29/2019 Ch 6 the 2 k Factorial Design
30/56
Design Projection: ANOVA Summary for the
Model as a 23 in Factors A, C, and D
-
7/29/2019 Ch 6 the 2 k Factorial Design
31/56
The Regression Model
The coded variables x1, x3, x4 take on values between -
1 and + 1. The predicted filtration rate at run (1) is
-
7/29/2019 Ch 6 the 2 k Factorial Design
32/56
Model Residuals are Satisfactory
-
7/29/2019 Ch 6 the 2 k Factorial Design
33/56
Model Interpretation Main Effects and
Interactions
-
7/29/2019 Ch 6 the 2 k Factorial Design
34/56
Model Interpretation Response
Surface Plots
With concentration at either the low or high level, high
temperature and high stirring rate results in high filtration rates
-
7/29/2019 Ch 6 the 2 k Factorial Design
35/56
Other Methods for Analyzing
Unreplicated Factorials
Lenth's method:The basis of Lenth's method is
to estimate the variance of a contrast from the
smallest (in absolute value) contrast estimates.
"pseudo standard error"
margin of error
simultaneous margin of error
For Filtration Example
-
7/29/2019 Ch 6 the 2 k Factorial Design
36/56
Adjusted multipliers for Lenths method
the original method makes too many type I errors,especially for small designs (few contrasts)
To address the problem adjusted multipliers have been
suggested:
Lenths method is a nice supplement to the normal
probability plot of effects
-
7/29/2019 Ch 6 the 2 k Factorial Design
37/56
Conditional inference chart
The purpose of the graph is to help the experimenter injudging significant effects.
In unreplicated designs, there is no internal estimate of
variance
conditional inference chart is designed to help theexperimenter evaluate effect magnitude for a range of
standard deviation values.
-
7/29/2019 Ch 6 the 2 k Factorial Design
38/56
-
7/29/2019 Ch 6 the 2 k Factorial Design
39/56
The Drilling Experiment:Data
Transformation in a Factorial Design
A = drill load,B = flow, C= speed,D = type of mud,
y = advance rate of the drill
-
7/29/2019 Ch 6 the 2 k Factorial Design
40/56
Normal Probability Plot of Effects
The Drilling Experiment
-
7/29/2019 Ch 6 the 2 k Factorial Design
41/56
Residual Plots
-
7/29/2019 Ch 6 the 2 k Factorial Design
42/56
The residual plots indicate that there are problems withtheequality of varianceassumption
The usual approach to this problem is to employ atransformationon the response
Power familytransformations are widely used
Transformations are typically performed to
Stabilize variance Induce at least approximate normality
Simplify the model
*y y
-
7/29/2019 Ch 6 the 2 k Factorial Design
43/56
Effect Estimates Following the
Log Transformation
Three main effects arelarge
No indication of large
interaction effects
What happened to theinteractions?expressing the data in the
correct metric has
simplified its structure to
the point that the twointeractions are no longer
required in the
explanatory model.
-
7/29/2019 Ch 6 the 2 k Factorial Design
44/56
Residuals
-
7/29/2019 Ch 6 the 2 k Factorial Design
45/56
Location and Dispersion Effects in an
Unreplicated Factorial
-
7/29/2019 Ch 6 the 2 k Factorial Design
46/56
-
7/29/2019 Ch 6 the 2 k Factorial Design
47/56
-
7/29/2019 Ch 6 the 2 k Factorial Design
48/56
Duplicate Measurements on the
Response The four design factors are temperature (A), time(B), pressure (C), and gas flow (D).
-
7/29/2019 Ch 6 the 2 k Factorial Design
49/56
-
7/29/2019 Ch 6 the 2 k Factorial Design
50/56
-
7/29/2019 Ch 6 the 2 k Factorial Design
51/56
-
7/29/2019 Ch 6 the 2 k Factorial Design
52/56
-
7/29/2019 Ch 6 the 2 k Factorial Design
53/56
Addition of Center Points
to a 2kDesigns
Runs at the center provide an estimate of
error and allow the experimenter to
distinguish between two possible models:
0
1 1
20
1 1 1
First-order model (interaction)
Second-order model
k k k
i i ij i j
i i j i
k k k k
i i ij i j ii i
i i j i i
y x x x
y x x x x
-
7/29/2019 Ch 6 the 2 k Factorial Design
54/56
-
7/29/2019 Ch 6 the 2 k Factorial Design
55/56
no "curvature"F Cy y
The hypotheses are:
0
1
1
1
: 0
: 0
k
ii
i
k
ii
i
H
H
2
Pure Quad
( )F C F C
F C
n n y ySS
n n
This sum of squares has a
single degree of freedom
-
7/29/2019 Ch 6 the 2 k Factorial Design
56/56
Central Composite Design