Diagrams Representing Relationships between … · Web viewCorrelations or covariances between...
Transcript of Diagrams Representing Relationships between … · Web viewCorrelations or covariances between...
Path Diagrams, EFA, CFA, and Amos(You should have read the EFA chapter in T&F and should read the CFA chapter.)Observed Variable
A variable whose values are observable.
Examples: IQ Test scores (Scores are directly observable), GREV, GREQ, GREA, UGPA, Minnesota Job Satisfaction Scale, Affective Commitment Scale, Gender, Questionnaire items.
Latent Variable
A variable, i.e., characteristic, presumed to exist, but whose values are NOT observable. A Factor in Factor Analysis literature. A characteristic of people that is not directly observable.
Intelligence, Depression, Job Satisfaction, Affective Commitment, Tendency to display affective state
No direct observation of values of latent variables is possible. Brain states? Brain chemistry?
Indicator
An observed variable whose values are assumed to be related to the values of a latent variable.
Reflective Indicator
An observed variable whose values are partially determined by, i.e., are influenced by or reflect, the values of a latent variable. For example, responses to Conscientiousness items are assumed to reflect a person’s Conscientiousness.
Formative Indicator
An observed variable whose values partially determine, i.e., cause or form, the values of a latent variable.
Exogenous Variable (Ex = Out)
A variable whose values originate from / are caused by influences outside the model, i.e., are not explained within the theory with which we’re working. That is, a variable whose variation we don’t attempt to explain or predict by whatever theory we’re working with. Causes of exogenous variable originate outside the model. Exogenous variables can be observed or latent.
Endogenous variable (En ~~ In)
A variable whose values are explained within the theory with which we’re working. We account for all variation in the values of endogenous variables using the constructs of whatever theory we’re working with. Causes of endogenous variables originate within the model.
CFA, Amos - 1 Printed on 5/8/2023
Basic EFA, CFA, SEM Path Analytic NotationObserved variables are symbolized by squares or rectangles.
Latent Variables are symbolized by Circles or ellipses.
Correlations or covariances between variables are represented by double-headed arrows.
"Causal" or "Predictive" or “Regression” relationships between variables are represented by single-headed arrows
CFA, Amos - 2 Printed on 5/8/2023
ObservedVariable
103 84121 76 . . . 97 81
106 78115 80. . . 93 83
101 90128 72 . . . 93 80
103 84121 76 . . . 97 81
ObservedVariable A
"Cor / Cov"Arrow
ObservedVariable B
LatentVariable A
LatentVariable B
"Cor / Cov"Arrow
106 78115 80. . . 93 83
104 79114 79. . . 92 81
"Causal"Arrow
ObservedVariable
LatentVariable
"Causal"Arrow
ObservedVariable
LatentVariable
LatentVariable
"Causal"Arrow
LatentVariable
ObservedVariable
ObservedVariable
"Causal"Arrow
Latent Variable
Values of individuals on latent variables are not observable, hence the dimmed text.
Exogenous Observed Variables
Exogenous variable connect to other variables in the model through either a “causal” arrow or a correlation
Exogenous Latent Variables
Exogenous latent variables also connect to other variables in the model through either a “causal” arrow or a correlation
Endogenous Observed Variables - Endogenous Latent Variable
Endogenous variables connect to other variables in the model by being on the “receiving” end of one or more “causal” arrows. Specifically, endogenous variables are typically represented as being “caused” by 1) other variables in the theory and 2) random error. Thus, 100% of the variation in every endogenous variable is accounted for by either other variables in the model or random error. This means that random error is an exogenous latent variable in SEM diagrams. Random error is a catch-all concept representing all “other” things that are affecting the endogenous variable.
Summary statistics associated with symbols
Our SEM program, Amos, prints means and variances above and to the right. Typically the mean and variance of latent variables are fixed at 0 and 1 respectively, although there are exceptions to this in advanced applications.
CFA, Amos - 3 Printed on 5/8/2023
"Causal"Arrow
ObservedVariable
ObservedVariable
"Causal"Arrow
LatentVariable Latent
Variable
ObservedVariable
"Causal"Arrow "Causal"
ArrowLatent
Variable
Randomerror
"Correlation"Arrow
"Correlation"Arrow
ObservedVariable
Mean, Variance
LatentVariable
Mean, Variance
B or
"Causal"Arrowr or Covariance
"Correlation"Arrow
Randomerror
Path Diagrams of Analyses We’ve Done Previously
Following is how some of the analyses we’ve performed previously would be represented using path diagrams.
1. Simple correlation between two observed variables.
2. Simple correlations between three observed variables.
3. Simple regression of an observed dependent variable onto one observed independent variable.
4. Multiple Regression of an observed dependent variable onto three observed independent variables.
CFA, Amos - 4 Printed on 5/8/2023
B or eP511GGRE-Q
rVA
rQArVQ
GRE-AGRE-QGRE-V
rVQ
GRE-QGRE-V
GRE-Q P511G eBQ or Q
GRE-V
UPGA
BV or V
BU or U
Note that the endogenous variable is caused in part by catch-all influences.
Note that the endogenous variable is caused in part by catch-all influences.
Path Analyses of ANOVAs
Since ANOVA is simply regression analysis, the representation of ANOVA in SEM is merely as a regression analysis. The key is to represent the differences between groups with group coding variables, just as we did in 513 and in the beginning of 595 . . .
1) Independent Groups t-testThe two groups are represented by a single, dichotomous observed group-coding variable. It is the independent variable in the regression analysis.
2) One Way ANOVAThe K groups are represented by K-1 group-coding variables created using one of the coding schemes (although I recommend contrast coding). They are the independent variables in the regression analysis. If contrast codes are used, the correlations between all the group coding variables are 0, so no arrows between them need be shown.
3) Factorial ANOVA.Each factor is represented by G-1 group-coding variables created using one of the coding schemes. The interaction(s) is/are represented by products of the group-coding variables representing the factors. Again, no correlations between coding variables need be shown if contrast codes are used.
CFA, Amos - 5 Printed on 5/8/2023
eDependentVariable
Dichotomous variable representing the two groups
eDependentVariable
. . . . .(K-1)th Group-coding contrast code variable.
2nd Group-coding contrast code variable.
1st Group-coding contrast code variable
eDependentVariable
Interaction
Interaction
Interaction
Interaction
2st Factor
2st Factor
1st Factor
1st Factor
Note: Contrast codes were used so Group-coding variables are uncorrelated.
Note: Contrast codes should be used to make sure the group-coding variables uncorrelated (assuming equal sample sizes.)
Path Diagrams representing Exploratory Factor Analysis1) Exploratory Factor Analysis solution with one factor.
The factor is represented by a latent variable with three or more observed indicators. (Three is the generally recommended minimum no. of indicators for a factor.)
Note that factors are exogenous. Indicators are endogenous. It’s a rule that all of the variance of endogenous variables must be accounted for in the diagram. Thus, each indicator must have an error or sometimes called residual latent variable to account for the variance in it not accounted for by the factor.
2) Exploratory Factor Analysis solution with two orthogonal factors.
For exploratory factor analysis, each variable is required to load on all factors. Of course, the hope is that the loadings will be substantial on only some of the factors and will be close to 0 on the others, but the loadings on all factors are estimated, even if they’re close to 0.
The loadings of items are conceptualized in two group. First , are the primary loadings - those connecting a factor to the indicators we believe it should be connected to.
The second group is secondary or cross loadings. These are the loadings that connect a factor to all the “other” variables – the variables that should not be connected to the factor even though they’re required to be estimated.
Let’s assume that Obs 1, 2, and 3 are thought to be primary indicators of F1 and 4,5,6 are thought to be the primary indicators of F2.
In the figure below, the primary loadings of each factors are in black with the cross loadings in red.
The lack of an arrow between the two factors means that they will be estimated as being orthogonal, i.e., independent
CFA, Amos - 6 Printed on 5/8/2023
F2
F1
e6
e5
e4
e3
e2
e1
Obs 6
Obs 5
Obs 4
Obs 3
Obs 2
Obs 1
Obs 1
Obs 2
Obs 3
e1
e2
e3
F
Orthogonal factors represent uncorrelated aspects of behavior.
Note what is assumed in the above figure: There are two independent characteristics of people – F1 and F2. Each one influences responses to all six items, although it is hoped that F1 influences primarily the first 3 items and that F2 influences primarily the last 3 items.
If Obs 1 thru Obs 3 are one class of behavior and Obs 4 thru Obs 6 are a second class, then if the loadings “fit” the expected pattern – loadings of 1,2, and 3 on F1 are large, while those of 4, 5, and6 on F2 are large, with all cross loadings small,, this would be evidence for the existence of two independent dispositions – that represented by F1 and that represented by F2.
3) Exploratory Factor Analysis solution with two oblique factors.
Each factor is represented by a latent variable with three or more indicators. The obliqueness of the factors is represented by the fact that there IS an arrow connecting the factors.
Again, in exploratory factor analysis, all indicators load on all factors, even if the loadings are close to zero.
This solution is potentially as important as the orthogonal solution, although in general, I think that researchers are more interested in independent dispositions than they are in correlated dispositions.
But discovering why two dispositions are separate but still correlated is an important and potentially rewarding task.
CFA, Amos - 7 Printed on 5/8/2023
F2
F1
e9
e8
e7
e6
e5
e4
Obs 6
Obs 5
Obs 4
Obs 3
Obs 2
Obs 1
Diagram of EFA model of NEO-FFI Big Five 60 item questionnaire.
(From Biderman, M. (2014). Against all odds: Bifactors in EFAs of Big Five Data. Part of symposium: S. McAbee & M. Biderman, Chairs. Theoretical and Practical Advances in Latent Variable Models of Personality. Conducted at the 29th annual conference of The Society for Industrial and Organizational Psychology; Honolulu, Hawaii, 2014.
Crossloadings are in red.
CFA, Amos - 8 Printed on 5/8/2023
Cross loadings are red’d.
It took me about 2d hours to draw this figure.
Path Diagrams vs the Table of Loadings.(Cross loadings are in red in both representations.)
Whew – there are tons of cross loadings, most of them near 0. Can’t they just be assumed to be zero? This kind of thinking leads to Confirmatory Factor Analysis Models.
CFA, Amos - 9 Printed on 5/8/2023
Pattern Matrixa
Factor1 2 3 4 5
ne1 -.025 -.001 -.056 -.067 .678ne2 .086 .002 .021 -.026 .268ne3 .126 -.023 .133 .229 .260ne4 .054 .014 .112 .184 .492ne5 -.115 -.045 -.069 -.269 .624ne6 .042 .206 -.200 .085 .563ne7 .194 -.168 .046 -.203 .166ne8 .342 -.127 .087 .299 .432ne9 .270 .046 .092 .419 .223ne10 -.017 -.138 -.105 .135 .327ne11 .209 -.270 .009 .102 .278ne12 .140 -.170 -.103 -.070 .483na1 -.100 -.193 .081 .518 .249na2 .347 -.153 -.076 .421 -.139na3 .091 -.123 -.050 .560 .125na4 -.139 .034 -.011 .504 -.099na5 .298 .073 .033 .335 .188na6 .335 .157 -.070 .353 .031na7 .019 -.177 .067 .231 .346na8 .053 .029 -.114 .543 .292na9 .163 .115 -.120 .319 -.211na10 -.057 -.123 .167 .594 .271na11 .028 .012 .055 .471 -.227na12 .027 -.210 -.075 .534 -.022nc1 .092 -.429 -.090 .183 .008nc2 .019 -.580 -.049 .055 -.086nc3 -.030 -.376 -.037 .011 -.140nc4 -.093 -.406 .052 .156 .028nc5 .026 -.716 -.025 -.156 .057nc6 .146 -.476 .100 .241 -.052nc7 -.154 -.694 -.070 -.121 .109nc8 .092 -.528 .019 -.017 .110nc9 .040 -.573 .044 -.050 .094nc10 .021 -.720 .016 -.103 .044nc11 .035 -.551 -.067 .148 .005nc12 .065 -.628 .035 -.011 .018ns1 .501 .072 .145 -.231 -.031ns2 .544 -.119 .132 -.044 .027ns3 .653 .037 -.026 -.069 -.118ns4 .664 .025 -.141 .006 .189ns5 .660 .082 -.031 .200 .047ns6 .677 -.053 -.104 .011 .046ns7 .658 .035 .027 -.069 .011ns8 .552 .068 -.078 .265 .012ns9 .724 -.093 .134 -.021 .027ns10 .662 -.041 -.068 -.009 .220ns11 .563 -.266 .035 .030 -.132ns12 .611 -.082 -.177 .075 -.003no1 -.146 .334 .267 -.003 -.103no2 .030 .366 .136 .061 .087no3 -.014 .044 .661 -.040 .046no4 .116 .037 .142 -.119 -.008no5 -.010 -.090 .731 .102 -.166no6 .029 .168 .217 .124 .207no7 -.064 .025 .207 .082 -.082no8 .024 .233 .240 -.101 -.197no9 -.008 -.012 .822 .049 -.123no10 .052 .135 .536 .026 -.033no11 -.026 -.111 .560 -.041 .110no12 .021 .131 .615 -.174 .090Extraction Method: Maximum Likelihood. Rotation Method: Oblimin with Kaiser Normalization.a. Rotation converged in 14 iterations.
Confirmatory vs Exploratory Factor AnalysisIn Exploratory Factor Analysis, as discussed above, the loading of every item on every factor is estimated. The analyst hopes that some of those loadings will be large and some will be small. An EFA two-orthogonal-factor model is represented by the following diagram.
As mentioned above there are arrows (loadings) connecting each variable to each factor. We have no hypotheses about the loading values – we’re exploring – so we estimate all loadings and let them lead us. Generally, EFA programs do not allow you to specify or fix loadings to pre-determined values.
In contrast to the exploration implicit in EFA, a factor analysis in which some loadings are fixed at specific values is called a Confirmatory Factor Analysis. The analysis is confirming one or more hypotheses about loadings, hypotheses representing by our fixing them at specific (usually 0) values.
Below is shown one common confirmatory analysis model – one in which the primary loadings are represented but the secondary, aka, cross loadings are set to 0. In the diagram, a loading or correlatin that has been set to 0 is simply left out of the diagram.
CFA, Amos - 10 Printed on 5/8/2023
F2
F1
e6
e5
e4
e3
e2
e1
Obs 6
Obs 5
Obs 4
Obs 3
Obs 2
Obs 1
Exploratory
Confirmatory
E6
E5
E4
E3
E2
E1
F1
F2
Obs 6
Obs 5
Obs 4
Obs 3
Obs 2
Obs 1
The Identification Problem Skip in 2017Consider the simple regression model . . .
Quantities which can be computed from the data . .
Mean of the X variable Variance of the X variableMean of the Y variable. Variance of the Y variable.
Correlation of Y with X
Quantities in the model .
Remember that in path diagrams, all the variance in every endogenous variable must be accounted for. For that reason, the path diagram includes a latent “Other factors” or “Error of measurement” or “Residual” variable, labeled “E” in the above diagram..
Mean of X Mean of EVariance of X Variance of E
Intercept of X->Y regressionSlope of X->Y regressionCorrelation of E with Y
Whoops! There are 5 quantities in the data but 7 in the model.
There are too few quantities in the data.
The model is underidentified. – not identified enough - there aren't enough quantities from the data to identify each model value.
CFA, Amos - 11 Printed on 5/8/2023
Mean, Variance
Mean, Variance
E
YXY = a + b*X
Corr(E,Y)
Note: Mean and variance of Y are not separately identified in the model because they are assumed to be completely determined by Y’s relationship to X and to E.
Dealing with underidentification . . . Skip in 2017
Solution 10) The mean of E is always assumed to be 0.1) Fix the variance of E to be 1.
So in this regression model, the path diagram will be
In this case, there are 5 quantities in the model that must be estimated – mean of X, variance of X, intercept of equation, slope of equation, and correlation of E with Y. There are also 5 quantities that can be estimated from the observed data.
The model is said to be “just identified” or “completely identified”. This means that every estimable quantity in the model corresponds in some way to one quantity obtained from the data.
Or,
Solution 20) The mean of E is always assumed to be 0.1) Fix covariance of E with Y at 1.
Underidentified models: Cannot be estimated.
Just identified models: Every model quantity is a function of some data quantity. But no parsimony.
Overidentified models: There are more data quantities than model quantities. It is said that you then have “degrees of freedom” in your model. This is good. Relationships are being explained by fewer model quantities than there are data quantities. This is parsimonious – what science is all about.
CFA, Amos - 12 Printed on 5/8/2023
Y = a + b*X
Y=a+b*X
X Y
E
Mean, Variance
0, 1
1
rEY
0, Variance
Mean, Variance
E
YX
Technical Issue: Identification in CFA models –
For mathematical reasons, we have to make a few assumptions about which values are to be estimated in CFA models. Here’s a typical CFA two-factor model.
Assumption 1. Insuring that the “Residuals” part of the model – that involving the “E”s – is identified.
We always assume all “E” means = 0.
Solution 1. Fix all “E” variances to 1.
or
Solution 2. Fix all E O loadings to 1 and estimate variances of “E”s.
CFA, Amos - 13 Printed on 5/8/2023
0, Variance
0, Variance
0, Variance
0, Variance
0, Variance
0, Variance
0,1
0,1
0,1
0,1
0,1
0,1E6
E5
E4
E3
E2
E1
F1
F2
Obs 6
Obs 5
Obs 4
Obs 3
Obs 2
Obs 1
E6
E5
E4
E3
E2
E1
F1
F2
Obs 6
Obs 5
Obs 4
Obs 3
Obs 2
Obs 1
1
1
1
1
1
1
I recommend
this.
Assumption B. Insuring that the Factors part of the CFA is identified
Solution 1. Fix one of the loadings for each factor at 1 and estimate all factor variances
Or
Solution 2. Fix the variance of each factor at 1 and estimate all factor loadings.
The main point for us – what the factor analysis gives us.
The above models tell us that the variation in 6 observed variables – Obs1, Obs2, Obs3, Obs4, Obs5, Obs5 - is due to variation in just two internal characteristics – F1 and F2.
So we have explained why there is variation in the observed variables – because of variation in F1 and F2.
We have also explained why the variation in Obs1 to Obs3 is unrelated to the variation in Obs4 to Obs6 – because F1 and F2 are uncorrelated.
CFA, Amos - 14 Printed on 5/8/2023
1
1
E6
E5
E4
E3
E2
E1
F1
F2
Obs 6
Obs 5
Obs 4
Obs 3
Obs 2
Obs 1 I recommend this, although some
examples below will use the above
method.
1
1
E6
E5
E4
E3
E2
E1
F1
F2
Obs 6
Obs 5
Obs 4
Obs 3
Obs 2
Obs 1The item whose loading is fixed is called the reference item.
Examples
1. Fixing all variances.
2. Fixing residual loadings but Factor variances
3. Fixing residual loadings and factor loadings.
CFA, Amos - 15 Printed on 5/8/2023
1
1
1
1
11
1
1
1
1
1
1
1
1
E6
E5
E4
E3
E2
E1
F1
F2
Obs 6
Obs 5
Obs 4
Obs 3
Obs 2
Obs 1
1
1
1
1
1
1
1
1
E6
E5
E4
E3
E2
E1
F1
F2
Obs 6
Obs 5
Obs 4
Obs 3
Obs 2
Obs 1
E6
E5
E4
E3
E2
E1
F1
F2
Obs 6
Obs 5
Obs 4
Obs 3
Obs 2
Obs 11
1
My favorite.
Factors vs Summated Scales
Recall that X = T + E as was discussed briefly in PSY 5130
. An observed score is comprised of both the True score and error of measurement.
This means that X – the variable we’re measuring – is partly error. There will be some variation in X that has nothing to do with our theory - it’s just random variation, clouding our results.
The basic argument for using latent variables is that the relationships between latent variables are closer to the “true score” relationships than can be found in any existing analysis. They eliminate the noise.
If we compute the average of 10 Conscientiousness items to form a scale score, for example, that scale score includes the errors associated with each of the items averaged.
Here’s a path diagram representing a scale score . . .Assume the items are Conscientiousness items.
CFA, Amos - 16 Printed on 5/8/2023
Item X7e7
e8
Item X3
Item X4
Item X5
Item X6
e3
e4
e5
e6
e1
e2
Scale score C+Junk
ee
ee2
e4
Item X8
Item X9
Item X10
e9
Item X1
Item X2
e10
A scale score contains everything – pure content (the Ts) plus error of measurement.
So any correlation involving a scale score is contaminated by the error of measurment contained in the scale score.e ee3
e1q
But if we create a C factor, the factor represents only the C present across all the items, and none of the error that also contaminates the item.
The errors affecting the items are treated separately, rather than being lumped into the scale score.
The result is that the latent variable, C, in the diagram below is a purer estimate of conscientiousness than would be a scale score. Its correlations with other variables will not be contaminated by errors of measurement.
From Schmidt, F. (2011). A theory of sex differences in technical aptitude and some supporting evidence. Perspectives on Psychological Science, 6, 560-573.“Prediction 3 was examined at both the observed and the construct levels. That is, both observed score and true score regressions were examined. However, from the point of view of theory testing, the true score regressions provide a better test of the theoretical predictions, because they depict processes operating at the level of the actual constructs of interest, independent of the distortions created by measurement error.”
CFA, Amos - 17 Printed on 5/8/2023
C
Item 7
Item 8
Item 9
Item 10
e
e
e
e
Item 3
Item 4
Item 5
Item 6
e
e
e
e
Item 1
Item 2
e
e
GPA
So, what do we get from factor analyses?
1. Reduction in dimensionality
As discussed in the first lecture on factor analysis – the first purpose is to reduce the complexity of the problem, allowing us to focus on a few factors rather than multiple observed variables.
2. Uncontaminated measurement
Factors estimated from appropriate models give us estimates of characteristics will be unaffected by measurement error.
The way to access a factor’s values for individual persons is is through factor scores. Right now, only Mplus makes that easy to do. I expect more programs to provide such capabilities soon.
3. Purer relationships
As stated above, relationships among latent variables are free from the “noise” of errors of measurement, so if two characteristics are related, their factor correlations will be farther from 0 than will be their scale correlations.
That is – assess the relationship with scale scores. Assess the same relationship using a latent variable model. The r from the latent variable analysis will usually be larger than the r from the analysis involving scale scores.
4. Sanity and advancement of the science
The sometimes random-seeming mish-mash of correlations will be a little less random-seeming when the effects of errors of measurement on relationships and on measurement of characteristics are taken out.
CFA, Amos - 18 Printed on 5/8/2023
Example illustrating the differences between scale score correlations and latent variable correlations.
From Biderman, M. D., McAbee, S. T., Chen, Z., & Nguyen, N. T. (2015). Assessing the Evaluative Content of Personality Questionnaires Using Bifactor Models. MS submitted for publication.
The correlations are those between three different measures of affect. For each measure, the scale was represented by a single summated score on the left and by a factor on the right.
Table 3Correlations of Measures of Affect
Summated Scales Latent Variables
RSE PANAS Dep. RSE PANAS Dep.Measured along with the NEO-FFI-31
NEO-FFI-3
RSE
PANAS .81 1.00
Depression - -.76 -.79 -.84 -1.00
Measured along with HEXACO-PI-R2
HEXACO-PI-R
RSE .
PANAS . .74 . .80
Depression -.78 -.73 -.87 -.82
Note. 1N=317, 2N=788. Dep. = depression. Values on diagonals are reliabilities of Scales and factor determinacies of latent variables.
As an aside, note that even the scale scores were very highly correlated, raising the issue of whether the three scales might all be measuring the same affective characteristic.
CFA, Amos - 19 Printed on 5/8/2023
Programming with diagrams: Introduction to Amos
Amos is an add-on program to SPSS that performs confirmatory factor analysis and structural equation modeling.
It is designed to emphasize a visual interface and has been written so that virtually all analyses can be performed by drawing path diagrams.
It also contains a text-based programming language for those who wish to write programs in the command language.
The Amos drawing toolkit with functions of the most frequently used tools.
CFA, Amos - 20 Printed on 5/8/2023
Observed variable tool
Tool to draw latent variables
Tool to draw correlation arrows
Tool to draw regression arrows
Tool to select a single object
Tool to put text on the diagram
Tool to select all objects in diagram
Tool to deselect all objects in diagram
Tool to copy an object Tool to erase an object
Tool to move an object
Tool to tell Amos to run the diagram.
Other programsLISRELEQSAmosMplus – the best
LAVAAN
Creating an Amos analysis
1. Open Amos Graphics.1b. File -> New2. File -> Data Files . . . (Because you have to connect the path diagram to a data file.)
3. Specify the name of the file that contains the raw or summary data.a. Click on the [File Name] button.b. Navigate to the file and double-click on it.c. Click on the [OK] button.
In this example, I opened a file called IncentiveData080707.sav
4. Draw the desired path diagram using the appropriate drawing tools.
The example below is a simple correlation analysis.
6. Save the model. File -> Save As...7. To run Amos, click on the
button.
8. Click on to see output.
CFA, Amos - 21 Printed on 5/8/2023
5. Name the variables by right-clicking on each object. And choosing “Object Properties . . .”
Amos Details
For most of the analyses you’ll perform using Amos, you should get in the habit of doing the following . . .
View -> Analysis Properties -> Estimation
Check “Estimate means and intercepts”
View -> Analysis Properties -> Output
Check “Standardized estimates”Check “Squared multiple correlations”
CFA, Amos - 22 Printed on 5/8/2023
Doing old things in a new way: Analyses we’ve done before, now performed using Amos Skip in 2017The data used for this example are the valdatnm data in the htlm2\5510\datafiles folder. We’ll simply look at the output here. Later, we’ll focus on the menu sequences needed to get this output.
a. SPSS analysis of the correlation of FORMULA with P511G
Correlations
b. Amos Input Path Diagram - Input Parameter Values
(Note, I told Amos to estimate means for this analysis.)
c. Amos Output Path Diagram - Unstandardized (Raw) coefficients
c. Amos Path Diagram - Standardized coefficients
CFA, Amos - 23 Printed on 5/8/2023
All variables are exogenous.
The mean and variance of Formula.
The mean and variance of p511g.
The covariance of p511g and Formula.
The correlation of p511g and Formula.
Means and variances of standardized variables are not displayed, since they are 0 and 1 respectively.
Simple Regression Analysis: SPSS and Amos 2017 skipThe data used here are the VALDAT data.
a. SPSS Version 10 output
GET FILE='E:\MdbT\P595\Amos\valdatnm.sav'..REGRESSION /MISSING LISTWISE /STATISTICS COEFF OUTS R ANOVA /CRITERIA=PIN(.05) POUT(.10) /NOORIGIN /DEPENDENT p511g /METHOD=ENTER formula .
RegressionVariables Entered/Removedb
FORMULAa . EnterModel1
VariablesEntered
VariablesRemoved Method
All requested variables entered.a.
Dependent Variable: P511Gb.
Model Summary
.480a .230 .220 4.725E-02Model1
R R SquareAdjus tedR Square
Std. Error ofthe Es timate
Predic tors : (Cons tant), FORMULAa.
ANOVAb
5. 005E-02 1 5. 005E-02 22. 420 . 000a
. 167 75 2. 233E-03
. 217 76
RegressionResidualTot al
Model1
Sum ofSquares df Mean Square F Sig.
Predict ors: (Const ant ) , FORMULAa.
Dependent Var iable: P511Gb.
Coeffi ci entsa
. 496 .078 6. 361 .0003. 004E-04 .000 .480 4. 735 .000
(Const ant )FORMULA
Model1
B St d. Error
Unst andardizedCoef f icients
Beta
St andardized
Coef f icients
t Sig.
Dependent Variable: P511Ga.
CFA, Amos - 24 Printed on 5/8/2023
b. Amos Input Path Diagram - Input parameter values
c. Amos Output Path Diagram - Unstandardized (Raw) coefficients; Means not estimated
CFA, Amos - 25 Printed on 5/8/2023
The model is underidentified unless you fix the value of one parameter. Fix either the variance of the latent error variable to 1 or the regression weight to 1. Here, the variance has been fixed.
Note that the fixed parameter values were not changed.
For what it's worth, the estimated unstandardized (raw score) relationship of p511g to the “other factors” latent variable.
The estimated unstandardized (raw score) relationship of p511g .to Formula - the slope, to 2 decimal places.
Variance of formula.
d. Amos Output Path Diagram - Standardized coefficients
(View/Set -> Analysis Properties -> Output to get Amos to print Standardized estimates what a pain!!)
Note that .482 + .882 = 1. All of variance of p511g has been accounted for.
We say that the formula and the error partition the total variance of p511g.
CFA, Amos - 26 Printed on 5/8/2023
Correlation of p511g with latent “other factors”..= sqrt(1-r2)=sqrt(1-.482) = sqrt(1-.23) = sqrt(.77)=.88
Squared multiple correlation of dependent variable (p511g) with predictor (only formula in this example).
Correlation of p511g with formula.
Two IV Regression Example - SPSS and Amos 2017 skipThe data here are the VALDATnm data. UGPA and GREQ are predictors of P511G.
a. SPSS output.GET FILE='G:\MdbT\P595\P595AL09-Amos\valdatnm.sav'.DATASET NAME DataSet1 WINDOW=FRONT.REGRESSION /DESCRIPTIVES MEAN STDDEV CORR SIG N /MISSING LISTWISE /STATISTICS COEFF OUTS R ANOVA /CRITERIA=PIN(.05) POUT(.10) /NOORIGIN /DEPENDENT p511g /METHOD=ENTER ugpa greq .
Regression[DataSet1] G:\MdbT\P595\P595AL09-Amos\valdatnm.sav
Correlations
1.000 .225 .322
.225 1.000 -.262
.322 -.262 1.000
. .025 .002
.025 . .011
.002 .011 .
77 77 77
77 77 77
77 77 77
p511g
ugpa
greq
p511g
ugpa
greq
p511g
ugpa
greq
PearsonCorrelation
Sig. (1-tailed)
N
p511g ugpa greq
Variables Entered/Removed b
greq, ugpa a . EnterModel1
Variables EnteredVariablesRemoved Method
All requested variables entered.a.
Dependent Variable: p511gb.
Model Summary
.455a .207 .185 .04828Model1
R R Square Adjusted R SquareStd. Error ofthe Estimate
Predictors: (Constant), greq, ugpaa.
Coefficients a
.571 .069 8.228 .000
.048 .016 .332 3.098 .003
.000 .000 .410 3.817 .000
(Constant)
ugpa
greq
Model1
B Std. Error
Unstandardized Coefficients
Beta
StandardizedCoefficients
t Sig.
Dependent Variable: p511ga.
CFA, Amos - 27 Printed on 5/8/2023
b. Amos Input Path Diagram - Input parameters.
c. Amos Output Path Diagram - Unstandardized (Raw) coefficientsI forgot to check “Estimate means and intercepts, so no means are printed.)
CFA, Amos - 28 Printed on 5/8/2023
Raw Regression coefficient relating p511g to residual effects.
Raw partial regression coefficient relating p511g to ugpa
The variance of the (unobserved) error latent variable must be specified at 1.
Note that if the IVs are correlated, you must specify that they are correlated. Otherwise, Amos will perform the analysis assuming they're uncorrelated.
Covariance of ugpa and greq.
Variance of ugpa
Raw partial regression coefficient relating p511g to GREQ to 2 decimal places.
d. Amos Output Path Diagram - Standardized coefficients.
Note that .332 + .412 + .892 = 1.07 > 1.0. This is because r2s partition variance only when variables are uncorrelated.e. Amos Text Output - Details of input and minimization (Early version of Amos without p values)Chi-square = 0.000Degrees of freedom = 0Probability level cannot be computedMaximum Likelihood Estimates----------------------------Regression Weights: Estimate S.E. C.R. Label ------------------- -------- ------- ------- -------
p511g <----- ugpa 0.048 0.015 3.140 p511g <---- error 0.047 0.004 12.329 p511g <----- greq 0.000 0.000 3.869
Standardized Regression Weights: Estimate-------------------------------- --------
p511g <----- ugpa 0.332 p511g <---- error 0.891 p511g <----- greq 0.410
Covariances: Estimate S.E. C.R. Label ------------ -------- ------- ------- -------
ugpa <-----> greq -8.537 3.861 -2.211
Correlations: Estimate------------- --------
ugpa <-----> greq -0.262
Variances: Estimate S.E. C.R. Label ---------- -------- ------- ------- -------
error 1.000 ugpa 0.134 0.022 6.164 greq 7897.622 1281.163 6.164
Squared Multiple Correlations: Estimate------------------------------ --------
p511g 0.207
CFA, Amos - 29 Printed on 5/8/2023
Standardized partial regression coefficients , sometimes called betas..
Correlation of ugpa and greq.
Multiple R2.
SQRT(1-R2)=sqrt(1-.21) = sqrt(.79)=.89
Note – No overall test of significance of R2.This test is available in the ANOVA box in SPSS.
Oneway Analysis of Variance Example - SPSS and Amos 2017 skip
The data for this example follow. They're used to introduce the 595 students to contrast coding. The dependent variable is Job Satisfaction (JS). The research factor is Job, with three levels. It is contrast coded by CC1 and CC2.
The data for this example are in ‘MdbT\P595\Amos\ OnewayegData.sav’
ID JS JOB CC1 CC2
1 6 1 .667 .000 2 7 1 .667 .000 3 8 1 .667 .000 4 11 1 .667 .000 5 9 1 .667 .000 6 7 1 .667 .000 7 7 1 .667 .000 8 5 2 -.333 .500 9 7 2 -.333 .500 10 8 2 -.333 .500 11 9 2 -.333 .500 12 10 2 -.333 .500 13 8 2 -.333 .500 14 9 2 -.333 .500 15 4 3 -.333 -.500 16 3 3 -.333 -.500 17 6 3 -.333 -.500 18 5 3 -.333 -.500 19 7 3 -.333 -.500 20 8 3 -.333 -.500 21 2 3 -.333 -.500
a. SPSS Oneway output.
OnewayANO VA
JS
40. 095 2 20. 048 5. 930 . 011
60. 857 18 3. 381
100. 952 20
Bet weenG r oups
Wit hin G r oups
Tot al
Sum ofSquar es df M ean Squar e F Sig.
CFA, Amos - 30 Printed on 5/8/2023
The rule for forming a contrast variable between two sets of groups is
1st Value = No. of groups in 2nd set / Total no. of groups.
2nd Value= - No. of groups in 1st set / Total no. of groups.
3rd Value = 0 for all groups to be excluded.
So, 1st Value of CC1 = 2 / 3 = .667.
2nd Value of CC1 = - 1 / 3
1st Value of CC2 = 1 / 2 = .5
2nd Value of CC2 = -1 / 2 = -..5
3rd Value of CC2 = 0 to exclude Job 1.
b. SPSS Regression Output.
regression variables = js cc1 cc2 /dependent = js /enter.
RegressionVariables Entered/Removed b
CC2, CC1 a . EnterModel1
VariablesEntered
VariablesRemoved Method
All reques ted variables entered.a.
Dependent Variable: JSb.
Mo d e l Su m m a ry
.6 3 0 a .3 9 7 .3 3 0 1 .8 3 8 7Mo d e l1
R R Sq u a reAd j u s te d R
Sq u a reStd . Erro r o fth e Es ti ma te
Pre d i c to rs : (Co n s ta n t ), CC2 , CC1a .
ANO VA b
40. 095 2 20. 048 5. 930 . 011a
60. 857 18 3. 381
100. 952 20
Regr ession
Residual
Tot al
M odel1
Sum ofSquar es df M ean Squar e F Sig.
Pr edict or s: ( Const ant ) , CC2, CC1a.
Dependent Var iable: JSb.
Coef f i ci ent sa
6. 952 . 401 17. 326 . 000
1. 357 . 851 . 292 1. 594 . 128
3. 000 . 983 . 559 3. 052 . 007
( Const ant )
CC1
CC2
M odel1
B St d. Er r or
Unst andar dized Coef f icient s
Bet a
St andar dizedCoef f icient s
t Sig.
Dependent Var iable: JSa.
CFA, Amos - 31 Printed on 5/8/2023
c. Amos Input Path Diagram.
d. Amos Output Path Diagram - Unstandardized (Raw) Coefficients
e. Amos Output Path Diagram - Standardized Coefficients
CFA, Amos - 32 Printed on 5/8/2023
This was prepared using Amos 3.6. I chose the "Estimate means" option. This was not required, but it caused means to be displayed.
Mean and variance.
Note that .292 + .562 + .782 = 1.01 ~~ 1.r2s partition variance since the variables are all independent.
Intercept
Multiple R2
Note that the correlation between group coding variables must be estimated. It's zero here because they're contrast codes, but estimate it anyway.
f. Amos Text Output – ResultsView -> Text Output
CFA, Amos - 33 Printed on 5/8/2023
Results continued . . .
CFA, Amos - 34 Printed on 5/8/2023
Note that AMOS does not provide a test of the null hypothesis that in the population, the multiple R = 0. This test is provided in the ANOVA box in SPSS.