FACTOR A TEST ANDcollectionscanada.gc.ca/obj/s4/f2/dsk2/ftp01/MQ28585.pdf · 2005. 2. 12. ·...

SHIFTED FACTOR ANALYSIS: A TEST OF MODELS AND ALGORlTHMS

Sungjin Hong Department of Psychology

Submitted in partial filfiliment of the requirements for the degree of

Master of Arts

Faculty of Graduate Studies The University of Western Ontario

London, Ontario Septernber 1997

O Sungjin Hong 1997

National Library 1*1 of Canada Bibliothèque nationale du Canada

Acquisitions and Acquisitions et Bibliographie Services services bibliographiques

395 Wellington Street 395. me Wellington OttawaON K1AON4 OttawaON K 1 A W Canada Canada

Yourlm V o n e r e Y ~

Our fi& Notre retereme

The author has granted a non- L'auteur a accordé une licence non exclusive licence allowing the exclusive permettant à la National Library of Canada to Bibliothèque nationale du Canada de reproduce, loan, distribute or sell reproduire, prêter, distribuer ou copies of this thesis in microform, vendre des copies de cette thèse sous paper or electronic formats. la forme de rnicrofiche/film, de

reproduction sur papier ou sur format électronique.

The author retains ownership of the L'auteur conserve la propriété du copyright in ths thesis. Neither the droit d'auteur qui protège cette thèse. thesis nor substantial extracts fkom it Ni la thèse ni des extraits substantiels may be printed or otherwise de celle-ci ne doivent être imprimés reproduced without the author's ou autrement reproduits sans son permission. autorisation.

ABSTRACT

The position shifi of the sequential facton in some sequential data can violate the

standard factor model's requirement of proportionality of the effect of factor weights.

Harshrnan has proposed a Shifted Factor model and analysis procedure to resolve this

problern. It would unshift the latent factors to recover the proportionality of factor effects.

He has also conjectured that the shift adjustment of the factors would provide an

additional advantage: the factor solution given by fitting the Shifted Factor model may

not be rotationally indeterminate. This thesis tests the feasibility of the "unshifiing" idea

by developing and testing cornputer programs that estimate shifi values in addition to

standard factor loadings and then applies these shifls to factors when predicting data.

Two types of programs are tested: Quasi Alternating Least Squares (QALS) and

nonlinear optimization (OPT). Both types of programs are implemented for two-way and

three-way sequentially organized data. Error-free and fallible synthetic data are created

and then analyzed to test the ability of the programs to recover latent factors and shifis,

and to test the hypothesized uniqueness property of the two-way Shified Factor model.

Both QALS and OPT have perfectly (fi-orn the error-free synthetic data) and successfully

( from the fallible synthetic data) recovered factor loadings and the shifts. The

hypothesized uniqueness of the Shifted Factor solution has been numerically proved by

showing many perfect solutions from the error-fiee data and many matching solutions

From the fallible data, starting fiom a few different random positions. Brain evoked

potential time series data are analyzed as a "real data" application of the model. The real

data analysis was not successful perhaps partly because not only shifts but shape changes

of factors seemed to be present. Provision for the shape changes (e.g., width variation in

latent peaks) is a possible fûture extension of this method. In the meantime, other kinds

of reai data (EEG spectra, chernical spectra, chromatographie series, etc) should be

explained by the Shifted Factor mode1 because they are likely to have shifts without

widthhhape changes.

Keywords: Factor analysis, Principal component analysis, P W A C , Uniqueness, Time

series, Lag.

ACKOWLEDGEMENTS

I would like to thank my advisor, Dr. Richard Harshman, for his deep insight and

cntical comments throughout this study. Without his thoughtful guide and w a m help,

this thesis would have not been finished. 1 would also thank Dr. Robert Gardner, Dr. Sam

Paunonen, and Dr. Stan Leung for their patience in understanding many arnbiguous

expressions in the cirafi of this thesis, and their critical suggestions. ln particular, it was

gratefully appreciated that Dr. Stan Leung kindly provided his data for this study and

taught me many basics necessary for analyzing the data.

TABLE OF CONTENTS

Page

CERTITICATE OF EXAMINATION ................................................ ABSTRACT .............................................................................. ACKNOWLEDGEMENT ............................................................. TABLE OF CONTENTS ............................................................... LIST OF TABLES ..................................................................... LIST OF FIGURES ..................................................................... LIST OF APPENDICES ...................................... .. .......................

1 . INTRODUCTION

. 2 STATEMENT OF PROBLEM ................................................... ........................... 2.1. Proportionality of the Standard Factor Model ........................... 2.2. An Example of Sequentially Organized Data

............ 2.3. Violation of the Proportionality in Some Sequential Data

3 . SHIFTED FACTOR MODELS ...................................................... 3.1. Two-way Shified Factor Mode1 ............................................. 3.2. Three-way Shified Factor Models ........................................... 3.3. Uniqueness in the Two-way Shifted Factor Mode1 .....................

4 . ALGORITHM TESTTNG AND DEVELOPMENT ............................... 4.1. Quasi-ALS .....................................................................

............ 4.1.1. Altemating Estimation for the Shifted Factor Models .................................... 4.1.2. Brute Force Estimation of Shifis

............ 4.1.3. Polishing the Sequential Mode Estimation Procedure 4.1.4. Fractional Line Search for the Shift Estimation .................. 4.1 .5 . "Super Iteration" of Recurrent BF and FLS .....................

.................................... 4.1.6. Alternating Estimation for TSF1 4.2. Initial Test of Nonlinear Optimization: Simultaneous

............................................. Optimization of Al1 Parameters

............................................. . 5 ANALYSIS OF SYNTHETIC DATA 5.1. Quasi-ALS ...................................................................

................................. 5.1.1. Two-way Synthetic Data Analyses .............................. 5.1.2. Three-way Synthetic Data Analyses

...................................................... 5.2. Nonlinear Optimization .............................. 5.2.1. Two-way Synthetic Data Analyses .............................. 5 -2.2. Three-way S ynthetic Data Analyses

6 . AN APPLICATION ..................................................................

7 . DISCUSSION

REFERENCES APPENDIX A APPENDK B APPENDIX C APPENDIX D APPENDIX E APPENDIX F APPENDIX G VITA .......................................................................................

vii

Table

LIST OF TABLES

Description Page

Nurnber of Perfect Solutions and Averaged Model Fit and Recovenes of Non-perfect Solutions .................................... Model Fits (R2) and the Recoveries of the Two-way

................................................... Super QALS Analyses

Model Fits (R~) and the Recoverks of the Three-way ................................................... Super QALS Analyses

Model Fits (R') and the Recovenes of the Two-way Nonlinear Optimization Analyses ..................... .. ............-.

Model Fits (R') and the Recoveries of the Three-way Nonlinear Optirnization Analyses .......................................

Factor Loadings (B) on the Measuring Conditions and ......................................................... Shift Estimates (S)

An Example of Mean Offset of Shifts and ................................................... Their Mode B Loadings

LIST OF FIGURES

Figure Description

Basic Data Relation Matrix (BDRM) ....................................

Flash Evoked Potentids Recorded at Different Scalp Locations ...

....................................... Three Shapes of Synthetic Curves

The 25 Evoked Potential Data C w e s ................................. Recovered Sequential Factors by the Two-way Shifted

..................... Factor Mode1 and by the Standard Two-way PCA

The Data Curves in Each Condition o f the Paired Pulses ............

Page

LIST OF APPENDICES

Appendix Description Page

A Fractional Line Search (FLS) for the Shifi Estimation .................. 68

B Oscillating Mode1 Fits in FLS and Super QALS ........................ 71

.......................................... C AIternating Estimation for TSF 1 74

.................. D Singular Value Decomposition for the A Estimation 77

E Nonlinear Optimization Procedures ....................................... 79

F MATLAB PrograrnsofQALS ........................................... 81

G Mean Shift Offset and Shift Outliers ....................................... 92

INTRODUCTION

Cattell (1 946) showed various ways of getting syrnmeû-ical data to factor analyze,

fiom his BDRM (Basic Data Relation Matrix): R- and Q-, P- and O-, and S- and T-

technique. He suggested P-technique as a method to define personality facton unique1 to

one person. P-technique factor analyzes the correlations (or covariances) among variables

but across occasions, while the conventional factor analysis extracts factors fiom the

correlations (or covariances) among variables but across subjects. He called the

traditional factor analysis R-technique. Different techniques defined by Cattell are based

on the various ways to get a symmetrical cross product type data matrix. In other words,

they are different in terms of how to capture the covariation of measures due to the

underlying factors in the data. Figure 1 shows how the different versions of correlations

(or covariances) can capture facton fiom the different facets.

Considenng that time senes variables are likely to be lagged dong the time axis

but differentially according to levels of the variable mode, Cattell (1963) suggested time-

corrected (or lead-and-lag) P-technique in order to correct the mismatch of time senes

variables before factoring them by using the standard factor analysis. But the potential

lagging was supposed to exist in the relation among variables, i.e., in correlations. Time-

corrected P-technique analyzes a corrected version of across-time correlations, maximum

cross correlations. To get a maximum cross correlation for two time senes variables,

' The term "unique factor'' to a person or a group (when each occasion measure is averaged across perçons in a gmup) should not be confused with "specific factor" to a variable. "Unique" is to restrict the factor solution to the sample (a person or a group), while "specific" is to indicate the variance accounted for by a variable but none of the other variables.

People

Variables

F i w e l . Basic Data Relation Matrix (BDRM): adopted from Cattell (1946). Each pair of two parallel lines represents how to capture the covariation from one of six facets of BDRM.

one of the two tirne senes variables is first lagged by various amounts oflag and then

correlated with the other variable. Among the cross correlations, the maximum cross

correlation is then used for factor analysis. Tirne-corrected P-technique, however, does

not take care of dzfferential lagging of the underlying time senes factors because the

varying amounts of lagging are applied to the observed time series variables, not to the

underlying time senes factors. In other words, it assumes that the underlying time senes

factors are simultaneously lagged.

Molenaar ( 1985) generalized Cattell 's time corrected P-technique by allowing

varying arnounts of lagging of the underlying time series factors and called his model

dynarnic factor analysis. Dynamic factor analysis assumes that an observed time series is

a weighted surn of underlying time senes factors that persist during several subsequent

times. In other words, the contribution of the latent time series factor to the data at time r

is assumed to take several consecutive times t, t+ 1, . . . to W e a r itself out. Dynarnic factor

analysis successfully resolved the limitation of the time corrected P-technique: the latent

time series factors are simultaneously lagged (or shifted). However, dynamic factor

analysis also cannot uniquely detetmine the model parameters, as is the case in the

solution of principal component analysis (PCA), because an arbitrary rotation of the

solution of dynarnic factor analysis keeps the data recovered by the model invariant.

More detailed discussion will be provided in the Shifted Factor model section.

STATEMENT OF PROBLEM

Pro~ortionaliîy of the Standard Factor Model

A fundamental feature of the standard factor model is the "proportionality" of the

effects of factor weights, i.e., of factor Ioadings and factor scores. The loading of factor r

on variable j is a weight applied multiplicatively to each subject7s factor score on factor r.

across al1 subjects; it represents the relative contribution of the rth factor to the jth

variable, and the proportional contribution to the data for al1 subjects at the jth level. In

the same way, the rth factor score of the ith subject holds for al1 variables as the relative

and proportional contribution of the rth factor to the ith subject at al1 variable levels.

Thus, the product of the jth factor loading and ith factor score for rth factor determines

the contribution of that factor to the observed score of the ith subject at jth variable. This

proportionality of the factor model c m be considered as a generalization fiom multiple

regression in the sense that both factor analysis and multiple regression search for

coefficients (or weights) that define some linear combination(s) of a set of given

variables so as to maximally account for the target variable(s). Factor analysis differs

fiom multiple regression in that it has to estimate two sets of unknown coefficients (i.e..

factor loadings and factor scores), but multiple regression estimates only one set of

coefficients (Le., regression coefficients).

The proportional relationship between the two sets of factor weights c m be

descnbed as a "bilinear combination," and written in scalar form as

where a represents a factor loading, b represents a factor score, and e represents a

residual not fit by the model. The subscnpts, i, j, and r, respectively, represent the ith

subject, the jth variable, and the rth factor. The italic upper case letter R is used to

represent the maximum value of r, Le., the nurnber of factors. By this convention, this

paper also uses the itdic upper case letter I to stand for the maximum value of i (e-g., the

nurnber of subjects), and J to represent the maximum value o f j (e-g., the nurnber of

variables). Note that Equation 1 is not an expression for the typical correlation or

covariance data as is usually the case in the traditional factor analysis, but instead is an

expression for the original "raw score" or profile data (as in Horst, 1965). Kniskal(1978)

differentiated two types of data fitting methods in factor analysis: direct and indirect

fitting. Indirect fitting represents the conventional way of performing factor analysis in

which a correlation or covariance matrix is factor analyzed, whereas direct fitting means

to fit a factor model directly to the original profile or "raw" score data without converting

them to correlations or covariances. Equation 1 represents the factor rnodel for direct

fitting in his taxonomy. The appropnateness of direct and indirect fitting for some

sequentially organized data will be discussed in the following section.

An Example of Sequentiallv Or~anized Data

Some psychological measures can be obtained at a series of closely spaced successive

points across an interval of tirne, fiequency, or distance. For exarnple, consider an

electrical signal evoked in the brain by a stimulus, such as a flash of light; the data consist

of a set of voltage measures obtained at I successive time points. Then suppose we have a

collection of J senes of such sequential measures frorn different sources, for example, J

6

different electrode locations on the scalp, but al1 obtained during the same set of l times.

We can then compose an I by J data matrix X, by adjoining the J column vectors of

length I side by side. These sequential data can be M e r assumed to consist of J linear

combinations of a few latent sequential variables called factors. An example of evoked

potential data is presented in Figure 2, where 8 time series measures are drawn against a

set of the sarne time points after a flash of light. Figure 2 is drawn based on the profiles

provided in Field and Graupe (1 991). The sequential mode could be frequencies or

distances instead of time points. For instance, EEG power spectral data in physiological

psychology or fluorescence spectral data in chemistry may be examples of typical

frequency data. III environmental science, the levels of the sequential mode could be

many geographical locations that are apart from a source-location by varying distances,

which is supposed to create a few pollutants.

Violation of the Proportionality in Some Sesuential Data

Some kinds of sequential measures rnay not be appropnate to fit by the standard

factor model. A problem arises when the underlying sequential factors are not simply

added together to produce the observed columns of X, but are first subject to shitts in

position of varying amounts along the sequential axis before being surnmed to produce

the observed data in any column. The fact that row position shifts (or position offsets) can

be different for each of the factors underlying such sequential measures will cause the

effects of the sequentiai factors to Molate the proportionality across cuves (e.g., across

electrode locations) required by the standard factor model. For example, the effects of a

sequential factor could be shifted down by 3 rows (relative, Say, to the average shifi

Time after flash

Figure 2. Flash Evoked Potentials Recorded at Di fferent Scalp Locations (C3, C,, C4, P3, Pz, P4, 01, Oz). From Field and Graupe (1 99 1 ).

8

position of that factor) when measured at an electrode located in the temporal lobe, while

it could be shifted up by 2 rows when measured at an electrode located in the occipital

lobe. Likewise, the shift amounts may Vary by factors at the same level in the non-

sequential mode. For example, a sequential factor could be shifted down by 3 rows when

measured at an electrode located in the temporal lobe but another sequential factor at the

same location might be shifted by a different nurnber of rows.

Furthemore, the standard indirect fitting factor analysis is also not feasible if

shifiing factors are present. This is because it is not appropriate to compute cross

products, covariances, or correlations among variables by matching ith scores meaçured

at different variables (e.g., different electrode locations) without adjusting for the row

position shi fis, again because underlying sequential factors will be di fferentially shi fted

From one variable to the next. If the sequential factors are differentially shifled across

variables, the "true" association between two sets of sequential measures would not be

known without adjusting the row position offset before computing one of the cross

product type measures. In addition, if the sequential factors are differentially shifted

across variables, it is not possible to know the 'bue" association between variables

without revealing the nature of the latent shifted factors. This is because each sequential

variable is a sum of a few sets of bi-products (&,ajr) and because the latent sequential

factors are differentially shifted before producing the bi-products. Thus, raw profile data

(1 by J for the above example) would be preferred to the symrnetrical data of cross

products ( Jby J for the above exarnple) when decomposing the sequential data to reveal

the latent shified factors.

SHIFTED FACTOR MODELS

Two-way Shifted Factor Mode1

Harshrnan (1 997) has proposed a "Shifted Factor" type of model to incorporate

the row position shifis in the description of the latent factors underlying the sequential

data when the proportionality is violated. He derived the Shifted Factor model from the

standard bilinear factor model shown in Equation 1 as follows,

(2) xii = x a,. 1+sjr 1 t- b . ~r +eV -

This differs from Equation 1 only in that (i+sjr) replaces i (the row subscnpt of the

sequential mode factor loading a is parenthesized for clarity). There is a shift value SIr,

added to the standard row subscnpt i, which is an additional mode1 parameter that is not

part of the standard factor mode1 such as the standard hvo-way factor analysis presented

in Equation 1. Note that the shifi value s,, is added to the row subscnpt i of the factor

loading for the sequential mode, air, so as to adjust the row position of factor r at colurnn

j. In the case of the above example, sjr is the shiR parameter defining the amount of the

row position shifting of the rth latent time senes factor at jth electrode location. The shifi

pararneters sjr can be collected in a J by R parameter matrix S; these pararneters form a

two-way matrix because shift values are assumed to Vary by levels of the non-sequential

modeL (Mode B in the above equation) as well as by factors. Therefore, the element in the

' Mode will be used to represent each axis in the data matrix (or array) but only for the raw profile data, not for cross product type symrnetrical data. Thus, each mode will be a distinct entit-y, e.g., subjects, variables, measuring conditions, occasions, etc.

10

jth row and the rth column of S represents the size of the row position shift that is applied

to al1 elements of the jth time series, but only for the rth factor.

The two-way Shifted Factor mode1 is different f?om both Cattell's time corrected

P-technique and Molenaar's dynamic factor anaiysis in the sense that it directly fits the

model to the raw profile data rather than to a cross product type of symmetncal data. The

direct fitting of the Shifted Factor model then gives a "direct" solution of factor scores

without any fùrther estimation. Dynamic factor analysis has been reported not to be able

to determine uniquely the solution of factor scores aven a fixed solution of factor

loadings (Molenaar, 1985). The two-way Shif€ed Factor model is the same as dynamic

factor analysis in the sense that both of them adjust for the row position shifi in the latent

factor level, but not in the observed variable level as time corrected P-technique does.

However, the two-way Shifted Factor model is different From dynamic factor analysis

because the former c m give a uniquely determined solution of factor scores and loadings,

but the latter cannot. The discussion about the indeterminacy in dynamic factor analysis

will be followed in more detail later.

Three-wav Shifted Factor Models

Like the two-way Shifted Factor model, the three-way Shified Factor models can

be derived fiom a three-way proportional factor mode1 (Le., trilinear factor model) such

as PARAFAC (ParaIlel Factor analysis) . The PARAFAC model for direct fitting of 'raw'

profile data can be presented in scalar form as,

I I

Equation 3 shows that each data point in a three-way array3 X is approximated by a sum

of R triple-products except, of course, the residual e p which is not accounted for by the

model. The proportionality of the effect of the three sets of factor loadings4 in PARAFAC

holds in the same way as it does for the two sets of factor loadings in the two-way

standard factor analysis s h o w in Equation 1. To facilitate describing three-way Shified

Factor models, the above hvo-way example of sequential data can be extended to a three-

way sequential data array by introducing another non-sequential mode. The third mode

could be various subjects from whorn the evoked potentials are to be measured or

different types of stimuli presented to a single subject. Thus, the evoked potential

rneasures are obtained from K different stimuli but at the sarne 1 time points after each

stimulus and at the sarne J electrode locations, forming an I by J by K three-way

sequential data array X.

The size of the shifi parameter matrix S (or of three-way array S) in three-way

Shified Factor models will Vary depending on how one defines the relation of shifi

parameters between the two non-sequential modes. The simpler model assumes that the

shifi, sjr holds for all levels of the third mode. For exarnple, the shifl parameter of the jth

electrode location for the rth factor is a constant across ail K stimuli. With this

assumption, a three-way Shifted Factor model (TSF1 , which stands for Three-way

Shifted Factor model 1) can be derived as,

One convention is to use a bold and underlined upper case letter for three-way arrays, whereas a bold upper case Ietter represents a two-way array. ' "Loading" will be used to stand for factor weights in any mode in a more general sense since each mode does not have any special meaning in the rnodel level.

TSF1 differs fiorn Equation 3 in the same way that Equation 1 differs fiom Equation 2.

narnely that shift parameter sjr is added to the row subscript i of the sequential mode

parameter a. The shift parameter has two subscripts just as in the two-way Shifted Factor

model shown in Equation 2. In other words, the rth factor underlying the jth 'slab' of the

three-way sequential data is shified by a constant amount s, for al1 levels of the third

mode. Because the shif? value for the rth factor at the jth level of Mode B (e.g., electrode

location mode) is common to al1 levels of Mode C (e.g., stimuli mode), shift parameters

can be collected in a J by R mahix as in the two-way Shifted Factor model.

A bit more complicated but general three-way Shifted Factor mode1 can be

defined by allowing a different amount of shift at sach specific level of Mode C for each

level of Mode B. In the above three-way example, when producing the time series of

evoked potential responses obtained at the jth electrode location but only fiom the kth

stimulus, the rth sequential factor is assumed to be subject to a distinct arnount of shifi.

Thus. the shifi parameters are assumed to Vary across levels of Mode C as well as across

levels of Mode B. Consequently, one needs to collect the shifi parameters in a J by K by

R three-way array, 3. This three-way Shifted Factor model (TSFZ) can be written in

scalar form as follows,

Note that the shifi parameter added to the row subscript i of the sequential mode ioading

a has additional subscnpt k, which is introduced to represent the varying shifl amounts

across levels of Mode C (e.g., across K stimuli). In TSF2, the rth sequential factor

underlying the jth level of Mode B and the kth level of Mode C is assurned to be shifted

by a distinct amount, s,b. In other words, shift pararneters are independent across each

combination of levels f?om both non-sequential modes as well as across factors. TSF:! is

more general than TSF1 ; TSF1 can be considered as a special case where the K J-by-R

slabs in the "three-way" shid parameter array S are constrained to be identical, thus,

TSF2 subsumes TSF1 . However, TSF1 is simpler and since it has fewer mode1

pararneters, the ratio of data degrees of fkeedom to mode1 parameters is higher and hence

better in TSF 1 than inTSF2.

The third three-way Shifted Factor mode1 (TSF3) c m be defined by açsuming that

the shifis result from two independent effects: the fint due to one non-sequential mode

(e-g., the electrode location mode), and the second due to the other non-sequential mode

(e.g., the stimuli mode). Shid values are allowed to vary across levels of both non-

sequential modes but the shifi value for the rth factor at the jth level of Mode B and the

kth level of Mode C is a sum of two shift components from different sources, sjr and sb.

As a result, shift pararneters can be collected in two 2-way shifi matrices: one, SfB, for

Mode B and the other, Stcj for Mode C. TSF3 can be written in scalar form as,

R

14

where si, is added to the row subscnpt i of the sequential mode factor loading a to adjust

the row position shift by an arnount due to the jth level of Mode B. and sb is to adjust the

row position shift by an arnount due to the kth level of Mode C. The parameter s, is the

element in row j and colurnn r of SfB,, and s b is the element in row k and column r of S,o.

Which, if any, of these three-way Shifted Factor models is appropriate for a

dataset will depend on the specific properties of the factors underlying those data. Thus,

it is an empirical question, to be investigated by comparing the results from fitting those

models to a given three-way sequential dataset.

Uniqueness in the Two-wav Shifted Factor Mode1

The "rotation problem", that is, the indeterminacy in fixing the orientation of

factor axes, is one of the most fundamental and potentially controversial issues in the

two-way standard factor model. Harshman (1 994b) has conjectured that the additional

parameters collected in S will help determine the unique orientation of factors underlying

the sequential data, even in the two-way case. Although the shift parameten are

introduced to provide for offsets of factors along the sequential axis, they c m provide

additional information useful in fixing the factor orientation for two-way sequential data.

In this sense, they can be considered an analogue of the third mode factor loadings in

PARAFAC. Thus, the recovered factor loadings of the two-way Shifted Factor mode1

will not be rotationally indeterminate when factors are differentially shifted at each level

of the non-sequential mode, enough to fix the factor orientation.

To show the uniqueness of the two-way Shifted Factor model, it is useful to first

think of the indeterminacy of factor orientation in two-way factor analysis or principal

15

component analysis (PCA). The standard two-way model for direct fitting analysis of raw

score data cm be written in matrix notation as,

where F, A, and E are, respectively, the factor score matrix, the factor loading matrix,

and the error (or unique) matrix not fit by the model. Note that equation 7 is for raw

profile data rather than for the cross product type of symrnetrical data. This is to facilitate

cornparison with the uniqueness property of the huo-way Shifted Factor mode1 which

also involves direct fitting of raw profile data. The rotational indeterminacy of the factor

score and the factor loading matrices can be explained by showing that the sarne X is

recovered no matter what rotation matrix has been applied. Let T be some nonsingular

'rotation' or transformation matrix. We can insert I (or TT-') in Equation 7 to obtain the

same X as follows,

where F and A represent, respectively, the rotated version of F and A. Thus, any

transformation of F and the cornpensating rotation of A leaves the fitted part of the data

X invariant, even though an infinite number of rotated factor solutions are possible.

However, this rotational invariance of the recovered X does not hold in the two-

way Shifted Factor rnodel. Because J differently shified versions of A are required to

produce X, there is no longer only one pair of rotation matrices, i.e., a rotation matrix T

and its inverse T-' as the compensating rotation matrix, to produce always the sarne

16

recovered X as in Equation 8 for the standard two-way model. The rotated version of the

two-way Shifted Factor model can be derived as follows,

where [s,(A)] and 8; represent, respectively, the shifted and then rotated version of A

and the rotated b,. Note that the columns of A in Equation 9 must be shifted first by the

amounts defined in the j th row of S before being rotated. Because the shifted version of A

[ s,(A)] holds for only the jth colurnn of data X and the jth row of B, it is not possible in

the two-way Shifted Factor model to keep the fitted part of data X invariant by applying a

pair of rotation matrices, Le., T and T-'. Because the shifted versions of A are not the

same across levels of Mode B, the idea of rotation (by only one pair of rotation matrices)

does not make sense in the two-way Shifted Factor model. in other words, it is not

possible to rotate a single A (not multiple versions of A each of which is for the

corresponding row of B) and the whole B (not b,) in the two-way Shifted Factor model.

The differential shifting of A makes it impossible to rotate the factor loading and score

matrices while keeping the fitted part of X invariant, and consequently gives a unique

solution. Thus, the two-way Shified Factor model has an important additional property,

"miqueness" of the orientation of factor axes in addition that it adjusts the row position

ofiset of sequential factors.

Note that Molenaar's dynamic factor analysis is also able to provide for the

position offsets of factors along the sequential axis by assurning that the contribution of

the latent time series factor to the data X at time t penists during several subsequent

17

times r, r+I, . . . to Wear itself out. As a result, the factor contribution to the data at time r

is a sum of a few previous factor contribution at t, t-1, . . . Because factor loadings are

supposed to vary depending on different amounts of lags, the recovered data by dynamic

factor analysis can be considered a surn of the factor contribution across several

subsequent times. The rotational invarince of the recovered data X by dynarnic factor

analysis can also be written in the sarne way as Equation 8, as follows,

O

where S is the maximum nurnber of lags (or shifts). F, and As represent, respectively, the

rotated version of Fs and A, (respectively, factor scores and factor loadings at time s).

Thus, any transformation of F,'s and the cornpensating rotation of As's will leave the

fitted part of the data X invariant as is the case in the standard two-way factor analysis.

Likewise, the provision for and estimation of such row position shifts will be able

to help resolve the rotation problem in some three-way sequential data particularly when

the sequential factors are differentially shified along the sequential mode and when the

systernatic proportional factor variation in the third mode is too weak to determine

uniquely the factor orientation. However, note that the shifi patterns must be distinct from

one factor to the next.

ALGORITHM TESTTNG AND DEVELOPMENT

Clearly, when the underlying structure of a dataset involves shif'ting latent factors.

standard factor analysis will not be suitable. New factor estimation methods are needed;

ones that incorporate d l of the parameters of the Shified Factor model. Two different

types of estimation methods have been tested here: QALS (Quasi AItemating Least

Squares) and nonlinear optimization (OPT). ALS is prefixed with "Quasi" because the

row position shifi of the underlying sequential factors makes the parameter estimation for

the sequential mode an approximate but not an exact least squares solution, conditional

on the other set of parameters B. This issue will be discussed in the following sections in

detail. To arrive at the current version of the QALS program, a series of algorithms have

been developed in a cumulative way; thus, the final version of the QALS prograrn

includes al1 the developed feahires. In addition, initial versions of the OPT programs have

been developed to implement the Shifted Factor models by another estimation rnethod. A

versatile matrix language program MATLAB (e.g., see MathWorks, 1996a and 1996b)

has been used for both the QALS and the OPT programs.

The altemating least squares (ALS) method is a popular and straightforward

technique to estimate multiple sets of unlaiown parameters. That is, one part of the

unhown pararneter set is estimated at a time by using regular regression estimation,

given the curent estimates of the other set(s) of variables5. Each set of unknown

variables is altemately estimated, updating the estirnate of some part of the unknowns at a

time. This altemating estimation keeps iterating until the iterative solution reaches a

prescribed stopping criterion. One of the most important properties of the ALS estimation

is that the estimate of each set of unlmowns is always a least squares (i.e., multiple

regression) solution given the data and the cwen t estimates of the other unknowns.

For the standard two-way factor analysis, ALS altematively estimates factor

scores and loadings. The alternating idea of the estimation of loadings on each mode

holds for the Shifted Factor models. However, the position of sequential factor loadings

along the sequential axis must be adjusted before using them as the fixed parameters in

the regression estimation. in sirnilar fashion, the columns of the data must be shified

before the data are used to estimate the sequential factors.

Altemating Estimation for the Shifled Factor Models

Harshman (1 997) suggested the following three features of the estimation

procedure for the two-way Shifted Factor mode1 in terms of the alternating least squares

method. These features are what have to be adapted to the ordinary ALS factor analysis

program;

1. The loadings for the sequential mode (Mode A) are subject to position shifts

along the sequential axis when it is used in the estimation of rows of non-

ALS is used for multilinear models, with each subset of parameters chosen so that the estimation problem is Iinear for those parameters when the othen are held fixed (Carroll & Pruzansky, 1984)

sequential mode (Mode B);

2. The data X is subject to position shifts along the sequential mis when used in

the estimation of colurnns of A;

3. There is a new step in the major iteration that estimates elements in S, the size

of the shifis for the latent factors of Mode A at each level of Mode B.

When estimating the R factor loadings for a specific level of the non-sequential

mode (e-g., b,,, . . ., bjR, giving the relative importance of the factors at a specific electrode

location), a differentially shifted version of A should be used as the predictors, Le., the

fixed (or independent) variables in the regression equation. Consequently, each row of

loadings for the non-sequential mode (i.e., each row of B) must be estimated individually

by using a version of A that is shifted for that row so that it will give the least squares

solution for that row of B and the corresponding colurnn of X. Conversely, for the A

estimation, colurnns of the data X must be differentially "mshifted" to be aligned in the

same row position as a particular colurnn in A, where the proportionality of the effect of

factor weights is restored. Now, each colurnn of A must be estimated at a time because

the amount of shifting is not the same across factors. These two altemating estimations

iterate until the QALS solution reaches a prescribed stopping criterion.

Brute-Force Estimation of Shifis

As mentioned in Harshman (1 997), the estimation of the size of shifts for a

specific row of S requires a different type of estimation procedure from the ordinary

regression method. As a crude exhaustive search type of estimation, a "brute-force" (BF)

method has been developed to estimate the size of shifls. The BF estimation is exhaustive

3 1

and simultaneous with B estimation in the sense that al1 possible combinations of shifts in

a given range are tried when estirnating each row of B. For example, when the

predetermined range of allowed shifi candidates is From -5 to 5 ( 1 1 integers) and the

number of factors to extract is 2, the number of al1 possible combinations of shifis to try

for the estimation of one row of B becomes 121 (= 1 1'). The corresponding row of S is

then updated with the best combination of shift values, namely the one that gives the least

sum of squared residuals for x,, the corresponding colurnn of X. The corresponding row

of B, bj is also updated with the resulting regression weights given the best combination

of shi fis, the fixed A, and x,.

Poiishing the Sequential Mode Estimation Procedure

When estimating the rth column of A, unshifting the columns of X by the amount

defined in the rth column of S is not enough to estimate precisely a colurnn of A. This is

because only the contribution of the rth factor is subject to the position shifts by the

amounts given in the rih colurnn of S; the contributions of other factors are shified by

different patterns of shifts, as given (approximately) by the values in different columns of

S. Thus, it is necessary to partial out the contribution of the other factors before

unshifling the data to restore each column to the "true" row position as in A. The

contribution of the rth factor X, is then estimated by subtracting the other factors'

contribution fkom X.

However, the estimate of the rth column of A, a,, is still not an exact least squares

solution. During the intermediate iterations, the estimate of Xr (a "polished" version of X

for the rth factor's contribution) includes some systematic variance due to the other

37 -- factors as well as the error variance. This is because inaccurate estimate of shifis during

these iterations c m cause the estimate of the rth factor contribution to be confounded

with the contribution of other factors. Likewise, some of the systematic variance due to

the rth factor can also be subtracted fkom X. This problem in the A estimation of Q M S

prograrn has not yet been overcome. Nevertheless, it does not prevent a perfect solution

fiom error-kee data. The inaccurate estimate of A violates the property of the iterative

ALS estimation: monotonie improvement of mode1 fit (reduction in sum of squared

residuals). This is also the reaçon why "quasi" has to be attached in fiont of ALS for the

name of the ALS type of Shifted Factor programs.

Fractional Line Search for the shift estimation

Another estimation procedure for shifts, called Fractional Line Search (FLS), has

been developed to increase further the mode1 fit. The purpose of FLS is to allow the

estimate of shifis to be fractional values rather than only integer values as assurned in BF.

where they are the numbers of rows to shift along the sequential axis. Considering that

sequential factors usually refiect processes that are continuous along the sequential axis

in nature or in the population, the interval (or unit) of shifi and the nurnber of levels of

the sequential mode can be considered as an artificial choice among many possibilities;

selected for the convenience of measurement, by convention, or due to a limitation of the

measuring tools. We c m then consider the sequential values as a sample fiom the

continuous population values, at least in theory. Thus, it makes sense to interpolate the

factor loadings in the sequential mode to obtain estimates for fractional shift values,

23

hopefully, in order to obtain the more flexible shifl estimation and thus B estimation. See

Appendix A for the details about the FLS estimation procedure.

A single shifi value is estimated at a time in FLS rather than a combination of

shifis for al1 factors at the jth column of the data because the combinations of fiactional

shifls are infinite. Necessarily, there is an order effect of factors in ternis of the fractional

shifl estimation and the resulting Mode B estimation. The order effect can be thought as a

cost of the more flexible estimation of the shift values

"Super Iteration" Using: Recurrent BF and FLS

Fit values during the iterations in FLS sometimes oscillated by the every nth

iteration, where n varied case by case. For example, the fit values bounced back and forth

between the sarne two nurnbers dunng the iterations in the analyses of synthetic data

when the oscillating symptom was strong. This symptom seems to be due to both the

order effect of factors in the shift and the B estimations in FLS and the "quasi" least

squares solution of A due to the inaccurate estimate of the rth factor contribution to the

data. Refer to Appendix B for the details about the oscillating symptom and an

interpretation of it.

As a provision for the oscillation problem, a "super" iterative method has been

developed. The program went through higher level iterations, altemating between BF and

FLS fittings. The oscillating or non-converging solutions obtained after a prescribed

maximum number of iterations in FLS are again fed into BF as a starting position. This

"super" level iteration of recurrent BF and FLS resolved most oscillating problems and

sometimes non-converging solutions of error-fiee data in a given number of iterations

24

(e.g., 1000). The super iterative method seems to take a distinct advantage kom both BF

and FLS. BF in the super iteration can estimate shifts without the order effect of factors

because of the exhaustive method of estimation. The solution by FLS is less accurate than

is the solution by BF because of the order effect. Thus. the ' k o n g " or less accurate FLS

solution could be considered as a kind of different initial starting position in the recurrent

BF that gets the BF solution in another path to the final solution. Thus, the less accurate

FLS solution c m give BF another chance to try a better path and then to avoid reaching a

local minimum solution. If the FLS solution is "bad" enough to give BF a chance to find

an efficient path to the grand minimum, the recurrent fitting by BF but with a small

maximum number of iterations will be better than a single BF fitting with a fairly large

maximum number of iterations. The recurrent fitting by BF in the super iteration could be

equivalent to trying a number of different starting positions and then picking the best

arnong them. However, the less accurate FLS solution must not be considered as a totally

different starting position because it is still partly a least squares solution.

Three-wav Alternatinn Estimation for TSF 1

A three-way QALS program has also been developed to implement the sirnplest

three-way Shified Factor rnodel, TSFl. The idea of the altemating estimation works for

the three-way Shifted Factor models in the sarne way as for the two-way Shifted Factor

model. For the ordinary ALS estimation method, see Appendix C. For the estimation of

each row of B, columns of the current estimate of the fixed pararneter set (A and C) must

first be shifted by the amount given in the corresponding row of S. This is because TSFl

assumes that a specific shift si, holds across al1 levels of Mode C but at the jth level of

Mode B and for the rth factor. Thus, B is estimated one row at a time by using the

properly shifted version of the fixed part. However, when estimating the factor loadings

of the other non-sequential mode C, the shifting procedure is not the sarne as that for the

Mode B estimation. The columns in the fixed parameter set of A and B must be

differentially shified according to the levels of Mode B because T SFl assumes the

sequential factors to be differentially shified across levels of Mode B. Another difference

between the C estimation and the B estimation is that al1 elements in C are

simultaneously estimated rather than by individual rows because the amount of shi Ring is

common across al1 levels of Mode C. A is estimated in the same way as in the two-way

QALS. That is, each colurnn of A is estimated separately by using the row-position

corrected version of X that is unshifted appropriately to line up the latent contributions of

factor r. This is done by applying the values defined in the rth column of S to restore the

"true" row position in the data.

Lnitial Test of Nonlinear &timization: Simultaneous Estimation of Al1 Parameters

The QALS Shifted Factor program usually requires a long time to converge. Most

computation time spent in the QALS programs has been for the BF estimation of shifts

because al1 possible combinations of shifts (at each level) must be tried. 'fhe computation

time of the QALS programs is approximately a function of the numbér of the allowed

shift estimates in BF and the number of factors. For example, when the shifts are allowed

to range fiom -5 to 5 (1 1 integen) and the nurnber of factors is set to 3, 133 1 1-by-3 row

vecton of integer shifts must be tried in estimating each row of S and the resulting Mode

B loadings. The three-way QALS program requires a huge amount of computation time

26

in general, particularly, in inverting (by using generalized inverse) the "big" matrix of the

fixed pararneter set of A and C for Mode B estimation in BF. For example, for b,, the IxK

by R fixed pararneter rnatrix (factonvise bi-products of a, and ch) must be inverted and

then premultiplied by x,, hr' times (where N is the number of possible integer shifts) to

get the best least squares solution of b,.

To develop, hopehilly, a much faster program and, more importantly, as another

method to implement the Shifted Factor models, a couple of nonlinear optirnization

programs have been tentatively tested. In addition, one important advantage of using

nonlinear optimization for the Shifted Factor models is that one c m easily impose

constraints on the Shified Factor models that might be appropriate to some sequential

data. For instance, non-negative factors cm be assumed for some chernical or

physiological data based on a relevant theory or in nature. Consequently, the non-

negativity constraint on the model would be suitable to those types of data. Of course, the

constraints on the Shifted Factor models must be carefully applied based on a sound

extemal source of information. If the constraints imposed on the model are correct, they

will make the model more stable in the presence of excess noise or systematic error. They

will also be able to Save some of computation time. More irnportantly, the correct

constraints will also be able to make the estimation more accurate in the sense that a

correct constraint reduces the degrees of keedom of the model to estimate. For exarnple.

if the '?rue" underlying factors are al1 positive, the expected value of the sum of squared

residuals of random positive factor estimates from the positive '?me" factor loadings will

be smaller than that of the unconstrained random factor Ioadings. This is because the

27

unconstrained factor estimates could be either positive or negative so that the expected

distance in the factor space between the unconsaained factor estimate and the tnie

loading is longer than that between the constrained estimate and the true. The expected

distance is equivalent to the square root of the mean square error of factor loadings.

Most optimization programs tested so far simultaneously estirnate al1 parameters,

A, B, (and C in the three-way case) and S by nonlinearly optimizing the parameters to

minimize an objective fûnction, the sum of squared residuals of the whole data matrix X

( ~ r X in the three-way case) rather than a part of parameters as is the case in the QALS

program. The A L S estimation is not simultaneous because the estimation is for one mode

at a time. The estimation is not simultaneous even within Mode A because each column

of A must be estimated at a time by using a correctly polished and then unshifted version

of X. The updated columns of A are then used in the estimation of the other columns of

A, inevitably having an effect on the estimation of the remaining columns of A.

The objective function (Le., sum of squared residuals of a column of the data or

the whole data) is minimized by a numencal 'hi11 climbing' procedure, simultaneous

optimizing al1 parameters of the Shifted Factor model. Details are presented in Appendix

E.

One common requirement in using these rninirnization programs is that the

objective fünction must be continuous. This continuity is a requirement to evaluate the

gradient of the objective function to minimize. It could be appropnate to assume that the

variation of factor loading parameten (A and B) results in continuous function values

because it is always possible to define a value between two factor loadings. Thus, the

28

continuity of the objective function will be guaranteed because the factor weight is

infinitely continuous in theory. However, the continuity does not hold for the Shifted

Factor rnodels as long as the size of shifts are defined to be a number of rows in a given

dataset. However, the linear interpolation of A or X by using the fiactional shifis has

resolved this discontinuity problem. If one c m define a fairly small amount of fiactional

shifl, the resulting surn of squared residuals will be continuous. This is because, given a,,

and bjr for xo, the sum of squared residual of x, is a continuous function of continuos the

fractional shifts, which can be any real number.

ANALYSIS OF SYNTHETIC DATA

Sequentially organized synthetic error-fiee data have been used to determine if

the cornputer programs work, and if the Shified Factor solution is unique. These tests

have been performed with both two-way and three-way error-free synthetic data. The

progressive improvements in algorithmic methods for the QALS technique have each

been tested with the synthetic data, and the changes have been accurnulated to complete

the final version of QALS prograrn. That is, the finally completed QALS program (Super

Iterative QALS with the recurrent brute force and the fractional line search estimations)

utilizes al1 the methods mentioned earlier in the algorithm section of Quasi-ALS. The

completed QALS prograrn also has been used to analyze synthetic fallible data to see

how well the prograrn can recover the parameters when some amount of error is present.

or in other words, to see how robust the Shifled Factor models are against various

amounts of error. The nonlinear optimization programs also have been used to analyze

the sarne error-fiee and fallible synthetic datasets to compare the two types of estimation

methods in tems of both the ability to recover the parameters and the efficiency of the

programs. In addition, three different types of c w e s were used in producing the error-

fiee sequential datasets to see if the shape of sequential factors has a differential effect on

the ease of the recovery of parameters. Because both QALS and OPT are likely to reach

local minima, a few different starting positions have been tried for each analysis.

The standard two-way factor analysis solutions of the synthetic two-way datasets.

and the P W A C solutions of the synthetic three-way datasets have been used as a

reference or baseline for cornparison when evaluating the solutions given by the Shifted

30

Factor models. The reference solutions make it possible to see how much additional fit is

attained, and more importantly, how much the recovery of parameters used to generate

the data is improved by the Shified Factor model over and above that given by the

standard two-way factor analysis or PARAFAC. Single-scored overall fit values such as

R', mean square error, or STRESS (so called '8adness of fit") give valuable information

,\ about how well X recovered by a model resembles the data, but they do not give more

detailed information on how well the pararneters recovered or estimated by the analysis

resemble those used to generate the (synthetic) data. In particular, because of the

rotational problem in the standard two-way factor model, it is often possible to get a

fairly strong agreement between the data and the recovered X, even though the estimate

of factors is certainly far fiom the true parameters which have produced the data. For this

reason, the results bom analyzing the synthetic datasets have been investigated in terms

of not only the rnodel fits but also the agreement between the true pararneters and the

estimates of them.

Two-way error-flee datasets were generated based on the Shifted Factor model. In

other words, differential row position shifts of factors were used in generating the

synthetic datasets. The same error-fiee synthetic data were used to test the successively

developed algorithmic features. The final version of the QALS program (the super

iterative QALS program) both for the two-way Shifted Factor model and for TSF1 was

then used to analyze the synthetic fallible data with varying amounts of error.

Two-way Synthetic Data Analvsis

Data Three different shapes of curves were used when constructing the factor loadings of -

the sequential mode (see Figure 3). The fint set of c w e s , called "Handrnade";

subsequently shortened to "Hand", was generated by continuously mouse clicking 100

times in a two-dimensional geornetric plane ranging from -1 to 1 in both axes. The

resulting coordinates in each axis formed a 100x 1 sequential vector. Equally spaced sine

values were setected to generate the second and the third sets of curves, called "Hump"

and "Sine" respectively. Each hump in the "Hurnp" curves consists of sine values fiom O

to xbut the fint c w e (i.e., the first factor that is named F1 in Figure 3) was multiplied

by 0.5 to give a different magnitude of the factor contribution in the sequential mode. In

addition, the humps in the two factors are located in different levels in the sequential a i s

to make the "Hump" curves clearly different from each other. Non-zero values in the first

"Sine" curve are sine values ranging fiom O to Zxand those in the second from O to x.

Again, the first factor was multiplied by 0.5 for the sarne reason. Even though both the

"Hump" curves and "Sine" curves consist of evenly spaced sine values, they are quite

different fiom each other with respect to the slope of curves. The "Sine" c w e s are

smoother (Le., have more slowly changing factor loadings along the sequential axis) than

are the "Hurnp" curves. The resulting three types of synthetic data have been then used to

explore some possible effects of the shape of the sequential factors on the ease of

recovery of parameters. Of course, al1 the other characteristics in organizing the synthetic

data were fixed to avoid potential confounding of the effect of different shapes with those

"Handmade" curves

"Hump" curves

"Sine" curves

Fimire 3. Three Shapes of Synthetic Curves

by the other conditions, such as number of factors, numben of levels and loadings for

each mode, tnie shift values, and so on.

The number of factors was set to 2 so as to Save the computation time in the BF

estimation. The number of levels (Le., time points or successive values) of the "Hand"

curves was set to 100 and that of the "Hurnp" and the "Sine" curves to 60. The true

loadings of the non-sequential mode, B and the true shifts, S were set to be the same for

the three different types of datasets. The number of levels of Mode B was set to 20. The

true factor loadings of Mode B were sampled from uniformly distributed random

nurnbers ranging between O and 1. The true shift values for the 20 levels of Mode B were

also sampled from the evenly distributed integen ranging from -3 to 3.

The different sets of synthetic sequential data have been further tested at three

different error levels (5%, IO%, and 25%) by adding different amount of error to the

error-fiee rank6 2 synthetic datasets as described above. Normally distributed random

numbers were used as the error part in generating fallible synthetic data. The size of both

the mie and the error parts of the synthetic data was then set to unit variance by dividing

each part by its corresponding "grand" standard deviation before muitiplying relative size

weights. "Grand" is appended to represent the standard deviation of the whole rnatrix of

the mie or the error part. The scale free true and error parts (Le., unit-varianced tnie and

error parts) were then multiplied by the square root of their desired proportions in the

total variance.

6 The rank of a matrix is defined as the maximum number of linearly independent rows (or c01um.n~) of the matrix (Green & Carroll, 1976: p 167).

34

Results. The error-f?ee synthetic data were analyzed first to see if the programs could

recover the true factor loadings and shifi values perfectly without providing any

constraint such as the orthogonality among factors. The QALS program could recover the

true factor loadings and shift values perfectly after applying the polishing procedure for

the A estimation. Even though the BF procedure could sometimes alrnost recover the true

factor loadings and the true shifis perfectly without the polishing procedure (recovery

correlation of more than 0.9), it was only after applying the polishing procedure that the

perfect solutions were obtained. The nurnber of perfect solutions out of 4 and the

averaged recovery correlations of parameters are summarized in Table 1. The standard

two-way factor analysis solutions are also given as a reference to compare with the

QALS solutions.

The recovery correlations of the non-perfect solutions have been averaged across

factors as well as across solutions. The uniqueness property of the two-way Shified

Factor mode1 has been proved by showing that the two-way QALS programs can recover

the parameters (A, B, and S) perfectly as well as the data from error-free data without any

extemal constraint such as the orthogonality among factors. The successive fitting of BF

and then FLS but only once for each (BF-FLS) could not get additional perfect solutions

that had not been perfect when fitting BF once. However, when BF and FLS were used

recurrently (i.e., the Super iterative QALS: Super QALS), the number of perfect solutions

was much increased as shown in Table 1. Thus, the hypothesized distinct advantages of

BF and FLS in Super QALS have been numerically confimed. That is, FLS could give

the following BF a chance to try another "starting position" by aggravating the better

Table 1

Number of Perfect Solutions and Averaged Mode1 Fit and Recoveries of Non-perfect

Solutions

Non-perfect Solutions No. Perfect

Fit ( R ~ ) A B S

PCA

Super QALS

"Hand" "Hurnp" "Sine"

"Hand" "Hump" "Sine"

"Hand" "Hump" "Sine"

Note: BF and Super QALS represent, respectively, fitting only by BF vs. the recurrent fittings by BF and FLS. PCA is for the model fits and the averaged recoveries across factors of the standard PCA. The number of perfect solutions is out of 4 different starting positions. The decirnal point is dropped fiorn the model fits (R') and the recovery correlations for A, B, and S. The model fits and the recovery correlations of the non- perfect solutions are averaged across factors as well as across the non-perfect solutions.

solution kom the previous BF fitting in Super QALS. However, FLS. when used just

once after fitting BF (i.e., in BF-FLS), fit a little additional variance without further

improving the recovery of the parameters. This is the reason why the nurnbers of perfect

solutions by the two QALS fitting methods (i-e., BF and BF-FLS) are compared with

each other in Table 1.

A specific amount of shifi cm have a totally different effect on the factor

contribution when it is applied to two different curves. The more fluctuating the

successive loadings in the sequential mode or the steeper the dope of curves, the larger

the effect of an arnount of shift when applied to the sequential factor. Because the "Sine"

curves are smoother than the "Hump" curves and are Iess fluctuating than the "Hand"

curves, it seems to be harder to find the correct arnount of shifts even though the tme

shifts are the same for al1 types of data.

Based on the results sumrnarized in Table 1, Super iterative QALS was selected

for further testing. Ten different starting positions were tried for each different error level

and different type of data, in order to estimate better how many times Super QALS can

reach the unique optimum solution and to see how much closer the optimum solutions

were to the true parameters. Table 2 surnmarizes the mode1 fits ( R ~ ) and the recovery

correlations of parameters, A, B, and S obtained in these 10 m s . The recovery

correlations are the correlations between the true parameters and their estimates. A lot of

results have been sumrnarized in Table 2 to facilitate more direct and simultaneous

cornparisons across different e m r levels, across the three different types of data, and

with the standard PCA solutions. The column heading "Uniq." represents the number of

Table 2

Mode1 Fits I R ) and the Recoveries of the Two-way Super QALS Analyses

Fit Uniq. A B S Error (%) R~ (Bad) F1 F2 (Bad) F1 F2 (Bad) F1 F2 (Bad)

"Hand" O PCA SF

5 PCA SF

10 PCA SF

25 PCA SF

"Hump" O PCA 879 SF P (902)

5 PCA 845 SF 961 (953)

10 PCA 811 SF 923(916)

25 PCA 711 SF 816(802)

"Sine" O PCA SF

5 PCA SF

10 PCA SF

25 PCA SF

Note: Error (%) means the proportion of error in the total variance. PCA and SF stand for, respectively, the PCA solution and the two-way super iterative QALS solution. The column heading Uniq. represents the nurnber of perfect solutions out of 10. The averaged mode1 fits and recovery correlations of the non-perfect solutions are given in the parentheses next to those of their perfect or best solution. The decimal points are dropped From the mode1 fits and the recovery correlations. The capital letter P represents the perfect fit or recovery.

38

the perfect solutions for the error-fiee data (0%) and the nurnber of the best and

presumably unique irnperfect solutions for the fallible data (5%, IO%, and 25%). The

counts are out of the 10 solutions starting fiom different initial values for the parameters.

The non-perfect (in the error-fkee data case) and the bad non-matching solutions with the

best solutions (in the fallible data case) are averaged across different solutions as well as

across both factors, and then reported in the parentheses. Solutions have been decided to

be identical when they were correlated with each other by more than 0.99. The capital

letter P in the columns for the recoveries represents the perfect correlation.

First of all, the true parameters could, again, be recovered perfectly fiom al1 error-free

data generated by using different types of cuves: always for "Hand" data, 6 times for

''Hump" data, and 4 times for "Sine" data out of the 10 solutions. Al1 mode1 fits by Super

QALS are better than those by the standard two-way PCA. Particularly in the "Hand" and

the "Hump" data cases, most recovery correlations were also much better than those of

the PCA solutions. Thus, the analyses of the fallible data have shown that the two-way

Shifted Factor model can recover factor loadings and scores much better than the

standard two-way Factor analysis does. The recovered factor loadings of the fallible

"Hand" (except for 25% error level case) and the fallible "Hump" data were al1 very

close to the true loadings (at l e s t 0.9 in the recovery correlations). Considering that the

improvement of model fits was not so big, the much better improvement in the recovery

of parameters shows that the super iterative QALS helped the program more in

recovering the parameters than in increasing the model fit. In al1 analyses reported in

Table 2 (across different error levels and across different types of data), the additional

39

fitted variance by Super QALS over and above that by the standard two-way PCA was

6% at best. However, the recovery of factor loadings of the fallible "Hand" (5% and

10%) and "Hump" (5%, IO%, and 25%) data were dramatically improved. This

differential improvement clearly shows that the two-way Shifted Factor mode1 is much

better in recovenng the parameters than in fitting the additional variance from the error in

the data. This is presumably because of the uniqueness property of the Shified Factor

model.

The recovery correlations for the shifts were always much lower than were the

recoveries for the factor loadings. This was at first puzzling but an explanation was found

recently. The lower recovery of shifts have been found to be due to a factorwise (Le.,

within a factor) "mean" offset in the shift estimates. When the recovered shift estimates

are compared with their tme shift values, one finds a few outliers that are quite far from

the their true values while the other shift estimates deviated from the tnie values by a

small constant amount. The mean offsets in the shift estimates seem to have happened in

order to compensate for the outliers, because of an anchoring procedure of the shifi

estimates that is applied in both BF and FLS. This procedure is designed to anchor the

mean of the shifi estimates within a given factor on a constant value. The factorwise

anchoring of the shift values does make sense in the sense that the mean difference of the

shift estimates arnong factors will not change the recovered data as far as the constant

amount of the mean shift is compensated for in the estimate of the sequential factor

loadings (A). This is because the constant shift offset is comrnon to al1 levels of the non-

sequential mode (Mode B) o n that factor. The anchoring procedure takes out the rounded

40

mean of the shift estirnates within a factor rather than the rnean of them because al1 shift

estimates must be integers in BF. Thus, after being anchored, the rounded mean of each

column of the shift matrix S must be zero. The mean offset made the A recovery

correlation looked worse than it was. The anchoring procedure could fix the factonirise

constant offset of shifts, and consequently, correct the underestimated A recovery

correlation without changing the mode1 fit and the recovery correlation for B. This is

discussed M e r in Appendix C.

Note that there were not matching best solutions for the fallible "Hump" and the

fallible "Sine" data: the 25% error level for the "Hump" data and al1 error ievels for the

"Sine" data. This means that the non-matching best solutions are likely to be a local

optimum solution rather than the best global optimum solution, because it is not possible

to Say that a single best solution is unique, which does not match with any of the other

solutions. Considenng that in the results of the fallible "Sine" data, there was only one

non-matching best solution, and that the recovery correlations were smaller in the fallible

"Sine" data than in the fallible "Hand" and the "Hump" data. the difficulty of the "Sine"

c w e s has been again confirmed.

The recovery correlations for factor loadings by Super QALS were smaller in one

or both factors than were the corresponding recoveries by the two-way PCA in the

analyses of the "Hand" 25% error data and of al1 fallible "Sine" data. These worse

recoveries seem to have been partly because of the difficult shape of the "Sine" curves

and partly because of the effect of the enor variance. The low recoveries of the "Hand"

25% and the "Hump" 25% data show the effect of the error variance that causes Super

QALS to have difficulty recovering the parameters.

The general pattern of the number of the perfect (or best) solutions and the

recovery correlations shows that the "Hand data are easier to recover the parameters

than are the "Hump" data, and that the "Hump" data are easier than are the "Sine" data

Of course, the smaller the error variance, the better the solution.

Three-way S-ynthetic Data Analvses

Data. As mentioned in the algorithm section, the three-way QALS prograrn requires a

fairly large amount of computation time in inverting the rearranged "long" fixed part of

bi-products (e.g., aiKk for the Mode B estimation) so many times in the brute force

estimation. Thus, the number of levels of the two non-sequential modes (Le., Mode B and

Mode C) was set quite small in this study: 5 for Mode B and 4 for Mode C. Except for

the true loadings of Mode B and Mode C, and the tme shifi values (because the number

of rows must be the same for B and S in TSFI), ail conditions were fixed in generating

three types of three-way synthetic data based on the simplest three-way Shified Factor

model, TSF 1. The m e factor loadings for Mode B and for Mode C were selected from

uniformly distributed random numbers and the true shifi values fiom uniformly

distributed integers ranging £iom -3 to 3. Unlike the two-way synthetic data analysis,

only one error level(25%) of fallible data was used for the three-way QALS analysis,

again because of the huge computation time of the three-way QALS prograrn. Normally

distributed random numbers were again used for the error part in the fallible three-way

data. The same three types of curves given in Figure 3 were used to generate the three

42

different types of resulting datasets. Thus, six different three-way synthetic datasets were

analyzed: two error levels (error-free and 25%) by three different shapes (the "Hand", the

"Hump", and the "Sine" curves).

Results. The six synthetic three-way datasets were analyzed by using the three-way super

iterative QALS program. Unlike the two-way synthetic data analysis, only four different

starting positions were used for each of the six datasets. Their results are surnmarized in

Table 3 in the same way as in the results of the two-way synthetic data analysis. The

solutions of the proportional three-way factor model, PARAFAC have been provided as a

reference to compare with the three-way Shifted Factor model (TSF 1) solutions. Again,

reported under the colurnn heading Uniq. are the nurnbers of perfect solution for the

error-fiee datasets and the nurnbers of matching best fitting solutions for the fallible

datasets. These nurnbers are out of four solutions that started fiorn different randorn

numben as the initial values for the estimate of pararneters. The remaining non-perfect or

non-matching sohtions have been averaged and reported in the parentheses below the

perfect or best solutions.

The three-way Super QALS program could recover the pararneters perfectly From

the error-free synthetic data: always for the "Hand" and the "Hump" data, and twice for

the "Sine" data out of the four solutions. These perfect solutions show that the shifting

idea also works for three-way data. Even in the f ' l ible case (the 25% error level), the

recovered pararneters are very close to their true values. In particular, al1 the four

solutions of the fallible "Hand" data exactly match the others, giving very strong

Table 3

Mode1 Fits (R') and the Recoveries of the Three-wav Super OALS Analyses

Fit Uniq. A B C S

"Hand" O

25

"Hurnp" O

25

"Sine" O

25

PAR 896 TSFl P PAR 688 TSFl 789

PAR 846 TSFl P PAR 672 TSFl 800

(754)

PAU 963 TSFl P

(918) PAR 746 TSFl 801

(807)

968 865 Always P

955 838 Always 951 981

966 869 Always P

889 871 1 930 970

(834)

Note: Error (%) means the proportion of error in the total variance. PAR and TSF1 stand for, respectively, the PARAFAC solution and the solution of three-way super iterative QALS for TSFl. The heading Uniq. represents the nurnber of perfect (or best) solutions out of 4. The averaged model fits and recovery correlations of the remaining non-perfect (or non-matching) solutions are given in the parentheses below those of their perfect (or best) solution. The decimal points are dropped fiom the model fits and the recovery correlations. The capital let-ter P represents the perfect recovery.

44

evidence that the factors underlying some three-way sequential data can be uniquely and

very successfully recovered by the three-way Shifted Factor mode1 even against a

considerable amount of error. However, note that the P M A C solutions are also quite

close to the tnie values. Considering that PARAFAC has not any provision for the shifts,

the three-way datasets already have a lot of information to detemine the orientation of

factors. This information must be provided by the differential factor weights across Ievels

of the third mode that do not exist in the standard two-way factor analysis. In other

words, the true parts in the two-way and the three-way fallible datasets are not the sarne

in texms of the arnount of information to fix the factor orientation, even though their

proportion in the total variance was fixed to be the same in the two-way and the three-

way fallible data. This is because the PARAFAC solution of the error-free three-way data

(Le., the mie part in the three-way fallible data) is always unique, whereas the two-way

PCA solution would not be unique without the extemal constraint, the orthogonality

arnong factors. Of course, it was the three-way QALS program but not the PARAFAC

program that could recover the parameters perfectly.

The recovery correlations for the shift estimates are again poorer than those for

the factor loadings (A, B, and C). These poorer recovenes must have been partly due to

the mean shift offset as mentioned earlier. The mean shift offset is iikely to exist in the

fallible "Hand" data case, because the solutions are al1 identical and because the

recoveries are perfect for Mode C and aimost perfect (0.999) for Mode B on the first

factor. Thus, the relatively low recovery (0.95 1) for Mode A and the rnuch lower

45

recovery (0.846) for the shifls on the same factor, strongly suggest that a mean shiA

ofEset has made the recovery correlations for Mode A and shifts underestirnated.

The difficulty of the "Sine" data in recovenng the parameters seems to be partly

relieved by an additional source of information to fix the factor orientation. The three-

way QALS program rnay have used to fix the factor orientation the differential factor

loadings across levels of the third mode (Mode C) as well as the differential amounts of

shifting across levels of Mode B. However, the "Sine" curves were still more difficult to

recover than were the other types of curves. The error-free "Sine" data could be

recovered perfectly twice out of the four solutions, whereas the error-fkee "Hand" and

"Hump" data were always perfectly recovered. For the fallible datasets, the best solution

of the "Hurnp" and the "Sine" data is likely to be a local optimum solution because there

was no matching solutions. even though their recovery correlations are very high. To see

if the three-way fallible "Hump" and "Sine" data can reach a unique solution, more

starting positions need to be tried.

The efficiency of the three-way Shifted Factor mode1 (TSFI) appeared in another

way. PARAFAC sometimes needed much more iterations (e-g., more than 500) to reach a

converged solution for the same data than did the three-way QALS program for TSFl

(e.g., within 30). This fact suggests that some sequential data, where differential shifls are

present, make it hard for PARAFAC to find the unique orientation of factors. The very

slowly converged PARAFAC solutions were more likely to reach a local minimum than

were the three-way QALS solutions. Most local minimum P W A C solutions could not

differentiate factors fiom each other, even though their mode1 fits were a s good as those

46

of the global minimum solutions by the three-way QALS program. The PARAFAC must

have required much more iterations than did the three-way QALS program because the

smooth "Sine" curves take less information to fix the factor orientation from the sarne

amount of the shift values than did the "Hand" and the "Hump" curves.

One practical problem in the BF estimation of shifis is that the trial of dl possible

combinations of shifts requires a huge arnount of computation time. in particular, the

three-way QALS program for TSF 1 will require a couple of hours to converge on a fast

(e.g., 200 MHz) persona1 computer when the two non-sequential modes have a relatively

large nurnber of levels (approximately more than 10) and there are more than 2 factors to

fit. inverting the long tentatively fixed matnx of bi-products to estimate a row of

B needed more than 95 percent of the computation t h e spent in the analysis of the three-

way synthetic data.

Nonlinear Optimization Analvsis

The same synthetic two-way and three-way datasets have been used to test the

nonlinear optimization programs for the Shifted Factor models. For the unconstrained

analysis of the synthetic data, one of the nonlinear minimizers built in MATLAB.

"hinu" has been tested. Because basic estimation method is always the same for any

parameters (Le., no matter what the parameter is: a factor loading or a shift), the

computation time was faster in the nonlinear optimization analysis than in the QALS

analysis. The nonlinear optimization program could recover the parameters perfectly

from the error-fkee two-way "Hurnp" data and from both of the error-fiee three-way data.

Thus, the uniqueness property of the two-way and the three-way Shifted Factor models

47

could again be confumed by another estimation method: the nonlinear and simultaneous

optimization of parameters. The parameters were more often recovered perfectly fiom the

tluee-way data than from the two-way data. This is, as mentioned earlier, because the

three-way error-fkee data (and the true part in the three-way failible data) have another

source of information to fix the factor orientation: differential factor loadings across

levels of the third mode.

The results of the nonlinear optimization analyses were obtained by setting a more

lenient stopping criterion of iterations than in the QALS analyses. The reason why the

stopping criterion must be made lenient was that the minimizer "hinu" started to give a

degenerate solution at some point when the stopping criterion was as stringent as that in

the QALS analyses. Consequently, the soliitions of the nonlinear optimization analyses

are less correct than are the solutions of the QALS analyses because a lenient criterion

can allow an insufficiently converged solution. To get more correct (or sufficiently

converged) solutions, a M e r shidy will be required. One potentially better nonlinear

minimizer could be "leastsq", because it has been reported to be more appropriate to

apply to the least squares type of problems than are the other optimizen. In a few very

tentative tests, "leastsq" behaved better than did "hinu".

Two-wav Svnthetic Data Analvsis

Two types of synthetic data (the "Hand" and the "Hump" data) were used for the

two-way nonlinear optimization analyses. Only one level of error (25%) was used as the

fallible data. The results of the two-way nonlinear optirnization analysis are sumrnarized

in Table 4.

Table 4

Mode1 Fits (R') and the Recoveries of the Two-wav Nonlinear Ootirnization Analyses

Fit Uniq. A B S Error (%) It2 (Bad) F1 F2 (Bad) FI F2 (Bad) FI F2 (Bad)

"Hand" O PCA 936 893 591 884 342 0PT2 984 (982) 1 992 873 (927) 786 978 (850) 437 835 (480)

25 PCA 768 888 483 905 289 OPT2 804 (797) 1 838 770 (616) 721 737 (536) 344 525 (442)

"Hump" O PCA 879 340 883 574 976 0PT2 999 (999) 3 P 999 (998) P (999) 940 975 (831)

25 PCA 711 331 884 495 963 0PT2 804 (782) 1 869 940 (672) 9 19 963 (636) 689 9 15 (45 1 )

Note: Error (%) means the proportion of error in the total variance. PCA and OPT2 stand for, respectively, the PCA solution and the two-way nonlinear optirnization solution. The colurnn heading Uniq. represents the nurnber of perfect or best solutions out o f 4 . The averaged model fits and recovery correlations of the rernaining non-perfect solutions are given in the parentheses next to those of their best solutions. The decimal points are dropped fiom the model fits ( R ~ ) and the recovery correlations. The capital letter P represents the perfect recovery.

49

Conservatively speaking, the non-linear optimization program could recover the

parameters perfectly from none of the N o types of error-fiee data. However, three

solutions of the enor-&e b'Hump" data matched out of four soiutions. Furthemore, the

recovery correlations of factor loadings (A and B) were almost perfect: at least 0.999.

Thus, in the three matching solutions, only the estimates of shifts seem to have made the

solutions not perfect. Thus, the nonlinear optimization analyses could confirm "almost"

the hypothesized uniqueness property of the two-way Shified Factor model. Note that

even in the fallible data case, the pararneten were recovered quite successfuIly from the

"Hump" data although there was no matching solutions. The "Hurnp" curves seem to

have behaved better than do the "Hand" curves, in terms of the recovery of parameters

when they were analyzed by the nonlinear optimization program.

Three-way S-ynthetic Data AnaIvsis

As in the above two-way nonlinear optimization analysis, the two types of

synthetic data (the "Hand" and the "Hump" data) were used for the three-way nonlinear

optimization analysis. Only one level of error (25%) was again used as the fallible data.

The results of the three-way nonlinear optimization analysis are summarized in Table 5.

The pararneten could be recovered perfectly from the error-fiee "Hand" data and

almost perfectly (at least 0.988 in the recovery correlations) fiom the error-free "Hump"

data. In the result of the error-fiee bbHump" data, the recovery correlation of the second

factor (0.993) seems to have made the corresponding recovery correlation in Mode A not

perfect. The results of the error-fiee data show that the parameters of three-way

sequential data can be recovered perfectly by the nonlinear optimization method.

Table 5

Mode1 Fits ( ~ ~ 1 and the Recoveries of the Three-wav Nonlinear ODtimization Analyses

Fit Uniq. A B C S

"Hand" O PAR 896 0PT3 P

(972) 25 PAR 688

0PT3 722 (709)

"Hump" O PAR 846 0PT3 999

(998) 25 PAR 746

0PT3 656 (629)

Note: Error (%) means the proportion of error in the total variance. PAR and 0PT3 stand for, respectively, the PARAFAC solution and the solution of the three-way nonlinear optimization analysis. The column heading Uniq. represents the nurnber of perfect (or best) solutions out of 3. The averaged mode1 fits and recovery correlations of the remaining non-perfect (or non-rnatching bad) solutions are given in the parentheses below those of their perfect (or best) solution. The decimal points are dropped fkom the mode1 fits (R') and the recovery correlations. The capital letter P represents the perfect mode1 fit or recovery.

The uniqueness property of the three-way Shifted Factor mode1 (TSFI) has been

confirmed again by the nonlinear optimization analysis.

Note that the recovery correlations of the three-way fallible data are worse than

those given by PARAFAC. They are not even unique because there is no matching

solutions. This undesirable result suggests that the current nonlinear optirnization

program requires fùrther study to be used properly. Considering that the QALS solutions

of the same fallible three-way data were always better than the correspondhg PARAFAC

solutions. the nonlinear optimization programs seem not to be used correctly. Othewise,

to optimize nonlinearïy and simultaneously the parameters of the Shifted Factor rnodels

may not a proper estimation method.

52

AN APPLICATION

Field and Graupe (1991) successhilly showed the shift problem of sequential

factors along the tirne axis in their evoked potential data analysis. Their data would have

been a nice exarnple which would be appropriate to fit the Shifted Factor model.

Unfortunately, their evoked potential data were not available for this study. Instead. a set

of evoked response measures7 in the rat brain was used for this study.

The evoked potential measures were obtained every 0.1 millisecond after an

elecûic pulse during 20 milliseconds, consisting of 200 levels in the time mis. The

evoked potential measures were M e r obtained in various different conditions (e.g.,

different paired-pulse intervals, different voltage levels of the paired pulses, and a dmg

administration), but at the same 200 time points. Al1 the different measuring conditions

formed 25 levels of Mode B. Only the measures at the first 80 time points were used in

this analysis, because the Mode B weights of the 25 raw data curves looked not

multiplicative in the later levels of the time mode, resulting in an 80 by 25 two-way

sequential dataset. The 25 data curves are plotted in Figure 4, where the vertical and

horizontal axes represent, respectively, voltage levels (pl) and the 80 time points.

Factor loadings in the measuring condition mode (Mode B) were constrained to

be non-negative because it would be reasonable to assume that the underlying relative

physiological influence (Le., the factor weights) on the measuring conditions is not

negative. Ln other words, a relative physiological influence under a specific measuring

7 It is gratefully appreciated that Dr. Stan Leung provided the data without any hesitation for this study.

Figure 3 . The 25 Evoked I>oteiiiial Data Curves The vertical axis represeiits t lie voltase @A) of the evoked pot eiitials. l'lie tiorizoiital axis represenis the successive t ime poiii~s (0.1 ins) aiter ilie electric pulse.

condition can be interpreted as a magnitude of a physiological event. Thris, it will be

reasonable to think that the magnitude of the latent event underlying the evoked

potentials varies fiom zero to positive infinity. As mentioned in the algorithm section,

one of the MATLAB built-in optimimrs "nnls" was used in order to constrain the Mode

B loadings not to be negative. As in the synthetic data analyses, the standard two-way

PCA solution of the evoked potential data has been used as a reference to compare with

the constrained QALS solution. Two factorss were suggested to underlie the electric

potential data evoked in the rat hippocarnpus in an informal discussion. To see if the two-

factor solution is most reasonable and interpretable, a few three-factor analyses have been

tried. However, the third factor in the three-factor solutions was highly correlated with

the first factor. The third factor seemed to share a considerable amount of its variance

with the first factor. Thus, two factors were decided to be the most reasonable nurnber of

factors to extract fiom the evoked potential data. The following result is based on the best

interpretable two-factor solution out of a few two-factor solutions that started fiom

different random positions.

The mode1 fits ( R ~ ) were 0.9740 and 0.9893, respectively for the PCA solution

and for the constrained QALS solution. Even though the additional fitting by the two-way

Shifted Factor model is not big (0.01 53 in R ~ ) , the recovered curves (factors in Mode A)

by the Shifted Factor model look quite different from those by PCA. The recovered

curves are reported in Figure Sa (the constrained QALS solution) and Figure 5b (the two-

8 A related discussion about the theoretical components and the nurnber of the components underlying the evoked potentials in the rat brain c m be found in Leung (1 978).

a. Recovered A by tlie two-way constrained QALS

b. Recovered .A by the two-way PCA and onhogonally rotated A'S, by 6, 10. and 30 degrees clockwise

Fisure 5 Recovered Sequeiitial Factors (Mode A) by the Two- way Shified Factor hlodel and by tlie Standard Two-way PCA. "F 1 -O" and "F3-0" in panel b stand for the unrotated PCA solut ion.

56

way PCA solution). The unit of the factor loadings is the sarne as that of the data curves

because the scale of the data is reflected in the time mode (Mode A). In the unrotated

PCA solution (FI -0 and F2-0 in Figure 5b), the orthogonal fint factor seems to account

for the general pattern of the 25 data c w e s . In other words, most covariation of the data

seems to be captured by the first component. The dominance of the first factor has been

also confirmed by the relative size of the factor contribution (Le., the variance accounted

for by each factor) in those two solutions. In the unrotated PCA solution, the variance

accounted for by the first component was approximately 11 times larger than that by the

second factor. However, the first factor accounted for an approximately 5 times larger

variance than did the second factor, in the constrained QALS solution. In Figure Sa, it is

easy to see that the first factor accounts for the slow and overall event, whereas the

second factor accounts for most of the fast and early event. Thus, the two-way Shifted

Factor model could better differentiate the two factors than could PCA. The PCA solution

has been orthogonally rotated by applying various degrees to see if the constrained QALS

solution is simply a rotated version of the PCA solution. However, any orthogonal

rotation could not have the PCA solution resemble the constrained QALS solution. In

Figure 5b, three rotated PCA solutions are plotted with the unrotated PCA solution;

clockwise rotations by 6, 10, and 30 degrees (respectively, represented by -6, -10, and -

30). None of thern resembles the recovered c w e s by the Shifted Factor model, showing

that the Shifted Factor solution is qualitatively different fiom the PCA solution.

The factor loadings of the measuring condition mode and the shift estimates are

provided in Table 6. The 25 measuring conditions are combinations of three aspects:

Table 6

Factor Loadings (BI on the Measuring Conditions and Shift Estimates (S)

Intensity Interval Drug W) (ms) WN) F1 F2 FI F2

Note: n i e column heading "Intensity (@)" represents the voltage of the electrical stimuli, which evoked potentials in the rat brain. The second heading "Interval (ms)" represents the time interval between the paired pulses. The next column heading " Drug (Y/N)" represents whether or not the dmg (propofol) is adrninistered. S and B stand for, respectively, the shift estimates and the factor loadings in the measuring condition mode. Bold characten are used to distinguish the measuring conditions after the second electric pulse of the paired pulses from those after the first pulse. The factor loadings of the second factor (syrnbolized F2) measured after the dmg administration are underlined to distinguish from the loadings for the other conditions.

58

intensity of the electric pulse, time interval of the paired pulses. and a drug administration

(propofol). The second pulse curve d e r 300 ,UA with 30 rns between paired pulses was

excluded fkom this study because its shape looked abnorrnal compared to the other

curves. That is why the condition of 300 @ with 30 ms interpulse interval has only one

level in Table 6 (row 19). The factor loadings in the measuring condition mode must not

be compared between the two factors (F1 and F2 in Table 6). This is because the relative

importance of each factor is reflected in the time mode, resulting in the scale-free factor

loadings in the measuring condition mode. Thus, the factor loadings given in Table 6

must be used only in comparing the relative importance of the 25 measuring conditions

on each factor. The pattern of the shift esthates shows that there is a systematic

difference between the fint and the second pulse conditions. Bold characten are used to

the second pulse conditions from the first pulse conditions. Most shift estimates are

positively shifted when measured after the second of the paired pulses, whereas they are

negatively shifled after the fint electric pulse. This systematic difference of the shifiing

direction is the sarne for both factors. Thus, both factors can be interpreted to be delayed

when the data were measured after the second electric pulse. They arose earlier when

measured afier the first electric pulse. With respect to the factor loadings in the

measuring condition mode, the second factor (the fast and early event) more strongly

appeared when the h g was administered than did it without the dmg. The loadings of

the second factor for the post-dmg conditions are underlined to distinguish from the other

Ioadings. On both factors, factor loadings are larger at the second pulse conditions than at

the first pulse conditions.

First Pulse Conditions 800 1 1 1 1 1 I I

600 -

400 -

-200 1 I I 1 I 1 !

10 20 30 40 50 60 70 80

Second Pulse Conditions 800 1 1 1 1 I 1 T I 1

Figure 6. The Data Curves ir i Each Condition of t l ie Paired Pulses

'ïhese systematic patterns of the shift estimates and the factor Ioadings in the

measuring condition mode have been reassured by comparing the data c w e s between

the paired pulses. Data c w e s are plotted in each of the pulse conditions in Figure 6. It is

not difficult to see the different positions of the two events along the time a i s .

Furthemore, the data curves in the second pulse condition look higher than in the first

pulse condition. This size difference between two pulse conditions confirms the

systematic overall pattern of both factor loadings in the measuring condition mode.

Unfominately, the two-way Shifted Factor model could not recover successfully

the suggested theoretical factors. In particular, the onset time of the two factors should be

in opposite direction to that of the two factors recovered by the two-way Shifted Factor

model. The f a t and early event has been supposed to start later than the second factor

recovered by the constrained QALS (F2 in Figure 5a). The slow and overall event has

been supposed to start from the very first level in the time mode, unlike the first factor

recovered by the constrained QALS (FI in Figure Sa). The failure of recovering the

theoretical factors might be partly due to "shape change" problem of the sequential

factors across levels of the non-sequential mode. That is, the sequential factors

underiying the evoked potential measures seem to change their shapes as well as to be

differentially shifted across levels of the non-sequential mode. The "shape change"

problem seems to be quite common to sequential data where the sequential factors are

supposed to be differentially shifted. In other sequential data (spectral data fiom

chemistry), the "shape change" problern of the sequential factors appeared more clearly.

In order to take care of this problem, a time warping method has been suggested. By

6 1

applying the time warping method to the Shified Factor models, the sequential factors

will be able to change flexibly their shape across levels of the non-sequential mode.

Further study is required to adopt the time warping method into the Shifted Factor

models.

DISCUSSION

The shifting of factors does not irretrievably destroy or degrade information. With

the proper algonthm, it is possible to recover the original factors and the shifts fiom the

shifted mixtures that make up the surface data. This has been successfully tested by

analyzing the error-free synthetic datasets by both the QALS and the nonlinear

optimization prograrns. The two-way and the simplest three-way Shifted Factor mode1

(TSF1 ) cm perfectly recover the model parameters from most random starting positions.

Furthemore, the cornparison of the Shifted Factor solution with the PCA or the

PARAFAC solution clearly shows that the Shifted Factor models provide more

improvement in recovering the parameters rather than in increasing the model fit when

there is a significant amount of error. This ability of the Shifted Factor models was

clearer in the two-way analysis than in the three-way analysis. This is because

PARAFAC already has another source of information to recover the parameters that the

two-way PCA does not have: the distinct factor variation across 1eveIs of the third mode.

Thus, it can be said that the differential shifting of the sequential factors that provides

additional information allows the two-way Shifted Factor mode1 to recover the

parameters.

The hypothesized uniqueness property of the Shifted Factor models has been

numerically demonstrated by analyzing synthetic data. The perfectly recovered Shifted

Factor solutions starting from a few different random starting positions demonstrate that

the Shifted Factor solution is unique and that the prograrns implementing the Shifted

Factor models work. Furthemore, the Shifted Factor solution c m be considered

63

"intrinsically" unique, because the uniqueness of the Shifted Factor solutions is provided

by the differential shifts imbedded in the sequential data rather than by an extemal

constraint such as the orthogonaiity of factors as is the case in the two-way PCA. With

fallible data, the likelihood of a unique global optimum is suggested by number of

matching solutions. Even though the number of best fitting and matching solutions varies

depending on the amount of error and the shape of sequential factors, many recovered

parameten from fallible data matched with others. This fact shows that the best fitting

and matching solutions are local optima rather than arbitrary alternative solutions From an

infinite set of alternatives. For the two-way PCA, this would not happen even with

fallible data. Taking account of the mean shift offset, the matching solutions would be

more numerous than reported in the results.

To avoid underestimating how much the estimates of sequential factors resemble

their true values. the estimates of factor loadings in Mode A can be adjusted by taking the

mean offset out. The recovery correlation for shifts c m also be corrected by weighting

each shifi estimate and its tnie value by the Mode B weight at that row. This is because

the effect of a shift on the recovered data is a fùnction of factor loadings on the non-

sequential mode. In other words, the contribution of an amount of shift varies according

to the factor weight on the non-sequential mode at that row.

When interpreting the estimate of shifis, one should note that shifi values are not

the same in nature as the multiplicative factor weights (Le., factor scores and loadings) in

the standard factor analysis models. They must be interpreted in connection to the

specific loading pattern of the corresponding sequential factors (particularly with respect

64

to the shape of the recovered sequential facton) and the corresponding factor loadings in

the non-sequential mode. The contribution of a specific amount of shifting to the fit or

residuals is a function of two variables: the degree of change of the adjacent factor

loadings in the sequential mode (Le., the shape of the sequential factors; in particular, the

average absolute steepness of dope of the curve) and the size of the factor loading at that

level in the non-sequential mode (i.e., the corresponding Mode B weight). The shift

parameter is also different f?om a standard factor loading in the sense that it does not

have the multiplicative (or proportional) relation with other Ioadings in reproducing the

data. Because of these distinct properties of the shift parameter, the interpretation of the

Shified Factor solution must not be made in the same way as for the solution of the

standard factor analysis. For the same reason, it must be guaranteed in sequential data

before fitting the Shifted Factor models that the adjacent loadings along the sequential

axis are somehow related to each other. Without this assumption, a recovered curve

(sequential factor) by the Shifted Factor mode1 will be rneaningless.

As shown in the results of the synthetic data analyses, the shape of sequential

facton has an important effect on the recovery of parameters. The more rapidly changing

(i.e., steeper slope of the sequential factor) and/or the more fluctuating the successive

factor loadings are along the sequential axis, the stronger the contribution of shift

parameters will be to differentiating the facton. However, the shape effect must be

interpreted together with the factor loading in the non-sequential mode (Mode B).

In the evoked potential data analysis, the sequential factors were found to change

their shape, as well as their position along the sequential mis, across levels of the non-

65

sequential mode. This shape change problem was more clearly found in a chemical

spectral dataset. If the Shifted Factor mode1 could be modified to account for the

differential shape change of the sequentiai factors, it might have another source of

information to fix the factor orientation as well as become a more general model that can

resolve the shape change "problem". A method cailed "time warping" has been suggested

to allow the sequential factors to differentially change their shape across levels of the

non-sequential mode. The time warping method must be applied in the latent factor level

but not in the observed variable level, as must the position adjustment. The idea of shape

changes in sequential factors must be M e r studied to see if it can be implemented in a

generalized Shifted Factor model.

With respect to the Super QALS program, the maximum number of iterations

must be carefully determined in the subprograms (BF and FLS). If it is too small, BF will

be unable to iterate enough to reach a minimum. However, if it is too large, FLS will be

unable to give the following BF a chance to try another starting position. Thus, a few

different maximum nurnbers of iterations rnust be tried for a specific datasei before

deciding the optimal level of the maximum number of iterations for each subprogram. As

mentioned earlier, to constrain the model could be very valuable in the analysis of some

sequential data. Further study must be devoted to the nonlinear optirnization method of

fitting the Shifted Factor model(s) so that one c m flexibly constrain the model

parameters.

REFERENCES

Carroll, I. D. & Pnizansky, S. (1984). The CANDECOMP-CANDELMC farnily of

models and methods for multidimensional data analysis. In H. G. Law, C. W.

Snyder, Jr., J. Hattie, and R. P. McDonald (Ed.), Research Methods for

Multirnode Data Analvsis (pp. 372-402). New York: Praeger, 1984.

Carroll, J. D. & Arabie, P. (1980). Multidimensional scaling. Annual Review of

Psvcholorzv. 3 1, 607-649.

Cattell, R. B. (1946). The Description and Measurement of Personalitv. New York:

Harcourt, Brace & World.

Cattell, R. B. (1 963). The stnicturing of change by Ftechnique and incremental R-

technique. In C. W. Harris (Ed.), Problems in measurïnn chance (pp. 167- 198).

Madison, Wisconsin: University of Wisconsin Press.

Field, A. S. & Graupe, D. (1991). Topographic cornponent (paralle1 factor) analysis of

multichannel evoked potentials: Practical issues in trilinear spatiotemporal

decomposition. Brain To~oeraphv. 3,407-423.

Green, P. E. & Carroll D. J. ( 1976). Mathematical Tools for Applied Multivariate

Analysis (p 167). New York: Academic Press.

Harshman, R. A. (1 994a). PARAFAC: Parallel factor analysis. Com~utational Statistic &

Data Analysis. 18, 39-72.

Harshman, R. A. (1 994b). Substituting statistical for physical decomposition: Are there

applications for parallel factor analysis (PARAFAC) in non-destructive

evaluation? In X. P. V. Malague (Eds.), Advances in Signal Processine - for

Nondestructive Evaluation of Matenals @p. 469-483). The Netherlands: Kluwer

Academic Publishers.

H a n h a n , R A. (1 997). A Shifted-Factor mode1 for analvsis of seauentially oreanized

data Research Bulletin, Department of Psychology. London, Canada: University - of Western Ontario.

Horst, P. (1965). Factor AnaIvsis of Data Matrices. New York: Holt, Rinehart, and

Winston.

Kniskal, J. B. (1978). Factor analysis and principal components: Bilinear methods. In W.

H. Kruskal and J. M. Tanur (Eds.), International Encyclopedia of Statistics (pp.

307-330). New York: Free Press.

Leung, L. S. (1978). Dynarnic mode1 of neuronal response in the rat hippocampus.

Biological Cvbernetics. 3 1 ,2 19-230.

MathWorks. (1996a). Using MATLAB. Natick: The MathWorks, Lnc.

MathWorks. (1996b). MATLAB: Chtimization Toolbox. Natick: The MathWorks, inc.

Molenaar, P. C. M. (1985). A dynamic factor mode1 for the analysis of multivariate tirne

series. Psvchometrika. 50, 1 8 1 -202.

Press, W. H., Teukolsky, S. A., Vetterling, W. T., & Flannery, B. P. (1 992). Numerical

Recines in FORTRAN: The Art of Scientific Comoutinq (2nd Ed.), (Chapter 10).

New York: Cambridge University Press.

APPENDIX A. Fractional Line Search (FLS) for the Shift Estimation

The purpose of FLS is to allow the estimate of shifts to be fractional values rather

than only integer values as assumed in BF, where they are the nurnbers of rows to shifi

along the sequential mis. FLS interpolates the factor loadings in the sequential mode

(Mode A) in obtaining estimates for fractional shifl values in order to increase further the

mode1 fit.

Once the iterative solution of BF has become sufficiently stable (changing less

than some critical level fiom one iteration to the next), the converged final solution is fed

as a starting position into the FLS estimation. FLS has a more stringent stopping criterion

than BF does. FLS is to find the least squares solution for each row of B that gives a

M e r minimized sum of squared residuals of the corresponding column of X. Unlike

BF, FLS estirnates only one element in a row of S and B (e.g., sjr and bjr) at a time given

the current values of the other shifls and the Mode B loadings in that row (e-g., the jth

row). This is obviously necessary because al1 possible combinations of fractional shifts

for a row of S and B are infinite.

The FLS estimation proceeds as follows. When the shift estimate at iteration i in

FLS is s j r , for the jth level of Mode B and the rdi factor, three shift values (s-0.5, s,

s+0.5) are tried in the next iteration t+l to find the best shift that will give a further

rninimized surn of squared residuals for x, but given that the other (R-1) shift values and

Mode B loadings in the same jth row are fixed. If one of the boundary shift values arnong

the three candidates (i.e., s-0.5 or sM.5) gives the smallest sum of squared residuals of x,

than that given by the middle value s, the shifi and the Mode B loading, respectively,

69

s,rtr+,, and bjrff+,, are updated with that boundary shifi value and the resulting estimate of

Mode B weight. When sjrff+,, is updated, the interval of the three shift candidates to ny in

iteration r+2 gets reduced to half. The decreasing interval is to restrict the possible range

of the final fractional shifi estimate in FLS from s-l to s+l where s is the initial integer

shift fed in FLS. Of course, the middle candidate d u e in iteration r+2 must move to the

updated shift in iteration t + l (e-g., s-0.5 or s+0.5). For instance, the three candidates in

iteration r+2 become s-0.75, s-0.5, s-0.25 when the updated shift in iteration r+l is s-0.5.

However, if none of the two boundary values gives the smallest sum of squared residuals

of 4 in iteration r+ 1, the interval of shift candidates does not change in iteration r +2 and

both the shifi and the corresponding Mode B Ioading remain the same. T echnically

speaking, the resulting three sums of squared residuals of x, are successively "bracketed"

to search the best shift by trying the evenly spaced three shifi candidates. For more details

about the bracketing procedure, see Press et al. (pp. 390-393, 1992). Af'ter the shifi and

the B estimation, the fiactional part of the fiactional shift values is used in linearly

interpolating the values in columns of A (or X when unshifting the data matrix) that are

already shi Red or unshifted by the integer part of the fractional shifts.

The allowed range of shifl estimates during the whole iterations in FLS is

constrained not to go beyond the integer boundaries (s-1, s+ 1 ), where s is the initial shift

value given from BF. This restriction is to keep the best shifi estimates in the integer

level, which have been estimated by the exhaustive trial-and-error in BF. Note that a

single shift sjr is estimated at a time in FLS rather than a combination of shifts for al1

factors at the jth column of the data, sj as it is the case in the BF. Necessarily, there is an

70

order effect o f factors in terms of the hctional shifi estimation and the resulting Mode B

estimation. The order effect c m be thought as a cost to the more flexible estimation of the

shifi values so as to fit the data over and above the fitting in BF.

APPENDK B. Oscillating Mode1 Fits in FLS and Super QALS

Fit values during the iterations in FLS sometimes oscillated by the every nth

iteration, where n varied case by case. For example, the fit values bounced back and fonh

between the same two nurnbers during the iterations in the analyses of synthetic data

when the oscillating syrnptom was strong. This syrnptom seems to be due to both the

order effect of factors in the shift and the B estimations in FLS and the "quasi" l es t

squares solution of A due to the incorrect estimate of the rth factor contribution to the

data.

Let suppose we have a simple example with 2 factors to extract. In iteration r in

FLS, each elernents in the jth row of S and B is estimated given A fiom iteration t - 1.

However, the second element in the jth row of S, s,? and the resulting Mode B loading 6,'

are also a function of the first elernent of shifis s,, and its Mode B loading b,, because of

the order effect of factors in FLS. Then in the same iteration, Mode A estimates are partly

a function of the corresponding shifts as well as Mode B loadings because of the

polishing (or subtracting) procedure for each factor contribution before estimating each

column of A. Of course, the order effect of factors arises not only in the shift estimation

but also in the size loading estimation (i.e., Mode B and Mode A loadings). If the

improvement of factor Ioadings is very small across iterations as is the case after a few

hundred iterations in the ordinary least squares analysis, synthetic data analyses, the

bouncing iterative fits might be mainly due to the order effect in the shifi estimation and

the polishing procedure in the Mode A estimation. Note that the Shified Factor mode1

always needs to shifi colurnns of A whenever producing or estimating a column of

sequential data, and the estimation of each factor contribution is also not an exception.

One possible interpretation o f the oscillating iterative fits and agreement with the

true parameters is as follows. In iteration t , the first shift in the jth row sjr, will have an

effect not only on the estimation of the second shift s,2(, but also on polishing X for the

second factor estimation of A. nie updated second factor of A, a ~ ~ ~ ) will influence the

estimation of the second shift in iteration t+ 1. This second shifi in iteration r+ 1

will then influence the estimation of the fint shift in iteration t+2, sj,/,+~,. Although the

shift estimation is always to minimize surn of squared residuals for a specific column of

data it is given the fixed shift for the other factor. Thus, to minimize the s u m of squared

residuals of a column of data is not necessarily compatible with the rninimization in the

least squares sense. The minimization in FLS always depends on the fixed shifi for the

other factor@). ïh is type of weak minimization in FLS can result in a pair of bouncing fit

values and agreement values with the rnie loadings, which might happen through the

influence path described above. This kind of oscillating or circulating solution c m be

considered as a new type of local minima in the sense thar it will not change across an

infinite nurnber of iterations although it is obviously not a minimum.

As a provision for the oscillation problern, a "super" iterative method has been

developed. The oscillating or non-converging solutions up to a prescnbed maximum

number of iterations in FLS are again fed into BF as a starting position. This "super"

level iteration of recurrent BF and FLS resolved most oscillating problems and

sometimes non-converging solutions of error-fiee data in a given nurnber of iterations

73

(e.g., 1000). The super iterative method seems to take a distinct advantage from both BF

and FLS. BF in the super iteration can estimate shifts without the order effect of factors

because of the exhaustive way of estimation. The solution by FLS is l e s accurate than is

the solution by BF because of the order effect and is "wrong" in a sense. Thus, the

"wrong" or less accurate FLS solution could be considered as a kind of different initial

starting position in the recurrent BF that gets the BF solution in another path to the final

solution. Thus, the less accurate FLS solution can give BF another chance to try a better

path and then to avoid reaching a local minimum. If the FLS solution is bad enough to

give BF a chance to find an efficient path to the grand minimum, the recurrent fitting by

BF but with a small maximum number of iterations will be better than a single BF fitting

with a fairly large maximum number of iterations. The recurrent fittiiig by BF in the

super iteration could be equivalent to tryng a number of different starting positions and

then picking the best arnong them. However, the less accurate FLS solution must not be

considered as a totally different starting position because it is still partly a least squares

solution.

APPENDIX C. Alternathg Estimation for TSF1

The idea of alternating estimation for each mode works for one of the three-way

Shifted Factor models, TSFl. Before descnbing the TSFl program, it will be usefùl to

take a bnef look at how the ALS program works for the ordinary three-way factor mode1

such as PARAFAC. When estimaiing one of the three loading matrices, the three-way

data array must be appropriately reshaped into a two-way big "strung out9' matrix. The

given predictors, which are sets of bi-products (e.g., aiçb for the Mode B estimation) of

the eiements in the two fixed matrices, also have to be arranged so that the row (or

column) position agrees with that of the adjoined data matrix. For instance, the I by J by

K three-way data array c m be reshaped into an IxK by J matrix for the Mode B

estimation by stringing K slices in the three-way data array out one below the other.

Then, the elements in the tentatively fixed part in the regression estimation (an I x K by R

matrix combined by the given but tentative A and C in this case) must be arranged so that

the row position of the fixed part of the parameters matches to that of the strung out

'btwo-way" data matrix. The way of reshaping of the three-way data array and the

required arrangement of the tentatively fixed bi-products hoids for the three-way Shifted

Factor models. Three-way Shi fied Factor models are. though, di fferent from P ARAF AC

in that A must be properly shifted for Mode B and Mode C estimations before combining

the tentatively fixed bi-products and that the strung out two-way data matrix for Mode A

estimation must be properly unshifted before using it in the regression estimation.

Harshrnan (1 994a) described more details about the rearrangements of both the data and

the fixed parameters.

75

For the estimation of each row of B, the columns of the tentatively fixed part of

the parameten ( I d by R rnatrix combined by using A and C) must fint be shifted by the

amount given in the corresponding row of S because TSF1 assumes that a specific shifi

s,, holds across al1 levels of Mode C but at the jth level of Mode B and for the rth factor.

Thus, B is estimated one row at a time by using the properly shified version of the fixed

part. However, when estimating the factor loadings of the other non-sequential mode C,

the shifting procedure is not the same as that for the Mode B estimation. The three-way

data c m be, for example, reshaped into an I d by K matrix. Consequently, the fixed bi-

products (ai&,,, colurnnwise or factonvise combination of A and B) must form an IxJ by

R matrix so that the row positions of the two rearranged matrices match each other.

Because TSF1 assumes that the shift amount varies across levels of Mode B, the bi-

products in the fixed part ai&, must be differentially shifted according to the row

subscript of loadings of Mode B. It will be easier to visualize the differential shifiing of

colurnns of bi-products if one collects hem in an I by J b y R three-way array. The shape

of the top surface of the three-way array of the fixed bi-products will be the same as that

of the shift rnatrix S. Thus each colurnn of the rearranged three-way fixed part can be

shifted by the arnount defined in S. Of course, the shifted version of the three-way array

fixed part must be reshaped into an IxJ by R two-way matrix before using it in the Mode

C estimation. Another difference between the C estimation and the B estimation is that

al1 elements in C are simultaneously estimated rather than by individual rows because the

size of shif€ is cornmon across al1 levels of Mode C.

76

A is estirnated in the sarne way as in the two-way QALS. That is, each column of

A (e.g., a,) is estimated separately by using the row-position corrected version of X that

is unshifted appropnately to line up the latent contributions of factor r. This is done by

applying the values defined in the rth column of S to restore the '?me" row position in

the data. Of course, the unshifted version of X for the rth factor must be reshaped into a

proper two-way maûix, for example, an I by JxK matrix to use for the regression

estimation of A.

77

APPENDIX D. Singular Value Decomposition for the A Estimation

As discussed in the section of the polishing procedure, the estimate of A is not an

exact least squares solution because of the inaccurate estimate of the rth factor

contribution in the data. If the factors are orthogonal to the others in both modes. the

contribution of the other factors does not have an effect on the estimation of the rth

column of A. Consequently, the polishing procedure in the A estimation will not be

required in the Shified Factor models. However, the orthogonal factors are not guaranteed

in the Shifted Factor models. This is because the Shifted Factor models do not need to

constrain the factors to be orthogonal in order to fix the factor orientation, as is the case

in the standard two-way factor analysis.

The ordinary regression has been used to estirnate a column of A given the

polished and then unshifted version of X and the rth column of B. Considenng that the

estimate of the rth factor contribution to the data is confounded with some of the other

factors' contribution, it may not be safe to use a column of B as the fixed parameters in

regression to estirnate the corresponding column of A. Because each row of B has been

estimated given the differentially shifted version of A that is not quite correct in the least

squares sense, the estimate of B may also be a little incorrect by the effect of incorrect

solution of A in the previous iteration. If the effect of the incorrect A solution is

sornehow accurnulated across iterations through the subsequent incorrect solution of B.

the solution will diverge from the correct solution more and more during iterations. To

relieve this conjectured "carry-over effect" of the incorect solution of A, singular value

decomposition (SVD) has been adopted for the A estimation. SVD is a similar type of

78

decornposition method to PCA except its different scaling convention. That is, the SVD

solution is the sarne as the unrotated PCA solution except their scaling convention: SVD

collects the relative scale of factors in a diagonal matrix whereas the unrotated PCA

solution reflects the scale of factors in the loading matrix. Thus, both the PCA and the

SVD solution are orthogonal among factors.

The orthogonality of SVD has been supposed to cut the chain, through which the

effect of the incorrect A is canied over, because the SVD solution for A is less related

with the B in the sense that its does not use the column of B (b,) when estimating the

corresponding column of A (a,). However, the SVD estimation for A has not always been

better than the regular regression: SVD was better in the three-way synthetic data

analysis but regression was better in the two-way synthetic data analysis. It, however, has

not been thoroughly studied what the advantage, if any, of SVD for the Mode A

estimation is, and why it happens.

APPENDIX E. Nonlinear Optimization Procedures

In the nonlinear optimization programS. the objective Function (Le., sum of

squared residuals of a column of the data or the whole data) is minimized by a numencal

'hill climbing' procedure, simultaneous optimizing al1 parameters of the Shifted Factor

model. MATLAB built-in optimization programs "hinu" and "leastsq" have been tested

for the unconstrained Shifted Factor analysis of synthetic datasets; and "constr" and

"nnls" for the constrained Shified Factor analysis of a real dataset. The default estimation

methods for these minimizers are Quasi-Newton, Levenberg-Marguardt, Quadratic

Prograrnming, and Sequential Quadratic Prograrnming for "hinu", "leastsq", "nnls", and

"constr", respectively (MathWorks, 1996b). "Leastsq" is reportedly more efficient for the

least squares problem, whereas "hinu" is more general for unconstrained multivariable

functions.

One of the constrained optimization programs, "nnls" estimates one set of

pararneters given the other set of parameters and the data but with the non-negativity

constraint. That is, it gives a non-negative least squares solution. Thus, "mls" is not for

the simultaneous estimation of al1 pararneters but for one mode estimation given the

factor loadings in the other mode(s) a s in QALS. It has thus been used inside the

constrained QALS for the real data analysis. That is, the constrained QALS uses "nnls",

rather than the regular regression, for the estimation of each row of B given a column of

data and the correctly shifted version of A. "Constr", however, is more appropnate to

apply to the simultaneous optimization problem than is "MIS" because it does not need

the known set of parameters (such as the fixed part in the regression equation) as "mls"

80

does. With "constr", it is possible to impose various constraints on the mode1 parameters,

e.g., non-negativity on factor loadings A and B andlor upper and lower bounds on the

shifi parameters S to restrict the range of shift estimates. Thus, "constr" seems to be an

appropriate program to constrained nonlinear and simultaneous optimization problems,

particularIy for some real data.

One common requirement in using these minimization programs is that the

objective function must be continuous. This continuity is a requirement to evaluate the

gradient of the objective fict ion to minimize. It could be appropriate to assume that the

variation of factor loading parameten (A and B) results in continuous fùnction values

because it is always possible to define a value between two factor loadings. Considenng

one element in a two-way data matrix X, .rd, the contribution of factor r, is a bi-

product of the rth factor score at the ith row of A and the rth factor loading at the jth row

of B. Given one of the two sets of R factor weights (e.g., a,), the sum of squared

residuals of.? will be a linear function of the estimate of the other set (e.g., bjr). Thus the

continuity of the objective function will be guaranteed because the factor weight is

infinitely continuous in theory. However, the continuity does not hold for the Shifted

Factor models as long as the size of shifts are defined to be a number of rows in a given

dataset. This linear interpolation of A or X by using the fractional shifis has resolved this

discontinuity problern. If one can define a fairly small amount of fiactional shift, the

resulting sum of squared residuals will be continuous. This is because, given air and b,,

for .TV, the sum of squared residual of xv is a continuous function of continuos the

fiactional shifts, which cm be any real nurnber.

APPENDIX F. MATLAB Prograrns of QALS

The two-way and the three-way subprograms in the super iterative QALS: BF and

FLS are provided, which were written in the MATLAB language (MATLAB ver. 5).

Each pmgram will require a few complementary programs to be run. They must not be

considered completed for a ninning.

iBF2WSIM is the f i r s t subroutine of the 2-way modified ALS shifted + factor program for simulated data. The true parameters Aoriq, 5 3oriq, and Loriq as weil as X should be defined as a global t varianle. It calls for the fractionai ! i n e search sabroutine, * II ; ,s2wsirn.m".

i 5 May 28, 1597, Sunqjin Hong 1.

[nas,nbs] = s i z e ( X I ; ~olBf=.OS;rnaxinum=50;aChangeMax=l;bChangeMax=l; inum=l; L=zeros (nbs, nfs) ; lagmin=-5; lagmax=5; c= lagmin: lagmax] ' ; i f nfs==2 3Generate al1 possi~le combinations of shifts f o r a column of

3x. Lpop=[kron(c,ones(length(c) ,1) 1 ,kron(ones(lenqth(c) , l ) , c ) J ;

else i f nfs==3 Lpop=[kronic,one~(length(c1~2,l)) , k r o n ( k r o n ~ o n e s ~ l e n g ~ h ~ c ~ ,l) ,c) ,. .. ones(length(c),l) 1 ,kron(one~(lengthic)~2,1) ,cl 1;

elseif n f s = = 4 Lpop=[kron(c,ones(length(c) *3,1) 1 , . . .

k r o n ( o n e s ( l e f i g t h ( ~ ) , l ) ~ k r o n ( ~ ~ ~ n e ~ ( i e n g t h ( ~ ) ~ Z , l ) 1 I I . . . kron(one~(length(c~~2,l),kron(c,ones~lenqth~c~,l! 1 1, . . . kroniones(length(c) - 3 , 1) ,cl 1 ;

elseif n f s = = 5 Lpop=[kron(c,ones(length(c) "4,111 ,.. .

kron(ones(length(c),l),kron(~,0ne~(~ength(c1~3,~) 1 1 , . . . kron(ones(length(c)^2,?i,kron(c,on~~,ones~1enqth~c~~2, 1) } 1 , . . . k r o n ( o n e s ( l e n g t h ( c ) A 3 I l ) , k r o n ~ ~ I ~ n e s ~ 1 e n g t h ~ c ~ , l ~ 11, .. . kron(ones(length(c) ̂4,l),c) 1 ;

end outsavefilename=[outfiIenarne, 'b2',start,int2strrunsn , .mat1]; disp ( [ 'Starc of 2-way Brute Force ' , int2str (runsnum) ] ;

while (aChangeMax>tolBFIbChangeMax>tolBF) &inum<-maxinum

if inum>maxinum disp('BF has reached its maximum number of iterations')

end

3 Estimation of S and L f3r j=l:nbs ssqresXj=10n6; EAssigned initial value of sum of squared

iresiduals of xj. r a z i=l: length (lpop)

Alag=laq (A, Lpop (i, : 1 1 ; SShFft A DY usinq a combinat ion of %shifts picked from the matrix of %al1 possible combination of shifts.

bj=X(:, j ) '*pinv(Alag) '; xjHat=Alag'bjl; ~sqresXjO=sum(fX(:,jl-xjHat).~2); %Cornpute sum of squared

$residuals of xj given by applying ith Jcombination of 3shifts.

if ssqresXjO<ssqresXj %If the ith combination gives smaller asum of squared residuals, update the %minimum ssqresx j, j th row of B, and %jth row of shift matrix L.

ssqresX j=ssqresXjO; Bhat(j,:)=bj; Lhat(j,:)=Lpop(i,:);

end end

end

$ PLnchor each colurrn of shift values so chat ics aean is close r c z e r c . laqoffset=rocnd (mean iLhati i ; Lhat=Lhat-ones inbsI 1 ; -l&qoffse:; A=laq (A, laqcffse~) ;Ahac=A;

i Estimation of A BinvPrFm=pinv (Bhat ' ) ; for r=l:nfs

for j=l:nbs Alag=lag (Ahat, Lhat ( j , : 1 ) ; JShift A to get the estimate fo j th

%colurnn of X. Alag ( : , r) =zeros (nas, i) ; %Assign zeros so as partial out the

3rth factor contribution. xjrhat=X ( :, j 1 -AlaqCBhat ( j , : 1 ' ; %Subtracte the contribution of

Sthe other factors from X . s=-Lhat (j, r) ; SAssigr! "-" to al1 shifts for che inverse

$shiftinq of X or undoing the shifting of X . if s<O 5Unshifc X for rth column of A .

XrUnlag(:, j)=[xjrhat(l-s:nasl ;zeros(-s,l)]; else

XrUnlag(:, j)=[zeros(s,l);xjrhat(I:nas-s) j ; end

end Ahac i : , r) =XrUnlaqCBinvPrim( :, r ) ;

end

3Scale A and B so tnat the scale of data goes to A. Bscaleinv=diag (msq (Bhat) . Y-. 5) ) ; Bhat=3hatCBscaleinv;Ahat=Ahat*inv (Bscaleinv) ;

% Fit values Ares=A-Ahat; Bres=S-Bhat;msA=msq (A) ;msB=msq (BI ; AperChange=abs (Ares) 'diag(msA. A ( - . 5 ) ) *lOO; BperChange=abs (Bres) *dia9 (msB. ( - . 5) 1 *lOO; aChange~ax=max(max(AperChange));bChangeMax~ax(max[BperCnange~ 1 ; ssqresL=sum(sum( (L-Lhat) . " 2 ) ) ;

Two-wav FLS

3fS2WSïX F s t h e second subrcucine of the 2-way sh i fced f a c t o r prcqr=T, w h i 5 rs 3 ne s tod Ln rhe f i r s t subroucize D f Z w s i m . 5 3 Ya:l 2 9 , 1 9 5 7 , SungjLs Bcng 4 3 - - . , , , rS=. - 31; ;rnaxinum=5O;aChanqeMax=L;i:ChangeMax=l; Fnum=L; 3=0nes ( * s , 2:s: * . 5; outsavefil2name= [oucfilename, ' f 2 ' , star:, inc2scz ( n n s n u n ) , ' ..mt ' 1 ; d i sp ( [ ' S:ar= of 2-way Fracciocal Li?-e S e a r c h ' , i n t 2 s t r ! rncsnum) 1) ;

i 5 Lnum>rnaxi~un d i s p ( ' F l S has reached ics naximum number 3f iterations . ' )

3 Zst i ,nat ion of 3 f o r j=i :nbs

for r = l : n f s sh i f t vec=Lha t ( j , : ) ; S S h i f t s for j t h column of d a t a . s - sh i f zvec (r) ; % S h i f t for r t h f a c t o r in the jth s h i f t vecEot. d=D ( j , r) ; %?ic!c OUL the correspondinq i n t e r r a l f o r s h i f t

3 for f a c t o r r and column j. s v e c = [ s - d , s , s r d ] ; %Three possible s h i f t s o u t of which one w i l l be

3picked ouc. f o r v=1:3

s h i f t v e c ( r i = s v e c ( v ) ; Alap- lag in ter (A, s h i f t - r e c ) ; % S h i f t A buc i n t e r ? o l a c i n g with the

% f r a c t i o n a l p a r t of s h i f t va lces .

. SI=:<, . :, 1' ' -?iriv ! X a c ' ; x i i a r=Alag*3: '; ssqrosX:uoc ,-Y) =su..? ( (X : , : 1 -x:nat) . ̂ 2 ) ; 3 S m cf square r e s r d u a l s f o r

%=Se t h r e e snif: a l t e z x t i - r e s fcr 3the !ch and r th s h l f z .

FZ min ;ssqrosXjvec: ==ssqresX jvec (1 ) 3 i , ! 5 ; %If rninLpum sum of squarec! r e s i d u a l s i s ;iTzer. by t h e

3 E i r s t s n i f r , reauce the Fncervai t o h a l f =a preven: S it f rom gettzng d ive rge .

Lha= ; 3 , z : =s-d; 3-d chen, update ~ h e snift . e l s e i f e n (ssqresX jvrec) ==ssqzesX jvec :3j

a ( ; ,=;=.5*-; :ha= l f , r : = s i c ! ;

end enc X l a ç - l a s i z t s r ,A, :bat (:, : ! 1 ; 3 h a t : f , : 1 = X : :, : ' *;ix-r ( .Uzg) ' ;

end

3scaiei~v=diag(msq:3hac) - - ( - . 5 ) 1; aha t=ahac-Bsca le inv ; Mat=Ahac'inv(Bscaleinv); %Sca le Ahat and 3 h t .

9 Fit -ralues Ares=A-Ahat; S r e s = M h a t ; L r e s = L - L h a t ; v q (BI ; AperChange=ans (Ares) *diag (msA. A ( - . 5 ) 1 '100; BcerChanqe=abs (Bres) *dFag ( m s B - A ( - .5 1 ) W O ; aChangeMax=rnax (max (AperChange ) ) ; bChangeMax=max ssqresL=sum(sun( (L-Lhatl . ̂ 2) ) ; ssqresLorig=surn (sm( (Lorig-Lhat) . -2 1 1 ; corrAqax ( a b s ( c o r r 2 s e t (Aor ig ,Ahat ! 1 l ;corzB=max f o r f =1: nbs

s h i f t v o c = l h a t (j, : ) ;

33F3SIY-SVD runs the 3-way s u p e r iteritive quasi-ALS shifzed f a c z ~ r 3 ?raqran wlrh simufated data of v a r m g e t r c r Levels :ha= 3 oscinates calumns of A by using svd. j

3 i 9rLqinal: Ju=e 3, 1997, S u n g j F ~ 3cnç 3 Revis~cc: A l q u s t 1 8 , 97. % 3 [nas,zSs,ncs]=size (X!;

+Set i n i r i a i values tolB~=.05;maxiter=50;aChangeMax=I;bChangeMax=~;cChangeMax=l; iternum=l; L=zeros (Anbs, n f s ) ; L h a t = L ; AC3d=zeros (nas, ncs , nfs 1 ;AC3dShif ted=AC3d; .=3d=zeros (nas, ribs,nfs) ;-3dShifted~AB3d; XrVnskifted-zeros (nas, zBsI ncs) ; iagmi~=-5;lagmax=5;~=[1agmin:Lagmax] ';

bRearrange X for Mode 9 and C estimation. XforB=reshape (permuta (X, [1,3,2] 1 ,nas*ncs,nbs) ; XforC=reshape ( X , nas*nbs, ncs) ;

if izermmxnoxi:ez ciisp c ' 3-way 3" has zeached its max F z e r a ï i o ~ rmiber' !

end

3 Zsc.aa-;co - -. L A W A . of 3 arrd 5 f o r k=l:~cs iCsn3ine A and C :O De a nas*ncs*nfs 3 - w a y array as t2.e

3fixed ?ar t . AC?d::, k, : ) =A*diaq:C(k,:)) ;

sr.d f 3 f j=I:n8s

ssqrosXj=1Oa6; f o r s=l:langth!L?op)

f o r r=l:zfs snif==Lpop i s, r) ; i f s,'iift>=c)

AC3dSkif=od(:, :, r)=(zercs(snif:,ncs);AC3d!l:nas-sii1f:, : ) ; ; olse

AC3dShifred(:, : ,r) =[AC3d(1-si:ift:nas, :, z ) ;zeros (-shif=,~cs) j ; a n c

end ADkset=reshape !AC3dShiftedt nas*ncs, nfs) ; bj3atTeap=Xfor3( : , j 1 '*pinv(ADksetl ) ; Xj HatTemp=ADkset*b jHatTempv ; ssqresXjTernp=sm( ( X f o r S ( : , j 1 -XjHatTempl . * 2 ) ; if ssqresXjTemp<ssqresXj %If the sth combination gives a

3smaller ssqres, Lhat(j, :)=Lpop(s, : ) ; %Update jth row of Lhat, ahat ( j , : 1 =b jHatTemp; %Update j th row of Bhac, and ssqresXj=ss~resXjTemp; 3Reset ssqres for Xj with the

%temporal minimur,. end

end

end

3 F i t * ra lues nsA-msq (4 ) ;ms3=msq (3) ;rnsC-msq (Cl ; AperChanqe=a=s (X - .Ua t ) *diag(msA." ( - - 5 ) 'LOO;aChangeMax=mx(mx:A~erChançe) ! ; 3perChznge=aCs (3-3haz) -diâg (=B. A ( - . 5 1 1 L O G ; bChangeMaxtmax (max (3pezChanqe 1 1 ;

C?erCSange=abs ( C-Char! *dia9 (rnsC. ( - - 5 ) '100; cCSanqeMax=max (max (C?erCharqe) :I ;

nsqresL=sun(sum( ( L-5,iat) . ̂2) i / (nbs'nfs) ; a~qresLorig=sum(sum((Lorig-Lhat).~2) ) / ( n b s * n f s ) ; carrA=n~ax!abs (corr2set (Ahat ,Aor ig) ) 1 ; corrBqax (ans (corr2sec (Bhat,Borig) l l ; corrc-max (abs (corr2set (Chat,Corig) ) ) ; f o r j=l:ibs

for r=l:nfs s h i f r = L h a t ( j , r) ; if shif=>=O

A s n i f t e d ( : , r ) = [ t e r o s (shift,l) ;Ahat{I:nas-shift, r) 1; e l s e

Ashiftedi:, r)=IA9at(l-shift:nas,r);zer~s (-shift, 1) 1;

end

3 3 3 3 i S % ? i % b 3 3 3 3 3 3 3 3 % i 3 3 % 0 3 3 3 3 3 END OF 3C3SI:d SVD 3 3 % 3 % t 3 3 4 3 5 3 % 3 3 i % ! ; 3 j ' f 7 1 + 3 3 % i i -

Three-wav FLS

5FS3ST!! ?.KI is the ç e c m d sub rouc ine ~f zhe 3-way X S s h i f t e d f a c t z r - 3 z r z g r m , whic? estimates columns of A by ~ s i n g svd. 3 3 Original: J w e 3, 1997, Sungjin Hong 3 Xe-ris~an: 4ugrst 19, 9 7 . 's 3 - ,,,?S=. - 7 01;maxiter=50; i=er?.um=I; 3 ~ V s e ~ = t e r o s ( 3 , n f s ) ;.ADkrVset=teros ( n a s ' n s , n fs ) *. 5 ; ou~saveEilename=[outfL1enme, ' f3svddr , int2str(nfsj, 'd- ' , . . -

s t a r t T ~ e c ( s t a r ~ ) , Fnt2str (runsnum) , ' .ma t ' ] ; dis? i [ ' S t a r t of 3-way FLS-SVD ' , e r r o r l e v e l , ' % ' , star tvec ( s t a r t l , . . .

'- ' , i nc2sc r (runsnum) 1 ) ; disp i ' No MSC RSQ A chng B chng C chng MSQ L MSQ LO A cor=

a c o r r C c o r r ' ) ;

if i~ernum>rnaxi ïer d isp( '3-way FLS has reached its i t e r a t i o n m a x ' i ;

end

3 Ancnor each z o i ~ n n o f shifts so t h a t i z s m e a n Fs CLOSE ta zer3 . laçoffse~=round(mearl i L h a t 1 1 ; Lhat=Lhat-cnes (nbs , 1) * l a g o f f s e ~ ; A=lag (4, lagof fsec) ;Snat=A;

3 E s t i m a t i o n of C for j=l:nbs

A B 3 d ( : , j, :l=Aediag(Bhatij, : ) 1 : end f o r j=l:&s

for r-l :nfs shFf~=Lhat: j, z ) ;shiftint=fIoor(shift} ;shifcrem=shift-shiftint: if s h i f t i n c > = O

aj rShiftod=[=eros (sniftint, 1) ;squeeze (AB3d (1:nas-

3 TL: v a i x e s asA=sm!A. "21 /nas ;mB=sum(3 . -21 /nbs;nsC=suniC. - 2 : / x s ; X p r C h a n ç e = a ~ s (A-.XiatI *diag (msA."( - .5 i 1 * 190; aChangeKax=ma:i [nax ;A?ezChariqe) ; ; 3perChange-a~s ( 8 - m a t ' d i a g ( m s B . (- . 5) -100; bC!Ia~geMax=rnax (max (3pezChafiqe) 1 ; CperChange=aCs ( C-Chat) *diag (msC - ( - . 5) 1 'x (max ( Q e r C h a ~ q e 1 1 ; msqresL=sum~swn( (2-Lhatl . & 2 ) / {nbs-nfs) ; nsqres lor ig=sum(sum( (Lorig-Lhat) . -21 1 / (nbsWnfs) ; corrA-max (abs ( c o r r 2 s e t (Ahatf Aor ig ) ) 1 ; corrB-max (abs ( c o r r 2 s e t (Bhat , Bo r ig ) 1 ) ; corrC=max ( a b s ( c o r r 2 s e t (Cha t f Corig) 1 1 ; for j=l:nbs

shiftvec=Lhat ( j , : 1 ; s h i f t i n c v e c = f l o o r (shiftvec) ; shiftremvec=shiftvec-shiftintvec; for r=l:nfs

s h i f t = s h i f t i n t v e c ( r ) ; i f s h i f r > = O

A s h i f t e d ( : , r ) = [ z e r o s ( sh i f t ,L ) ;Aha t ! I : na s - sh i f t , r l 1 ; e l s e

MPENDIX G. Mean Shift Offset and Shifi Outliers

An underestimation has been suspected to look the recovery correlations of the

shift estimates much smailer than those of the standard factor loadings (A, B. or C). [f the

Mode B loadings corresponding to the shifi outliers are not small, the deviation of most

shifts by a constant amount must be considered meaningful rather than an artificial offset.

This is because the contribution of a particular factor to the fit and residuals for a column

of data is also a function of the corresponding B weight. To see whether or not the shifi

deviation by a constant amount was an artificial offset, the solutions of the "Hurnp" data

with 5% e m r were investigated. Eight out of wenty columns of shift estimates were

involved in the mean offset. As an example, one case is provided in Table 7 to show how

the mean offsets compensated for the outliers to make the rounded rnean of shifts zero,

without a big change in the model fit. In Table 7, the a11 shift estimates other than those in

row 9.10, 13, 15, and 16, are deviated from their true values by 1. In the five rows where

the shift deviates are not 1. the estimates of Mode B loadings are al1 small as expected,

except row 16. Even though the loadings at row 16 are not small, their effect on the

model fit and residuals will be relatively small because the amount of shift deviation at

row 16 is small (-1). This example well shows that the recovery correlation for A on that

factor has been underestimated. When the mean offset of shifts was corrected, the

recovery correlation for A increased fkom 0.9461 to 0.9893. 'The eight columns of A out

of 20, where the mean shift offset is present, were then corrected by their mean shift

offset. M e r the correction, the number of the best and matching solutions has increased

i?6$'T 6 Z P 9 . 1 OOPO'T LZTZ'T ZLEL 'O 08TT-O OLL9 ' 0 9ETO'O - C ::6?": C C C . C I - 4 - € ' 5 tP60 ' 0 0190 'O ? g t - z

94

f?om 3 to 6 (out of 10 solutions) and the recovery correlations of A from -9388 and -9982

to .9877 and .9970, respectively for each factor. Note that the recovery correlations are

only about the best fining "and" matching soIutions. The reason why the second recovery

correlation goes down is that the second factor in the initial 3 unique solutions were not

involved in the mean shft offset.

A few shift outlien seern to have made the recovery correlations of shifts

underestimated by an considerable amount, even though their coaesponding Mode B

loadings m u t have been relatively srnaIl. The number of unique solutions reported in

Table 2 rnay also have been underestimated for the same reason. The mean offset of

shifts must be the major reason why the recovery correlations for the shifts are much

smaller than the recovery correlations for A and B. When the recovery correlations are

low both for factor loadings (A and B) and for shifts (S), the low recovery correlations

for shifis might be partly because of the artificial rnean shift offset and parrly because of

the di fficult shape of sequential factors such as the "Sine" curves. This might be the case

in the results of the "Sine" data because the recovery correlations for factor loadings are

lower than those of the "Hand" and the "Hump" data. To veriS, this will need further

investigation.

r I V I M U C C V H L U H I I V I Y

TEST TARGET (QA-3)

APPLIED - IMAGE. lnc = 1653 East Main Street - -. . , Rochester. NY 14609 USA -- -- , - Phone: 716/482-0300 -- -- - - Fax: 71 6/200-5989

O 1993. Appiied Image. Inc.. Ail Rights Reserved

FACTOR A TEST ANDcollectionscanada.gc.ca/obj/s4/f2/dsk2/ftp01/MQ28585.pdf · 2005. 2. 12. ·...

Documents

Transcript of FACTOR A TEST ANDcollectionscanada.gc.ca/obj/s4/f2/dsk2/ftp01/MQ28585.pdf · 2005. 2. 12. ·...