pdfs.semanticscholar.org · ~well as surrogate responses taken along with concomitant variates. On...
Transcript of pdfs.semanticscholar.org · ~well as surrogate responses taken along with concomitant variates. On...
![Page 1: pdfs.semanticscholar.org · ~well as surrogate responses taken along with concomitant variates. On the other hand, the set of surrogate samples consists of measurements on a subset](https://reader034.fdocuments.us/reader034/viewer/2022042205/5ea7b5204109341eb3561f6c/html5/thumbnails/1.jpg)
NONPARAMETRIC METHODOWGY FORINCORPORATION OF SURROGATE
IN CLINICAL TRIALS
byHabib Elias EI-Moalem
Department of BiostatisticsUniversity of North Carolina
Institute of StatisticsMimeo Series No. 2149
July 1995
![Page 2: pdfs.semanticscholar.org · ~well as surrogate responses taken along with concomitant variates. On the other hand, the set of surrogate samples consists of measurements on a subset](https://reader034.fdocuments.us/reader034/viewer/2022042205/5ea7b5204109341eb3561f6c/html5/thumbnails/2.jpg)
NONPARAMETRIC METHODOLOGY FOR
INCORPORATION OF SURROGATES
IN CLINICAL TRIALS
by
Habib Elias EI-Moalem
A dissertation submitted to the faculty of the University of North Carolina at Chapel
Hill in partial fulfillment of the requirements for the degree of Doctor of Philosophy
in the Department of Biostatistics.
Chapel Hill
1995
![Page 3: pdfs.semanticscholar.org · ~well as surrogate responses taken along with concomitant variates. On the other hand, the set of surrogate samples consists of measurements on a subset](https://reader034.fdocuments.us/reader034/viewer/2022042205/5ea7b5204109341eb3561f6c/html5/thumbnails/3.jpg)
~ABIB ELIAS EL-MOALEM. Nonparametric methodology for incorporation of
surrogates in clinical trials. (Under the direction of Dr. Pranab Kumar Sen.)
ABSTRACT
In clinical trials, as well as in medical research, it is often the case that the
variables of interest, known as true endpoints, are either hard to measure or are too
costly, or simply require considerable time for completion. Hence we search for end
points that are correlated with the true endpoints and that are easier and less costly
to measure or perhaps can be measured at an earlier time. These endpoints are called
surrogate endpoints. There is a controversy over the definition of a surrogate endpoint
due to the lack of formal methodology for making inference regarding a true endpoint,
Y, when a surrogate,S, is used intstead.
Prentice (1989) defined a valid surrogate as an outcome variable that would
"yield a valid test of the null hypothesis of no association between treatment and
true response". Thus, the surrogate should be informative about the primary end
point, and fully capture the effect of treatment on true endp?int.
A surrogate is used in this work to mean a substitute for a true response. It
differs from a surrogate as a substitute for a true covariate dealt with in measurement
error or in Latent-class models Sen (1992) considers design and analysis of a general
class of incomplete multiresponse designs (imd) that take into account use of sur
rogate variables. He formulates appropriate rank based procedures to test the null
hypothesis of no treatment effect in a robust manner using only one true endpoint
and one surrogate.
The purpose of this study is to generalize Sen's approach into a multi-response,
multi-surrogate setup. A vector of primary response variates is partitioned into vari
ous subsets on which measurements are obtained for different number of experimental
units in each subset. A vector of surrogate response variates is partitioned simillary.
The set of the validation samples consists of measurements on a subset of primary
11
![Page 4: pdfs.semanticscholar.org · ~well as surrogate responses taken along with concomitant variates. On the other hand, the set of surrogate samples consists of measurements on a subset](https://reader034.fdocuments.us/reader034/viewer/2022042205/5ea7b5204109341eb3561f6c/html5/thumbnails/4.jpg)
~ well as surrogate responses taken along with concomitant variates. On the other
hand, the set of surrogate samples consists of measurements on a subset of the sur
rogate variates along with covariates. Hence, we have an incomplete multiresponse
design scenario. A rank-based test is developed to detect differential treatment effects
on primary variates while incorporating information from surrogates also. The test is
developed in two design settings: the randomized block design, and the balanced in
complete block design. In the latter recovery of ineterblock information is considered
in detail.
III
![Page 5: pdfs.semanticscholar.org · ~well as surrogate responses taken along with concomitant variates. On the other hand, the set of surrogate samples consists of measurements on a subset](https://reader034.fdocuments.us/reader034/viewer/2022042205/5ea7b5204109341eb3561f6c/html5/thumbnails/5.jpg)
Acknowledgement
I thank God for giving me this wonderful oportunity to study at the hands of masters
of statistics. I thank my advisor Dr. Pranab Kumar Sen for inspiring me and for his
constant encouragement. I thank all my committe members Drs. Q~ade, Kupper,
Helms and Vine for their helpful comments and syggestions. Also, I am indebted
to Dr. Bahjat Qaqish for all the training he gave me during my employment as a
graduate research assistant at the Linberger Cancer Center. The experience I got
by working on consulting projects is invaluable. Many thanks go to the staff in the
Department of Biostatistics for their technical support; in particular I thank Betty
Pounders and Betty Owens for the administrative help and Bruce Walter and Corey
McEntyre for their computing support. I also thank my fellow students, particularly
my past and present office mates: Ralph Demasi, Joseph Galanko, Antonio Pedroso,
and all the other wonderful fellow students; they have been a constant source of en
couragement and given me positive feedback throughout my stay here. I am thankful
to my brother Basim and my sister Rose and my brother in law Basil for helping me
financially and emotionally.
IV
![Page 6: pdfs.semanticscholar.org · ~well as surrogate responses taken along with concomitant variates. On the other hand, the set of surrogate samples consists of measurements on a subset](https://reader034.fdocuments.us/reader034/viewer/2022042205/5ea7b5204109341eb3561f6c/html5/thumbnails/6.jpg)
Contents
1 General Introduction 1
1.1 Introduction. . . 1
1.2 Literature Review . 3
1.2.1 Surrogate Endpoints: Uses and Abuses 3
1.2.2 Prentice Criteria for Surrogacy .... 5
1.2.3 Designs of Clinical Trials with Surrogate Endpoints 7
1.2.4 Parametric and Semi-parametric Approach ..... 10
Advantages and Disadvantages of the Parametric and Semi-
Parametric Approaches 14
1.2.5 Nonparametric Approach. 15
Hypothesis Testing 16
Estimation. . . . . 24
1.3 Synopsis of The Work Done 28
2 Methodology In Randomized Block Design (RBD) 33
2.1 Introduction . . . . . . . . 33
2.2 Randomized Block Design 34
2.2.1 RBD For The Surrogate Set 1 34
2.2.2 RBD For The Validation Set 1* . 43
2.2.3 RBD For The Set 1° . . . . . . . 45
2.2.4 Construction of the Test Statistics 46
VI
![Page 7: pdfs.semanticscholar.org · ~well as surrogate responses taken along with concomitant variates. On the other hand, the set of surrogate samples consists of measurements on a subset](https://reader034.fdocuments.us/reader034/viewer/2022042205/5ea7b5204109341eb3561f6c/html5/thumbnails/7.jpg)
2.2.5 Asymptotic Non-null Distribution of LO· 47
3 Nonparametric Intra-Block Inference 51
3.1 Balanced Incomplete Block Designs 51
3.1.1 BIBD for Surrogate Set 1 . 51
3.1.2 BIBD For The Validation Set 1* 63
3.1.3 BIBD For The Set 1° . . 64
3.2 Construction of the Test Statistic 65
3.3 Asymptotic Non-null Distribution of LO· 67
4 Recovery of Inter-Block Information (RIBI) 71
4.1 RIBI for Surrogate Set 1 72
4.1.1 RIBI For The Validation Set 1* . 81
4.1.2 Construction of The Test Statistics 82
4.1.3 Asymptotic Non-null Distribution of LO· 83
5 General Case of a Vector of Primary Variates 87
5.1 Intra And Inter-Block inference . . 88
5.2 Construction of The Test Statistic . 89
5.3 Asymptotic Non-Null Distribution of LO· 90
6 Properties of The Test Statistics And an Illustration 92
6.1 Introduction............. 92
6.2 ARE for the Complete Block Case. 92
6.3 Example: The Effect of Zidovudine on Survival in Patients with AIDS 97
6.4 Further Research . . . . . . . . . . . . . . . . . . . . . . . . . . . .. 102
VII
![Page 8: pdfs.semanticscholar.org · ~well as surrogate responses taken along with concomitant variates. On the other hand, the set of surrogate samples consists of measurements on a subset](https://reader034.fdocuments.us/reader034/viewer/2022042205/5ea7b5204109341eb3561f6c/html5/thumbnails/8.jpg)
Chapter 1
General Introduction
1.1 Introduction
In clinical trials, as well as in medical research, it is often the case that the vari
ables of interest, known as true endpoints, are either hard to measure, are too costly,
or simply require considerable time for completion. Hence a search for endpoints that
are correlated with the true endpoints and that are easier and less costly to measure,
or perhaps can be measured at an earlier time, results in what are called surrogate
endpoints. There is controversy over the definition of a surrogate endpoint due to the
lack of formal methodology for making inferences regarding a true endpoint, Y, when
a surrogate,S, is used instead.
Prentice (1989) has defined surrogacy and established some operational criteria
with which to choose a surrogate outcome variate in the context of clinical trials.
Basically Prentice (1989) defined a valid surrogate as an outcome variable that would
"yield a valid test of the null hypothesis of no association between treatment and
true response". This simply means that for a surrogate to be a valid substitute for
a true endpoint, the treatments being studied should affect the true endpoint only
through the surrogate. In other words "the surrogate endpoint must have precisely
the same relationship to the true endpoint under each of the treatment strategies
1
![Page 9: pdfs.semanticscholar.org · ~well as surrogate responses taken along with concomitant variates. On the other hand, the set of surrogate samples consists of measurements on a subset](https://reader034.fdocuments.us/reader034/viewer/2022042205/5ea7b5204109341eb3561f6c/html5/thumbnails/9.jpg)
b.eing compared".
Pepe (1992) considered settings where, for a random subsample of the subjects
being studied, some validation data are available to study the association between
one true endpoint and one surrogate variable. She considers a completely randomized
design (treatment wise). Let P{3(Y I Z) denote the regression model relating the true
endpoint Y to the covariate Z. Pepe showed that the maximum likelihood estimate
of (3 is nonrobust to misspecification of the conditional distribution of the surrogate
X giventhe unobserved true outcome Y and the covariate Z, P(X IY, Z). Moreover,
she considers as an alternative a semi-parametric model where she lays no structure
on P(X I Y, Z). This partial likelihood approach leads to an estimate of f3 which
behaves much better than the fully parametric estimate, yet remains sensitive to any
departure from the structure imposed on P{3(Y I Z) which would affect the validity
and efficiency of the statistical analysis.
Sen (1994) considers design and analysis of a general class of incomplete multire
sponse designs (IMD) that take into account the use of surrogate variables. The Pepe
(1992) setup is a special case of an IMD with one primary response variate and one
surrogate. The model that Sen considers here is a nonparametric model that avoids
the stringent conditions needed in a parametric or semi-parametric model. He formu
lates appropriate rank based procedures to test the null hypothesis of no treatment
effect in a robust manner using only one true endpoint and one surrogate. More
over, he tackles the problem of estimating the conditional regression quantiles of the
conditonal distributions (given concomitant variates) of the primary and surrogate
variables.
The purpose of this study is to generalize Sen's approach into a multiresponse,
multisurrogate setup and to study the properties of the proposed nonparametric pro
cedures in both finite and infinite samples. Although the focus is on use of surrogate
variables for primary response variables, covariates may also need surrogates. There
has been much work published recently on the so called measurement error models
2
![Page 10: pdfs.semanticscholar.org · ~well as surrogate responses taken along with concomitant variates. On the other hand, the set of surrogate samples consists of measurements on a subset](https://reader034.fdocuments.us/reader034/viewer/2022042205/5ea7b5204109341eb3561f6c/html5/thumbnails/10.jpg)
for continuous as well as discrete covariates. For instance, see Kupper (1984), Fuller
(1987), Carroll (1989), Carroll and Ruppert (1988), Pepe and Fleming (1991), Satten
arid Kupper (1993) among others.
1.2 Literature Review
1.2.1 Surrogate Endpoints: Uses and Abuses
The term surrogate comes from the Latin 'surrgare' meaning 'to substitute'.
Thus a surrogate endpoint simply means a substitute measure for some other variable.
Ellenberg and Hamilton (1989) suggest that "investigators use a surrogate when the
endpoint of interest is too difficult and/or expensive to measure routinely and when
they can define some other, more readily measurable endpoint that is well correlated
with the first to justify its use as a substitute". They present some surrogate endpoints
for survival time in cancer studies such as tumor response, time to progression, or
time to reappearance of disease. However, Piedbois, et al. (1992) warn that tumor
response should not be considered a valid surrogate endpoint for survival in patients
with advanced colorectal cancer.
Wittes, Lakatos and Probstfield (1989) define a surrogate endpoint in cardiovas
cular clinical trials as an "endpoint measured in lieu of some other so-called 'true'
endpoint." Ejection fraction is given as an example of a surrogate for total mortal
ity. Hillis and Seigel (1989) define a surrogate as "an observed variable that relates
in some way to the variable of primary interest, which we cannot conveniently ob
serve directly." In clinical trials that study ophthalmologic disorders, they examined
the use of the "status of one eye as a surrogate for the (unobservable) status of the
opposite eye in the same individual."
Machado, Gail and Ellenberg (1990) mention the use of CD4+ lymphocyte levels
as surrogates for the development of Acquired Immune Deficiency Syndrome (AIDS)
or death. However, Choi, Lagakos, Schooley and Volberding (1993) conclude that
3
![Page 11: pdfs.semanticscholar.org · ~well as surrogate responses taken along with concomitant variates. On the other hand, the set of surrogate samples consists of measurements on a subset](https://reader034.fdocuments.us/reader034/viewer/2022042205/5ea7b5204109341eb3561f6c/html5/thumbnails/11.jpg)
CD4+ lymphocytes are an incomplete surrogate marker for progression to AIDS in
asymptomatic HIV infected persons taking Zidovudine. Gruttola, Wulfsohn, Fischl
and Tsiatis (1993) observe that although CD4 lymphocyte counts are associated with
improved survival in patients with AIDS and AIDS-Related Complex, "they account
for only a small portion of the survival benefit of Zidovudine". Lagakos and Hoth
(1992) echo the same conclusion. Baccheti, Moss, Andrews and Jacobson (1992)
found that CD8 counts and changes in hemoglobin and WBC during therapy could
add independent predictive power to the CD4 count and thus would produce more
valid and useful surrogates than the CD4 count alone.
The urgency for developing cures for terminal illnesses, such as cancer and AIDS,
coupled with the long period of patient follow up necessary to obtain valid data for
estimating survival time has led scientists to consider surrogate endpoints that could
be useful for several reasons. Clinical trials that use a surrogate endpoint require
a much shorter follow up time than a true endpoint. Surrogates often require less
invasive techniques of measurement than true endpoints, and hence are easier to
measure. Sometimes a disease might be rare, which makes study of the true endpoint
very difficult.
These benefits should not overshadow the possible pitfalls that abound in using
surrogate variables. Fleming (1992) writes that "one rarely can establish that sur
rogate endpoints are valid". DeMets, in his commentary on Fleming (1992), refers
to the reliance on surrogate markers as "the most disturbing, even threatening, issue
today in clinical trials". Prentice (1989), after generalizing his operational criteria,
mentions that "in spite of the hope for such extension I am somewhat pessimistic
concerning the potential of the surrogate endpoint concept, as it is interpreted in this
paper." Ellenberg (1991) emphasizes that there should be a "strong biological ratio
nale" for a valid use of surrogate markers in AIDS trials, added to a strong predictive
ability of survival time at any given point in time.
The fact that treatment might affect the 'true' clinical outcome through path-
4
,
..
..
![Page 12: pdfs.semanticscholar.org · ~well as surrogate responses taken along with concomitant variates. On the other hand, the set of surrogate samples consists of measurements on a subset](https://reader034.fdocuments.us/reader034/viewer/2022042205/5ea7b5204109341eb3561f6c/html5/thumbnails/12.jpg)
'Yays or biological processes other than the surrogate marker a raises lot of doubt
as to the markers' validity in predicting the true outcome. In such a situation, false
positive conclusions are misleading. A recent example often cited in the literature is
provided by the Cardiac Arrythmia Suppression Trial (CAST, 1989). Arrhythmias
were accepted as a surrogate for sudden death since it was believed that suppression
of arrhythmias would reduce sudden deaths. The FDA approved the antiarrhythmic
drugs, ancainide and flecainide, and these were widely used. Later on the CAST trial
established that these drugs nearly tripled the death rate relative to placebo.
False negative conclusions might also arise when treatment has no effect on
the surrogate marker but is beneficial with respect to the true clinical endpoint.
Thus reliance on the surrogate marker in this case would deprive patients of effective
treatments. Fleming (1992) cites the Chronic Granulomatous Disease (CGD) clinical
trial. Gamma interferon led to noticable reduction in the rate of serious recurrent
infections in CaD patients, while there was no detectable effect on either superoxide
production or bacterial killing, both of which were used as surrogates for the life
threatening infections.
Thus, Machado et al. (1990) say that understanding the mechanism of action of a
new treatment is an important a priori condition for selection of a surrogate endpoint,
and whenever such mechanisms are unclear, data on the true endpoint seems to be
the essential source of information. However, they add that "assuming that other
agents in the same class have similar mechanisms of action, it may be possible to rely
on surrogate endpoints in later studies to evaluate these agents" ..
1.2.2 Prentice Criteria for Surrogacy
In light of the above complications of choosing a valid surrogate, formal statistical
methods to aid in this process are urgently needed. Unfortunately, they are also
scarce. Prentice was the first to adopt criteria for valid surrogacy. He defines a
surrogate endpoint as "a response variable for which a test of the null hypothesis
5
![Page 13: pdfs.semanticscholar.org · ~well as surrogate responses taken along with concomitant variates. On the other hand, the set of surrogate samples consists of measurements on a subset](https://reader034.fdocuments.us/reader034/viewer/2022042205/5ea7b5204109341eb3561f6c/html5/thumbnails/13.jpg)
~f no relationship to the treatment groups under comparison is also a valid test of
the corresponding null hypothesis based on the true endpoint." The term valid here
means that both hypotheses are parallel in the sense they would either both reject
the null hypothesis or both would accept it. Note that the surrogate endpoint in this
definition depends on the treatments being compared, and cannot be assumed to be a
universal substitute for the true endpoint. In symbols, this means that if t denotes the
time of enrollment in a clinical trial, and if Y denotes a true time-to-failure endpoint,
and if we let z = (Zl,' .. , zp) be indicators for p of the p+ 1 treatments to be compared
with respect to the corresponding hazard functions Ay(t I z), then with X(t) as the
candidate surrogate we should have
P(X(t) I z, F(t)) - P(X(t) I F(t)) {:} Ay(t I z) - Ay(t) (1.1 )
where F(t) consists of the failure and censoring histories prior to t for the true end
point, and P(.) stands for the probability distribution. This means that for a surro
gate to be valid, a test of the null hypothesis of no treatment effect on the surrogate
should be parallel to the test of no treatment effect on the true endpoint. In other
words, the two tests would either both reject or both accept the null hypothesis.
Prentice established two criteria to check the validity of (1.1), namely
Ay(t IX(t), z) =Ay(t IX(t)) (1.2)
which states that the surrogate fully captures the effect of the treatment z on Y, and
Ay(t I X(t)) -I Ay(t) (1.3)
that is, the surrogate is informative about Y . .In order to establish (=» in (1.1), it is
necessary in addition to condition (1.3), to restrict the class of alternatives to the null
hypothesis of no treatment effect on the surrogate endpoint to "those for which the
treatment effects on the surrogate response distribution have some impact on average
true endpoint risk".
Although time to response was the true endpoint here, Prentice (1989) considered
6
![Page 14: pdfs.semanticscholar.org · ~well as surrogate responses taken along with concomitant variates. On the other hand, the set of surrogate samples consists of measurements on a subset](https://reader034.fdocuments.us/reader034/viewer/2022042205/5ea7b5204109341eb3561f6c/html5/thumbnails/14.jpg)
generalizations where the true response was a stochastic process {Y(t), t 2:: O}, where
t is the time from begining of follow-up, and Y(t) = {Yi(u), Y2(u),···; 0 ~ u < t}.
Moreover, instead of considering comparison of two or more treatments in a clinical
trial setup, a surrogate response is sought in order to replace a true response in
"respect to its dependence on some general exposure or covariate history". Thus if
we denote the possibly time-dependent covariate by W(t) = {W1 (u), W2(u),···; 0 ~
u < t}, then condition (1.2) can be extended to
Pr{Y(t) IX(t), W(t), F(t)} =Pr{Y(t) IX(t), F(t)}. (1.4)
It is well known that in actual practice, condition (1.2) rarely is known to hold
since it is hard to verify or simply is not true. Moreover, measurement error in the
surrogate variable may invalidate criteria (1.2 or 1.4) and (1.3). It is possible also to
use a later response as a surrogate for an earlier true endpoint if the latter is hard to
measure, and the same criteria above can be employed to validate surrogacy.
1.2.3 Designs of Clinical Trials with Surrogate Endpoints
The information gathered in a clinical trial usually involves a number of re
sponses, some of which may be more important than others, and these are what Sen
(1994) calls primary variates. It is essential that the statistical relationship between
the primary variates and the surrogate endpoints be studied before any inference is
made about the treatment effects on the primary endpoints. But then it is often
difficult and/or costly to record data on all the primary variates (as well as surro
gate variables and concomitant variables) for all experimental units, thus creating
the need for simultaneous measurements of the primary and surrogate variables for
different subsets of experimental units. This is the genesis of incomplete multire
sponse designs (IMD) in clinical trials, as Sen (1994) has so elaborately explained.
In situations where the primary variates can be ordered in terms of their importance
or relevance, a hierarchical design (HD ) can be adopted as follows.
7
![Page 15: pdfs.semanticscholar.org · ~well as surrogate responses taken along with concomitant variates. On the other hand, the set of surrogate samples consists of measurements on a subset](https://reader034.fdocuments.us/reader034/viewer/2022042205/5ea7b5204109341eb3561f6c/html5/thumbnails/15.jpg)
Assume that Y = (Yl ,"', Yp ) is a vector of response variates such that Yl is
the most important primary variate, and the other responses have a decreasing or
der of importance. Assume also that Yo is a q-vector of surrogate variates that
may be recorded for all subjects with relative ease. If S = So denotes the set of all
experimental units, then a possible hierarchical scheme would be:
S = So ::J Sp 2 ... 2 SI, (1.5)
where on SI, all the p + q responses are recorded, on S2 \ SI (Y2,"', Yp and Yo)
are recorded, and so on; on Sp \ Sp-l, Yp and Yo are recorded, and on So \ Sp only
Yo is recorded. Here Sp \ Sp-l means all elements of Sp that are not in Sp-l' Data
pertaining to treatment as well as design variates are also recorded in all of the above
subsets. Now for the specific subsets,
(1.6)
efficient designs {D} like those discussed in chapter 9 of Roy, Gnanadesikan and
Srivastava (1971) may be adopted to draw statistical conclusions.
However, sometimes all the primary variates can be equally important and no
ordering may apply. Here the hierarchical design loses its appeal and other response
wise incomplete blocking may be applied to Y. To see this, Sen (1994) formulates
a general class of IMDs in the following manner. Let P denote {I"", p} and
.consider the totality of 2P subsets of P, defined by i r = {il,"', iT}' for all possible
1 ~ i l < '" < iT ~ p and r = 0,1"" ,p; io = O. Consider a proper subset Po of P,
determined by clinical and other factors, such that:
Po = {ir : (r,ir ) E lo}. (1. 7)
where 10 is the set corresponding to Po. Then the set of all experimental units is
partitioned into a system
S(Po) = {Sir: (r, ir) E PO}.
8
(1.8)
![Page 16: pdfs.semanticscholar.org · ~well as surrogate responses taken along with concomitant variates. On the other hand, the set of surrogate samples consists of measurements on a subset](https://reader034.fdocuments.us/reader034/viewer/2022042205/5ea7b5204109341eb3561f6c/html5/thumbnails/16.jpg)
w;here for the subset Sir' the primary variate(s) Yi l ,· •• , li r along with the surrogate
variates Yo and design and concomitant variates are recorded. For the subset Sir' an
appropriate design Dir (for the treatment as well as design variates), can be adopted
leading to the design sets
D(Po) = {Di r : (r,ir ) E Po}. (1.9)
Thus, an incomplete multiresponse design that incorporates surrogate endpoints can
be formulated in terms of the dual design sets, responsewise design set and treatment-
wise design set, namely
{S(Po), D(Po)}. (1.10)
Sen (1994) stresses that the choice of optimal designs of the type (1.10) may depend
heavily on the cost and/or difficulty of measuring the responses, whereby a cost
benefit approach will be needed.
In setups where there is more than one surrogate endpoint, and data, on all of
them for all experimental units cannot be recorded easily, Sen (1994) suggests a more
general class of IMD designs. Here the set P will be replaced by
P* = {1 ... p·1 ... q}, '" , , (1.11)
and ir by (ir , is) = {i l ,···, iT; i~,·.·, i:}, for all possible 1 ~ i l < ... < iT ~ p,
1 ~ ii < ... < i: ~ q; 1 ~ r ~ p, 1 ~ 5 ~ q, and include io or ioin this system. Also
Po is replaced by Po = (ir , i~) : (r, 5, iT' i:) E 10. Then (1.8) is extended to
S(Po)= {S(ir,is) : (r, 5, ir , is) E 1O}, (1.12)
where in the subset S(ir,is)' the primary response variates Yill ···, Yir , and the sur
rogate response variates Yi~, ... ,Yi~ are to be recorded on the experimental units.I s
However, Sen (1994) does not foresee usage of such general designs since it requires
large cardinality of S, and a sophisticated statistical analysis that may not be ap
pealing to the clinical scientist.
9
![Page 17: pdfs.semanticscholar.org · ~well as surrogate responses taken along with concomitant variates. On the other hand, the set of surrogate samples consists of measurements on a subset](https://reader034.fdocuments.us/reader034/viewer/2022042205/5ea7b5204109341eb3561f6c/html5/thumbnails/17.jpg)
1.2.4 Parametric and Semi-parametric Approach
Prentice (1989) used his criteria to draw inferences based solely on surrogate
data. The design that he considered is a special case of an IMD wit?- p = 1, q = 1;
Po refers to the set of no primary index and 5(Po) = 50 is the entire set of experimen
tal units where only the surrogate variate is recorded. Pepe (1992) considers settings
where some validation data on both the true endpoint Y and the surrogate variable
5 is available. The design here is also a special case of an IMD with p = 1, q = 1
and io = {O}, h = {I}. Thus one subset contains data only on the surrogate variate,
while the other subset contains both the primary and surrogate variates (validation
sample).
Assume that P/3(Y I Z) is the regression model that relates the true response
to the vector of covariates Z, and let P/3,o(X I Y, Z) be the model that relates the
surrogate X to Y and Z. A validation subset V of experimental units on which Y,
X, and Z are recorded is considered so that the strength of the relationship between
X and Y could be studied. Pepe (1992) discusses two approaches in order to draw
inferences about (3.
In the first approach, inference for the true outcome is based on maximum likeli
hood theory using a parametric model for P(X IY, Z). Maximum likelihood estimates
of (3 and () are based on the likelihood
L((3, ()) = II P/3(Yi I ZdP/3,o(Xi IYi, Zi) II P/3,o(Xj I Zj),iEV jEV
where V indicates the nonvalidation set of observations on which only the surrogate
and concomitant variables (as well as design variates) have been measured, and where
P/3,O(X I Z) = JP/3(y I Z)P/3,o(X I y, Z) dy.
Pepe (1992) demonstrates by an example that the maximum likelihood estimate of (3
is very sensitive and nonrobust to misspecification of P/3,o(X IY,Z).
In the second approach, no structure on P(X IY, Z) is imposed while P(Y I Z)
10
![Page 18: pdfs.semanticscholar.org · ~well as surrogate responses taken along with concomitant variates. On the other hand, the set of surrogate samples consists of measurements on a subset](https://reader034.fdocuments.us/reader034/viewer/2022042205/5ea7b5204109341eb3561f6c/html5/thumbnails/18.jpg)
i~ still parametric in nature, hence the method is termed semi-parametric. Since
P(X I Y, Z) is nonparametric in nature it can be assumed to be independent of ;3.
If zs denotes the components of Z which are thought to be informative with respect
to the association between X and Y, then an empirical estimate of P(X I Y, ZS) is
found using the validation sample:
where in the discrete case
1- ~ I[X· = X 1': = Y ZS = ZS]vL...J t ,t 't ,
n iEV
~ EI[Yi = Y,Z: = ZS]n iEV
Here 1[.] is the indicator function and n V is the number of subjects in the validation
sample. If X, Y, or ZS is continuous, then suitable smooth kernel type estimators of
the probability ~ensities P(X, Y, ZS) and P(Y, ZS) are used for those components.
Define P(3(X I Z) = f P(3(Y I Z)P(X I y, ZS) dy, or the corresponding sum if Y
is discrete. The inference about ;3 is based on the estimated likelihood
L(;3) = II P(3(Yi I Zi) II P{3(Xj I Zj)iEV jEV
which is an estimate of
L(;3) = II P{3(Yi I Zd II P{3(Xj I Zj)iEV jEV
(1.13)
had P(X I Y, Z) been known exactly. Pepe (1992) showed that, provided the valida-
tion sample fraction nV/ n has a nonzero, positive limit pV, and the usual regularity
conditions of Cox and Hinkley (1974) hold for both P{3(Y I Z) and P{3(X I Z), the
maximum estimated likelihood estimate ~ converges in distribution to a normal ran
dom variable with zero mean and a variance made of two components. The first,
I- 1 (;3)/n, is the variance of the maximum likelihood estimate based on (1.13), and
the second, K(;3), is the penalty induced by estimating (1.13) which is a decreasing
11
![Page 19: pdfs.semanticscholar.org · ~well as surrogate responses taken along with concomitant variates. On the other hand, the set of surrogate samples consists of measurements on a subset](https://reader034.fdocuments.us/reader034/viewer/2022042205/5ea7b5204109341eb3561f6c/html5/thumbnails/19.jpg)
f':lnction of the validation sample fraction pl/.
Although it is true that in large samples there is no loss in using uninformative
nonvalidation set data in the analysis of validation set only, the use of informative
surrogates does increase the efficiency over estimates based solely on true outcome
data. The estimated likelihood method is fully efficient if the surrogate is perfect,
that is if P(Y I X, Z) = 1. In other words, if X is perfect then the variance of the
maximum estimated likelihood estimate is equal to the variance of the maximum like
lihood estimate based on true outcome data for all subjects and hence the surrogate
would be as informative about (3 as the true endpoint. Pepe (1992) showed that when
(3 is a scalar parameter, the maximum estimated likelihood estimate is more efficient
than the maximum likelihood estimate based on the validation set alone if and only
if the validation fraction nl/ In is greater than !.Fleming (1992) discusses use of biological markers as auxiliary variables rather
than surrogates in order to strengthen the clinical efficacy analyses and to avoid
the risk of making false conclusions when surrogate endpoints are used. Three ap
proaches are reviewed to deal with auxiliary variables: "variance reduction", "aug
mented scores", and "estimated likelihood". The variance reduction approach was
explored by Kosorok (1991), and the latter two approaches by Fleming (1992). In
all three approaches, two conditions are needed for efficient analyses employing aux
iliary variables: in the first the auxiliary variable and the true endpoint should be
highly correlated; and in the second "one pool of patients having longer follow-up,
and another pool of patients with auxiliary information but with relatively short-term
follow-up on the clinical endpoint".
In their commentary on Fleming (1992), Farewell and Cook discuss their method,
which estimates the correlation p between the surrogate and the true endpoint from
the asymptotic covariance matrix of Lin (1991) and uses it for appropriate weights of
the test statistics. To see this, let VI (t) and V2 (t) be the standard log-rank statistics
at time t for the treatment effect on the surrogate endpoint and the true endpoint,
12
![Page 20: pdfs.semanticscholar.org · ~well as surrogate responses taken along with concomitant variates. On the other hand, the set of surrogate samples consists of measurements on a subset](https://reader034.fdocuments.us/reader034/viewer/2022042205/5ea7b5204109341eb3561f6c/html5/thumbnails/20.jpg)
r~spectively. A global test statistic is defined as
R(t) = pl(t)Vi(t) +P2(t)V2(t),
where Pl(t) and P2(t) are possible data-dependent weights that may be of the form
Pl(t) jl + p(t)
P2(t) j2 - p(t)
where jl and i2 are chosen to reflect the relative weighting of the marginal test
statistics assuming zero correlation. Their simulation studies found that even with
such weights, possible discordant treatment effects hamper the validation of surrogacy.
Also, p was found to be insensitive for assessing a surrogate candidate, whereupon
the weights simplified to a proper choice of jl and h.
Louis, in his commentary on Fleming (1992), postulates a parametric model that
relates a surrogate (X) to treatment (Z) and a true endpoint (Y) as follows
Pe(Z, X, Y) = P(Z)Pe(X IZ)Pe(Y I Z, X).
-Now, condition in two different orders and use the Fisher information decomposition
in Louis (1982) to get
Equating the right-hand sides and solving for I(TIZ) (0)
The term in square brackets determines the cost/benefit of using a surrogate. If it
is negative, the surrogate will be more efficient than the true endpoint in drawing
inferences about O. Otherwise, if it is positive and one still insists on using the
surrogate a larger sample will be needed.
13
![Page 21: pdfs.semanticscholar.org · ~well as surrogate responses taken along with concomitant variates. On the other hand, the set of surrogate samples consists of measurements on a subset](https://reader034.fdocuments.us/reader034/viewer/2022042205/5ea7b5204109341eb3561f6c/html5/thumbnails/21.jpg)
4dvantages and Disadvantages of the Parametric and Semi-Parametric
Approaches
A sketch of the general theory and statistical analysis of IMDs is f~)Und in Mona
han (1961), Srivastava (1966), and Srivastava (1968). In multiresponse situations the
multivariate general linear model (MGLM) is often adopted when there are no surro
gate variables. Two assumptions in an MGLM are questionable here. The response
variables as well as the error vectors are assumed to be multivariate normal. These
are risky assumptions that cannot be verified in practice. Usually, one checks to make
sure that the marginal distributions are univariate normal. If they are found to be
skewed, appropriate transformations are made to bring about symmetry. But then to
claim that the transformed variables are multivariate normal would not be reasonable.
In addition, sometimes some of the response variables may be categorical in nature
and they would then require other treatment, for example logistic regression. In such
a case the MGLM cannot be applied for all responses. The MGLM is valuable if all
the responses are continuous and the multinormality and other assumptions are valid,
and if our interest lies in the covariance structure of the responses, or in a secqndary
parameter and/or hypothesis that involve more than one dependent variable in an
essential way, the two main reasons for employing a MGLM.
Generalized linear models (GLM) of McCullagh and Nelder (1989) can be adopted
in clinical trials when only one response is analysed. But here also there are drawbacks
that arise due to non-identifiability of such GLM's, high dimension of the "asymptotic
covariance" matrix, and large sample sizes.
The use of Cox (1972) partial likelihood by Fleming (1992) should be viewed
with caution. In clinical trials, censoring (Type I, II or random) often occurs, and
with moderate to high censoring, the partial likehood will not have enough informa
tion for a powerful test. Moreover, if there are many covariates with a scatter that
is not very localized then any departure from the proportional hazards assumption
would invalidate the inference about the covariate parameters by introducing possible
14
![Page 22: pdfs.semanticscholar.org · ~well as surrogate responses taken along with concomitant variates. On the other hand, the set of surrogate samples consists of measurements on a subset](https://reader034.fdocuments.us/reader034/viewer/2022042205/5ea7b5204109341eb3561f6c/html5/thumbnails/22.jpg)
bias and nonrobust standard errors.
In summary, the parametric setup should be "viewed with caution" as Sen (1993)
elaborates primarily because "any departure from the assumed functional forms may
cause considerable damage to the validity and efficacy of statistical analysis based
on the assumed model." Although the semi-parametric models offer more flexibility
by allowing some arbitrariness of the distribution functions (for example the baseline
hazard in Cox's proportional hazards model is nonparametric in nature), yet they are
not robust to qepartures from the assumed models. For these reasons it seems more
safe to assume a broader model that is less restrictive, and that will preserve consis
tency of the estimates while providing a reasonably efficient test statistics. Hence the
need for "nonparametric models" which are discussed in detail in the next section.
1.2.5 N onparametric Approach
Nonparametric formulations are more flexible but also more complex. Sen (1994)
states that the reason for this is that "any reduction of the statistical information
through only a few summaritative measures merits a much more careful consideration,
and often, a finite number of such measures may not suffice the purpose." Sen (1994)
lays down the foundation of nonparametric inference using surrogate variables by
considering a uniresponse model with a true outcome, Y, a set of covariates denoted
by Z, and a surrogate variable, X.
For the majority of experimental units, data are recorded only on (Xi, Zi), i E 1.
The set 1 is called the surrogate set. On another subset of experimental units, J*,
disjoint from 1, measurements are made on Y, X, and Z so as to allow assessment of
the relation between the true outcome and the surrogate. Thus, the set of all exper
imental units would be 50 = 1 U J* and 51 = J*, which is a hierarchical design as
in (1.2.3). An IMD can also be considered by including a third subset, 1°, with data
recorded on (Y, Z), but not on X.
Denote the conditional distribution function (d.f.) ofY, given Z = z, by F(y I z),
15
![Page 23: pdfs.semanticscholar.org · ~well as surrogate responses taken along with concomitant variates. On the other hand, the set of surrogate samples consists of measurements on a subset](https://reader034.fdocuments.us/reader034/viewer/2022042205/5ea7b5204109341eb3561f6c/html5/thumbnails/23.jpg)
apd the corresponding conditional survival function (sJ.) by F (y I z) [= 1-F (y I z)].
Also, let the corresponding functions of X, given Z = z, be denoted by G(x I z) and
G (x I z) respectively. Let H(y, x I z) be the conditional joint dJ. of (Y, X).
Two main goals of the analysis of an IMD are discussed by Sen (1992). The
first is when a test of the null hypothesis of no treatment effect is desired, and the
second relates to estimation of the treatment effects.
Hypothesis Testing
2.5.1.1 ~urrogate Sample Analysis
For the sake of simplicity, Sen (1992) deals first with the surrogate set, I, alone,
and then extends the method to the validation set, 1*. The Prentice (1989) criteria
for a surrogate translate here as concordance of F (. I z) and G (. I z), viewed as
functions of the concomitant variate z. The term concordance is used in·the usual
sense of concordance between two random variables, and the term concomitant variate
is used interchangeably throughout this work with the term ~ovariates. The method
proposed makes use of nonparametric analysis of covariance (ANOCOVA) tests that
were reported in Puri and Sen (1971) and Puri and Sen (1985). For extensions of
ANOCOVA tests to survival analysis see Chapter 11 of Sen (1981).
Let the vector of concomitant variates Z be expressed as Zj = (ci, ~D' , i 2: 1,
where the Ci are non-stochastic r(2: 1) vectors mostly relating to the design variables,
and the ~i are stochastic covariates. Note that random assignment of treatments to
subjects ensures the independent and identically distributed nature of the ~i's, which
is the basic assumption of ANOCOVA (in the parametric as well as the nonparametric
setups). Now, let II(~) be the marginal distribution of ~i, and let
16
![Page 24: pdfs.semanticscholar.org · ~well as surrogate responses taken along with concomitant variates. On the other hand, the set of surrogate samples consists of measurements on a subset](https://reader034.fdocuments.us/reader034/viewer/2022042205/5ea7b5204109341eb3561f6c/html5/thumbnails/24.jpg)
l'he null hypothesis of no treatment effect is formulated in a nonparametric way as
Ho : G i (. I ~) = G(. I~)' i E I (1.14)
where G is a suitable yet unknown d.£. which is assumed to be continuous everywhere.
Let ~i's be p x I-vectors, and define the (p + 1) x I-vectors Xi = (Xi,~D', i E I. If
the cardinality of I is n then Xi leads to a (p + 1) x n matrix. Arrange the elements
in order of magnitude within each row of this matrix, and denote the corresponding
ranks by Rji,j = 0,··· ,p; i = 1,···, n. This (p + 1) x n matrix is called the rank
collection matrix. Define the r-row vectors linear rank statistics (for each j = 0"" ,p
separately) as
T nj = l)Ci - c)anj(Rji ), j = 0,··· ,p,iEI
(1.15)
where c = C£iEI ci)/n, and the anj(k), k = 1,···, n, are suitable scores (for example
Wilcoxon scores: anj(k) = k/(n + 1), k = 1,·'·, n). Let V n be the (p + 1) x (p + 1)
matrix with elements Vnjjl given by
Vnjjl = n-1I: anj(Rji)anjl(Rjli)- an/injl,iEI
for j,l = 0,·,· ,p, and anj= n-12::k=1 anj(k),j = 0,,·· ,p. Also define
n
C n = I:(Ci - C)(Ci - c)',i=l
(1.16)
(1.17)
and assume that Rank(Cn ) = r(2:: 1), and as n increases n-1C n converges to a
positive definite matrrix C. Let T~ = (T~ll'" , T~o)' be an p x r matrix, and write
((vnjjl))j,jl=l,... ,P as V nOO , and let Vno = (VnOl,···,Vnop ). Then proceeding as on page
365 of Sen (1981), fit a linear regression of the surrogate variate rank statistics on the
concomitant part in order to eliminate the effects of the concomitant variates. Define
the residual rank statistics-vector
and let
* (V )-1'VnOO = VnOO - VnO nOO V nO •
17
(1.18)
(1.19)
![Page 25: pdfs.semanticscholar.org · ~well as surrogate responses taken along with concomitant variates. On the other hand, the set of surrogate samples consists of measurements on a subset](https://reader034.fdocuments.us/reader034/viewer/2022042205/5ea7b5204109341eb3561f6c/html5/thumbnails/25.jpg)
~inally, let
(1.20)
Sen (1994) proposes L no as a test statistic for testing the nul! hypothesis in
(1.14). Moreover, under (1.14), Lno is a permutationally (conditionally) distribution-
free test, and hence for small n, one may use an exact test, whereas for large n, L no
can" be approximated by the central chi squared distribution with r degrees of free
dom under Ha. When r = 1, one can perform a one-sided test if one pleases based
on T* J(v* )1/2nO nOD .
2.5.1.2 Censoring
A little adjustment is needed in the above discussion if the design incorporates cen
soring. In particular, if there is Type II censoring, i.e., if only data on the kn (out
of n) smallest ordered values of the Xi variable are available, where n-1 kn is close
to some pre-fixed a (0 < a < 1), then as in Chapter 11 of Sen (1981), the censored
version of (1.15) should be considered by replacing anO(Roi) in (1.15) by bnO,kn(Rod,
where
•
,i = 1,"" kn ,
for j > kn .
(1.21)
The censored version of Vnjj' also has to be considered, and proceeding as in (1.18-
1.20), one can define the statistics T~o(kn), v~ao(kn) and Lno(kn ), for every kn(1 ::;
kn ::; n). When kn is large, the exact (permutational) conditional distribution of
Lno(kn) can be approximated by the chi-squared distribution with r degrees offreedom
under Ho.
When there is Type I censoring (i.e., truncation at a prefixed timepoint T), then
if kn(T) = L,iEII(Xi ::; T), where I(.) is the indicator function, then under Ha, as
n ~ 00
-lk (T) a.sn n ~ aT,
18
0< aT < 1. (1.22)
![Page 26: pdfs.semanticscholar.org · ~well as surrogate responses taken along with concomitant variates. On the other hand, the set of surrogate samples consists of measurements on a subset](https://reader034.fdocuments.us/reader034/viewer/2022042205/5ea7b5204109341eb3561f6c/html5/thumbnails/26.jpg)
(see Chapter 11 of Sen (1981)). The test based on Lno(kn(T)) is also a conditionally
(given kn(T) = kn) distribution-free test under Ho that has a large sample chi-squared
approximation as well. Moreover, if the Prentice (1989) criteria for a surrogate are
fulfilled, then to circumvent the loss in efficiency brought about by censoring, time
sequential tests (such as the progressive censoring scheme (PCS)) of Chapter 11 of
Sen (1981) could be employed. Group sequential testing (GST) in the context of
repeated significance tests could be adopted also. However, if the Prentice (1989)
criteria can not be justified then validation sample analysis will be urgently needed if
valid inference about differential treatment effects on the primary endpoint is desired.
2.4.1.3 Validation Sample Analysis
Let the set of experimental units on which we have data for the primary, surrogate,
and concomitant (as well as design) variates be denoted by Iv with cardinality nv.
If C stands for the concordance between Y and X, then one way to incorporate the
validation set into the analysis is to perform first a preliminary test of
Ho : Pr(C) ~ 1/2. (1.23)
Now, corresponding to Xi, we have a (p + 2)-variate observation (Yi, Xi, ~i)', i E Iv,
and T no becomes here T nvo which is a 2 x r matrix with r elements for each of Y and
X. Also, V n is now V nv, a (p + 2) x (p + 2) matrix that can be partitioned as
where V nvOO is a 2 x 2 matrix, V nvO+ is 2 x p and V nv ++ is a p x p matrix. Then,
proceeding as in (1.15) through 1.19, let
T nvo - Vnvo+(Vnv++)-l~v'
V nvOO - VnvO+(Vnv++tlv~vo+
19
(1.24)
(1.25)
![Page 27: pdfs.semanticscholar.org · ~well as surrogate responses taken along with concomitant variates. On the other hand, the set of surrogate samples consists of measurements on a subset](https://reader034.fdocuments.us/reader034/viewer/2022042205/5ea7b5204109341eb3561f6c/html5/thumbnails/27.jpg)
'I;he permutational covariance matrix of T~"o is given by Cn" ®Vn"OO, where Cn" IS
defined as in (1.17) with n replaced by nv and I by Iv. Then 1.23 can be reframed as
Ho : V~O,XY 2: 0 ver.sus HI: V~O,XY < 0 (1.26)
where V~O,XY is the population counterpart of V~.oo. An asymptotically normal test,
L~", of (1.26) can be performed using the theory developed in Chapter 8 of Puri
and Sen (1971). Now, if Z; is the critical level of L~. with TJ (0 < TJ < 1) being
the level of significance for this test, then if L~" < Z;, we reject the null hypothesis
of concordance, and we do not proceed to test the original hypothesis of treatm,ent
effects based on the surrogate set only. Alternative designs (for example, adaptive
designs) will then be needed to test the basic hypothesis in a valid and reliable man
ner. In such a situation, Sen (1994) considers another test statistic, L~", of a parallel
hypothesis constructed from Iv such that the component of Tn"o corresponding to Y
is the dependent variable, and the other component corresponding to X, along with
T~" serve as the covariate rank vector. A simple linear regression is done as before
and L~" will be permutationally (conditionally) distribution-free with a chi-squared
large sample approximation with r degrees of freedom.
If, on the other hand, the null hypothesis of concordance is accepted, that is if
L~. 2: Z;, then Sen (1994) suggests combining statistical evidence from both I and Iv
to get a test statistic to test the original hypothesis of no treatment effect. One such
combination can be L~~" = L no + L~., and this test is also permutationally (condi
tionally) distribution-free with a large sample central chi-squared distribution with
2r degrees of freedom under Ho. However, the fact that the degrees of freedom are
2r instead of r causes concern regarding the efficiency of L~~" in terms of asymptotic
power. Hence, Sen (1994) suggests combining the covariate adjusted rank statis
tics from I and Iv before constructing the quadratic forms, and then constructing a
quadratic form in such a way that under Ho, the resulting test is permutationally
conditionally distribution-free with large sample chi- squared approximation with r
degrees of freedom.
20
![Page 28: pdfs.semanticscholar.org · ~well as surrogate responses taken along with concomitant variates. On the other hand, the set of surrogate samples consists of measurements on a subset](https://reader034.fdocuments.us/reader034/viewer/2022042205/5ea7b5204109341eb3561f6c/html5/thumbnails/28.jpg)
Before outlining the details of such a test, it is worth mentioning at this stage
that if the Prentice (1989) assumption holds (based upon clinical considerations),
performing a preliminary test actually complicates matters due to the multiple tests
involved. Sen (1994) illustrates this point by pointing out that two tests are per
formed, the preliminary test at significance level 1] , and the second stage test at level
(Y2, say. Then the overall level, say (Y, is given by
(1.27)
where 1~2 and l~~ are the critical levels of L~v and L~~v respectively. The basic prob
lem here is that the pair (L~v' L~v)' and the pair (L~v' L~~v) may not be stochastically
independent, hence evaluation of (1.27) will be quite complicated (even in the asymp
totic setup). That is why performing a preliminary test of concordance may not be
very appealing.
Sen (1994) considers analogues of the standard parametric procedures discussed
in Roy et al. (1971), though in a nonparametric setup. In the normal theory setup,
specifying the mean and covariance structure completely specifies the distribution of
the test statistic, whereas here, the covariate adjusted linear ~ank statistics, although
asymptotically multivariate normal, include some unknown functionals in the mean
vector and covariance matrix that need to be known to be able to construct the
test. One option then would be to use adaptive procedures, but the large sample size
required eliminates such an option. Hence, Sen (1994) prescribes the unique combi
nation procedures which we shall describe in detail.
First, partition the residual vector T~vo in (1.24) and the dispersion matrix in
1.25 as follows
(1.28)
Also let
(1.29)
21
![Page 29: pdfs.semanticscholar.org · ~well as surrogate responses taken along with concomitant variates. On the other hand, the set of surrogate samples consists of measurements on a subset](https://reader034.fdocuments.us/reader034/viewer/2022042205/5ea7b5204109341eb3561f6c/html5/thumbnails/29.jpg)
The statistics T~vo(x) and T~vo(Y:x~ are statistically independent under the permu
tational model for large n, and permutationally uncorrelated for finite n. Moreover,
each of them, when normalized, is asymptotically multinormal. This prompted Sen
(1994) to consider the combination
T~:o = [(v~vxxt 1 + (vnvY:x t 1t 1{ (vn"xx )-lT~vo(x)+ (v~"(y:x))-lT~,,o(Y:X)}·
(1.30)
The permutational mean of T~:o is 0 and its permutational dispersion matrix is
(1.31)
The final step is to combine T~:o and T~o in (1.18) to get the suggested test statistic.
Denote the r x r matrix in (1.31) by AI, and let A 2 = v~oo.Cn where C n and v~oo
are defined in (1.17) and 1.19 respectively. Let
and let
W. - [A-1 + A-1]-1 A-It- 1 2 i' i = 1,2 (1.32) •
(1.33)
TO' is conditionally distribution-free, under the permutational model, with 0 mean
and covariance matrix [A~l + A21]-1, and the permutational multivariate central
limit theorem of Sen (1983) applies here. Hence the motivation for the overall test
statistic
(1.34)
The exact permutational distribution of LO' can be obtained by enumeration if n
and nv are small to moderate. But for large nand nv , the distribution of LO' is
asymptotically a chi-squared with r degrees of freedom. For local (i.e., Pitman-type)
alternatives, the asymptotic distribution of LO' is a noncentral chi-squared with r
degrees of freedom and noncentrality parameter t:,.L which depends upon the alter
native and the score functions. The test based on LO' has, at least asymptotically,
22
![Page 30: pdfs.semanticscholar.org · ~well as surrogate responses taken along with concomitant variates. On the other hand, the set of surrogate samples consists of measurements on a subset](https://reader034.fdocuments.us/reader034/viewer/2022042205/5ea7b5204109341eb3561f6c/html5/thumbnails/30.jpg)
~etter power than the two stage test in (1.27) if the Prentice (1989) assumption holds,
where it is not as favorable if there is reason to doubt the concordance between the
surrogate and the primary variate.
Moreover, both Type I and Type II censoring can be incorporated into these
test just like before in (1.21) and (1.22). The V matrices would be modified and we
proceed as in (1.28) through (1.34). The extension to repeated significance testing is
also conceivable albeit more complex.
In simple designs (e.g., two-sample or multi-sample models), it is possible to
construct functionals of the distribution functions F(. I z) and G(. I z) that help in
drawing statistical inferences about the treatment effects. Consider the functional
O(z) = O(F(. I z), and the functional ~(z) = ~(G(. Iz)). Examples of such functionals
could be the nonparametric regression quantiles (e.g., the conditional median), or the
conditional mean, although the latter is not preferred due to its sensitivity to the
tails of the F (. I z) and will not be considered here.
The dependence between the surrogate and the primary variates is reflected in
some suitable nonparametric functional, 'ljJ(.), where
O(z) = 'ljJ(~(z)), z E 3 z ; (1.35)
Note that in a hierarchical design, ~(z) may be estimated from the set 1, and 'ljJ(.) from
the set 1*. In an imd design, O(z) may be estimated from 1°, ~(z) from 1, and 'ljJ(.)
from 1*. Let 0(.) and ~(.) be conditional quantiles and consider the simplest model
of placebo versus treatment. The Prentice (1989) assumption may be tested here as
follows. Partition the concomitant vector Z as (Z(l), Z(2)), with Z(1) containing the
treatment and design variates, and Z(2) has the other possibly random concomitant
variates. In the placebo vs. treatment case, Z(1) takes only the values 0 for placebo,
and 1 for treatment.
The null hypothesis of no treatment relationship to the true outcome can be
formulated as
(1.36)
23
![Page 31: pdfs.semanticscholar.org · ~well as surrogate responses taken along with concomitant variates. On the other hand, the set of surrogate samples consists of measurements on a subset](https://reader034.fdocuments.us/reader034/viewer/2022042205/5ea7b5204109341eb3561f6c/html5/thumbnails/31.jpg)
f9r all Z(l) and Z(2). Similarly, the null hypothesis of no relationship of the treatment
to the surrogate can be formulated as
(1.37)
for all Z(l) and Z(2). Testing the validity of the Prentice (1989) assumption is equiv
alent here to requiring (1.36) to be testable through (1.37). Thus if O(Z(l), Z(2»)
depends on Z(l), then the concordance condition would have to be verified before
making any conclusions. If Zo and Zl are two distinct values of Z(1), then we may
require that for every Z(2)
(1.38)
This brings us to the domain of estimation of the functionals 0(.), e(.) and 'ljJ(.) in a
nonparametric way which we shall deal with in detail in the next subsection.
Estimation
Consider the simplest case of placebo versus treatment when ZP) = 0 or 1,
and when there is no concomitant variate, i.e., when Z(2) = o. For the set I, we
have two subsets corresponding to the placebo and treatment groups, from which
we estimate the quantiles e(O) and e(l) as the sample quantiles of the respective
distribution. For the validation subset, J*, Sen (1994) suggests considering the two
bivariate distributions of (X, Y) for the placebo and treatment groups, and then
estimating the sample quantiles from the marginal distributions for each sample alone.
Thus, from the validation subset, J*, we have the estimators
(1.39)
(1.40)
Also, from the surrogate set, I, we have the estimators ts(O), ts(1). By standard
multivariate nonparametric methods (see Chapter 5 of Puri and Sen (1971)), the
asymptotic distribution of the two bivariate points (tv(O),Ov(O)) and (tv(l), Ov(1)) is
~ ( tv(O) - e(O) ) '" Ar (0 r )nvo ~ .IV2 , 0,Ov(O) - 0(0)
24
![Page 32: pdfs.semanticscholar.org · ~well as surrogate responses taken along with concomitant variates. On the other hand, the set of surrogate samples consists of measurements on a subset](https://reader034.fdocuments.us/reader034/viewer/2022042205/5ea7b5204109341eb3561f6c/html5/thumbnails/32.jpg)
where nvo is the cardinality of the subset of 1* for which Z = O. Also, if nvl stands
for the cardinality of the subset of 1* for which Z = 1, we have
(1.41)
where ro and rl are to be consistently estimated from the validation subsets, though
with a better rate of convergence if we assume that ro = r1= r. Similarly, if nso
and n sl are the sizes of the subsets of I for which Z = 0 or Z = 1, then
(1.42)
(1.43)
where /50 and /51 are consistently estimated from the respective samples, also with
a better estimate if we let IsO = /sl = /5. By (1.40) and (1.41), E(Ov(O) I tv(O))
and E( Ov(1) I tv (1 )) are both linear in tv(O) and tv (1 ) respectively, and hence the
motivation for the following estimators of 0(0) and 0(1)
(1.44)
(1.45)
The regression estimates, ~o and ~I, are to be obtained from classical linear inference
procedures by using estimates of r o, rI, lOs and /ls' This procedure applies also
when Z(I) takes more than two values.
Now, with the introduction of the concomitant variate, Z(2), we may have to
deal with three possible situations. The first arises when Z(2) is categorical in nature.
The same method of estimation discussed will still apply but based on the subsets
corresponding to the possible values of Z(2). So, if these values are denoted by ar, r =
1,' .. ,I<, then the functionals to estimate are
O(Z(I), a r ), e(Z(I), a r ), r = 1,· .. ,I<.
25
(1.46)
![Page 33: pdfs.semanticscholar.org · ~well as surrogate responses taken along with concomitant variates. On the other hand, the set of surrogate samples consists of measurements on a subset](https://reader034.fdocuments.us/reader034/viewer/2022042205/5ea7b5204109341eb3561f6c/html5/thumbnails/33.jpg)
1;'he second case may arise when Z(2) is a continuous random variable. Consider a
simple model when Z(2) is a scalar random variable with a continuous distribution.
To estimate O(Z(l),XO) for a given Xo, we let
Wi = IZ?) - xol, i E I(and i E 1*).
Now, within each subset of I (and 1*) we consider a subset of observations for which
the Wi have the smallest k values, where k is not small but k/ nvo (or k / nvl, etc.)
is small. Based upon these k observations we proceed as in (1.40) through (i.45)
and estimate O(Z(1), xo). The theory for this is explained in Gangopadhyay and Sen
(1992a) and Gangopadhyay and Sen (1992b). It should be noted that the rate of
convergence in ((1.40))-(1.43) was .,jn, whereas here it is of the order n a for some
a < 2/5.
The third case is when some covariates are discrete and others are continuous. A
combination of the methods for the first and second cases will be a good prescription.
2.5.2.1 Concomitance Assumption
Up till now we have been assuming that the (Xi, Zi), i E I, has the same distribution
as (Xi, Zi) i E 1* so that O(z) and e(z) could be estimated consistently. But this may
'not be tenable in practice due to planned censoring of the surrogate or concomitant
variates that may damage the homogeneity assumption of the covariates made in
(1.14),' and hence, all the analysis made so far may be invalid. Sen (1994) gives
us some hope here by introducing some modifications that will extend the method
discussed to such situations.
Consider the case when 1= {i : Xi :::; xo} for some real xo. Here G(. I z) will be
right truncated at xo. Define
G(x I z)crx;;rz)'
26
x:::; Xo,
x> Xo
![Page 34: pdfs.semanticscholar.org · ~well as surrogate responses taken along with concomitant variates. On the other hand, the set of surrogate samples consists of measurements on a subset](https://reader034.fdocuments.us/reader034/viewer/2022042205/5ea7b5204109341eb3561f6c/html5/thumbnails/34.jpg)
and consider the functional
which is different from e(z). On the other hand variates in 1* have values that
correspond to values of Xi exceeding Xo. Thus, H(x, y I z), for this upper tail, will
be truncated from the left at Xo with respect to Xi. Define
H (x y Iz) - H(x, y I z) - H(xo, y Iz) for x > Xo, Y E R+.L, - 1 H( I) ,- Xo,oo z
Then the corresponding marginal distribution of X is
G ( I ) - H(x,oo Iz) -H(xo,y Iz)L x Z - , x > Xo.
1 - H(xo, 00 I z)
and parallel to eR(Z), let
Thus, n s and n v are nonnegative integer random variables that add up to a
known n, and such that
n s P ( ) d nv p ( )- ---t Pr X ::; Xo an - ---t Pr X > Xo, as n ~ 00.n n
The asymptotic normality results of (1.40)-(1.41) on the estimates of fh(z) and
eL(Z), and ~L(Z) [as in (1.42-1.43)] will not lead to better estimates of O(z) [as in (1.44
1.45)], unless eL(Z), eR(Z), OL(Z), and O(z) are related by some estimable functional
forms. Sen (1994) contends that although it is tempting to consider semiparamet
ric models (such as the proportional hazards model of Cox (1972)) to enforce such
estimablity conditions, these models should be avoided here on account of lack of
robustness. Sen (1994) suggests as a partial solution, extension of the validation sub
set, 1*, by including additional observations for which Xi ::; Xo. This will permit the
nonparametric estimation of the relationship between edz) and ~R(Z), and also the
relationship between O(z) and ~L(Z).
27
![Page 35: pdfs.semanticscholar.org · ~well as surrogate responses taken along with concomitant variates. On the other hand, the set of surrogate samples consists of measurements on a subset](https://reader034.fdocuments.us/reader034/viewer/2022042205/5ea7b5204109341eb3561f6c/html5/thumbnails/35.jpg)
1.3 Synopsi~ of The Work Done
There are two main issues that need to be addressed when incorporating surrogate
variables in clinical trials. The first issue is economic in nature, while the second is
statistical. A reduction in the cost of a clinical trial, although desired, should not
be at the expense of unreliable statistical inference. This could happen, for example,
if one abuses surrogate endpoints by using them without considering carefully their
association with the true endpoint. Thus, in the design stage of the clinical trial, one
has to try and balance the practicality of primary and surrogate response measure
ments (in terms of cost and difficulty), with the minimization to the extent possible
of the bias of the estimates and inefficiency of the tests involved. Hence, a proper
choice of the validation subset, 1*, seems to be essential.
Although this study will not focus on design issues of clinical trials that handle
surrogate responses, we will consider statistical analysis in incomplete multiresponse
design (IMD) settings of which the hierarchical design is a special case. The reason
for this is that it allows investigators greater flexibility than if we consider hierarchical
designs only. The contribution of this work consists in generaJizing Sen's method into
the multivariate case in a randomized block setting (complete as well as incomplete),
and to combine inter and intra-block information in a nonparametric way. This has
not been done before in the literature. Above all it is the introduction of surrogates
that is also new.
There are two scenarios that we will study in detail which I shall now describe.
As we have seen in (1.2.1), one surrogate may not suffice to draw valid inference
about the treatment effect on the primary variate. One major concern of scientists
that puts the validity of the surrogate endpoints in doubt is the possibility that treat
ment may affect the true endpoint via other pathways than the surrogate. Thus, if
the investigators think that this is the case with a certain surrogate, they can use
measurements of other surrogates to account for these other pathways of treatment
effects on the primary endpoint. In this way we will have more information about dif-
28
![Page 36: pdfs.semanticscholar.org · ~well as surrogate responses taken along with concomitant variates. On the other hand, the set of surrogate samples consists of measurements on a subset](https://reader034.fdocuments.us/reader034/viewer/2022042205/5ea7b5204109341eb3561f6c/html5/thumbnails/36.jpg)
f~rential treatment effects on the true endpoint and our inference would be valid. For
example, we mentioned in (1.2.1) that Baccheti et al. (1992) found that CD8 counts
as well as changes in hemoglobin and WBC during therapy add more information to
the effect of treatment on AIDS or death than CD4 counts alone. Thus, in many
cases use of more than one surrogate to predict treatment effect on a true outcome
is better than relying on only one surrogate.
The first scenario can thus be formulated as follows. A primary variate, Y, is
considered along with a q-vector of surrogates, X, and measurements on the concomi
tant and design variates, Z, are obtained as well. An IMD here pertains to designing
three subsets of experimental units; the first subset, I, has measurements on X and
Z, and is termed the surrogate set. The second subset, called the validation subset,
1*, has measurements on Y, X, and Z. The third subset, 1°, contains measurements
on Y and Z.
The second scenario that we will focus on deals with a vector of primary vari
ates, Y, a vector of surrogates X, and accompanying concomitant variates, Z. Here
surrogate and validation subsets will be considered also.
The methodology proposed here will be an extension of the methods in (1.2.5).
The tests proposed by Sen in (1.2.5) dealt with hierarchical designs only, and they
accomodate use of one surrogate for a primary variate. Keeping in mind that hier
archical designs are a special case of an IMD, this research generalizes these tests
to the IMD setting. As pointed out in (1.2.5), statements were made regarding test
statistics and their properties without any detailed proofs. Supplying proofs in the
multivariate setting of the two scenarios above will be one of the tasks to be carried
out during this study.
The second chapter will focus on analysis of designs of the first scenario type.
Since it would take too much space here we will not go into the technical details, but
rather we content ourselves with only a motivation. The null hypothesis of interest is
the same as (1.14) when we consider the surrogate sample alone with the difference
29
![Page 37: pdfs.semanticscholar.org · ~well as surrogate responses taken along with concomitant variates. On the other hand, the set of surrogate samples consists of measurements on a subset](https://reader034.fdocuments.us/reader034/viewer/2022042205/5ea7b5204109341eb3561f6c/html5/thumbnails/37.jpg)
that G(x I ~) is now a multivariate distribution function. The aim is to formulate
test statistics, based on linear rank statistics in (1.15), in each of the subsets 1,1*,
and 10, and then to come up with a unified test that combines information from all
these subsets similar to the test suggested in (1.34).
To be more specific, if we assume that all the surrogate variates can be recorded
with relative ease, then if we have reason to believe that the Prentice (1989) criteria
hold true, we will proceed with the surrogate sample analysis as in (1.2.5). Thus, we
will have now
which leads to a (p + q) X n matrix. The linear rank statistics are constructed as in
(1.15), but now T no will be a qr-row vector with r elements for each of the q surrogate
variables. V n is now a (p+q) x (p+q) matrix. We fit a multiple linear regression with
the q surrogates as the dependent variables and the p concomitant variables as the
independent variables. An appropriate residual rank statistic, say T~o, correspond
ing to (1.18) will then be constructed. Censoring can also be incorporated here by
replacing the censored version of the score functions as in (1.21).
Simillar modifications have to made when constructing a residual rank statistic
for the validation subset, J*. T nvo is now a (q+ l)r-row vector with r elements for Y,
and r elements for each of the q surrogate variables. V nv is now a (p+ q+1) x (p+ q+1)
matrix, where V nvOO is (q + 1) x (q + 1), V nv O+ is (q + 1) x p and V n v++ is a p x p
matrix. We regress Y and X on the p concomitant variates, and then we consider
the residual rank statistic, say T~vo' corresponding to (1.24). We then partition T~vo
and the corresponding covariance matrix V~voo as in before. V~voo now becomes
where vnvYY is variance of Y, and v~vYX is a 1 x q vector, and v~vxx is a q x q matrix
of the covariance of X. We then construct a statistic similar to T~:o in (1.30).
An analogous treatment to the subset 10 would yield a similar residual rank
30
![Page 38: pdfs.semanticscholar.org · ~well as surrogate responses taken along with concomitant variates. On the other hand, the set of surrogate samples consists of measurements on a subset](https://reader034.fdocuments.us/reader034/viewer/2022042205/5ea7b5204109341eb3561f6c/html5/thumbnails/38.jpg)
s}atistic, say T~oo, where no is the cardinality of ]0. Here we would regress Y on
the p covariates to obtain T~oo. We shall use T~oo along with the previously defined
residual rank statistics for] and 1*, namely T~o and T~:o respectively, to construct
an overall test statistic similar to (1.34). Thus, if we let TO·, as in (1.33), be
T O· W T**' W T*' W T*'. = 1 nv O + 2 nO + 3 noO (1.47)
where the weights Wi, i = 1, ... ,3 are to be defined as in (1.32), then the final step
consists of constructing the quadratic form similar to (1.34) that will be permuta
tionally distribution-free, and will have a large sample chi-squared approximation.
The third chapter will be devoted to studying the second scenario. Here we will
make use of the nonparametric multivariate techniques in Puri and Sen (1971) in
order to come up with the combined test. We consider residual rank statistics based
on regressing the q surrogate variates on the p covariates in the surrogate sample.
Moreover, if Y is an s x 1 vector, then we regress the q + s variates on the p co-
variates to get the residual rank statistic in the validation subset, 1*. In the subset
]0, we regress the s primary response variates on the p concomitant variates to get
the corresponding residual rank statistic. Then we construc,t the combined test by
constructing the appropriate quadratic form as in (1.34).
It maybe noted in (1.47), that if, based on other considerations, the weights
are prespecified, then although n1/2To· will still be asymptotically multinormal, the
quadratic form, LO·, based on TO· and the prior weights may not converge to a chi
squared distribution because the discriminant of such a form may not be equal to the
covariance matrix of the combined test-statistics, and thus Cochran's (1934) Theorem
may not apply. Here it would be appropriate to consider resampling methods, like the
bootstrap, when dealing with the distribution theory in the multivariate case. Further
note that a hierarchical design is a special case of an incomplete multiresponse design
so that the methods developed in this work will apply to such designs as well. Hence,
they will not be considered in detail.
Finally, the last chapter will study the power function of the test developed and
31
![Page 39: pdfs.semanticscholar.org · ~well as surrogate responses taken along with concomitant variates. On the other hand, the set of surrogate samples consists of measurements on a subset](https://reader034.fdocuments.us/reader034/viewer/2022042205/5ea7b5204109341eb3561f6c/html5/thumbnails/39.jpg)
it,s asymptotic relative efficiency. An example will be based on a completed double
blind placebo-controlled trial conducted by Burroughs-Wellcome, which treated 281
patients with advanced HIV disease. Of these, 137 patients were randomized to receive
placebo and 144 patients were randomized to receive a 250-mg dose of Zidovudine
(ZDV) every four hours. In this study CD4 counts were determined prior to treatment
and approximately every four weeks during therapy. The median duration of follow
up was 120 - 127 days, at which point the study was stopped due to the superior
results of the ZDV arm in decreasing mortality.
32
![Page 40: pdfs.semanticscholar.org · ~well as surrogate responses taken along with concomitant variates. On the other hand, the set of surrogate samples consists of measurements on a subset](https://reader034.fdocuments.us/reader034/viewer/2022042205/5ea7b5204109341eb3561f6c/html5/thumbnails/40.jpg)
Chapter 2
Methodology In Randomized
Block Design (RBD)
2.1 Introduction
This chapter will be devoted to studying the first scenario mentioned in (1.3) in
which a primary variate Y, a q-vector of surrogates X, and a p-vector of concomitant
variates, Z, are considered. As mentioned earlier the new treatment may affect the
primary endpoint through possibly more than one path. This model allows us to
account for these different paths.
An IMD here pertains to designing three subsets of experimental units: the
first is the surrogate set /, which has measurements on X and Z; the second is the
validation set, /*, which has measurements on Y, X, and Z; and the third subset is
/0, contains measurements on Y and Z.
Although this study will not focus on design issues of clinical trials that handle
surrogate responses, statistical analysis is considered in two design settings: the ran
domized block design (RBD), and balanced incomplete block design (BIBD). The
main reason for considering factorial designs is to reduce the variability of the esti
mates by eliminating the effect of one or more nuisance variables. Each of the above
33
![Page 41: pdfs.semanticscholar.org · ~well as surrogate responses taken along with concomitant variates. On the other hand, the set of surrogate samples consists of measurements on a subset](https://reader034.fdocuments.us/reader034/viewer/2022042205/5ea7b5204109341eb3561f6c/html5/thumbnails/41.jpg)
designs will be considered separately for the subsets f, 1* and fO. The next section
will deal with the simplest design, namely the randomized block layout in each of the
subsets f, 1* and fO.
2.2 Randomized Block Design
In classical linear models theory, the analysis of data from a randomized block setup
makes the following assumptions when estimating parameters:
i) The block and treatment effects are additive.
ii) The errors are independent and homoscedastic.
iii) Blocks and treatments do not interact unless each cell contains two or more
observations.
Moreover, the errors are assumed to be normally distributed in case confidence inter-
vals or significance tests for the parameters are desired. Here, in the nonparametric-
setup, these assumptions are relaxed for the most part. We do not assume additivity
of blocks and treatments; we drop the homoscedasticity assumption of the errors,
rather we assume that the errors are independent random vectors each of which has
a continuous distribution that is symmetric in its arguments. The normality assump-
. tion is completely relaxed.
2.2.1 RBD For The Surrogate Set I
Consider a two-way layout with s blocks of r plots each where r different treatments
are randomly assigned to the plots. Assume there are no replicates for simplicity.
Thus the cardinality of f is N(= rs). The response in the ith block receiving the
jth treatment is a p + q-vector V ij = (xg), ... ,xi1),zg), ... ,Zi1))' = (Xij' ZiJ' of
34
![Page 42: pdfs.semanticscholar.org · ~well as surrogate responses taken along with concomitant variates. On the other hand, the set of surrogate samples consists of measurements on a subset](https://reader034.fdocuments.us/reader034/viewer/2022042205/5ea7b5204109341eb3561f6c/html5/thumbnails/42.jpg)
measurements corresponding to the p covariates and the q surrogates. Consider the
model
V ij = J.L + ai +Tj + fij, j = 1,"', r, i = 1,"', S, (2.1)
where J.L is the mean effect, 01, ... ,as are the block effects, TI, ... , T r are the treat-
ment effects, and En,··', Esr are all p + q-vectors. The following assumptions are
made:
a) Vi = (Vill ···, Vir) has a continuous r(p+q)-variate c.d.£. Gi(u), u E Rr(p+q), i =
1,··" s.
b) The joint c.d.f. of Zi = (Zil,"', Zir) is symmetric in its r p-vectors. This
is the concomitance assumption of the covariate distribution in the analysis of
covarIance.
We wish to test the null hypothesis,
Ho : Tj = 0 j = 1,' .. ,r.
while the set of alternatives relates to shifts in location due to treatment effects.
The linear rank statistics discussed in (1.2.5) will be based upon the method of
ranking after alignment described in detail in Chapter 7 of Puri and Sen (1971). The
alignment procedure eliminates the block effect by subtracting from each observation
in a block a translation invariant symmetric function of the observations like the
block average, block median, the Winsorized or trimmed mean, etc. Let Vi. be such
a function. Define the aligned observations as
J. = 1 ... r. '" i = 1,'" ,So (2.2)
For the kth variate, rank the observations ug)*, ... , U};)* in ascending order of magni
tude and denote by R~J) the rank of Ui~k)* in this set, for i = 1," . ,S, j = 1,' .. ,r, and
k = 1,'·· ,p+ q. Thus, there is a rank vector R ij = (RU),···, R~f+q))', corresponding
to V~· = (U(~)* ... U~~+q)*), i = 1 '" S J' = 1 ... r~J ~J'.' ~J " , " •
35
![Page 43: pdfs.semanticscholar.org · ~well as surrogate responses taken along with concomitant variates. On the other hand, the set of surrogate samples consists of measurements on a subset](https://reader034.fdocuments.us/reader034/viewer/2022042205/5ea7b5204109341eb3561f6c/html5/thumbnails/43.jpg)
For each N(= rs) and each k = 1,' .. ,p + q, define suitable rank scores a~) =
(a~~ll"" a~:N)' where a~:i = J};)(jj(N + 1)), 1 :::; j :::; N. Moreover, J};)(u) is
defined in accordance with the Chernoff-Savage convention, that is J};)(u) satisfies
the following conditions:
(a) limN ---+ ooJ};)(u) = J(k)(u) exists for 0 < u < 1 and is not constant,
(b)
where
(k) 1[ (k)*]GN[i] (x) = -; number of Uii :::; x, k = 1, ... ,p + q, j = 1, ... , r,
and
(k)( 1~ (k) ( )HN x) = - LJ GN[jl x , k = 1, ... ,p + q.r i=l
Define
(k,k') 1 [ ( (k)* (k')* . ]GN[j.il(x, y) = -; number of Uii ,Uii ):::; (x, y) ,
for k, k' = 1,' .. ,p + q, j, 1= 1,' .. ,r with either j -=/:- I or k -=/:- k' or both.
Let a~)= N- 1 L~=l a~!a' Also let C}j~ = 1 ifthe a th smallest abservation among
the N values of Ui~k)* belong to the ph treatment, and let C}j~ = 0 otherwise, for
a = 1,' .. ,N, j = 1,' .. ,r. Now construct the linear rank order statistic for the kth
variate and the ph treatment, j = 1,"', r, k = 1,'" ,p + q
N
T (k) _ -1 "" CU) (k)N,i - s LJ NaaN,o.,
0.=1
This leads to the r x (p +q) matrix
TN = ((T~;))i=l,. .. ,T, k=l"",p+q
36
(2.3)
(2.4)
![Page 44: pdfs.semanticscholar.org · ~well as surrogate responses taken along with concomitant variates. On the other hand, the set of surrogate samples consists of measurements on a subset](https://reader034.fdocuments.us/reader034/viewer/2022042205/5ea7b5204109341eb3561f6c/html5/thumbnails/44.jpg)
Define the rank collection matrix of order (p + q) x N by R';v
Partition RjV into s submatrices of order (p + q) X reach
(Rn ,' .. ,Rsr ).
R* = (R(p+q)xr '" R(p+q)xr) . . D. = (R. ". D.)N 1 "S ,.LLi ~1, ,.LLir, i = 1-"", s
Under the null hypothesis the distribution of U71'···' U7r is symmetric in the r
vectors and hence remains invariant under any permutation of the r vectors. Thus
the joint distribution of
remains invariant under the finite group 9s of transformations {9s} which maps the
sample space onto itself. The cardinality of 9s is equal to (r!Y. Typically a 9s is such
that
where (U?1'···' U?r) is a permutation of U71" . " U7n etc. i = 1,"', s. Let R~
denote the rank collection matrix corresponding to U~. Note that for every 9s E 9s,
there exists a R~ = 9sRN which is permutationally equivalent to RjV.
The distribution of R';v over its (N!)(p+q) possible realizations will depend on the
unknown c.dJ Gi , even when the null hypothesis holds. However, under Ho, U~ has
the same distribution as UN for all 9s E 9s, and hence, the conditional distribution of
U';v over {U~ = 9sU';v; 9s E 9s} will be uniform, each realization having the common
conditional probability (d)-s. This leads to the probability law, g:Js:
Under Ho, the conditional distribution of R';v over the (r!)S realizations {R~ =
9sR';v;9s E 9s} is uniform, each realization having the conditional probability (d)-s.
Since g:Js is completely specified, the existence of conditionally distribution-free tests
for Ho is thus established.
Letr
-(k) -1"'" (k)a (k)= r ~a (k)N,R N,R•. j=1 'J
be the intrablock averages for the kth variate, k =1,,·' ,p + q.
37
![Page 45: pdfs.semanticscholar.org · ~well as surrogate responses taken along with concomitant variates. On the other hand, the set of surrogate samples consists of measurements on a subset](https://reader034.fdocuments.us/reader034/viewer/2022042205/5ea7b5204109341eb3561f6c/html5/thumbnails/45.jpg)
Theorem 2.1 Let V N be the (p + q) x (p + q) matrix with elements Vkk' given by
[1 ~~( (k) :;(k)
Vkk' = s(r -1) LL aN,R(k) - aN,R(k»)1=1 t=1 It l.
(k /) :;(k/)]x(a (k') - a (kl))
N,Rlt N,RI.(2.5)
k, k' = 1,'" ,p + q. Moreover, let eN to be the r x r matrix with elements Cjjl given
by
1 (C 1)'" 1C' 'I = - 0 "/r - ),) = ,"', rJJ sr JJ
where 8jj , is the usual Kronecker delta. Then,
(2.6)
E(TN)
Var [Vec (TN)]
(~)" ", aX;+q)) 0 J
where J is an r X 1 vector of ones and Var [Vec (TN)] is a r(p+q) x r(p+q) dispersion
matrix.
Proof: Note that
and
E ((k) )2 1 ~ ( (k) )21". aN,R(k) = N L aN,ex .
'J ex=1
Also, for all i,i'= 1,···,s j,l = 1,···,r and k,k' = 1,···,p+q
To justify the first equality above note that 1can take any of 1, ... ,s with probability
~, m can take any of the remaining s - 1 numbers with probability S~1' t, u can take
38
![Page 46: pdfs.semanticscholar.org · ~well as surrogate responses taken along with concomitant variates. On the other hand, the set of surrogate samples consists of measurements on a subset](https://reader034.fdocuments.us/reader034/viewer/2022042205/5ea7b5204109341eb3561f6c/html5/thumbnails/46.jpg)
-apy of 1, ... ,r with probability ~, and under ps the blocks are independent. On the
other hand if i = i',j =I j', then
1 ~ ~ (k) (k)
( 1) L.... L.... aN R(k)aN R(k)sr r - 1=1 t:;eu=1 'It 'I"
1 {~(~( (k) ))2 ~ ~((k) 2}sr(r -1) ~ ~ aN,R~;) - ~~ aN,R~;)) .
_ 1 {~r2(a(k»)2 _ ~(a(k) )2}sr(r _ 1) L.... N,R(k) L.... N,OI
1=1 l. 01=1
and
1 s T
"'" "'" (k) (k') if k -I- k',)' -I- ).,-(--1-) L.... L.... aN R(k)a (k') r rsr r - 1=1 t:;eu=1 'It N,RI "
and
1 ST
"'" "'" (k) (k'). ,-L....L.... a (k)a (k') If k =I k,sr 1=1 t=1 N,R/t N,RIt
Thus, we have
and
. (k»)Var p , (TN,j
39
![Page 47: pdfs.semanticscholar.org · ~well as surrogate responses taken along with concomitant variates. On the other hand, the set of surrogate samples consists of measurements on a subset](https://reader034.fdocuments.us/reader034/viewer/2022042205/5ea7b5204109341eb3561f6c/html5/thumbnails/47.jpg)
(2.7)
Moreover,
The expected value in the first term in (2.7) above contains products of scores in the
same block and same treatment. Such a product occurs wjth probability ~ under
~S. The expected value in the second term consists of products of scores in different
blocks but same treatment. Keep in mind that the blocks are independent under ~S'
Also,
E [(8-1 ;'" C(j) a(k) )(8-1 ;... C(j') a(k/»)]
Ps L...J NOI N,OI L...J NOI N,OI01=1 01=1
8-2 [;... C(j) C(j')E (a(k) a(k')) + ;... C(j) C(j') E (a(k) a(k'))] (2.8)L...J NOI NOI Ps N,OI N,OI L...J NOI N{3 Ps N,OI N,{301=1 0If:{3=1
-2 [ 1 ~ ~ (k) (k') ~ E (k) (k') )]8 r(r _ 1) L...J L...J aN,R(k)aN R(k') + L...J Ps aN,R(k)aN R(k')
1=1 tf:u=1 It 'Z" If:m=1 I)' m)'
-2 [ 1 ~ ~ (k) (k') ~ 1 ~ (k) ~ (k') ]8 r(r -1) ~ t/:::l aN,R~~)aN,R~~/)+1~1 r 2 ~aN,R)~) ~ aN,R~2
-2 [ 1 ~ (~~ (k) (k') ~ (k) (k') )8 r(r _ 1) (:: 8 ~ aN,R~~)aN,R~~') - ~ aN,R~~)aN,R)~/)
40
![Page 48: pdfs.semanticscholar.org · ~well as surrogate responses taken along with concomitant variates. On the other hand, the set of surrogate samples consists of measurements on a subset](https://reader034.fdocuments.us/reader034/viewer/2022042205/5ea7b5204109341eb3561f6c/html5/thumbnails/48.jpg)
The expected value in the first term in (2.8) above contains products of scores in the
same block but different treatments. Such a product occurs with probability T(T~l)
under ps. The expected value in the second term consists of products of scores in
different blocks and different treatments. Hence,
(k) (k'))Covps(TN,j' TN,j
-(k)-(k')-aN aN
Also, for j =1= j' we have
•Denote the marginal c.d.f. of Ui~k)* and of (Ui~k)*,USk')*) by G~VJ(x) ~nd G~V.l](X,y)
respectively, for j,l = 1,·· . ,r, k, k' = 1,· .. ,p + q, with at least one of j =1= 1, k =1= k'
41
![Page 49: pdfs.semanticscholar.org · ~well as surrogate responses taken along with concomitant variates. On the other hand, the set of surrogate samples consists of measurements on a subset](https://reader034.fdocuments.us/reader034/viewer/2022042205/5ea7b5204109341eb3561f6c/html5/thumbnails/49.jpg)
~eing true, and let
1 S T
jj~) (x) = - L L G~VJ(x), fork = 1,'" ,p + q.sr i=l j=l
Define the monotone transformation
W " - (W(l) W(p+q)),·· - 1 . - 1 .tJ - ij' ... 'ij ,J - ,,"', r, Z - ,"', s,
bkkl 'I' - E(W(k)W(k' )) k k' - 1 ... p + q J' - 1 ... r'.J ,t - ij iJ ' , -" ,-",
S
b(s) - -1" b k k' - 1 . - 1 .kk'.jl-S .L..J kk'.jl,i , - ,"',p+q, J- ,"',r,i=l
..
Finally let
I T 1 T T
" (s) "" (s)Vkk',N = - .L..J bkkl. jj - 2' .L..J.L..J bkkl. jl ,r j=l r j=l 1=1
k,k'= 1,"',p+q; (2.9)
VN = ((Vkkl ,N))k,k'=l'''',P+q' (2.10)
Now by Lemma 7.3.10 of Puri and Sen (1971) V N :e.., VN, and V N is positive definite in
probability when VN is positive definite. Now, using this fact and conditions (a), (b),
and (c) we arrive at the following thoerem.
Theorem 2.2 For each j = 1,"', r, the 1 x (p + q) row- vector of TN in (2.4)
is asymptotically normal with mean equal to (aW,· .. ,~+q)), and dispersion matrix
Cjj V N, where Cjj is the diagonal element of eN.
For proof see Theorem 7.3.3 of Puri and Sen (1971). For each j separately, j =
1,"', r, let us partition the corresponding 1 x (p + q) row vector of TN in (2.4) into
two components after subtracting the mean; the first component, TNo,j, is a q x 1
vector of the centered linear rank statistics corresponding to the surrogate variables
that is
TNo . = (TN(1) - a}N1) ... TN(q) - a}Nq)),,J ,J ",J '
and the second component, Tt,j, is a p x 1 vector of centered linear rank statistics
corresponding to the concomitant variates that is
TO . = (T(q+1) - a(q+l) ... T(p+q) - a}p+q)),N,J N,J N" N,J N .
42
..
![Page 50: pdfs.semanticscholar.org · ~well as surrogate responses taken along with concomitant variates. On the other hand, the set of surrogate samples consists of measurements on a subset](https://reader034.fdocuments.us/reader034/viewer/2022042205/5ea7b5204109341eb3561f6c/html5/thumbnails/50.jpg)
S;imillarly, we partition the matrix V N as follows
(VNOO VNO+)V N =
. V~ci+ V N++(2.11)
where V NOO is a q x q matrix, V NO+ is q x p and V N++ is a p x p matrix. Then, from
the classical normal theory, (see Theorem 2.5.1 of Anderson (1984)), for large N we
have
Define the q x 1 residual rank-statistics
V NOO V V V (-l) V'NOO - NO+ N++ NO+ (2.12)
Now, for each j = 1,···, r, obtain similarly a residual vector, consider the q x r
residual matrix corresponding to the different treatments
(2.13)
where
and
Moreover, the dispersion matrix of sl/2Vec (TNO ) is the qr x qr matrix eN ® V NOO .
It maybe noted that censoring can also be incorporated in the design and we would
then use the censored version of the score functions as in (1.21).
2.2.2 RBD For The Validation Set 1*
Consider a two-way layout with n* blocks of r plots each where r different treat
ments are randomly assigned to the plots. Since there are no replicates the cardinality
43
![Page 51: pdfs.semanticscholar.org · ~well as surrogate responses taken along with concomitant variates. On the other hand, the set of surrogate samples consists of measurements on a subset](https://reader034.fdocuments.us/reader034/viewer/2022042205/5ea7b5204109341eb3561f6c/html5/thumbnails/51.jpg)
o,f 1* is N v (= n*r), say, thus allowing the number of subjects in the validation set to be
different from the number in the surrogate set. The response in the ith block receiving
h ·th t . 1 U·· - ("l-":. X(I) X(q) Z(I) Z(p))' ft e J reatment IS a p + q + -vector tJ - .I iJ' ij'···' ij' ij'···' ij 0
measurements corresponding to the primary variate Yij, the p covariates and the q
surrogates.
Consider the same model and assumptions as in (2.1) with the only difference
that now Ell,·· . ,En • r are all p + q + 1 vectors. That is to say we regress the pri
mary variate and surrogates on the concomitant variates. The method of ranking
after alignment will also be used here. Proceeding as in proof of Theorem 2.1 with
N replaced by N v , and s by n*, so that k ranges now from 1 to p + q + 1 and hence
V N v is a (p + q + 1) x (p + q + 1) matrix, then, corresponding to (2.11) we have
V NvO+ )
V Nv ++
(2.14)
where V NvOO is ~ (q + 1) x (q + 1) matrix, V NvO+ is (q + 1) x p and V N v++ is a p x p
matrix. Moreover, corresponding to (2.13) let the (q + 1) x 1 residual rank-statistics
and dispersion matrix be
T*NvO,j
V V V(-I) V'NvOO - NvO+· N v++ NvO+ (2.15)
Now, for each j = 1,···, r, we obtain similarly a residual vector. Consider the
(q + 1) x r residual matrix corresponding to the different treatments
(2.16)
Partition the residual matrix in (2.16) and the (q +1) x (q + 1) dispersion matrix in
(2.15) as follows:
44
![Page 52: pdfs.semanticscholar.org · ~well as surrogate responses taken along with concomitant variates. On the other hand, the set of surrogate samples consists of measurements on a subset](https://reader034.fdocuments.us/reader034/viewer/2022042205/5ea7b5204109341eb3561f6c/html5/thumbnails/52.jpg)
'Yhere vNvYY is 1 xl,. vNvYX is 1 x q, and V NvXX is q x q. Also let
(2.17)
(2.18)
Note that the dispersion matrix of the 1Xr vector n*1/2 TNvO(Y:X) is given by vNv(Y:X)CNv '
where the elements of CNv are given by (2.10) with N replaced by Nv .
2.2.3 RBD For The Set 1°
Consider a two-way layout with nO blocks of r plots each where r different treat-
ments are randomly assigned to the plots. Since there are no replicates the cardinality
of 1° is N°( = nOr). The response in the ith block receiving the jth treatment is a
p + I-vector Uij = (lij, zg), ... ,Zi~»)' of measurements corresponding to the pri
mary variate lij, and the p covariates.
Consider the same model and assumptions as in (2.1) with the only difference
that now en,···, enO r are all p + 1 vectors. The method of ranking after alignment
will also be used here. Proceeding as in the proof of Theorem 1 with N replaced by
N°, and s by nO, also with k ranging from 1 to p+ 1 and hence V NO is (p+ 1) x (p+ 1)
matrix which is partitioned as we partitioned V N Next partition V NO as
where vNOOO is 1 xl, vNoo+is 1 x p, and V NO++ is P x p. Moreover, corresponding to
(2.12) let the (p + 1) x 1 residual rank-statistics and its dispersion matrix be
(2.19)
Now, for each j = 1,· .. ,r, obtain similarly a residual rank statistic. Finally, let the
1 x r residual vector corresponding to the different treatments be
(2.20)
45
![Page 53: pdfs.semanticscholar.org · ~well as surrogate responses taken along with concomitant variates. On the other hand, the set of surrogate samples consists of measurements on a subset](https://reader034.fdocuments.us/reader034/viewer/2022042205/5ea7b5204109341eb3561f6c/html5/thumbnails/53.jpg)
penote by VNoooCNo, the dispersion matrix of nOl/2TNOO' where CNo is the r x r
matrix whose elements are given by (2.6) with N replaced by N°.
2.2.4 Construction of the Test Statistics"
The next step consists of combining information from all subsets. Since the tests
developed in the different subsets are independent, we will take a weighted linear
combination of these tests with the weights being the inverses of the corresponding
dispersion matrices. Note that for such a combination to make sense the dimensions
of the tests should be the same.
One possible way to get around this difficulty would be to reduce the order of
the q x r matrix in 2.13 into a 1 x r vector by taking a linear combination of this
matrix, say aTiVo, where a is of order 1 x q, subject to the two restrictions, i) a'a = 1,
and ii) the variance given in (2.12) is minimum. Although by doing this we are losing
information, the minimum variance condition maximizes the noncentrality parameter
of the chi-square distribution of the quadratic form based on the transformed residual
vector, which in turn maximizes the power of this test. Moreover, a can be ordered
if there is inherent ordering in the surrogate vector. Note that
..
TNO(X) aTNO
(aTNO,I' ... , aTNO,r)
.with dispersion vNXCN, where VNX = aViVooa'. To see this
Cov(aTNO,j' aTNO,jl) aCov(TNO,j' TNO,jl )a'
cjj'aVNOOa'
where cjj' is the corresponding element of CN. Let Al = VNXCN, A2 = VN"(Y:X)CN,,,
and A 3 = vNO OO CNO. Let .
W · = [A-I +A-I + A-I]-IA~I . 1 2 3t 1 2 3 t' Z= , ,
46
(2.21 )
![Page 54: pdfs.semanticscholar.org · ~well as surrogate responses taken along with concomitant variates. On the other hand, the set of surrogate samples consists of measurements on a subset](https://reader034.fdocuments.us/reader034/viewer/2022042205/5ea7b5204109341eb3561f6c/html5/thumbnails/54.jpg)
~ow, noting that the mean of each of the three statistics TNO(Xl' TNvO(Y:X), and T NOO
is 0, following the Gauss-Markov Theorem we consider the linear combination
T O· W T*' W T*' W *'= I NO(X) + 2 NvO(Y:X) + 3T NOO (2.22)
Assume that :. ~ PI, and :0 ~ P2 for positive and finite real PI and P2. Then SI/2To·
is conditionally distribution-free, under the permutational model, with °mean and
covariance matrix
and the permutational multivariate central limit theorem of Sen (1983) applies here.
Thus, for large N, Nv , and N°, SI/2To· "" N,. (0, A). Finally consider the overall test
statistic
(2.23)
The exact permutational distribution of LO· can be obtained by enumeration if N,
Nv , and N° are small to moderate. It maybe noted that rank(CN) = rank(CNv ) =
rank(CNo) = r - 1. Thus for large N, Nv , and N° the null distribution of LO· by
Cochran (1934) is asymptotically a chi-squared with r -1 degrees of freedom since it
is a quadratic form of asymptotically normal variates with discriminant of rank r - 1.
2.2.5 Asymptotic Non-null Distribution of LO·
For the study of the non-null distribution of LO· we will consider local alterna
tives only. Fixed alternatives lead to consistent tests in the sense that the power
goes to one in large samples, which makes it hard to compare them. Thus they
will not be considered further. Let us first work with the surrogate set I. Write
Gi(u) = Gi(X,Z), x E n:q, z E Rrp
, and consider the following sequence of alterna
tive hypotheses:
(2.24)
47
![Page 55: pdfs.semanticscholar.org · ~well as surrogate responses taken along with concomitant variates. On the other hand, the set of surrogate samples consists of measurements on a subset](https://reader034.fdocuments.us/reader034/viewer/2022042205/5ea7b5204109341eb3561f6c/html5/thumbnails/55.jpg)
where ..x = ((.\;k»)) stands for an r x q matrix of treatment effects. Let G~~]
S-1 Li=1 G~~~, 1 :s; j :s; r, 1 :s; k :s; q, and let
Thus, under {HN }
j=l,···,r, k=l,···,q;
Assume that G~VJ(x), j = 1,"', r, k = 1,"', q, i = 1"·,, s are all absolutely
continuous. Moreover if each G~k) ~ G(k) then lims-+oo S-1 Li=1 G~k)(x) = G(k)(x).
Also, asymptotically jj~) (x) is equal to G(k)(x), which is the limit of G~~](x). Let
g(k) (x) be the density function corresponding to G(k) (x).
Now expand G(k)(X - N-1/2>..Y») in a Taylor series around x for a fixed x. We
have for large N
(k)J1N,j JOO (k) -(k) -(k)
-00 J [HN (x)]dGs[j] (x),
1: J(k) [G(k)(x)] dG(k)(x - N-1/2>..;k») + O(N-1/2)o
i: J(k) [G(k)(x)] dG(k)(x) +i: J(k) [G(k)(x)] d [G(k)(x - N-1/2>..;k») - G(k)(x)]
+o(N-1/
2)
•
Integrate by parts the second integral in the last term
J1~!j lo1 J(k)(u)du -1: [G(k)(x - N-1/2>..;k») - G(k)(x)] :x J(k)(G(k)(x)) + o(N-1/2)
[1 J(k)(u)du + (N-1/2)>..;k) JOO ~J(k)(G(k)(x))g(k)(x)dx + O(N-1/2)Jo -00 dx
[1 J(k)(u)du + (N-1/2)>..;k) joo ~J(k)(G(k)(x))dG(k)(x) + O(N-1/2)Jo -00 dx
where the last two steps resulted from a Taylor series expansion of G(k)(x_(N-1/2>..Y»)
around x for a fixed x. Thus, if we let
48
![Page 56: pdfs.semanticscholar.org · ~well as surrogate responses taken along with concomitant variates. On the other hand, the set of surrogate samples consists of measurements on a subset](https://reader034.fdocuments.us/reader034/viewer/2022042205/5ea7b5204109341eb3561f6c/html5/thumbnails/56.jpg)
then from the above we have
N1/2[J1~!j - fa1 J(k)(U)dU] -+ A~k)B(G(k)), j = 1,···,r as N -+ 00
By Theorem (7.3.12) of Puri and Sen (1971) [N1/2(T~} - J1~!J, j = 1,···, r, k =
1, ... , p+q] has asymptotically a multinormal distribution with null mean and disper
sion matrix that converges to the same dispersion matrix under the null hypothesis
namely, VN. Also by condition (a) of (2.2.1) we have
Thus in the light of the above discussion
(2.26)
Now, by Lemma (7.3.10) and by the convergence model in Section (7.2.4) of Puri
and Sen (1971), 'VN '" v defined in (2.10) with bkkl.jjl all defined for the limiting
distributions of the average c.d.f. 'so Let v I be the version of v in the surrogate set.
Partition VI as in (2.11) and corresponding to (2.12) and (2.13) let
T * T -1 TONO = NO - VI0+ V l++ N
Thus, let 111 = ((1])k))) where 1])k) = A)k)B(G(k)), j = 1,···,r, k = 1,···,q. Partition
11 I into 1110 corresponding to the surrogates, and 11~ corresponding to the covariates.
Let TNO(x) = aTNO' then for large n
E ( 1/2T* IH ) ( -1 0)' *S NO(X) N -+ a 1110 - vlo+vI++111 = #-tID
and its dispersion matrix is vixeN where vix = avjooa'.
In a similar fashion, define {HN} for the validation set 1* such that ((A)k))) is
now an r x (q + 1) matrix. Also, 111. is defined in the same way as 111 with the
49
![Page 57: pdfs.semanticscholar.org · ~well as surrogate responses taken along with concomitant variates. On the other hand, the set of surrogate samples consists of measurements on a subset](https://reader034.fdocuments.us/reader034/viewer/2022042205/5ea7b5204109341eb3561f6c/html5/thumbnails/57.jpg)
difference that k = 1,· .. ,q + 1. V Nv rv VI* which is the version of VI defined in the
validation set ]*. Thus, corresponding to (2.14) we have
Partition the r x (q + 1) expected value matrix "71* and the (q +1) x (q +1) dispersion
matrix V 1* as follows
* ( vj*yyv I • OO =
*'vI*YX
where vj*yy is 1 x 1, vj.yX is 1 x q, and vj.xx is q x q. Hence, for large n*
d h d·· . f *1/2 T * . * C han t e IsperSlOn matrIX 0 n NvO(Y:X) IS vI*(Y:X) 1* were
Finally, along the same lines, we define {HN } for the subset ]0 such that .\ is now
an r x 1 vector. Also, "710 is defined in the same way as "7-[. V NO rv V 10 which is
the version of V I defined in the subset ]0. Thus, for large n the expected value of
no1
/2Tl'voo under {HN } converges to p,joo = "710 with dispersion matrix VIOOOCIO.
Consider the weights Wi defined in (2.21) where now we have Al = vjXCN,
A 2 = vj*(y:X)CNv ' and A 3 = vjoooCNo. Thus, under {HN }, the mean of n1/
2To* in
(??) is for large n
Therefore, from Theorem (7.3.12), Lemma (7.3.10), and Theorem (2.8.2) of Puri and
Sen (1971) and under {HN}, LO· defined in (2.23) has asymptotically a noncentral
chi-squared distribution with r -1 degrees of freedom and the noncentrality parameter
50
![Page 58: pdfs.semanticscholar.org · ~well as surrogate responses taken along with concomitant variates. On the other hand, the set of surrogate samples consists of measurements on a subset](https://reader034.fdocuments.us/reader034/viewer/2022042205/5ea7b5204109341eb3561f6c/html5/thumbnails/58.jpg)
Chapter 3
Nonparametric Intra-Block
Inference
Blocking in experimental design is .done in order to use experimental units as nearly
homogeneous as possible. Complete block designs lose their efficincy due to the fail
ure to eliminate heterogeneity among units when the number of treatments to be
compared is not small. In such cases, incomplete block designs are used where ex
perimental units are divided into blocks containing fewer units than the number of
treatments to be compared. Comparisons of treatments with equal accuracy may re
quire equal number of replicates and other constraints leading to balanced incomplete
block designs (BIBD).
3.1 Balanced Incomplete Block Designs
3.1.1 BIBD for Surrogate Set I
Consider n replications of a BIBD consisting of s blocks of constant size r(~ 2) to
which v treatments are applied such that:
(i) No treatment occurs more than once in any block,
51
![Page 59: pdfs.semanticscholar.org · ~well as surrogate responses taken along with concomitant variates. On the other hand, the set of surrogate samples consists of measurements on a subset](https://reader034.fdocuments.us/reader034/viewer/2022042205/5ea7b5204109341eb3561f6c/html5/thumbnails/59.jpg)
(ii) The ph treatment occurs in rj(~ s) blocks j = 1,··· ,v, and
(iii) The (j, j')th treatments occur together in rjj'(> 0) blocks (j =I- j' = 1, ... ,v).
Let Si stand for the set of treatments occurring in the ith block, i = 1; ... ,s. For the
a th replicate, the response in the ith block receiving the jth treatment is a stochastic
V (X(l) X(q) Z(l) Z(p))' (X' z' )' fp + q-vector exij = exij, .•. , exij, exij,···, exij = exij, exij 0 measurements
corresponding to the p covariates and the q surrogates. Consider the model
V exij = JL ex + f3 exi + Tj + Eexij, j E Si, i = 1,···, s, a = 1,···, n (3.1)
where JL ex is the replicate effect, f3 exi is the block effect, TI,···, Tv are the treatmentI
effects, and Eexij are the error vectors. The following assumptions are made:
a) V exi = (Vexij,j E Si) has a continuous r(p + q)-variate c.d.f. Gi(u), u E
Rr(p+q), i = 1,··· ,So
b) The joint c.d.f. of [Zexij, j E Si] is symmetric in its r p-vectors. This IS the
concomitance assumption in ANOCOVA.
c) [Eexij,j E Si] have a jointly continuous cumulative c.d.f. G(2:I,···,2: r ) which is
symmetric in its r vectors. This includes the i.i.d. assumption of the distribution
of [Eexij,j E Si, i = 1,··· ,s] as a special case.
d) The joint distribution of [Zexij, Eexij,j E Si] is exchangeable and is the same for
all i = 1,··· ,So
Note that the Tj is a (p + q)-vector with two components: Tjl is a q-vector of
treatment effects corresponding to the surrogates, and Tj2 a p-null vector of treatment
effects for the covariates, since we assume no interaction between treatment and
concomitant variates. We wish to test the null hypothesis, How,
How: Tjl = 0 j = 1,·",v,
52
..
![Page 60: pdfs.semanticscholar.org · ~well as surrogate responses taken along with concomitant variates. On the other hand, the set of surrogate samples consists of measurements on a subset](https://reader034.fdocuments.us/reader034/viewer/2022042205/5ea7b5204109341eb3561f6c/html5/thumbnails/60.jpg)
while the set of alternatives relates to shifts in location due to treatment effects. To
adopt the method of ranking after alignment [see Chapter 7 of Puri and Sen (1971)],
define the aligned observations by
U:ij = U Olij - r-1 L U Olil j E Si, i = 1,"', S, a= 1"·,, n.
IESi
Let
-1 "" . S . 1Tj,i = Tj - r L..J TI, J E i, Z = ,.", S.
IESi
-1 ""eOlij = EOlij - r L..J Eoil,
IESi
Then we have
j E Si, i = 1, ... ,S, a = 1,' .. ,n.
U:ij = Tj,i + eOlij, j E Si, i = 1,,·, ,S, a = 1",· ,no
Thus if F(zl,···, zr) is the c.d.f. of eOlij,j E Si, and if Fr(p+.q) stands for the class of
all G(Zl,···, zr) which are symmetric in their r (p + q)-vectors, then
Moreover, the joint distribution of (Z:ij' eOlij, j E Si) is als0 exchangeable. For the
kth variate, rank the observations ugi*,···, u~~~* in ascending order of magnitude
and denote by R~~} the rank of Ui7]* in this set, for i = 1, ... ,S, j E Si, a = 1, ... ,n,
d k 1 Th d· U* (U(l)* U(p+.q)*), han =, ... ,p + q. us, correspon mg to Olij = Olij,···, Olij we ave a
k R (R(l) R(p+.q)),· 1 . S d - 1ran vector Olij = Olij, ... , Olij , Z = ,... ,s, J E i, an a - , ... , n.
For each N(= nsr) and each k = 1,'" ,p + q, we define suitable rank scores
a (k) - (a(k) ... a(k) ) where a(k). - fk)(J'j(N + 1)) 1 < J' < N Moreover J(k)(u)N - N,l' 'N,N' N,] - N , - -' , N
is defined in accordance with the Chernoff-Savage convention, i.e., we assume that
JJ.P (u) satisfies the conditions a, b, c of Section 2.2.1. For notational simplicity, let
TJ (k) - a(k) J' E S· ,; - 1· S '" - 1 ... n'Olij - (k)' ., • - , •• , , ~ -, "N,R"'iJ
s n
TJ~:~ = r-1 L TJ~:;, TJ~~! = S-l L1J~:~, and 1J.\~) = n-
1 L 1J;:'!jESi i=l 0l=1
53
![Page 61: pdfs.semanticscholar.org · ~well as surrogate responses taken along with concomitant variates. On the other hand, the set of surrogate samples consists of measurements on a subset](https://reader034.fdocuments.us/reader034/viewer/2022042205/5ea7b5204109341eb3561f6c/html5/thumbnails/61.jpg)
f<?r k = 1,' .. ,p + q. The proposed test is based on the statistics
(k) _ 1~" (k) . _ _TN,j -;; ~.~ T/aij' J -l,···,v,k -l,···,p+q.
a=l zEPJ
where Pj = {i : j E Sil, j = 1," " v. Let TN = ((T~~))j=l,. .. ,v,k=l'."'P+q.
Tests based on aligned ranks are only conditionally distribution free. Define
U~i = (U~ij,j E Si) ,Ua = (U~l""'U~s), a = 1,···,n, and URr = (Ui,···,U~).
Hence, by (3.2) we have the following three arms of the permutational law:
(i) The joint distribution of U~i remains invariant under any permutation of the r
vectors among themselves, there being r! such permutations, i = 1,' .. ,s,
(ii) As U~i' i = 1,' .. , s are i.i.d., the joint distribution of U~ remains invariant under
any permutation of the s sets U~i' i = 1," " s, among themselves, there being
s! such permutations,
(iii) As replicates are independent, the joint distribution of URr remains invariant
under any permutation of the n sets U~, a = 1,"', n among themselves.
Thus, if we define (In to be the compound group of transformations {9n} by
U o . U o* [(l)U* (n)u*] (a) r. 1On N = N = 9n 1"" ,9n n' 9n E ~n, a = ,"', n.
Then (In contains [s!(r!yt transformations, and under How it leaves the joint distri
bution of U~ invariant. Let S(U~) = {U% = OnU~ : gn E (In}. Then, it follows
from the above discussion that
P [U?v = uo; IS(U?v), How] = l/N*;N* = [s!(rwt,
for all uo; E S(U?v), whenever G E Fr(p+q). Let us denote this conditional (permu
tational) probability measure by rn. As rn is completely specified the existence of
conditionally distribution-free tests for How is thus established.
54
"
![Page 62: pdfs.semanticscholar.org · ~well as surrogate responses taken along with concomitant variates. On the other hand, the set of surrogate samples consists of measurements on a subset](https://reader034.fdocuments.us/reader034/viewer/2022042205/5ea7b5204109341eb3561f6c/html5/thumbnails/62.jpg)
Theorem 3.1 Let
I ns(1) _ "" "" "" [(k) (k)] [(k') (k')]
VN,kk' - -s- L.J L.J L.J "lOlij - "lai. "lOlij - "lai. ,n r 01=1 i=l je8i
and
k, k' = 1, ... ,p + q,
I ns(2) _ "" "" [(k) (k)] [(k
l
) (kl)]
VN,kk' - - L.J L.J "lai. - "la.. "lai. - "la.. ,ns 01=1 i=l
k k' = 1 .. , p + q', " ,
Y (l) (( (1»)) d y(2) (( (2) ))N = VN,kk' an N = VN,kk' .
Also, let
A(l) = ((aJ~~)) and A(2) = ((aJ~~))
where
aJ~~ = [srjj' - rjrj'] /(s - 1), j,j' = 1,"', v,
where hjj , is the usual Kronecker delta. Finally, let
BN is a (p + q)v x (p + q)v matrix. Then
Proof: Note that for a given 0:',0:' = 1," . ,n, under rn the probability that i will be
any of 1" .. ,s is ~, and the probability that j E Si is ~, hence
1 ~ "" ( (k»);:; L.J L.J E pn "lOlij. 01=1 iePj
H; i~ Ur t, j~, ~~~l)r .71(k) i = 1 .. , s J' = 1 ... v k = 1, ... ,p + q.J'/... , , " ,
55
![Page 63: pdfs.semanticscholar.org · ~well as surrogate responses taken along with concomitant variates. On the other hand, the set of surrogate samples consists of measurements on a subset](https://reader034.fdocuments.us/reader034/viewer/2022042205/5ea7b5204109341eb3561f6c/html5/thumbnails/63.jpg)
¥oreover, let A = Hi, i') : i E Pj, i' E Pj" i =I- i'}, and note that then the cardinality
of A is rjrj' - rjA= rj(rj - 1) if j = j'). Then
nE [r(k) - E (r(k))] [r(k') - E (r(k'))]Pn N,) N,) N,) N,)
[(1~"( (k) (k))) (1 ~" ((k') (k/)))]nEpn - L- L- 1]OIij - 1]01.. X - L- L- 1]{3i'j - 1]{3 ..
n 01=1 iEPj n {3=1 i'EPj
n
n-1 I: I: E pn [(1]~:} -1]i~~) (1]~:; - 1]i~:))]01=1 iEP}
n
+n-1 " " E [(1](~). _1](k)) (1](~;) _1](k' ))]L- L- Pn at) 01.. 01' ) 01 ..01=1 (i,i')EA
n-1 "" [( (k) (k)) ((k') (k'))]+n L- L- E pn 1]OIij - 1]01.. 1]{3ij - 'fJ{3 ..
0I#{3=1 iEPjn-1"" [( (k) (k)) ((k') (k'))]+n L- L- E pn 'fJOIij - 'fJ0I .. 'fJ{3i'j - 1]{3 ..
0I#{3=1 (i,i')EA
Note that the last two terms are null because the replicates are independent and
E pn ('fJ~:; - 'fJi~:) = O. The first term yields
n
-1 " " [( (k) (k)) ((k') (k'))]n L- L- E pn 'fJOIij - 'fJ 0I .. 'fJOIij - 1]01 ..01=1 iEP)
56
..
![Page 64: pdfs.semanticscholar.org · ~well as surrogate responses taken along with concomitant variates. On the other hand, the set of surrogate samples consists of measurements on a subset](https://reader034.fdocuments.us/reader034/viewer/2022042205/5ea7b5204109341eb3561f6c/html5/thumbnails/64.jpg)
The second term gives
n-1 "'" "'" E [( (k) (k)) ((k
/) (k
/))]n ~ ~ Pn TJOIij - TJ OI .. TJOIilj - TJOI ..
01=1 (i,i/)EA
-1 ~ "'" E ((~) (~/).) _ rj(rj - 1) ~ ( (k) (k /))n ~ ~ Pn TJOI,)TJOI,I) ~ TJOI .. TJOI ..
01=1 (i,i/)EA n 01=1
- n-1t 2: [ 1 t (~2: (k)) (~L (~/))]01=1 (i,i/)EA S(S - 1) i#i
'=1 k jEBi TJm) k jEBi, TJOI,,)
_ rj(rj - 1) ~ ( (k) (k /))~ TJ OI .. TJOI ..
n 01=1
-1 ~ "'" [ 1 (~~ (k) (k/) ~ (k) (k
l))]
n ~ ~ S(S _ 1) ~ ~ TJOIi. TJOIi ' . - ~ TJOIi. TJOIi.01=1 (i,i/)EA i=1 i ' =1 i=1
_ rj(rj - 1) ~ ( (k) (k /))n ~ TJ OI .. TJOI ..
rj(rj - 1) [~82 (k) (k /) _ ~ ~ (~) (k /)]
( 1) ~ TJOI.. TJOI.. ~ ~ TJOI'. TJOI'.n8 S - 01=1 01=1 i=1
_ rj(rj - 1) ~ ( (k) (k /))n ~ TJOI .. TJ OI ..
rj(rj - 1) ~~ ( (k) (k /) _ ~ ~ (~) (~/))]( 1) ~ TJ OI .. TJOI.. ~ TJOI .. TJm.
n 8 - =1 8 i=1
rj(rj - 1) ~ ~ ( (k) (k /)) (k) (k /)- (8 _ 1) ~ ~ TJOIi. TJOIi. - 8TJOI .. TJOI ..
n8 01=1 i=1 .
_ rj(rj - 1) {~ ~~ [ (~) _ (k)] [ (~/) _ (kl)]}
( 1) ~~ TJw. TJOI.. TJOI'. TJOI ..8 - n8 01=1 i=1
r·(r· -1) (2))) v- (8 - 1) N,kk'
Hence, the sum of the two terms give
C (T (k) T(k/))n ovPn N,j' N,j
(1) (1) (2) rj(rj - 1) (2)ajj'vN,kk' + rjVN,kk' - (8 _ 1) vN,kk'
(1) (1) (2) (2)a· "vN kk' +a· 'IvN kk')), )),
On the other hand, let B = {(i, i') : i E Pj, i' E Pjl, i = i'}. The cardinality of B is
rjj', Thus
nE [T(k). - E (T(k))] [T(k'), - E (T(k"~)]Pn N,) N,) N,) N,)
_ [(1~ "'" ((k) (k))) [1 ~ "'" ((k/) (k
/)))]- nEpn ;:; ~.~ TJOIij - TJOI.. x;:;~ .,~ TJ{3i l j' - TJ{3 ..
01=1 'EP) {3=1 , EPj'
57
![Page 65: pdfs.semanticscholar.org · ~well as surrogate responses taken along with concomitant variates. On the other hand, the set of surrogate samples consists of measurements on a subset](https://reader034.fdocuments.us/reader034/viewer/2022042205/5ea7b5204109341eb3561f6c/html5/thumbnails/65.jpg)
n
-1 "" "" E [( (k) (k») ((kl
) (kl
»)]n L- L- Pn 'rJo:ij - 'rJ0:.. 'rJo:i'jl - 'rJ0: ..0:=1 (i,i/)EB
n
+n-1 L L Epn [( 'rJ~:} - 'rJi~!) (7]~::}, - 7]i~~»)]
0:=1 (i,i')EA .
n
-1 "" "" E [( (k) (k») ((kl
) . (kl
»)]+n L- L- Pn 7]o:ij - 'rJ0:.. 7]f3i' jl - 7]13 ..o:f:.f3=l (i,i/)EB
n
-1 "" "" E [( (k) (k») ((kl
) (kl
»)]+n L- L- Pn 'rJo:ij - 7]0/.. 7]f3i ' jl - 7]13 ..o:f:.f3=l (i,i/)EA
Note that the first term in this last equality is the sum of products in blocks where
pairs of treatments (j,j') occur together, while the second term consists of the sum
of products in blocks where treatments (j, j') occur in different blocks. The last two
terms are null due to the independence of replicates. The first term then yields
n
-1 "" "" E [( (k) (k») ( (k' ) (kl
»)]n L- L- Pn 7]o:ij - 'rJ0:.. 7]o:il jl - 7]0: ..0/=1 (i,i / )EB
n nn-1"" "" E ((~), (k:~/) _ rjjl "" (k) (k' )L- L- pn 7]0:1J 'rJ0:1 J L- 'rJ0/ .. 'rJ0: ..
0/=1 (i,i/)EB n 0:=1
rjjl ~ [ 1 ~ "" (k) (k l),] _ rjjl ~ (k) (k')
n L- sr(r _ 1)~ ,L- 7]0:1)7]0:1J n L- 'rJ0: .. 7]0: ..0:=1 1=1 Jf:.J/ES; 0/=1
rjjl 1 ~~ ["" "" (k) (k' )' (k) (k l)]
---;:;: sr(r - 1) L- L- L- ,L- 7]o:ij7]o:ij' - L 7]O/ij'rJo:ij0/=11=1 JES; J/ES; JESi
n_ r jj' "" (k) (k' )
n ~ 7]0/ .. 'rJ0: ..
I ns nrjjl _ "" "" r 2 (~) (~') _ rjjl "" (k) (k')
1 L- L- 'rJm. 'rJ0:1. L- 'rJ0: .. 7]0: ..r - nsr 0:=1 i=l n 0:=1
_ rjjl 1 ~ ~ ["" (k) (k ' ) (k) (k' ) + (k) (~/)]r _ 1 nsr L- ~ L- 7]O/ij7]o:ij - r'rJo:i. 'rJo:i. r7]o:i.7]o:i.
0/=1 1=1 JESi
r' 'I [~ ~ (~ (~) (~/) _ s (k) (k l»)] _ rjjl V(l) IJJ ns L- ~ 7]0:1. 'rJm. 7]0: .. 7]0:.. _ 1 N,jj
. 0/=1 1=1 r
rjjl (1) (2)---VN "I + r'J'IvN "Ir - 1 JJ J JJ
And the second t~rm simplifies into
n
-1 "" "" E [( (k) (k») ((kl
) (kl»)]n L- L- Pn 7]o:ij - 7]0:.. 'rJO/iljl - 'rJ0: ..
0:=1 (i,i/)EA
58
..
..
•
![Page 66: pdfs.semanticscholar.org · ~well as surrogate responses taken along with concomitant variates. On the other hand, the set of surrogate samples consists of measurements on a subset](https://reader034.fdocuments.us/reader034/viewer/2022042205/5ea7b5204109341eb3561f6c/html5/thumbnails/66.jpg)
n , n .
n-l'" '" E (~~ (~:~/) _ TjTj - Tjjl '" (k) (k /)L...J L...J l"n "lellt) "lat ) L...J "la .. "la ..a=l (i,i/)EA n a=l
TjT; - Tjjl ~ [ 1 ~ (~) (~/)] _ TjT; - Tjjl ~ (k) (k /)n L...J s(s _ 1) L...J "l0it. "l0it. L...J "lau "la..
a=l i:;t:i'=l n . a=l
TjT; - Tjjl ~ [S2 (k) (k/) _ ~ (~) (~/)] _ TjT; - Tjjl ~ (k) (k/)- ns(s - 1) L...J "lau "la.. ~ "lat. "lat. n L...J "la .. "la ..
a=l t=l a=l
_ (TjT; - Tjj/) [~ ~ (~ (~) (~/) _ (k) (k/»)]- S - 1 ns L...J ~ 1]OH. "'Ctz. TJa .. TJa ..
a=l t=l
(TjT; - Tjj/) (2)- s -1 VN,kk'
Therefore, the sum of the two terms gives
Hence the theorem.
Define the following•
•
I ns
(k,k/)( ) _ '" '" '" [( (k)* (k
/)*) ]HN,l x,y - N L...J L...J L...J I Uaij ,Uaij :::; (x,y)
a=li=ljE~ .
I ns
(k,k') ( ) '" '" '" [( (k)* (k/)*) ]
H N ,2 X, Y = nT(T _ 1)~ f:t #7es; I Uaij ' Uaijl :::; (x, y)
Moreover, denote the marginal c.dJ of ul;j* by Fi)k)(x) and note that under How it is
equal to F(k)(x) which is independent of i and j. Also, for every i = 1,,", s, denote
(k,k /)( (_ (k,k /)( ) d H ) . (k)* (k /)*) .by F ij x) - F l x, y un er ow the margmal c.dJ. of Uaij ,Uaij . Fmally,
for j =1= j' = 1,'" ,v, let Fj~~;k')(x)(= F?,k/)(x,y) under How) be the marginal c.d.f.
f (U(k)* U(kl)*) D fio aij, aij' . e ne
i = 1,2, for k,k' = 1,"',p+q, where Ilk = J~J(k)(u)du. Note that Ffk,k)(x,y)
reduces to the univariate c.d.£. F(k)(x) as x = y almost everywhere. Further let
. _ (( (kk /»)) . _ .V t - Vi k,k'=l,.u,p+q, Z - 1,2,
59
![Page 67: pdfs.semanticscholar.org · ~well as surrogate responses taken along with concomitant variates. On the other hand, the set of surrogate samples consists of measurements on a subset](https://reader034.fdocuments.us/reader034/viewer/2022042205/5ea7b5204109341eb3561f6c/html5/thumbnails/67.jpg)
A = [(r ~ 1)] A(I) + [(s:. 1)] A(2); ,
B = [(r ~ 1)] A(I) _ [(s -ll~r - 1)] A(2);
{} = A ® VI - B ® V2
Theorem 3.2 Under How, B N as defined in Theorem 3.1 converges in probability
(as n -+ (0) to {} defined above.
Proof: Note that R~~~ = NH<;) (U~7]*). Hence, TJ~~} = J<;) [~IH<;) (U~7]*)]. It
follows that
Let us rewrite V~!kk' as follows
..
(1 )vN,kk'
1 n s
- L L L [TJ~~} - TJ~:~] [TJ~::) - TJ~~?]nsr 0=1 i=1 jESj
n~r ttL TJ~:}TJ~:j - :s t t [~ L TJ~:}] [~ .L TJ~~:~]0=1 t=1 JESj 0=1 t=1 JESj J'ESj
r - 1[_1 ~~ '" (~). (~')] _r - 1[ 1 ~ ~ '" (~). (~')]L..J L..J L..J TJOtJTJOtJ ( 1) L..J L..J L..J TJOtJTJOItJ ,
r nsr 0=1 i=1 jESj r nsr r - 01=1 i=1 j=f::j'ESj
r - 1Jf fk) [ N H(k)(x)] f k') [ N H(k')()] dH(k,k')(x )_r JR2 N N + 1 N N N + 1 N Y N,1 , Y
r - 1Jf fk) [ N H(k)(x)] f k') [ N H(k')()] dH(k,k')(x )r JR2 N N + 1 N N N + 1 N Y N,2 ,Y
p r - 1 [kk' kk']-+ -- lI1 - lI2r
where R2 is the the Euclidean plane, and the last term follows from Theorem 5.4.2 of
Puri and Sen (1971). Note that H<;) -+ H(k) (= p(k)under How), where H(k) is the
population combined c.dJ. Hence we have
60
..
![Page 68: pdfs.semanticscholar.org · ~well as surrogate responses taken along with concomitant variates. On the other hand, the set of surrogate samples consists of measurements on a subset](https://reader034.fdocuments.us/reader034/viewer/2022042205/5ea7b5204109341eb3561f6c/html5/thumbnails/68.jpg)
L 't (2).et US rewn e VN kk' as,
(2)VN,kk'
The first term in the last equality gives
1 n s
- L L [(1]~~~ -1]~.~)) (TJ~~:) - TJ~~'))]ns 0'=1 ;=1
1 n s
- L L TJ~~~TJ~::) - TJ~~)TJ~.~')ns 0'=1 ;=1
2-~ ~ [~" (~).] [~" (~'>,] _ (k) (k')L.J L.J L.J TJm) L.J TJm) TJ ... TJ ...ns 0'=1 ;=1 r jESi r j'ESi
1 [ 1 ~~" (k) (k')] r - 1 [ 1 ~~" (k) (k')]- - L.J L.J L.J TJO';j1]O';j + -- (. 1) L.J L.J L.J TJO';j1]O';j'r nsr 0'=1 ;=1 jESi r nsr r - 0'=1 ;=1 j1=j'ESi
-TJ.(.~)TJ~.~')
~Jr J(k) [ N H(k)(x)] J(k') [ N H(k')(y)] dH(k,k')(x y) +r JR2 N N + 1 N N N + 1 N N,I'
r - 1 Jr J(k) [ N H(k)(x)] J(k') [ N H(k')()] dH(k,k')(x )r JR2 N N + 1 N N N + 1 N Y N,2 ,y
-TJ.\~)1].\~')
p 1 kk' r - 1 kk'-+ -VI + --V2r r
where the last term follows from Theorem 5.4.2 of Puri and Sen (1971). The second
term gives
61
![Page 69: pdfs.semanticscholar.org · ~well as surrogate responses taken along with concomitant variates. On the other hand, the set of surrogate samples consists of measurements on a subset](https://reader034.fdocuments.us/reader034/viewer/2022042205/5ea7b5204109341eb3561f6c/html5/thumbnails/69.jpg)
_..!.. {_I~ ~ " [( (~). _(k)) ( (~') _ (k'))]}- LJ LJ L.-i 'fJcx~J TJ... TJO:~J 77...sr nsr 0'=1 i=1 jESi .
_ r - 1 { 1 ~~ " [( (k). _ (k)) ( (k') _ (k'»)]}sr nsr(r - 1) L.J~ . ~ 'rJQ<J 'rJ... 'rJQij' 'rJ ...
0'=1 <=1 J::f:.J'ESi
S - 1 { 1 ~ ~ [( (k) (k») (k') (k'»)] }--s- ns(s _ 1) L.J .~ 'rJQi. - 'rJ... 'rJQi'. - 'rJ ...
0'=1 <::f:.<'=1
p 1 kk' r - 1 kk'-+ --v - --v
sr 1 sr 2
Note that the last term in the last equality converges to zero in probability. To
see this, remember that {'rJ~~~} are independent for every Q - 1,"', n, i = 1,"', s.
Moreover, if we let FnQi(x) = ~ 'LjESi I(Ui7J* $ x), then
( 'rJ(~) - 'rJ(k») = joo J(k) [ N H(k)(x)] dFnQi(x) _ 'rJ(k)w. ... -00 N N + 1 N ...
~ fal J(k)(u )du - J.l(k) = o.
Thus, if we let ('rJ~~~ - 'rJ~.~)) = Mi7) ,then E pn Mi7) ~ 0, and .
1 ~ ~ [( (k) (k)) (le') (k'))]( _ 1) L.J L.J 'rJQi. - 'rJ... 'rJQi'. - 'rJ ...
ns s 0'=1 i::f:.i'=1
((k) (k'))E pn MQi MQi, .
p-+ O.
Therefore,
(2) P [(s - 1)] (kk') [(S - l)(r - 1)] (kk')vN kk' --+ VI +. V2's sr
Hence
vW~ [s:' 1] [VI + (r - 1)V2] .
Thus it is clear now how under How, B N ~ n.
62
•
![Page 70: pdfs.semanticscholar.org · ~well as surrogate responses taken along with concomitant variates. On the other hand, the set of surrogate samples consists of measurements on a subset](https://reader034.fdocuments.us/reader034/viewer/2022042205/5ea7b5204109341eb3561f6c/html5/thumbnails/70.jpg)
Theorem 3.3 LetVe:c(TN)' a (p+t 1 )vx1-vector, denote the rolled out form ofTN.
Then n ~ (Vec (TN) - 1/) converges, in probability, to a multivariate normal distribu
tion with null mean vector and dispersion matrix B N.
The proof is found in Theorem 4:1 of Sen (1969). Now; construct the resid
ual rank statistics as we did in the complete block design. Partition [Vec (TN) - 1/]
into two components: the first component, TNO, is a qv X 1 vector of the centered
linear rank statistics corresponding to the surrogate variables, and the second com
ponent, T~, is a pv X 1 vector of centered linear rank statistics corresponding to the
concomitant variates. Simillarly, partition the matrix B N as we partitioned V N in
(2.11)
(
BNOOBN=
B'rvo+
B NO+ )
B N ++
(3.3)
where B NOO is a qv X qv matrix, B NO+ is qv X pv and B N++ is a pv X pv matrix.
Then, from the classical normal theory, (see Theorem 2.5.1 of Anderson (1984)), for
large N we have
Define the qv X 1 residual rank-statistics TN-O' and BN-OO' the covariance matrix of
n1/2T'No as follows
B'Noo B B B (-l) B'NOD - NO+ N++ NO+ (3.4)
3.1.2 BIBD For The Validation Set 1*
Consider n* replications of a BIBD consisting of s* blocks of constant size r*(2:: 2) to
which v treatments are applied such that the conditions i, ii, and iii in Section 3.1.1
are satisfied.
63
![Page 71: pdfs.semanticscholar.org · ~well as surrogate responses taken along with concomitant variates. On the other hand, the set of surrogate samples consists of measurements on a subset](https://reader034.fdocuments.us/reader034/viewer/2022042205/5ea7b5204109341eb3561f6c/html5/thumbnails/71.jpg)
Let Si stand for the set of treatments occuring in the ith block, i = 1, ... , s*. For
the a th replicate, the response in the ith block receiving the jth treatment is a stochas-
. U (lJ X(l) X(q) Z(l) Z(p))' ("t.r X' Z')' ftIc P+q + 1-vector OIii = .L OIi;"l exij'" • , exij' exii'·'" exii = .L i;"l OIij' exij 0
measurements corresponding to the primary variate, p covariates and the q surrogates.
Consider the the same model in (3.1) and the assumptions following it with
the difference that here we regress the primary variate and the surrogates on the
concomitant variates. We basically repeat what we have done in the surrogate set with
N replaced by N v(= n*s*r*). Thus B N in Theorem 3 is now a (p+q+l)v x (p+q+l)v
matrix ,BNv ' which we partition as in (3.3). Then, corresponding to (3.4) we have
B NvOO B B B(-l) B'NvOO - NvO+ Nv++ NvO+ (3.5)
Partition the (q + l)v x I-vector T NvO ' and the (q + l)v x (q + l)v matrix B NvOO as
follows
T NvO = (T~vo(Y), T~vo(X))'
(BN.,Yy B NVY.X ).B NvOO =
B~vYX B NvXX
where B NvYY is v x v, B NvYX is v x qv, and B NvXX is qv x qv. Also let
TNvO(Y:X)
BNv(Y:x)
T* B* B*(-l) T*NvO(Y) - NvYX NvXX NvO(X) ,
B * B* B* B*'NvYY - NvYX NvXX NvYX·
(3.6)
(3.7)
Note that the dispersion matrix of v x 1 vector n*1/2 TNvO(Y:X) is given by the v x v
matrix BNv(Y:X).
3.1.3 BIBD For The Set [0
Consider nO replications of a BIBD consisting of SO blocks of constant size
rOC~ 2) to which v treatments are applied such that the conditions i, ii, and iii in
(3.1.1) are satisfied.
64
![Page 72: pdfs.semanticscholar.org · ~well as surrogate responses taken along with concomitant variates. On the other hand, the set of surrogate samples consists of measurements on a subset](https://reader034.fdocuments.us/reader034/viewer/2022042205/5ea7b5204109341eb3561f6c/html5/thumbnails/72.jpg)
Let Si stand for the set of treatments occuring in the i th block, i = 1,"', so.
For the a th replicate, the response in the ith block receiving the jth treatment is a
. ((1) (p)) I ( ') IstochastIc p + I-vector U aij = Yaij,"', Zaij, .. " Zaij = }ij, Zaij of measure-
ments corresponding to the primary variate, and the p covariates.
Consider the same model in (3.1) and the assumptions following it with the
difference that here we regress the primary variate on the concomitant variates. We
basically repeat what we have done in the surrogate set with N replaced by N°(=
nOsOrO). Thus BN in Theorem 3 is now a (p + l)v x (p + l)v matrix ,BNo, which is
partitioned as in (3.3)
where B'N0oo is v x v, BNoO+ is v x pv, and V NO++ is pv x pv. Thus, corresponding
to (3.5) we now have the v x 1 residual rank-statistics T'N0o and the v x v dispersion
. B* f Ol/2 T * f 11matnx NOOO 0 n NOO as 0 ows
B'N°oo (3.8)
3.2 Construction of the Test Statistic
The next step consists of combining information from all subsets. Since the tests
developed in the different subsets are independent, we will take a weighted linear
combination of these tests with the weights being the inverses of the corresponding
dispersion matrices. Note that for such a combination to make sense the dimensions
of the tests should be the same.
We follow a similar approach as in the complete block case. Write the qv x 1-
vector, T'No, obtained from the surrogate set I, as a q x v matrix. Then take a linear
combination of this matrix, say aT'No, where a is of order 1 x q, subject to the two
restrictions, i) a'a = 1, and ii) the variance given in (3.4) is minimum. Although by
65
![Page 73: pdfs.semanticscholar.org · ~well as surrogate responses taken along with concomitant variates. On the other hand, the set of surrogate samples consists of measurements on a subset](https://reader034.fdocuments.us/reader034/viewer/2022042205/5ea7b5204109341eb3561f6c/html5/thumbnails/73.jpg)
qoing this we are losing information, the minimum variance condition maximizes the
noncentrality parameter of the chi-squared distribution of the quadratic form based
on the transformed residual vector, which in turn maximizes the power of this test.
Moreover, a can be ordered if there is inherent ordering in the surrogate vector.
Note that if we partition vW, and vW as we did for B N in (3.3), then we get
V~bo and V~bo each of which is a q x q matrix. Now write
TNO(X) - aTNO
(aTNO 1"'" aTNO v), ,
where {TN-O'i' j = 1, ... , v} are the q x 1 columns of the q x vmatrix T NO' Moreover,
Cov(aTNO,i' aTNO,i') aCov(TNO,j, TNO,j/)a'
[ (I)V(I) (2)V(2)]'a ajjl NOO + ajjl NOO a
Thus, denote the dispersion matrix of n I/2TNO(x) by B NX which is a v x v matrix.
Let Al = B NX' A2 = BNv(Y:X), and A 3 = B NOOO ' Let
W· . - [A-I + A-I + A-I]-IA:- I . 1 2 3,- 1 2 3 t' z= , , (3.9)
Now, noting that the mean of each of the three statistics TNO(x), TNvO(Y:X), and T NOO
is 0, following the Gauss-Markov theorem we consider the linear combination
(3.10)
Assume that nn. -t PI, and ~ -t P2 for positive and finite real PI and P2. Then nI/2To'
is conditionally distribution-free, under the permutational model, with °mean and
covariance matrix
and the permutational multivariate central limit theorem of Sen (1983) applies here.
Thus, for large N, Nv , and N°, nI/2To' ,..., Nv (0, A). Finally consider the overall test
statistic
(3.11)
66
![Page 74: pdfs.semanticscholar.org · ~well as surrogate responses taken along with concomitant variates. On the other hand, the set of surrogate samples consists of measurements on a subset](https://reader034.fdocuments.us/reader034/viewer/2022042205/5ea7b5204109341eb3561f6c/html5/thumbnails/74.jpg)
The exact permutational distribution of LO· can be obtained by enumeration
if N, Nv , and N° are small to moderate. Thus for large N, Nv , and N° the null
distribution of LO· by Cochran (1934) is asymptotically a chi-squared with v-I
degre.es of freedom since it is a quadratic form of asymptotically normal variates with
discriminant of rank v-I.
3.3 Asymptotic Non-null Distribution of L O*
For the study of the non-null distribution of LO· we will consider local alternatives
only. Let us first work with the surrogate set I. Write Gi(u) = Gi(x, z), x E Rrq , z E
Rrp , and consider the following sequence of alternative hypotheses:
(3.12)
where ~ = ((Ay))) stands for an v X q matrix of treatment effects. Thus, under {HN}
Fi\') (x);" F(') [x - n-1/
2 (A\') - ~ ~ AI'») ], j E S" k= 1,· .. ,q.
Assume that all Fi~k) (x) are absolutely continuous. Let f(k) (x} be the density function
corresponding to F(k) (x). Let
(3.13)
where1 s
H(k)(x) = - I: I: Fi~k)(x), k = 1,"" q.sr i=1 jeSi
Now expand F(k) [x - n-1/
2 (A)k) - ~ LIes; A~k))] in a Taylor series around x for a
fixed x. We will have
Thus, for large n we have
67
![Page 75: pdfs.semanticscholar.org · ~well as surrogate responses taken along with concomitant variates. On the other hand, the set of surrogate samples consists of measurements on a subset](https://reader034.fdocuments.us/reader034/viewer/2022042205/5ea7b5204109341eb3561f6c/html5/thumbnails/75.jpg)
I: J(') [F(')( x)] dF(') [x - n-1/' (Aj') - ~ I~ Ai'))]i: J(k) [F(k)(x)] dF(k)(x) +
I: J(') [F(') (x)] d {F(') [x - n-II' ( Aj') - ~ I~ Aj')) ] - F(')(x) }
+o(n-1/2
)
Now integrate by parts the second integral in the last term
(k)I-ln,ij r1
J(k)(u)du _jOO {F(k) [x _ n-1/ 2 ().)k) _ ~ L ).~k))] _ F(k)(x)} ~J(k)(F(k)(X))Jo -00 r IESi dx
+ o(n-1/
2)
r1J(k)(u)du + (n-1/ 2 ) ().)k) _ ~ L ).}k)) JOO ~J(k)(F(k)(X))f(k)(x)dx + o(n-1/ 2 )
Jo r IESi -00 dx
r1J(k)(u)du + (n-1/ 2 ) ().)k) _ ~ L ).}k)) JOO ~J(k)(F(k)(X))dF(k)(X) + o(n-1/ 2 )
Jo r lESi -00 dx
where the last two steps resulted from a Taylor series exapnsion of
around x for a fixed x. Thus, if we let
then from the above we have
Now, let Q~~lj = ~ I:~=l 7]~:}, j E Si, {= 1,"', s, k = 1,'" ,p + q. Note that
r;;3 = I:iEPj Q~~lj' Then by Theorem 5.1 of Sen (1969) [n 1/
2( Q~~lj -I-l~~lj)' j E Si, i =
1,"', s, k = 1,'" ,p + q] has asymptotically a multinormal distribution with null ..mean and dispersion matrix whose entries for i # if are zero, and for j = jf converge
to V~kl, and for j =I- jf converge to v;kl. Thus, by condition (a) of Section 2.2.1 and
68
![Page 76: pdfs.semanticscholar.org · ~well as surrogate responses taken along with concomitant variates. On the other hand, the set of surrogate samples consists of measurements on a subset](https://reader034.fdocuments.us/reader034/viewer/2022042205/5ea7b5204109341eb3561f6c/html5/thumbnails/76.jpg)
Theorem 5.1 of Sen (1969) we have
lim E [n1/2 (T(k) - r'''l(k l ) I HN]n-+oo N,J J ...
lim E[n1/2 (Tjp - " /1(k).) IHN]+ lim E[n1/2 (" (/1(k). - '1l(k»)) I HN]n--+oo ,J L..J n,tJ n--+oo L...J n,tJ 'I ...
iEPj iEPj
lim E [n1/2 " (Q(k). _ I/(k).) I HN] +n--+oo L..J n,tJ rn;"J
iEPj
n~~E [nIl' (~(I'~:!j - 1.' J(kJ(U)dU)) I HN]
- J~ E[n1/2(.L("l.~~l _fa1 J(kl(U)dU)) IHN]~EP}
EH~ Jim n'l' (I'~:!j - /.' jlkJ(U)dU)] IHN}
B(F(kl ) .L [>.Yl - ~ L >.}kl]~EP} lESi
B(F(kl ) " [>.(kl _ ~>.(kl _ ~ " >.(kl]LJ J rJ rLJ IiEP) I#jESi
B(F(kl ) {[rj(r -1)] >.;kl _ t rjjl >.;~l}r j':f.j=l r
Let (J = ((eY»)) j=l, ... ,v,k=l,. .. ,q , where
Then, by Theorem 5.2 of Sen (1969), {n1/
2 (T;t,} - rj"l~~»), k= 1, ... ,p + q, j = 1,' .. ,v}have jointly asymptotically a multinormal distribution with mean (J and dispersion
matrix fl. Denote by fl I the version of fl pertaining to the surrogate set, I, and by
(JI the version of (J in I. Partition (JI as ((JIO, (J~). Also partition flI as we partitioned
B N in (3.3) and corresponding to (3.4) let
1")* _ I") I") 1")(-1) 1")1uIwOO - UIwOO - UIO+UI++UIO+'
69
![Page 77: pdfs.semanticscholar.org · ~well as surrogate responses taken along with concomitant variates. On the other hand, the set of surrogate samples consists of measurements on a subset](https://reader034.fdocuments.us/reader034/viewer/2022042205/5ea7b5204109341eb3561f6c/html5/thumbnails/77.jpg)
Then for large n
and its dispersion matrix is njwx which is the population version of B'Nx.
In a similar fashion, define {HN} for the validation set 1iv such that ((>.~k))) is
now an v x (q + 1) matrix. Also, (J [tv is defined in the same way as (J [ with the
difference that k = 1,,' . ,q + 1. B Nv rv n[w which is the version of n defined in the
validation set 1iv. Thus, corresponding to (4.5) and (4.6) we have
TN-vo(y:X)
OJ,(y:X)
Hence for large n*
T * £")* £")*(-1) T*NvO(Y) - U ['YX U ['XX NvO(X) ,
n* £")* £")*(-1) £")*'HiI*YY - "~['YX"~['XX"~['YX'
E[ *1/2T* IH] (J £")* £")*(-1) (J' *n NvO(Y:X) N ---+ ['(Y) - U['YXU[,XX ['(X) = J..t['O(Y:X)
and the dispersion matrix of n*1/2TN-vO(Y:X) under {HN } is nj,(y:X)'
Finally, along the same lines, define {HN } for the subset 1° such that ~ is now-
an v x 1 vector. Also, (J [0 is defined in the same way as (J [ . Denote the expected
value of n01
/2 TN-oo under {HN } by /Ljoo, and its dispersion matrix is njoo'
Now, we consider the weights Wi defined in (3.9) where now we have Al = nN-X,
A 2 = nj,(y:X)' and A 3 = njoo' Thus, under {HN }, the mean of TO' in (3.10) is
Therefore, from Theorem 5.3 of Sen (1969) and under {HN }, LO' defined in (3.11)
has asymptotically a noncentral chi-square distribution with v-I degrees of freedom
and the noncentrality parameter
70
•
![Page 78: pdfs.semanticscholar.org · ~well as surrogate responses taken along with concomitant variates. On the other hand, the set of surrogate samples consists of measurements on a subset](https://reader034.fdocuments.us/reader034/viewer/2022042205/5ea7b5204109341eb3561f6c/html5/thumbnails/78.jpg)
Chapter 4
Recovery of Inter-Block
Information (RIBI)
The model introduced in (3.1) and the analysis following it extracts intrablock in
formation. However, when block comparisons are taken into account, the varietal
effects of treatments become more accurate; see Rao (1947). This section deals with
combined inference based on intra- and interblock information. A full treatment of
the parametric case is also found in Scheffe (1959).
The need for the recovery of inter-block information was first felt by Yates (1940)
in the context of incomplete block designs. Since the treatments are randomly al
located to incomplete blocks, it is reasonable to assume that the block effects are
random variables instead of fixed. Moreover, Yates (1940) noted that if the experi
mental material is fairly heterogeneous, treating block effects as fixed will result in
loss of information contained in the block totals.
71
![Page 79: pdfs.semanticscholar.org · ~well as surrogate responses taken along with concomitant variates. On the other hand, the set of surrogate samples consists of measurements on a subset](https://reader034.fdocuments.us/reader034/viewer/2022042205/5ea7b5204109341eb3561f6c/html5/thumbnails/79.jpg)
4.1 RIBI for Surrogate Set I
The model considered in (3.1) assumes that the block effects f301i, a = 1"", n, Z =
1,' .. ,s, are fixed. In some instances this may not be viable. It may be more rational
to assume that blocks are random variables. For example, consider blocks made
up of litters of cats where each litter, and hence each block, come from a different
mother. The mothers are assumed to be a random sample from an infinite population.
Consider the same setup as in (3). Denote the block totals by hOli, a = 1, ... ,n, i =
1,"', s. Then, (3.1) gives
where
hOli - L XijjESi
L [ltOi + f301i + Tj + EOIij]jESi
rltOi + L Tj + f OIijES;
rltOi + T OIi + f OIi (4.1)
fOli = rf301i + L EOIij, i = I,···,s, a = I,···,n.jESi
For each a = 1,' .. ,n, the block effects f3 01i' i = 1,'" ,s are' assumed to be random
variables with a joint distribution that is symmetric in its s arguments. Moreover,
in addition to the assumptions following (3.1), f3 01i and LjES; EOIij are assumed to
be independent. T OIi is a p + q-vector with two components, T OIil a p x I-vector of
treatment effects corresponding to the surrogates, and T OIi2, a q x I-zero vector of
treatment effects for the covariates since we assume no interaction between treatment
and concomitant variables. We wish to test the null hypothesis
HOB: TOIil = 0 i = 1,"', s.
while the set of alternatives relates to shifts in location due to treatment effects. Let
hOi = (hOlI,"', hOls). Also, let h N = (hI,"', h n ). Under HOB, we have the following
(i) The joint distribution of hOi remains invariant under any permutation of the s
vectors among themselves, there being s! such permutations.
72
![Page 80: pdfs.semanticscholar.org · ~well as surrogate responses taken along with concomitant variates. On the other hand, the set of surrogate samples consists of measurements on a subset](https://reader034.fdocuments.us/reader034/viewer/2022042205/5ea7b5204109341eb3561f6c/html5/thumbnails/80.jpg)
(ii) As replicates are independent, the joint distribution of h N remains invariant
under any permutation of the n sets hOI, a = 1,' .. ,n.
Thus, if we define On to be the compound group of transformations {gn} by
9 hN - h* - [g(l)h ... g(n)h] g(OI) E t: rv - 1 ... nn - N - n 1, 'n n, n ~n, \...< -, ,.
Then On contains [s!]n transformations, and under HOB it leaves the joint distribution
of hN invariant. Let S(hN) = {hN = 9nhN : gn E On}. Then, it follows from the
above discussion that
for all hNE S(hN). Denote this conditional (permutational) probability measure by
~~. As ~~ is completely specified the existence of conditionally distribution-free tests
for HOB is thus established.
For the kth variate, we rank the observations h~~), ... ,h~k) in ascending order of
magnitude and denote by S~~) the rank of h~~) in this set, for i = 1," . ,s, a = 1,' .. ,n,
and k = 1,'" ,p + q. Thus, corresponding to hOli = (h~l/, ... : h~+q»)' we have a rank
vector SOli = (S~~),· ., ,S~+q»)', i = 1,' .. ,s, a = 1,' .. ,n.
For each N'(= ns) and each k = 1,'" ,p + q, we define suitable rank scores
b~~ = (b~~,l,···,b~~,NI)' where b~~,j = J;;)(j/(N' + 1)), 1 :::; j :::; N'. Moreover,
J;;)(u) is defined in accordance with the Chernoff-Savage convention, i.e., we assume
that J;;) (u) satisfies the conditions a, b, c of Section 2.2.1. For notational simplicity,
we let
t(k) - b(k) ~ - 1 ... s '" - 1 '" n'"'OIi - I (k)' 0 -, ." \...< -, "N ,sa;s n
ei~) = S-l Le~~), e~.k) = n-1 Lei~)i=l i=l
for k = 1, ... ,p +q. The proposed test is based on the statistics
T (k). = _1 ~"t(~) . 1 k 1N',) . L.J L.J "'0/1 , J = ,"', v, = ,'" ,p + q.
nr) 0'=1 iEPj
73
![Page 81: pdfs.semanticscholar.org · ~well as surrogate responses taken along with concomitant variates. On the other hand, the set of surrogate samples consists of measurements on a subset](https://reader034.fdocuments.us/reader034/viewer/2022042205/5ea7b5204109341eb3561f6c/html5/thumbnails/81.jpg)
where Pj = {i : j E Sd, j = 1,' .. ,v. Let TN, = ((T;;'~J)j=I ..",v, k=I"",p+q,
Theorem 4.1 Let
(1) = ~~~ [t(~) _ dk)] [t(~') _ d k')]wN',kk' L-J L-J ~m ~o. ~o~ ~o. ,
ns 0=1 i=1
and
k k' = 1 ... p + q, " ,
W(2~ I = ~ ~ [d k) _ d k)] [dk' ) - t(k')] k k' = 1 ... p + q'N kk L-J ~o. ~.. c"o. c".. " " ,, n 0=1
(1) (( (1) )) (2) (( (2) ))W N' = wN,kk' and W N' = wN,kk' .
Also, let
where
d(l) _ srjjl - rjrjl .., .jjl- 2( 1)' J,J=l,···,v;
rj s-
d(~), = rjrjl .., 1JJ 2 ' J,J = ,"',v,
rj
Finally, let
C - V(l) iOI W(I) + V(2) iOI W(2)N' - '<Y N' '<Y N" -
eN' is a (p + q)v X (p + q)v matrix. Then
where J is a v x I-vector of ones.
Proof: Note that for a given Q, Q = 1,'" ,n, the probability under p~ that i will be
any of 1 ... s is 1. hence, , s'
74
![Page 82: pdfs.semanticscholar.org · ~well as surrogate responses taken along with concomitant variates. On the other hand, the set of surrogate samples consists of measurements on a subset](https://reader034.fdocuments.us/reader034/viewer/2022042205/5ea7b5204109341eb3561f6c/html5/thumbnails/82.jpg)
¥oreover, let A = {(i, i') : i E Pj, i' E Pj, i =I- i'}, and note that then the cardinality
of A is rjrjl - rjj/(= rj(rj - 1) if j = j'). Then
nEp ;. [Tt~j - E (Tt~j)] [T~~:~ - E (Tt:n]
nEp ;. [(~ t L: (e~~) - e~.k))) x (~ t L (e~::) - f~kl)))]nrJ a=1 iEPj nrJ f3=1 i'EPj
n~2 t L: Ep ;. [(e~~) - f\k)) (e~~/) - e~kl)) ]J a=1 iEPj
+n~~ t L: Ep ;. [(e~~) - e~.k)) (e~::) - e~kl)) ]J a=1 (i,i/)EA
+_1 ~ "E. [(e(~) - e(k)) (e(~/) - e(kl))]nr2 L.. L.. Pn at .. f3t ..
J a#f3=1 iEPj
+~ t L: Ep ;. [(e~~) - f~k)) (eh~:) - f\k l
))]
nrJ a#f3=1 (i,i/)EA
Note that the last two terms are null because the replicates are independent and
Ep ;. (e~:) - e~k)) = o. The first term yields
~ t L Ep ;. [(e~~) - e~k)) (e~:/) - e~.kl)) ]nrj a=1 iEPj
n~~ t .L: E p ;. (e~~) e~:/)) - r1 (e~.k) e~.k'))
J a=1 tEPJ J .
n~2 t L: [~t e~:) e~~/)] - :. ((~k) e.~kl))J a=1 tEPj t=1 J
_1_ t [~ te~:)e~~') - d~)d~/) + ei~)ei~')]nrj a=1 S i=1
-~ t (e~k)e~kl))r J a=1
_1_. t t [e~:) - ei~)] [ei~/) - d~/)]nsrJ a=1 i=1
+_1 t ei~)ei~') - ~ (f(.k)e~kl))nrj a=1 rj
_1_ ~~ [e(k) _e(k)] [t(~/) _ t<kl)]nsr' L.. L.. at a. ~at ~a.
J a=1 t=1
+n~' t [d~) - e~.k)] [e!:') - e~.k')]J a=1
1 [ (1) (2)]- - WN' kk' +WN' kk'rj' ,
75
![Page 83: pdfs.semanticscholar.org · ~well as surrogate responses taken along with concomitant variates. On the other hand, the set of surrogate samples consists of measurements on a subset](https://reader034.fdocuments.us/reader034/viewer/2022042205/5ea7b5204109341eb3561f6c/html5/thumbnails/83.jpg)
The second term gives
Hence, the sum of the two terms give
((k) (k')) S - rj (1) (2)
nCov p • TN' j' TN' j = ( 1) wN' kk' +wN' kk"n , , rj S - ' ,
On the other hand, let B = Hi, i') : i E Pj, i' E Pj', i = i'}. The cardinality of B is
rjj', Thus
[(k) ( (k))] [(k') ((k') )]nEpi. TN',j - E TN',j TN',j' - E TN',j'
76
•
![Page 84: pdfs.semanticscholar.org · ~well as surrogate responses taken along with concomitant variates. On the other hand, the set of surrogate samples consists of measurements on a subset](https://reader034.fdocuments.us/reader034/viewer/2022042205/5ea7b5204109341eb3561f6c/html5/thumbnails/84.jpg)
nEp:. [(~t I: (~~~) - ~~k))) X (~t I: (~}j;:) - ~~.kl)))]nr) a=1 iEPJ nr) {3=1 i/EPJI
_.1~ " E. [(~(~) _ d k )) (t(~;) _ d k/ ))]nr2 LJ LJ Pn m ":... <"m ":. ..
) a=1 (i,i/)EB
+_1~ " E. [(t(~) _ ~(k)) (t(~;) _ t(k'))]nr~ LJ LJ P n ":.at .. ":.at ":. ..
) a=1 (i,i/)EA
+_1 ~ "E. [(t(~) _ d k)) (t(~:) _ d kl ))]nr~ LJ LJ P n ":.at ":... ":.{3t ":. ..
) a#={3=1 (i,i/)EB
+_1 ~ "E. [(t(~) _ d k)) (t(k;) _ d kl ))]nr~ LJ LJ P n ":.at ":... ":.{3t ":. ..
) a#={3=1 (i,i/)EA
The first term then yields
_1 ~ " E. [(~(~) _ ~(k)) (t(~I) _ dk/))]nr~ LJ LJ P n at .. ":.at <" ..
) a=1 (i,i/)EB
_1 ~ " E. (~(~)~(k')) _ rjjl (dk)dkl))nr~ LJ LJ Pn O!t O!t 2 ":... ":. ..
) a=1 (i,i/)EB r)
~ t I: [~t~~~)~~:/)]- rj~' (~~.k)~~.kl))nrj a=1 (i,i/)EB S i=1 rj
rjjl ~ [~~ ~(~)~(~/) _ ~(k)~(kl) + ~(k)~(kl)]nr2 LJ s ~ at at a. a. a. a.
) a=1 t=1n
_ r:~, I: ((~k)(~kl))) a=1
rjjl [~ ~ ~ [~(~) _ ~(k)] [t(~I) _ d kl )]]r~ ns LJ~ at a. ":.at ":.a.
) a=1 t=1
+r:r [~~ d~) ~~~/) - (~~.k) ~~.k/))]
rjjl [~ ~ ~ [~(k) _ ~(k)] [~W) _ ~(kl)]]r2 ns LJ~ at a. at a.
) a=1 t=1
+r:r [~~ [~~~) - (~k)] [~~~') _ ~~.kl)]]rjjl [ (1) (2)]r~ WNI,kk' +WNI,kk'
)
The second term gives
_1~ " E. [(~(~) _ d k)) (t(~:) _ d kl ))]n ~ LJ LJ P n at ":... ":.at ":. ..r) a=1 (i,i/)EA
77
![Page 85: pdfs.semanticscholar.org · ~well as surrogate responses taken along with concomitant variates. On the other hand, the set of surrogate samples consists of measurements on a subset](https://reader034.fdocuments.us/reader034/viewer/2022042205/5ea7b5204109341eb3561f6c/html5/thumbnails/85.jpg)
The details were skipped because they were done earlier. Therefore, the sum of the
two terms gives
C (T (k) T(k')) ,(1) (1) d(2) (2)n OV P:; N',j' N',j' = ajj'wN',kk' + jj'WN',kk'
Hence the theorem.
Define the following
•
1 n s
H};i,;') (x, y) = ns L ~ I [(h~~)*, h~~')*) ~ (x, y)]a=l,=l
(k,k') . 1 ~ ~ [( (k)* (k')*) ( )]HN',2 (x,y) = (-1) L..J L..J I hai ,hai ~ x,yns s a=l i#i'=l
Moreover, denote the marginal c.d.f of h~~)* by F}k)(x) and note that under HOB it
is equal to F(k)(x) which is independent of i. Also, for every i = 1,"', s, denote by
F?,k')(x)(= F1(k,k')(x,y) under HOB) the marginal c.d.f. of (h~~)*,h~~')*). Finally, for
i =I i' = 1,···,s, let F}:"k')(x)(= FJk,k')(X,y) under HOB) be the marginal c.d.f. of,
(h(k)* h(k')*) D fiai , ai' . e ne
z = 1,2, for k,k' = 1,"',p+q, where J-lk = f~J(k)(u)du. Note that F1(k,k)(x,y)
reduces to the univariate c.d.f. F(k) (x) as x = y almost everywhere. Further let
(kk') ) .Wi=((Wi )k,k'=l, ..·.,P+q, z=1,2;
78
•
![Page 86: pdfs.semanticscholar.org · ~well as surrogate responses taken along with concomitant variates. On the other hand, the set of surrogate samples consists of measurements on a subset](https://reader034.fdocuments.us/reader034/viewer/2022042205/5ea7b5204109341eb3561f6c/html5/thumbnails/86.jpg)
E = (8 - 1) D(1) _ ~D(2).,8 8
Theorem 4.2 Under HOB, eN defined in Theorem 3.3, converges in probability (as
n --+ 00) to 1P defined above.
Proof: Note that S~7) = NHc;) (h~~)). Hence, ~~)
follows that
Rewrite w~J kk' as follows,
(1 )wN',kk'
where R2 is the Euclidean plane, and the last term follows from Theorem 5.4.2 of
Puri and Sen (1971). Note that Hc;J --+ H(k)(= p(k)under HOB), where H(k) is the
population combined c.d.f. Hence we have
(1) P s - 1W N' --t -- [WI - W2] .
s
L . (2)et us rewnte wN',kk' as
(2)WN',kk' ~ f [~~~) - ~~k)] [~~~') _~~.k')]
0'=1
79
![Page 87: pdfs.semanticscholar.org · ~well as surrogate responses taken along with concomitant variates. On the other hand, the set of surrogate samples consists of measurements on a subset](https://reader034.fdocuments.us/reader034/viewer/2022042205/5ea7b5204109341eb3561f6c/html5/thumbnails/87.jpg)
~ t t t [(~~~) -~~.k») - (~~~:) _~~.k'»)]ns 0'=1 i=1 i'=1
~ t t [(~~:) - ~~.k») _ (~~~') _~~.k'»)]ns 0'=1 i=1
+_1 ~~ [(~(~) _ ~(k») _ (~(~:) _ ~(k'»)]ns 2 LJ LJ at.. at ..
0'=1 i=1
~ ~ [w~k' + w;k']Hence we have
f
Thus the theorem. •Theorem 4.3 n1
/2 [Vec(TN') - eJ converges in probability to a multivariate normal
distribution with null mean vector and dispersion matrix C N ,.
The proof is found in Theorem 4.1 of Sen (1969). Now, construct the residual rank
statistics as we did in the intra-block case. Partition [Vec (TN') - eJ into two com-
ponents, the first component, T N'O, is a qv x 1 vector of the centered linear rank
statistics corresponding to the surrogate variables, and the second component, T~"
is a pv x 1 vector of centered linear rank statistics corresponding to the concomitant
CN,O+ )
C N ,++
variates. Similarly, partition the matrix C N' as follows
(
CN'OOC N ,=
C~I\T'o+
(4.2)
where C N'OO is a qv x qv matrix, C N'O+ is qv x pv and C N'++ is a pv x pv matrix.
Then, from the classical normal theory, (see Theorem 2.5.1 of Anderson (1984)), for
large n we have
Define the qv x 1 residual rank-statistics TN,o, and the covariance matrix of n 1/
2T N,o,
CN,oo, as follows
C C C (-1) C'N'OO - N'O+ N'++ N'O+
80
(4.3)
![Page 88: pdfs.semanticscholar.org · ~well as surrogate responses taken along with concomitant variates. On the other hand, the set of surrogate samples consists of measurements on a subset](https://reader034.fdocuments.us/reader034/viewer/2022042205/5ea7b5204109341eb3561f6c/html5/thumbnails/88.jpg)
4.1.1 RIBI For The Validation Set 1*
Consider n* replications of a BIBD consisting of s* blocks of constant size r*(2:: 2) to
which v treatments are applied such that, all the assumptions made i~ the surrogate
set I are satisfied.
Let Si stand for the set of treatments occuring in the ith block, i = 1, ... , s*. For the
a th replicate, the response in the ith block receiving the jth treatment is a stochastic
p+q+l-vectorUoij = (lij,X~~J, ... ,X~~J,Z~~J, ... ,Z~~J)' = (Y;j,X~ij,Z~ij)' of mea-
suremeIits corresponding to the primary variate, p covariates and the q surrogates.
Consider the the same model in (4.1) and the assumptions following it with the differ-
ence that here we regress the primary variate and the surrogates on the concomitant
variates. We basically repeat what we have done in the surrogate set with N' replaced
by N'v(= n*s*). Thus CN' in theorem 6 is now a (p + q + l)v x (p + q + l)v matrix
,CN'v' which we partition as in (4.2). Then, corresponding to (4.3) we have
(4.4)
Partition the (q + l)v x I-vector T"N,v o' and the (q + l)v x (q + l)v matrix C"N,voo as
follows
(
C*C* _ N'vYY
N'vOO - ,CN'vYX
where CN'vYY is v x V, C"N,vYX is v x qv, and C"N,vxx is qv x qv. Also let
T * C* C*(-l) T*N'vO(Y) - N'vYX N'vXX N'vO(X),
C * C* C* C*'N'vYY - N'vYX N'vXX N'vYX·
(4.5)
(4.6)
Note that the dispersion matrix of v x 1 vector n*1/2T"N,vO(Y:X) is given by the v x v
matrix C"N,v(Y:X).
81
![Page 89: pdfs.semanticscholar.org · ~well as surrogate responses taken along with concomitant variates. On the other hand, the set of surrogate samples consists of measurements on a subset](https://reader034.fdocuments.us/reader034/viewer/2022042205/5ea7b5204109341eb3561f6c/html5/thumbnails/89.jpg)
4.1.2 Construction of The Test Statistics
For the reasons mentioned in the begining of Section 3.2 we work with TN,o(x)
bTN,o instead of TN,o, where b is a 1 x q- vector subject to the two, restrictions, i)
b'b = 1, and ii) the variance given in (4.3) is minimum. Denote the v x v dispersion
matrix of n I/2T;,Tlo(x) by CN,x'
The next step consists of combining the interblock and intrablock information.
Since the tests developed in Chapte:s 3 and 4 are independent, we will take a weighted
linear combination of the four tests where the weights are inverses of the corresponding
dispersion matrices. Recall that we let Al = B NX' A 2 = BNv(Y:X) developed in
Chapter 3. Let A 3 = CN,x, and let A 4 = CN'v(Y:X) developed in this chapter. Then
let the weights be defined by
•
W · - [A-I +A-I +A-I +A-Ij-IA-I . 1 2 3 4,- I 2 3 4 i' z= , , , (4.7)
Now, noting that the mean of each of the four statistics TNO(x), TNvO(Y:X), TN,o(x),
and TN,vO(Y:X), is 0, consider the linear combination
Assume that ;:. -+ p for positive and finite real p. Then n I / 2To· is conditionally
distribution-free, under the permutational model, with °mean and covariance matrix
and the permutational multivariate central limit theorem of Sen (1983) applies here.
Thus, for large N, Nv , N', and N~, nI/
2To· '" Nv (0, A). Finally consider the overall
test statistic
(4.9)
The exact permutational distribution of LO· can be obtained by enumeration if N,
N v , N', and N~ are small to moderate. Thus for large N, Nv , N', and N~ the null
distribution of LO· by Cochran (1934) is asymptotically a chi-squared with v - 1
degrees of freedom.
82
![Page 90: pdfs.semanticscholar.org · ~well as surrogate responses taken along with concomitant variates. On the other hand, the set of surrogate samples consists of measurements on a subset](https://reader034.fdocuments.us/reader034/viewer/2022042205/5ea7b5204109341eb3561f6c/html5/thumbnails/90.jpg)
4.1.3 Asymptotic Non-null Distribution of LO*
For the study of the non-null distribution of LO· we will consider local alternatives
only. Recall that we have found in Section 3.3 that the the covariance .matrices of the
linear rank statistics in both sets I and 1* under HN, converge to the same dispersion
matrices under How. On the other hand, the test statistics had a different mean under
HN than under How. The same results hold for the tests using block totals. To see this
let us first work with the surrogate set I. Write Gi(u) = Gi(x, z), x E Rq, z E RP,
consider the following sequence of alternative hypotheses:
(4.10)
where A = ((>.;k))) stands for an v x q matrix of treatment effects. Thus, under {HN}
y(k)(X) = F(k) [x - n-1/ 2 )..(k)] k = 1 '" qt J" ,.
We assume that F}k\x) are all absolutely continuous. Let j(k)(x) be the density
function corresponding to F(k)(x). Let
(4.11)
where
H(k)(X) = ~ t F?)(x), k = 1,"', q.S i=l
Now expand F(k) [x - n-1/ 2>.;k)] in a Taylor series around x for a fixed x. We will
have
Thus, for large n we have
J1S~l 1.: J(k) [H(k)(x)] dF?)(x)
1: J(k) [F(k)(x)] dF(k) [x - n-1/ 2 )..;k)]
1: J(k) [F(k)(x)] dF(k)(x) +
1: J(k) [F(k)(x)] d{F(k) [x - n- 1/ 2>.;k)] - F(k)(x)}
+o(n-1/2
)
83
![Page 91: pdfs.semanticscholar.org · ~well as surrogate responses taken along with concomitant variates. On the other hand, the set of surrogate samples consists of measurements on a subset](https://reader034.fdocuments.us/reader034/viewer/2022042205/5ea7b5204109341eb3561f6c/html5/thumbnails/91.jpg)
~ow integrate by parts the second integral in the last term
(k)f-ln,i [1 J(k)(u)du -100
{F(k) [x _ n- 1/ 2 A)k)] _ F(k)(x)} ~J(k)(F(k)(x))k -00 dx
+ o(n-1/2)
[1 J(k)(u)du + (n-1/2)A)k)100
~J(k)(F(k)(X))f(k)(x)dx+ o(n-1/2)Jo -00 dx
[1 J(k)(u)du + (n- 1/2)A(k)100
~J(k)(F(k)(X))dF(k)(x)+ o(n-1/2)Jo J -00 ·dx
where the last two steps resulted from a Taylor series expansion of F(k) [x - n-1/ 2 A)k)]
around x for a fixed x. Thus, if we let
then from the above we have
Now, let Q~~l = ~ L::=1 ~~:), i = 1,···, s, k = 1,·" ,p + q. Note that Tj;3 =1.. L:iEP Q~k]. Then along the same lines of theorem 5.1 of Sen (1969) [nl/2(Q~kl_T J ), ,
f-l~~l), i = 1,···, s, k = 1,··· ,p + q] has asymptotically a IIlUltinormal distribution
with null mean and dispersion matrix whose entries for i = i' converge to wtkl, and
for i =I- i' converge to W~kl. Thus, by condition (a) of Section 2.2.1 and Theorem 5.1
of Sen (1969) we have
lim E [n 1/ 2 (T(~). - ~(k)) IH N ]n-+oo N ,J ..
84
![Page 92: pdfs.semanticscholar.org · ~well as surrogate responses taken along with concomitant variates. On the other hand, the set of surrogate samples consists of measurements on a subset](https://reader034.fdocuments.us/reader034/viewer/2022042205/5ea7b5204109341eb3561f6c/html5/thumbnails/92.jpg)
- E {[:..2: J~~ n1/2(Ji~:l- fal ](k)(U)dU)] I HN}
J te~ 0
B(F(k)) :. 2: )..}k)J iePj
= B(F(k)))..}k)
Let I = ((,Y))) j=l, ...,v,k=l, ...,q , where IY) = B(F(k)))..}k) Then, by Theorem 5.2 of Sen
(1969), {n1/
2 (T;;'~j - ~~k)), k= 1,·,· ,p + q, j = 1,·'·, v} have jointly asymptot
ically a multinormal distribution with mean I and dispersion matrix W. Denote by
WI the version of W pertaining to the surrogate set, I, and by II the version of I in
I. Partition WI as we partitioned CN' in (4.2) and corresponding to (4.3) let
.Tr* _ .Tr .Tr .TA-l).Trl'£' 100 - '£' 100 - '£' [0+ '£' 1++ '£' [0+·
Then for large n
and its dispersion matrix is 1P~x which is the population version of eN-,x.
In a similar fashion, we define {HN} for the validation set 1* such that (()..}k)))
is now an v x (q + 1) matrix. Also, 1[" is defined in the same way as T I with the
difference that k = 1, ... , q+ 1. CN'v f"V 1P[" which is the version of 1P defined in the
validation set 1*. Thus, corresponding to (4.7) and (5.1) we have
Hence for large n*
TNvO(Y:X)
W~"(Y:X)
T * .Tr* .Tr*(-l) T*NvO(Y) - '£' I"YX '£' I"XX NvO(X),
.T.* .Tr* . .Tr*(-l) .Tr*'~ ["YY - '.I!' I"YX '£' I·XX '£' I·YX·
d h d·· . f *1/2 T * . .Tr*an t e IsperSlOn matrIX 0 n NvO(Y:X) IS '.I!' I"(Y:X)'
85
![Page 93: pdfs.semanticscholar.org · ~well as surrogate responses taken along with concomitant variates. On the other hand, the set of surrogate samples consists of measurements on a subset](https://reader034.fdocuments.us/reader034/viewer/2022042205/5ea7b5204109341eb3561f6c/html5/thumbnails/93.jpg)
For now, focus only on the surrogate and validation subsets. Consider the weights
Wi defined in (3.9) where now we have Al = .njx, A 2 = .nj.(y:X), A 3 = qi'jx, and
A 4 = qi'j.(y:X). Thus, under {HN }, the mean of n I/
2T o• in (4.9) is
Therefore, from Theorem 5.3 of Sen (1969) and under {HN}, La· has asymptotically a
noncentral chi-squared distribution with v-I degrees of freedom and the noncentrality
parameter
86
•
![Page 94: pdfs.semanticscholar.org · ~well as surrogate responses taken along with concomitant variates. On the other hand, the set of surrogate samples consists of measurements on a subset](https://reader034.fdocuments.us/reader034/viewer/2022042205/5ea7b5204109341eb3561f6c/html5/thumbnails/94.jpg)
Chapter 5
General Case of a Vector of
Primary Variates
It is hard to find an ideal surrogate that fully captures the treatment effect on the
true endpoint, because a new treatment may affect the true endpoint through other
potential surrogate variates. Hence, there is a need to consider all possible surrogates
so as to avoid losing information about treatment effects on the primary variates.
Moreover, it is often difficult/costly to record data on all primary variates (as
well as surrogates and covariates) for all experimental units. Consider a vector
of primary response variates (y(l), ... , y(m))', and a vector of surrogate responses
(X(l),'" ,X(q))' that we partition into various subsets on which measurements are
obtained for different numbers of experimental units. Such a partition is done based
upon practical consideration of medical as well as economic factors. This is the genesis
of incomplete multiresponse designs (IMD).
Sen (1994) formulates a general class of IMDs in the following manner. Let
P denote {1,'" , q + m} and consider the totality of 2q+m subsets of P, defined by
(ir , i;) = {ill' .. ,ir ; ii', ... ,i:}, for all possible 1 ::; il < ... < ir ::; m, 1 ::; ii' < ... <
i: ::; q; 1 ::; r ::; m, 1 ::; s ::; q, and include the null sets io or i~ in this system.
Consider a proper subset Po of P, determined by clinical and other factors, such that
87
![Page 95: pdfs.semanticscholar.org · ~well as surrogate responses taken along with concomitant variates. On the other hand, the set of surrogate samples consists of measurements on a subset](https://reader034.fdocuments.us/reader034/viewer/2022042205/5ea7b5204109341eb3561f6c/html5/thumbnails/95.jpg)
P.o = {(ir,i;) : (r,s,ir,i:) E Io}, where 10 is the set corresponding to Po. Then the
set of all experimental units is partitioned into the system
where in the subset S(ir,is) the primary response variates y(il),.", y(ir) , and the
surrogate response variates XUi), ... ,XU:) are to be recorded on the experimental
units.
For the subset S(ir,is ) efficient design D(ir,is) (for the treatment as well as design
variates) may be adopted leading to the design sets
Thus, an incomplete multiresponse design that incorporates surrogate endpoints
can be formulated in terms of the dual design sets, responsewise design set and
treatmentwise design set, namely {S(Po),D(Po)}.
Assume that there are II, ... ,hI surrogate subsets of sizes nI, ... , nkll that con
tain measurements on surrgate and concomitant variates, and I;"" ,Ik2 validation
subsets of sizes ni',·· . ,nk2
, with measurements on both primary and surrgate end
points along with covariates. Assume also that a p-vector of covariates is easily
measured on all subjects in the respective subsets.
5.1 Intra And Inter-Block inference
Consider n* replications of a BIBD consisting of s blocks of constant size r(~ 2) to
which v treatments are applied in subset II, such that all the previous assumptions
discussed in Chapter 3 apply. For the a th replicate, the response in the i th block receiv- ~
. h 'th . h' U .. - (Xi~ Xi: Z(I) ZIp))' -mg t e J treatment IS a stoc astIc vector en) - e<ij' ••• , e<ij" e<ij'···, e<ij -
(X~ij' Z~ij)' of measurements corresponding to the p covariates and the surrogate re
sponses in {SOo,i:): (r,s,io,i~) E Po}.
88
![Page 96: pdfs.semanticscholar.org · ~well as surrogate responses taken along with concomitant variates. On the other hand, the set of surrogate samples consists of measurements on a subset](https://reader034.fdocuments.us/reader034/viewer/2022042205/5ea7b5204109341eb3561f6c/html5/thumbnails/96.jpg)
The focus here is, on hypothesis testing. Moreover, since a BIBD design is a
connected design, testability of hypotheses is ensured (see Bose (1947)).
Apply the same method developed in Chapter 3 to create a test statistic in each
of the surrogate subsets. Simillarly, obtain a test statistic in each of the validation
sets. Moreover, obtain a test statistic in each of the surrogate as well as validation
subsets by modelling the block totals as explained in Chapter 4.
Thus, for within block inference we conceive of a vector of test statistics (T~ w,"', T~ w)1, kl'
in the surrogate sets, and another vector of test statistics,
(T~. w, .. "T~. w), in the validation sets. Also, for between block inference wel' k 2 '
have a vector of tests (T~ B"", T~ B) in the surrogate sets, and a vector of tests1, kl'
(T~. B,' .. ,T~. B) in the validation subsets. Let n = nl +... + nkl + n~ +... + n k .l' k2' . 2
5.2 Construction of The Test Statistic
The next step consists of combining information from all subsets. Since the tests
developed in the different subsets are independent, we will take a weighted linear
combination of these tests with the weights being the inverses of the corresponding
dispersion matrices. Note that for such a combination to make sense the dimensions
of the tests should be the same. Thus, each test is appropriately transformed so as
to allow a meaningful linear combination.
For simplicity of notation replace the within block tests in the surrogate and
validations subsets by their transformed version (Tl,w,"', Tn,w), and the between
block tests in all subsets by (Tl,B,"', Tn,B). Also replace their corresponding dis
persion matrices by (Al,w,"', An,w), and (Al,B,"" An,B)
Define
for i = 1,' .. ,n, j E {W, B}. Consider the linear combination
89
![Page 97: pdfs.semanticscholar.org · ~well as surrogate responses taken along with concomitant variates. On the other hand, the set of surrogate samples consists of measurements on a subset](https://reader034.fdocuments.us/reader034/viewer/2022042205/5ea7b5204109341eb3561f6c/html5/thumbnails/97.jpg)
Assume that ~ -+ Pi for j = 2,' .. ,kt, and !!t -+ PI for 1 = 1,'" ,k2 , where then J nz
p's are all positive and finite real numbers. Then n~/2To* is conditionally distribution-
free under the permutational model, and the permutational multivariate central limit
theorem of Sen (1983) applies here. Hence n~/2To* is multinormal with 0 mean and
covariance matrix
Finally consider the overall test statistic
(5.1 )
The exact permutational distribution of LO* can be obtained by enumeration if n,
is small to moderate, while for large n, the null distribution of LO* by the Cochran
(1934) Theorem is asymptotically a chi-squared with v-I degrees of freedom.
5.3 Asymptotic Non-Null Distribution of LO*
The dispersion matrix of the test developed in the different subsets converge, under
local alternatives to the same limit as under the null. The mean of the test-statistics
however, is shifted under local alternatives. The distribution of LO* under local alter-
natives is a non-central chi-squared. The details follow in the same way as has been
done in Chapters 4 and 5.
Assume that the mean of the respective tests under HN defined in Chapter 4 is
a 2n x 1- vector, (PI w,' ", Pn w, PI B,' .. ,Pn B)', " ,
Define the weights Wi using the population versions of the covariance matrices.
Thus, under {HN }, the mean of n~/2To* is
Therefore, from Theorem 5.3 of Sen (1969) and under {HN }, LO* has asymptotically a
noncentral chi-squared distribution with v-I degrees of freedom and the noncentrality
90
![Page 98: pdfs.semanticscholar.org · ~well as surrogate responses taken along with concomitant variates. On the other hand, the set of surrogate samples consists of measurements on a subset](https://reader034.fdocuments.us/reader034/viewer/2022042205/5ea7b5204109341eb3561f6c/html5/thumbnails/98.jpg)
Chapter 6
Properties of The Test Statistics
And an Illustration
6.1 Introduction
This chapter discusses studies the asymptotic relative efficiency (ARE) of the test
statistics LO·, developed in Chapters 2,3, and 4 with respect to the parametric version
constructed in exactly the same way using the least squares estimates lse's of the
treatment effects instead of the linear rank statistics T;;'~.
Also, calculations for the complete block case will be done on the Burroughs
Wellcome data mentioned at the end of Chapter 1 to illustrate the method developed.
6.2 ARE for the Complete Block Case
Consider the same setup in the surrogate set as in Section 2.2.1. The model in
2.1 assumes now the errors to be normally and independently distributed. This
means that fij has now a (p+q)-variate multinormal distribution with null mean and
covariance ~, independently for all i = 1,· .. ,s, j = 1,· .. ,r.
92
![Page 99: pdfs.semanticscholar.org · ~well as surrogate responses taken along with concomitant variates. On the other hand, the set of surrogate samples consists of measurements on a subset](https://reader034.fdocuments.us/reader034/viewer/2022042205/5ea7b5204109341eb3561f6c/html5/thumbnails/99.jpg)
The least squares estimates of the treatment effects are defined by
which is the average of the aligned variable Ui~k) over blocks. The alignment is done
by subtracting the block average from Ui~k). Moreover, V N, the (p + q) x (p + q)
matrix defined in Theorem 2.1 has now elements VH' given by
k,k'=1,···,p+q,and
where eN is the matrix of coefficients defined in Theorem 2.1, and TN is the r x
(p +q) matrix of least square estimates corresponding to the different treatments and
variates.
Repeat the same method developed in the nonparametric setup. Partition V N
as in 2.11, then obtain TNo, a q x r matrix of residual 1se's in the surrogate set.
Similarly, obtain TNvO(Y:X)' a 1 x r vector of residual1se's in the validation set. Build
the weighted linear combination TO· as in (2.22). Finally construct the quadratic
form La· using TO· and its covariance matrix.
Whenever the errors have a finite dispersion matrix (~), the 1se's are asymptot
ically normal, and hence the null distribution of La· is asymptotically a chi-squared
with r - 1 degrees of freedom. However, under the sequence of alternative hypothe
ses {HN } in (2.24), T = n-1/
2>., and thus the mean of T undergoes a location shift
leading to the following shift in the mean of T
(6.1)
Let Als be the version similar to A, the covariance of the weighted linear com
bination of the test TO· under {HN }. Note that the limit of the expected value of VN
for large sis r;l~ under both Ho and HN . Thus, construct A as was done in Section
93
![Page 100: pdfs.semanticscholar.org · ~well as surrogate responses taken along with concomitant variates. On the other hand, the set of surrogate samples consists of measurements on a subset](https://reader034.fdocuments.us/reader034/viewer/2022042205/5ea7b5204109341eb3561f6c/html5/thumbnails/100.jpg)
2.2.5 but using r-1 eN 0 ~ instead of v. Let /-t?: be the mean ofthe TO· under {HN }.. r
Then LO· has asymptotically a noncentral chi-squared distribution with r-1 degrees
of freedom and noncentrality
( 0·)' (-1)( 0·)~LO· = /-tIs A ls /-t Is .Zs
Thus, the ARE is given by
ARE = ~LO•.
~Lo·Zs
Rewrite /-to· = M1~*' and /-t?s· = M2~*, where M1 and M 2 are both 1 x (2q+ 1), and
~* is the (2q+ 1) x r matrix of the shifts in treatment effects in the surrogate set stacked
over the (q + 1) x r matrix in the validation set. Moreover, let B~ = M~A(-l)Ml'
and let Bg = M~A~;1)M2. Then the ARE becomes
In the univ~riatecase A*, B~, and Bg) are all scalars. The ratio depends only on
the scalars B? and Bg. No common value can be assigned for all possible directions.
In the multivariate case however, the ARE depends upon A*, B~, and Bg) and a
unique solution may not exist. Therefore, one considers the maximum and minimum
possible values over the variation of ~ *.
If cm(G,J) and CM(G,J) are respectively the minimum and maximum charac
teristic roots of B~B~(-l), then by virtue of the well-known Courant theorem on the
extremum of the ratio of two non-negative definite quadratic forms we have
Cm(G,J) ~ e (A*,B~,B~) ~ CM(G,J), VA*.
It may be noted that if the normal scores are used for J, and if G is normal then
B(G(k)) = O"kk1/2
, where O"kk is the (k, kyh entry of ~ corresponding to the kth variate.
v defined in (2.10) is the same as ~. These facts imply that B~B~(-l) = I, and hence
cm ( G, J) = CM(G, J) = 1, and hence the normal scores LO· and L?: are asymptotically
power equivalent.
94
..
![Page 101: pdfs.semanticscholar.org · ~well as surrogate responses taken along with concomitant variates. On the other hand, the set of surrogate samples consists of measurements on a subset](https://reader034.fdocuments.us/reader034/viewer/2022042205/5ea7b5204109341eb3561f6c/html5/thumbnails/101.jpg)
It may be noted that in such a multivariate setup there is no generally uniformly
most powerful test of the null hypothesis. The least squares test developed here is
asymptotically equivalent to the likelihood ratio test when the distribution of the
errors is normal. But any departure from normality makes it nonrobust. Moreover,
the least squares test is not invariant to monotone transformation of the data, e.g.,
the log transformation. This is not the case with the rank procedure developed in
this work which remains invariant under coordinate-wise strictly monotone transfor-
mation, although it is not affine invariant. Therfore, there may exist some situations
when rank statistics fair much better especially with heavy tails when the distribution
is strictly non-normal.
The same mechanism can be applied to the balanced incomplete block design to
find the ARE and the bounds on it. Consider the same model in (3.1). Assume that
the errors fOiij have a p + q multinormal distribution with null mean and covariance
!: which is assumed to be nonnegative definte.
The intra as well as the inter-block tests developed in Chapters 4 and 5 are re
placed here with the least square version of these test. Thus, for intra-block inference,
the v x (p + q) matrix of least square estimates, TN,W has entries given by
which is the average of the aligned variable Ul7J over blocks containing the ph treat
ment and over all replicates. The alignment is done by subtracting the block average
from Ul7J. Moreover, v~) defined in Theorem 3.1 has now entries defined by
(1) = _1 ~~ '" [U(~~* _ U(~)*] [U(~~)* - U(~/)*]vN,kk' L.J L.J L.J W) Oit. Oit) OiL'
nsr 0i=1 i=1 jESi
and vW defined in Theorem 3.1 has entries defined by
k, k' = 1, . ". ,p + q,
V(2) ,= ~~ ~ [U(~)* - U(k)*] [U(~/)* - U(k')*] k, k' = 1,·· . ,p + q",N,kk L.J L.J OiL Oi.. OiL Oi..'
ns 0i=1 i=1
where the dots in the subscripts indicate averages over the dotted subscript. Then,
95
![Page 102: pdfs.semanticscholar.org · ~well as surrogate responses taken along with concomitant variates. On the other hand, the set of surrogate samples consists of measurements on a subset](https://reader034.fdocuments.us/reader034/viewer/2022042205/5ea7b5204109341eb3561f6c/html5/thumbnails/102.jpg)
wjth A (1) and A (2) defined as Theorem 3.1 we have
BN is a (p + q)v x (p + q)v matrix, and
nV[Vec(TN,w)] = B N.
The Expected value of vW for large n converges to (r~I)~. Similarly, The
limit of the mean of V~2) for large n is (S~I) (r~). Thus if A = (r~l) A(I) and
B = r (1 - ~) A(2), then the limit of the mean of B N for large n is (A + B) @ ~.
This limit can be now used to bulid the noncentrality parameter as was done in
Section 3.3 or in Section 4.1.3.
On the other hand, for inter-block inference, consider the model in (4.1) and
note that LjES; €aij has covariancer~. Let TN,B be the v x (p + q) matrix of least
square estimates with entries given by
A(k) 1 ~" (k) .TN' J" = - LJ LJ hai , J = 1,"', v, k = 1,'" ,p + q.
, nr" "J a=1 tEPj
The covariance matrix of f;;'~j is the same as that defined in Theorem 4.1 with
the ~~~) replaced by h~~). In such a case the limit of the mean of the matrix W~~
defined in Theorem 4.1 converges for large n to (S~I) (r~), and the limit of the
expected value of W~! converges for large n to (n~l) (;~) .
Define D = ~D(1) and E = r(n-l)D(2) where D(I) and D(2) are defined ins' ns'
Theorem 4.1. The limit of the expected value of the least square version of eN'
defined in Theorem 4.1 converges for large n to (D +E) @~. This limit can be used
to compute the noncentrality parameter under HN as done in Section 4.1.3 using the
least squares estimates. The ARE will be the ratio of the noncentrality parameters
as before and the same conclusions arrived at in the complete case apply here too.
96
..
![Page 103: pdfs.semanticscholar.org · ~well as surrogate responses taken along with concomitant variates. On the other hand, the set of surrogate samples consists of measurements on a subset](https://reader034.fdocuments.us/reader034/viewer/2022042205/5ea7b5204109341eb3561f6c/html5/thumbnails/103.jpg)
6.3 Example: The Effect of Zidovudine on Sur
vival in Patients with AIDS
The example uses data from a double blind placebo-controlled clinical trial (study
02) conducted by Burroughs-Wellcome in the year 1986 to evaluate the effect of
Zidovudine (ZDV) on survival in patients with acquired immuno-deficiency syndrome
(AIDS). A total of 281 patients were enrolled in the study; 144 patients were assigned
to the ZDV group and 137 patients were assigned to the placebo group (See Tsiatis,
Gruttola and Wulfsohn (1995)).
The primary response is the length of survival, but because there is a need to
evaluate new therapies in a shorter period of time, surrogate outcomes are of interest.
Entry into the study was staggered over a period of five months between 19 February
1986 and 7 July 1986. The study was stopped after seven months because of a
significant decrease in mortality in patients treated with ZDV.
At that time Burroughs Wellcome conducted a second study (study 08) where
patients receiving placebo were then offered ZDV and followed for clinical outcomes.
Entry into study 08 was staggered over time between 16 Sept'ember 1986 and 6 April
1987.
The survival time for this illustration was calculated differently for patients on
placebo and those on ZDV. (Tsiatis et al. (1995)). Thanks to Dr. Michael Wulfsohn
for sending me the data sets and the SAS program that calculates the survival time.
The SAS code that calculates the survival time is attached in the Appendix. A
description of the SAS code that calulates the survival time follows.
Since most of the placebo patients were dying of AIDS after approximately 80
weeks of entry into study 02, their sample size became too small to allow a powerful
comparison with the ZDV group. Hence the survival time was truncated at 80 weeks
of entry into study 02. Let a denote the time of entry into study 02 plus 80 weeks.
If the death date, the date of study termination, and the last date the patient
97
![Page 104: pdfs.semanticscholar.org · ~well as surrogate responses taken along with concomitant variates. On the other hand, the set of surrogate samples consists of measurements on a subset](https://reader034.fdocuments.us/reader034/viewer/2022042205/5ea7b5204109341eb3561f6c/html5/thumbnails/104.jpg)
w;as known to be alive were misssing then the survival time was calculated as the
difference between 16 September 1986 and entry into study 02 irrespective of the
treatment group.
Let m stand for the minimum of entry into study 08 and 16 October 1986 where
the minimum is taken over nonmissing values of entry into study 08. By October
16 nearly 90% of placebo patients switched to ZDV. Let M denote the maximum of
the death date, the date off-study, the last date the patient was known to be alive,
and entry into study 08, where the maximum is taken over the nomissing values of
these variables. The date off-study for a patient is the date when that patient was
no longer followed because of AIDS complications. Patients died shortly afterwards.
For placebo patients whose death date was not missing and was less than or
equal to m, the survival time is the time from entry into study 02 to the death date.
Otherwise, if either the death date was missing or was greater than m, then the
survival time was calculated as the time from entry into study 02 to the minimum
of entry into study 08, M, and 16 October 1986. Placebo patients were censored if
either their death date was missing or if it exceeded entry into study 08 or 16 October
1986.
For patients receiving ZDV whose death date was not missing and was less than
a, the survival time is the time from entry into study 02 to the death date. Otherwise,
if either their death date was missing or was greater than a, then the survival time
is the time from entry into study 02 to the minimum of M and a. Patients on ZDV
were censored if either their death date was missing or if it exceeded a.
As mentioned in Section 1.2.1 both CD4 and CD8 cell counts (counts/Liter) have
been proposed as surrogates for survival in AIDS patients. The CD4-lymphocyte
count has been proposed as a potential surrogate for human immunodefficiency virus
(HIV), the cause of AIDS, in many studies because of its observed correlation with
the clinical outcome, survival. CD4 counts decrease soon after infection with HIV
because the virus attacks the immune system. They continue to decline during the
98
•
![Page 105: pdfs.semanticscholar.org · ~well as surrogate responses taken along with concomitant variates. On the other hand, the set of surrogate samples consists of measurements on a subset](https://reader034.fdocuments.us/reader034/viewer/2022042205/5ea7b5204109341eb3561f6c/html5/thumbnails/105.jpg)
<l.9ymptomatic stage of the disease until full-blown AIDS develops and terminates its
victims.
Patients with advanced disease have lower CD4 counts than those in the early
stage of infection. In this study CD4 counts were determined prior to treatment and
aproximately every four weeks during therapy. The initial CD4 measurement was
used in this example. Moreover, as mentioned in the Section 1.2.1, Baccheti et al.
(1992) found that CD8 counts add predictive power to the CD4 counts and thus would
produce more valid and useful surrogates than the CD4 counts alone. The CD8 cells
act both as deterents of virus multiplication by a mechanism that is not very well
understood, and as accelerators of AIDS by expressing CD38, CD57, or HLA-DR. (
See Romagnani (1994) for a review of CD4 and CD8 cells in the progression of AIDS.
Thus, CD8 count is considered as another surrogate in this example, and again only
the initial measurement was used.
At the time of this analysis covariate data were not available. Patients were
assigned random values for age generated as psuedo-random numbers independently
and uniformly distributed between 18 and 55.
To illustrate the theory an artificial responsewise as well as treatmentwise design
were imposed. With regard to the responsewise design, only those patients who were
not censored were included in the validation set. The validation set data consist of
the survival time, CD4 count, CDS count, and age. There were 24 patients on placebo
that were not censored. These were matched randomly with 24 out of 55 patients
who were not censored and were on ZDV. Thus a randomized block treatmentwise
design with 24 blocks was formed for the validation set.
The surrogate set data consisted of the variables CD4 count, CDS count, and
age. There were 113 patients on placebo and 89 patients on ZDV who were censored.
Twenty four out of 55 patients who were not censored and were on ZDV were added to
the 89 patients re'sulting in 113 patients on ZDV who were censored. These were ran
domly matched with the 113 patients on placebo who were censored. A randomized
99
![Page 106: pdfs.semanticscholar.org · ~well as surrogate responses taken along with concomitant variates. On the other hand, the set of surrogate samples consists of measurements on a subset](https://reader034.fdocuments.us/reader034/viewer/2022042205/5ea7b5204109341eb3561f6c/html5/thumbnails/106.jpg)
b~ock treatmentwise design with 113 blocks was formed for the surrogate set.
Descriptive statistics for the overall data, surrogate set, and validation set IS
given in Table 1 in the Appendix. In the validation set, the overall correlation between
the survival time and the surrogate responses is given in Table 2 in the Appendix.
Moreover, the correlation between survival and the surrogates in each treatment group
in the validation set is given in Tables 3 and 4 in the Appendix. In Tables 3 and 4 the
negative correlation is due to chance fluctuations. The Spearman rank correlation
did not give negative values.
The mean of the survival time for placebo group is 109 days, whereas for the
ZDV group it is 360 days. A two sample t-test to test the equality of means of the
survival time of the placebo group and that of the ZDV group in the validation set
gave a p-value of 0.0001. The normal approximation for the Wilcoxon rank sum
test gave a p-value of 0.0001. This shows that the data support a more beneficial
treatment effect over placebo.
In the surrogate subset, CD4 count, CDS count, and age were aligned by sub
tracting their block averages, then ranked and Wilcoxon scores were assigned to the
ranks. A 2 x 3 matrix of linear rank statistics was constructed by taking the mean
over blocks of the scores belonging to the ph treatment. Each of these statistics
were centered by subtracting its expected value under the null hypothesis and the
permutational model.
The scores were aligned by subtracting the block score average. The estimated
covariance matrix VN defined in Theorem 2.1 was computed yielding the following
3 x 3 matrix of sums of squares and cross products of the aligned scores summed
over all blocks and treatments and divided by 113(= s( r - 1)), where s = 113 is the
number of blocks, and r = 2 is the number of treatments.
The matrix of coefficients eN for the surrogate set defined in Theorem 2.1 IS
100
![Page 107: pdfs.semanticscholar.org · ~well as surrogate responses taken along with concomitant variates. On the other hand, the set of surrogate samples consists of measurements on a subset](https://reader034.fdocuments.us/reader034/viewer/2022042205/5ea7b5204109341eb3561f6c/html5/thumbnails/107.jpg)
g~ven by
0.004424779
-0.004424779
-0.004424779 },
0.004424779
the entries being ±1/226, where sr = 226 is the cardinality of the surrogate set. The
estimated covariance matrix of the surrogates and age scores is
0.16519 0.076239 0.013159
VN = 0.076239 0.16520 0.010146
0.013159 0.010146 0.16502
where the columns and rows correspond to CD4, CD8, and age respectively. Next,
partition VN and calculate the residual statistics, TN-a defined in 2.13
* _ {-0.023765 0.0237655}T No -
-0.002536 0.0025356
where the rows correspond to the CD4 and CD8 rank statistics, respectively, and the
columns correspond to the treatments, placebo and ZDV.
A similar manipulation was done in the validation set. Thus a 2 x 4 matrix of
linear rank statistics was formed. In addition, residuals were calculated by regressing
the survival time on the surrogates after regressing both the survival time and sur
rogates on age as described in Section 2.2.2. This resulted in a 1 x· 2 vector of linear
rank statistics of survival time corresponding to placebo and ZDV.
Weights were calculated as in (2.21), the weighted linear combination of the
test statistics, TO·, constructed as in (2.22), and the quadratic form La· in (2.23)
computed. The following table gives the values of La· for both the parametric and
nonparametric versions with and without surrogates.
Least Squares Test
Rank Based Test
With Surrogates
70
125
101
Without Surrogates
346
388
![Page 108: pdfs.semanticscholar.org · ~well as surrogate responses taken along with concomitant variates. On the other hand, the set of surrogate samples consists of measurements on a subset](https://reader034.fdocuments.us/reader034/viewer/2022042205/5ea7b5204109341eb3561f6c/html5/thumbnails/108.jpg)
The value of the rank based Lo· was 125 when surrogates were incorporated, which
obviously leads to rejection of the null hypothesis of no treatment effect on the primary
endpoint, survival, when compared with the chi-squared distribution with 1 degree
of freedom.
Separate analyses were done using the validation set but ignoring surrogates in
order to compare the with and without surrogate analyses. The value of the quadratic
form was 388 which also leads to rejection of Ho : Tj = 0 j = 1"", r. But the
significance level is smaller when surrogates are ignored. Thus, if significance level is
used as a criterion to see what was gained by using surrogates, this example shows
that one gets a more conservative significance level by using surrogates thus guarding
against faulty conclusions.
Parametric analyses were done using the least square estimates for comparison
with the nonparametric analyses. The same surrogate and validation sets were used.
It may be noted that the Shapiro-Wilk test of normality for the survival time, CD4
count, and CD8 count led to rejection of the null hypothesis that the distribution of
each of these variables is normal (p-value=O.OOOl). Even after taking the logarithm
of these variables, the Shapiro-Wilk test gave a p-value less' than 0.0001. Analyses
were carried on the logged variables. The value of the quadratic form in (2.23) for
the with surrogate analysis was 70.15, and for the without surrogate analysis it was
345.57.
Note that both the parametric and nonparametric methods led to very high
values of the quadratic form when surrogates were ignored. The small significance
level in such a case may lead researchers to optimistic conclusions that may be faulty.
6.4 Further Research
The method developed in this work handles only continuous data. However, it is
common for researchers to be interested in using discrete response variates. If the
102
![Page 109: pdfs.semanticscholar.org · ~well as surrogate responses taken along with concomitant variates. On the other hand, the set of surrogate samples consists of measurements on a subset](https://reader034.fdocuments.us/reader034/viewer/2022042205/5ea7b5204109341eb3561f6c/html5/thumbnails/109.jpg)
sample size is fairly large, one may repeat the same analyses described in this work for
each category and compare categories accordingly. However, if there isn't sufficient
data in each category this appraoch may not work. Further research is needed to
incorporate discrete data in the methodology expounded in this study.
The linear models considered in (2.1), (3.1), and (4.1) do not accomodate inter
action between blocks and treatments. The method discussed here can be extended
to cover such a possibility.
Moreover, the replicates may not be independent in the balanced incomplete
block case. This may happen if the replicates are measured on the same person
over time. Appropriate changes in the covariance matrices should be made to allow
dependent replicates.
A more involved problem is that of estimation of conditional quantiles of both the
surrogate and primary variates (conditional on the covariates). Sen (1994) discussed
such an estimation problem. It is more complex than the hypothesis testing problem.
This has been touched upon in the introductory Chapter. More work in that direction
is needed.
This procedure does not handle missing data. It would be worthwhile to incor
porate missing cells in such a methodology. Rubin (1976) formulates three conditions
on the process that causes missing data which enable the statistician to ignore this
process when making inferences about the distribution of the data. If () is the param
eter of the data, and ¢ is the parameter of the conditional distribution of the missing
data indicator given the data, then the missing data mechanism is ignorable if the
following conditons hold:
a) the data are missing at random ifmissingness depends on the data only through
observed values
b) The observed data are observed at random if for each possible value of the missing
data and the parameter ¢, the conditional probability of the observed pattern
103
![Page 110: pdfs.semanticscholar.org · ~well as surrogate responses taken along with concomitant variates. On the other hand, the set of surrogate samples consists of measurements on a subset](https://reader034.fdocuments.us/reader034/viewer/2022042205/5ea7b5204109341eb3561f6c/html5/thumbnails/110.jpg)
of missing data, given the missing data and the observed data, is the same for
all possible values of the observed data.
c) The parameters () and </> are distinct in the sense that their joint p?-rameter space
factorizes into a </>-space and a ()-space.
Moreover, Rubin (1976) defines data to be missing completely at random if the
indicator for missing data is independent of the observed data. However, sometimes
the process that caused missing data cannot be ignored for example when the missing
pattern depends upon the observed data.
A nice account of the parametric approach to missing data is found in Rubin
(1976), Little and Rubin (1987), Little (1993), and Little (1994) among others. Their
approach is based on maximum likelihood techniques and the assumption of normality
is essential in the models they select. However, as the example shows that even when
data are transformed, normality may not hold. This leads the researcher to investigate
other, more widely applicable nonparametric methods.
The nonparametric approach to missing variables in multi-sample rank permu
tation tests for MANOVA and MANOCOVA is developed in: Servy and Sen (1987).
The multi-sample rank permutation tests for the complete data case are extended to
random missing patterns. Define the scores of the observed variables as they were
defined in the complete case (See Puri and Sen (1971)), disregarding the blanks, and
filling up any blank with the mean of the scores of the non-missing values of the
variable it belongs to. They show that under a random missing scheme, the rank
permutation principle holds and yields conditionally distribution-free test statistics.
They study in detail the asymptotic relative efficiency of the proposed test statistics.
An extention of their methodology to the block case is planned to make this study
gain wide applicability.
104
![Page 111: pdfs.semanticscholar.org · ~well as surrogate responses taken along with concomitant variates. On the other hand, the set of surrogate samples consists of measurements on a subset](https://reader034.fdocuments.us/reader034/viewer/2022042205/5ea7b5204109341eb3561f6c/html5/thumbnails/111.jpg)
* ENTRY02= DATE OF ENTRY INTO STUDY 02 (MMDDYY)
* ENTRY08= DATE OF ENTRY INTO STUDY 08. ALL PATIENTS ENTERING THIS
* STUDY WERE PUT ON RETROVIR THERAPY. (MMDDYY)
* DDEATH = DATE OF DEATH (MMDDYY)
* DTERM = DATE OFF STUDY (MMDDYY)
* DALIVE = DATE PATIENT WAS LAST KNOWN TO BE ALIVE (MMDDYY)
**********************************************************************;
data D;
set D;
last_al='160CT86'd;
last_azt=entry02+7*80;
if ddeath ne . then death=1;else death=O;
*placebo;
if trt=O then do;
cens=(death=1 and .<ddeath<=min(entry08,last_al));
if cens=l then survt=ddeath-entry02;
else survt=min(entry08,max(dterm,dalive,ddeath,entry08),last_al)-entry02;
end;
if trt=O and ddeath=. and dalive=. and dterm=. then do;
survt='16Sep86'd-entry02;
end;
*AZT;
if trt=l then do;
cens=(death=l) and ddeath<=last_azt;
if cens=1 then survt=ddeath-entry02;
106
![Page 112: pdfs.semanticscholar.org · ~well as surrogate responses taken along with concomitant variates. On the other hand, the set of surrogate samples consists of measurements on a subset](https://reader034.fdocuments.us/reader034/viewer/2022042205/5ea7b5204109341eb3561f6c/html5/thumbnails/112.jpg)
else survt=min(max(dterm,dalive,entry08,ddeath),last_azt)-entry02;
end;
if trt=1 and ddeath=. and dalive=. and dterm=. then do;
survt='16Sep86'd-entry02;
end;
107
![Page 113: pdfs.semanticscholar.org · ~well as surrogate responses taken along with concomitant variates. On the other hand, the set of surrogate samples consists of measurements on a subset](https://reader034.fdocuments.us/reader034/viewer/2022042205/5ea7b5204109341eb3561f6c/html5/thumbnails/113.jpg)
Table 1
Means and Standard Deviations
Survival
(days)
CD4
(cells/L)
CD8
(cells/L)
Age
(years)
Surrogate
(N=226)
276
(189)
130
(129)
556
(351)
37
(11)
Validation
(N=48)
235
(164)
75
(99)
492
(409)
38
(11)
108
Overall
(N=274)
268
(185)
120
(126)
545
(362)
37
(11)
•
![Page 114: pdfs.semanticscholar.org · ~well as surrogate responses taken along with concomitant variates. On the other hand, the set of surrogate samples consists of measurements on a subset](https://reader034.fdocuments.us/reader034/viewer/2022042205/5ea7b5204109341eb3561f6c/html5/thumbnails/114.jpg)
Table 2
Overall Correlation Between
Survival Time and Surrogates
in The Validation Set (N=48)
Pearson Correlation
(p-value)
Spearman Correlation
(p-value)
Survival CD8 Survival CD8
CD4
CD8
0.0146
(0.9216)
0.193
(0.1885)
0.634
(0.0001)
109
CD4
CD8
0.0137
(0.9259)
0.2609
(0.0732)
0.5868
(0.0001)
![Page 115: pdfs.semanticscholar.org · ~well as surrogate responses taken along with concomitant variates. On the other hand, the set of surrogate samples consists of measurements on a subset](https://reader034.fdocuments.us/reader034/viewer/2022042205/5ea7b5204109341eb3561f6c/html5/thumbnails/115.jpg)
Table 3
Correlation Between Survival
Time and Surrogates in The
Validation Set For the Placebo Group
(N=24)
\.
Pearson Correlation
(p-value)
Spearman Correlation
(p-value)
CD4
CD8
Survival
-0.2688
(0.2041)
0.2925
(0.1654)
CD8
0.3562
(0.0876)
no
CD4
CD8
Survival
0.0461
(0.8306)
0.2301
(0.2795)
CD8
0.7149
(0.0001)•
..
![Page 116: pdfs.semanticscholar.org · ~well as surrogate responses taken along with concomitant variates. On the other hand, the set of surrogate samples consists of measurements on a subset](https://reader034.fdocuments.us/reader034/viewer/2022042205/5ea7b5204109341eb3561f6c/html5/thumbnails/116.jpg)
Table 4
Correlation Between Survival
Time and Surrogates in The
Validation Set For the ZDV Group
(N=24)
Pearson Correlation
(p-value)
Spearman Correlation
(p-value)
CD4
CD8
Survival
-0.0207
(0.9237)
0.0619
(0.7735)
CD8
0.7435
(0.0001)
111
CD4
CD8
Survival
0.1818
(0.3952)
0.2057
(0.3349)
CD8
0.5127
(0.0104)
![Page 117: pdfs.semanticscholar.org · ~well as surrogate responses taken along with concomitant variates. On the other hand, the set of surrogate samples consists of measurements on a subset](https://reader034.fdocuments.us/reader034/viewer/2022042205/5ea7b5204109341eb3561f6c/html5/thumbnails/117.jpg)
Bibliography
Anderson, T. (1984). An Introduction to Multivariate Statistical Analysis. second edn.John Wiley, New York.
Baccheti, P., Moss, A. R., Andrews, J. C. and Jacobson, M. (1992). Early predictors of survival in symptomatic HIV- infected persons treated with high-dosezidovudine. Journal of Acquired Immune Deficiency Syndromes 5, 732-736.
Bose, R. C. (1947). Proceedings of the 34th Indian Science Congress 11, 1-25.
Carroll, R. (1989). Covariance analysis in Generalized Linear Measurement ErrorModels. Statistics in Medicine 8, 1075-1093.
Carroll, R. and Ruppert, D. (1988). Transformation and Weighting in Regression.London: Chapman and Hall.
Choi, S., Lagakos, S. W., Schooley, R. T. and Volberding, P. (1993). CD4+ lymphocytes are an incomplete surrogate marker for clinical progression in personswith asymptomatic HIV infection taking zidovudine. Annals of Internal Medicine118, 674-680.
Cochran, W. G. (1934). The distribution of quadratic forms in a normal system,with application to the analysis of covariance.. Proceedings of the CambridgePhilosophical Society 30, 178-191.
Cox, D. R. (1972). Regression models and life tables (with discussion). Journal of theRoyal Statistical Society Ser. B 34, 187-220.
Cox, D. R. and Hinkley, D. V. (1974). Theoretical Statistics. London: Chapman andHall.
Ellenberg, S. S. (1991). Surrogate endpoints in clinical trials. British Medical Journal302,63-64.
Ellenberg, S. S. and Hamilton, J. M. (1989). Surrogate endpoints in clinical trials:Cancer. Statistics in Medicine 8, 405-413.
Fleming, T. R. (1992). Evaluating therapeutic interventions: Some issues and experiences. Statistical Science 7, 428-456.
Fuller, W. (1987). Measurement Error Models. John Wiley, New York.
112
•
...
![Page 118: pdfs.semanticscholar.org · ~well as surrogate responses taken along with concomitant variates. On the other hand, the set of surrogate samples consists of measurements on a subset](https://reader034.fdocuments.us/reader034/viewer/2022042205/5ea7b5204109341eb3561f6c/html5/thumbnails/118.jpg)
--. -Gangopadhyay, A. K. and Sen, P. (1992a). Nonparametric statistics and related topics.
A.K.M.E. Saleh, North Holland, Amsterdam.
Gangopadhyay, A. K. and Sen, P. (1992b). Nonparametric statistics and related topics.A.K.M.E. Saleh, North Holland, Amsterdam.
Gruttola, V., Wulfsohn, M., Fischl, M. and Tsiatis, A. (1993). Modeling the relationship between survival and CD4 lymphocytes in patients with AIDS and AIDSRelated Complex. Journal of Acquired Immune Deficiency Syndromes 6, 359365.
Hillis, A. and Seigel, D. (1989). Surrogate endpoints in clinical trials: Opthalmologicdisorders. Statistics in Medicine 8, 427-430.
Kosorok, M. (1991). A variance reduction method for combining multivariate failure times in order to increase power in clinical trials. PhD thesis. University ofWashington.
Kupper, 1. (1984). Effects of the use of unreliable surrogate variables on the validity ofepidemiologic research studies. American Journal of Epidemiology 120,643-648.
Lagakos, S. and Hoth, D. (1992). Surrogate markers in AIDS: where are we? whereare we going? Annals of Internal Medicine 116, 599-601.
Lin, D. (1991). Nonparametric sequential testing in clinical trials with incompletemultivariate observations. Biometrika 78, 123-131.
Little, R. J. A. (1993). Pattern-mixture models for multivariate incomplete data.Journal of the American Statistical Association 88, 125-34.
Little, R. J. A. (1994). A class of pattern-mixture models for normal incomplete data.Biometrika 81, 471-83.
Little, R. J. and Rubin, D. B. (1987). Statistical analysis and missing data. JohnWiley, New York.
Louis, T. A. (1982). Finding the observed information using the EM algorithm. Journal of the Royal Statistical Society Ser. B pp. 226-233.
Machado, S. G., Gail, M. H. and Ellenberg, S. S. (1990). On the use of laboratorymarkers as surrogates for clinical endpoints in the evaluation of treatment forHIV infection. Journal of Acquired Immune Deficiency Syndromes 3, 1065-1073.
McCullagh, P. and NeIder, J. (1989). Generalized Linear Models. London: Chapmanand Hall.
Monahan, I. P. (1961). Incomplete-variable designs in multivariate experiments. PhDthesis. Virginia Polytechnic Institute, Blacksburg, Virginia.
113
![Page 119: pdfs.semanticscholar.org · ~well as surrogate responses taken along with concomitant variates. On the other hand, the set of surrogate samples consists of measurements on a subset](https://reader034.fdocuments.us/reader034/viewer/2022042205/5ea7b5204109341eb3561f6c/html5/thumbnails/119.jpg)
Pepe, M. S. (1992). Inference using surrogate outcome data and a validation sample.Biometrika 79, 355-365.
Pepe, M. S. and Fleming, T. (1991). A nonparametric method for dealing withmismeasured covariate data. Journal of the American Statistical Association86, 108-113.
Piedbois, et al. (1992). Modulation of fluorouracil by leucovorin in patients with ad- ..vanced colorectal cancer: Evidence in terms of response rate. Journal of ClinicalOncology 10, 896-903.
Prentice, R. L. (1989). Surrogate endpoints in clinical trials: Definition and opera- ,tional criteria. Statistics in Medicine 8, 431-440.
Puri, M. 1. and Sen, P. (1971). Nonparametric Methods in Multivariate Analysis.John Wiley, New York.
Puri, M. 1. and Sen, P. K. (1985). Nonparamefric Methods in General Linear Models.John Wiley, New York.
Rao, C. R. (1947). General methods of analysis for incomplete block designs. Journalof the American Statistical Association 42, 541-561.
Romagnani, S. (1994). Immunologic and clinical aspects of human immunodeficiencyvirus infection. Allergy 49, 685-695.
Roy, S., Gnanadesikan, R. and Srivastava, J. (1971). Analysis and Design of CertainQuantitative Multiresponse Experiments. Pergamon Press, New York.
Rubin, D. B. (1976). Inference and missing data. Biometrika 63,581-592.
Satten, G. and Kupper, 1. (1993). Inferences about exposure-disease associatios usingprobability of exposure values. Journal of the American Statistical Association88, 200-208.
Scheffe, H. (1959). The Analysis of Variance. John Wiley, New York.
Sen, P. K. (1969). On a class of aligned rank order tests for multiresponse experimentsin some incomplete block designs; See also Annals of Mathematical Statistics,42 1971. Mimeo Series 607. Department of Biostatistics, University of NorthCarolina.
Sen, P. K. (1981). Sequential Nonparametrics. John Wiley, New York.
Sen, P. K. (1983). On permutational central limit theorems for general multivariatelinear rank statistics. Sankhya A 45, 141-149.
Sen, P. K. (1992). Incomplete multiresponse designs and surrogate endpoints in clinh
ical trials. Mimeo Series 2106. Department of Biostatistics, University of NorthCarolina.
114
•
![Page 120: pdfs.semanticscholar.org · ~well as surrogate responses taken along with concomitant variates. On the other hand, the set of surrogate samples consists of measurements on a subset](https://reader034.fdocuments.us/reader034/viewer/2022042205/5ea7b5204109341eb3561f6c/html5/thumbnails/120.jpg)
Sen, P. K. (1994). Incomplete multiresponse designs and surrogaie endpoints in clinical trials. Journal of Statistical Planning and Inference 42, 161-186.
Servy, E. C. and Sen, P. K. (1987). Missing variables in multi-sample rank permutation tests for manova and manocova. Sankhya A 49, 78-95.
Srivastava, J. N. (1966). Incomplete multiresponse designs. Sankhya A 28, 377-388.
Srivastava, J. N. (1968). On a general class of designs for multiresponse experiments.Annals of Mathematical Statistics 39, 1825-1843.
Tsiatis, A., Gruttola, V. and Wulfsohn, M. (1995). Modeling the relationship of survival to longitudinal data measured with error. applications to survival and cd4counts in patients with aids." Journal of the American Statistical Association90,27-37.
Wittes, J., Lakatos, E. and Probstfield, J. (1989). Surrogate endpoints in clinicaltrials: Cardiovascular Diseases. Statistics in Medicine 8, 415-425.
Yates, F. (1940). The recovery of interblock information in balanced incomplete blockdesigns. Annals of Eugenics 10, 317-325.
115