Inferences on Relative Failure Rates in Stratified Mark-Specific Proportional Hazards Models with Missing Marks, with Application to Human Immunodeficiency Virus Vaccine Efficacy Trials (2024)

Article Navigation

Volume 64 Issue 1 January 2015

Article Contents

  • Summary

  • 1. Introduction

  • 2. Model and missing mark data

  • 3. Estimation procedure with missing marks

  • 4. Testing of mark-specific vaccine efficacy

  • 5. Simulation study

  • 6. Analysis of the RV144 Thai trial

  • Acknowledgements

  • References

  • Appendix A: symptotic results

  • A.1. Proof of theorem 1

  • < Previous
  • Next >

Journal Article

,

Peter B. Gilbert

University of Washington

, Seattle,

USA

Fred Hutchinson Cancer Research Center

, Seattle,

USA

Address for correspondence: Peter B. Gilbert, Fred Hutchinson Cancer Research Center, 1100 Fairview Avenue North, PO Box 19024, Seattle, WA 98109, USA. E-mail: pgilbert@scharp.org

Search for other works by this author on:

Oxford Academic

Yanqing Sun

University of North Carolina at Charlotte

,

USA

Search for other works by this author on:

Oxford Academic

Journal of the Royal Statistical Society Series C: Applied Statistics, Volume 64, Issue 1, January 2015, Pages 49–73, https://doi.org/10.1111/rssc.12067

Published:

03 July 2014

Article history

Received:

01 February 2013

Accepted:

01 March 2014

Published:

03 July 2014

  • PDF
  • Split View
  • Views
    • Article contents
    • Figures & tables
    • Video
    • Audio
    • Supplementary Data
  • Cite

    Cite

    Peter B. Gilbert, Yanqing Sun, Inferences on Relative Failure Rates in Stratified Mark-Specific Proportional Hazards Models with Missing Marks, with Application to Human Immunodeficiency Virus Vaccine Efficacy Trials, Journal of the Royal Statistical Society Series C: Applied Statistics, Volume 64, Issue 1, January 2015, Pages 49–73, https://doi.org/10.1111/rssc.12067

    Close

Search

Close

Search

Advanced Search

Search Menu

Summary

The paper develops hypothesis testing procedures for the stratified mark-specific proportional hazards model in the presence of missing marks. The motivating application is preventive human immunodeficiency virus (HIV) vaccine efficacy trials, where the mark is the genetic distance of an infecting HIV sequence to an HIV sequence represented inside the vaccine. The test statistics are constructed on the basis of two-stage efficient estimators, which utilize auxiliary predictors of the missing marks. The asymptotic properties and finite sample performances of the testing procedures are investigated, demonstrating double robustness and effectiveness of the predictive auxiliaries to recover efficiency. The methods are applied to the RV144 vaccine trial.

Augmented inverse probability weighting, Auxiliary marks, Competing risks failure time data, Genetic data, Proportional hazards model, Semiparametric model

1. Introduction

The primary objective of a preventive human immunodeficiency virus (HIV) vaccine efficacy trial is to assess vaccine efficacy VE to prevent HIV infection, where typically VE is defined as 1 minus the hazard ratio (vaccine/placebo) of HIV infection diagnosis. However, the great genetic variability of HIV poses a central challenge to developing a highly efficacious vaccine (Fauci et al., 2008). The trial population is exposed to many HIV genotypes but the vaccine contains only a few, and the vaccine is less likely to protect against HIVs with greater genetic distance from the sequences inside the vaccine (Gilbert et al., 1999). The trial has objectives to assess whether and how the vaccine impacts the infection rate with any HIV genotype and whether and how the vaccine effect varies by HIV genotype; assessment of this objective has been named ‘sieve analysis’ (Gilbert et al., 1998). Gilbert et al. (2008), Sun et al. (2009) and Sun and Gilbert (2012) developed sieve analysis methods using the competing risks failure time framework (Prentice et al., 1978), which attach a continuous ‘mark’ variable to HIV-infected subjects that measures the genetic distance of an infecting HIV sequence to a sequence inside the vaccine. The goal of the sieve analysis methods is evaluation of mark-specific vaccine efficacy, which here is defined as 1 minus the mark-specific hazard ratio (vaccine/placebo) of infection. Beyond HIV, the methods apply generally to any preventative vaccine efficacy trial for which the pathogen targeted by the vaccine is genetically diverse, which includes influenza, malaria, tuberculosis, dengue, streptococcus pneumoniae, human papilloma virus and hepatitis C virus.

Gilbert et al. (2008) and Sun et al. (2009) assumed no missing mark data in infected subjects, whereas Sun and Gilbert (2012) allowed marks missing at random. In practice there are missing marks; for example in the Vax004 trial 32 of 368 infected subjects had no HIV sequence data (Gilbert et al., 2008), owing to drop-out or to inability of the HIV sequencing technology to measure the infecting HIV sequence, and in the ‘Step’ trial 22 of 88 infected subjects had no HIV sequence data (Rolland et al., 2011). Whereas it is of scientific interest to evaluate a mark defined on the basis of the earliest available HIV sequence, a mark of particular scientific interest is defined on the basis of an HIV sequence measured near the time of acquisition, which is missing in a much larger fraction of infected subjects owing to the periodic (typically 6-monthly) diagnostic tests for HIV infection. Specifically, HIV sequences are measured from the earliest available post-infection blood sample, and a ‘near acquisition’ or ‘early’ sample may be defined as a sample that has been documented to be sufficiently near acquisition. In the Step trial, only 23 of the 66 infected subjects with sequence data had an early mark measured, defined as sampling within 3 weeks. Sun and Gilbert (2012) have provided details on the HIV testing algorithm that is used to define an early mark.

Sun and Gilbert (2012) is the only reference on sieve analysis that accommodates missing continuous marks. It develops two valid estimation approaches based on the stratified mark-specific proportional hazards model. The first uses inverse probability weighting (IPW) of the complete-case estimator, which leverages auxiliary predictors of whether the mark is observed, whereas the second, adapting Robins et al. (1994), augments the IPW complete-case estimator with auxiliary predictors of the missing marks. Sun and Gilbert (2012) restricted attention to estimation methods, and this paper is a sequel that develops corresponding inferential or hypothesis testing methods based on the augmented IPW estimator. An important new component of this work compared with the previous work is to centre it on the sieve analysis of the RV144 Thai trial, which recently delivered the landmark result that a prime boost HIV vaccine appeared to provide partial protection against HIV infection (estimated VE =31%; 95% confidence interval 1–51%; p = 0.04; Rerks-Ngarm et al. (2009)). This result has stimulated intense interest in sieve analysis, for two reasons. First, there is controversy about whether the vaccine is really partially working versus a false positive result (Gilbert et al., 2011), and the sieve analysis of HIV sequences can help to resolve this question. In particular, if evidence is found that the vaccine efficacy declines with genetic distance, and the distance is defined on the basis of known parts of HIV that contain putatively protective antibody epitopes, then an interpretation of real vaccine efficacy is supported. Secondly, the HIV vaccine field is grappling with how to modify the tested vaccine to increase its potential vaccine efficacy for the next efficacy trial, and understanding the relationship between vaccine efficacy and the genetic distance provides direct guidance on which HIV sequences to put inside the next generation vaccines.

This paper is organized as follows. Notation, assumptions and the stratified mark-specific proportional hazards model are introduced in Section 2. Background on the estimation procedures that are needed for the testing procedures are described in Section 3. The testing procedures are developed, and asymptotic properties described, in Section 4. The finite sample performances of the tests are evaluated via simulations in Section 5. The application to the Thai trial is given in Section 6, and the asymptotic results and their proofs are placed in Appendix A.

The programs that were used to analyse the data can be obtained from

https://academic.oup.com/jrsssc/issue/

2. Model and missing mark data

2.1. Stratified mark-specific proportional hazards model

Let T be the failure time, V a continuous mark variable with bounded support [0,1] and Z(t) a possibly time-dependent p-dimensional covariate. The mark V is observable only when T is observed. Suppose that the conditional mark-specific hazard function at time t given the covariate history Z(s), for st, depends on the current value Z(t) only. We consider the stratified mark-specific proportional hazards model

λk{t,v|z(t)}=λ0k(t,v)exp{β(v)Tz(t)},k=1,,K,

(1)

where λk{t,v|z(t)} is the conditional mark-specific hazard function given covariate z(t) for an individual in the kth stratum, λ0k(·,v)=λk{t,v|z(t)=0} is the unspecified baseline hazard function for the kth stratum, β(v) is the p-dimensional unknown regression coefficient function of v and K is the number of strata. Model (1) allows different baseline functions for different strata and flexibly allows for arbitrary mark-specific infection hazards over time in the placebo group. In practice, different key subgroups (e.g. men and women in the Thai trial) are assigned different baseline mark-specific hazards of HIV infection.

Arranging β(v)=(β1(v),β2T(v))T, so that β1(v) is the coefficient for vaccination status and β2(v) for other covariates, the covariate- and stratum-adjusted mark-specific vaccine efficacy VE(v)=1− exp {β1(v)}. Sun et al. (2009) developed some statistical procedures for model (1) under K = 1 based on observations of the random variables (X,Z(·),V) for δ = 1 and (X,Z(·)) for δ = 0, where X = min{T,C}, δ = I(TC) and C is a censoring random variable. Sun and Gilbert (2012) developed estimation procedures for model (1) allowing V to be missing for some subjects with δ = 1; these methods incorporate auxiliary covariates and/or auxiliary mark variables that inform about the probability V that is observed and about the distribution of V. This paper develops parallel hypothesis testing procedures for assessing VE(v). As summarized in Section 1, the two objectives are to assess whether the vaccine efficacy ever deviates from 0 (i.e. test VE(v)=0) and to assess whether the vaccine efficacy changes with the mark (i.e. test VE(v)=VE).

2.2. Missing data assumptions

Let R be the indicator of whether all possible data are observed for a subject; R = 1 if either δ = 0 (right censored) or if δ = 1 and V is observed; and R = 0 otherwise. Auxiliary variables A may be helpful for predicting missing marks. Since the mark can only be missing for failures, supplemental information is potentially useful only for failures, for predicting missingness and for informing about the distribution of missing marks. For example, if V is defined on the basis of the early virus, then V*, the auxiliary mark information, may include sequences of later sampled viruses and can be considered a subset of A. In general, A could include multiple viral sequences per infected subject at multiple time points, giving information on intrasubject HIV evolution. The relationship between A and V can be modelled to help to predict V (see Section 5 for a simulated example).

We assume that C is conditionally independent of (T,V) given Z(·) and the stratum. We also assume that V is missing at random (Rubin, 1976), i.e., given δ = 1 and W=(T,Z(T),A), the probability that V is missing depends only on the observed W, not on the value of V; this assumption is expressed as

rk(W)P(R=1|δ=1,W)=P(R=1|V,δ=1,W).

(2)

Let πk(Q)=P(R=1|Q) where Q=(δ,W). Then πk(Q)=δrk(W)+1−δ. The missingness at random (MAR) assumption (2) also implies that V is independent of R given Q:

ρk(v,W)P(Vv|δ=1,W)=P(Vv|R=1,δ=1,W).

(3)

Define rk(w)=P(R=1|δ=1,W=w) and ρk(v,w)=P(Vv|δ=1,W=w). The stratum-specific definitions of rk(w) and ρk(v,w) allow the models of the probability of complete-case and of the mark distribution to differ across strata.

Let τ be the end of the follow-up period, and nk be the number of subjects in the kth stratum; the total sample size is n=Σk=1Knk. Let {Xki,Zki(·),δki,Rki,Vki,Aki;i=1,,nk} be independent and identically distributed replicates of {X,Z(·),δ, R,V,A} from the kth stratum. The observed data are {Oki;i=1,,nk,k=1,,K}, where Oki={Xki,Zki(·),Rki, RkiVki,Aki} for δki=1 and Oki={Xki,Zki(·),Rki=1} for δki=0. We assume that the Oki are independent for all subjects.

2.3. Hypotheses to test

We develop procedures for testing the following two sets of hypotheses. Let [a,b]⊂(0,1). The first set of hypotheses is

H10:VE(v)=0forv[a,b]versusH1a:VE(v)0for somev(general alternative)orH1m:VE(v)0with strict inequality for somev(monotone alternative).

The second set of hypotheses is

H20:VE(v)does not depend onv[a,b]versusH2a:VE(v)depends onv(general alternative)orH2m:VE(v)decreases asvincreases (monotone alternative).

The null hypothesis H10 implies that the vaccine affords no protection (nor increased risk) against any HIV genotype. The ordered alternative H1m indicates that the vaccine provides protection for at least some of the HIV genotypes, whereas H1a indicates that the vaccine provides protection and/or increased risk for some HIV genotypes. The null hypothesis H20 implies that there is no difference in vaccine protection against different HIV genotypes. The ordered alternative H2m indicates that vaccine efficacy decreases with v and H2a indicates that the vaccine efficacy changes with v. With β1(v) the first component of β(v), the first set of hypotheses is equivalent to H10:β1(v)=0 for v ∈ [a,b] versusH1a:β1(v)≠0 for some v or H1m:β1(v)≤0 with strict inequality for some v. The second set of hypotheses is equivalent to H20:β1(v) does not depend on v ∈ [a,b] versusH2a:β1(v) depends on v or H2m:β1(v) increases as v increases. We develop testing procedures for detecting departures from H10 in the direction of H1m and H1a and for detecting departures from H20 in the direction of H2m and H2a. The procedures are developed on the basis of the augmented IPW complete-case estimator that was developed by Sun and Gilbert (2012).

3. Estimation procedure with missing marks

The augmented IPW (AIPW) estimator for model (1) is obtained in two stages. First the IPW complete-case estimator is derived and second the AIPW estimator is obtained, which improves efficiency by accounting for information in the conditional distribution of V given the auxiliaries.

Let rk(Wki, ψk) be the parametric model for the probability of the complete case, with rk(Wki) defined in equation (2), where Wki=(Tki,Zki(Tki),Aki) and ψk is a q-dimensional parameter. For example, we can assume the logistic model with logit{rk(Wki,ψk)}=ψkTWki for those with δki=1, where Wki=(Tki,Zki(Tki),Aki). By equation (2), the maximum likelihood estimator ψ^=(ψ^1,,ψ^K) of ψ=(ψ1,…,ψK) is obtained by maximizing the observed data likelihood

k,irk(Wki,ψk)Rkiδki{1rk(Wki,ψk)}(1Rki)δki.

(4)

Let K(x) be a kernel function with support [−1,1] and let h = hn be a bandwidth. Let Nki(t,v)=I(Xkit,δki=1,Vkiv) and Yki(t)=I(Xkit). Let Qki=(δki,Wki) and πk(Qki, ψk)=δkirk(Wki, ψk)+1−δki. The first-stage estimator is the IPW estimator β^ipw(v), which solves the estimating equation for βUipw(v,β,ψ^)=0, where

Uipw(v,β,ψ^)=k=1Ki=1nk010τKh(uv){Zki(t)Z~k(t,β,ψ^k)}Rkiπk(Qki,ψ^k)Nki(dt,du).

(5)

Here

Kh(x)=K(x/h)h,

Z~k(t,β,ψk)=S~k(1)(t,β,ψk)S~k(0)(t,β,ψk)

and

S~k(j)(t,β,ψk)=nk1i=1nkRki{πk(Qki,ψk)}1Yki(t)exp{βTZki(t)}Zki(t)j

for j = 0,1, where z⊗0=1 and z⊗1=z for any zRp. The score function (5) can be viewed as an extension of the score function that is used for the cause-specific Cox model (Prentice et al., 1978) for a particular failure cause J = j, for which the counting process counts only events of type j. It borrows strength from observations having marks in the neighbourhood of v. The kernel function is designed to give greater weight to observations with marks near v than those further away.

The baseline function λ0k(t,v) can be estimated by λ^0kipw(t,v), obtained by smoothing the increments of the following estimator of the doubly cumulative baseline function Λ0k(t,v)=0t0vλ0k(s,u)dsdu:

Λ^0kipw(t,v)=i=1nk0t0vRkiπk(Qki,ψ^k)Nki(ds,du)nkS~k(0){s,β^ipw(u),ψk^}.

(6)

For example, one can use the following kernel smoothing:

λ^0kipw(t,v)=0τ01Kh1(1)(ts)Kh2(2)(vu)Λ^0kipw(ds,du),

(7)

where Kh1(1)(x)=K(1)(x/h1)/h1 and Kh2(2)(x)=K(2)(x/h2)/h2, with K(1)(·) and K(2)(·) the kernel functions and h1 and h2 the bandwidths.

Following Robins et al. (1994), Sun and Gilbert (2012) proposed a more efficient procedure for estimating equation (1) by incorporating the knowledge of ρk(w,v) in the estimation procedure. Let w=(t,z,a) and gk(a|t,v,z)=P(Aki=a|Tki=t,Vki=v,Zki=z,δki=1). Then

ρk(w,v)=0vλk(t,u|z)gk(a|t,u,z)du/01λk(t,u|z)gk(a|t,u,z)du.

(8)

If no auxiliary variables are available or if Aki is conditionally independent of Vki given (Tki,Zki, δki), then

ρk(w,v)=0vλk(t,u|z)du/01λk(t,u|z)du.

In this case, ρk(w,v) can be estimated by

ρ^kipw(w,v)=0vλ^kipw(t,u|z)du/01λ^kipw(t,u|z)du,

where λ^kipw(t,u|z)=λ^0kipw(t,u)exp{β^ipw(u)Tz}. When the auxiliary marks Aki are correlated with Vki conditional on Tki,Zki and δki=1, the conditional distribution ρk(w,v) involves the function gk(a|t,u,z), for which a parametric or semiparametric model may be developed to describe the dependence between Aki and Vki. Let g^k(a|t,u,z) be an estimator of gk(a|t,u,z) with a convergence rate of at least (nh)−1/2. Then ρk(w,v) can be estimated by

ρ^kipw(w,v)=0vλ^kipw(t,u|z)g^k(a|t,u,z)du/01λ^kipw(t,u|z)g^k(a|t,u,z)du.

(9)

Let Nkix(t)=I(Xkit,δki=1) and Nkiv(v)=I(Vkiv). The AIPW estimating equation for β is Uaug{v,β,ψ^,ρ^(·)}=0, where

Uaug{v,β,ψ^,ρ^()}=k=1Ki=1nk010τKh(uv){Zki(t)Z¯k(t,β)}×[Rkiπk(Qki,ψ^k)Nki(dt,du)+{1Rkiπk(Qki,ψ^k)}Nkix(dt)d{ρ^kipw(Wki,u)}],

(10)

and

Z¯k(t,β)=Sk(1)(t,β)/Sk(0)(t,β)

and

Sk(j)(t,β)=nk1i=1nkYki(t)exp{βTZki(t)}Zki(t)j

for j = 0,1. The AIPW estimator of β(v) solves equation (10) and is denoted by β^aug(v). The estimator of the cumulative function B(v)=0vβ(u)du is given by B^aug(v)=0vβ^aug(u)du. Note that there is no ψ^k in Z¯k(t,β); this is a difference between the IPW and AIPW estimators.

To implement the estimation procedures in practice, one can use arbitrary auxiliaries for estimating ψ^k; these auxiliaries may include covariates and marks at multiple time points pre infection and post infection respectively. In contrast, although in principle arbitrary auxiliaries may also be used for the terms g^k(a|t,u,z) in equation (9), owing to the curse of dimensionality the method is expected to perform best in practice with a univariate auxiliary, where semiparametric or fully parametric models for gk(a|t,u,z) would be required to include multivariate auxiliaries.

Sun and Gilbert (2012) proved that the estimators β^ipw(t,v) and β^aug(t,v) are consistent and that β^aug(v) is more efficient than β^ipw(v). In the next section, we develop some hypothesis testing procedures for assessing mark-specific vaccine efficacy based on B^aug(v).

4. Testing of mark-specific vaccine efficacy

The covariate-adjusted vaccine efficacy VE(v) is defined through the first component of β(v). Let B1(v) be the first component of the cumulative coefficient function B(v). The hypothesis tests concerning VE(v) are constructed on the basis of the first component B^1aug(v) of the AIPW estimator B^aug(v). The cumulative estimator B^aug(v) has more stable large sample behaviour and a faster convergence rate than β^aug(v).

Let WB(v)=n1/2{B^aug(v)B^aug(a)}n1/2{B(v)B(a)} for v ∈ [a,b]. In Appendix A we show that WB(v), v ∈ [a,b], converges weakly to a p-dimensional mean 0 Gaussian process with continuous sample paths on v ∈ [a,b]. Further, the distribution of WB(v), for v ∈ [a,b], can be approximated by using the Gaussian multipliers resampling method (of Lin et al. (1993)) based on

WB*(v)=n1/2k=1Ki=1nkξkiH^ki(v),v[a,b],

where {ξki,i=1,,nk,k=1,,K} are independent and identically distributed standard normal random variables and H^ki(v) is defined in expression (22) in Appendix A. Let WB1(v) and WB1*(v) be the first component of WB(v) and WB*(v) respectively. With the Gaussian multipliers method, the variance var{B^1aug(v)B^1aug(a)} can be consistently estimated by var^{B^1aug(v)B^1aug(a)}=n1var*{WB1*(v)}, where var*{WB1*(v)} is the first component on the diagonal of the covariance given in expression (23) in Appendix A.

4.1. Testing the null hypothesis H10

Consider the test process Q(1)(v)=n1/2{B^1aug(v)B^1aug(a)}, v ∈ [a,b]. Then Q(1)(v)=WB1(v)+n1/2{B1(v)B1(a)}, v ∈ [a,b]. Under H10, B1(v)B1(a)=0 for v ∈ [a,b], which motivates the following test statistics for testing H10:

Ta1(1)=supv[a,b]|Q(1)(v)|,Ta2(1)=abQ(1)(v)2dvar*{WB1*(v)},Tm1(1)=infv[a,b]Q(1)(v),Tm2(1)=abQ(1)(v)dvar*{WB1*(v)}.

The test statistics Ta1(1) and Ta2(1) capture general departures H1a, whereas the test statistics Tm1(1) and Tm2(1) are sensitive to the monotone departures H1m. It is easy to derive that all the test statistics Ta1(1), Ta2(1), Tm1(1) and Tm2(1) are consistent against their respective alternative hypotheses, and Appendix A derives their limiting distributions under H10.

Under H10, the distribution of Q(1)(v), v ∈ [a,b], can be approximated by the conditional distribution of WB1*, v ∈ [a,b], given the observed data sequence. Hence, the distributions of Ta1(1), Ta2(1), Tm1(1) and Tm2(1) under H10 can be approximated by the conditional distributions of Ta1*(1)=supv[a,b]|WB1*(v)|, Ta2*(1)=abWB1*(v)2dvar*{WB1*(v)}, Tm1*(1)=infv[a,b]WB1*(v) and Tm2*(1)=abWB1*(v)dvar*{WB1*(v)}, given the observed data sequence, respectively. The critical values ca1(1) and ca2(1) of the test statistics Ta1(1) and Ta2(1) can be approximated by the (1−α)-quantile of Ta1*(1) and Ta2*(1), which can be obtained by repeatedly generating a large number, say 500, of independent sets of normal samples {ξki,i=1,,nk,k=1,,K} while holding the observed data sequence fixed. Similarly, the critical values cm1(1) and cm2(1) of the test statistics Tm1(1) and Tm2(1) can be approximated by the α-quantile of Tm1*(1) and Tm2*(1), which again can be obtained by repeatedly generating independent sets of normal samples {ξki,i=1,,nk,k=1,,K}. At the significance level α, the tests based on Ta1(1) and Ta2(1) reject H10 in favour of H1a if Ta1(1)>ca1(1) and Ta2(1)>ca2(1) respectively, and the tests based on Tm1(1) and Tm2(1) reject H10 in favour of H1m if Tm1(1)<cm1(1) and Tm2(1)<cm2(1) respectively.

4.2. Testing the null hypothesis H20

Let

Q(2)(v)=(va)1n1/2{B^1aug(v)B^1aug(a)}(ba)1n1/2{B^1aug(b)B^1aug(a)}.

Then

Q(2)(v)=Γ(v,WB1)+n1/2Γ(v,B1)fora<vb,

(11)

where

Γ(v,F1)=(va)1{F1(v)F1(a)}(ba)1{F1(b)F1(a)}

is a transformation of F1(·). We note that Γ(·,B1)=0 under H20 and Γ(·,B1)0 under the alternatives, motivating Q(2)(v) as the test process and the following test statistics for testing H20:

Ta1(2)=supv[a,b]|Q(2)(v)|,Ta2(2)=abQ(2)(v)2dvar*{WB1*(v)},Tm1(2)=infv[a,b]Q(2)(v),Tm2(2)=abQ(2)(v)dvar*{WB1*(v)},

where a<a<b. We choose a>a to avoid 0 in the denominator of Q(2)(v). In practice, one can choose a close to a to make use of available data and to ensure that the tests are consistent.

By the asymptotic results shown in Appendix A and the continuous mapping theorem, under H20 the distribution of Q(2)(v), v ∈ [a,b], can be approximated by the conditional distribution of Γ(v,WB1*), v ∈ [a,b], given the observed data sequence. Hence, the distributions of Ta1(2), Ta2(2), Tm1(2) and Tm2(2) under H20 can be approximated by the conditional distributions of

Ta1*(2)=supv[a,b]|Γ(v,WB1*)|,

Ta2*(2)=abΓ(v,WB1*)2dvar*{WB1*(v)},

Tm1*(2)=infv[a,b]Γ(v,WB1*)

and

Tm2*(2)=abΓ(v,WB1*)dvar*{WB1*(v)},

given the observed data sequence, respectively. Similarly to Section 4.1, the respective critical values ca1(2) and ca2(2) of the test statistics Ta1(2) and Ta2(2) can be approximated by the (1−α)-quantiles of the conditional distributions of Ta1*(2) and Ta2*(2) obtained through repeatedly generating independent sets of normal samples {ξki,i=1,,nk,k=1,,K} while holding the observed data sequence fixed. The critical values cm1(2) and cm2(2) for Tm1(2) and Tm2(2) can be approximated similarly. At the significance level α, the tests based on Ta1(2) and Ta2(2) reject H20 in favour of H2a if Ta1(2)>ca1(2) and Ta2(2)>ca2(2) respectively, and the tests based on Tm1(2) and Tm2(2) reject H20 in favour of H2m if Tm1(2)<cm1(2) and Tm2(2)<cm2(2) respectively.

The tests Ta1(2) and Ta2(2) capture general departures H2a whereas the tests Tm1(2) and Tm2(2) are sensitive to the monotone departure H2m. Note that the derivative dΓ(v,B1)/dv=(va)1{β1(v)(va)1B1(v)}0 under H2m with strict inequality for at least some v ∈ [a,b]. This plus the fact that Γ(v,B1) is non-decreasing with Γ(b,B1)=0 lead to the results that the tests based on Tm1(2) and Tm2(2) are consistent against H2m and the tests based on Ta1(2) and Ta2(2) are consistent against H2a. The proofs are given in the second paragraph following theorem 1 in Appendix A.

In Sections 4.1 and 4.2, we considered two types of test statistics, namely the integration-based test statistics and the supremum-based test statistics, for each pair of hypotheses. The former are generalizations of the Cramér–von Mises test statistic and involve integration of deviations over the whole range of the mark, whereas the latter are extensions of the classic Kolmogorov–Smirnov test statistic for testing the goodness of fit of a distribution function, and take the supremum of such deviations. As demonstrated in a comprehensive analysis of the relative powers of the classic Kolmogorov–Smirnov test and the Cramér–von Mises test by Stephens (1974), we expect that the two types of test statistic have different powers for different true alternative distributions. The integration-based test statistics are best suited for situations where the true alternative distribution deviates a little over the whole support of the mark and the supremum-based test statistics may have more power against situations where the true alternative has large deviations over a small section of the support. For example, for testing differential VE(v), H20, the supremum-based tests will tend to be relatively more powerful if VE^(v) is very high for a small range of marks near a and declines sharply to 0 and is constant at 0 for all other marks.

5. Simulation study

5.1. Numerical assessment of the tests under correctly specified models

We conduct a simulation study to evaluate the finite sample performance of the testing procedures proposed. The empirical sizes and powers of the test statistics are assessed for various models, sample sizes (500 and 800) and choices of bandwidths. The powers of the tests are evaluated in both situations where a correlated auxiliary variable is used and where it is absent.

We consider K = 1 stratum. Let Zki be the treatment indicator with P(Zki=1)=0.5. The (Tki, Vki) are generated from the following mark-specific proportional hazards model:

λ(t,v|z)=exp{γv+(α+βv)z},t0,0v1,

(12)

where α, β and γ are constants. Under model (12), λ0(t,v)= exp (γv) and VE(v)=1− exp (α+βv). For α = 0 and β = 0, VE(v)=0, indicating no vaccine efficacy, and, for β = 0, VE(v)=VE, indicating mark invariant vaccine efficacy, whereas β>0 indicates VE(v) decreasing in v. We examine the hypothesis testing procedures for the following specific models M1–M5 respectively:

(α,β,γ)=(0,0,0.3),

implying that VE(v)=0;

(α,β,γ)=(0.69,0,0.3),

implying that VE(v) does not depend on v;

(α,β,γ)=(0.6,0.6,0.3),

implying that VE(v) decreases;

(α,β,γ)=(1.2,1.2,0.3),

implying that VE(v) decreases;

(α,β,γ)=(1.5,1.5,0.3),

implying that VE(v) decreases.

We generate the censoring times from an exponential distribution, independent of (T,V), with censoring rates ranging from 20% to 30%. We take τ = 2.0. The complete-case indicator Rki is generated with conditional probability rk(Wki)=P(Rki=1|δki=1,Wki), where

logit{rk(Wki)}=ψk0+ψk1Zki,i=1,,nk,k=1,,K.

(13)

With ψk0=0.2 and ψk1=−0.2 about 50% of observed failures are missing marks.

Conditionally on (Tki, Zki, Vki), we assume that the auxiliary marks follow the model

Aki=(θ+1)1(Vki+θUki),θ>0,

(14)

for i=1,,nk,k = 1,…,K, where Vki are the possibly missing marks, Uki is uniformly distributed on [0,1] independent of Vki and θ>0 is an association parameter between Aki and Vki. The correlation coefficient ρ between Aki and Vki is 1 for θ = 0. Since Aki is observed for all observed failure times, the AIPW estimator in this case is the full data estimator. The Aki and Vki are independent for θ = ∞, yielding ρ = 0. In addition, the θ-values of 0.8, 0.4 and 0.2 correspond to ρ = 0.78, 0.92, 0.98.

Under model (14), the conditional density of Aki given (Tki, Zki, Vki) is

gk(at,v,z;θ)=1+θθI(v1+θav+θ1+θ),0a1,0v1.

(15)

The likelihood function for θ is

L(θ)=δki=1,Rki=11+θθI(Vki1+θAkiVki+θ1+θ)forθ>0.

It is easy to show that the maximum likelihood estimator equals

θ^=maxδki=1,Rki=1{Vki/Aki,(1Vki)/(1Aki)}1.

The density estimator gk(a|t,v,z;θ^) is plugged into equation (9) to obtain ρ^kipw(w,v), which is used to construct the AIPW estimator of β in equation (10).

The performances of the test procedures proposed are evaluated through simulations for the models described in expressions (12), (13) and (14) under the settings M1–M5, where M1 is a setting under the null hypothesis H10 and M2 is a setting under the null hypothesis H20. We consider the situations where no auxiliary information is provided and where the correlation between the auxiliary mark and the mark of interest is ρ = 0.92 (under model (14) with θ = 0.4). Table 1 presents the empirical sizes and powers of the tests Ta1(1), Ta2(1), Tm1(1) and Tm2(1) for testing H10 at the nominal level 0.05. Table 2 presents the empirical sizes and powers of the tests Ta1(2), Ta2(2), Tm1(2) and Tm2(2) for testing H20 at the nominal level 0.05. The results are presented for n = 500 with h1=0.1 and h=h2=0.15, 0.2, and for n = 800 with h1=0.1 and h=h2=0.1, 0.15. We take a = 0, b = 1 and a=0.5 for the tests. The Epanechnikov kernel K(x)=0.75(1x2)I(|x|1) is used throughout the numerical analysis.

Table 1

Open in new tab

Empirical sizes and powers of the tests T1(1), T2(1), T1(1) and T2(1) for testing H10 at the nominal level 0.05 for ρ = 0 and 0.92 when 50% of the marks are missing

Model(α,β,γ)nhSize/power
ρ = 0ρ = 0.92
Ta1(1)Ta2(1)Tm1(1)Tm2(1)Ta1(1)Ta2(1)Tm1(1)Tm2(1)
M1(0,0,0.3)5000.155.44.04.05.04.64.23.84.2
0.205.04.44.65.24.84.04.23.6
8000.103.83.64.24.23.83.85.44.8
0.154.03.84.64.65.04.45.45.6
M3(−0.6,0.6,0.3)5000.1568.267.079.476.073.274.683.285.4
0.2063.265.075.874.269.271.479.882.6
8000.1088.286.294.690.492.093.095.097.2
0.1587.486.692.890.889.290.693.495.2
M4(−1.2,1.2,0.3)5000.1599.699.499.899.899.810099.8100
0.2099.499.099.699.899.699.899.8100
8000.10100100100100100100100100
0.15100100100100100100100100
M2(−0.69,0,0.3)5000.1510010010099.8100100100100
0.20100100100100100100100100
8000.1010099.810010099.899.899.899.8
0.15100100100100100100100100
Model(α,β,γ)nhSize/power
ρ = 0ρ = 0.92
Ta1(1)Ta2(1)Tm1(1)Tm2(1)Ta1(1)Ta2(1)Tm1(1)Tm2(1)
M1(0,0,0.3)5000.155.44.04.05.04.64.23.84.2
0.205.04.44.65.24.84.04.23.6
8000.103.83.64.24.23.83.85.44.8
0.154.03.84.64.65.04.45.45.6
M3(−0.6,0.6,0.3)5000.1568.267.079.476.073.274.683.285.4
0.2063.265.075.874.269.271.479.882.6
8000.1088.286.294.690.492.093.095.097.2
0.1587.486.692.890.889.290.693.495.2
M4(−1.2,1.2,0.3)5000.1599.699.499.899.899.810099.8100
0.2099.499.099.699.899.699.899.8100
8000.10100100100100100100100100
0.15100100100100100100100100
M2(−0.69,0,0.3)5000.1510010010099.8100100100100
0.20100100100100100100100100
8000.1010099.810010099.899.899.899.8
0.15100100100100100100100100

The bandwidths are h1=0.1 and h2=h. Each entry is based on 500 Gaussian multipliers samples and 500 repetitions.

Table 1

Open in new tab

Empirical sizes and powers of the tests T1(1), T2(1), T1(1) and T2(1) for testing H10 at the nominal level 0.05 for ρ = 0 and 0.92 when 50% of the marks are missing

Model(α,β,γ)nhSize/power
ρ = 0ρ = 0.92
Ta1(1)Ta2(1)Tm1(1)Tm2(1)Ta1(1)Ta2(1)Tm1(1)Tm2(1)
M1(0,0,0.3)5000.155.44.04.05.04.64.23.84.2
0.205.04.44.65.24.84.04.23.6
8000.103.83.64.24.23.83.85.44.8
0.154.03.84.64.65.04.45.45.6
M3(−0.6,0.6,0.3)5000.1568.267.079.476.073.274.683.285.4
0.2063.265.075.874.269.271.479.882.6
8000.1088.286.294.690.492.093.095.097.2
0.1587.486.692.890.889.290.693.495.2
M4(−1.2,1.2,0.3)5000.1599.699.499.899.899.810099.8100
0.2099.499.099.699.899.699.899.8100
8000.10100100100100100100100100
0.15100100100100100100100100
M2(−0.69,0,0.3)5000.1510010010099.8100100100100
0.20100100100100100100100100
8000.1010099.810010099.899.899.899.8
0.15100100100100100100100100
Model(α,β,γ)nhSize/power
ρ = 0ρ = 0.92
Ta1(1)Ta2(1)Tm1(1)Tm2(1)Ta1(1)Ta2(1)Tm1(1)Tm2(1)
M1(0,0,0.3)5000.155.44.04.05.04.64.23.84.2
0.205.04.44.65.24.84.04.23.6
8000.103.83.64.24.23.83.85.44.8
0.154.03.84.64.65.04.45.45.6
M3(−0.6,0.6,0.3)5000.1568.267.079.476.073.274.683.285.4
0.2063.265.075.874.269.271.479.882.6
8000.1088.286.294.690.492.093.095.097.2
0.1587.486.692.890.889.290.693.495.2
M4(−1.2,1.2,0.3)5000.1599.699.499.899.899.810099.8100
0.2099.499.099.699.899.699.899.8100
8000.10100100100100100100100100
0.15100100100100100100100100
M2(−0.69,0,0.3)5000.1510010010099.8100100100100
0.20100100100100100100100100
8000.1010099.810010099.899.899.899.8
0.15100100100100100100100100

The bandwidths are h1=0.1 and h2=h. Each entry is based on 500 Gaussian multipliers samples and 500 repetitions.

Table 2

Open in new tab

Empirical sizes and powers of the tests T1(2), T2(2), T1(2) and T2(2) for testing H20 at the nominal level 0.05 for ρ = 0 and 0.92 when 50% of the marks are missing

Model(α,β,γ)nhSize/power
ρ = 0ρ = 0.92
Ta1(2)Ta2(2)Tm1(2)Tm2(2)Ta1(2)Ta2(2)Tm1(2)Tm2(2)
M2(−0.69,0,0.3)5000.155.64.85.85.87.67.27.47.0
0.205.84.85.45.26.66.66.67.4
8000.106.45.05.65.86.25.87.27.0
0.156.65.25.85.66.05.66.06.6
M3(−0.6,0.6,0.3)5000.1516.817.022.425.220.625.832.637.4
0.2014.215.822.224.819.424.231.834.6
8000.1026.025.835.236.436.038.046.049.2
0.1525.425.834.835.634.036.045.447.4
M4(−1.2,1.2,0.3)5000.1544.446.259.063.263.668.476.480.2
0.2042.244.057.259.661.465.873.275.8
8000.1066.267.675.278.082.886.690.691.8
0.1564.666.274.077.080.684.488.491.2
M5(−1.5,1.5,0.3)5000.1564.566.575.076.581.085.688.890.4
0.2061.062.672.272.277.882.486.889.4
8000.1080.885.687.691.494.696.297.698.4
0.1578.684.887.891.494.495.695.897.8
Model(α,β,γ)nhSize/power
ρ = 0ρ = 0.92
Ta1(2)Ta2(2)Tm1(2)Tm2(2)Ta1(2)Ta2(2)Tm1(2)Tm2(2)
M2(−0.69,0,0.3)5000.155.64.85.85.87.67.27.47.0
0.205.84.85.45.26.66.66.67.4
8000.106.45.05.65.86.25.87.27.0
0.156.65.25.85.66.05.66.06.6
M3(−0.6,0.6,0.3)5000.1516.817.022.425.220.625.832.637.4
0.2014.215.822.224.819.424.231.834.6
8000.1026.025.835.236.436.038.046.049.2
0.1525.425.834.835.634.036.045.447.4
M4(−1.2,1.2,0.3)5000.1544.446.259.063.263.668.476.480.2
0.2042.244.057.259.661.465.873.275.8
8000.1066.267.675.278.082.886.690.691.8
0.1564.666.274.077.080.684.488.491.2
M5(−1.5,1.5,0.3)5000.1564.566.575.076.581.085.688.890.4
0.2061.062.672.272.277.882.486.889.4
8000.1080.885.687.691.494.696.297.698.4
0.1578.684.887.891.494.495.695.897.8

The bandwidths are h1=0.1 and h2=h. Each entry is based on 500 Gaussian multipliers samples and 500 repetitions.

Table 2

Open in new tab

Empirical sizes and powers of the tests T1(2), T2(2), T1(2) and T2(2) for testing H20 at the nominal level 0.05 for ρ = 0 and 0.92 when 50% of the marks are missing

Model(α,β,γ)nhSize/power
ρ = 0ρ = 0.92
Ta1(2)Ta2(2)Tm1(2)Tm2(2)Ta1(2)Ta2(2)Tm1(2)Tm2(2)
M2(−0.69,0,0.3)5000.155.64.85.85.87.67.27.47.0
0.205.84.85.45.26.66.66.67.4
8000.106.45.05.65.86.25.87.27.0
0.156.65.25.85.66.05.66.06.6
M3(−0.6,0.6,0.3)5000.1516.817.022.425.220.625.832.637.4
0.2014.215.822.224.819.424.231.834.6
8000.1026.025.835.236.436.038.046.049.2
0.1525.425.834.835.634.036.045.447.4
M4(−1.2,1.2,0.3)5000.1544.446.259.063.263.668.476.480.2
0.2042.244.057.259.661.465.873.275.8
8000.1066.267.675.278.082.886.690.691.8
0.1564.666.274.077.080.684.488.491.2
M5(−1.5,1.5,0.3)5000.1564.566.575.076.581.085.688.890.4
0.2061.062.672.272.277.882.486.889.4
8000.1080.885.687.691.494.696.297.698.4
0.1578.684.887.891.494.495.695.897.8
Model(α,β,γ)nhSize/power
ρ = 0ρ = 0.92
Ta1(2)Ta2(2)Tm1(2)Tm2(2)Ta1(2)Ta2(2)Tm1(2)Tm2(2)
M2(−0.69,0,0.3)5000.155.64.85.85.87.67.27.47.0
0.205.84.85.45.26.66.66.67.4
8000.106.45.05.65.86.25.87.27.0
0.156.65.25.85.66.05.66.06.6
M3(−0.6,0.6,0.3)5000.1516.817.022.425.220.625.832.637.4
0.2014.215.822.224.819.424.231.834.6
8000.1026.025.835.236.436.038.046.049.2
0.1525.425.834.835.634.036.045.447.4
M4(−1.2,1.2,0.3)5000.1544.446.259.063.263.668.476.480.2
0.2042.244.057.259.661.465.873.275.8
8000.1066.267.675.278.082.886.690.691.8
0.1564.666.274.077.080.684.488.491.2
M5(−1.5,1.5,0.3)5000.1564.566.575.076.581.085.688.890.4
0.2061.062.672.272.277.882.486.889.4
8000.1080.885.687.691.494.696.297.698.4
0.1578.684.887.891.494.495.695.897.8

The bandwidths are h1=0.1 and h2=h. Each entry is based on 500 Gaussian multipliers samples and 500 repetitions.

Tables 1 and 2 show that all the tests have satisfactory empirical sizes close to the nominal level 0.05. The powers of the tests increase with sample size and they are not overly sensitive to the bandwidths selected. The powers of the tests for testing H10 increase as the model moves in the direction M1→M3→M4→M2, representing increased departure from the null hypothesis H10. The powers of the tests for testing H20 increase as the model moves in the direction M2→M3→M4→M5, representing increased departure from the null hypothesis H20. The tests utilizing the auxiliary marks have higher power than those without using the auxiliary marks.

As with any non-parametric smoothing procedure, one needs to select bandwidths carefully. In practice, the appropriate bandwidth selection can be based on a K-fold cross-validation method (e.g. Efron and Tibshirani (1993), Hoover et al. (1998), Cai et al. (2000) and Tian et al. (2005)).

The testing procedures proposed properly handle missing marks under MAR with asymptotically correct significance levels. However, if only the observations with complete information are used, i.e. the complete-case analysis, then the testing procedures are expected often not to provide correct type I error control. We conduct a simulation study to evaluate the observed sizes of the proposed tests using the complete cases under two different models for missing the indicator Rki—model (13) and the model

logit{rk(Wki)}=0.8Zki0.3Tki,i=1,,nk,k=1,,K.

(16)

For K = 1 both model (13) and model (16) yield about 50% missing marks among the observed failures. The sizes of Ta1(1), Ta2(1), Tm1(1) and Tm2(1) for testing H10 are evaluated under model M1 and the sizes of Ta1(2), Ta2(2), Tm1(2) and Tm2(2) for testing H20 are evaluated under model M2 (Table 3). Under model (13), the observed sizes for testing H10 are elevated (around 7–15%), whereas those for testing H20 remain around 5%. Under model (16), the observed sizes for testing H10 exceed 37% for all tests, whereas those for testing H20 reach 12% and 14% for tests Tm1(2) and Tm2(2) when n = 800.

Table 3

Open in new tab

Empirical sizes of the tests for H10 and H20 at the nominal level 0.05 using the complete cases under missingness completely at random when 50% of the marks are missing

ModelMissingness modelnhSize
Ta1(1)Ta2(1)Tm1(1)Tm2(1)
TestingH10
M1(13)5000.200.140.100.120.15
8000.150.100.070.110.11
(16)5000.200.390.370.500.42
8000.150.500.460.630.55
Ta1(2)Ta2(2)Tm1(2)Tm2(2)
Testing H20
M2(13)5000.200.080.040.080.05
8000.150.060.090.060.10
(16)5000.200.080.070.080.05
8000.150.070.060.120.14
ModelMissingness modelnhSize
Ta1(1)Ta2(1)Tm1(1)Tm2(1)
TestingH10
M1(13)5000.200.140.100.120.15
8000.150.100.070.110.11
(16)5000.200.390.370.500.42
8000.150.500.460.630.55
Ta1(2)Ta2(2)Tm1(2)Tm2(2)
Testing H20
M2(13)5000.200.080.040.080.05
8000.150.060.090.060.10
(16)5000.200.080.070.080.05
8000.150.070.060.120.14

The bandwidths are h1=0.1 and h2=h. Each entry is based on 100 Gaussian multipliers samples and 100 repetitions.

Table 3

Open in new tab

Empirical sizes of the tests for H10 and H20 at the nominal level 0.05 using the complete cases under missingness completely at random when 50% of the marks are missing

ModelMissingness modelnhSize
Ta1(1)Ta2(1)Tm1(1)Tm2(1)
TestingH10
M1(13)5000.200.140.100.120.15
8000.150.100.070.110.11
(16)5000.200.390.370.500.42
8000.150.500.460.630.55
Ta1(2)Ta2(2)Tm1(2)Tm2(2)
Testing H20
M2(13)5000.200.080.040.080.05
8000.150.060.090.060.10
(16)5000.200.080.070.080.05
8000.150.070.060.120.14
ModelMissingness modelnhSize
Ta1(1)Ta2(1)Tm1(1)Tm2(1)
TestingH10
M1(13)5000.200.140.100.120.15
8000.150.100.070.110.11
(16)5000.200.390.370.500.42
8000.150.500.460.630.55
Ta1(2)Ta2(2)Tm1(2)Tm2(2)
Testing H20
M2(13)5000.200.080.040.080.05
8000.150.060.090.060.10
(16)5000.200.080.070.080.05
8000.150.070.060.120.14

The bandwidths are h1=0.1 and h2=h. Each entry is based on 100 Gaussian multipliers samples and 100 repetitions.

These simulation results verify that the testing procedures applied to complete cases generally do not have nominal size, although for some of the scenarios the sizes are nominal. To explain this, it can be shown that, under MAR, λk(t,v|z,Rki=1)=λk(t,v|z)hk(t,z), where hk(t,z)=P(Rki=1|Tki=t,Zki=z)/P(Rki=1|Tkit,Zki=z). If hk(t,z) does not depend on z and MAR holds, then the observations for individuals with the observed marks only can be viewed as a random sample from a mark-specific proportional hazards model with a different baseline hazard function but the same regression function β(v). In this case, the tests for both H10 and H20 based on the complete cases are valid. If hk(t,z) depends on z but not on t and MAR holds, then hk(t,z) can be expressed as hk(t,z)=exp(ϑkz) (the scenario under model (13)), and the tests of H10 based on the complete cases will be biased. However, the tests of H20 remain unbiased since the biases in the estimation of β(v) do not depend on v, such that the test process Q(2)(v) is still asymptotically a mean 0 process. In general, if hk(t,z) depends on both z and t and MAR holds, which is the scenario under the missingness model (16), then the test process Q(2)(v) is not an asymptotically mean 0 process. The magnitude of departure of the asymptotic sizes of the test statistics of H20 from the nominal level depends on hk(t,z) in a complicated manner.

5.2. Numerical assessment of the tests under misspecified models

This subsection evaluates robustness of the proposed test procedures to misspecifications of rk(w) and/or gk(a|t,v,z), and to violation of the MAR assumption. The Zki, (Tki, Vki) and Cki are generated by using the same models as above, again with approximately 30% censoring.

Robustness of the tests to misspecification of rk(w) is examined by assuming model (13) whereas the actual complete-case indicator Rki is generated with the conditional probability rk(Wki)=P(Rki=1|δki=1,Wki), where

logit{rk(Wki)}=1.1+ZkiTki,i=1,,nk.

(17)

This model yields approximately 50% missing marks among observed failures under models M1–M5.

Robustness of the tests is also examined when gk(a|t,v,z) is misspecified. This is carried out by assuming model (14) for the auxiliary mark, or, equivalently, model (15) for gk(a|t,v,z), whereas the actual mark for δki=1 is generated from

Aki=(1.4+2τ)1(Vki+0.4Uki+2Xki),

(18)

for i=1,,nk. Here Uki is uniformly distributed on [0,1] and is independent of Vki.

Robustness of the tests to violation of the MAR assumption (2) is examined by assuming model (13), whereas the actual Rki depends on Vki through the model

The proportion of missing marks among the observed failures is kept around 50% in all scenarios.

Models (17), (18) and (19) are similar to those used in Sun and Gilbert (2012) for examining robustness of the AIPW estimator. However, instead of examining biases and standard errors of the estimators, here we check whether the empirical sizes of the tests are close to their nominal level 0.05 and how the powers of the tests are affected by these misspecifications. For sample size n = 500 and bandwidths h1=0.1 and h=h2=0.20, Table 4 shows the empirical sizes and powers of the tests of H10 and Table 5 shows the empirical sizes and powers of the tests of H20. In both Table 4 and Table 5, the first block shows the results when rk(w) is misspecified following model (17) and gk(a|t,v,z) is correctly specified by model (15) with θ = 0.4; the second block shows the results when gk(a|t,v,z) is misspecified following model (18) and rk(w) is correctly specified by model (13) with ψk1=0.2 and ψk1=−0.2; the third block shows the results when rk(w) is misspecified following model (17) and gk(a|t,v,z) is misspecified following model (18); and the fourth block shows the results when rk(w) depends on Vki following model (19) and gk(a|t,v,z) is correctly specified by model (15) with θ = 0.4.

Table 4

Open in new tab

Robustness of the tests for H10

Model(α,β,γ)Size/power
Ta1(1)Ta2(1)Tm1(1)Tm2(1)
rk(w) is misspecified
M1(0,0,0.3)4.25.23.64.2
M3(−0.6,0.6,0.3)62.074.474.081.8
M4(−1.2,1.2,0.3)99.699.899.899.8
M2(−0.69,0,0.3)100100100100
gk()a|t,v,z()is misspecified
M1(0,0,0.3)3.44.25.84.6
M3(−0.6,0.6,0.3)59.664.472.874.4
M4(−1.2,1.2,0.3)99.299.499.699.6
M2(−0.69,0,0.3)10099.810099.8
rk(w) andgk()a|t,v,z()are misspecified
M1(0,0,0.3)4.04.03.83.4
M3(−0.6,0.6,0.3)61.861.871.873.8
M4(−1.2,1.2,0.3)99.698.699.899.8
M2(−0.69,0,0.3)100100100100
MAR assumption is violated
M1(0,0,0.3)3.43.83.65.0
M3(−0.6,0.6,0.3)60.667.073.077.8
M4(−1.2,1.2,0.3)99.299.699.899.6
M2(−0.69,0,0.3)100100100100
Model(α,β,γ)Size/power
Ta1(1)Ta2(1)Tm1(1)Tm2(1)
rk(w) is misspecified
M1(0,0,0.3)4.25.23.64.2
M3(−0.6,0.6,0.3)62.074.474.081.8
M4(−1.2,1.2,0.3)99.699.899.899.8
M2(−0.69,0,0.3)100100100100
gk()a|t,v,z()is misspecified
M1(0,0,0.3)3.44.25.84.6
M3(−0.6,0.6,0.3)59.664.472.874.4
M4(−1.2,1.2,0.3)99.299.499.699.6
M2(−0.69,0,0.3)10099.810099.8
rk(w) andgk()a|t,v,z()are misspecified
M1(0,0,0.3)4.04.03.83.4
M3(−0.6,0.6,0.3)61.861.871.873.8
M4(−1.2,1.2,0.3)99.698.699.899.8
M2(−0.69,0,0.3)100100100100
MAR assumption is violated
M1(0,0,0.3)3.43.83.65.0
M3(−0.6,0.6,0.3)60.667.073.077.8
M4(−1.2,1.2,0.3)99.299.699.899.6
M2(−0.69,0,0.3)100100100100

Empirical sizes and powers of the tests Ta1(1), Ta2(1), Tm1(1) and Tm2(1) for testing H10 at the nominal level 0.05 for n = 500 and h = 0.2 when 50% of the marks are missing. The bandwidths are h1=0.1 and h2=h. Each entry is based on 500 Gaussian multipliers samples and 500 repetitions.

Table 4

Open in new tab

Robustness of the tests for H10

Model(α,β,γ)Size/power
Ta1(1)Ta2(1)Tm1(1)Tm2(1)
rk(w) is misspecified
M1(0,0,0.3)4.25.23.64.2
M3(−0.6,0.6,0.3)62.074.474.081.8
M4(−1.2,1.2,0.3)99.699.899.899.8
M2(−0.69,0,0.3)100100100100
gk()a|t,v,z()is misspecified
M1(0,0,0.3)3.44.25.84.6
M3(−0.6,0.6,0.3)59.664.472.874.4
M4(−1.2,1.2,0.3)99.299.499.699.6
M2(−0.69,0,0.3)10099.810099.8
rk(w) andgk()a|t,v,z()are misspecified
M1(0,0,0.3)4.04.03.83.4
M3(−0.6,0.6,0.3)61.861.871.873.8
M4(−1.2,1.2,0.3)99.698.699.899.8
M2(−0.69,0,0.3)100100100100
MAR assumption is violated
M1(0,0,0.3)3.43.83.65.0
M3(−0.6,0.6,0.3)60.667.073.077.8
M4(−1.2,1.2,0.3)99.299.699.899.6
M2(−0.69,0,0.3)100100100100
Model(α,β,γ)Size/power
Ta1(1)Ta2(1)Tm1(1)Tm2(1)
rk(w) is misspecified
M1(0,0,0.3)4.25.23.64.2
M3(−0.6,0.6,0.3)62.074.474.081.8
M4(−1.2,1.2,0.3)99.699.899.899.8
M2(−0.69,0,0.3)100100100100
gk()a|t,v,z()is misspecified
M1(0,0,0.3)3.44.25.84.6
M3(−0.6,0.6,0.3)59.664.472.874.4
M4(−1.2,1.2,0.3)99.299.499.699.6
M2(−0.69,0,0.3)10099.810099.8
rk(w) andgk()a|t,v,z()are misspecified
M1(0,0,0.3)4.04.03.83.4
M3(−0.6,0.6,0.3)61.861.871.873.8
M4(−1.2,1.2,0.3)99.698.699.899.8
M2(−0.69,0,0.3)100100100100
MAR assumption is violated
M1(0,0,0.3)3.43.83.65.0
M3(−0.6,0.6,0.3)60.667.073.077.8
M4(−1.2,1.2,0.3)99.299.699.899.6
M2(−0.69,0,0.3)100100100100

Empirical sizes and powers of the tests Ta1(1), Ta2(1), Tm1(1) and Tm2(1) for testing H10 at the nominal level 0.05 for n = 500 and h = 0.2 when 50% of the marks are missing. The bandwidths are h1=0.1 and h2=h. Each entry is based on 500 Gaussian multipliers samples and 500 repetitions.

Table 5

Open in new tab

Robustness of the tests for H20

Model(α,β,γ)Size/power
Ta1(2)Ta2(2)Tm1(2)Tm2(2)
rk(w) is misspecified
M2(−0.69,0,0.3)5.03.85.46.2
M3(−0.6,0.6,0.3)24.025.234.236.0
M4(−1.2,1.2,0.3)60.866.672.878.4
M5(−1.5,1.5,0.3)76.682.085.888.8
gk()a|t,v,z()is misspecified
M2(−0.69,0,0.3)4.86.66.05.8
M3(−0.6,0.6,0.3)17.218.028.428.2
M4(−1.2,1.2,0.3)44.847.256.461.0
M5(−1.5,1.5,0.3)58.060.468.673.2
rk(w) andgk()a|t,v,z()are misspecified
M2(−0.69,0,0.3)4.04.84.44.4
M3(−0.6,0.6,0.3)16.619.626.826.6
M4(−1.2,1.2,0.3)43.246.655.660.6
M5(−1.5,1.5,0.3)53.858.867.471.4
MAR assumption is violated
M2(−0.69,0,0.3)6.86.07.67.8
M3(−0.6,0.6,0.3)28.633.639.642.0
M4(−1.2,1.2,0.3)61.867.074.078.4
M5(−1.5,1.5,0.3)77.481.685.489.2
Model(α,β,γ)Size/power
Ta1(2)Ta2(2)Tm1(2)Tm2(2)
rk(w) is misspecified
M2(−0.69,0,0.3)5.03.85.46.2
M3(−0.6,0.6,0.3)24.025.234.236.0
M4(−1.2,1.2,0.3)60.866.672.878.4
M5(−1.5,1.5,0.3)76.682.085.888.8
gk()a|t,v,z()is misspecified
M2(−0.69,0,0.3)4.86.66.05.8
M3(−0.6,0.6,0.3)17.218.028.428.2
M4(−1.2,1.2,0.3)44.847.256.461.0
M5(−1.5,1.5,0.3)58.060.468.673.2
rk(w) andgk()a|t,v,z()are misspecified
M2(−0.69,0,0.3)4.04.84.44.4
M3(−0.6,0.6,0.3)16.619.626.826.6
M4(−1.2,1.2,0.3)43.246.655.660.6
M5(−1.5,1.5,0.3)53.858.867.471.4
MAR assumption is violated
M2(−0.69,0,0.3)6.86.07.67.8
M3(−0.6,0.6,0.3)28.633.639.642.0
M4(−1.2,1.2,0.3)61.867.074.078.4
M5(−1.5,1.5,0.3)77.481.685.489.2

Empirical sizes and powers of the tests Ta1(2), Ta2(2), Tm1(2) and Tm2(2) for testing H20 at the nominal level 0.05 for n = 500 and h = 0.2 when 50% of the marks are missing. The bandwidths are h1=0.1 and h2=h. Each entry is based on 500 Gaussian multipliers samples and 500 repetitions.

Table 5

Open in new tab

Robustness of the tests for H20

Model(α,β,γ)Size/power
Ta1(2)Ta2(2)Tm1(2)Tm2(2)
rk(w) is misspecified
M2(−0.69,0,0.3)5.03.85.46.2
M3(−0.6,0.6,0.3)24.025.234.236.0
M4(−1.2,1.2,0.3)60.866.672.878.4
M5(−1.5,1.5,0.3)76.682.085.888.8
gk()a|t,v,z()is misspecified
M2(−0.69,0,0.3)4.86.66.05.8
M3(−0.6,0.6,0.3)17.218.028.428.2
M4(−1.2,1.2,0.3)44.847.256.461.0
M5(−1.5,1.5,0.3)58.060.468.673.2
rk(w) andgk()a|t,v,z()are misspecified
M2(−0.69,0,0.3)4.04.84.44.4
M3(−0.6,0.6,0.3)16.619.626.826.6
M4(−1.2,1.2,0.3)43.246.655.660.6
M5(−1.5,1.5,0.3)53.858.867.471.4
MAR assumption is violated
M2(−0.69,0,0.3)6.86.07.67.8
M3(−0.6,0.6,0.3)28.633.639.642.0
M4(−1.2,1.2,0.3)61.867.074.078.4
M5(−1.5,1.5,0.3)77.481.685.489.2
Model(α,β,γ)Size/power
Ta1(2)Ta2(2)Tm1(2)Tm2(2)
rk(w) is misspecified
M2(−0.69,0,0.3)5.03.85.46.2
M3(−0.6,0.6,0.3)24.025.234.236.0
M4(−1.2,1.2,0.3)60.866.672.878.4
M5(−1.5,1.5,0.3)76.682.085.888.8
gk()a|t,v,z()is misspecified
M2(−0.69,0,0.3)4.86.66.05.8
M3(−0.6,0.6,0.3)17.218.028.428.2
M4(−1.2,1.2,0.3)44.847.256.461.0
M5(−1.5,1.5,0.3)58.060.468.673.2
rk(w) andgk()a|t,v,z()are misspecified
M2(−0.69,0,0.3)4.04.84.44.4
M3(−0.6,0.6,0.3)16.619.626.826.6
M4(−1.2,1.2,0.3)43.246.655.660.6
M5(−1.5,1.5,0.3)53.858.867.471.4
MAR assumption is violated
M2(−0.69,0,0.3)6.86.07.67.8
M3(−0.6,0.6,0.3)28.633.639.642.0
M4(−1.2,1.2,0.3)61.867.074.078.4
M5(−1.5,1.5,0.3)77.481.685.489.2

Empirical sizes and powers of the tests Ta1(2), Ta2(2), Tm1(2) and Tm2(2) for testing H20 at the nominal level 0.05 for n = 500 and h = 0.2 when 50% of the marks are missing. The bandwidths are h1=0.1 and h2=h. Each entry is based on 500 Gaussian multipliers samples and 500 repetitions.

Tables 4 and 5 show that the empirical sizes of the tests are very close to the nominal level 0.05 when one of rk(w) and gk(a|t,v,z) is misspecified, reflecting the double-robustness property of the AIPW estimator. The empirical sizes are also close to 0.05 when both rk(w) and gk(a|t,v,z) are misspecified and when the MAR assumption is violated, which is intriguing. When only rk(w) is misspecified and MAR holds, the empirical powers in Tables 4 and 5 closely track the corresponding powers in Tables 1 and 2 under correct model specifications. The empirical powers are lower than those observed in Table 1 and 2 when gk(a|t,v,z) is misspecified or when both rk(w) and gk(a|t,v,z) are misspecified, whereas the empirical powers in Tables 4 and 5 are very close to those in Tables 1 and 2 when MAR is violated. Apparently for our particular data simulation, the bias due to the MAR violation counterbalances the bias due to misspecification of both rk(w) and gk(a|t,v,z); however, in general these violations could distort sizes and powers.

5.3. Simulation study for the Thai trial

We conduct a simulation of the Thai trial, to gain insight about the power that is available for this real trial. Specifically, we simulated data to yield about the numbers of infections observed (74 in the placebo group and 51 in the vaccine group), the overall vaccine efficacy from the proportional hazards model is about 31%, and the true VE(v) curve decreases with v to be around 65–70% for v close to 0 and around 0% for v close to 1. The actual infection rate was only 0.3% over 3.5 years; to speed the simulations we use a 20% placebo infection rate and retain 74 infections on average.

Again with K = 1 stratum, the (Tki, Vki) are generated from the model

λ(t,v|z)=γexp{(α+βv)z},t0,0v1,

(20)

where α, β and γ are constants. Under model (20), VE(v)=1− exp (α+βv), the marginal hazards are λ0(t)=γ for z = 0 and λ1(t)=γ exp (α){ exp (β)−1}/β for z = 1, and the Cox proportional hazards vaccine efficacy is VEC=1−λ1(t)/λ0(t)=1− exp (α){ exp (β)−1}/β. We choose (α,β,γ)=(−1.1,1.3,0.068), yielding VEC=0.32, VE(0)=0.67 and VE(0.85)=0. We study 400 subjects each in the vaccine and placebo groups. Matching the actual trial, the censoring rate before τ is kept very low, just under 5%. The missing mark indicator is generated from model (13), with (ψk0, ψk1) set to yield about 0%, 25% (−1.2, −0.2), 50% (0.2, −0.2) and 75% (−1.0, −0.2) missing marks among observed failures. We assume that the auxiliary variable Aki follows model (14) given in Section 5.1, where the θ-values of ∞, 0.8, 0.4 and 0.2 correspond to ρ = 0, 0.78, 0.92, 0.98 for the correlation coefficient between Aki and Vki.

Because of lost information on the mark, we choose larger bandwidths for higher percentages of missing marks. We use h = 0.4 for the case with 75% missing marks, h = 0.3 for the case with 50% missing marks, h = 0.2 for the case with 25% missing marks and h = 0.15 for the case with 0% missing marks. The bandwidths h1 and h2 in equation (7) in the estimation of λ^0kipw(t,v) are taken to be 0.50 and h2=h in each case. Powers of the proposed tests Ta1(1), Ta2(1), Tm1(1), Tm2(1), Ta1(2), Ta1(2), Tm1(2) and Tm2(2) for the simulations based on the Thai trial at the nominal level 0.05 are reported in Table 6. The tests show similar performance as was found in the simulation study of Section 5.1. As only 10% of infected subjects had missing marks in the RV144 trial and the auxiliary was very weakly predictive, we focus on the entries with 0% or 25% missing marks and ρ = 0. There is 67–95% power to reject H10, and 33–60% power to reject H20. These results show that a fairly strong sieve effect with VE(v) declining from 67% to 0% could readily be missed in the Thai trial because of limited power. The only slightly improved power with an excellent auxiliary ρ = 0.98 shows that greater numbers of events would be needed to achieve high power for testing H20.

Table 6

Open in new tab

Power of the tests T1(1), T2(1), T1(1), T2(1), T1(2), T1(2), T1(2) and T2(2) for the Thai trial at the nominal level 0.05

ρ% missing markshPower
TestingH10TestingH20
Ta1(1)Ta2(1)Tm1(1)Tm2(1)Ta1(2)Ta2(2)Tm1(2)Tm2(2)
00.157785869548485960
0250.26776798536335047
500.36371718229273742
750.44151595821183531
0.78250.26779828936394650
500.36071748428284139
750.44953636525253434
0.92250.27080849137415056
500.36171738735395051
750.45458627130334044
0.98250.27181829139475355
500.36676758644425052
750.45666687641435149
ρ% missing markshPower
TestingH10TestingH20
Ta1(1)Ta2(1)Tm1(1)Tm2(1)Ta1(2)Ta2(2)Tm1(2)Tm2(2)
00.157785869548485960
0250.26776798536335047
500.36371718229273742
750.44151595821183531
0.78250.26779828936394650
500.36071748428284139
750.44953636525253434
0.92250.27080849137415056
500.36171738735395051
750.45458627130334044
0.98250.27181829139475355
500.36676758644425052
750.45666687641435149

Each entry is based on 100 Gaussian multipliers samples and 100 repetitions.

Table 6

Open in new tab

Power of the tests T1(1), T2(1), T1(1), T2(1), T1(2), T1(2), T1(2) and T2(2) for the Thai trial at the nominal level 0.05

ρ% missing markshPower
TestingH10TestingH20
Ta1(1)Ta2(1)Tm1(1)Tm2(1)Ta1(2)Ta2(2)Tm1(2)Tm2(2)
00.157785869548485960
0250.26776798536335047
500.36371718229273742
750.44151595821183531
0.78250.26779828936394650
500.36071748428284139
750.44953636525253434
0.92250.27080849137415056
500.36171738735395051
750.45458627130334044
0.98250.27181829139475355
500.36676758644425052
750.45666687641435149
ρ% missing markshPower
TestingH10TestingH20
Ta1(1)Ta2(1)Tm1(1)Tm2(1)Ta1(2)Ta2(2)Tm1(2)Tm2(2)
00.157785869548485960
0250.26776798536335047
500.36371718229273742
750.44151595821183531
0.78250.26779828936394650
500.36071748428284139
750.44953636525253434
0.92250.27080849137415056
500.36171738735395051
750.45458627130334044
0.98250.27181829139475355
500.36676758644425052
750.45666687641435149

Each entry is based on 100 Gaussian multipliers samples and 100 repetitions.

6. Analysis of the RV144 Thai trial

In the RV144 Thai trial, 125 subjects (51 of 8197 in the vaccine group and 74 of 8198 in the placebo group) were diagnosed with HIV infection over a 42-month follow-up period, from whom full length HIV genomes were measured from 121; three missed data because their HIV viral load was too low for the Sanger sequencing technology to work, and one dropped out (Rerks-Ngarm et al., 2009; Rolland et al., 2012). We focus on the gp120 region of the HIV Env protein, because this region stimulates anti-HIV antibody responses which are the putative cause of the observed partial vaccine efficacy. Three gp120 sequences were included in the vaccine: 92TH023 in the ALVAC canarypox vector prime component, and CM244, MN in the AIDSVAX gp120 protein boost component. 92TH023 and CM244 are subtype E HIVs whereas MN is subtype B, and 110 of the 121 subjects were infected with subtype E sequences. The subtype E vaccine insert sequences are much closer genetically to the infecting (and regional circulating) sequences than MN and thus are more likely to stimulate protective immune responses. Accordingly, the analysis focuses on the 92TH023 and CM244 reference sequences, and right-censors the 15 subjects who were HIV infected with subtype B or with unknown subtype. One subject who acquired HIV infection during the trial was documented to have acquired HIV from another trial participant who had previously become HIV infected; the analysis excludes this subject because his or her inclusion would violate the independent observations assumption. In the context of our model set-up, T is the time to diagnosis of HIV infection with subtype E HIV. The time to diagnosis of HIV infection with subtype B or with unknown HIV subtype is treated as censoring.

We define V based on HIV sequence data measured from a blood sample drawn at or before the date of HIV diagnosis. (The trial documented acute phase or preseroconversion infection in only a few subjects, prohibiting defining the mark based on acute phase sequences.) 11 of the 109 (11%) infected subjects have sequences measured from a post-diagnosis sample and hence are missing V. To maximize biological relevance and statistical power, we restrict the gp120 distances to the published set of gp120 sites in contact with known broadly neutralizing monoclonal antibodies (Moore et al., 2009; Wei et al., 2003). For each HIV sequence from a subject and each of the two reference vaccine sequences, V is computed as a weighted Hamming distance by using the point accepted mutation between scoring matrix (Nickle et al., 2007). Between two and 13 sequences (a total of 1030 sequences) were measured per infected subject, and V is defined as the subject's sequence closest to his or her consensus sequence (the consensus sequence is comprised of the majority amino acids at each site, one site at a time). Finally, the distances are rescaled to values between 0 and 1.

In total, 109 infected subjects (43 vaccine; 66 placebo) are included in the analysis, of which 98 (39 vaccine; 59 placebo) have an observed mark V; Fig. 1 displays the observed Vs.

Inferences on Relative Failure Rates in Stratified Mark-Specific Proportional Hazards Models with Missing Marks, with Application to Human Immunodeficiency Virus Vaccine Efficacy Trials (3)

Fig. 1.

Scatter plots of the marks V versus the HIV-infection time T for the 98 HIV-infected subjects in the Thai trial with an observed mark: the mark V is the HIV-specific point accepted mutation matrix (Nickle et al., 2007) weighted Hamming distances between a subject's HIV envelope gp120 amino acid sequence (nearest to his or her consensus sequence) and (a) the 92TH023 or (b) CM244 vaccine reference sequence; the distances restrict to the 172 amino acid sites in gp120 documented to contact broadly neutralizing monoclonal antibodies (Inferences on Relative Failure Rates in Stratified Mark-Specific Proportional Hazards Models with Missing Marks, with Application to Human Immunodeficiency Virus Vaccine Efficacy Trials (4), vaccine LOWESS smooth fit (Cleveland, 1979); Inferences on Relative Failure Rates in Stratified Mark-Specific Proportional Hazards Models with Missing Marks, with Application to Human Immunodeficiency Virus Vaccine Efficacy Trials (5), placebo LOWESS smooth fit)

Open in new tabDownload slide

To predict the probability of observing V among the 109 infected subjects, we use all-subsets logistic regression model selection considering demographics, host genetics and biomarker data post infection. The best model by the Bayesian information criterion BIC includes only the years from entry until diagnosis of HIV infection (X1), with model fit logit{P^(R=1|δ=1,X1)}=1.17+0.70X1 for the CM244 reference sequence. The model was very similar for the 92TH023 reference sequence (which is not shown). In addition, we consider linear and logistic regression models for relating the mean of various potential auxiliary variables (A) to V, X1 and treatment indicator Z. Model selection did not reveal any significantly predictive auxiliary variables; we expect that HIV sequence information measured after V has been defined would be a good predictor, but these data were not collected. Nevertheless, to implement the AIPW method we select the best available auxiliary variable, gender (A = X2; 1≡male; 0≡female) and use the logistic regression model that results; for CM244 the fitted model g^(A=a|V,X1,Z) is logit{P^(X2=1|δ=1,V,X1,Z)}=0.240.33V+0.16X1+0.38Z, and the model was very similar for 92TH023 (which is not shown).

The AIPW estimation and testing procedures are applied to the Thai trial data set with bandwidths h1=0.5 and h2=h=0.3, a = 0.05, b = 1 and a=a+0.01 (a and a are near the minimum observed marks). As in the simulation study, 500 simulated Gaussian multipliers are used. Because the results are nearly identical with and without the auxiliary variable, only the latter results are presented. Fig. 2 shows the estimated VE(v) along with 95% pointwise confidence bands, indicating that vaccine efficacy appears to be high against HIVs near the 92TH023 reference sequence (estimated VE(0.01) = 56%) and declines to 0 against HIVs farthest from the 92TH023 reference sequence (estimated VE(1.0) = 2.4%). The decline is similar for the CM244 reference sequence, with estimated VE(0.01) = 45% and estimated VE(0.95) = −9.1%.

Inferences on Relative Failure Rates in Stratified Mark-Specific Proportional Hazards Models with Missing Marks, with Application to Human Immunodeficiency Virus Vaccine Efficacy Trials (6)

Fig. 2.

AIPW estimation of VE(v) and 95% pointwise confidence bands without using auxiliary variables for the Thai trial with bandwidths h1=0.5 and h2=h=0.3 for the monoclonal antibody contact site distances to (a) the 92TH023 and (b) CM244 reference sequences

Open in new tabDownload slide

Figs 3(a) and 3(b) show the test processes Q(1)(v) versus 20 realizations from the Gaussian multiplier process WB1*(v) given the observed data, and Figs 3(c) and 3(d) show the parallel results for the test process Q(2)(v), each suggesting departures from the null hypothesis H10 and from the null hypothesis H20 for each reference sequence. The p-values of the tests based on the test statistics Tm1(1) and Tm2(1) for testing H10 against the monotone alternative over v ∈ [0,1] are 0.032 and 0.008 for 92TH023, and 0.014 and 0.010 for CM244. The p-values of the test statistics Ta1(1) and Ta2(1) for testing H10 against the general alternative are 0.054 and 0.018 for 92TH023 and 0.030 and 0.010 for CM244. For testing H20 over v ∈ [0,1], the p-values of the supremum-type tests based on the test statistics Ta1(2) and Tm1(2) are 0.53 and 0.27 for 92TH023 and 0.37 and 0.18 for CM244. The p-values of the integrated square type of tests based on the test statistics Ta2(2) and Tm2(2) are 0.35 and 0.14 for 92TH023 and 0.44 and 0.19 for CM244.

Inferences on Relative Failure Rates in Stratified Mark-Specific Proportional Hazards Models with Missing Marks, with Application to Human Immunodeficiency Virus Vaccine Efficacy Trials (7)

Fig. 3.

Diagnostic plots of the test processes for the Thai trial data set with bandwidths h1=0.5, h2=h=0.3 and a = 0.05, b = 1 and a=a+0.01: (a) Q(1)(v) (Inferences on Relative Failure Rates in Stratified Mark-Specific Proportional Hazards Models with Missing Marks, with Application to Human Immunodeficiency Virus Vaccine Efficacy Trials (8)) versus 20 realizations (Inferences on Relative Failure Rates in Stratified Mark-Specific Proportional Hazards Models with Missing Marks, with Application to Human Immunodeficiency Virus Vaccine Efficacy Trials (9)) from the Gaussian multiplier process WB1*(v) (92TH023, CM244 reference; H10); (b) Q(1)(v) (Inferences on Relative Failure Rates in Stratified Mark-Specific Proportional Hazards Models with Missing Marks, with Application to Human Immunodeficiency Virus Vaccine Efficacy Trials (10)) versus 20 realizations (Inferences on Relative Failure Rates in Stratified Mark-Specific Proportional Hazards Models with Missing Marks, with Application to Human Immunodeficiency Virus Vaccine Efficacy Trials (11)) from the Gaussian multiplier process WB1*(v) (92TH023, CM244 reference; H20); (c) Q(2)(v) (Inferences on Relative Failure Rates in Stratified Mark-Specific Proportional Hazards Models with Missing Marks, with Application to Human Immunodeficiency Virus Vaccine Efficacy Trials (12)) versus 20 realizations (Inferences on Relative Failure Rates in Stratified Mark-Specific Proportional Hazards Models with Missing Marks, with Application to Human Immunodeficiency Virus Vaccine Efficacy Trials (13)) from the Gaussian multiplier process Γ(v,WB1*) (92TH023, CM244 reference; H10): (d) Q(2)(v) (Inferences on Relative Failure Rates in Stratified Mark-Specific Proportional Hazards Models with Missing Marks, with Application to Human Immunodeficiency Virus Vaccine Efficacy Trials (14)) versus 20 realizations (Inferences on Relative Failure Rates in Stratified Mark-Specific Proportional Hazards Models with Missing Marks, with Application to Human Immunodeficiency Virus Vaccine Efficacy Trials (15)) from the Gaussian multiplier process Γ(v,WB1*) (92TH023, CM244 reference, H20)

Open in new tabDownload slide

These analyses provide more evidence that the vaccine had some protective efficacy than the original primary analysis that did not account for the mark information (Rerks-Ngarm et al., 2009): the primary analysis test for any vaccine efficacy yielded p = 0.04 whereas the tests for any vaccine efficacy against any mark reported here yielded a median p-value of 0.016 across the four test statistics and two reference sequences. The analyses also showed a non-significant trend (p-values around 0.14–0.19) that the vaccine protected better against HIVs closely matched to the vaccine strain HIVs in the monoclonal antibody contact sites but had less or absent protection against HIVs with many mismatches in these sites. Although the levels of significance are not compelling, the simulation study presented in Section 5.5.3 of the power available for detecting a vaccine sieve effect in the Thai trial showed that the study is well powered only to detect large sieve effects (with greater decline of VE(v) in v than was observed in the estimated VE(v) curves); thus a moderate-to-large sieve effect is consistent with the observed results. These results may guide future vaccine research by suggesting modifications of future vaccine candidates to include HIV sequences more closely matched to circulating HIVs in the monoclonal antibody contact sites. They may also motivate the design of future experiments to understand functional effects of amino acid mutations at the monoclonal antibody contact sites.

Acknowledgements

The authors thank Hasan Ahmed and Paul Edlefsen for generating the HIV sequence distances, and they thank the participants, investigators and sponsors of the RV144 Thai trial, including the US Military HIV Research Program, US Army Medical Research and Materiel Command, National Institute of Allergy and Infectious Diseases US and Thai Components, Armed Forces Research Institute of Medical Science Ministry of Public Health, Thailand, Mahidol University, SanofiPasteur and Global Solutions for Infectious Diseases. The authors thank the Joint Editor, Associate Editor and two referees for their helpful suggestions. The research of Yanqing Sun was partially supported by National Science Foundation grants DMS-0905777 and DMS-1208978, and the research of Dr Sun and Dr Peter Gilbert was partially supported by National Institutes of Health National Institute of Allergy and Infectious Diseases grant R37AI054165. The content is solely the responsibility of the authors and does not necessarily represent the official views of the National Institutes of Health.

References

Aalen

,

O. O.

and

Johansen

,

S.

(

1978

)

An empirical transition matrix for non-hom*ogeneous Markov chains based on censored observations

.

Scand. J. Statist.

,

5

,

141

150

.

Google Scholar

OpenURL Placeholder Text

Cai

,

Z.

,

Fan

,

J.

and

Li

,

R.

(

2000

)

Efficient estimation and inferences for varying-coefficient models

.

J. Am. Statist. Ass.

,

95

,

888

902

.

Cleveland

,

W. S.

(

1979

)

Robust locally weighted regression and smoothing scatterplots

.

J. Am. Statist. Ass.

,

74

,

829

836

.

Efron

,

B.

and

Tibshirani

,

R. J.

(

1993

)

An Introduction to the Bootstrap

.

New York

:

Chapman and Hall

.

Fauci

,

A. S.

,

Johnston

,

M. I.

,

Dieffenbach

,

C. W.

,

Burton

,

D. R.

,

Hammer

,

S. M.

,

Hoxie

,

J. A.

,

Martin

,

M.

,

Overbaugh

,

J.

,

Watkins

,

D. I.

,

Mahmoud

,

A.

and

Greene

,

W. C.

(

2008

)

HIV vaccine research: the way forward

.

Science

,

321

,

530

532

.

Gilbert

,

P. B.

,

Berger

,

J. O.

,

Stablein

,

D.

,

Becker

,

S.

,

Essex

,

M.

,

Hammer

,

S. M.

,

Kim

,

J. H.

and

Degruttola

,

V. G.

(

2011

)

Statistical interpretation of the RV144 HIV vaccine efficacy trial in Thailand: a case study for statistical issues in efficacy trials

.

J. Infect. Dis.

,

203

,

969

975

.

Gilbert

,

P. B.

,

Lele

,

S.

and

Vardi

,

Y.

(

1999

)

Maximum likelihood estimation in semiparametric selection bias models with application to AIDS vaccine trials

.

Biometrika

,

86

,

27

43

.

Gilbert

,

P. B.

,

McKeague

,

I. W.

and

Sun

,

Y.

(

2008

)

The two-sample problem for failure rates depending on a continuous mark: an application to vaccine efficacy

.

Biostatistics

,

9

,

263

276

.

Gilbert

,

P. B.

,

Self

,

S. G.

and

Ashby

,

M. A.

(

1998

)

Statistical methods for assessing differential vaccine protection against human immunodeficiency virus types

.

Biometrics

,

54

,

799

814

.

Hoover

,

D. R.

,

Rice

,

J. A.

,

Wu

,

C. O.

and

Yang

,

P.-L.

(

1998

)

Nonparametric smoothing estimates of time-varying coefficient models with longitudinal data

.

Biometrika

,

85

,

809

822

.

Lin

,

D. Y.

,

Wei

,

L. J.

and

Ying

,

Z.

(

1993

)

Checking the Cox model with cumulative sums of martingale-based residuals

.

Biometrika

,

80

,

557

572

.

Moore

,

P. L.

,

Ranchobe

,

N.

,

Lambson

,

B. E.

,

Gray

,

E. S.

,

Cave

,

E.

,

Abrahams

,

M.-R.

,

Bandawe

,

G.

,

Mlisana

,

K.

,

Abdool Karim

,

S. S.

,

Williamson

,

C.

,

Morris

,

L.

,

the CAPRISA 002 Study and the NIAID Center for HIV/AIDS Vaccine Immunology (CHAVI)

(

2009

)

Limited neutralizing antibody specificities drive neutralization escape in early HIV-1 subtype C infection

.

PLOS Path.

,

5

, article e1000598.

Google Scholar

OpenURL Placeholder Text

Nickle

,

D. C.

,

Heath

,

L.

,

Jensen

,

M. A.

,

Gilbert

,

P. B.

,

Mullins

,

J. I.

and

Kosakovsky Pond

,

S. L.

(

2007

)

HIV-specific probabilistic models of protein evolution

.

PLOS ONE

,

2

, no. 6, article e503.

Google Scholar

OpenURL Placeholder Text

Prentice

,

R. L.

,

Kalbfleisch

,

J. D.

,

Peterson, Jr

,

A. V.

,

Flournoy

,

N.

,

Farewell

,

V. T.

and

Breslow

,

N. E.

(

1978

)

The analysis of failure times in the presence of competing risks

.

Biometrics

,

34

,

541

554

.

Rerks-Ngarm

,

S.

,

Pitisuttithum

,

P.

,

Nitayaphan

,

S.

,

Kaewkungwal

,

J.

,

Chiu

,

J.

,

Paris

,

R.

,

Premsri

,

N.

,

Namwat

,

C.

,

de Souza

,

M.

,

Adams

,

E.

,

Benenson

,

M.

,

Gurunathan

,

S.

,

Tartaglia

,

J.

,

McNeil

,

J. G.

,

Francis

,

D. P.

,

Stablein

,

D.

,

Birx

,

D. L.

,

Chunsuttiwat

,

S.

,

Khamboonruang

,

C.

,

Thongcharoen

,

P.

,

Robb

,

M. L.

,

Michael

,

N. L.

,

Kunasol

,

P.

and

Kim

,

J. H.

(

2009

)

Vaccination with ALVAC and AIDSVAX to prevent HIV-1 infection in Thailand

.

New Engl. J. Med.

,

361

,

2209

2220

.

Robins

,

J. M.

,

Rotnitzky

,

A.

and

Zhao

,

L. P.

(

1994

)

Estimation of regression coefficients when some regressors are not always observed

.

J. Am. Statist. Ass.

,

89

,

846

866

.

Rolland

,

M.

,

Edlefsen

,

P. T.

,

Larsen

,

B. B.

,

Tovanabutra

,

S.

,

Sanders-Buell

,

E.

,

Hertz

,

T.

,

deCamp

,

A. C.

,

Carrico

,

C.

,

Menis

,

S.

,

Magaret

,

C. A.

,

Ahmed

,

H.

,

Juraska

,

M.

,

Chen

,

L.

,

Konopa

,

P.

,

Nariya

,

S.

,

Stoddard

,

J. N.

,

Wong

,

K.

,

Zhao

,

H.

,

Deng

,

W.

,

Maust

,

B. S.

,

Bose

,

M.

,

Howell

,

S.

,

Bates

,

A.

,

Lazzaro

,

M.

,

O’Sullivan

,

A.

,

Lei

,

E.

,

Bradfield

,

A.

,

Ibitamuno

,

G.

,

Assawadarachai

,

V.

,

O’Connell

,

R. J.

,

deSouza

,

M. S.

,

Nitayaphan

,

S.

,

Rerks-Ngarm

,

S.

,

Robb

,

M. L.

,

McLellan

,

J. S.

,

Georgiev

,

I.

,

Kwong

,

P. D.

,

Carlson

,

J. M.

,

Michael

,

N. L.

,

Schief

,

W. R.

,

Gilbert

,

P. B.

,

Mullins

,

J. I.

and

Kim

,

J. H.

(

2012

)

Increased HIV-1 vaccine efficacy against viruses with genetic signatures in Env V2

.

Nature

,

490

,

417

420

.

Rolland

,

M.

,

Tovanabutra

,

S.

,

Decamp

,

A. C.

,

Frahm

,

N.

,

Gilbert

,

P. B.

,

Sanders-Buell

,

E.

,

Heath

,

L.

,

Magaret

,

C. A.

,

Bose

,

M.

,

Bradfield

,

A.

,

O’Sullivan

,

A.

,

Crossler

,

J.

,

Jones

,

T.

,

Nau

,

M.

,

Wong

,

K.

,

Zhao

,

H.

,

Raugi

,

D. N.

,

Sorensen

,

S.

,

Stoddard

,

J. N.

,

Maust

,

B. S.

,

Deng

,

W.

,

Hural

,

J.

,

Dubey

,

S.

,

Michael

,

N. L.

,

Shiver

,

J.

,

Corey

,

L.

,

Li

,

F.

,

Self

,

S. G.

,

Kim

,

J.

,

Buchbinder

,

S.

,

Casimiro

,

D. R.

,

Robertson

,

M. N.

,

Duerr

,

A.

,

McElrath

,

M. J.

,

McCutchan

,

F. E.

and

Mullins

,

J. I.

(

2011

)

Genetic impact of vaccination on breakthrough HIV-1 sequences from the STEP trial

.

Nat. Med.

,

17

,

366

371

.

Rubin

,

D. B.

(

1976

)

Inference and missing data

.

Biometrika

,

63

,

581

592

.

Stephens

,

M. A.

(

1974

)

Edf statistics for goodness of fit and some comparisons

.

J. Am. Statist. Ass.

,

69

,

730

737

.

Sun

,

Y.

and

Gilbert

,

P. B.

(

2012

)

Estimation of stratified mark-specific proportional hazards models with missing marks

.

Scand. J. Statist.

,

39

,

34

52

.

Sun

,

Y.

,

Gilbert

,

P. B.

and

McKeague

,

I. W.

(

2009

)

Proportional hazards models with continuous marks

.

Ann. Statist.

,

37

,

394

426

.

Sun

,

Y.

and

Wu

,

H.

(

2005

)

Semiparametric time-varying coefficients regression model for longitudinal data

.

Scand. J. Statist.

,

32

,

21

47

.

Tian

,

L.

,

Zucker

,

D.

and

Wei

,

L. J.

(

2005

)

On the Cox model with time-varying regression coefficients

.

J. Am. Statist. Ass.

,

100

,

172

183

.

Wei

,

X.

,

Decker

,

J. M.

,

Wang

,

S.

,

Hui

,

H.

,

Kappes

,

J. C.

,

Wu

,

X.

,

Salazar-Gonzalez

,

J. F.

,

Salazar

,

M. G.

,

Kilby

,

J. M.

,

Saag

,

M. S.

,

Komarova

,

N. L.

,

Nowak

,

M. A.

,

Hahn

,

B.

,

Kwong

,

P. D.

and

Shaw

,

G. M.

(

2003

)

Antibody neutralization and escape by HIV-1

.

Nature

,

422

,

307

312

.

Appendix A: symptotic results

The following regularity conditions from Sun and Gilbert (2012) are assumed.

Condition 1.

  • (a)

    β(v) has componentwise continuous second derivatives on [0,1]. For each k = 1,…,K, the second partial derivative of λ0k(t,v) with respect to v exists and is continuous on [0,τ]×[0,1]. The covariate process Zk(t) has paths that are left continuous and of bounded variation, and satisfies the moment condition

    E[Zk(t)4exp{2MZk(t)}]<,

    where M is a constant such that (v,β(v)) ∈ [0,1]×(−M,M)p for all v and A=maxk,l|akl| for a matrix A = (akl).

  • (b)

    Each component of sk(j)(t,θ) is continuous on [0,τ]×[−M,M]p, s~k(j)(t,θ,ψ) is continuous on [0,τ]×[−M,M]p×[−L,L]q for some M,L>0 and j = 0,1,2. supt[0,τ],θ[M,M]pSk(j)(t,θ)sk(j)(t,θ)=Op(n1/2) and supt[0,τ],θ[M,M]p,ψk[L,L]qS~k(j)(t,θ,ψk)s~k(j)(t,θ,ψ)=Op(n1/2).

  • (c)

    The limit pk=limnnk/n exists and 0<pk<∞. sk(0)(t,θ)>0 on [0,τ]×[−M,M]p and the matrix

    Σ(v)=k=1KpkΣk(v)

    is positive definite, where

    Σk(v)=k=1K0τIk{t,β(v)}λ0k(t,v)sk(0){t,β(v)}dt

    and Ik(t,β)=sk(2)(t,β)/sk(0)(t,β)z¯k(t,β)2.

  • (d)

    The kernel function K(·) is symmetric with support [−1,1] and of bounded variation. The bandwidth h satisfies nh2→∞ and nh4→0 as n→∞.

  • (e)

    There is a σ>0 such that rk(Wki)≥σ for all k and i with δki=1.

Let Ft=σ{I(Xkis,δki=1),I(Xkis,δki=0),VkiI(Xkis,δki=1),Zki(s);0st,i=1,,nk,k=1,,K} be the (right continuous) filtration generated by the full data processes {Nki(s,v),Yki(s),Zki(s);0st,0v1,i=1,,nk,k=1,,K}. Assume that E{Nki(dt,dv)|Ft}=E{Nki(dt,dv)|Yki(t),Zki(t)}, i.e. the mark-specific instantaneous failure rate at time t given the observed information up to time t depends on only the failure status and the current covariate value. By the definition of the conditional mark-specific hazard function, E{Nki(dt,dv)|Ft}=Yki(t)λk{t,v|Zki(t)}dtdv. Hence, the mark-specific intensity of Nki(t,v) with respect to Ft equals Yki(t)λki{st,v|Zki(t)}. Let

Mki(t,u)=0t0u[Nki(ds,dx)Yki(s)λk{s,x|Zki(s)}dsdx].

By Aalen and Johansen (1978), Mki(·,v1) and Mki(·,v2)−Mki(·,v1) are orthogonal square integrable martingales with respect to Ft for any 0≤v1v2≤1.

The weak convergence of WB(v)=n1/2{B^aug(v)B(v)}n1/2{B^aug(a)B(a)} for v ∈ [a,b] is given in theorem 1 below.

Theorem 1. Under conditions 1–5, WB(v)=n1/2Σk=1KΣi=1nkHki(v)+op(1),uniformly in v ∈ [a,b], where

Hki(v)=av0τΣ(u)1[Zki(t)z¯k{t,β(u)}][Rkiπk(Qki)Mki(dt,du)n+{1Rkiπk(Qki)}E{Mki(dt,du)Qki}].

(21)

The processes WB(v) converge weakly to a p-dimensional mean 0 Gaussian process with continuous sample paths on v ∈ [a,b], where z¯k(t,β)=sk(1)(t,β)/sk(0)(t,β) and sk(j)(t,β)=E{Sk(j)(t,β)}.

Theorem (2) provides the basis for obtaining asymptotically correct critical values for the testing procedures for H10 and for H20. In particular, let G(v) be the limiting Gaussian process of WB1(v), v ∈ [a,b], as n→∞. Then, under H10,Q(1)(v)DG(v),v[a,b], as n→∞. By theorem 1 and the continuous mapping theorem, Ta1(1)Dsupv[a,b]|G(v)|, Ta2(1)DabG(v)2dvar{G(v)}, Tm1(1)Dinfv[a,b]G(v) and Tm2(1)DabG(v)dvar{G(v)} under H10 as n→∞. Under H20, Q(2)(v)=Γ(v,WB1)DΓ(v,G), v ∈ [a,b], as n→∞. Applying the continuous mapping theorem, under H20, Ta1(2)Dsupv[a,b]|Γ(v,G)|, Ta2(2)DabΓ(v,G)2dvar{G(v)}Tm1(2)Dinfv[a,b]Γ(v,G) and Tm2(2)DabΓ(v,G)dvar{G(v)}, as n→∞.

The proof of the consistency of the tests for testing H10 are straightforward. To show the consistency of the tests for testing H20, we note that the derivative dΓ(v,B1)/dv=(va)1{β1(v)(va)1B1(v)}0 under H2m with strict inequality for at least some v ∈ [a,b]. The function Γ(v,B1) is non-decreasing with Γ(b,B1)=0. We have, under H2m, Γ(v,B1)0 with strict inequality for at least some v ∈ [a,b]. Let v0 ∈ [a,b] be such that Γ(v0,B1)<0. Then Γ(v,B1)<0 for vv0. Now, defining vm*=sup{v:Γ(v,B1)<0,avb}, we have Γ(v,B1)<0 for v<vm* and Γ(v,B1)=0 for v*v<b. It follows from expression (11) and theorem 1 that Tm1(2)P and Tm2(2)P under H2m as n→∞ for a<vm*. Thus the tests based on Tm1(2) and Tm2(2) are consistent against H2m. Similarly, let va*=sup{v:supu[v,b]|Γ(u,B1)|>0,avb}. Then, under H2a, |Γ(v,B1)|>0 for v<va*, and |Γ(v,B1)|=0 for va*vb. Hence Ta1(2)P and Ta2(2)P under H2a as n→∞ for a<va*, resulting in the consistent tests against H2a.

We use the Gaussian multiplier resampling method (Lin et al., 1993) to approximate the distribution of WB(v), v ∈ [a,b]. Let {ξki,i=1,,nk,k=1,,K} be independent and identically distributed standard normal random variables. Replacing each term of equation (26), which is asymptotically equivalent to equation !(21), by its empirical counterpart and multiplying by ξki, we obtain WB*(v)=n1/2Σk=1KΣi=1nkξkiH^ki(v), where

H^ki(v)=010τH(v,u)[Zki(t)Z¯k{t,β^aug(u)}]{Rkiπk(Qki,ψ^k)Nki(dt,du)+{1Rkiπk(Qki,ψ^k)}Nkix(dt)d{ρ^kaug(Wki,u)}Yki(t)exp{β^aug(u)TZki(t)}Λ^0kaug(dt,du)},

(22)

where H(v,u)=avΣ^aug(x)1Kh(ux)dx.

Following an application of lemma 1 of Sun and Wu (2005), the distribution of WB(v), v ∈ [a,b], can be approximated by the conditional distribution of WB*(v), v ∈ [a,b], given the observed data sequence, which can be obtained through repeatedly generating independent sets of {ξki,i=1,,nk,k=1,,K}. Hence, the distribution of Q(1)(v), v ∈ [a,b], under H10, can be approximated by the conditional distribution of WB1*, v ∈ [a,b], given the observed data sequence. By the continuous mapping theorem, the distribution of Q(2)(v), v ∈ [a,b], under H20, can be approximated by the conditional distribution of Γ(v,WB1*), v ∈ [a,b], given the observed data sequence.

With the Gaussian multiplier method, the variance var{B^1aug(v)B^1aug(a)} can be consistently estimated by var^{B^1aug(v)B^1aug(a)}=n1var*{WB1*(v)}, where var*{WB1*(v)} is the first component on the diagonal of

cov*{WB*(v)}=cov{WB*(v)observeddata}=n1k=1Ki=1nk(010τH(v,u)[Zki(t)Z¯k{t,β^aug(u)}]×[Rkiπk(Qki,ψ^k)Nki(dt,du)+{1Rkiπk(Qki,ψ^k)}Nkix(dt)dρ^k(Wki,u)Yki(t)exp{β^aug(u)TZki(t)}Λ^0kaug(dt,du)])2.

(23)

A.1. Proof of theorem 1

Let

Aki(v)=010τKh(uv)[Zki(t)z¯k{t,β(u)}]Rkiπk(Qki)Mki(dt,du),ki(v)=010τKh(uv)[Zki(t)z¯k{t,β(u)}]{1Rkiπk(Qki)}E{Mki(dt,du)Qki}.

(24)

Following the proof of theorem 4 of Sun and Gilbert (2012) (their Web appendix (W.19)) and under nh4→0,

n1/2{β^aug(v)β(v)}=Σ(v)1n1/2k=1Ki=1nk{Aki(v)+Bki(v)}+op(1).

(25)

Hence

n1/2{B^aug(v)B(v)}=n1/2k=1Ki=1nk0vΣ(u)1{Aki(u)+Bki(u)}du+op(1),

which, by exchanging the order of integrations, equals

n1/2k=1Ki=1nk(010τ{0vKh(ux)Σ(x)1dx}[Zki(t)z¯k{t,β(u)}][Rkiπk(Qki)Mki(dt,du)+{1Rkiπk(Qki)}E{Mki(dt,du)Qki}]).

(26)

Let

η˜n(v)=n1/2k=1Ki=1nk0v0τ[Zki(t)Z¯k{t,β(u)}][Rkiπk(Qki)Mki(dt,du)+{1Rkiπk(Qki)}E{Mki(dt,du)Qki}].

It follows that

n1/2{B^aug(v)B(v)}=0vΣ(u)101Kh(xu)η˜n(dx)du+op(1)=01(0v,Σ,(u)1,Kh,(xu),d,u)η˜n(dx)+op(1).

(27)

Since the kernel function K(·) has compact support on [−1,1], equation (27) equals

hvh{0vΣ(u)1Kh(xu)du}η˜n(dx)hh{0vΣ(u)1Kh(xu)du}η˜n(dx)vhv+h{0vΣ(u)1Kh(xu)du}η˜n(dx)+op(1).

(28)

It can be shown that η~n(x) converges weakly to a mean 0 Gaussian process with continuous paths. Under assumption 4, 0vΣ(u)1Kh(xu)du has bounded variation and converges uniformly to Σ(x)−1 for x ∈ (h,vh). By lemma 2 of Gilbert et al. (2008), the first term in expression (28) is equal to 0vΣ(x)1η~n(dx)+op(1). Similar arguments lead to the second and the third terms in expression (28) to be op(1). Hence,

n1/2{B^aug(v)B(v)}=n1/2k=1Ki=1nk(0v0τΣ(u)1[Zki(t)z¯k{t,β(u)}][Rkiπk(Qki)Mki(dt,du)+{1Rkiπk(Qki)}E{Mki(dt,du)Qki}])+op(1),

which converges weakly to a p-dimensional mean 0 Gaussian process on v ∈ [a,b] with continuous sample paths by lemma 1 of Sun and Wu (2005). Theorem 1 follows since WB(v)=n1/2{B^aug(v)B(v)}n1/2{B^aug(a)B(a)} is a linear transformation of n1/2{B^aug(·)B(·)}.

© 2014 Royal Statistical Society

This article is published and distributed under the terms of the Oxford University Press, Standard Journals Publication Model (https://academic.oup.com/journals/pages/open_access/funder_policies/chorus/standard_publication_model)

Issue Section:

Original Article

Download all slides

Advertisem*nt

Citations

Views

124

Altmetric

More metrics information

Metrics

Total Views 124

94 Pageviews

30 PDF Downloads

Since 3/1/2023

Month: Total Views:
March 2023 5
April 2023 9
June 2023 5
July 2023 6
August 2023 5
September 2023 2
October 2023 4
November 2023 1
December 2023 10
January 2024 9
February 2024 27
March 2024 2
April 2024 10
May 2024 17
June 2024 12

Citations

Powered by Dimensions

11 Web of Science

Altmetrics

×

Email alerts

Article activity alert

Advance article alerts

New issue alert

Receive exclusive offers and updates from Oxford Academic

Citing articles via

Google Scholar

  • Latest

  • Most Read

  • Most Cited

Two-phase biomarker studies for disease progression with multiple registries
Bats monitoring: a classification procedure of bats behaviours based on Hawkes processes
Inverse set estimation and inversion of simultaneous confidence intervals
Exponential tilting for zero-inflated interval regression with applications to cyber security survey data
Dynamic factor models for binary data in circular spaces: an application to the US Supreme Court

More from Oxford Academic

Mathematics

Probability and Statistics

Science and Mathematics

Books

Journals

Advertisem*nt

Inferences on Relative Failure Rates in Stratified Mark-Specific Proportional Hazards Models with Missing Marks, with Application to Human Immunodeficiency Virus Vaccine Efficacy Trials (2024)

References

Top Articles
Latest Posts
Article information

Author: Francesca Jacobs Ret

Last Updated:

Views: 5746

Rating: 4.8 / 5 (68 voted)

Reviews: 91% of readers found this page helpful

Author information

Name: Francesca Jacobs Ret

Birthday: 1996-12-09

Address: Apt. 141 1406 Mitch Summit, New Teganshire, UT 82655-0699

Phone: +2296092334654

Job: Technology Architect

Hobby: Snowboarding, Scouting, Foreign language learning, Dowsing, Baton twirling, Sculpting, Cabaret

Introduction: My name is Francesca Jacobs Ret, I am a innocent, super, beautiful, charming, lucky, gentle, clever person who loves writing and wants to share my knowledge and understanding with you.