ur¨ um,1 Runze Li,2 Saul Shiffman,3 and Weixin Yao1 1 Department of Statistics, University of California, Riverside 2Department of Statistics and The Methodology Center, The Pennsylvania State University 3 Department of Psychology, University of Pittsburgh
one of the leading preventable causes of coronary heart disease (CHD) and cancer (U.S. Department of Health and Human Services, 1982, 1983, and 2004) Source: The Methodology Center, Penn State University.
one of the leading preventable causes of coronary heart disease (CHD) and cancer (U.S. Department of Health and Human Services, 1982, 1983, and 2004) 304 participants Smoking for at least 2 years with 10 cigarettes/day Source: The Methodology Center, Penn State University.
setting, such as coffee consumption, alcohol use, and presence of other smokers Current mood and urge to smoke Source: http://mhealth.jmir.org/2014/1/e4/
situations or contexts, such as alcohol consumption Alcohol consumption (Shiffman and Balabanis, 1995; Shiffman et al., 2002) increases the odds of smoking is associated with increased risk of lapsing back to smoking Source: The Methodology Center, Penn State University.
and urge to smoke controlling for confounders such as mood variables Primary interest: Estimate the time-varying association between alcohol use and urge to smoke controlling for the temporal patterns of mood variables
and urge to smoke controlling for confounders such as mood variables Primary interest: Estimate the time-varying association between alcohol use and urge to smoke controlling for the temporal patterns of mood variables Approach: Joint modeling of longitudinal binary and continuous responses 1 1K¨ ur¨ um, E., Li, R., Shiffman, S., and Yao, W. (2016). Time-varying coefficient models for joint modeling binary and continuous outcomes in longitudinal data. Statistica Sinica
a natural multivariate distribution Solution: Latent variable approach Introduce a continuous latent outcome underlying the binary outcome Assume that the latent variable and the continuous response follow a joint normal distribution
a natural multivariate distribution Solution: Latent variable approach Introduce a continuous latent outcome underlying the binary outcome Assume that the latent variable and the continuous response follow a joint normal distribution Factorize the joint model: a marginal model for the continuous variable and a conditional model for the binary variable given the continuous variable
Q(t) = 1 if Y (t) > 0 and Q(t) = 0 if Y (t) ≤ 0 Factorize the joint model—a marginal model for the continuous variable W (t) and a conditional model for Q(t) given W (t) f {q(t), w(t)} = fW {w(t)} f {q(t)|w(t)}
Applying the standard normal theory Y (t)|W (t) ∼ N µ(t), σ2 2 (t) 1 − τ2(t) where µ(t) = XT(t)α(t) + σ2 (t) σ1 (t) τ(t)ε1 (t) and ε1 (t) = W (t) − XT(t)β(t) (2)
0 and Q(t) = 0 if Y (t) ≤ 0 Y (t)|W (t) ∼ N µ(t), σ2 2 (t) 1 − τ2(t) The conditional model for Q(t) given W (t) is P {Q(t) = 1|W (t)} = Φ µ(t) σ2 2 (t) {1 − τ2(t)}
0 and Q(t) = 0 if Y (t) ≤ 0 Y (t)|W (t) ∼ N µ(t), σ2 2 (t) 1 − τ2(t) The conditional model for Q(t) given W (t) is P {Q(t) = 1|W (t)} = Φ µ(t) σ2 2 (t) {1 − τ2(t)} Reparameterize P {Q(t) = 1|W (t)} = Φ XT(t)α∗(t) + α∗ p+1 (t)ε1 (t) (3) where α∗(t) = α∗ 1 (t), . . . , α∗ p (t) T
outcome: Urge to smoke (Recorded on a scale ranging from 0 to 11) Predictors: Mood variables (Negative affect factor, arousal factor, attention disturbance factor) (Shiffman et al., 2002)
outcome: Urge to smoke (Recorded on a scale ranging from 0 to 11) Predictors: Mood variables (Negative affect factor, arousal factor, attention disturbance factor) (Shiffman et al., 2002) The Model: W (t) = β0 (t) + β1 (t)X1 (t) + β2 (t)X2 (t) + β3 (t)X3 (t) + ε1 (t) (6) where W (t) : The score of urge to smoke of the ith subject at time t X1 (t) : The centered score of negative affect of the ith subject at time t X2 (t) : The centered score of arousal of the ith subject at time t X3 (t) : The centered score of attention disturbance of the ith subject at time t
we fit the following generalized time-varying coefficient model: P {Q(t) = 1 | W (t)} = Φ α∗ 0 (t) + α∗ 1 (t)X1 (t) + α∗ 2 (t)X2 (t) + α∗ 3 (t)X3 (t) + α∗ 4 (t)e(t) where Q(t) : the alcohol usage of the ith subject at time t X1 (t) : the centered score of negative affect of the ith subject at time t X2 (t) : the centered score of arousal of the ith subject at time t X3 (t) : the centered score of attention disturbance of the ith subject at time t
-5 0 5 10 15 -0.4 -0.2 0.0 0.2 0.4 Days since quit smoking τ(T) Before quit day, the relationship is positive, i.e, increased drinking is associated with increased urge in smoking
time-varying association between longitudinal binary and continuous responses allow all parameters to be time-varying: response–predictor relationships, all associations, and variances be applied to longitudinal data sets with irregular time points
dimension and any type of outcome 2 3K¨ ur¨ um, E., Hughes, J., Li, R., and Shiffman, S. (2017). Time-varying copula models for longitudinal mixed data, Statistics and Its Interface
Drug Abuse/National Institute of Health grants P50-DA10075 and P50 DA039838, National Cancer Institute grant R01 CA168676, and National Science Foundation grants DMS-1512422 and DMS-1461677 The authors acknowledge the Research Computing and Cyberinfrastructure unit of Information Technology Services at The Pennsylvania State University for providing advanced computing resources and services that have contributed to the research. URL: http://rcc.its.psu.edu