Analyzing repeated measures semi-continuous data, with application to an alcohol dependence study

Lei Liu*, Robert L. Strawderman, Bankole A. Johnson, John M. O'Quigley

*Corresponding author for this work

Research output: Contribution to journalArticlepeer-review

44 Scopus citations

Abstract

Two-part random effects models (Olsen and Schafer,1 Tooze et al.2) have been applied to repeated measures of semi-continuous data, characterized by a mixture of a substantial proportion of zero values and a skewed distribution of positive values. In the original formulation of this model, the natural logarithm of the positive values is assumed to follow a normal distribution with a constant variance parameter. In this article, we review and consider three extensions of this model, allowing the positive values to follow (a) a generalized gamma distribution, (b) a log-skew-normal distribution, and (c) a normal distribution after the Box-Cox transformation. We allow for the possibility of heteroscedasticity. Maximum likelihood estimation is shown to be conveniently implemented in SAS Proc NLMIXED. The performance of the methods is compared through applications to daily drinking records in a secondary data analysis from a randomized controlled trial of topiramate for alcohol dependence treatment. We find that all three models provide a significantly better fit than the log-normal model, and there exists strong evidence for heteroscedasticity. We also compare the three models by the likelihood ratio tests for non-nested hypotheses (Vuong3). The results suggest that the generalized gamma distribution provides the best fit, though no statistically significant differences are found in pairwise model comparisons.

Original languageEnglish (US)
Pages (from-to)133-152
Number of pages20
JournalStatistical Methods in Medical Research
Volume25
Issue number1
DOIs
StatePublished - Feb 1 2016

Funding

The author thanks the editor, the associate editor and the referee for valuable suggestions. We are grateful to Mr Xin-Qun Wang for help in data management, to Kevin Zhou and Jinsong Chen for computational assistance, and to Robert Cormier Jr for assistance in manuscript preparation.

Keywords

  • Hierarchical model
  • generalized linear mixed model
  • longitudinal data analysis
  • mixed model
  • model comparison
  • non-nested model

ASJC Scopus subject areas

  • Epidemiology
  • Statistics and Probability
  • Health Information Management

Fingerprint

Dive into the research topics of 'Analyzing repeated measures semi-continuous data, with application to an alcohol dependence study'. Together they form a unique fingerprint.

Cite this