Statistical learning with time series dependence: An application to scoring sleep in mice

Blakeley B. McShane, Shane T. Jensen, Allan I. Pack, Abraham J. Wyner

Research output: Contribution to journalArticlepeer-review

10 Scopus citations

Abstract

We develop methodology that combines statistical learning methods with generalized Markov models, thereby enhancing the former to account for time series dependence. Our methodology can accommodate very general and very long-term time dependence structures in an easily estimable and computationally tractable fashion. We apply our methodology to the scoring of sleep behavior in mice. As methods currently used to score sleep in mice are expensive, invasive, and labor intensive, there is considerable interest in developing high-throughput automated systems which would allow many mice to be scored cheaply and quickly. Previous efforts at automation have been able to differentiate sleep from wakefulness, but they are unable to differentiate the rare and important state of rapid eye movement (REM) sleep from non-REM sleep. Key difficulties in detecting REM are that (i) REM is much rarer than non-REM and wakefulness, (ii) REM looks similar to non-REM in terms of the observed covariates, (iii) the data are noisy, and (iv) the data contain strong time dependence structures crucial for differentiating REM from non-REM. Our new approach (i) shows improved differentiation of REM from non-REM sleep and (ii) accurately estimates aggregate quantities of sleep in our application to video-based sleep scoring of mice. Supplementary materials for this article are available online.

Original languageEnglish (US)
Pages (from-to)1147-1162
Number of pages16
JournalJournal of the American Statistical Association
Volume108
Issue number504
DOIs
StatePublished - 2013

Funding

Blakeley B. McShane is Assistant Professor, Kellogg School of Management, Northwestern University, Evanston, IL 60611 (E-mail: [email protected]). Shane T. Jensen is Associate Professor, The Wharton School, University of Pennsylvania, Philadelphia, PA 19104 (E-mail: [email protected]). Allan I. Pack is John Miclot Professor, Center for Sleep and Circadian Neurobiology, University of Pennsylvania, Philadelphia, PA 19104 (E-mail: [email protected]). Abraham J. Wyner is Professor, The Wharton School, University of Pennsylvania, Philadelphia, PA 19104 (E-mail: [email protected]). This research was supported in part by the National Institutes of Health grants T32 HL07713, P01 AG17628, and MH081491.

Keywords

  • Categorical
  • Classification
  • Machine learning
  • Markov
  • REM
  • Sequence

ASJC Scopus subject areas

  • Statistics and Probability
  • Statistics, Probability and Uncertainty

Fingerprint

Dive into the research topics of 'Statistical learning with time series dependence: An application to scoring sleep in mice'. Together they form a unique fingerprint.

Cite this