Replicates in high dimensions, with applications to latent variable graphical models

Kean Ming Tan, Yang Ning, Daniela M. Witten, Han Liu

Research output: Contribution to journalArticle

4 Scopus citations

Abstract

In classical statistics, much thought has been put into experimental design and data collection. In the high-dimensional setting, however, experimental design has been less of a focus. In this paper, we stress the importance of collecting multiple replicates for each subject in the high-dimensional setting. We consider learning the structure of a graphical model with latent variables, under the assumption that these variables take a constant value across replicates within each subject. By collecting multiple replicates for each subject, we can estimate the conditional dependence relationships among the observed variables given the latent variables. To test the hypothesis of conditional independence between two observed variables, we propose a pairwise decorrelated score test. Theoretical guarantees are established for parameter estimation and for this test. We show that our method is able to estimate latent variable graphical models more accurately than some existing methods, and we apply it to a brain imaging dataset.

Original languageEnglish (US)
Pages (from-to)761-777
Number of pages17
JournalBiometrika
Volume103
Issue number4
DOIs
StatePublished - Dec 1 2016

Keywords

  • Experimental design
  • Nuisance parameter
  • Pairwise decorrelated score test
  • Semiparametric exponential family graphical model

ASJC Scopus subject areas

  • Statistics and Probability
  • Mathematics(all)
  • Agricultural and Biological Sciences (miscellaneous)
  • Agricultural and Biological Sciences(all)
  • Statistics, Probability and Uncertainty
  • Applied Mathematics

Fingerprint Dive into the research topics of 'Replicates in high dimensions, with applications to latent variable graphical models'. Together they form a unique fingerprint.

  • Cite this