Abstract
Gaussian graphical models (GGMs) provide a framework for modeling conditional dependencies in multivariate data. In this tutorial, we provide an overview of GGM theory and a demonstration of various GGM tools in R. The mathematical foundations of GGMs are introduced with the goal of enabling the researcher to draw practical conclusions by interpreting model results. Background literature is presented, emphasizing methods recently developed for high-dimensional applications such as genomics, proteomics, or metabolomics. The application of these methods is illustrated using a publicly available dataset of gene expression profiles from 578 participants with ovarian cancer in The Cancer Genome Atlas. Stand-alone code for the demonstration is available as an RMarkdown file at https://github.com/katehoffshutta/ggmTutorial.
Original language | English (US) |
---|---|
Pages (from-to) | 5150-5187 |
Number of pages | 38 |
Journal | Statistics in Medicine |
Volume | 41 |
Issue number | 25 |
DOIs | |
State | Published - Nov 10 2022 |
Funding
information U.S. National Library of Medicine, R01LM013444-01 Research reported in this publication was supported by the U.S. Natural Library of Medicine of the National Institutes of Health under award number R01LM013444‐01.
Keywords
- Gaussian graphical models
- graphical lasso
- network medicine
- omics
- partial correlation networks
ASJC Scopus subject areas
- Epidemiology
- Statistics and Probability