TY - JOUR
T1 - Improving visualization and interpretation of metabolome-wide association studies
T2 - An application in a population-based cohort using untargeted 1H NMR metabolic profiling
AU - Castagné, Raphaële
AU - Boulangé, Claire Laurence
AU - Karaman, Ibrahim
AU - Campanella, Gianluca
AU - Ferreira, Diana L.Santos
AU - Kaluarachchi, Manuja R.
AU - Lehne, Benjamin
AU - Moayyeri, Alireza
AU - Lewis, Matthew R.
AU - Spagou, Konstantina
AU - Dona, Anthony C.
AU - Evangelos, Vangelis
AU - Tracy, Russell
AU - Greenland, Philip
AU - Lindon, John C.
AU - Herrington, David
AU - Ebbels, Timothy M.D.
AU - Elliott, Paul
AU - Tzoulaki, Ioanna
AU - Chadeau-Hyam, Marc
N1 - Funding Information:
This work has been carried out as part of the of the FP7 project COMBI-BIO [305422 to P. E.]. MESA was supported by Contract Nos. HHSN268201500003I, N01-HC-95159, N01-HC-95160, N01-HC-95161, N01-HC-95162, N01-HC-95163, N01-HC-95164, N01-HC-95165, N01-HC-95166, N01-HC-95167, N01-HC-95168, and N01-HC-95169 from the National Heart, Lung, and Blood Institute, and by Grant Nos. UL1-TR-000040, UL1-TR-001079, and UL1-TR-001420 from NCATS. P.E. is Director of the MRC-PHE Centre for Environment and Health and acknowledges support from the Medical Research Council and Public Health England (MR/L01341X/1). P.E. acknowledges support from the NIHR Biomedical Research Centre at Imperial College Healthcare NHS Trust and Imperial College London, and the NIHR Health Protection Research Unit in Health Impact of Environmental Hazards (HPRU-2012− 10141). This work used the computing resources of the UK MEDical BIOinformatics partnership (UKMED-BIO) supported by the Medical Research Council (MR/L01632X/1). The authors wish to thank all the centres that took part in the study and the additional members of the COMBI-BIO consortium. The authors thank the other investigators, the staff, and the participants of the MESA study for their valuable contributions. A full list of participating MESA investigators and institutions can be found at http://www.mesa-nhlbi.org.
Publisher Copyright:
© 2017 American Chemical Society.
PY - 2017/10/6
Y1 - 2017/10/6
N2 - 1H NMR spectroscopy of biofluids generates reproducible data allowing detection and quantification of small molecules in large population cohorts. Statistical models to analyze such data are now well-established, and the use of univariate metabolome wide association studies (MWAS) investigating the spectral features separately has emerged as a computationally efficient and interpretable alternative to multivariate models. The MWAS rely on the accurate estimation of a metabolome wide significance level (MWSL) to be applied to control the family wise error rate. Subsequent interpretation requires efficient visualization and formal feature annotation, which, in-Turn, call for efficient prioritization of spectral variables of interest. Using human serum 1H NMR spectroscopic profiles from 3948 participants from the Multi-Ethnic Study of Atherosclerosis (MESA), we have performed a series of MWAS for serum levels of glucose. We first propose an extension of the conventional MWSL that yields stable estimates of the MWSL across the different model parameterizations and distributional features of the outcome. We propose both efficient visualization methods and a strategy based on subsampling and internal validation to prioritize the associations. Our work proposes and illustrates practical and scalable solutions to facilitate the implementation of the MWAS approach and improve interpretation in large cohort studies.
AB - 1H NMR spectroscopy of biofluids generates reproducible data allowing detection and quantification of small molecules in large population cohorts. Statistical models to analyze such data are now well-established, and the use of univariate metabolome wide association studies (MWAS) investigating the spectral features separately has emerged as a computationally efficient and interpretable alternative to multivariate models. The MWAS rely on the accurate estimation of a metabolome wide significance level (MWSL) to be applied to control the family wise error rate. Subsequent interpretation requires efficient visualization and formal feature annotation, which, in-Turn, call for efficient prioritization of spectral variables of interest. Using human serum 1H NMR spectroscopic profiles from 3948 participants from the Multi-Ethnic Study of Atherosclerosis (MESA), we have performed a series of MWAS for serum levels of glucose. We first propose an extension of the conventional MWSL that yields stable estimates of the MWSL across the different model parameterizations and distributional features of the outcome. We propose both efficient visualization methods and a strategy based on subsampling and internal validation to prioritize the associations. Our work proposes and illustrates practical and scalable solutions to facilitate the implementation of the MWAS approach and improve interpretation in large cohort studies.
KW - Cohort studies
KW - Full resolution H NMR
KW - High-Throughput analysis
KW - MESA
KW - Metabolic profiling
KW - Metabolome wide association study
KW - Molecular epidemiology
KW - Multiple testing correction
KW - Results visualization and prioritization
KW - Significance level
UR - http://www.scopus.com/inward/record.url?scp=85041110991&partnerID=8YFLogxK
UR - http://www.scopus.com/inward/citedby.url?scp=85041110991&partnerID=8YFLogxK
U2 - 10.1021/acs.jproteome.7b00344
DO - 10.1021/acs.jproteome.7b00344
M3 - Article
C2 - 28823158
AN - SCOPUS:85041110991
SN - 1535-3893
VL - 16
SP - 3623
EP - 3633
JO - Journal of Proteome Research
JF - Journal of Proteome Research
IS - 10
ER -