Strategy for improved characterization of human metabolic phenotypes using a COmbined Multi-block Principal components Analysis with Statistical Spectroscopy (COMPASS)

Ruey Leng Loo, Queenie Chan, Henrik Antti, Jia V. Li, H. Ashrafian, Paul Elliott, Jeremiah Stamler, Jeremy K. Nicholson, Elaine Holmes, Julien Wist*

*Corresponding author for this work

Research output: Contribution to journalArticlepeer-review

Abstract

Motivation: Large-scale population omics data can provide insight into associations between gene-environment interactions and disease. However, existing dimension reduction modelling techniques are often inefficient for extracting detailed information from these complex datasets. Results: Here, we present an interactive software pipeline for exploratory analyses of population-based nuclear magnetic resonance spectral data using a COmbined Multi-block Principal components Analysis with Statistical Spectroscopy (COMPASS) within the R-library hastaLaVista framework. Principal component analysis models are generated for a sequential series of spectral regions (blocks) to provide more granular detail defining sub-populations within the dataset. Molecular identification of key differentiating signals is subsequently achieved by implementing Statistical TOtal Correlation SpectroscopY on the full spectral data to define feature patterns. Finally, the distributions of cross-correlation of the reference patterns across the spectral dataset are used to provide population statistics for identifying underlying features arising from drug intake, latent diseases and diet. The COMPASS method thus provides an efficient semi-automated approach for screening population datasets.

Original languageEnglish (US)
Pages (from-to)5229-5236
Number of pages8
JournalBioinformatics
Volume36
Issue number21
DOIs
StatePublished - Nov 1 2020

ASJC Scopus subject areas

  • Statistics and Probability
  • Biochemistry
  • Molecular Biology
  • Computer Science Applications
  • Computational Theory and Mathematics
  • Computational Mathematics

Fingerprint

Dive into the research topics of 'Strategy for improved characterization of human metabolic phenotypes using a COmbined Multi-block Principal components Analysis with Statistical Spectroscopy (COMPASS)'. Together they form a unique fingerprint.

Cite this