Simplified methods for variance estimation in microbiome abundance count data analysis

Yiming Shi, Lili Liu, Jun Chen, Kristine M. Wylie, Todd N. Wylie, Molly J. Stout, Chan Wang, Haixiang Zhang, Ya Chen T. Shih, Xiaoyi Xu, Ai Zhang, Sung Hee Park, Hongmei Jiang, Lei Liu*

*Corresponding author for this work

Research output: Contribution to journalArticlepeer-review

Abstract

The complex nature of microbiome data has made the differential abundance analysis challenging. Microbiome abundance counts are often skewed to the right and heteroscedastic (also known as overdispersion), potentially leading to incorrect inferences if not properly addressed. In this paper, we propose a simple yet effective framework to tackle the challenges by integrating Poisson (log-linear) regression with standard error estimation through the Bootstrap method and Sandwich robust estimation. Such standard error estimates are accurate and yield satisfactory inference even if the distributional assumption or the variance structure is incorrect. Our approach is validated through extensive simulation studies, demonstrating its effectiveness in addressing overdispersion and improving inference accuracy. Additionally, we apply our approach to two real datasets collected from the human gut and vagina, respectively, demonstrating the wide applicability of our methods. The results highlight the efficacy of our covariance estimators in addressing the challenges of microbiome data analysis. The corresponding software implementation is publicly available at https://github.com/yimshi/robustestimates.

Original languageEnglish (US)
Article number1458851
JournalFrontiers in Genetics
Volume15
DOIs
StatePublished - 2024

Funding

The author(s) declare that financial support was received for the research, authorship, and/or publication of this article. Research reported in this publication was supported by NIH UL1 TR002345 and R01 HD095986.

Keywords

  • bootstrap
  • heteroscedasticity
  • microbiome abundance count
  • robust variance estimation
  • sandwich estimates

ASJC Scopus subject areas

  • Molecular Medicine
  • Genetics
  • Genetics(clinical)

Fingerprint

Dive into the research topics of 'Simplified methods for variance estimation in microbiome abundance count data analysis'. Together they form a unique fingerprint.

Cite this