TY - JOUR
T1 - An improved workflow for accurate and robust healthcare environmental surveillance using metagenomics
AU - Shen, Jiaxian
AU - McFarland, Alexander G.
AU - Blaustein, Ryan A.
AU - Rose, Laura J.
AU - Perry-Dow, K. Allison
AU - Moghadam, Anahid A.
AU - Hayden, Mary K.
AU - Young, Vincent B.
AU - Hartmann, Erica M.
N1 - Funding Information:
This work was supported by the Centers for Disease Control and Prevention (BAA FY2018-OADS-01 Contract 02915). This research was supported in part through the computational resources and staff contributions provided for the Quest high performance computing facility at Northwestern University which is jointly supported by the Office of the Provost, the Office for Research, and Northwestern University Information Technology. We thank Thelma Dangana and Khaled Aboushaala for their help in collecting samples and the clinical staff of the medical intensive care unit at Rush University Medical Center for their cooperation. We are grateful to Pamela B. Bell and Rachel Beers for their contributions in performing MALDI-TOF MS. We would like to thank the anonymous reviewers for greatly improving this manuscript as well.
Funding Information:
This work was supported by the Centers for Disease Control and Prevention (BAA FY2018-OADS-01 Contract 02915).
Publisher Copyright:
© 2022, The Author(s).
PY - 2022/12
Y1 - 2022/12
N2 - Background: Effective surveillance of microbial communities in the healthcare environment is increasingly important in infection prevention. Metagenomics-based techniques are promising due to their untargeted nature but are currently challenged by several limitations: (1) they are not powerful enough to extract valid signals out of the background noise for low-biomass samples, (2) they do not distinguish between viable and nonviable organisms, and (3) they do not reveal the microbial load quantitatively. An additional practical challenge towards a robust pipeline is the inability to efficiently allocate sequencing resources a priori. Assessment of sequencing depth is generally practiced post hoc, if at all, for most microbiome studies, regardless of the sample type. This practice is inefficient at best, and at worst, poor sequencing depth jeopardizes the interpretation of study results. To address these challenges, we present a workflow for metagenomics-based environmental surveillance that is appropriate for low-biomass samples, distinguishes viability, is quantitative, and estimates sequencing resources. Results: The workflow was developed using a representative microbiome sample, which was created by aggregating 120 surface swabs collected from a medical intensive care unit. Upon evaluating and optimizing techniques as well as developing new modules, we recommend best practices and introduce a well-structured workflow. We recommend adopting liquid-liquid extraction to improve DNA yield and only incorporating whole-cell filtration when the nonbacterial proportion is large. We suggest including propidium monoazide treatment coupled with internal standards and absolute abundance profiling for viability assessment and involving cultivation when demanding comprehensive profiling. We further recommend integrating internal standards for quantification and additionally qPCR when we expect poor taxonomic classification. We also introduce a machine learning-based model to predict required sequencing effort from accessible sample features. The model helps make full use of sequencing resources and achieve desired outcomes. [MediaObject not available: see fulltext.] Conclusions: This workflow will contribute to more accurate and robust environmental surveillance and infection prevention. Lessons gained from this study will also benefit the continuing development of methods in relevant fields.
AB - Background: Effective surveillance of microbial communities in the healthcare environment is increasingly important in infection prevention. Metagenomics-based techniques are promising due to their untargeted nature but are currently challenged by several limitations: (1) they are not powerful enough to extract valid signals out of the background noise for low-biomass samples, (2) they do not distinguish between viable and nonviable organisms, and (3) they do not reveal the microbial load quantitatively. An additional practical challenge towards a robust pipeline is the inability to efficiently allocate sequencing resources a priori. Assessment of sequencing depth is generally practiced post hoc, if at all, for most microbiome studies, regardless of the sample type. This practice is inefficient at best, and at worst, poor sequencing depth jeopardizes the interpretation of study results. To address these challenges, we present a workflow for metagenomics-based environmental surveillance that is appropriate for low-biomass samples, distinguishes viability, is quantitative, and estimates sequencing resources. Results: The workflow was developed using a representative microbiome sample, which was created by aggregating 120 surface swabs collected from a medical intensive care unit. Upon evaluating and optimizing techniques as well as developing new modules, we recommend best practices and introduce a well-structured workflow. We recommend adopting liquid-liquid extraction to improve DNA yield and only incorporating whole-cell filtration when the nonbacterial proportion is large. We suggest including propidium monoazide treatment coupled with internal standards and absolute abundance profiling for viability assessment and involving cultivation when demanding comprehensive profiling. We further recommend integrating internal standards for quantification and additionally qPCR when we expect poor taxonomic classification. We also introduce a machine learning-based model to predict required sequencing effort from accessible sample features. The model helps make full use of sequencing resources and achieve desired outcomes. [MediaObject not available: see fulltext.] Conclusions: This workflow will contribute to more accurate and robust environmental surveillance and infection prevention. Lessons gained from this study will also benefit the continuing development of methods in relevant fields.
KW - Environmental surveillance
KW - Infection prevention
KW - Low biomass
KW - Machine learning
KW - Metagenomics
KW - Quantification
KW - Sequencing depth prediction
KW - Viability
UR - http://www.scopus.com/inward/record.url?scp=85143181641&partnerID=8YFLogxK
UR - http://www.scopus.com/inward/citedby.url?scp=85143181641&partnerID=8YFLogxK
U2 - 10.1186/s40168-022-01412-x
DO - 10.1186/s40168-022-01412-x
M3 - Article
C2 - 36457108
AN - SCOPUS:85143181641
SN - 2049-2618
VL - 10
JO - Microbiome
JF - Microbiome
IS - 1
M1 - 206
ER -