Biologically informed deep learning to query gene programs in single-cell atlases

Mohammad Lotfollahi, Sergei Rybakov, Karin Hrovatin, Soroor Hediyeh-zadeh, Carlos Talavera-López, Alexander V. Misharin, Fabian J. Theis*

*Corresponding author for this work

Research output: Contribution to journalArticlepeer-review

Abstract

The increasing availability of large-scale single-cell atlases has enabled the detailed description of cell states. In parallel, advances in deep learning allow rapid analysis of newly generated query datasets by mapping them into reference atlases. However, existing data transformations learned to map query data are not easily explainable using biologically known concepts such as genes or pathways. Here we propose expiMap, a biologically informed deep-learning architecture that enables single-cell reference mapping. ExpiMap learns to map cells into biologically understandable components representing known ‘gene programs’. The activity of each cell for a gene program is learned while simultaneously refining them and learning de novo programs. We show that expiMap compares favourably to existing methods while bringing an additional layer of interpretability to integrative single-cell analysis. Furthermore, we demonstrate its applicability to analyse single-cell perturbation responses in different tissues and species and resolve responses of patients who have coronavirus disease 2019 to different treatments across cell types.

Original languageEnglish (US)
Pages (from-to)337-350
Number of pages14
JournalNature Cell Biology
Volume25
Issue number2
DOIs
StatePublished - Feb 2023

ASJC Scopus subject areas

  • Cell Biology

Fingerprint

Dive into the research topics of 'Biologically informed deep learning to query gene programs in single-cell atlases'. Together they form a unique fingerprint.

Cite this