TY - JOUR
T1 - Biologically informed deep learning to query gene programs in single-cell atlases
AU - Lotfollahi, Mohammad
AU - Rybakov, Sergei
AU - Hrovatin, Karin
AU - Hediyeh-zadeh, Soroor
AU - Talavera-López, Carlos
AU - Misharin, Alexander V.
AU - Theis, Fabian J.
N1 - Funding Information:
We are grateful to all members of the Theis laboratory. M.L. is grateful for valuable feedback on the text from F. Curion and L. Zapia. M.L. is thankful for feedback from A. Gayoso on amortized inference. M.L. and K.H. acknowledge financial support from the Joachim Herz Stiftung via Add-on Fellowships for Interdisciplinary Life Science. K.H. acknowledges support from Helmholtz Association under the joint research school ‘Munich School for Data Science’. This work was supported by the BMBF (01IS18036A and 01IS18036B), by the European Union’s Horizon 2020 research and innovation program (grant 874656), by Helmholtz Association’s Initiative and Networking Fund through Helmholtz AI (ZT-I-PF-5-01) and sparse2big (ZT-I-0007), all to F.J.T.
Publisher Copyright:
© 2023, The Author(s).
PY - 2023/2
Y1 - 2023/2
N2 - The increasing availability of large-scale single-cell atlases has enabled the detailed description of cell states. In parallel, advances in deep learning allow rapid analysis of newly generated query datasets by mapping them into reference atlases. However, existing data transformations learned to map query data are not easily explainable using biologically known concepts such as genes or pathways. Here we propose expiMap, a biologically informed deep-learning architecture that enables single-cell reference mapping. ExpiMap learns to map cells into biologically understandable components representing known ‘gene programs’. The activity of each cell for a gene program is learned while simultaneously refining them and learning de novo programs. We show that expiMap compares favourably to existing methods while bringing an additional layer of interpretability to integrative single-cell analysis. Furthermore, we demonstrate its applicability to analyse single-cell perturbation responses in different tissues and species and resolve responses of patients who have coronavirus disease 2019 to different treatments across cell types.
AB - The increasing availability of large-scale single-cell atlases has enabled the detailed description of cell states. In parallel, advances in deep learning allow rapid analysis of newly generated query datasets by mapping them into reference atlases. However, existing data transformations learned to map query data are not easily explainable using biologically known concepts such as genes or pathways. Here we propose expiMap, a biologically informed deep-learning architecture that enables single-cell reference mapping. ExpiMap learns to map cells into biologically understandable components representing known ‘gene programs’. The activity of each cell for a gene program is learned while simultaneously refining them and learning de novo programs. We show that expiMap compares favourably to existing methods while bringing an additional layer of interpretability to integrative single-cell analysis. Furthermore, we demonstrate its applicability to analyse single-cell perturbation responses in different tissues and species and resolve responses of patients who have coronavirus disease 2019 to different treatments across cell types.
UR - http://www.scopus.com/inward/record.url?scp=85147371442&partnerID=8YFLogxK
UR - http://www.scopus.com/inward/citedby.url?scp=85147371442&partnerID=8YFLogxK
U2 - 10.1038/s41556-022-01072-x
DO - 10.1038/s41556-022-01072-x
M3 - Article
C2 - 36732632
AN - SCOPUS:85147371442
SN - 1465-7392
VL - 25
SP - 337
EP - 350
JO - Nature Cell Biology
JF - Nature Cell Biology
IS - 2
ER -