TY - JOUR
T1 - Leveraging single-cell sequencing to classify and characterize tumor subgroups in bulk RNA-sequencing data
AU - Shetty, Arya
AU - Wang, Su
AU - Khan, A. Basit
AU - English, Collin W.
AU - Nouri, Shervin Hosseingholi
AU - Magill, Stephen T.
AU - Raleigh, David R.
AU - Klisch, Tiemo J.
AU - Harmanci, Arif O.
AU - Patel, Akash J.
AU - Harmanci, Akdes Serin
N1 - Publisher Copyright:
© This is a U.S. Government work and not under copyright protection in the US; foreign copyright protection may apply 2024. corrected publication 2024.
PY - 2024/7
Y1 - 2024/7
N2 - Purpose: Accurate classification of cancer subgroups is essential for precision medicine, tailoring treatments to individual patients based on their cancer subtypes. In recent years, advances in high-throughput sequencing technologies have enabled the generation of large-scale transcriptomic data from cancer samples. These data have provided opportunities for developing computational methods that can improve cancer subtyping and enable better personalized treatment strategies. Methods: Here in this study, we evaluated different feature selection schemes in the context of meningioma classification. To integrate interpretable features from the bulk (n = 77 samples) and single-cell profiling (∼ 10 K cells), we developed an algorithm named CLIPPR which combines the top-performing single-cell models, RNA-inferred copy number variation (CNV) signals, and the initial bulk model to create a meta-model. Results: While the scheme relying solely on bulk transcriptomic data showed good classification accuracy, it exhibited confusion between malignant and benign molecular classes in approximately ∼ 8% of meningioma samples. In contrast, models trained on features learned from meningioma single-cell data accurately resolved the sub-groups confused by bulk-transcriptomic data but showed limited overall accuracy. CLIPPR showed superior overall accuracy and resolved benign-malignant confusion as validated on n = 789 bulk meningioma samples gathered from multiple institutions. Finally, we showed the generalizability of our algorithm using our in-house single-cell (∼ 200 K cells) and bulk TCGA glioma data (n = 711 samples). Conclusion: Overall, our algorithm CLIPPR synergizes the resolution of single-cell data with the depth of bulk sequencing and enables improved cancer sub-group diagnoses and insights into their biology.
AB - Purpose: Accurate classification of cancer subgroups is essential for precision medicine, tailoring treatments to individual patients based on their cancer subtypes. In recent years, advances in high-throughput sequencing technologies have enabled the generation of large-scale transcriptomic data from cancer samples. These data have provided opportunities for developing computational methods that can improve cancer subtyping and enable better personalized treatment strategies. Methods: Here in this study, we evaluated different feature selection schemes in the context of meningioma classification. To integrate interpretable features from the bulk (n = 77 samples) and single-cell profiling (∼ 10 K cells), we developed an algorithm named CLIPPR which combines the top-performing single-cell models, RNA-inferred copy number variation (CNV) signals, and the initial bulk model to create a meta-model. Results: While the scheme relying solely on bulk transcriptomic data showed good classification accuracy, it exhibited confusion between malignant and benign molecular classes in approximately ∼ 8% of meningioma samples. In contrast, models trained on features learned from meningioma single-cell data accurately resolved the sub-groups confused by bulk-transcriptomic data but showed limited overall accuracy. CLIPPR showed superior overall accuracy and resolved benign-malignant confusion as validated on n = 789 bulk meningioma samples gathered from multiple institutions. Finally, we showed the generalizability of our algorithm using our in-house single-cell (∼ 200 K cells) and bulk TCGA glioma data (n = 711 samples). Conclusion: Overall, our algorithm CLIPPR synergizes the resolution of single-cell data with the depth of bulk sequencing and enables improved cancer sub-group diagnoses and insights into their biology.
KW - Meningioma
KW - Single-cell RNA sequencing
KW - Tumor classification
UR - http://www.scopus.com/inward/record.url?scp=85194777787&partnerID=8YFLogxK
UR - http://www.scopus.com/inward/citedby.url?scp=85194777787&partnerID=8YFLogxK
U2 - 10.1007/s11060-024-04710-6
DO - 10.1007/s11060-024-04710-6
M3 - Article
C2 - 38811523
AN - SCOPUS:85194777787
SN - 0167-594X
VL - 168
SP - 515
EP - 524
JO - Journal of Neuro-Oncology
JF - Journal of Neuro-Oncology
IS - 3
ER -