High throughput single cell long-read sequencing analyses of same-cell genotypes and phenotypes in human tumors

Cheng Kai Shiau, Lina Lu, Rachel Kieser, Kazutaka Fukumura, Timothy Pan, Hsiao Yun Lin, Jie Yang, Eric L. Tong, Ga Hyun Lee, Yuanqing Yan, Jason T. Huse, Ruli Gao*

*Corresponding author for this work

Research output: Contribution to journalArticlepeer-review

25 Scopus citations

Abstract

Single-cell nanopore sequencing of full-length mRNAs transforms single-cell multi-omics studies. However, challenges include high sequencing errors and dependence on short-reads and/or barcode whitelists. To address these, we develop scNanoGPS to calculate same-cell genotypes (mutations) and phenotypes (gene/isoform expressions) without short-read nor whitelist guidance. We apply scNanoGPS onto 23,587 long-read transcriptomes from 4 tumors and 2 cell-lines. Standalone, scNanoGPS deconvolutes error-prone long-reads into single-cells and single-molecules, and simultaneously accesses both phenotypes and genotypes of individual cells. Our analyses reveal that tumor and stroma/immune cells express distinct combination of isoforms (DCIs). In a kidney tumor, we identify 924 DCI genes involved in cell-type-specific functions such as PDE10A in tumor cells and CCL3 in lymphocytes. Transcriptome-wide mutation analyses identify many cell-type-specific mutations including VEGFA mutations in tumor cells and HLA-A mutations in immune cells, highlighting the critical roles of different mutant populations in tumors. Together, scNanoGPS facilitates applications of single-cell long-read sequencing technologies.

Original languageEnglish (US)
Article number4124
JournalNature communications
Volume14
Issue number1
DOIs
StatePublished - Dec 2023

Funding

We thank research funding supports to R.G. from Northwestern University and National Heart, Lung, and Blood Institute (NIH 1R01HL160552-01). The nanopore sequencing was carried out by the DNA Technologies and Expression Analysis Core at the UC Davis Genome Center, supported by NIH Shared Instrumentation Grant (1S10OD010786-01). The next-generation sequencing was carried out by the ATGC core at the University of Texas MD Anderson Cancer Center, supported by the Core grant (CA016672) and NIH Shared Instrument Grant (1S10OD024977-01), and by the Genomic and RNA Profiling Core at Baylor College of Medicine with funding support from an NIH S10 grant (1S10OD023469).

ASJC Scopus subject areas

  • General Chemistry
  • General Biochemistry, Genetics and Molecular Biology
  • General Physics and Astronomy

Fingerprint

Dive into the research topics of 'High throughput single cell long-read sequencing analyses of same-cell genotypes and phenotypes in human tumors'. Together they form a unique fingerprint.

Cite this