TY - JOUR
T1 - Semi-Supervised, Attention-Based Deep Learning for Predicting TMPRSS2:ERG Fusion Status in Prostate Cancer Using Whole Slide Images
AU - Omar, Mohamed
AU - Xu, Zhuoran
AU - Rand, Sophie B.
AU - Alexanderani, Mohammad K.
AU - Salles, Daniela C.
AU - Valencia, Itzel
AU - Schaeffer, Edward M.
AU - Robinson, Brian D.
AU - Lotan, Tamara L.
AU - Loda, Massimo
AU - Marchionni, Luigi
N1 - Publisher Copyright:
© 2024 American Association for Cancer Research Inc.. All rights reserved.
PY - 2024/4
Y1 - 2024/4
N2 - Prostate cancer (PCa) harbors several genetic alterations, the most prevalent of which is TMPRSS2:ERG gene fusion, affecting nearly half of all cases. Capitalizing on the increasing availability of whole-slide images (WSIs), this study introduces a deep learning (DL) model designed to detect TMPRSS2:ERG fusion from H&E-stained WSIs of radical prostatectomy specimens. Leveraging the TCGA prostate adenocarcinoma cohort, which comprises 436 WSIs from 393 patients, we developed a robust DL model, trained across ten different splits, each consisting of distinct training, validation, and testing sets. The model’s best performance achieved an Area Under the ROC curve (AUC) of 0.84 during training, and 0.72 on the TCGA test set. This model was subsequently validated on an independent cohort comprising 314 WSIs from a different institution, in which it has a robust performance at predicting TMPRSS2:ERG fusion with an AUC of 0.73. Importantly, the model identifies highly-attended tissue regions associated with TMPRSS2:ERG fusion, characterized by higher neoplastic cell content and altered immune and stromal profiles compared to fusion-negative cases. Multivariate survival analysis revealed that these morphological features correlate with poorer survival outcomes, independent of Gleason grade and tumor stage. This study underscores the potential of DL in deducing genetic alterations from routine slides and identifying their underlying morphological features which might harbor prognostic information.
AB - Prostate cancer (PCa) harbors several genetic alterations, the most prevalent of which is TMPRSS2:ERG gene fusion, affecting nearly half of all cases. Capitalizing on the increasing availability of whole-slide images (WSIs), this study introduces a deep learning (DL) model designed to detect TMPRSS2:ERG fusion from H&E-stained WSIs of radical prostatectomy specimens. Leveraging the TCGA prostate adenocarcinoma cohort, which comprises 436 WSIs from 393 patients, we developed a robust DL model, trained across ten different splits, each consisting of distinct training, validation, and testing sets. The model’s best performance achieved an Area Under the ROC curve (AUC) of 0.84 during training, and 0.72 on the TCGA test set. This model was subsequently validated on an independent cohort comprising 314 WSIs from a different institution, in which it has a robust performance at predicting TMPRSS2:ERG fusion with an AUC of 0.73. Importantly, the model identifies highly-attended tissue regions associated with TMPRSS2:ERG fusion, characterized by higher neoplastic cell content and altered immune and stromal profiles compared to fusion-negative cases. Multivariate survival analysis revealed that these morphological features correlate with poorer survival outcomes, independent of Gleason grade and tumor stage. This study underscores the potential of DL in deducing genetic alterations from routine slides and identifying their underlying morphological features which might harbor prognostic information.
KW - ERG fusion
KW - Prostate cancer
KW - artificial intelligence
KW - deep learning
KW - digital pathology
KW - semi-supervised learning
KW - tissue morphology
UR - http://www.scopus.com/inward/record.url?scp=85189751647&partnerID=8YFLogxK
UR - http://www.scopus.com/inward/citedby.url?scp=85189751647&partnerID=8YFLogxK
U2 - 10.1158/1541-7786.MCR-23-0639
DO - 10.1158/1541-7786.MCR-23-0639
M3 - Article
C2 - 38284821
AN - SCOPUS:85189751647
SN - 1541-7786
VL - 22
JO - Molecular Cancer Research
JF - Molecular Cancer Research
IS - 4
ER -