TY - JOUR
T1 - Cancer digital slide archive
T2 - An informatics resource to support integrated in silico analysis of TCGA pathology data
AU - Gutman, David A.
AU - Cobb, Jake
AU - Somanna, Dhananjaya
AU - Park, Yuna
AU - Wang, Fusheng
AU - Kurc, Tahsin
AU - Saltz, Joel H.
AU - Brat, Daniel J.
AU - Cooper, Lee A D
PY - 2013
Y1 - 2013
N2 - Background: The integration and visualization of multimodal datasets is a common challenge in biomedical informatics. Several recent studies of The Cancer Genome Atlas (TCGA) data have illustrated important relationships between morphology observed in whole-slide images, outcome, and genetic events. The pairing of genomics and rich clinical descriptions with whole-slide imaging provided by TCGA presents a unique opportunity to perform these correlative studies. However, better tools are needed to integrate the vast and disparate data types. Objective: To build an integrated web-based platform supporting whole-slide pathology image visualization and data integration. Materials and methods: All images and genomic data were directly obtained from the TCGA and National Cancer Institute (NCI) websites. Results: The Cancer Digital Slide Archive (CDSA) produced is accessible to the public (http://cancer. digitalslidearchive.net) and currently hosts more than 20 000 whole-slide images from 22 cancer types. Discussion: The capabilities of CDSA are demonstrated using TCGA datasets to integrate pathology imaging with associated clinical, genomic and MRI measurements in glioblastomas and can be extended to other tumor types. CDSA also allows URL-based sharing of wholeslide images, and has preliminary support for directly sharing regions of interest and other annotations. Images can also be selected on the basis of other metadata, such as mutational profile, patient age, and other relevant characteristics. Conclusions: With the increasing availability of wholeslide scanners, analysis of digitized pathology images will become increasingly important in linking morphologic observations with genomic and clinical endpoints.
AB - Background: The integration and visualization of multimodal datasets is a common challenge in biomedical informatics. Several recent studies of The Cancer Genome Atlas (TCGA) data have illustrated important relationships between morphology observed in whole-slide images, outcome, and genetic events. The pairing of genomics and rich clinical descriptions with whole-slide imaging provided by TCGA presents a unique opportunity to perform these correlative studies. However, better tools are needed to integrate the vast and disparate data types. Objective: To build an integrated web-based platform supporting whole-slide pathology image visualization and data integration. Materials and methods: All images and genomic data were directly obtained from the TCGA and National Cancer Institute (NCI) websites. Results: The Cancer Digital Slide Archive (CDSA) produced is accessible to the public (http://cancer. digitalslidearchive.net) and currently hosts more than 20 000 whole-slide images from 22 cancer types. Discussion: The capabilities of CDSA are demonstrated using TCGA datasets to integrate pathology imaging with associated clinical, genomic and MRI measurements in glioblastomas and can be extended to other tumor types. CDSA also allows URL-based sharing of wholeslide images, and has preliminary support for directly sharing regions of interest and other annotations. Images can also be selected on the basis of other metadata, such as mutational profile, patient age, and other relevant characteristics. Conclusions: With the increasing availability of wholeslide scanners, analysis of digitized pathology images will become increasingly important in linking morphologic observations with genomic and clinical endpoints.
UR - http://www.scopus.com/inward/record.url?scp=84886280529&partnerID=8YFLogxK
UR - http://www.scopus.com/inward/citedby.url?scp=84886280529&partnerID=8YFLogxK
U2 - 10.1136/amiajnl-2012-001469
DO - 10.1136/amiajnl-2012-001469
M3 - Article
C2 - 23893318
AN - SCOPUS:84886280529
SN - 1067-5027
VL - 20
SP - 1091
EP - 1098
JO - Journal of the American Medical Informatics Association : JAMIA
JF - Journal of the American Medical Informatics Association : JAMIA
IS - 6
ER -