Abstract
We provide a generalizable framework for the systematic analysis of complicated, longitudinal clinical features in pediatric cancer. We use a threefold pipeline of exploratory data analysis, ontological categorization through a multi-modal data transformation process towards predictive analytics. We derive a data-driven phenotype from a subset of a sample of over 1900 brain tumor cases focused specifically on High-Grade Gliomas. We implement an analyst-friendly process to make machine learning-ready data sets based on domain ontologies ready for enumeration and vectorization. The results are clinical domain expert readable data points from 4.3 million observational events across 16,000 patient days. In this research, we address the gap in phenotypic data features by utilizing extensive harmonized observational clinical data and identify resources and specific processes for their use in rare tumor research.
Original language | English (US) |
---|---|
Title of host publication | Proceedings - 2020 IEEE International Conference on Bioinformatics and Biomedicine, BIBM 2020 |
Editors | Taesung Park, Young-Rae Cho, Xiaohua Tony Hu, Illhoi Yoo, Hyun Goo Woo, Jianxin Wang, Julio Facelli, Seungyoon Nam, Mingon Kang |
Publisher | Institute of Electrical and Electronics Engineers Inc. |
Pages | 1943-1950 |
Number of pages | 8 |
ISBN (Electronic) | 9781728162157 |
DOIs | |
State | Published - Dec 16 2020 |
Event | 2020 IEEE International Conference on Bioinformatics and Biomedicine, BIBM 2020 - Virtual, Seoul, Korea, Republic of Duration: Dec 16 2020 → Dec 19 2020 |
Publication series
Name | Proceedings - 2020 IEEE International Conference on Bioinformatics and Biomedicine, BIBM 2020 |
---|
Conference
Conference | 2020 IEEE International Conference on Bioinformatics and Biomedicine, BIBM 2020 |
---|---|
Country/Territory | Korea, Republic of |
City | Virtual, Seoul |
Period | 12/16/20 → 12/19/20 |
Funding
This research is supported by the iFellowship Program administered by the University of Pittsburgh, and funded by The Andrew W. Mellon Foundation.
Keywords
- biomedical research
- biospecimen banking
- data-driven phenotype
- electronic medical records
- rare disease
ASJC Scopus subject areas
- Computer Science Applications
- Information Systems and Management
- Medicine (miscellaneous)
- Health Informatics