Abstract
Open-set recognition generalizes a classification task by classifying test samples as one of the known classes from training or “unknown.” As novel cancer drug cocktails with improved treatment are continually discovered, classifying patients by treatments can naturally be formulated in terms of an open-set recognition problem. Drawbacks, due to modeling unknown samples during training, arise from straightforward implementations of prior work in healthcare open-set learning. Accordingly, we reframe the problem methodology and apply a recent Gaussian mixture variational autoencoder model, which achieves state-of-the-art results for image datasets, to breast cancer patient data. Not only do we obtain more accurate and robust classification results (14% average F1 increase compared to recent methods), but we also reexamine open-set recognition in terms of deployability to a clinical setting.
Original language | English (US) |
---|---|
Article number | 102451 |
Journal | Artificial Intelligence In Medicine |
Volume | 135 |
DOIs | |
State | Published - Jan 2023 |
Funding
The work of the first author is supported by the Predoctoral Training Program in Biomedical Data Driven Discovery (BD3) at Northwestern University (National Library of Medicine Grant 5T32LM012203 ). The work of the last author is supported in part by NIH Grant R01LM013337 .
Keywords
- Autoencoder
- Breast cancer treatments
- Deep neural networks
- Open-set recognition
ASJC Scopus subject areas
- Medicine (miscellaneous)
- Artificial Intelligence