Receiver operating characteristic curves and confidence bands for support vector machines

Daniel J. Luckett*, Eric B. Laber, Samer S. El-Kamary, Cheng Fan, Ravi Jhaveri, Charles M. Perou, Fatma M. Shebl, Michael R. Kosorok

*Corresponding author for this work

Research output: Contribution to journalArticlepeer-review

10 Scopus citations

Abstract

Many problems that appear in biomedical decision-making, such as diagnosing disease and predicting response to treatment, can be expressed as binary classification problems. The support vector machine (SVM) is a popular classification technique that is robust to model misspecification and effectively handles high-dimensional data. The relative costs of false positives and false negatives can vary across application domains. The receiving operating characteristic (ROC) curve provides a visual representation of the trade-off between these two types of errors. Because the SVM does not produce a predicted probability, an ROC curve cannot be constructed in the traditional way of thresholding a predicted probability. However, a sequence of weighted SVMs can be used to construct an ROC curve. Although ROC curves constructed using weighted SVMs have great potential for allowing ROC curves analyses that cannot be done by thresholding predicted probabilities, their theoretical properties have heretofore been underdeveloped. We propose a method for constructing confidence bands for the SVM ROC curve and provide the theoretical justification for the SVM ROC curve by showing that the risk function of the estimated decision rule is uniformly consistent across the weight parameter. We demonstrate the proposed confidence band method using simulation studies. We present a predictive model for treatment response in breast cancer as an illustrative example.

Original languageEnglish (US)
Pages (from-to)1422-1430
Number of pages9
JournalBiometrics
Volume77
Issue number4
DOIs
StatePublished - Dec 2021

Funding

The authors gratefully acknowledge the following funding sources: NIH T32 CA201159, NIH P01 CA142538, National Center for Advancing Translational Sciences UL1 TR001111, NSF DMS-1555141, NSF DMS-1557733, NSF DMS-1513579, NIH 1R01DE024984, NIH U01HD39164, NIH U01AI58372, an investigator initiated grant from Merck, NCI Breast SPORE program (P50-CA58223-09A1), the Breast Cancer Research Foundation, and the V Foundation for Cancer\u00A0Research The authors gratefully acknowledge the following funding sources: NIH T32 CA201159, NIH P01 CA142538, National Center for Advancing Translational Sciences UL1 TR001111, NSF DMS\u20101555141, NSF DMS\u20101557733, NSF DMS\u20101513579, NIH 1R01DE024984, NIH U01HD39164, NIH U01AI58372, an investigator initiated grant from Merck, NCI Breast SPORE program (P50\u2010CA58223\u201009A1), the Breast Cancer Research Foundation, and the V Foundation for Cancer Research

ASJC Scopus subject areas

  • Statistics and Probability
  • General Biochemistry, Genetics and Molecular Biology
  • General Immunology and Microbiology
  • General Agricultural and Biological Sciences
  • Applied Mathematics

Fingerprint

Dive into the research topics of 'Receiver operating characteristic curves and confidence bands for support vector machines'. Together they form a unique fingerprint.

Cite this