Training the ACRIN 6666 investigators and effects of feedback on breast ultrasound interpretive performance and agreement in BI-RADS ultrasound feature analysis

Wendie A. Berg*, Jeffrey D. Blume, Jean B. Cormack, Ellen B. Mendelson

*Corresponding author for this work

Research output: Contribution to journalArticlepeer-review

31 Scopus citations

Abstract

OBJECTIVE. Qualification tasks in mammography and breast ultrasound were developed for the American College of Radiology Imaging Network (ACRIN) 6666 Investigators. We sought to assess the effects of feedback on breast ultrasound interpretive performance and agreement in BI-RADS feature analysis among a subset of these experienced observers. MATERIALS AND METHODS. After a 1-hour didactic session on BI-RADS: Ultrasound, an interpretive skills quiz set of 70 orthogonal sets of breast ultrasound images including 25 (36%) malignancies was presented to 100 experienced breast imaging observers. Thirty-five observers reviewed the quiz set twice: first without and then with immediate feedback of consensus feature analysis, management recommendations, and pathologic truth. Observer performance (sensitivity, specificity, area under the curve [AUC]) was calculated without feedback and with feedback. Kappas were determined for agreement on feature analysis and assessments. RESULTS. For 35 observers without feedback, the mean sensitivity was 89% (range, 68-100%); specificity, 62% (range, 42-82%); and AUC, 82% (range, 73-89%). With feedback, the mean sensitivity was 93% (range, 80-100%; mean increase, 4%; range of increase, 0-12%; p < 0.0001), the mean specificity was 61% (range, 45-73%; mean decrease, 1%; range of change, -18% to 11%; p = 0.19), and the mean AUC was 84% (range, 78-90%; mean increase, 2%; range of change, -3% to 9%; p < 0.0001). Three breast imagers in the lowest quartile of initial performance showed the greatest improvement in sensitivity with no change or improvement in AUC. The kappa values for feature analysis did not change, but there was improved agreement about final assessments, with the kappa value increasing from 0.53 (SE, 0.02) without feedback to 0.59 (SE, 0.02) with feedback (p < 0.0001). CONCLUSION. Most experienced breast imagers showed excellent breast ultrasound interpretive skills. Immediate feedback of consensus BI-RADS: Ultrasound features and histopathologic results improved performance in ultrasound interpretation across all experience variables.

Original languageEnglish (US)
Pages (from-to)224-235
Number of pages12
JournalAmerican Journal of Roentgenology
Volume199
Issue number1
DOIs
StatePublished - Jul 2012

Keywords

  • BI-RADS
  • Breast ultrasound
  • Feedback
  • Observer agreement
  • Observer performance
  • Training

ASJC Scopus subject areas

  • Radiology Nuclear Medicine and imaging

Fingerprint Dive into the research topics of 'Training the ACRIN 6666 investigators and effects of feedback on breast ultrasound interpretive performance and agreement in BI-RADS ultrasound feature analysis'. Together they form a unique fingerprint.

Cite this