TY - JOUR
T1 - Training the ACRIN 6666 investigators and effects of feedback on breast ultrasound interpretive performance and agreement in BI-RADS ultrasound feature analysis
AU - Berg, Wendie A.
AU - Blume, Jeffrey D.
AU - Cormack, Jean B.
AU - Mendelson, Ellen B.
PY - 2012/7
Y1 - 2012/7
N2 - OBJECTIVE. Qualification tasks in mammography and breast ultrasound were developed for the American College of Radiology Imaging Network (ACRIN) 6666 Investigators. We sought to assess the effects of feedback on breast ultrasound interpretive performance and agreement in BI-RADS feature analysis among a subset of these experienced observers. MATERIALS AND METHODS. After a 1-hour didactic session on BI-RADS: Ultrasound, an interpretive skills quiz set of 70 orthogonal sets of breast ultrasound images including 25 (36%) malignancies was presented to 100 experienced breast imaging observers. Thirty-five observers reviewed the quiz set twice: first without and then with immediate feedback of consensus feature analysis, management recommendations, and pathologic truth. Observer performance (sensitivity, specificity, area under the curve [AUC]) was calculated without feedback and with feedback. Kappas were determined for agreement on feature analysis and assessments. RESULTS. For 35 observers without feedback, the mean sensitivity was 89% (range, 68-100%); specificity, 62% (range, 42-82%); and AUC, 82% (range, 73-89%). With feedback, the mean sensitivity was 93% (range, 80-100%; mean increase, 4%; range of increase, 0-12%; p < 0.0001), the mean specificity was 61% (range, 45-73%; mean decrease, 1%; range of change, -18% to 11%; p = 0.19), and the mean AUC was 84% (range, 78-90%; mean increase, 2%; range of change, -3% to 9%; p < 0.0001). Three breast imagers in the lowest quartile of initial performance showed the greatest improvement in sensitivity with no change or improvement in AUC. The kappa values for feature analysis did not change, but there was improved agreement about final assessments, with the kappa value increasing from 0.53 (SE, 0.02) without feedback to 0.59 (SE, 0.02) with feedback (p < 0.0001). CONCLUSION. Most experienced breast imagers showed excellent breast ultrasound interpretive skills. Immediate feedback of consensus BI-RADS: Ultrasound features and histopathologic results improved performance in ultrasound interpretation across all experience variables.
AB - OBJECTIVE. Qualification tasks in mammography and breast ultrasound were developed for the American College of Radiology Imaging Network (ACRIN) 6666 Investigators. We sought to assess the effects of feedback on breast ultrasound interpretive performance and agreement in BI-RADS feature analysis among a subset of these experienced observers. MATERIALS AND METHODS. After a 1-hour didactic session on BI-RADS: Ultrasound, an interpretive skills quiz set of 70 orthogonal sets of breast ultrasound images including 25 (36%) malignancies was presented to 100 experienced breast imaging observers. Thirty-five observers reviewed the quiz set twice: first without and then with immediate feedback of consensus feature analysis, management recommendations, and pathologic truth. Observer performance (sensitivity, specificity, area under the curve [AUC]) was calculated without feedback and with feedback. Kappas were determined for agreement on feature analysis and assessments. RESULTS. For 35 observers without feedback, the mean sensitivity was 89% (range, 68-100%); specificity, 62% (range, 42-82%); and AUC, 82% (range, 73-89%). With feedback, the mean sensitivity was 93% (range, 80-100%; mean increase, 4%; range of increase, 0-12%; p < 0.0001), the mean specificity was 61% (range, 45-73%; mean decrease, 1%; range of change, -18% to 11%; p = 0.19), and the mean AUC was 84% (range, 78-90%; mean increase, 2%; range of change, -3% to 9%; p < 0.0001). Three breast imagers in the lowest quartile of initial performance showed the greatest improvement in sensitivity with no change or improvement in AUC. The kappa values for feature analysis did not change, but there was improved agreement about final assessments, with the kappa value increasing from 0.53 (SE, 0.02) without feedback to 0.59 (SE, 0.02) with feedback (p < 0.0001). CONCLUSION. Most experienced breast imagers showed excellent breast ultrasound interpretive skills. Immediate feedback of consensus BI-RADS: Ultrasound features and histopathologic results improved performance in ultrasound interpretation across all experience variables.
KW - BI-RADS
KW - Breast ultrasound
KW - Feedback
KW - Observer agreement
KW - Observer performance
KW - Training
UR - http://www.scopus.com/inward/record.url?scp=84863653585&partnerID=8YFLogxK
UR - http://www.scopus.com/inward/citedby.url?scp=84863653585&partnerID=8YFLogxK
U2 - 10.2214/AJR.11.7324
DO - 10.2214/AJR.11.7324
M3 - Article
C2 - 22733916
AN - SCOPUS:84863653585
SN - 0361-803X
VL - 199
SP - 224
EP - 235
JO - American Journal of Roentgenology
JF - American Journal of Roentgenology
IS - 1
ER -