TY - JOUR
T1 - Comparison of accuracy and reproducibility of colposcopic impression based on a single image versus a two-minute time series of colposcopic images
AU - Perkins, Rebecca
AU - Jeronimo, Jose
AU - Hammer, Anne
AU - Novetsky, Akiva
AU - Guido, Richard
AU - del Pino, Marta
AU - Louwers, Jaqueline
AU - Marcus, Jenna
AU - Resende, Ceres
AU - Smith, Katie
AU - Egemen, Didem
AU - Befano, Brian
AU - Smith, Debi
AU - Antani, Sameer
AU - de Sanjose, Silvia
AU - Schiffman, Mark
N1 - Funding Information:
The work was supported by the National Institutes of Health . No commercial support was obtained.
Publisher Copyright:
© 2022 The Authors
PY - 2022/10
Y1 - 2022/10
N2 - Objective: Colposcopy is an important part of cervical screening/management programs. Colposcopic appearance is often classified, for teaching and telemedicine, based on static images that do not reveal the dynamics of acetowhitening. We compared the accuracy and reproducibility of colposcopic impression based on a single image at one minute after application of acetic acid versus a time-series of 17 sequential images over two minutes. Methods: Approximately 5000 colposcopic examinations conducted with the DYSIS colposcopic system were divided into 10 random sets, each assigned to a separate expert colposcopist. Colposcopists first classified single two-dimensional images at one minute and then a time-series of 17 sequential images as ‘normal,’ ‘indeterminate,’ ‘high grade,’ or ‘cancer’. Ratings were compared to histologic diagnoses. Additionally, 5 colposcopists reviewed a subset of 200 single images and 200 time series to estimate intra- and inter-rater reliability. Results: Of 4640 patients with adequate images, only 24.4% were correctly categorized by single image visual assessment (11% of 64 cancers; 31% of 605 CIN3; 22.4% of 558 CIN2; 23.9% of 3412 < CIN2). Individual colposcopist accuracy was low; Youden indices (sensitivity plus specificity minus one) ranged from 0.07 to 0.24. Use of the time-series increased the proportion of images classified as normal, regardless of histology. Intra-rater reliability was substantial (weighted kappa = 0.64); inter-rater reliability was fair ( weighted kappa = 0.26). Conclusion: Substantial variation exists in visual assessment of colposcopic images, even when a 17-image time series showing the two-minute process of acetowhitening is presented. We are currently evaluating whether deep-learning image evaluation can assist classification.
AB - Objective: Colposcopy is an important part of cervical screening/management programs. Colposcopic appearance is often classified, for teaching and telemedicine, based on static images that do not reveal the dynamics of acetowhitening. We compared the accuracy and reproducibility of colposcopic impression based on a single image at one minute after application of acetic acid versus a time-series of 17 sequential images over two minutes. Methods: Approximately 5000 colposcopic examinations conducted with the DYSIS colposcopic system were divided into 10 random sets, each assigned to a separate expert colposcopist. Colposcopists first classified single two-dimensional images at one minute and then a time-series of 17 sequential images as ‘normal,’ ‘indeterminate,’ ‘high grade,’ or ‘cancer’. Ratings were compared to histologic diagnoses. Additionally, 5 colposcopists reviewed a subset of 200 single images and 200 time series to estimate intra- and inter-rater reliability. Results: Of 4640 patients with adequate images, only 24.4% were correctly categorized by single image visual assessment (11% of 64 cancers; 31% of 605 CIN3; 22.4% of 558 CIN2; 23.9% of 3412 < CIN2). Individual colposcopist accuracy was low; Youden indices (sensitivity plus specificity minus one) ranged from 0.07 to 0.24. Use of the time-series increased the proportion of images classified as normal, regardless of histology. Intra-rater reliability was substantial (weighted kappa = 0.64); inter-rater reliability was fair ( weighted kappa = 0.26). Conclusion: Substantial variation exists in visual assessment of colposcopic images, even when a 17-image time series showing the two-minute process of acetowhitening is presented. We are currently evaluating whether deep-learning image evaluation can assist classification.
KW - Cervical cancer prevention
KW - Colposcopy
KW - Visual image evaluation
UR - http://www.scopus.com/inward/record.url?scp=85136312752&partnerID=8YFLogxK
UR - http://www.scopus.com/inward/citedby.url?scp=85136312752&partnerID=8YFLogxK
U2 - 10.1016/j.ygyno.2022.08.001
DO - 10.1016/j.ygyno.2022.08.001
M3 - Article
C2 - 36008184
AN - SCOPUS:85136312752
SN - 0090-8258
VL - 167
SP - 89
EP - 95
JO - Gynecologic oncology
JF - Gynecologic oncology
IS - 1
ER -