Evaluation of ChatGPT-Generated Educational Patient Pamphlets for Common Interventional Radiology Procedures

Soheil Kooraki, Melina Hosseiny, Mohamamd H. Jalili, Amir Ali Rahsepar, Amir Imanzadeh, Grace Hyun Kim, Cameron Hassani, Fereidoun Abtin, John M. Moriarty, Arash Bedayat*

*Corresponding author for this work

Research output: Contribution to journalArticlepeer-review

5 Scopus citations

Abstract

Rationale and Objectives: This study aimed to evaluate the accuracy and reliability of educational patient pamphlets created by ChatGPT, a large language model, for common interventional radiology (IR) procedures. Methods and Materials: Twenty frequently performed IR procedures were selected, and five users were tasked to independently request ChatGPT to generate educational patient pamphlets for each procedure using identical commands. Subsequently, two independent radiologists assessed the content, quality, and accuracy of the pamphlets. The review focused on identifying potential errors, inaccuracies, the consistency of pamphlets. Results: In a thorough analysis of the education pamphlets, we identified shortcomings in 30% (30/100) of pamphlets, with a total of 34 specific inaccuracies, including missing information about sedation for the procedure (10/34), inaccuracies related to specific procedural-related complications (8/34). A key-word co-occurrence network showed consistent themes within each group of pamphlets, while a line-by-line comparison at the level of users and across different procedures showed statistically significant inconsistencies (P < 0.001). Conclusion: ChatGPT-generated education pamphlets demonstrated potential clinical relevance and fairly consistent terminology; however, the pamphlets were not entirely accurate and exhibited some shortcomings and inter-user structural variabilities. To ensure patient safety, future improvements and refinements in large language models are warranted, while maintaining human supervision and expert validation.

Original languageEnglish (US)
Pages (from-to)4548-4553
Number of pages6
JournalAcademic radiology
Volume31
Issue number11
DOIs
StatePublished - Nov 2024

Keywords

  • Chat GPT
  • Co-occurrence network graph
  • Education
  • Interventional radiology
  • Large language models

ASJC Scopus subject areas

  • Radiology Nuclear Medicine and imaging

Fingerprint

Dive into the research topics of 'Evaluation of ChatGPT-Generated Educational Patient Pamphlets for Common Interventional Radiology Procedures'. Together they form a unique fingerprint.

Cite this