Society of Interventional Radiology bests ChatGPT at informing patients—but contest reveals shortcomings on both sides

Dave Pearson | June 19, 2023 | Health Imaging | Artificial Intelligence

ChatGPT large language AI radiology patient information

Photo by Ray Shrewsberry via Unsplash

As a source of patient information, human-authored SIRweb.org beats ChatGPT on readability and, in a word, helpfulness. However, the website needs work on those scores too.

Harvard researchers at Beth Israel Deaconess Medical Center found as much when they organized the website’s content into questions and posed them to ChatGPT.

Analyzing the outputs for word and sentence count, accuracy, readability (per several established scales) and suitability for patient education (per the HHS’s PEMAT instrument), the team found ChatGPT produced materials that ran longer, contained more difficult words and posed a greater reading challenge than competing content on SIRweb.org.

Also hurting the AI’s performance were a lack of visual aids and a rate of incorrect or incomplete content of 11.5% (12 of 104 answers).

At the same time, the SIR website tied ChatGPT on accessibility: Both were written at a grade level higher than the guideline-recommended fifth or sixth grade.

Corresponding study author Colin McCarthy, MD, and colleagues had their work published June 15 in the Journal of Vascular and Interventional Radiology [1].

In total, the researchers analyzed more than 21,000 words. These included almost 8,000 from the website and more than 13,000 generated by ChatGPT (across 22 text passages).

Four of five readability scales judged ChatGPT harder to read than SIRweb.org, and PEMAT scored the former lower than the latter.

In their discussion, McCarthy and co-authors remark that, as patients and their caregivers continue turning to digital outlets offering health information, the medical community “should recognize that their first stop may not be a societal website or other sources of ground truth. As was seen in this study, such sources may themselves benefit from improvements, specifically to ensure the content is understandable by the majority of readers, ensuring equitable access to healthcare information.”

The authors note the likelihood that additional tests of generative AI’s utility for patient education will soon follow.

That’s a good thing, given the speed at which large-language AI has evolved and captured the public’s imagination since ChatGPT’s introduction last fall.

“We propose that the use of existing, validated instruments such as those outlined herein may serve as a framework for future research in this field,” McCarthy and co-authors write.

For now, the early indicators suggest that this technology may have potential use cases for both physicians and patients alike. However, current versions of [ChatGPT] may produce incomplete or inaccurate patient educational content, and therefore opportunities may exist to develop customized chatbots for patient education, based on finetuning existing large language models.”

Abstract here, full text behind paywall.

ChatGPT effectively simplifies radiology reports, presents 'real opportunity' to better inform patients

ChatGPT's radiology board success has experts rethinking resident education

Google's latest large language model is poised to give ChatGPT a run for its money in imaging

ChatGPT helps radiologist churn out 16 papers in 4 months

ChatGPT to be utilized in new medical imaging app for patients

'Fictitious references' and 'significant inaccuracies' could hinder ChatGPT's medical writing career

ChatGPT offers 'pretty amazing' recommendations on breast cancer screening, but oversight remains critical

Dave Pearson

Dave P. has worked in journalism, marketing and public relations for more than 30 years, frequently concentrating on hospitals, healthcare technology and Catholic communications. He has also specialized in fundraising communications, ghostwriting for CEOs of local, national and global charities, nonprofits and foundations.

Around the web

Radiology Business

The impact of Trump tariffs on iodine contrast media costs

GE HealthCare said the price of iodine contrast increased by more than 200% between 2017 to 2023. Will new Chinese tariffs drive costs even higher?

Cardiovascular Business

COVID-19 linked to accelerated plaque growth, long-term risk of heart attack or stroke

These risks appear to be present regardless of a person's age or health at the time of infection.