Age, race and density status influence AI performance on mammogram reads

Hannah Murphy | May 21, 2024 | Health Imaging | Artificial Intelligence

Doctors have increasingly been seeing breast exams with swollen lymph nodes imitating cancer in patients who have received a vaccine, prompting Penn Medicine providers to offer up guidance. mammography mammogram breast cancer

Certain patient demographic characteristics could affect the outcomes of AI-interpreted breast cancer screenings, a new study suggests.

Screening mammography is an area where artificial intelligence has great potential to reduce radiologists’ workloads. And though studies have shown it to be effective as a support tool, several have also highlighted issues related to the potential for bias in algorithms that have not been trained on diverse datasets.

This issue was again brought to light in this most recent study, published in the journal Radiology.

“There are few demographically diverse databases for AI algorithm training, and the FDA does not require diverse datasets for validation,” Derek L. Nguyen, MD, assistant professor at Duke University in Durham, North Carolina, said in a release on the findings. “Because of the differences among patient populations, it’s important to investigate whether AI software can accommodate and perform at the same level for different patient ages, races and ethnicities.”

For the study, experts tested for bias using an algorithm approved the by the U.S. Food and Drug Administration for generating breast cancer risk scores based on mammographic findings. They included nearly 5,000 cases with confirmed negative screenings to see if the algorithm would flag any as suspicious.

“Our goal was to evaluate whether an AI algorithm’s performance was uniform across age, breast density types and different patient race/ethnicities,” Nguyen said.

The team found that the algorithm was significantly more likely to flag the imaging of Black women as having suspicious findings. These false positives were also more common among older women between 71-80 and in those who had extremely dense breasts.

Asian women and younger women (ages 41-50) had fewer false positives in comparison to white women and those in their 50s and 60s.

“This study is important because it highlights that any AI software purchased by a healthcare institution may not perform equally across all patient ages, races/ethnicities and breast densities. Moving forward, I think AI software upgrades should focus on ensuring demographic diversity,” Nguyen said, adding that institutions looking to purchase similar software need to take their patients’ demographics into consideration before doing so.

“Having a baseline knowledge of your institution’s demographics and asking the vendor about the ethnic and age diversity of their training data will help you understand the limitations you’ll face in clinical practice,” he said.

The study abstract is available here.

FDA adds more than 120 new AI-enabled medical devices focused on radiology to list of approvals

Radiographers are apprehensive about integrating AI into their workflow

GPT-4 now has vision—can it actually read chest X-rays?

Hannah Murphy

In addition to her background in journalism, Hannah also has patient-facing experience in clinical settings, having spent more than 12 years working as a registered rad tech. She began covering the medical imaging industry for Innovate Healthcare in 2021.

Around the web

Cardiovascular Business

Bracco updates HeartSee coronary flow capacity software with new diagnostic features

Clinicians have been using HeartSee to diagnose and treat coronary artery disease since the technology first debuted back in 2018. These latest updates, set to roll out to existing users, are designed to improve diagnostic performance and user access.

Cardiovascular Business

Key trends in diagnostic heart testing: CT on the rise as some traditional techniques fall out of favor

The cardiac technologies clinicians use for CVD evaluations have changed significantly in recent years, according to a new analysis of CMS data. While some modalities are on the rise, others are being utilized much less than ever before.

Cardiovascular Business

ASE updates recommendations for assessing right heart function in patients with pulmonary hypertension

The new guidelines were designed to ensure sonographers and other members of the heart team have the information they need to screen patients when appropriate and identify early warnings signs of PH.

Age, race and density status influence AI performance on mammogram reads

Related:

FDA adds more than 120 new AI-enabled medical devices focused on radiology to list of approvals

Radiographers are apprehensive about integrating AI into their workflow

GPT-4 now has vision—can it actually read chest X-rays?

Related Content

Around the web