A Semiautonomous Deep Learning System to Reduce False Positives in Screening Mammography

  • Stefano Pedemonte
  • , Trevor Tsue
  • , Brent Mombourquette
  • , Yen Nhi Truong Vu
  • , Thomas Matthews
  • , Rodrigo Morales Hoil
  • , Meet Shah
  • , Nikita Ghare
  • , Naomi Zingman-Daniels
  • , Susan Holley
  • , Catherine M. Appleton
  • , Jason Su
  • , Richard L. Wahl

Research output: Contribution to journalArticlepeer-review

5 Scopus citations

Abstract

Purpose: To evaluate the ability of a semiautonomous artificial intelligence (AI) model to identify screening mammograms not suspicioufor breast cancer and reduce the number of false-positive examinations. Materials and Methods: The deep learning algorithm was trained using 123 248 two-dimensional digital mammograms (6161 cancers) and a retrospective study was performed on three nonoverlapping datasets of 14 831 screening mammography examinations (1026 cancers) from two U.S. institutions and one U.K. institution (2008–2017). The stand-alone performance of humans and AI was compared. Human plus AI performance was simulated to examine reductions in the cancer detection rate, number of examinations, false-positive callbacks, and benign biopsies. Metrics were adjusted to mimic the natural distribution of a screening population, and bootstrapped CIs and P values were calculated. Results: Retrospective evaluation on all datasets showed minimal changes to the cancer detection rate with use of the AI device (noninferiority margin of 0.25 cancers per 1000 examinations: U.S. dataset 1, P = .02; U.S. dataset 2, P < .001; U.K. dataset, P < .001). On U.S. dataset 1 (11 592 mammograms; 101 cancers; 3810 female patients; mean age, 57.3 years ± 10.0 [SD]), the device reduced screening examinations requiring radiologist interpretation by 41.6% (95% CI: 40.6%, 42.4%; P < .001), diagnostic examinations callbacks by 31.1% (95% CI: 28.7%, 33.4%; P < .001), and benign needle biopsies by 7.4% (95% CI: 4.1%, 12.4%; P < .001). U.S. dataset 2 (1362 mammograms; 330 cancers; 1293 female patients; mean age, 55.4 years ± 10.5) was reduced by 19.5% (95% CI: 16.9%, 22.1%; P < .001), 11.9% (95% CI: 8.6%, 15.7%; P < .001), and 6.5% (95% CI: 0.0%, 19.0%; P = .08), respectively. The U.K. dataset (1877 mammograms; 595 cancers; 1491 female patients; mean age, 63.5 years ± 7.1) was reduced by 36.8% (95% CI: 34.4%, 39.7%; P < .001), 17.1% (95% CI: 5.9%, 30.1%: P < .001), and 5.9% (95% CI: 2.9%, 11.5%; P < .001), respectively. Conclusion: This work demonstrates the potential of a semiautonomous breast cancer screening system to reduce false positives, unnecessaprocedures, patient anxiety, and medical expenses.

Original languageEnglish
Article numbere230033
JournalRadiology: Artificial Intelligence
Volume6
Issue number3
DOIs
StatePublished - May 2024

Keywords

  • Artificial Intelligence
  • Breast Cancer
  • Screening Mammography
  • Semiautonomous Deep Learning

Fingerprint

Dive into the research topics of 'A Semiautonomous Deep Learning System to Reduce False Positives in Screening Mammography'. Together they form a unique fingerprint.

Cite this