Building Trust in Artificial Intelligence for Clinical Decision-Making

Research news and discoveries from Mass General Brigham

Next Previous

Building Trust in Artificial Intelligence for Clinical Decision-Making

March 12, 2026

Young-Tak Kim, PhD

One of the significant challenges associated with integrating artificial intelligence (AI) in clinical settings is understanding when it is safe to trust AI-based decisions.

Traditional metrics for evaluating AI performance, like accuracy, often do not address critical aspects of operational safety (i.e., meeting pre-specified reliability targets for rule-in and rule-out decisions), which leads to a hesitance in adopting such technologies.

To overcome this barrier, a new study led by Young-Tak Kim, PhD, and Synho Do, PhD, of the Department of Radiology at Mass General Brigham, introduces the Safety-Aware Receiver Operating Characteristic (SA-ROC) framework.

This tool indicates to providers when it is and is not appropriate to trust AI to help make clinical decisions, while also identifying a “Gray Zone” for judgments that require human review.

The researchers examined two FDA-cleared AI algorithms used for cancer screening. In a surprising turn, they found that the model with better performance metrics was less safe for clinical use under the most stringent safety requirements than the one with slightly poorer performance—revealing that looking solely at accuracy metrics can be misleading.

By offering a clearer understanding of how AI models operate in real-world scenarios, the SA-ROC framework could ultimately improve patient care and reduce physician workload with safer automation.

Published in npj Digital Medicine on February 20, 2026 | Read the paper: “Defining Operational Safety in Clinical Artificial Intelligence Systems”
Summary reviewed by: Synho Do, PhD, senior author

The research team. Top row, from left, are Young-Tak Kim, PhD, Hyunji Kim, PhD, Manisha Bahl, MD, MPH, and Michael H. Lev, MD. Bottom row, from left, are Ramon Gilberto González, MD, PhD, Michael S. Gee, MD, PhD, and Synho Do, PhD.

Category:

Clinical Research

Tags:

Artificial Intelligence

Brian Burns

Related Projects:

Researchers Investigate Whether Pharmacist-Led Discharge Care Can Improve Outcomes for Older Patients

patient care pharmacy

Researchers Investigate Whether Pharmacist-Led Discharge Care Can Improve Outcomes for Older Patients

patient care pharmacy

Researchers Investigate Whether Pharmacist-Led Discharge Care Can Improve Outcomes for Older Patients

2026-03-18

Two Patient Cases Suggest GLP-1 Therapy Could Interfere With Parkinson’s Disease Treatment

brain and nervous system conditions

Two Patient Cases Suggest GLP-1 Therapy Could Interfere With Parkinson’s Disease Treatment

brain and nervous system conditions

Two Patient Cases Suggest GLP-1 Therapy Could Interfere With Parkinson’s Disease Treatment

2026-03-13

A Versatile New Step for Better Medical Image Alignment

imaging

A Versatile New Step for Better Medical Image Alignment

imaging

A Versatile New Step for Better Medical Image Alignment

2026-03-13