Robustness tests for biomedical foundation models should tailor to specification

11 Pages Posted: 19 Nov 2024 Last revised: 7 Jan 2025

See all articles by Patrick Xian

Patrick Xian

University of California, San Francisco (UCSF)

Noah R. Baker

University of California, San Francisco (UCSF)

Tom David

PRISM Eval

Qiming Cui

University of California, San Francisco (UCSF)

A. Jay Holmgren

University of California, San Francisco (UCSF)

Stefan Bauer

Technical University of Munich

Madhumita Sushil

University of California, San Francisco (UCSF)

Reza Abbasi-Asl

University of California, San Francisco (UCSF)

Date Written: January 06, 2025

Abstract

Existing regulatory frameworks for biomedical AI include robustness as a key component but lack detailed implementational guidance. The recent rise of biomedical foundation models creates new hurdles in testing and certification given their broad capabilities and susceptibility to complex distribution shifts. To balance test feasibility and effectiveness, we suggest a priority-based, task-oriented approach to tailor robustness evaluation objectives to a predefined specification. We urge concrete policies to adopt a granular categorization of robustness concepts in the specification. The approach promotes the standardization of risk assessment and monitoring and guides technical developments and mitigation efforts.

Keywords: foundation model, health AI, robustness, AI policy

JEL Classification: I18, C52

Suggested Citation

Xian, R. Patrick and Baker, Noah R. and David, Tom and Cui, Qiming and Holmgren, A. Jay and Bauer, Stefan and Sushil, Madhumita and Abbasi-Asl, Reza, Robustness tests for biomedical foundation models should tailor to specification (January 06, 2025). Available at SSRN: https://ssrn.com/abstract=5013799 or http://dx.doi.org/10.2139/ssrn.5013799

R. Patrick Xian (Contact Author)

University of California, San Francisco (UCSF) ( email )

Third Avenue and Parnassus
San Francisco, CA CA 94143
United States

Noah R. Baker

University of California, San Francisco (UCSF) ( email )

Tom David

PRISM Eval ( email )

Qiming Cui

University of California, San Francisco (UCSF) ( email )

A. Jay Holmgren

University of California, San Francisco (UCSF) ( email )

Stefan Bauer

Technical University of Munich ( email )

Madhumita Sushil

University of California, San Francisco (UCSF) ( email )

Reza Abbasi-Asl

University of California, San Francisco (UCSF) ( email )

Do you have a job opening that you would like to promote on SSRN?

Paper statistics

Downloads
71
Abstract Views
304
Rank
669,697
PlumX Metrics