Replacing Cross-Validation with Interrogation: A Universal Test for Underfitting and Overfitting

16 Pages Posted: 7 May 2025 Last revised: 1 May 2025

See all articles by Huili Song

Huili Song

State Street Global Markets - State Street Associates

Megan Czasonis

State Street Corporate

David Turkington

State Street Associates

Yin Li

State Street Global Markets - State Street Associates

Date Written: April 23, 2025

Abstract

Gauging the reliability of a prediction routine for use outside its training data is a fundamental challenge in machine learning. Model training typically relies on cross-validation to avoid overfitting, whereby alternate calibrations of a model are trained on subsamples of the available data and tested on the corresponding validation samples. However, cross-validation has inherent limitations: it cannot directly evaluate the model trained on all available data, sample slicing limits the statistical power of subsample training and testing, and it is computationally expensive. We propose an alternative approach based on interrogating the primary model trained on all available data. In our simulations, the interrogation-based method successfully identified near-optimal model calibrations without using any validation samples. Our model-agnostic method works by explicitly decomposing prediction logic into linear, nonlinear, pairwise, and high-order interaction components, and then testing for the statistical identifiability of these separate components in the presence of noise. Poorly calibrated models reveal statistical problems analogous to harmful collinearity in linear regression. Interrogation can be applied in a wide variety of contexts to evaluate model reliability.

Keywords: Model validation, Overfitting, Underfitting, Cross-validation, Neural network, Machine learning, Model interpretability, Agnostic model

JEL Classification: C51, C52

Suggested Citation

Song, Huili and Czasonis, Megan and Turkington, David and Li, Yin, Replacing Cross-Validation with Interrogation: A Universal Test for Underfitting and Overfitting (April 23, 2025). Available at SSRN: https://ssrn.com/abstract=5229414 or http://dx.doi.org/10.2139/ssrn.5229414

Huili Song (Contact Author)

State Street Global Markets - State Street Associates ( email )

140 Mt. Auburn St.
Cambridge, MA Cambridge 02138
United States

Megan Czasonis

State Street Corporate ( email )

1 Lincoln Street
Boston, MA 02111
United States

David Turkington

State Street Associates ( email )

United States

Yin Li

State Street Global Markets - State Street Associates ( email )

165 Tremont Street
Unit 705
Boston, MA 02111
United States

Do you have a job opening that you would like to promote on SSRN?

Paper statistics

Downloads
68
Abstract Views
203
Rank
728,813
PlumX Metrics