Machine Learning from a "Universe" of Signals: The Role of Feature Engineering 

64 Pages Posted: 9 May 2025

See all articles by Bin Li

Bin Li

Wuhan University

Alberto G. Rossi

Georgetown University - McDonough School of Business

Xuemin Sterling Yan

Lehigh University - College of Business

Lingling Zheng

Renmin University of China

Date Written: January 01, 2021

Abstract

We construct real-time machine learning strategies based on a "universe" of fundamental signals. The out-of-sample performance of these strategies is economically meaningful and statistically significant, but considerably weaker than those documented by prior studies that use curated sets of signals as predictors. Strategies based on a simple recursive ranking of each signal's past performance also yield substantially better out-of-sample performance. We find qualitatively similar results when examining past-return-based signals. Our results underscore the key role of feature engineering and, more broadly, inductive biases in enhancing the economic benefits of machine learning investment strategies.

Suggested Citation

Li, Bin and Rossi, Alberto G. and Yan, Xuemin Sterling and Zheng, Lingling, Machine Learning from a "Universe" of Signals: The Role of Feature Engineering  (January 01, 2021). Available at SSRN: https://ssrn.com/abstract=5248179 or http://dx.doi.org/10.2139/ssrn.5248179

Bin Li

Wuhan University ( email )

Economics and Management School
Wuhan University
Wuhan, Hubei 430072
China

HOME PAGE: http://libinli.com

Alberto G. Rossi (Contact Author)

Georgetown University - McDonough School of Business ( email )

3700 O Street, NW
Washington, DC 20057
United States

Xuemin Sterling Yan

Lehigh University - College of Business ( email )

Bethlehem, PA 18015
United States

Lingling Zheng

Renmin University of China ( email )

Do you have a job opening that you would like to promote on SSRN?

Paper statistics

Downloads
160
Abstract Views
381
Rank
401,752
PlumX Metrics