Feature Selection in Jump Models
32 Pages Posted: 18 Mar 2021
Date Written: March 16, 2021
Jump models switch infrequently between states to fit a sequence of data while taking the ordering of the data into account. We propose a new framework for joint feature selection, parameter and state-sequence estimation in jump models. Feature selection is necessary in high-dimensional settings where the number of features is large compared to the number of observations and the underlying states differ only with respect to a subset of the features. We develop and implement a coordinate descent algorithm that alternates between selecting the features and estimating the model parameters and state sequence, which scales to large data sets with large numbers of (noisy) features. We demonstrate the usefulness of the proposed framework by comparing it with a number of other methods on both simulated and real data in the form of financial returns, protein sequences, and text. The resulting sparse jump model outperforms all other methods considered and is remarkably robust to noise.
Keywords: High-dimensional; sequential data; time series; clustering; unsupervised learning; regime switching
JEL Classification: C22, C38, C51, C58, C61, G11
Suggested Citation: Suggested Citation