Mortality Forecasting Using Stacked Regression Ensembles

33 Pages Posted: 12 Apr 2021 Last revised: 8 Oct 2021

See all articles by Salvatory KESSY

Salvatory KESSY

University of New South Wales

Michael Sherris

UNSW Business School

Andrés Villegas

University of New South Wales (UNSW) - ARC Centre of Excellence in Population Ageing Research (CEPAR)

Jonathan Ziveyi

University of New South Wales; ARC Centre of Excellence in Population Ageing Research and School of Risk & Actuarial Studies

Date Written: April 10, 2021

Abstract

There are many alternative approaches to selecting mortality models and forecasting mortality. The standard practice is to produce forecasts using a single model such as the Lee-Carter, the Cairns-Blake-Dowd, or the Age- Period-Cohort model, with model selection based on in-sample goodness of fit measures. However, increasingly cross-validation measures based on forecasts are used in mortality model selection, and model combination methods such as Bayesian Model Averaging and Model Confidence Set have been proposed as alternatives to using a single model. We present a stacked regression ensemble method that optimally combines different mortality models to reduce the mean squared errors of mortality rate forecasts and mitigate model selection risk. Stacked regression uses a supervised machine learning algorithm to approximate the horizon-specific weights by minimizing the cross-validation criterion for each forecasting horizon. The horizon-specific weights facilitate the development of a mortality model combination customized to each horizon. Unlike other model combination methods, stacked regression simultaneously solves model selection and estimates model combinations to improve model forecasts. We use 44 populations from the Human Mortality Database to compare the stacked regression mortality approach with other alternative methods. We show that using one-year-ahead to 15−year-ahead out-of-sample mean squared errors, the stacked regression ensemble approach improves mortality forecast accuracy by 13% - 49% and 19% - 90% for females over individual mortality models. The stacked regression ensemble has better predictive accuracy than other model combination methods, including Simple Model Averaging, Bayesian Model Averaging, and Model Confidence Set. We provide a user-friendly open-source R package, CoMoMo, that estimates models and provides mortality rate forecasts for different mortality model combination methods.

Keywords: Stacked regression, ensemble learning, cross-validation, model uncertainty, model combination, age-period-cohort model, mortality forecasting

JEL Classification: G22, C02, C10, C31, C45, C52, C88

Suggested Citation

KESSY, Salvatory and Sherris, Michael and Villegas, Andrés and Ziveyi, Jonathan, Mortality Forecasting Using Stacked Regression Ensembles (April 10, 2021). Available at SSRN: https://ssrn.com/abstract=3823511 or http://dx.doi.org/10.2139/ssrn.3823511

Salvatory KESSY (Contact Author)

University of New South Wales ( email )

Sydney, New South Wales
Australia

Michael Sherris

UNSW Business School ( email )

Sydney, NSW 2052
Australia

Andrés Villegas

University of New South Wales (UNSW) - ARC Centre of Excellence in Population Ageing Research (CEPAR) ( email )

Level 6, Central Lobby (enter via East Lobby)
Australian School of Business Building
Sydney, New South Wales NSW 2052
Australia

Jonathan Ziveyi

University of New South Wales; ARC Centre of Excellence in Population Ageing Research and School of Risk & Actuarial Studies ( email )

School of Risk and Actuarial Studies
UNSW Business School
Sydney, NSW 2000
Australia
+61 2 9065 8254 (Phone)
+61 2 9385 1883 (Fax)

Do you have negative results from your research you’d like to share?

Paper statistics

Downloads
381
Abstract Views
1,642
Rank
142,786
PlumX Metrics