Opening the Black Box: An Interpretability Evaluation Framework Using Land-Use Probing Tasks to Understand Deep Learning Traffic Prediction Models
31 Pages Posted: 14 May 2022
Abstract
Emerging Machine Learning-based models are increasingly being deployed in critical traffic prediction problems. Due to the ‘black box’ nature of these models, there is an increasing emphasis on understanding their working process and analysing the physical meaning of their hidden layers. Most existing academic studies on Deep Learning (DL) tools in transportation engineering, however, are over-focusing on statistical accuracy, while little attention has been paid to analysing model interpretability. To address this limitation, we propose a novel two-step interpretability evaluation framework with land-use Probing Tasks to facilitate understanding the internal functions of a given DL-based traffic prediction model. This framework is built to understand if one or several hidden layers inside a DL-based traffic prediction model can spontaneously capture the spatial characteristics and if such spatial characteristics significantly affect the predicted outcome. In the case study, the interpretability of three types of self-supervised pre-training models (i.e., Auto-encoders, Variational Auto-encoders and Graph Convolutional Auto-encoders) is evaluated within the proposed framework. These DL models are trained and tested using parking and land-use data collected from the San Francisco Bay Area, California. The results of the experiments justify the reasonability of the proposed framework from both a global and a local perspective. All the land use indexes used in the testing are spontaneously captured in the hidden layers and the indexes of office and retail land use are the main contributors to prediction accuracy. Based on the measurements of spatial interpretability from different dimensions, we also provide a hidden layer-level post-hoc explanation for why the Graph Convolutional Network outperforms traditional DL-based models.
Keywords: Traffic prediction, Deep learning, Interpretability evaluation, land-use Probing tasks, Partial Dependence Plot
Suggested Citation: Suggested Citation