A Multimodal Fusion Model with Multi-Level Attention Mechanism for Depression Detection

22 Pages Posted: 26 Jul 2022

See all articles by Ming Fang

Ming Fang

Northeast Normal University

Siyu Peng

Northeast Normal University

Yujia Liang

Northeast Normal University

Chih-Cheng Hung

Kennesaw State University

Shuhua Liu

Northeast Normal University - School of Information Science and Technology

Multiple version iconThere are 2 versions of this paper

Abstract

Depression is a common mental illness that affects the physical and mental health of hundreds of millions of people around the world. Therefore, designing an efficient and robust depression detection model is an urgent research task. In order to fully extract depression features, we systematically analyze audio-visual and text data related to depression, and proposes a multimodal fusion model with multi-level attention mechanism (MFM-Att) for depression detection. The method is mainly divided into two stages: the first stage utilizes two LSTMs and a Bi-LSTM with attention mechanism to learn multi-view audio feature, visual feature and rich text feature, respectively. In the second stage, the output features of the three modalities are sent into the attention fusion network (AttFN) to obtain effective depression information, aiming to make use of the diversity and complementarity between modalities for depression detection. It is worth noting that the multi-level attention mechanism can not only extract valuable depressive features of intra-modality, but also learn the correlations of inter-modality, thereby improving the overall performance of the model by reducing the influence of redundant information. MFM-Att model is evaluated on the DAIC-WOZ dataset, and the result outperforms state-of-the-art models in terms of root mean square error (RMSE).

Keywords: Depression detection, Multimodal, Attention Mechanism

Suggested Citation

Fang, Ming and Peng, Siyu and Liang, Yujia and Hung, Chih-Cheng and Liu, Shuhua, A Multimodal Fusion Model with Multi-Level Attention Mechanism for Depression Detection. Available at SSRN: https://ssrn.com/abstract=4172609 or http://dx.doi.org/10.2139/ssrn.4172609

Ming Fang

Northeast Normal University ( email )

Changchun
China

Siyu Peng

Northeast Normal University ( email )

Changchun
China

Yujia Liang

Northeast Normal University ( email )

Changchun
China

Chih-Cheng Hung

Kennesaw State University ( email )

1000 Chastain Rd
Kennesaw, GA 30144
United States

Shuhua Liu (Contact Author)

Northeast Normal University - School of Information Science and Technology ( email )

Do you have a job opening that you would like to promote on SSRN?

Paper statistics

Downloads
246
Abstract Views
367
Rank
133,403
PlumX Metrics