A multimodal fusion model with multi-level attention mechanism for depression detection. (April 2023)
- Record Type:
- Journal Article
- Title:
- A multimodal fusion model with multi-level attention mechanism for depression detection. (April 2023)
- Main Title:
- A multimodal fusion model with multi-level attention mechanism for depression detection
- Authors:
- Fang, Ming
Peng, Siyu
Liang, Yujia
Hung, Chih-Cheng
Liu, Shuhua - Abstract:
- Highlights: A multimodal depression detection model (MFM-Att) is proposed. MFM-Att combines audio, visual and textual modality. Multi-level attention extracts effective feature of intra and inter modality. Abstract: Depression is a common mental illness that affects the physical and mental health of hundreds of millions of people around the world. Therefore, designing an efficient and robust depression detection model is an urgent research task. In order to fully extract depression features, we systematically analyze audio-visual and text data related to depression, and proposes a multimodal fusion model with multi-level attention mechanism (MFM-Att) for depression detection. The method is mainly divided into two stages: the first stage utilizes two LSTMs and a Bi-LSTM with attention mechanism to learn multi-view audio feature, visual feature and rich text feature, respectively. In the second stage, the output features of the three modalities are sent into the attention fusion network (AttFN) to obtain effective depression information, aiming to make use of the diversity and complementarity between modalities for depression detection. It is worth noting that the multi-level attention mechanism can not only extract valuable depressive features of intra-modality, but also learn the correlations of inter-modality, thereby improving the overall performance of the model by reducing the influence of redundant information. MFM-Att model is evaluated on the DAIC-WOZ dataset, and theHighlights: A multimodal depression detection model (MFM-Att) is proposed. MFM-Att combines audio, visual and textual modality. Multi-level attention extracts effective feature of intra and inter modality. Abstract: Depression is a common mental illness that affects the physical and mental health of hundreds of millions of people around the world. Therefore, designing an efficient and robust depression detection model is an urgent research task. In order to fully extract depression features, we systematically analyze audio-visual and text data related to depression, and proposes a multimodal fusion model with multi-level attention mechanism (MFM-Att) for depression detection. The method is mainly divided into two stages: the first stage utilizes two LSTMs and a Bi-LSTM with attention mechanism to learn multi-view audio feature, visual feature and rich text feature, respectively. In the second stage, the output features of the three modalities are sent into the attention fusion network (AttFN) to obtain effective depression information, aiming to make use of the diversity and complementarity between modalities for depression detection. It is worth noting that the multi-level attention mechanism can not only extract valuable depressive features of intra-modality, but also learn the correlations of inter-modality, thereby improving the overall performance of the model by reducing the influence of redundant information. MFM-Att model is evaluated on the DAIC-WOZ dataset, and the result outperforms state-of-the-art models in terms of root mean square error (RMSE). … (more)
- Is Part Of:
- Biomedical signal processing and control. Volume 82(2023)
- Journal:
- Biomedical signal processing and control
- Issue:
- Volume 82(2023)
- Issue Display:
- Volume 82, Issue 2023 (2023)
- Year:
- 2023
- Volume:
- 82
- Issue:
- 2023
- Issue Sort Value:
- 2023-0082-2023-0000
- Page Start:
- Page End:
- Publication Date:
- 2023-04
- Subjects:
- Depression detection -- Multimodal -- Attention mechanism
Signal processing -- Periodicals
Biomedical engineering -- Periodicals
Signal Processing, Computer-Assisted -- Periodicals
Image Processing, Computer-Assisted -- Periodicals
Biomedical Engineering -- Periodicals
610.28 - Journal URLs:
- http://www.sciencedirect.com/science/journal/17468094 ↗
http://www.elsevier.com/journals ↗
http://www.sciencedirect.com/science?_ob=PublicationURL&_tockey=%23TOC%2329675%232006%23999989998%23626449%23FLA%23&_cdi=29675&_pubType=J&_auth=y&_acct=C000045259&_version=1&_urlVersion=0&_userid=836873&md5=664b5cf9a57fc91971a17faf20c32ec1 ↗ - DOI:
- 10.1016/j.bspc.2022.104561 ↗
- Languages:
- English
- ISSNs:
- 1746-8094
- Deposit Type:
- Legaldeposit
- View Content:
- Available online (eLD content is only available in our Reading Rooms) ↗
- Physical Locations:
- British Library DSC - 2087.880400
British Library DSC - BLDSS-3PM
British Library HMNTS - ELD Digital store - Ingest File:
- 26009.xml