Abstract
Depression is a widespread mental health issue requiring efficient automated detection methods. Traditional single-modality approaches are less effective due to the disorder's complexity, leading to a focus on multimodal analysis. Recent advancements include transformer-based fusion methods, yet their application in depression detection is often limited by the dominant text modality. To address this, we propose the Text-Guided Multimodal Cross-Attention Transformer, enhancing cross-modal interactions between text, audio, and video for more effective depression detection. Our approach uniquely pre-trains encoders on a large sentiment dataset to better capture emotion-related features crucial for identifying depression-related sentiment changes. Our method demonstrates superior performance on the AVEC2019 benchmark, outperforming current state-of-the-art depression detection techniques.
| Original language | English |
|---|---|
| Title of host publication | 46th Annual International Conference of the IEEE Engineering in Medicine and Biology Society, EMBC 2024 - Proceedings |
| Publisher | Institute of Electrical and Electronics Engineers Inc. |
| ISBN (Electronic) | 9798350371499 |
| DOIs | |
| Publication status | Published - 2024 |
| Event | 46th Annual International Conference of the IEEE Engineering in Medicine and Biology Society, EMBC 2024 - Orlando, United States Duration: 15-07-2024 → 19-07-2024 |
Publication series
| Name | Proceedings of the Annual International Conference of the IEEE Engineering in Medicine and Biology Society, EMBS |
|---|---|
| ISSN (Print) | 1557-170X |
Conference
| Conference | 46th Annual International Conference of the IEEE Engineering in Medicine and Biology Society, EMBC 2024 |
|---|---|
| Country/Territory | United States |
| City | Orlando |
| Period | 15-07-24 → 19-07-24 |
UN SDGs
This output contributes to the following UN Sustainable Development Goals (SDGs)
-
SDG 3 Good Health and Well-being
All Science Journal Classification (ASJC) codes
- Signal Processing
- Biomedical Engineering
- Computer Vision and Pattern Recognition
- Health Informatics
Fingerprint
Dive into the research topics of 'A Sentiment Pre-trained Text-Guided Multimodal Cross-Attention Transformer for Improved Depression Detection'. Together they form a unique fingerprint.Cite this
- APA
- Author
- BIBTEX
- Harvard
- Standard
- RIS
- Vancouver