DepressionLLM: Emotion- and causality-aware depression detection with foundation models

  • Shiyu Teng
  • , Jiaqing Liu
  • , Hao Sun
  • , Yue Huang
  • , Rahul Kumar Jain
  • , Shurong Chai
  • , Ruibo Hou
  • , Tomoko Tateyama
  • , Lanfen Lin
  • , Lang He
  • , Yen Wei Chen

Research output: Contribution to journalArticlepeer-review

Abstract

Depression is a complex mental health issue often reflected through subtle multimodal signals in speech, facial expressions, and language. However, existing approaches using large language models (LLMs) face limitations in integrating these diverse modalities and providing interpretable insights, restricting their effectiveness in real-world and clinical settings. This study presents a novel framework that leverages foundation models for interpretable multimodal depression detection. Our approach follows a three-stage process: First, pseudo-labels enriched with emotional and causal cues are generated using a pretrained language model (GPT-4o), expanding the training signal beyond ground-truth labels. Second, a coarse-grained learning phase employs another model (Qwen2.5) to capture relationships among depression levels, emotional states, and inferred reasoning. Finally, a fine-grained tuning stage fuses video, audio, and text inputs via a multimodal prompt fusion module to construct a unified depression representation. We evaluate our framework on benchmark datasets – E-DAIC, CMDC, and EATD – demonstrating consistent improvements over state-of-the-art methods in both depression detection and causal reasoning tasks. By integrating foundation models with multimodal video understanding, our work offers a robust and interpretable solution for mental health analysis, contributing to the advancement of multimodal AI in clinical and real-world applications.

Original languageEnglish
Article number103304
JournalDisplays
Volume92
DOIs
Publication statusPublished - 04-2026
Externally publishedYes

All Science Journal Classification (ASJC) codes

  • Human-Computer Interaction
  • Hardware and Architecture
  • Electrical and Electronic Engineering

Fingerprint

Dive into the research topics of 'DepressionLLM: Emotion- and causality-aware depression detection with foundation models'. Together they form a unique fingerprint.

Cite this