Automatic Depression Detection: An Emotional Audio-Textual Corpus and a GRU/BiLSTM-based Model
About
Depression is a global mental health problem, the worst case of which can lead to suicide. An automatic depression detection system provides great help in facilitating depression self-assessment and improving diagnostic accuracy. In this work, we propose a novel depression detection approach utilizing speech characteristics and linguistic contents from participants' interviews. In addition, we establish an Emotional Audio-Textual Depression Corpus (EATD-Corpus) which contains audios and extracted transcripts of responses from depressed and non-depressed volunteers. To the best of our knowledge, EATD-Corpus is the first and only public depression dataset that contains audio and text data in Chinese. Evaluated on two depression datasets, the proposed method achieves the state-of-the-art performances. The outperforming results demonstrate the effectiveness and generalization ability of the proposed method. The source code and EATD-Corpus are available at https://github.com/speechandlanguageprocessing/ICASSP2022-Depression.
Related benchmarks
| Task | Dataset | Result | Rank | |
|---|---|---|---|---|
| Depression Detection | DAIC-WOZ (dev) | F1 (avg)77.34 | 43 | |
| Depression Recognition | EATD | F1 Score71 | 17 | |
| Depression Detection | MODMA | F1 Score60.4 | 15 | |
| Depression Binary Classification | DepSeverity (test) | Accuracy84.6 | 10 | |
| Depression Binary Classification | MMDA (test) | Accuracy74.5 | 10 | |
| Depression Severity Estimation | MMDA (test) | Precision (Normal)38.1 | 10 | |
| Depression Binary Classification | DR (test) | Accuracy30.5 | 10 | |
| Depression Severity Estimation | DepSeverity | Precision (Normal)58.2 | 10 |