CBT-LLM: A Chinese Large Language Model for Cognitive Behavioral Therapy-based Mental Health Question Answering
About
The recent advancements in artificial intelligence highlight the potential of language models in psychological health support. While models trained on data from mental health service platform have achieved preliminary success, challenges persist in areas such as data scarcity, quality, and ensuring a solid foundation in psychological techniques. To address these challenges, this study introduces a novel approach to enhance the precision and efficacy of psychological support through large language models. Specifically, we design a specific prompt derived from principles of Cognitive Behavioral Therapy (CBT) and have generated the CBT QA dataset, specifically for Chinese psychological health Q&A based on CBT structured intervention strategies. Unlike previous methods, our dataset emphasizes professional and structured response. Utilizing this dataset, we fine-tuned the large language model, giving birth to CBT-LLM, the large-scale language model specifically designed for Cognitive Behavioral Therapy techniques. Empirical evaluations demonstrate that CBT-LLM excels in generating structured, professional, and highly relevant responses in psychological health support tasks, showcasing its practicality and quality. The model is available on Hugging Face: https://huggingface.co/Hongbin37/CBT-LLM.
Related benchmarks
| Task | Dataset | Result | Rank | |
|---|---|---|---|---|
| CBT Conversation Generation | EmoLLM (test) | Bert Score0.89 | 20 | |
| Clinical Counseling Dialogue Evaluation | Session Rating Scale | Helpful Score (SRS)2.71 | 18 | |
| CBT Conversation Generation | CBT conversation evaluation dataset | Semantic Coherence1.74 | 10 | |
| Counselor Competence Assessment | Counseling Dialogues (test) | Understanding2.93 | 9 | |
| Clinical Counseling Dialogue Evaluation | Session Rating Scale Resistant | Helpful Rating2.32 | 9 | |
| Clinical Counseling Dialogue Evaluation | Session Rating Scale Engaged | Helpful Rating3.02 | 9 | |
| Psychological counseling evaluation | Core Evaluation Set N=180 (test) | Relevance4.25 | 5 | |
| Psychiatric dialogue evaluation | XInsight-Bench | Proficiency Score6.16 | 5 | |
| Psychological Counseling | Psychological Counseling Interaction | Average Turns1 | 4 |