Interactive Agents: Simulating Counselor-Client Psychological Counseling via Role-Playing LLM-to-LLM Interactions
About
Creating effective dialogue systems for mental health support requires high-quality multi-turn counseling dialogue data, yet collecting real counselor-client conversations presents significant challenges, including privacy concerns, high costs, and limited scalability. We present \textbf{Interactive Agents}, a novel framework that simulates naturalistic counseling dialogues through controlled LLM-to-LLM interactions. The framework introduces two key innovations: (1) a personalized client agent that maintains consistent psychological characteristics throughout a session, and (2) a counselor agent that implements a theoretically grounded three-stage therapeutic model comprising the exploration, insight, and action phases. Through rigorous evaluation using both automatic metrics and professional-counselor assessments based on the Working Alliance Inventory, we demonstrate that our framework generates therapeutically valid dialogues that are comparable in quality to human-generated sessions. Models fine-tuned on our proposed synthetic dataset (SimPsyDial) achieve state-of-the-art performance in a standard pairwise chatbot-arena evaluation of LLM-based counselors. Our framework provides a scalable, privacy-preserving method for generating high-quality counseling dialogue data while maintaining professional therapeutic standards.
Related benchmarks
| Task | Dataset | Result | Rank | |
|---|---|---|---|---|
| Emotional Support Dialogue | Cooperative Emotional Support Dialogue Average-case | Human-likeness4.54 | 17 | |
| Emotional Support Dialogue | Emotional Support Dialogue Worst-case 1.0 (test) | Human-likeness2.26 | 17 | |
| Seeker Simulation | D4 | Precision64.73 | 12 | |
| Psychological Counseling | CPsyCounR Multi-Session | Coherence (Coh)2.39 | 12 | |
| Psychological Counseling | CPsyCounR Single-Session | T.Alli1.424 | 12 | |
| Seeker Simulation | DAIC | Precision33.14 | 12 | |
| Psychological Counseling Dialogue Evaluation | Multi-response competition dataset | Total Upvotes808 | 6 | |
| Dialogue Authenticity Evaluation | PsyCLIENT-CP (test) | Fluency4.371 | 4 |