SocialLM: Social Signal Processing of Patient-Provider Communication using LLMs and Contextual Aggregation
About
Effective patient-provider communication is difficult to assess at scale. We examine whether large language models (LLMs) can track 20 social behaviors from clinical transcripts without fine-tuning. Across three model families and multiple prompting strategies, LLMs reliably detect social signals, though performance varies by patient race and visit segment. To address this variability under query-only API constraints, we introduce an agreement-weighted ensemble using group-level agreement patterns. This approach improves both accuracy and stability over the best individual model, demonstrating a practical pathway for scalable social signal tracking in clinical conversations.
Manas Satish Bedmutha, Feng Chen, Andrea Hartzler, Trevor Cohen, Nadir Weibel• 2025
Related benchmarks
| Task | Dataset | Result | Rank | |
|---|---|---|---|---|
| Social Signal Inference | EF dataset | Provider Dominance0.603 | 11 | |
| Social Signal Classification | SocialLM provider-dominance | Balanced Accuracy60.9 | 3 | |
| Social Signal Classification | SocialLM provider-warmth | Balanced Accuracy62.6 | 3 | |
| Social Signal Classification | SocialLM provider-engagement | Balanced Accuracy63.2 | 3 | |
| Social Signal Classification | SocialLM provider-empathy | Balanced Accuracy67.6 | 3 | |
| Social Signal Classification | SocialLM provider-respect | Balanced Accuracy63.5 | 3 | |
| Social Signal Classification | SocialLM provider-interactivity | Balanced Accuracy67.3 | 3 | |
| Social Signal Classification | SocialLM patient-attentiveness | Balanced Accuracy61.6 | 3 | |
| Social Signal Classification | SocialLM patient-engagement | Balanced Accuracy62.7 | 3 | |
| Social Signal Classification | SocialLM patient-respect | Balanced Accuracy75.4 | 3 |
Showing 10 of 21 rows