Share your thoughts, 1 month free Claude Pro on us
See more
Home
/
Benchmarks
Empathetic Dialogue on MSD
Loading...
85.3
Success Rate (SR)
gemini-2.5-flash
73.756
76.753
79.75
82.747
Mar 17, 2026
Success Rate (SR)
Average Turns (AT)
Empathy Score (ES)
Effectiveness Score (EA)
Updated 1mo ago
Evaluation Results
Method
Method
Links
Success Rate (SR)
Average Turns (AT)
Empathy Score (ES)
Effectiveness Score (EA)
gemini-2.5-flash
Type=Proprietary LLM API
2026.03
85.3
2.68
4.23
3.76
EmoLLM
Type=open-weight model
2026.03
83.2
2.86
4.17
3.71
gpt-5-mini
Type=Proprietary LLM API
2026.03
81.4
2.97
4.01
3.53
gemini-3.1-flash-lite
Type=Proprietary LLM API
2026.03
76.4
2.62
4.11
3.46
gpt-5-nano
Type=Proprietary LLM API
2026.03
74.2
3.84
4.12
3.57
Feedback
Search any
task
Search any
task