Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

EQ-Bench

Benchmarks

Task NameDataset NameSOTA ResultTrend
Emotional IntelligencePolish EQ-Bench
Overall Score78.07
106
Long-form writingEQ-Bench Long-form writing (leaderboard)
Score0.798
23
Reverse Chain-of-Thought GenerationEQ-Bench 3
Score0.896
20
Emotional Intelligence evaluationEQ-Bench
Overall Score86.4
15
Emotional Intelligence EvaluationEQ-Bench3
DoI13.8
8
Creative Writing EvaluationEQ-Bench Creative Writing v3
EQ-Bench83.8
6
Empathetic response generationEQ-Bench 3
Metric-
0
Showing 7 of 7 rows