Our new X account is live! Follow @wizwand_team for updates
WorkDL logo mark

Text Quality Meta-evaluation on SummEval & Topical-Chat Combined

69.5Overall Score

DeepSeek-V3

40.58848.09455.663.106Feb 17, 2025
Updated 4d ago

Evaluation Results

MethodLinks
69.5
2025.02
69.4
68.9
68.4
67.4
66.9
66.7
2025.02
65.8
65
62.5
59.7
59.6
59
2025.02
51.4
48.4
2025.02
41.7