Our new X account is live! Follow @wizwand_team for updates
Home
/
Benchmarks
Task-Focused Dialogue on TmallBrand-A
Loading...
3.757
G-Eval Score
GOPO-Qwen3-14B
3.51572
3.57836
3.641
3.70364
Jan 24, 2026
G-Eval Score
Updated 4d ago
Evaluation Results
Method
Method
Links
G-Eval Score
GOPO-Qwen3-14B
Number of parameters=14B
2026.01
3.757
Qwen-235B
Number of parameters=235B
2026.01
3.753
DeepSeek-R1
2026.01
3.728
Gemini-2.5
2026.01
3.653
GPT-5.2
2026.01
3.596
GLM-4.7
2026.01
3.525
Feedback
Search any
task
Search any
task