Our new X account is live! Follow @wizwand_team for updates
Home
/
Benchmarks
Task-Focused Dialogue on TmallBrand-B
Loading...
3.761
G-Eval Score
GOPO-Qwen3-14B
3.49996
3.56773
3.6355
3.70327
Jan 24, 2026
G-Eval Score
Updated 4d ago
Evaluation Results
Method
Method
Links
G-Eval Score
GOPO-Qwen3-14B
Number of parameters=14B
2026.01
3.761
Qwen-235B
Number of parameters=235B
2026.01
3.747
DeepSeek-R1
2026.01
3.745
Gemini-2.5
2026.01
3.534
GPT-5.2
2026.01
3.517
GLM-4.7
2026.01
3.51
Feedback
Search any
task
Search any
task