Our new X account is live! Follow @wizwand_team for updates
Home
/
Benchmarks
NLG Evaluation on SumPubMed
Loading...
0.3
Spearman Correlation
GPT-4o
-0.1056
-0.0003
0.105
0.2103
Feb 9, 2026
Spearman Correlation
Updated 4d ago
Evaluation Results
Method
Method
Links
Spearman Correlation
GPT-4o
Prompting=Few-shot
2026.02
0.3
Mixtral
Prompting=Zero-shot
2026.02
0.3
Mixtral
Prompting=Few-shot
2026.02
0.28
Qwen
Prompting=Few-shot
2026.02
0.28
Llama
Prompting=Zero-shot
2026.02
0.24
Llama
Prompting=Few-shot
2026.02
0.2
Qwen
Prompting=Zero-shot
2026.02
0.19
GPT-4o
Prompting=Zero-shot
2026.02
0.14
GPT-4o-mini
Prompting=Few-shot
2026.02
0.06
GPT-4o-mini
Prompting=Zero-shot
2026.02
-0.09
Feedback
Search any
task
Search any
task