Share your thoughts, 1 month free Claude Pro on us
See more
Home
/
Benchmarks
NLG Meta-evaluation on CUS-QA en cs
Loading...
0.73
Kendall Correlation
Llama 4 Scout
0.51576
0.57138
0.627
0.68262
Mar 10, 2026
Kendall Correlation
Updated 1mo ago
Evaluation Results
Method
Method
Links
Kendall Correlation
Llama 4 Scout
Shot=Few
2026.03
0.73
Qwen 3 30B
Shot=Zero
2026.03
0.725
Llama 4 Scout
Shot=Zero
2026.03
0.656
Qwen 3 30B
Shot=Few
2026.03
0.651
Llama 3.3 70B
Shot=Few
2026.03
0.619
Llama 3.3 70B
Shot=Zero
2026.03
0.524
Feedback
Search any
task
Search any
task