Share your thoughts, 1 month free Claude Pro on us
See more
Home
/
Benchmarks
NLG Meta-evaluation on CUS-QA orig. (cs)
Loading...
0.804
Kendall Correlation
Qwen 3 30B
0.66672
0.70236
0.738
0.77364
Mar 10, 2026
Kendall Correlation
Updated 1mo ago
Evaluation Results
Method
Method
Links
Kendall Correlation
Qwen 3 30B
Shot=Zero
2026.03
0.804
Llama 3.3 70B
Shot=Few
2026.03
0.757
Llama 4 Scout
Shot=Few
2026.03
0.751
Qwen 3 30B
Shot=Few
2026.03
0.693
Llama 4 Scout
Shot=Zero
2026.03
0.688
Llama 3.3 70B
Shot=Zero
2026.03
0.672
Feedback
Search any
task
Search any
task