Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

Insight-level Evaluation on SCOpE-QA Reinforcement Learning collection

4.52Insight-level Score

INSIGHTGEN

1.1922.0562.923.784Apr 21, 2026
Updated 1mo ago

Evaluation Results

MethodLinks
2026.04
4.52
2026.04
4.18
2026.04
3.82
2026.04
3.72
2026.04
3.41
2026.04
3.35
2026.04
3.34
2026.04
3.31
2026.04
3.2
2026.04
3.2
2026.04
2.92
2026.04
2.85
2026.04
2.54
2026.04
2.51
2026.04
2.49
2026.04
2.27
2026.04
2.11
2026.04
1.7
2026.04
1.32
2026.04
1.32