Share your thoughts, 1 month free Claude Pro on us
See more
Home
/
Benchmarks
Intrinsic Reasoning on UNLI
Loading...
0.813
Spearman Correlation
Always Tell Me The Odds
0.62164
0.67132
0.721
0.77068
May 2, 2025
Spearman Correlation
Updated 1mo ago
Evaluation Results
Method
Method
Links
Spearman Correlation
Always Tell Me The Odds
Backbone=Qwen2.5-14B-I...
2025.05
0.813
Always Tell Me The Odds
Backbone=Qwen2.5-14B-I...
2025.05
0.812
Always Tell Me The Odds
Backbone=Qwen2.5-14B-I...
2025.05
0.812
Always Tell Me The Odds
Backbone=Qwen2.5-8B-In...
2025.05
0.804
Always Tell Me The Odds
Backbone=Qwen2.5-7B-In...
2025.05
0.802
RoBERTa-L
Type=Encoder
2025.05
0.707
GPT-4o
Evaluation Protocol=0-...
2025.05
0.699
Llama-3-Instruct
Evaluation Protocol=Pr...
2025.05
0.681
DeepSeek-R1-Distill-Qwen-32B
Evaluation Protocol=0-...
2025.05
0.629
Feedback
Search any
task
Search any
task