Share your thoughts, 1 month free Claude Pro on us
See more
Home
/
Benchmarks
Intrinsic Reasoning on EntailmentBank
Loading...
0.789
Spearman Correlation
Always Tell Me The Odds
0.49156
0.56878
0.646
0.72322
May 2, 2025
Spearman Correlation
Updated 1mo ago
Evaluation Results
Method
Method
Links
Spearman Correlation
Always Tell Me The Odds
Backbone=Qwen2.5-14B-I...
2025.05
0.789
Always Tell Me The Odds
Backbone=Qwen2.5-14B-I...
2025.05
0.787
DeepSeek-R1-Distill-Qwen-32B
Evaluation Protocol=0-...
2025.05
0.783
GPT-4o
Evaluation Protocol=0-...
2025.05
0.76
Always Tell Me The Odds
Backbone=Qwen2.5-14B-I...
2025.05
0.735
Always Tell Me The Odds
Backbone=Qwen2.5-7B-In...
2025.05
0.687
Always Tell Me The Odds
Backbone=Qwen2.5-8B-In...
2025.05
0.659
Llama-3-Instruct
Evaluation Protocol=Pr...
2025.05
0.558
RoBERTa-L
Type=Encoder
2025.05
0.503
Feedback
Search any
task
Search any
task