Share your thoughts, 1 month free Claude Pro on us
See more
Home
/
Benchmarks
Intrinsic Reasoning on circa
Loading...
0.747
Spearman Correlation
Always Tell Me The Odds
0.41732
0.50291
0.5885
0.67409
May 2, 2025
Spearman Correlation
Updated 1mo ago
Evaluation Results
Method
Method
Links
Spearman Correlation
Always Tell Me The Odds
Backbone=Qwen2.5-14B-I...
2025.05
0.747
GPT-4o
Evaluation Protocol=0-...
2025.05
0.734
DeepSeek-R1-Distill-Qwen-32B
Evaluation Protocol=0-...
2025.05
0.663
Always Tell Me The Odds
Backbone=Qwen2.5-14B-I...
2025.05
0.564
Llama-3-Instruct
Evaluation Protocol=Pr...
2025.05
0.553
Always Tell Me The Odds
Backbone=Qwen2.5-7B-In...
2025.05
0.544
Always Tell Me The Odds
Backbone=Qwen2.5-14B-I...
2025.05
0.536
Always Tell Me The Odds
Backbone=Qwen2.5-8B-In...
2025.05
0.474
RoBERTa-L
Type=Encoder
2025.05
0.43
Feedback
Search any
task
Search any
task