Share your thoughts, 1 month free Claude Pro on us
See more
Home
/
Benchmarks
Logical Deduction on BBH-LD
Loading...
87.1
BCA
Qwen2.5-7B
47.06
57.455
67.85
78.245
May 12, 2026
BCA
Updated 21d ago
Evaluation Results
Method
Method
Links
BCA
Qwen2.5-7B
Model Size=7B
2026.05
87.1
Qwen2.5-32B
Model Size=32B
2026.05
82.7
Gemma 3 27B
Model Size=27B
2026.05
81.9
DS-R1-7B
Model Size=7B
2026.05
77.5
Qwen 3.5 9B
Model Size=9B
2026.05
77.4
Qwen2.5-14B
Model Size=14B
2026.05
70.4
Llama 3.1 8B
Model Size=8B
2026.05
66.9
DS-R1-32B
Model Size=32B
2026.05
53.8
DS-R1-14B
Model Size=14B
2026.05
48.6
Feedback
Search any
task
Search any
task