Share your thoughts, 1 month free Claude Pro on us
See more
Home
/
Benchmarks
Causal Reasoning on Com2
Loading...
79.8
Accuracy
Qwen3-32B
69.088
71.869
74.65
77.431
May 24, 2026
Accuracy
Updated 8d ago
Evaluation Results
Method
Method
Links
Accuracy
Qwen3-32B
Base Model=Qwen3-32B
2026.05
79.8
Olmo3.1-32B-Instruct
Base Model=Olmo3.1-32B...
2026.05
79.6
UNICO
Base Model=Qwen3-8B, D...
2026.05
78.3
Original
Base Model=Olmo3-7B-In...
2026.05
77.6
CauGym
Base Model=Qwen3-8B, D...
2026.05
77.2
UNICO
Base Model=Olmo3-7B-In...
2026.05
76.7
CauGym
Base Model=Olmo3-7B-In...
2026.05
76.5
CDCR
Base Model=Olmo3-7B-In...
2026.05
75.7
Original
Base Model=Qwen3-8B, D...
2026.05
75.5
UNICO
Base Model=Qwen3-4B, D...
2026.05
74.6
Original
Base Model=Qwen3-4B, D...
2026.05
72.9
CauGym
Base Model=Qwen3-4B, D...
2026.05
72.6
CDCR
Base Model=Qwen3-8B, D...
2026.05
70.2
CDCR
Base Model=Qwen3-4B, D...
2026.05
69.5
Feedback
Search any
task
Search any
task