Our new X account is live! Follow @wizwand_team for updates
Home
/
Benchmarks
Causal Variable Identification on CRASS
Loading...
92.3
F1 (X)
GPT-5
87.1
88.45
89.8
91.15
May 17, 2025
F1 (X)
F1 (Z)
F1 (M)
F1 (Y)
Updated 4d ago
Evaluation Results
Method
Method
Links
F1 (X)
F1 (Z)
F1 (M)
F1 (Y)
GPT-5
Setting=Decompositiona...
2025.05
92.3
91.1
87.4
91.7
GPT-o4
Setting=Decompositiona...
2025.05
91
89.2
84.1
89.3
Llama4-M
Setting=Decompositiona...
2025.05
90.6
89.1
82.4
88.6
DeepSeek
Setting=Decompositiona...
2025.05
89.5
87.1
76.2
83.5
Gemini2.5
Setting=Decompositiona...
2025.05
88.6
86.2
74.1
81.6
Llama4-S
Setting=Decompositiona...
2025.05
88.5
86.9
81.2
87.4
Qwen3
Setting=Decompositiona...
2025.05
87.3
85.4
72.8
79.9
Feedback
Search any
task
Search any
task