Share your thoughts, 1 month free Claude Pro on us
See more
Home
/
Benchmarks
Causal Variable Identification on MalAlgoQA
Loading...
84.1
F1 (X)
GPT-5
72.452
75.476
78.5
81.524
May 17, 2025
F1 (X)
F1 (Z)
F1 (M)
F1 (Y)
Updated 1mo ago
Evaluation Results
Method
Method
Links
F1 (X)
F1 (Z)
F1 (M)
F1 (Y)
GPT-5
Setting=Decompositiona...
2025.05
84.1
81.3
79.2
83.4
GPT-o4
Setting=Decompositiona...
2025.05
81.5
78.9
76.8
80.4
Llama4-M
Setting=Decompositiona...
2025.05
80.6
78.2
77.8
82.4
Llama4-S
Setting=Decompositiona...
2025.05
79.5
77.1
76.6
81.2
DeepSeek
Setting=Decompositiona...
2025.05
75.9
73.4
63.8
72.3
Gemini2.5
Setting=Decompositiona...
2025.05
73.5
71.2
60.2
70.5
Qwen3
Setting=Decompositiona...
2025.05
72.9
70.6
61.5
69.3
Feedback
Search any
task
Search any
task