Our new X account is live! Follow @wizwand_team for updates
Home
/
Benchmarks
Causal Discovery on Asia (d=8, |E|=8) small-scale (test)
Loading...
100
Precision
AVICI (Baseline)
65.368
74.359
83.35
92.341
Jan 20, 2026
Precision
Recall
F1-Score
SHD
Updated 4d ago
Evaluation Results
Method
Method
Links
Precision
Recall
F1-Score
SHD
AVICI (Baseline)
Base Algorithm=AVICI
2026.01
100
57.5
72.8
3.4
CauScientist (Qwen3-32B)
Base Algorithm=AVICI,...
2026.01
100
95
97.3
0.4
CauScientist (Qwen3-14B)
Base Algorithm=AVICI,...
2026.01
97.8
92.5
94.6
0.8
Qwen3-32B
Evaluation Protocol=Ze...
2026.01
90.6
92.5
91.5
1.4
CauScientist (Qwen3-32B)
Base Algorithm=FCI, LL...
2026.01
83.1
92.5
87.5
2.2
CauScientist (Qwen3-14B)
Base Algorithm=FCI, LL...
2026.01
76.5
87.5
81.3
3.2
Qwen3-14B
Evaluation Protocol=Ze...
2026.01
67.4
82.5
73.7
5
FCI (Baseline)
Base Algorithm=FCI
2026.01
66.7
25
36.4
6
Feedback
Search any
task
Search any
task