Our new X account is live! Follow @wizwand_team for updates
Home
/
Benchmarks
Object Stacking on Stack Spuriousness S (test)
Loading...
97.6
Success Rate
GRADER
19.08
39.465
59.85
80.235
Jul 19, 2022
Success Rate
Updated 4d ago
Evaluation Results
Method
Method
Links
Success Rate
GRADER
2022.07
97.6
Offline
data_source=offline ra...
2022.07
95.4
Score
discovery_method=score...
2022.07
90.5
TICSA
2022.07
88.8
Full
causal_graph=full
2022.07
86
ICIL
2022.07
81.2
PETS
2022.07
77.7
ICIN
2022.07
71
GNN
2022.07
39
SAC
2022.07
22.1
Feedback
Search any
task
Search any
task