Our new X account is live! Follow @wizwand_team for updates
WorkDL logo mark

TOY

Benchmarks

Task NameDataset NameSOTA ResultTrend
Explicit AttackToy
Avg Queries (E)500
17
RationalizationToy (test)
HI-F176.02
12
2d multi-goalToy
Recovery Time (%)3.2
8
ClassificationToy Synthetic Skew (test)
F1 Score99.93
7
ClassificationToy (test)
F1 Score99.92
5
Cell detectionTOY
AP @ IoU=0.5099.98
4
Real2Sim Reconstruction and Interaction PredictionToy4K real-world experiment
Stability73.3
2
High-dimensional predictionToy-512
Average Regret0.29
2
High-dimensional predictionToy-256
Average Regret1.29
2
High-dimensional predictionToy-128
Average Regret4.18
2
High-dimensional predictionToy-64
Average Regret5.61
2
Counterfactual PredictionToy 4
MAE (do(n1) -> n2)0.443
2
Counterfactual PredictionToy 3
MAE (do(n1), n2)0.451
2
Counterfactual PredictionToy 2
MAE (do(n1) -> n2)0.303
2
Counterfactual PredictionToy 1
MAE (do(n1) -> n2)0.434
2
Counterfactual PredictionToy 4 (test)
MAE (do(n1) -> n2)0.158
2
Counterfactual PredictionToy 3 (test)
MAE (do(n1) -> n2)0.443
2
Showing 17 of 17 rows