Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

TOY

Benchmarks

Task NameDataset NameSOTA ResultTrend
MSI CompletionToy MSI database
PSNR48.878
28
Explicit AttackToy
Avg Queries (E)500
17
RationalizationToy (test)
HI-F176.02
12
Single-object generationToy4K
PSNR23.98
11
2d multi-goalToy
Recovery Time (%)3.2
8
ClassificationToy Synthetic Skew (test)
F1 Score99.93
7
ClassificationToy (test)
F1 Score99.92
5
Cell detectionTOY
AP @ IoU=0.5099.98
4
Set-level AttributionToy
Shape Accuracy100
3
Real2Sim Reconstruction and Interaction PredictionToy4K real-world experiment
Stability73.3
2
High-dimensional predictionToy-512
Average Regret0.29
2
High-dimensional predictionToy-256
Average Regret1.29
2
High-dimensional predictionToy-128
Average Regret4.18
2
High-dimensional predictionToy-64
Average Regret5.61
2
Counterfactual PredictionToy 4
MAE (do(n1) -> n2)0.443
2
Counterfactual PredictionToy 3
MAE (do(n1), n2)0.451
2
Counterfactual PredictionToy 2
MAE (do(n1) -> n2)0.303
2
Counterfactual PredictionToy 1
MAE (do(n1) -> n2)0.434
2
Counterfactual PredictionToy 4 (test)
MAE (do(n1) -> n2)0.158
2
Counterfactual PredictionToy 3 (test)
MAE (do(n1) -> n2)0.443
2
Showing 20 of 20 rows