Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

Scenario

Benchmarks

Task NameDataset NameSOTA ResultTrend
Perceived Risk PredictionScenario MB
RMSE0.2391
81
RegressionScenario IS2
Size0
24
ClassificationScenario IS1
Model Size0
24
Change point localizationScenario 5
Mismatch Proportion (K!=K)0.055
20
Change point localizationScenario 3
Error Proportion (K_hat != K)70.5
20
Policy Value EstimationScenario 4
Policy Value Mean6.714
15
Policy Value EstimationScenario 3
Policy Value (mean)1.879
15
Individualized Treatment Rule EstimationScenario 2
Policy Value (PV)1.095
15
Individualized Treatment Rule EstimationScenario 1
Policy Value (PV)1.017
15
Multi-agent trajectory planning10 agent scenario (ground truth goals)
Trajectory Success Rate2.31
12
Change point localizationScenario 1 T=300
Prop. K_hat != K1
10
Change point localizationScenario 1 T=150
Error Proportion0
10
Trajectory TrackingScenario A Out-of-Distribution 500 g
RMSE (Slow 0.5 m/s)0.301
8
Trajectory TrackingScenario A In-Distribution 300 g
RMSE (Slow 0.5 m/s)0.215
8
Economic Decision-MakingScenario S3 Crisis Shock
Average Reward8.18
8
Constrained Motion PlanningScenario 3 (two Franka Panda manipulators) 1.0 (test)
Success Rate100
8
Constrained motion planningScenario 2 (Two Franka Panda manipulators with closed-chain constraints) (test)
Success Rate100
8
Text-to-Image GenerationScenario 4
Similarity (95th Percentile)0.9262
8
Traffic Signal ControlScenario VISSIM corridor 1
ANP1,749.57
7
Simultaneous Exploration and InspectionScenario C
Finish Rate Avg98.7
7
Intent RecognitionScenario Static S1
Selection Accuracy98
6
Intent RecognitionScenario 1 Dynamic
Tracking Rate92
6
Multi-robot motion planningScenario 3 Four-arm setup
Planning Time (Q1)0.075
6
Multi-robot motion planningScenario 2 Two-arm setup with obstacle
Time Q10.057
6
Multi-robot motion planningScenario 1 Two-arm setup
Planning Time (Q1)0.013
6
Showing 25 of 78 rows