Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

RW

Benchmarks

Task NameDataset NameSOTA ResultTrend
Multi-step outcome predictionRW MIMIC-extract (Latino)
RMSE4.57
36
Multi-step outcome predictionRW MIMIC-extract (Asian)
RMSE4.69
36
Off-policy predictionRW tabular
Tail-average RMSE0.023
16
Word SimilarityEN RW
Spearman Correlation48
10
Off-policy predictionRW inverted
Tail-average RMSE0.035
8
Integer Linear Programming SolvingRW
Objective Value77.5
7
Semantic SegmentationRW-10
mIoU44.8
4
Autonomous NavigationRW Baseline Difficult Forest 1.0
Distance (m)57
2
Word SimilarityRW (test)
Spearman Correlation58.12
2
Showing 9 of 9 rows