Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

RBM

Benchmarks

Task NameDataset NameSOTA ResultTrend
Trajectory RankingRBM OOD 1.0 (test)
Kendall's Tau-a0.66
8
Reward alignmentRBM-EVAL ID
Pearson r (VOC)0.92
8
Goodness-of-fit testingRBM perturbation=0.06
Null Rejection Rate100
7
Goodness-of-fit testingRBM (perturbation=0.04)
Null Rejection Rate100
7
Goodness-of-fit testingRBM perturbation=0.02
Null Rejection Rate100
7
Goodness-of-fit testingRBM perturbation=0
Null Rejection Rate0
7
Showing 6 of 6 rows