Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

Multimodal Policy Fine-Tuning on Gaussian-mixture environment (G1 landscape)

100Success Rate (SR)

DSRL

30.3248.4166.584.59May 12, 2026
Updated 21d ago

Evaluation Results

MethodLinks
2026.05
100250.250
2026.05
100580.58250.4
2026.05
10010010.99
2026.05
10010010.99
2026.05
989811
2026.05
66160.08250
2026.05
33330.33250.46