Share your thoughts, 1 month free Claude Pro on us
See more
Home
/
Benchmarks
Multi-Objective Reinforcement Learning on Queue
Loading...
1.6
MER
HEURISTIC
0.6944
6.8072
12.92
19.0328
Mar 24, 2026
MER
Success Rate (%)
Updated 25d ago
Evaluation Results
Method
Method
Links
MER
Success Rate (%)
HEURISTIC
2026.03
1.6
10.05
ENVELOPE
2026.03
3.54
25.1
DPI
Algorithm=Q-learning
2026.03
3.74
29.09
FIXED
2026.03
4.19
10.05
RS
2026.03
4.29
11.43
SR-PPO
2026.03
5.64
46.91
DPI
Algorithm=PPO
2026.03
10.34
39.95
DPI-PPO
2026.03
10.34
39.95
Dense Oracle
2026.03
14.41
49.27
MER-PPO
2026.03
15.01
0.98
RANDOM
2026.03
24.24
17.25
Feedback
Search any
task
Search any
task