Our new X account is live! Follow @wizwand_team for updates
Home
/
Benchmarks
Generative Recommendation on RecSim Extreme Noisy Noise Dominated
Loading...
0.318
Reward
DRPO-Exp
0.11624
0.16862
0.221
0.27338
Feb 11, 2026
Reward
eCPM
Updated 4d ago
Evaluation Results
Method
Method
Links
Reward
eCPM
DRPO-Exp
2026.02
0.318
1.59
DRPO
2026.02
0.308
1.54
Adapt. BC
mechanism=Adaptive fil...
2026.02
0.297
1.48
IQL
2026.02
0.289
1.44
BC
2026.02
0.268
1.34
AWR
2026.02
0.254
1.27
CRR
2026.02
0.217
1.08
BPPO
2026.02
0.124
0.62
Feedback
Search any
task
Search any
task