Share your thoughts, 1 month free Claude Pro on us
See more
Home
/
Benchmarks
Regret Minimization on Episodic MNL mixture MDP
Loading...
1
Regret Bound
Hwang and Oh
0.92
1.46
2
2.54
May 27, 2026
Regret Bound
Regret
Variance-Adaptive Configuration
Optimal Regret
Updated 6d ago
Evaluation Results
Method
Method
Links
Regret Bound
Regret
Variance-Adaptive Configuration
Optimal Regret
Hwang and Oh
Bound Type=Upper Bound
2026.05
1
-
-
-
This work (Theorem 5)
Bound Type=Upper Bound
2026.05
1
-
-
-
This work (Theorem 7)
Bound Type=Lower Bound
2026.05
1
-
-
-
Cho et al.
Bound Type=Upper Bound
2026.05
2
-
-
-
Li et al.
Bound Type=Upper Bound
2026.05
2
-
-
-
This work (Corollary 9)
Bound Type=Upper Bound
2026.05
3
-
-
-
Park et al.
Bound Type=Lower Bound
2026.05
3
-
-
-
Hwang and Oh (2023)
Bound Type=Upper bound
2026.05
-
2
-
-
Li et al. (2024)
Bound Type=Upper bound
2026.05
-
2
-
-
LIVAROT (Theorem 3)
Bound Type=Upper bound
2026.05
-
2
-
-
Park et al. (2024)
Bound Type=Lowerbound
2026.05
-
3
-
-
LIVAROT (Theorem 4)
Bound Type=Lowerbound
2026.05
-
2
-
-
Feedback
Search any
task
Search any
task