| Dataset Name | SOTA Method | Metric | Trend | ||
|---|---|---|---|---|---|
| Vanishing-mismatch experiment | TACC | Mean Cost-Weighted Pseudo-Regret75.7 | 15 | 22d ago | |
| Wheel Bandit delta=0.9 | Gradient-Laplace | Final Cumulative Regret (Mean)7,654.6 | 10 | 22d ago | |
| Wheel Bandit delta=0.95 | Gradient-Laplace | Mean Cumulative Regret8,211.9 | 10 | 22d ago | |
| Checkpoint-based (evaluation) | MF-UCB | Mean Difference427.7 | 5 | 22d ago |