Our new X account is live! Follow @wizwand_team for updates
Home
/
Benchmarks
Question Answering on HotpotQA (Mean per-step regret)
Loading...
0.188
Mean Per-Step Regret
ϵ-FTRL
0.18728
0.19214
0.197
0.20186
Feb 23, 2026
Mean Per-Step Regret
Updated 4d ago
Evaluation Results
Method
Method
Links
Mean Per-Step Regret
ϵ-FTRL
Strategy Category=Non-...
2026.02
0.188
Disamb
Strategy Category=Stat...
2026.02
0.191
LinUCB
Strategy Category=Cont...
2026.02
0.191
LinEXP3
Strategy Category=Cont...
2026.02
0.192
TS
Strategy Category=Cont...
2026.02
0.194
LinFTPL
Strategy Category=Cont...
2026.02
0.196
EXP3
Strategy Category=Non-...
2026.02
0.197
TS
Strategy Category=Non-...
2026.02
0.197
LinUCB+KL
Strategy Category=Cont...
2026.02
0.197
No-Rewrite (NoRw)
Strategy Category=Base
2026.02
0.198
Simpl
Strategy Category=Stat...
2026.02
0.199
FTPL
Strategy Category=Non-...
2026.02
0.199
Expand
Strategy Category=Stat...
2026.02
0.201
Para
Strategy Category=Stat...
2026.02
0.203
Clarify
Strategy Category=Stat...
2026.02
0.206
Feedback
Search any
task
Search any
task