Share your thoughts, 1 month free Claude Pro on us
See more
Home
/
Benchmarks
Full-Information Online Learning on FOL Sine-trend rewards Horizon Generalization [T=15 -> T=25] 1.0
Loading...
40.62
Max LR
GPT-4o mini
37.9888
38.6719
39.355
40.0381
Nov 6, 2025
Max LR
Avg LR
Beta (β)
Updated 2d ago
Evaluation Results
Method
Method
Links
Max LR
Avg LR
Beta (β)
GPT-4o mini
Horizon (T)=25, Action...
2025.11
40.62
11.24
0.43
Trained GPT-4o mini
Horizon (T)=25, Action...
2025.11
39.64
9.83
0.38
FTRL
Horizon (T)=25, Action...
2025.11
38.09
11.32
0.43
Feedback
Search any
task
Search any
task