Share your thoughts, 1 month free Claude Pro on usSee more

Full-Information Online Learning on FOL Sine-trend rewards Horizon Generalization [T=15 -> T=25] 1.0

40.62Max LR

GPT-4o mini

Updated 1mo ago

Evaluation Results

Method	Links
GPT-4o mini 2025.11		40.62	11.24	0.43
Trained GPT-4o mini 2025.11		39.64	9.83	0.38
FTRL 2025.11		38.09	11.32	0.43