Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

Full-Information Online Learning on FOL Sine-trend rewards Horizon Generalization [T=15 -> T=25] 1.0

40.62Max LR

GPT-4o mini

37.988838.671939.35540.0381Nov 6, 2025
Updated 2d ago

Evaluation Results

MethodLinks
2025.11
40.6211.240.43
2025.11
39.649.830.38
2025.11
38.0911.320.43