Fine-tuning Timeseries Predictors Using Reinforcement Learning
About
This chapter presents three major reinforcement learning algorithms used for fine-tuning financial forecasters. We propose a clear implementation plan for backpropagating the loss of a reinforcement learning task to a model trained using supervised learning, and compare the performance before and after the fine-tuning. We find an increase in performance after fine-tuning, and transfer learning properties to the models, indicating the benefits of fine-tuning. We also highlight the tuning process and empirical results for future implementation by practitioners.
Hugo Cazaux, Ralph Rudd, Hlynur Stef\'ansson, Sverrir \'Olafsson, Eyj\'olfur Ingi \'Asgeirsson• 2026
Related benchmarks
| Task | Dataset | Result | Rank | |
|---|---|---|---|---|
| Timeseries Forecasting | Financial | MSE0.145 | 12 | |
| Timeseries Forecasting | Industrials | MSE0.119 | 12 | |
| Timeseries Forecasting | Technology | MSE0.118 | 12 |
Showing 3 of 3 rows