Fine-tuning Timeseries Predictors Using Reinforcement Learning

About

This chapter presents three major reinforcement learning algorithms used for fine-tuning financial forecasters. We propose a clear implementation plan for backpropagating the loss of a reinforcement learning task to a model trained using supervised learning, and compare the performance before and after the fine-tuning. We find an increase in performance after fine-tuning, and transfer learning properties to the models, indicating the benefits of fine-tuning. We also highlight the tuning process and empirical results for future implementation by practitioners.

Hugo Cazaux, Ralph Rudd, Hlynur Stef\'ansson, Sverrir \'Olafsson, Eyj\'olfur Ingi \'Asgeirsson• 2026

Related benchmarks

Task	Dataset	Result
Timeseries Forecasting	Financial	MSE0.145	12
Timeseries Forecasting	Industrials	MSE0.119	12
Timeseries Forecasting	Technology	MSE0.118	12

Showing 3 of 3 rows

Other info

Follow for update

@wizwand_team Discord