Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

Fine-tuning Timeseries Predictors Using Reinforcement Learning

About

This chapter presents three major reinforcement learning algorithms used for fine-tuning financial forecasters. We propose a clear implementation plan for backpropagating the loss of a reinforcement learning task to a model trained using supervised learning, and compare the performance before and after the fine-tuning. We find an increase in performance after fine-tuning, and transfer learning properties to the models, indicating the benefits of fine-tuning. We also highlight the tuning process and empirical results for future implementation by practitioners.

Hugo Cazaux, Ralph Rudd, Hlynur Stef\'ansson, Sverrir \'Olafsson, Eyj\'olfur Ingi \'Asgeirsson• 2026

Related benchmarks

TaskDatasetResultRank
Timeseries ForecastingFinancial
MSE0.145
12
Timeseries ForecastingIndustrials
MSE0.119
12
Timeseries ForecastingTechnology
MSE0.118
12
Showing 3 of 3 rows

Other info

Follow for update