Our new X account is live! Follow @wizwand_team for updates
WorkDL logo mark

FLAG-Trader: Fusion LLM-Agent with Gradient-based Reinforcement Learning for Financial Trading

About

Large language models (LLMs) fine-tuned on multimodal financial data have demonstrated impressive reasoning capabilities in various financial tasks. However, they often struggle with multi-step, goal-oriented scenarios in interactive financial markets, such as trading, where complex agentic approaches are required to improve decision-making. To address this, we propose \textsc{FLAG-Trader}, a unified architecture integrating linguistic processing (via LLMs) with gradient-driven reinforcement learning (RL) policy optimization, in which a partially fine-tuned LLM acts as the policy network, leveraging pre-trained knowledge while adapting to the financial domain through parameter-efficient fine-tuning. Through policy gradient optimization driven by trading rewards, our framework not only enhances LLM performance in trading but also improves results on other financial-domain tasks. We present extensive empirical evidence to validate these enhancements.

Guojun Xiong, Zhiyang Deng, Keyi Wang, Yupeng Cao, Haohang Li, Yangyang Yu, Xueqing Peng, Mingquan Lin, Kaleb E Smith, Xiao-Yang Liu, Jimin Huang, Sophia Ananiadou, Qianqian Xie• 2025

Related benchmarks

TaskDatasetResultRank
Single Asset TradingTSLA (test)
CR %50.394
24
Stock TradingJNJ stock
Calmar Ratio33.724
15
Stock TradingBTC (test)
Compound Return (CR)0.4551
15
Stock TradingUVV stock
Compound Return (CR)46.799
15
Stock TradingMSFT stock
Compound Return (CR)20.106
15
Stock TradingHON (test)
Calmar Ratio (%)34.342
15
Showing 6 of 6 rows

Other info

Follow for update