Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

Autonomous AI Agents for Option Hedging: Enhancing Financial Stability through Shortfall Aware Reinforcement Learning

About

The deployment of autonomous AI agents in derivatives markets has widened a practical gap between static model calibration and realized hedging outcomes. We introduce two reinforcement learning frameworks, a novel Replication Learning of Option Pricing (RLOP) approach and an adaptive extension of Q-learner in Black-Scholes (QLBS), that prioritize shortfall probability and align learning objectives with downside sensitive hedging. Using listed SPY and XOP options, we evaluate models using realized path delta hedging outcome distributions, shortfall probability, and tail risk measures such as Expected Shortfall. Empirically, RLOP reduces shortfall frequency in most slices and shows the clearest tail-risk improvements in stress, while implied volatility fit often favors parametric models yet poorly predicts after-cost hedging performance. This friction-aware RL framework supports a practical approach to autonomous derivatives risk management as AI-augmented trading systems scale.

Minxuan Hu, Ziheng Chen, Jiayu Yi, Wenxi Sun• 2026

Related benchmarks

TaskDatasetResultRank
Option HedgingSPY ATM 2020Q1
Shortfall Probability91
5
Option PricingSPY 2025Q2 τ=14d (Whole sample)
IVRMSE9.49
5
Option Pricing AccuracySPY Whole sample 28d maturity 2025Q2
IVRMSE7.34
5
Option Pricing AccuracySPY Moneyness < 1, 28d maturity 2025Q2
IVRMSE7.55
5
Option PricingSPY 2020Q2 τ=56d
IVRMSE7.05
5
Option PricingXOP 2025Q2 τ=14d (Whole sample)
IVRMSE15.16
5
Option Pricing AccuracyXOP Whole sample 28d maturity 2020Q1
IVRMSE10.99
5
Option Pricing AccuracyXOP 2020Q1, Moneyness < 1, 28d maturity
IVRMSE12.48
5
Option Pricing AccuracyXOP 2025Q2, Moneyness > 1, 28d maturity
IVRMSE6.6
5
Option Pricing AccuracySPY 2020Q1, Moneyness > 1.03, 28d maturity
IVRMSE4.17
5
Showing 10 of 38 rows

Other info

Follow for update