Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

Enhancing Generative Auto-bidding with Offline Reward Evaluation and Policy Search

About

Auto-bidding is a critical tool for advertisers to improve advertising performance. Recent progress has demonstrated that AI-Generated Bidding (AIGB), which learns a conditional generative planner from offline data, achieves superior performance compared to typical offline reinforcement learning (RL)-based auto-bidding methods. However, existing AIGB methods still face a performance bottleneck due to their inherent inability to explore beyond the static dataset with feedback. To address this, we propose \textbf{AIGB-Pearl} (\emph{\textbf{P}lanning with \textbf{E}valu\textbf{A}tor via \textbf{RL}}), a novel method that integrates generative planning and policy optimization. The core of AIGB-Pearl lies in constructing a trajectory evaluator to assess the quality of generated scores and designing a provably sound KL-Lipschitz-constrained score-maximization scheme to ensure safe and efficient exploration beyond the offline dataset. A practical algorithm that incorporates the synchronous coupling technique is further developed to ensure the model regularity required by the proposed scheme. Extensive experiments on both simulated and real-world advertising systems demonstrate the state-of-the-art performance of our approach.

Zhiyu Mou, Yiqin Lv, Miao Xu, Qi Wang, Yixiu Mao, Jinghao Chen, Qichen Ye, Chao Li, Rongquan Bai, Chuan Yu, Jian Xu, Bo Zheng• 2025

Related benchmarks

TaskDatasetResultRank
Auto-biddingSimulated Offline Advertising System 1.5k budget 30 advertisers
GMV503
9
Auto-biddingSimulated Offline Advertising System 2.0k Budget 30 Advertisers
GMV521.8
9
Auto-biddingSimulated Offline Advertising System 2.5k Budget 30 Advertisers
GMV545
9
Auto-biddingSimulated Offline Advertising System 3.0k Budget 30 Advertisers
GMV574.2
9
Auto-biddingTaoBao real-world A/B (test)
GMV7.87e+7
9
Auto-biddingReal-world A/B tests 4k unseen advertisers over 19 days
GMV6.93e+7
4
Auto-biddingTargetROAS real-world A/B test 300k advertisers (22 days)
GMV8.20e+8
2
Auto-biddingSimulated First-Price Auction
GMV1.61e+3
2
Showing 8 of 8 rows

Other info

Follow for update