GAS: Generative Auto-bidding with Post-training Search

About

Auto-bidding is essential in facilitating online advertising by automatically placing bids on behalf of advertisers. Generative auto-bidding, which generates bids based on an adjustable condition using models like transformers and diffusers, has recently emerged as a new trend due to its potential to learn optimal strategies directly from data and adjust flexibly to preferences. However, generative models suffer from low-quality data leading to a mismatch between the condition, like return to go, and true action value, especially in long sequential decision-making. Besides, the majority preference in the dataset may hinder models' generalization ability on minority advertisers' preferences. While it is possible to collect high-quality data and retrain multiple models for different preferences, the high cost makes it unaffordable, hindering the advancement of auto-bidding into the era of large foundation models. To address this, we propose a flexible and practical Generative Auto-bidding scheme using post-training Search, termed GAS, to refine a base policy model's output and adapt to various preferences. We use weak-to-strong search alignment by training small critics for different preferences and an MCTS-inspired search to refine the model's output. Specifically, a novel voting mechanism with transformer-based critics trained with policy indications could enhance search alignment performance. Additionally, utilizing the search, we provide a fine-tuning method for high-frequency preference scenarios considering computational efficiency. Extensive experiments conducted on the real-world dataset and online A/B test on the Kuaishou advertising platform demonstrate the effectiveness of GAS, achieving significant improvements, e.g., 4.60% increment of target cost.

Yewen Li, Shuai Mao, Jingtong Gao, Nan Jiang, Yunjian Xu, Qingpeng Cai, Fei Pan, Peng Jiang, Bo An• 2024

Related benchmarks

Task	Dataset	Result
Auto-bidding	AuctionNet	Score525	150
Auto-bidding	AuctionNet-Sparse	Score46.57	112
Auto-bidding	AuctionNet Medium reward sparsity	Score358.5	50
Auto-bidding	AuctionNet High reward sparsity	Score498.8	50
Auto-bidding	AuctionNet Low reward sparsity	Score41.8	50
Auto-bidding	AuctionNet (Dense)	Conversions371	10
Auto-bidding	AuctionNet (75% budget)	Score27.5	9
Auto-bidding	AuctionNet 100% budget	Score36.1	9
Auto-bidding	AuctionNet 125% budget	Score40	9
Auto-bidding	AuctionNet 150% budget	Score46.5	9

Showing 10 of 14 rows

Other info

Follow for update

@wizwand_team Discord