Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

From Scaling to Structured Expressivity: Rethinking Transformers for CTR Prediction

About

Despite massive investments in scale, deep models for click-through rate (CTR) prediction often exhibit rapidly diminishing returns -- a stark contrast to the {predictable scaling laws} seen in large language models (LLMs). We identify the root cause as a {fundamental} \textit{structural misalignment}: {standard} Transformers assume sequential compositionality, whereas CTR data demand combinatorial reasoning over {heterogeneous} fields. To restore alignment, we introduce the \textbf{Field-Aware Transformer (FAT)}. {By reconstructing the standard Transformer block with field-centric parameters, FAT achieves \textit{structured expressivity}, {fundamentally shifting the model complexity dependence from the total vocabulary size $n$ with the number of fields $F$ ($n \gg F$).}} Crucially, to decouple model capacity from field cardinality, FAT employs a {{Basis-Composed Hypernetwork}} to synthesize field-specific parameters from shared bases, further reducing parameter complexity. {Theoretically, we ground this scaling behavior through a formal scaling law based on Rademacher complexity. Empirically, FAT outperforms exisiting state-of-the-art methods with up to \textbf{{+4.38\%}} AUC improvement, and delivers \textbf{+2.33\%} CTR and \textbf{+0.66\%} RPM in live production.} Our work establishes that scalable recommendation arises not from size alone, but from \textit{structured expressivity} -- architectural coherence with data semantics.

Bencheng Yan, Yuejie Lei, Zhiyuan Zeng, Zheye Deng, Di Wang, Kaiyi Lin, Pengjie Wang, Chuan Yu, Jian Xu, Bo Zheng• 2025

Related benchmarks

TaskDatasetResultRank
CTR PredictionTaobao
AUC78.2
13
CTR PredictionMovieLens 20M
AUC84.5
13
CTCVR PredictionE-commerce Douyin
ΔAUC0.82
12
User RetentionKuaishou advertising dataset
AUC0.7449
12
CTR PredictionTaobao sponsored search system large-scale (Online A/B test)
P99 Latency (ms)48
3
Showing 5 of 5 rows

Other info

Follow for update