Generative Cross-Entropy: A Strictly Proper Loss for Data-Efficient Classification

About

Cross-entropy (CE) is the default training loss for supervised classification, but its sample efficiency is limited when labels are scarce. Existing remedies primarily act on the data side, via augmentation, synthesis, or transfer from pretrained models; the training objective itself is rarely revisited. We revisit it here. Drawing on the classical observation that generative classifiers reach their asymptotic error with fewer samples than discriminative ones, we propose Generative Cross-Entropy (GenCE), a drop-in replacement for CE that introduces a generative learning principle into a standard discriminative network without altering the architecture or fitting a separate density model. GenCE follows from a Bayesian rewrite of the class-conditional likelihood and, in the mini-batch approximation, reduces to normalizing each sample's softmax score against the model's predictions on the batch, coupling the training signal across examples sharing a class. We extend the proper-scoring-rule framework to such non-local losses and prove that GenCE is strictly proper under a mild completeness condition: its population risk is uniquely minimized at the true posterior. Across three datasets, on two architectures and in both balanced small-data and class-imbalanced regimes, GenCE outperforms CE and other widely used losses, while also producing better-calibrated probabilities and stronger out-of-distribution detection.

Qipeng Zhan, Zhuoping Zhou, Li Shen• 2026

Related benchmarks

Task	Dataset	Result
Image Classification	CIFAR-10 (test)	--	882
Image Classification	Tiny ImageNet (test)	--	722
Image Classification	CIFAR-100 (test)	--	395
Image Classification Calibration	CIFAR100	Classwise ECE0.2	99
Image Classification Calibration	CIFAR10	Classwise ECE0.39	84
Model Calibration	CIFAR-100	ECE1.25	81
Model Calibration	CIFAR-10	ECE66	68
Classwise Calibration	CIFAR-10-LT	Average Classwise ECE1.07	56
Image Classification	CIFAR-10 LT-100 (test)	Error Rate23.75	40
Model Calibration	Tiny-ImageNet	Expected Calibration Error1.15	32

Showing 10 of 17 rows

Other info

Follow for update

@wizwand_team Discord