Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

Generative Cross-Entropy: A Strictly Proper Loss for Data-Efficient Classification

About

Cross-entropy (CE) is the default training loss for supervised classification, but its sample efficiency is limited when labels are scarce. Existing remedies primarily act on the data side, via augmentation, synthesis, or transfer from pretrained models; the training objective itself is rarely revisited. We revisit it here. Drawing on the classical observation that generative classifiers reach their asymptotic error with fewer samples than discriminative ones, we propose Generative Cross-Entropy (GenCE), a drop-in replacement for CE that introduces a generative learning principle into a standard discriminative network without altering the architecture or fitting a separate density model. GenCE follows from a Bayesian rewrite of the class-conditional likelihood and, in the mini-batch approximation, reduces to normalizing each sample's softmax score against the model's predictions on the batch, coupling the training signal across examples sharing a class. We extend the proper-scoring-rule framework to such non-local losses and prove that GenCE is strictly proper under a mild completeness condition: its population risk is uniquely minimized at the true posterior. Across three datasets, on two architectures and in both balanced small-data and class-imbalanced regimes, GenCE outperforms CE and other widely used losses, while also producing better-calibrated probabilities and stronger out-of-distribution detection.

Qipeng Zhan, Zhuoping Zhou, Li Shen• 2026

Related benchmarks

TaskDatasetResultRank
Image ClassificationCIFAR-10 (test)--
882
Image ClassificationTiny ImageNet (test)--
722
Image ClassificationCIFAR-100 (test)--
395
Image Classification CalibrationCIFAR100
Classwise ECE0.2
99
Image Classification CalibrationCIFAR10
Classwise ECE0.39
84
Model CalibrationCIFAR-100
ECE1.25
81
Model CalibrationCIFAR-10
ECE66
68
Classwise CalibrationCIFAR-10-LT
Average Classwise ECE1.07
56
Image ClassificationCIFAR-10 LT-100 (test)
Error Rate23.75
40
Model CalibrationTiny-ImageNet
Expected Calibration Error1.15
32
Showing 10 of 17 rows

Other info

Follow for update