Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

Data Distribution Valuation Using Generalized Bayesian Inference

About

We investigate the data distribution valuation problem, which aims to quantify the values of data distributions from their samples. This is a recently proposed problem that is related to but different from classical data valuation and can be applied to various applications. For this problem, we develop a novel framework called Generalized Bayes Valuation that utilizes generalized Bayesian inference with a loss constructed from transferability measures. This framework allows us to solve, in a unified way, seemingly unrelated practical problems, such as annotator evaluation and data augmentation. Using the Bayesian principles, we further improve and enhance the applicability of our framework by extending it to the continuous data stream setting. Our experiment results confirm the effectiveness and efficiency of our framework in different real-world scenarios.

Cuong N. Nguyen, Cuong V. Nguyen• 2026

Related benchmarks

TaskDatasetResultRank
Fine-grained Image ClassificationCUB200 2011 (test)
Accuracy73.92
543
Fine-grained Image ClassificationStanford Dogs (test)
Accuracy73.2
124
Annotator EvaluationCIFAR-10 (test)
Test Accuracy78.32
6
Annotator EvaluationCUB-200-2011 (test)
Test Accuracy58.08
5
Data distribution valuationCIFAR-10
Time (s)0.85
4
Pearson correlation between valuation and test accuracyCUB-200-2011 (test)
Pearson Correlation Coefficient0.98
2
Pearson correlation between valuation and test accuracyCIFAR-10 (test)
Pearson Correlation Coefficient0.99
2
Showing 7 of 7 rows

Other info

Follow for update