Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

Cost-Effective Model Evaluation with Meta-Learning

About

The rapid growth of machine learning has produced an ever-expanding ecosystem of models, making it increasingly challenging to verify the reliability of newly released models on unseen, unlabeled data. Conventional evaluation pipelines depend on expensive annotation, repeated fine-tuning, or narrow assumptions that fail to transfer across model families. We present MetaEvaluator, a cost-effective, model-agnostic framework for rapid, label-free assessment of unseen models spanning diverse architectures and modalities. MetaEvaluator leverages meta-learning over a pool of reference models to obtain a transferable initialization, enabling accurate evaluation of new models while amortizing cost across the pool and removing the need for per-model retraining. To the best of our knowledge, this is the first model-agnostic framework capable of evaluating new models on entirely unlabeled datasets. Extensive experiments show that MetaEvaluator produces stable and accurate performance estimates at substantially reduced cost compared to conventional approaches, making scalable benchmarking of emerging models on unlabeled data practical.

Trinh Pham, Viet Huynh, Hongzhi Yin, Quoc Viet Hung Nguyen, Thanh Tam Nguyen• 2026

Related benchmarks

TaskDatasetResultRank
Accuracy EstimationText2SQL source-target transfers Spider BIRD WikiSQL SParC CoSQL SynSQL-2.5M
MAE3.41
42
Accuracy EstimationMNIST, USPS, SVHN, COCO, PASCAL, ImageNet source-target transfers
MAE3.58
42
Showing 2 of 2 rows

Other info

Follow for update