Principled Algorithms for Optimizing Generalized Metrics in Multi-Label Learning

About

Many real-world classification tasks require predicting multiple labels per instance, necessitating the optimization of complex evaluation metrics such as the $F$-measure and Jaccard index. While the Empirical Utility Maximization (EUM) framework is natural for these population-level metrics, existing theoretical results are largely limited to asymptotic Bayes-consistency. In this paper, we develop principled learning algorithms for optimizing a broad class of generalized metrics within the EUM framework, grounded in the stronger notion of $H$-consistency. Our key contribution is the design of novel surrogate loss functions for multi-label learning that admit provable $H$-consistency bounds, enabling optimization with non-asymptotic guarantees tailored to the hypothesis class and finite samples. Crucially, we prove these combinatorially formulated surrogates decompose exactly, operating in strictly $O(l)$ time without approximations. Building on this foundation, we introduce MMO (Multi-Label Metric Optimization), a new family of algorithms for optimizing generalized linear-fractional metrics. We validate our approach through extensive experiments, demonstrating robust scalability and superior performance over state-of-the-art continuous baselines on large-scale datasets (MS-COCO, Reuters-21578) in high-sparsity, deep learning regimes. Our results offer both theoretical rigor and practical effectiveness for general multi-label metric optimization.

Mehryar Mohri, Yutao Zhong• 2026

Related benchmarks

Task	Dataset	Result
Multi-Label Classification	MS-COCO 2014 (test)	--	81
Multi-Label Classification	Scene	F1 Score70.39	12
Multi-Label Classification	Birds	F1 Score42.41	12
Multi-Label Classification	EMOTIONS	F1 Score66.89	12
Multi-Label Classification	CAL500	F1 Score50.21	12
Multi-Label Classification	Reuters-21578 1995 (test)	Micro-F186.34	6

Showing 6 of 6 rows

Other info

Follow for update

@wizwand_team Discord