Principled Algorithms for Optimizing Generalized Metrics in Multi-Label Learning
About
Many real-world classification tasks require predicting multiple labels per instance, necessitating the optimization of complex evaluation metrics such as the $F$-measure and Jaccard index. While the Empirical Utility Maximization (EUM) framework is natural for these population-level metrics, existing theoretical results are largely limited to asymptotic Bayes-consistency. In this paper, we develop principled learning algorithms for optimizing a broad class of generalized metrics within the EUM framework, grounded in the stronger notion of $H$-consistency. Our key contribution is the design of novel surrogate loss functions for multi-label learning that admit provable $H$-consistency bounds, enabling optimization with non-asymptotic guarantees tailored to the hypothesis class and finite samples. Crucially, we prove these combinatorially formulated surrogates decompose exactly, operating in strictly $O(l)$ time without approximations. Building on this foundation, we introduce MMO (Multi-Label Metric Optimization), a new family of algorithms for optimizing generalized linear-fractional metrics. We validate our approach through extensive experiments, demonstrating robust scalability and superior performance over state-of-the-art continuous baselines on large-scale datasets (MS-COCO, Reuters-21578) in high-sparsity, deep learning regimes. Our results offer both theoretical rigor and practical effectiveness for general multi-label metric optimization.
Related benchmarks
| Task | Dataset | Result | Rank | |
|---|---|---|---|---|
| Multi-Label Classification | MS-COCO 2014 (test) | -- | 81 | |
| Multi-Label Classification | Scene | F1 Score70.39 | 12 | |
| Multi-Label Classification | Birds | F1 Score42.41 | 12 | |
| Multi-Label Classification | EMOTIONS | F1 Score66.89 | 12 | |
| Multi-Label Classification | CAL500 | F1 Score50.21 | 12 | |
| Multi-Label Classification | Reuters-21578 1995 (test) | Micro-F186.34 | 6 |