Our new X account is live! Follow @wizwand_team for updates
WorkDL logo mark

Resilient Multiple Choice Learning: A learned scoring scheme with application to audio scene analysis

About

We introduce Resilient Multiple Choice Learning (rMCL), an extension of the MCL approach for conditional distribution estimation in regression settings where multiple targets may be sampled for each training input. Multiple Choice Learning is a simple framework to tackle multimodal density estimation, using the Winner-Takes-All (WTA) loss for a set of hypotheses. In regression settings, the existing MCL variants focus on merging the hypotheses, thereby eventually sacrificing the diversity of the predictions. In contrast, our method relies on a novel learned scoring scheme underpinned by a mathematical framework based on Voronoi tessellations of the output space, from which we can derive a probabilistic interpretation. After empirically validating rMCL with experiments on synthetic data, we further assess its merits on the sound source localization problem, demonstrating its practical usefulness and the relevance of its interpretation.

Victor Letzelter, Mathieu Fontaine, Micka\"el Chen, Patrick P\'erez, Slim Essid, Ga\"el Richard• 2023

Related benchmarks

TaskDatasetResultRank
Source LocalizationRESYN reverberant (D2)
EMD24.45
7
Source LocalizationRESYN reverberant (D3)
EMD32.28
7
Source LocalizationANSYN 1.0 (D2)
EMD13.87
7
Source LocalizationANSYN D3 1.0
EMD20.76
7
Source LocalizationRESYN reverberant (D1)
EMD12.14
7
Source LocalizationANSYN 1.0 (D1)
EMD7.04
7
Source SeparationWSJ0 3mix (eval)
SI-SDR10.06
4
Source SeparationWSJ0-2mix (eval)
SI-SDR16.3
4
RegressionUCI Wine (20 folds)
Distortion0.02
3
RegressionUCI Concrete (20 folds)
Distortion5.13
3
Showing 10 of 18 rows

Other info

Code

Follow for update