Our new X account is live! Follow @wizwand_team for updates
WorkDL logo mark

Information Retrieval for ZeroSpeech 2021: The Submission by University of Wroclaw

About

We present a number of low-resource approaches to the tasks of the Zero Resource Speech Challenge 2021. We build on the unsupervised representations of speech proposed by the organizers as a baseline, derived from CPC and clustered with the k-means algorithm. We demonstrate that simple methods of refining those representations can narrow the gap, or even improve upon the solutions which use a high computational budget. The results lead to the conclusion that the CPC-derived representations are still too noisy for training language models, but stable enough for simpler forms of pattern matching and retrieval.

Jan Chorowski, Grzegorz Ciesielski, Jaros{\l}aw Dzikowski, Adrian {\L}a\'ncucki, Ricard Marxer, Mateusz Opala, Piotr Pusz, Pawe{\l} Rychlikowski, Micha{\l} Stypu{\l}kowski• 2021

Related benchmarks

TaskDatasetResultRank
Acoustic unit discoveryZeroSpeech 2021 (dev)
SS Clean Error2.95
7
Showing 1 of 1 rows

Other info

Follow for update