LIWhiz: A Non-Intrusive Lyric Intelligibility Prediction System for the Cadenza Challenge

About

We present LIWhiz, a non-intrusive lyric intelligibility prediction system submitted to the ICASSP 2026 Cadenza Challenge. LIWhiz leverages Whisper for robust feature extraction and a trainable back-end for score prediction. Tested on the Cadenza Lyric Intelligibility Prediction (CLIP) evaluation set, LIWhiz achieves a root mean square error (RMSE) of 27.07%, a 22.4% relative RMSE reduction over the STOI-based baseline, yielding a substantial improvement in normalized cross-correlation.

Ram C. M. C. Shekar, Iv\'an L\'opez-Espejo• 2025

Related benchmarks

Task	Dataset	Result	Rank
Lyric Intelligibility Prediction	CLIP (val)	RMSE (%)27.13		3
Lyric Intelligibility Prediction	CLIP (evaluation)	RMSE (%)27.07		3

Showing 2 of 2 rows

Other info

Follow for update

@wizwand_team Discord