LIWhiz: A Non-Intrusive Lyric Intelligibility Prediction System for the Cadenza Challenge
About
We present LIWhiz, a non-intrusive lyric intelligibility prediction system submitted to the ICASSP 2026 Cadenza Challenge. LIWhiz leverages Whisper for robust feature extraction and a trainable back-end for score prediction. Tested on the Cadenza Lyric Intelligibility Prediction (CLIP) evaluation set, LIWhiz achieves a root mean square error (RMSE) of 27.07%, a 22.4% relative RMSE reduction over the STOI-based baseline, yielding a substantial improvement in normalized cross-correlation.
Ram C. M. C. Shekar, Iv\'an L\'opez-Espejo• 2025
Related benchmarks
| Task | Dataset | Result | Rank | |
|---|---|---|---|---|
| Lyric Intelligibility Prediction | CLIP (val) | RMSE (%)27.13 | 3 | |
| Lyric Intelligibility Prediction | CLIP (evaluation) | RMSE (%)27.07 | 3 |
Showing 2 of 2 rows