Speech Intelligibility Classifiers from 550k Disordered Speech Samples
About
We developed dysarthric speech intelligibility classifiers on 551,176 disordered speech samples contributed by a diverse set of 468 speakers, with a range of self-reported speaking disorders and rated for their overall intelligibility on a five-point scale. We trained three models following different deep learning approaches and evaluated them on ~94K utterances from 100 speakers. We further found the models to generalize well (without further training) on the TORGO database (100% accuracy), UASpeech (0.93 correlation), ALS-TDI PMP (0.81 AUC) datasets as well as on a dataset of realistic unprompted speech we gathered (106 dysarthric and 76 control speakers,~2300 samples).
Subhashini Venugopalan, Jimmy Tobin, Samuel J. Yang, Katie Seaver, Richard J.N. Cave, Pan-Pan Jiang, Neil Zeghidour, Rus Heywood, Jordan Green, Michael P. Brenner• 2023
Related benchmarks
| Task | Dataset | Result | Rank | |
|---|---|---|---|---|
| Dysarthric speech severity assessment | UASpeech Cross-domain (test) | SRCC0.936 | 10 | |
| Dysarthric speech severity assessment | DysArinVox Cross-domain (test) | SRCC0.538 | 10 | |
| Dysarthric speech severity assessment | EasyCall Cross-domain (test) | SRCC0.205 | 10 | |
| Dysarthric speech severity assessment | EWA-DB Cross-domain (test) | SRCC0.393 | 10 | |
| Dysarthric speech severity assessment | NeuroVoz Cross-domain (test) | SRCC0.18 | 10 | |
| Dysarthric speech severity assessment | SAP In-domain (test) | SRCC0.473 | 10 |
Showing 6 of 6 rows