Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

Hierarchical Pronunciation Assessment with Multi-Aspect Attention

About

Automatic pronunciation assessment is a major component of a computer-assisted pronunciation training system. To provide in-depth feedback, scoring pronunciation at various levels of granularity such as phoneme, word, and utterance, with diverse aspects such as accuracy, fluency, and completeness, is essential. However, existing multi-aspect multi-granularity methods simultaneously predict all aspects at all granularity levels; therefore, they have difficulty in capturing the linguistic hierarchy of phoneme, word, and utterance. This limitation further leads to neglecting intimate cross-aspect relations at the same linguistic unit. In this paper, we propose a Hierarchical Pronunciation Assessment with Multi-aspect Attention (HiPAMA) model, which hierarchically represents the granularity levels to directly capture their linguistic structures and introduces multi-aspect attention that reflects associations across aspects at the same level to create more connotative representations. By obtaining relational information from both the granularity- and aspect-side, HiPAMA can take full advantage of multi-task learning. Remarkable improvements in the experimental results on the speachocean762 datasets demonstrate the robustness of HiPAMA, particularly in the difficult-to-assess aspects.

Heejin Do, Yunsu Kim, Gary Geunbae Lee• 2022

Related benchmarks

TaskDatasetResultRank
Phoneme Pronunciation Assessmentspeechocean762 official (test)
PCC0.616
24
Pronunciation AssessmentSpeechocean762 (test)
Utterance Fluency (PCC)74.9
18
Utterance-level Pronunciation AssessmentSpeechocean762
PCC (Total)0.754
9
Word-level Pronunciation AssessmentSpeechocean762
PCC (Total)0.591
7
Word Pronunciation Assessmentspeechocean762 official (test)
Accuracy (PCC)57.5
4
Utterance Pronunciation Assessmentspeechocean762 official (test)--
4
Showing 6 of 6 rows

Other info

Follow for update