Our new X account is live! Follow @wizwand_team for updates
WorkDL logo mark

Attention model for articulatory features detection

About

Articulatory distinctive features, as well as phonetic transcription, play important role in speech-related tasks: computer-assisted pronunciation training, text-to-speech conversion (TTS), studying speech production mechanisms, speech recognition for low-resourced languages. End-to-end approaches to speech-related tasks got a lot of traction in recent years. We apply Listen, Attend and Spell~(LAS)~\cite{Chan-LAS2016} architecture to phones recognition on a small small training set, like TIMIT~\cite{TIMIT-1992}. Also, we introduce a novel decoding technique that allows to train manners and places of articulation detectors end-to-end using attention models. We also explore joint phones recognition and articulatory features detection in multitask learning setting.

Ievgen Karaulov, Dmytro Tkanov• 2019

Related benchmarks

TaskDatasetResultRank
Phoneme RecognitionTIMIT (test)
PER20.4
31
Articulatory Feature DetectionTIMIT (test)
Anterior Feature Accuracy0.94
4
Showing 2 of 2 rows

Other info

Code

Follow for update