Snips Voice Platform: an embedded Spoken Language Understanding system for private-by-design voice interfaces
About
This paper presents the machine learning architecture of the Snips Voice Platform, a software solution to perform Spoken Language Understanding on microprocessors typical of IoT devices. The embedded inference is fast and accurate while enforcing privacy by design, as no personal user data is ever collected. Focusing on Automatic Speech Recognition and Natural Language Understanding, we detail our approach to training high-performance Machine Learning models that are small enough to run in real-time on small devices. Additionally, we describe a data generation procedure that provides sufficient, high-quality training data without compromising user privacy.
Alice Coucke, Alaa Saade, Adrien Ball, Th\'eodore Bluche, Alexandre Caulier, David Leroy, Cl\'ement Doumouro, Thibault Gisselbrecht, Francesco Caltagirone, Thibaut Lavril, Ma\"el Primet, Joseph Dureau• 2018
Related benchmarks
| Task | Dataset | Result | Rank | |
|---|---|---|---|---|
| Automatic Speech Recognition | LibriSpeech (test-other) | WER17.5 | 966 | |
| Automatic Speech Recognition | LibriSpeech clean (test) | WER6.6 | 833 | |
| Automatic Speech Recognition | LibriSpeech (dev-other) | WER16.8 | 411 | |
| Automatic Speech Recognition | LibriSpeech (dev-clean) | WER (%)6.4 | 319 | |
| NLU Intent Classification and Slot Filling | Braun corpora overall (test) | Precision89.6 | 6 | |
| NLU Intent Classification and Slot Filling | Chatbot Corpus (test) | Precision96.3 | 6 | |
| NLU Intent Classification and Slot Filling | Ask Ubuntu Corpus (test) | Precision81.2 | 6 | |
| NLU Intent Classification and Slot Filling | web apps corpus (test) | Precision65.5 | 6 |
Showing 8 of 8 rows