Our new X account is live! Follow @wizwand_team for updates
WorkDL logo mark

IMS-Speech: A Speech to Text Tool

About

We present the IMS-Speech, a web based tool for German and English speech transcription aiming to facilitate research in various disciplines which require accesses to lexical information in spoken language materials. This tool is based on modern open source software stack, advanced speech recognition methods and public data resources and is freely available for academic researchers. The utilized models are built to be generic in order to provide transcriptions of competitive accuracy on a diverse set of tasks and conditions.

Pavel Denisov, Ngoc Thang Vu• 2019

Related benchmarks

TaskDatasetResultRank
Automatic Speech RecognitionLibriSpeech (test-other)
WER12.5
966
Automatic Speech RecognitionLibriSpeech clean (test)
WER4.3
833
Speech RecognitionWSJ (92-eval)
WER3.8
131
Automatic Speech RecognitionTuda-DE (test)
WER10
26
Automatic Speech RecognitionTuda-DE (dev)
WER11.1
24
Automatic Speech RecognitionAMI SDM English (eval)
WER38.5
8
Automatic Speech RecognitionVerbmobil 1 German (test)
WER7.3
4
Automatic Speech RecognitionAMI IHM English (eval)
WER17.4
2
Automatic Speech RecognitionAMI MDM English (eval)
WER34.1
2
Automatic Speech RecognitionVerbmobil German 1 (dev)
WER0.067
2
Showing 10 of 12 rows

Other info

Code

Follow for update