What is WER/CER of DeepSpeech v0.7.1 (or any other models) on Common Voice English

qdenisq · August 12, 2020, 3:24pm

Hey there. Sorry for a newbie question but could someone point me in a direction where I could find some info about the performance of DeepSpeech on the Common Voice English dataset (is it yet considered as a benchmark?). I saw some mentioned WER ~40% on the test set for one of the DeepSpeech versions. And that’s it. Couldn’t find anything else. So, will appreciate any links to blogs, papers that report performance on English Common Voice

baconator · August 12, 2020, 3:59pm

search for “word error rate” here:

othiele · August 12, 2020, 3:59pm

As Mozilla offers a great model as @baconator mentions, few people are training just on Common Voice alone. Depends heavily on the test set, but I would guess you end up at 0.15-0.20 for most versions of STT. See some WERs for other languages here

qdenisq · August 12, 2020, 4:13pm

Thanks for the link. I already checked it out and found only WER reported for the LibriSpeech test-clean which, I assume, is way lower than the same model would get on the CommonVoice. Do you know if it makes sense to speculate about WER on CommonVoice based on WER for LibriSpeech test-other? both sets have a fair degree of the accented speech, recording conditions are different though

qdenisq · August 12, 2020, 4:14pm

Thanks @othiele, will have a look. Cheers

dabinat · August 12, 2020, 7:37pm

The reason Librispeech is used as a benchmark is because the test set is very accurately transcribed. Currently Common Voice has errors in the validated dataset so it’s less suitable for benchmarking.

Topic		Replies	Views
Documentation about WER CER and loss value on Test Set of LibriSpeech for pre trained models? DeepSpeech	0	637	September 16, 2019
Help with understanding benchmarks - are we at 5.6% word error rate on Librespeech Clean+Other? DeepSpeech	1	1023	May 25, 2018
DeepSpeech Latest Results with English DeepSpeech	10	1295	July 14, 2019
DeepSpeech WER on librispeech clean dataset DeepSpeech	3	616	December 10, 2019
Benchmark results with v0.3.0? DeepSpeech	4	525	October 27, 2018

What is WER/CER of DeepSpeech v0.7.1 (or any other models) on Common Voice English

Related topics