Deep Speech v0.4.1 Released

carlfm01 · February 1, 2019, 1:16am

What about packing rnnoise into the current C++ client and add an option to enable denoise on the fly? You think it will worth to try make it work together? For my use case using ffmpeg with a band filter is not practical at least using the streaming feature from C#, I think it would be great to have the same noise filter for all the clients.

Here’s the GitHub https://github.com/xiph/rnnoise

kdavis · February 1, 2019, 9:10am

It’s certainly possible.

However, there are a few reasons we have not added in rnnoise:

We’ve created, but yet to utilize, a tool voice-corpus-tool to supplement our audio with noise to make the model itself capable of denoising the audio. In this case rnnoise is not needed.
Adding in rnnoise with the current model will systematically modify the audio in ways not seen at train time and could increase WER.
Adding in rnnoise could require retraining the model with rnnoise in the pipeline to combat the previous issue
Adding in another dependency where may not be needed is something we try to avoid

This is our current take on rnnoise. I’d be curious as to your opinion/experience with it in the pipeline.

carlfm01 · February 1, 2019, 8:17pm

I think at the end just by testing we will know who will perform better at handling the noise or the artifacts from rnnoise. I’ll add this to my road. Thanks for sharing your opinion.

Topic		Replies	Views
Deep Speech v0.4.0 Released DeepSpeech	9	700	January 16, 2019
Deep Speech 0.5.0 Pre-Release Model + Checkpoint DeepSpeech	8	1407	June 12, 2019
DeepSpeech Latest Results with English DeepSpeech	10	1295	July 14, 2019
Has anyone from scratch trained or fine tuned DeepSpeech 0.7.0+ model for conversational American English DeepSpeech	0	331	July 4, 2020
Curious about WER of Deep Speech when only AM is used for LibriSpeech test-clean DeepSpeech	0	319	June 11, 2019

Deep Speech v0.4.1 Released

Related topics