As of now, is Deep Speech viable for real-world applications?

nishthajain1611 · January 9, 2020, 7:46am

Shouldn’t adding a buffer time before and after the webrtcvad output solve the problem?

For example,
If VAD says the voice lies between 4.20 sec(start) and 6.80 sec(end)
we can cut the chunk from
4.18 sec to 6.82 sec
i.e. a 20 ms buffer time, before and after the start and end time

The only problem here would be to choose the exact buffer time to use.

Am i correct in following this approach to deal with this error?
Thanks in advance

Topic		Replies	Views
Deepspeech recognition rate DeepSpeech	16	8593	July 23, 2018
Share your trained model for Mozilla DeepSpeech? DeepSpeech	6	466	April 14, 2020
Deep Speech vs Picovoice Cheetah DeepSpeech	8	2071	November 17, 2019
General status of DeepSpeech DeepSpeech	10	829	September 23, 2019
Deepspeech accuracy decreasing? DeepSpeech	8	2691	October 10, 2018

As of now, is Deep Speech viable for real-world applications?

Related topics