Transcription having lot of spelling errors and giving wrong spaces for words

raghavk92 · January 15, 2019, 8:33am

@dan0 the models for 0.4.0 has resolved most of the transcription errors the errors. Thanks for the response but i have a few issues with transfer learning.

I have 3 questions regarding a few problems i am facing :

We tried to do transfer learning on the model with a few of our samples(4 large audio samples (technical talks) converted to 740 (around 5 sec chunks and 500 training samples and 100 for dev and 140 for test) that we created with around 5 sec audio with transcription in the csv file created from voice activity detection.

So some transcriptions got better some got worse.
So how many files are needed for a good transfer learning to happen?

While transcribing with 04.0 i found that when the person speaks fast the transcription goes wrong either two words merge and form a wrong word or to seperate wrong words. So how do i improve this or this will also happen with transfer learning with people speaking fast and how many samples are ideal
I tried transcribing with files with background music but got around 75% accuracy. I removed the noise with audacity:
procedure :

remove voice from audio
get noise profile
and remove noise from original sample with noise profile

The accuracy was 85% after this.

But i tried to automate this with sox package for ubuntu
procedure:

remove voice
sox audio.wav music.wav oops
create noise profile
sox music.wav -n noiseprof noise.prof
Remove noise from wav using profile
sox audio.wav output.wav noisered noise.prof 0.21

(i also tried with different levels of aggressivness like 0.3,0.05,0.1 etc but not much change in transcription)

The trancription became bad. I think it damaged the voice audio while noise reducing with sox.Do you know a better way for noise reduction and get better transscription.? And if i need to better transcribe a file which has background music is there any other way(like would training help and how many samples would i be needing)?

Thanks

Topic		Replies	Views
Question with DeepSpeech Transfer Learning DeepSpeech	40	2837	March 28, 2020
How to get good transcription results with only a specific English vocabulary? DeepSpeech	15	1772	June 3, 2020
Transcription Results very bad in english DeepSpeech	16	1190	October 7, 2020
Fine-tuning DeepSpeech Model (CommonVoice-DATA) DeepSpeech	60	6201	August 20, 2019
Trained model on my own data DeepSpeech	48	4645	May 29, 2021

Transcription having lot of spelling errors and giving wrong spaces for words

Related topics