Why are the words combined when we use language model

dbanka · December 27, 2017, 7:54am

if I infer using my model without language model, I am getting this output:

on a istanash seeeelad foundre of asios sociats smalls a ce firm sand marets texes whe o engierating architectua o serven primarily servin cowe recourte this conversation sermey so lo we’re here fore isaske so questions about the wayyuecannoct meetings wire then your business we are in t trig to get insight on aa application were developing thit takes the audio

If I infer with language model, I am getting:

on a istanashseeeeladfoundreofasiossociatssmallsacefirmsandmaretstexeswheyoengieratingaarchitectuaoservenprimarilyservintcoweyrecourtethis conversation serve so what we are here for is ask some questions about the way you can not meetings were then your business we are in a try to get insight on an application were developing that takes the audio

My question is that why are many words combined/joined when we use language model?

kdavis · December 28, 2017, 9:48am

Not answering your question, but making a suggestion.

As documented in the README

Once everything is installed you can then use the deepspeech binary to do speech-to-text on short, approximately 5 second, audio files (currently only WAVE files with 16-bit, 16 kHz, mono are supported in the Python client)

The sentences you are feeding the system seem longer than 5 seconds, assuming they’re not from Steve Woodmore.

To improve the performance of the acoustic and language model you should limit your audio files to about 5 second in length.

panybj · December 29, 2017, 7:55am

someone said that the gpu deepspeech will not combine words.

Topic		Replies	Views
Text produced has long strings of words with no spaces DeepSpeech	22	4045	April 30, 2018
DeepSpeech generates long nonsense tokens as output DeepSpeech	1	607	July 3, 2018
How language model is used in deepspeech DeepSpeech	5	8332	February 26, 2018
Model misses some of the word during inference DeepSpeech	15	1025	November 4, 2020
Using Deep Speech DeepSpeech	34	12908	August 20, 2019

Why are the words combined when we use language model

Related topics