Recognizing phone numbers

singpolyma · March 28, 2018, 7:52pm

I’m interested in using deep speech to do transcriptions on voicemails I receive. So far the biggest limitation in the recognition is phone numbers – deep speech seems biased against the audio containing a string of digits in a row, which makes sense since saying “five five five three four four …” is not common in normal speech, but very common in voicemails.

Is it possible to create a training model that starts with the pre-trained model, but adds more known-good transcripts I have on top of that? Maybe if I just feed it a lot of people saying phone numbers this will improve.

kdavis · March 29, 2018, 8:35am

This is definitely possible and one of the reasons we release the checkpoints[1].

What one does is to “fine tune”, continue training the checkpointed model using your data set containing phone numbers, the model.

In addition you may need to recreate a language model, using KenLM[2], and trie from text that contains more numbers.

Topic		Replies	Views
How can we recognize speech to text with numbers as well as? any possibilities is there? DeepSpeech	0	418	October 3, 2018
How to do Contact Name Recognition using DeepSpeech DeepSpeech	3	756	November 7, 2018
Predicting mobile number DeepSpeech	3	483	April 14, 2019
Some words getting skipped in whole sentence DeepSpeech	9	680	May 22, 2019
Converting numbers in textual form, to numerical values in STT output DeepSpeech	13	3208	November 19, 2020

Recognizing phone numbers

Related topics