Existing scorer and acoustic model further tweaking/training possibilities

Jakub_Muzik · September 15, 2020, 10:02am

Hi All, I am very new to all this, so please accept my apology if anything below does not make sense…

First I would like to ask if I understand two things correctly:

1

The whole solution architecture is composed from three layers/components:

Acoustic model + decoder (.pbmm file)

Language model (included in the .scorer file)

Classification model (TRIE) (included included in the .scorer file)

2

In newer versions the Language model and TRIE are combined into one component called Scorer.

3

Usually the best way how to create the Acoustic model + decoder (pbmm) and Scorer (scorer) components is to use the same text input set (and corresponding WAV files in case of the Acoustic model).

Given the answers to previous questions are yes, I would like to ask few more more specific questions:

1

a) Is it possible to ammend the pre-created Scorer (based on Libri Speech Corpus) with the custom text input? I.e. keep using it but „tweak it“ by adding specific content (text input)?

b) If so, is it possible to increase the statistical weight of the addition so it takes precedence before the „standard“ content (from Libri Speech Corpus)?

2

Is it possible to do something similar with the Acoustic model + decoder component? I.e. use the supplied (example) model and train it further on specific content (text + audiofiles) to increase its efficiency in specific area of usage?

Thanks a lot for any input on this.

reuben · September 15, 2020, 10:07am

The trie is just a prefix tree encoding the vocabulary of the model.

Incorrect, it’s best to use more text data to build a scorer than just your training transcripts.

Yes, LibriSpeech is open source and so are the scripts we use to build the Scorer, just modify things according to your needs.

Not sure, probably not without writing some code.

Yes, it’s called fine tuning.

Jakub_Muzik · September 15, 2020, 4:41pm

Thank you for the clarification!

Topic		Replies	Views
Question regarding the new scorer function instead of LM+trie DeepSpeech	8	826	May 20, 2020
How does the scorer in DeepSpeech 0.7 work? DeepSpeech	0	889	April 30, 2020
Assigning weights to certain words while training DeepSpeech Model DeepSpeech participation , feedback	25	2276	June 25, 2020
Scorer hyperparameters DeepSpeech	3	879	May 13, 2020
Can I have the option of generating a new scorer file along with the new model post training? DeepSpeech	1	316	May 20, 2020

Existing scorer and acoustic model further tweaking/training possibilities

Related topics