Phonetics description

Leo_Abalckin · June 18, 2022, 5:17pm

Does deepspeech use any phonetics sublayer while transforming speech to text? Is there a possibility to see its before and not after applying deepspeech to speech?
If not, maybe is there some neural networks for this goal, i mean to convert speech to phonetics?

Robin_Euhus · June 29, 2022, 10:56am

with deepspeech it doesn’t work with the normal models. the lowest processing level corresponds to the output without a scorer (= the result of the acoustic model). the acoustic model already has a direct relation to the orthography. a special acoustic model would be necessary (which would be incompatible with the deepspeech scorer). As far as I know it doesn’t exist - if I’m wrong -> I’d be interested too.

Leo_Abalckin · June 29, 2022, 8:07pm

There is https://www.speechace.com/ with phonetical checking and i don’t know how they do it

Topic		Replies	Views
Use deepspeech model as a pretrained feature extractor for another task DeepSpeech	0	559	April 13, 2019
Phonemes Conversion DeepSpeech	7	10135	December 15, 2017
Adding an additional layer to DeepSpeech model DeepSpeech	7	481	March 24, 2020
Deep Speech vs Picovoice Cheetah DeepSpeech	8	2071	November 17, 2019
Preprocessing, Silence, Lyric Recognition DeepSpeech	0	340	April 10, 2019

Phonetics description

Related topics