Potential for IPA Voice to Text

sribbleinc · March 31, 2021, 11:39pm

Hi, I think I’m exploring the possibility of utilizing Mozilla’s Deep Speech for general phonetic transcription (ideally for connected speech transcription). In terms of audio collection, it would have a slightly different format than Common Voice: audio (of any language) would be transcribed into the international phonetic alphabet (IPA), diacritics, etc. This would potentially require more review given than most people are not versed in phonetic transcription in addition to the fact that there are some variations in transcription by region and language. It’s just a thought and I’d love to hear what people think.

noetits · April 29, 2021, 8:12am

Hello, that would be very interesting. Did you continue to explore that topic ?

ftyers · April 29, 2021, 6:43pm

I wrote a number of phonemisers for different languages, they might be useful.

sribbleinc · May 3, 2021, 6:27pm

I’m still looking into it! Not in a rush since I think currently what’s needed is reliable input/training data.

sribbleinc · May 3, 2021, 6:28pm

Ooh I’ll take a look when I get a chance! Thank you for sharing!

Topic		Replies	Views
Mozilla Voice STT in the Wild! DeepSpeech	31	11924	August 25, 2020
Using Common Voice data with DeepSpeech Common Voice	11	7548	August 21, 2021
Share your trained model for Mozilla DeepSpeech? DeepSpeech	6	489	April 14, 2020
Preprocesses steps of Common Voice dataset DeepSpeech	1	313	May 8, 2021
Is microphone the only available input for STT DeepSpeech	4	881	July 13, 2020

Potential for IPA Voice to Text

Related topics