Generate Phonemes From Text

bschuss02 · April 2, 2021, 6:53pm

Is there a way to have the TTS model generate spoken phonemes in addition to spoken words? For example, to have the model read out the sentence “hɛˈləʊ wɜːld hello world” and have it say “hello world” twice in a row.

nmstoker · April 8, 2021, 4:12pm

If you’re happy to write some code this shouldn’t be particularly hard although I suspect it’ll be marginally easier if you stick with sentences being purely regular alphabet or IPA characters within a particular sentence.

Take a look at the code that converts the text to phonemes. You should check the sentence for the presence of IPA characters and if any are found then you bypass the step that turns the text into phonemes and pass the IPA characters directly to the TTS model.

BTW your subject description is confusing as it implies you’re trying to do something else (to be consistent I’d suggest that “Generate audio from phonemes” made more sense)

Topic		Replies	Views
How to use the TTS models TTS (Text-to-Speech)	3	14033	October 29, 2019
How to generate actual speech TTS (Text-to-Speech)	6	10839	June 25, 2020
Training a TTS model with a language that doesn't have a supported phoneme_language TTS (Text-to-Speech)	3	1632	March 30, 2020
Front-end / Phoneme discussions TTS (Text-to-Speech)	9	2218	June 10, 2020
Requesting Guidance for Training TTS (Text-to-Speech)	0	641	January 25, 2021

Generate Phonemes From Text

Related topics