Syntethic voices generation in Hindi

harveenchadha · April 14, 2020, 10:44am

Hi,

I want to create a dataset with synthetic voices in Hindi Language, I suppose tacotron and wavenet are trained on English. Any idea how can I go about collecting data to train a production grade speech to text engine?

baconator · April 14, 2020, 6:07pm

You should look into the Common Voice Project.

nmstoker · April 14, 2020, 10:45pm

Could you explain a little more about what you’re trying as your overall objective?

You seem to be mixing comments that interchangeably imply an interest in TTS and STT and you’ve posted this in the STT forum in spite of a title that looks more appropriate for TTS. I appreciate they’re not completely disconnected but would be good to understand if you’re clear on your own understanding first (no offence intended )

xhtm · April 15, 2020, 2:05am

Hi @nmstoker

The OP wants to train a speech-to-text model (STT). However, he needs to generate data to train it. He has the text files and wants to generate the speech files from it (i.e. do TTS)

nmstoker · April 15, 2020, 9:49am

Thanks @xhtm - I guessed as much but don’t know why they didn’t just say that more clearly.

Also if they have (or can conceivably get) the data to produce a TTS but ultimately want to produce STT why not start with at least trying to use that data for transfer learning / fine tuning with the STT model?

xhtm · April 15, 2020, 9:54am

I think they only have the text files, and are looking for a software that can produce speech for them. So, right now they can’t really do STT.

Topic		Replies	Views
Multispeaker versus transfer learning TTS (Text-to-Speech)	10	1318	October 5, 2020
Requesting Guidance for Training TTS (Text-to-Speech)	0	640	January 25, 2021
New to the TTS field and i have some questions (about the necessary data) TTS (Text-to-Speech) learning	3	829	February 12, 2021
My Success with Mozilla TTS TTS (Text-to-Speech)	7	7096	January 21, 2021
Training 2 New Custom Datasets with TTS-recipes, need suggestions for inference/synthesis TTS (Text-to-Speech) learning	2	1668	January 28, 2022

Syntethic voices generation in Hindi

Related topics