I am new to NLP field. So, please excuse me, if I am asking about some obvious stuff.
Are there any fundametal differences in the datasets used for training ASR and TTS models?
In case, this is possible, is there anything still to pay attention to when using the dataset for TTS training?