Persian/Farsi TTS

hkalbasi · December 16, 2020, 6:59pm

I just made a pull request for adding Farsi in num2words library.

Why we don’t use common voice sentences? There are near 7000 sentences in Farsi, which is reviewed and does not have numbers.

My friend is volunteer for recording the dataset. But I think we don’t have good microphone. Is IPhone’s microphone considered good? Can we solve this by software? Please message me for more details in an instant messaging app so I can send you samples of my friend voice. (Matrix: @hkalbasi:mozilla.org , telegram: @hkalbasi)

And another concern: In Farsi, characters like e, a, o are optional. For example کِتاب which means book is ketaab and کَتاب which is an invalid word is kataab (notice the small character moves from down to up, I hope your browser show that, but I can also send image) and everyone write کتاب without any a or e. This maybe is possible to handle by a dictionary. But the problem becomes more difficult in genitive case, which is connected in Farsi by e. For example my book in farsi is کِتابِ مَن = ketaabe man. But every one write it کتاب من and espeak read it ketaab man which is wrong. How we can handle that so our machine can read simple texts without ـَ ـِ ـُ explicity declared?

Topic		Replies	Views
Questions about phonemizer for a non-English language TTS (Text-to-Speech)	1	2104	July 7, 2022
Potential for IPA Voice to Text DeepSpeech	4	909	May 3, 2021
Speech to tex persian by deep speech DeepSpeech learning	10	3281	September 27, 2020
Help a user in need of a TTS TTS (Text-to-Speech)	2	2831	February 22, 2023
A basque university is looking for a company to take ove their version of common voice TTS (Text-to-Speech)	6	488	January 7, 2020

Persian/Farsi TTS

Related topics