I just made a pull request for adding Farsi in num2words library.
Why we don’t use common voice sentences? There are near 7000 sentences in Farsi, which is reviewed and does not have numbers.
My friend is volunteer for recording the dataset. But I think we don’t have good microphone. Is IPhone’s microphone considered good? Can we solve this by software? Please message me for more details in an instant messaging app so I can send you samples of my friend voice. (Matrix: @hkalbasi:mozilla.org
, telegram: @hkalbasi
)
And another concern: In Farsi, characters like e, a, o are optional. For example کِتاب which means book is ketaab and کَتاب which is an invalid word is kataab (notice the small character moves from down to up, I hope your browser show that, but I can also send image) and everyone write کتاب without any a or e. This maybe is possible to handle by a dictionary. But the problem becomes more difficult in genitive case, which is connected in Farsi by e. For example my book in farsi is کِتابِ مَن = ketaabe man. But every one write it کتاب من and espeak read it ketaab man which is wrong. How we can handle that so our machine can read simple texts without ـَ ـِ ـُ explicity declared?