I was planning to develop a language model for Devanagari script, but we have a lot of alphabets and variations to even try. क, ख, अ, आ and their variations such as के, का, खे खा, etc.
Also, we have half letters as well. For example, क् in क्या.
How do you guys propose to build alphabet.txt in the cases like these ?