Following on from here Creating a github page for hosting community trained models
@synesthesiam - thanks a lot for those details. I will certainly have a go with gruut.
It sounds like we’ve got some similar aspirations around pronunciation!
Regarding the handling of read, I’d experimented earlier in the year with a simple (ish) language model approach that could generally infer the most applicable pronunciation for heteronyms by use in the context. It is hardly production ready but there are a few details in the thread below. The data I’d gathered is by no means complete, but demonstrates reasonable effectiveness. The manual set up is a clear downside but the upside is that it doesn’t need guidance with input text, it will make a pretty reasonable guess usually.
Generally my focus is on British English / RP so it’d be interesting to see how easily gruut could be adapted for that. I saw you had a way to add accents (eg French) but with so many different pronunciation differences, I’m wondering if a distinct en-UK dictionary might make sense (although clearly I’d need to figure out a way to source it!)