I see - I hadn’t picked that up from your initial question.
0.4.1 did include a snapshot from English Common Voice. There was some discussion on this [here] (Any reason 0.5.x models weren't trained on Common Voice data this time?) : why 0.5 didn’t include Common Voice (was an oversight) and that it’s likely to be in the released models for 0.6 once it’s out of alpha.
If you’re trying to improve it specifically for Indian English some fine tuning might help (I know others on here have been looking at that for Indian accents but I don’t know how they’ve got on). Another approach to consider would be including Indian sourced text in the LM, since that could help it cope with “Indianisms” that aren’t typically part of the American / British English data that likely make up the bulk of the LM data.