Hi everyone,
Can we use Deepspeech to build a multi accent S2T model?
example: I want to build an ASR model that can transcribe US and Indian accent English language. I have 3-4k hours of labelled data for each accent. Is it possible to build a single model for these accents which can give me good enough WER(<15)?
Any suggestion would be of great help!
Thanks