I setup and used the web-mic example, which included using the models
these are about 1GB
the results were sub-optimal, and i’m wondering about how to improve it.
i wondered regarding the datasets from common voice, 50GB, available for download here - https://commonvoice.mozilla.org/en/datasets. would it help?
if so, how would i go about replacing the above files with it?
any other ideas for improvements?
thanks very much for any idea