Hi all!
I found some time to play around with training a model. I put together this tutorial on how I trained a Dutch model with DeepSpeech and Common Voice using Google Colab’s free resources:
https://colab.research.google.com/drive/12AVZWydY07O2k2eGRlri4pwPMJibs_0v
This is mostly a re-creation of the existing DeepSpeech documentation on training a model / creating a scorer. I used comments in Colab to indicate whenever I had to add or change a step from what was in the docs.
I know this isn’t the most accurate model & I ran into a few problems! Would love any advice/suggestions to improve results. But I’m still impressed at the results of open source, open data and the free hardware on Colab.
As the Mozilla Foundation focuses more on trustworthy AI, I’d love to do more with DeepSpeech and support the work you’re doing. Happy to put together other tutorials or update docs when appropriate based on the comments in Colab. Let me know how I can help!