I am working on training/fine-tuning DeepSpeech branch/version - 0.7.0 on Linux Ubuntu 16.04 with Python version -3.6.5, TensorFlow version - 1.15.2, CUDA/cuDNN version - CUDA 10.0/cuDNN 7.6.5.
I am also looking into the option of training from scratch for conversational datasets only.
I was interested in getting some details on the 1700 hrs of WAMU NPR transcribed dataset that was in the original training corpora. Is this like a podcast, conversational type? Is this dataset available for download (free or paid)? Is it possible to get details on how to get this dataset?