I have installed deepspeech and could successfully perform live transcription with mic_vad_streaming. Now, I desire to train with my data which consists of about 15 words (i.e., 15 commands). I have the following difficulties:
- I am using windows. I am finding DeepSpeech.EXE and not DeepSpeech.py. Executing DeepSpeech.exe throws a message with options which does NOT incude --train-files
- DeepSpeech-0.9.3 archive is obtained separately. If I execute DeepSpeech.py, I am getting the following error:
from deepspeech_training import train as ds_train
ModuleNotFoundError: No module named ‘deepspeech_training’
Since my language contains only 15 words, do I need GPU or does CPU suffices?
The command to train deep speech (python3 DeepSpeech.py --train_files …/data/CV/en/clips/train.csv --dev_files …/data/CV/en/clips/dev.csv --test_files …/data/CV/en/clips/test.csv) is obtained from https://deepspeech.readthedocs.io/en/r0.9/TRAINING.html. The command does not include ‘alphabet.txt’. Is it implied that ‘alphabet.txt’ exists in the current directory?
The wav file size in CSV represents actual size of file? I mean, what is obtained from ‘dir’ command?
Is there any simplified data set (say about 10 words) to ILLUSTRATE training?
Thanks and Regards