Hey there,
I’m currently trying to implement the model for my native language.
Can anyone help me with the dataset format?
In particular what should be the format of train,test and dev files?
I’ve currently referred -
Here is the run command I used -
./DeepSpeech.py --train_files /content/drive/My\ Drive/stt_dataset/train/train.csv
–dev_files /content/drive/My\ Drive/stt_dataset/dev/dev.csv
–test_files /content/drive/My\ Drive/stt_dataset/test/test.csv
–export_dir /content/drive/My\ Drive/model_export/
–checkpoint_dir /content/drive/My\ Drive/checkout/
–alphabet_config_path /content/drive/My\ Drive/alphabet.txt \
Train, dev and test are usually split 80/10/10 or 75/20/5 and all have the same layout:
wav_filename,wav_filesize,transcript
/path/audio/common_voice_de_18530569.wav,185900,zahlen sie monatlich vierteljaehrlich halbjaehrlich oder jaehrlich ihren beitrag
/path/audio/common_voice_de_18530616.wav,184364,auf derartige belastungen ist mein gepaecktraeger nicht ausgelegt
But please read the docs first, you are missing a trie or scorer and use a higher batch size. Search the forum a bit, give more information, no images and read here: