I was trying to train my own model with English data set .
(50 GB from common voice)
but after first epoch I’ve got following error:
Alphabet cannot encode transcript “” while processing sample “…/DataBase/en/clips/common_voice_en_1655Preformatted text6180.wav”, check that your alphabet contains all characters in the training corpus. Missing characters are: .
does anybody knows why it happened?