Hello,
Similar to issue #1083 on Github, I encountered
Invalid label <symbol>
Aborted (core dumped)
when running
path/to/folder/generate_trie alphabet.txt transcript.binary transcript.txt trie
I fixed the above error message for many symbols and characters I didn’t think were in the transcript by simply adding them to my alphabet.txt file. This is a temporary solution before I go back and figure out where the symbols are.
However, now I’ve encountered the following:
Invalid label
Aborted (core dumped)
when running
path/to/folder/generate_trie alphabet.txt transcript.binary transcript.txt trie
In other words, it seems like the invalid label is empty. I added an “empty” to alphabet.txt by just creating a line with two spaces and nothing else, but that didn’t seem to help.
It seems like the problem is my transcript has some unknown character messing up the alphabet.h file (seen here at https://github.com/mozilla/DeepSpeech/blob/master/native_client/alphabet.h) at line 50.
Any suggestions as to how I can debug this on my end?