I am following the tutorial french model trainning but I am stucked in the part of create the trie file.
I was looking for the native_client.tar.xz where it is the binary of the generate trie, but I just can’t find it. My native_client folder has only the generate_trie.cpp file but no tar.xz.
I also tried to build the binary with the command:
bazel build --workspace_status_command="bash native_client/bazel_workspace_status_cmd.sh" --config=monolithic -c opt --copt=-O3 --copt="-D_GLIBCXX_USE_CXX11_ABI=0" --copt=-fvisibility=hidden //native_client:libdeepspeech.so //native_client:generate_trie
Extracting Bazel installation...
ERROR: The 'build' command is only supported from within a workspace (below a directory having a WORKSPACE file).
See documentation at https://docs.bazel.build/versions/master/build-ref.html#workspace
But the above error appears. (executing in the native_client folder)
Can someone help me?
Python: 3.7.3
SO: Debian buster
GPU: RTX 2060 SUPER
Using common voice Portugues dataset.
And when it is finished and I try to transcribe any audio, I only get “a”, “e” and “o” letters. Command to transcribe, for example:
deepspeech --model /home/deep_train/portugues/output_graph.pb --audio /home/deep_train/portugues/clips/common_voice_pt_19887570.wav
I appreciate any help with the trie file.
Best regards.
lissyx
((slow to reply) [NOT PROVIDING SUPPORT])
2
cat is for create a big file with all the dev, train and test csv.
lissyx
((slow to reply) [NOT PROVIDING SUPPORT])
7
I guess it might work with some pre-filtering. But here you are also passing CSV metadata, filenames and filesize. That’s likely going to confuse a lot your LM.
I Exporting the model…
I Models exported at /home/deep_train/portugues
The common voice Portuguese dataset has only 27h validated. is it so little that he can’t even learn?
Also, I used 100 iterations, but I am not sure it can improve so much(since 75 iter is recommended)
I fixed the trie and he is clean now, only the phrases.
lissyx
((slow to reply) [NOT PROVIDING SUPPORT])
10
Oh yeah, with 27h you can’t expect anything really. Your best bet here would be to use transfer learning. This is on master, though.