I started using DS 0.7. As usual, to check the accuacy of the the environment, I did run
./bin/run-ldc93s1.sh
The same has worked (& is working).
Now, to train my system (fine tune) I wanted to use the existing DS checkpoints. I’m using the check points extracted from “deepspeech-0.7.0-checkpoint.tar.gz”.
However, as I start my training using the following commands (some parameters removed to keep this short)
python DeepSpeech.py --n_hidden 2048 --checkpoint_dir ../deepspeech-0.7.0-checkpoint --export_dir ../trained_model/ --epochs 2 --train_files my-train.csv --dev_files my-dev.csv --test_files my-test.csv --train_cudnn=True --automatic_mixed_precision=True
I’m getting the following error:
tensorflow.python.framework.errors_impl.NotFoundError: Key cond_1/beta1_power not found in checkpoint
The full traceback is
File "DeepSpeech.py", line 12, in <module>
ds_train.run_script()
File "/home/sayantan/Desktop/ai_learning/deepspeech_0_7/DeepSpeech/training/deepspeech_training/train.py", line 939, in run_script
absl.app.run(main)
File "/home/sayantan/.local/lib/python3.6/site-packages/absl/app.py", line 299, in run
_run_main(main, args)
File "/home/sayantan/.local/lib/python3.6/site-packages/absl/app.py", line 250, in _run_main
sys.exit(main(argv))
File "/home/sayantan/Desktop/ai_learning/deepspeech_0_7/DeepSpeech/training/deepspeech_training/train.py", line 911, in main
train()
File "/home/sayantan/Desktop/ai_learning/deepspeech_0_7/DeepSpeech/training/deepspeech_training/train.py", line 511, in train
load_or_init_graph_for_training(session)
File "/home/sayantan/Desktop/ai_learning/deepspeech_0_7/DeepSpeech/training/deepspeech_training/util/checkpoints.py", line 132, in load_or_init_graph_for_training
_load_or_init_impl(session, methods, allow_drop_layers=True)
File "/home/sayantan/Desktop/ai_learning/deepspeech_0_7/DeepSpeech/training/deepspeech_training/util/checkpoints.py", line 97, in _load_or_init_impl
return _load_checkpoint(session, ckpt_path, allow_drop_layers)
File "/home/sayantan/Desktop/ai_learning/deepspeech_0_7/DeepSpeech/training/deepspeech_training/util/checkpoints.py", line 70, in _load_checkpoint
v.load(ckpt.get_tensor(v.op.name), session=session)
File "/home/sayantan/.local/lib/python3.6/site-packages/tensorflow_core/python/pywrap_tensorflow_internal.py", line 915, in get_tensor
return CheckpointReader_GetTensor(self, compat.as_bytes(tensor_str))
tensorflow.python.framework.errors_impl.NotFoundError: Key cond_1/beta1_power not found in checkpoint
Could you help why this is happening? Is the checkpoint missing some variable?