Noticing Slow testing steps in --utf8 mode

Orion · May 23, 2020, 12:57am

Hi,
I am training the model with common voice Arabic data - 12Hrs.

With following parameters:

!python3 DeepSpeech.py \
--train_files data/ar/clips/train.csv \
--dev_files data/ar/clips/dev.csv \
--test_files data/ar/clips/test.csv \
--train_batch_size 1 \
--dev_batch_size 1 \
--test_batch_size 1 \
--summary_dir logs/ \
--export_dir model_1024/ \
--checkpoint_dir checkpoint_dir \
--epochs 20 \
--utf8 \
--learning_rate 0.0001 \
--dropout_rate 0.00 \
--n_hidden 1024 \
--scorer ' '

I am noticing very slow progress with steps while testing.
Please advise, I have tested with normal mode (without --utf8). And seems fast while testing.

othiele · May 23, 2020, 8:11am

Use higher batch sizes for training if your CPU/GPU can do that: 4 or 8

Use dropout of 0.25 or higher. Very important if nothing is moving.

And 12 hrs is very few data, don’t expect much

reuben · May 23, 2020, 10:12am

UTF-8 increases the alphabet size to 256, and since you’re not passing a scorer, it also can’t trim out of vocabulary beams during decoding, so the decoding process gets slower. If you build a scorer it should counteract any slowness you’re seeing.

reuben · May 23, 2020, 10:13am

The out of vocabulary trimming thing is also made worse by UTF-8 because without a scorer the decoder does not trim invalid UTF-8 sequences, so it’s spending time doing useless work.

Orion · May 23, 2020, 2:59pm

So even with scorer the utf-8 mode will be slower than normal mode?

reuben · May 23, 2020, 6:12pm

No. There’s no universal rule as it depends on what you’re comparing it to. Alphabet mode can be super slow with very large alphabets, for example. With a scorer it should be just as fast in most cases, and much faster if you have a large alphabet.

In general, unless you have a specific issue with alphabet mode that can be fixed by using UTF-8 mode, you should stick to using an alphabet. Some examples of these issues:

My alphabet is too large and makes the model big and slow.
I want to easily train on several languages at once.
I want to easily transfer from one language to another including the final layer.

Orion · May 23, 2020, 6:35pm

@reuben Thank you for these suggestion.

Topic		Replies	Views
Step, epoch, hardware, weird Duration DeepSpeech	8	598	July 1, 2020
Too many steps? DeepSpeech	8	1291	May 5, 2020
Question for korean transfer learning DeepSpeech learning	6	1041	October 7, 2020
How to make the testing process more quickly? DeepSpeech	20	3028	October 18, 2018
Alphabet training issue DeepSpeech	3	1297	September 1, 2020

Noticing Slow testing steps in --utf8 mode

Related topics