Printing filename in evaluate.py report

dko · July 23, 2019, 11:58am

Hi,

I use evaluate.py to batch process a dataset and I want to compare how results differ when I plug different language models. To achieve this I have changed evaluate.py to output the full report (all results) as a simple CSV (just changed the print format). I plan to import the data from different CSVs as tables and join them. Unfortunately, there is no column of unique values.

The filenames would work perfectly, however I cannot figure out how to extract them from the TF iterator that consumes the dataset or how to match them to results if I manually import them as a list.

Any ideas for a straightforward solution to include filenames in the report produced by evaluate.py?

lissyx · July 23, 2019, 12:31pm

Isn’t it what you need here ? https://github.com/mozilla/DeepSpeech/issues/2180 Looks like @Tilman_Kamp has patches for that

Tilman_Kamp · July 23, 2019, 1:09pm

@lissyx Put up a PR for it.

lissyx · July 23, 2019, 1:11pm

@dko Can you try if the patch that @Tilman_Kamp shared fixes your use-case ? If so, we would merge it then.

dko · July 23, 2019, 1:38pm

@Tilman_Kamp @lissyx Very thankful for this. It’s exactly what I needed.

I am using v0.5.1 and there was a small change I had to do. Line 60 of evaluate.py had to be deleted as it complained create_model() was getting an extra argument:

Original:
59 logits, _ = create_model(batch_x=batch_x,
60 batch_size=FLAGS.test_batch_size,
61 seq_length=batch_x_len,
62 dropout=no_dropout)

What worked for me on 0.5.1:
59 logits, _ = create_model(batch_x=batch_x,
60 seq_length=batch_x_len,
61 dropout=no_dropout)

(Indentation won’t show up properly on this post.)

Not sure if this is caused by version mismatch or something else.

Tilman_Kamp · July 23, 2019, 3:23pm

@dko The change got merged to master.

Topic		Replies	Views
Evaluate.py with pbmm model instead of checkpoint and wav_filename in report DeepSpeech	3	713	October 28, 2019
Evaluate.py sorting DeepSpeech	2	527	July 11, 2019
KeyError: 'wav_filename' DeepSpeech	19	1585	July 21, 2020
Worse evaluation results with evaluate_tflite than evaluate DeepSpeech	6	674	April 20, 2020
How to find the which file is making loss inf DeepSpeech	8	1714	August 24, 2019

Printing filename in evaluate.py report

Related topics