Case 1: (Urdu Language)
Dataset used : Own (15 hr)
Version: v.0.6
Experiment 1: Tested the trained model, with LM, built using train and val.csv produces Avg. WER=0.138
Experiment 2: Tested the trained model, with LM built using Wiki text, produces Avg. WER = 0.128
Comments: Experiment 2 has better inference, assuming my intuition, that better LM produces better inference.
Case 1: (Tamil Language)
Dataset used : Common Voice(~12 hr)
Version: v.0.7
Experiment 1: Tested the trained model, with LM, built using train and val.csv produces Avg. WER=0.17
Experiment 2: Tested the trained model, with LM built using Wiki text, produces Avg. WER = 0.8
Comments: Experiment 2 has worst inference, contradicting case 1.
I can’t figure out, where the error may be.
If the results of case 1 are expected, then why does case 2 contradicts it.
@lissyx