Mismatched transcription on LibriSpeech test clean

A_N · May 21, 2020, 6:32pm

I am on version 0.7.0 and just testing on LibriSpeech test clean. I picked the first 18 files from the csv, combined them into a single wav file using ffmpeg. I just using the downloaded deepspeech-0.7.0-models.scorer and deepspeech-0.7.0-checkpoint or deepspeech-0.7.0-models.pbmm

Running the following on the same input audio

deepspeech --model models/deepspeech-0.7.0-models.pbmm --scorer models/deepspeech-0.7.0-models.scorer --audio bin/librispeech/LibriSpeech/test-clean-wav/test.wav

python transcribe.py -load_checkpoint_dir deepspeech-0.7.0-checkpoint --scorer models/deepspeech-0.7.0-models.scorer -src bin/librispeech/LibriSpeech/test-clean-wav/test.wav

python evaluate.py --n_hidden 2048 --checkpoint_dir deepspeech-0.7.0-checkpoint --test_files bin/librispeech/LibriSpeech/test-clean-wav/test.csv --test_output_file output.json

why is there a discrepancy in the transcribed output from the 3 methods? I have attached a pdf with the output in each case and the ground truth. Shouldn’t the results from evaluate.py and Deepspeech (0.7.0) python binding atleast be the same? I do see that transcribe.py does an additional split of the audio, not sure if that is causing any issues?

Also I see that transcribe.py very often misplaces a block, for instance the first 15 secs in the audio was the last but one line in the transcribed file.
Can someone help me please why there is so much difference in the output in of the cases?

thankscomparison.pdf (105.6 KB)

A_N · May 22, 2020, 1:01pm

@reuben @lissyx Do you have any thoughts on this?
Am I missing some obvious here? Should the test results not be the same from all these scripts for the same input LibriSpeech?

lissyx · May 22, 2020, 1:33pm

Not even waiting 24 hours before asking end being pushy? How about you sharing more actionable infos?

I can’t get accesstto your pdf, share text here.

A_N · May 22, 2020, 1:46pm

@lissyx sorry about that

Ground Truth
so they were quite willing to obey the orders of their girl queen and in a short time the blasts of trumpets and roll of drums and clashing of cymbals told trot and cap’n bill that the blue bands had assembled before the palace
then they all marched out a little way into the fields and found that the army of pinkies had already formed and was advancing steadily toward them
at the head of the pinkies were ghip ghisizzle and button bright who had the parrot on his shoulder and they were supported by captain coralie and captain tintint and rosalie the witch
when the blueskins saw ghip ghisizzle they raised another great shout for he was the favorite of the soldiers and very popular with all the people
since last thursday i ghip ghisizzle have been the lawful boolooroo of the blue country but now that you are conquered by queen trot i suppose i am conquered too and you have no boolooroo at all
when he finished she said cheerfully
don’t worry sizzle dear it’ll all come right pretty soon
now then let’s enter the city an enjoy the grand feast that’s being cooked i’m nearly starved myself for this conquerin kingdoms is hard work
then she gave rosalie back her magic ring thanking the kind witch for all she had done for them
you are mate replied the sailor
it will be such a satisfaction
the guards had a terrible struggle with the goat which was loose in the room and still wanted to fight but finally they subdued the animal and then they took the boolooroo out of the frame he was tied in and brought both him and the goat before queen trot who awaited them in the throne room of the palace
i’ll gladly do that promised the new boolooroo and i’ll feed the honorable goat all the shavings and leather and tin cans he can eat besides the grass
scuse me said trot i neglected to tell you that you’re not the boolooroo any more
the former boolooroo groaned
i’ll not be wicked any more sighed the old boolooroo i’ll reform
as a private citizen i shall be a model of deportment because it would be dangerous to be otherwise

Using evaluate.py

python evaluate.py –n_hidden=2048 --checkpoint_dir deepspeech-0.7.0-checkpoint --scorer deepspeech-0.7.0-models.scorer –test_files bin/librispeech/LibriSpeech/test-clean-wav/test.csv –test_output_file output.json

so they were quite willing to obey the orders of their girl queen and in a short time the blast of trumpets and roll of drums and clashing of symbols told trot and cap’n bill at the blue bands had assembled before the palace
then they all marched out a little way into the fields and found that the army of tanks had already formed and was advancing steadily towards them
at the head of the pinkies were get kisses and button bright who had the parrot on his shoulder and there were supported by captain corey and captain tenant and rosalie the witch
when the blue skin saw get kissed they raised another great shout for he was the favorite of the soldiers and very popular with all the people
since last thursday i get disease been the lawful booleroo of the blue country but now that you are conquered by clean shot i suppose i am conquered too and you have no booleroo at all
when he faced she said cheerfully
don’t worry sis dear it’ll all come ireson
now then let’s enter the city and enjoy the great feast at being cooked an early star myself for this confining doms is hard work
then she gave rosalie back her magic ring thanking the kind which for all she had done for them
you are mad replied the sailor
there will be such a satisfaction
the guards had a terrible struggle with the goat which was loose in the room and still want to fight but finally they subdued the animal and then they took the bolero out of the frame is tide in and brought both him and the goat before queen trot who awaited them in the throne room of the palace
i’ll gladly do that promised the new booleroo and all the honorable got all the shavings and leather and timkins he can eat besides the grass excuse me said that i neglected to tell you that you’re not the booleroo any more
the former bolero groaned
how not be wicked any more sighed the old booleroo i’ll reform
as a private citizen i shall be a model of deportment because it would be dangerous to be otherwise

Using DeepSpeech python binding

deepspeech --model models/deepspeech-0.7.0-models.pbmm --scorer models/deepspeech-0.7.0-models.scorer --audio bin/librispeech/LibriSpeech/test-clean-wav/test.wav

so they were quite willing to obey the orders of their girl queen and in a short time the blast of trumpets and roll of drums and clashing of symbols told trot and cap’n bill at the blue bands had assembled before the palace
then they all marched out a little way into the fields and found that the army of pinky had already formed and was advancing steadily towards them at the head of the pinkies were it kisses and button bright who had the parrot on his shoulder and there were supported by captain corey and captain tin tent and rosalie the witch when the blueskin saw gipsey raised another great shout for he was the favorite of the soldiers and very popular with all the people since last thursday i geisha been the lawful booleroo of the blue country but now that you are conquered by clean shot i suppose i am conquered too and you have no boolooral when he faced she said cheerfully tintwistle it all come ripon now then let’s enter the city and enjoy the great feast is being cooked on nearly storms for this convincing dom is hard work then she gave rosalie back her magic ring thanking the kind which for all she had done for them you are mate replied the sailor i will be such a satisfaction
the guards had a terrible struggle with the goat which was loose in the room and still want it to fight but finally they subdued the animal and then they took the bolero out of the frame is tide in and brought both him and the goat before queen trot who awaited them in the throne room of the palace
i’ll gladly do that promised the new bolero and i’ll feed the honorable got all the shavings and leather and tenkashi the grass excuse me said chot i neglected to tell you that you’re not the booleroo the former booleroo groaned oh not be wicked any more sighed the old booleroo i’ll reform as a private citizen i shall be a model of deportment because it would be dangerous to be otherwise

Using transcribe script

python transcribe.py -load_checkpoint_dir deepspeech-0.7.0-checkpoint --scorer deepspeech-0.7.0-models.scorer -src bin/librispeech/LibriSpeech/test-clean-wav/test.wav

then they all marched out a little way into the fields and found that the army of tanks had already formed and was advancing steadily towards them at the head of the pinkies were get kisses bright who had the parrot on his shoulder and there were supportive i kept in curly captain intent and rosalie the witch when the blue skin saw get cases they raised another great shout for he was the favorite of the soldiers and very popular with all the people since last thursday i all been the lawful boor of the blue country but now that you are conquered by queen shot i suppose i am conquered too and you have no bolero at all when he faced she said cheerfully don’t worry is a dear it’ll come ritson now then let’s enter the city and enjoy the great feast at being poked an early start myself for this congering kingdom as hard work then she gave rosalie back her magic ring thanking the kind which for all she had done for them you are made replied the sailor it will be such a satisfaction
gladly do that a new bolero and all the honorable go all the shavings and leather and tin cans he can eat size the grass excuse me said i neglected to tell you that you’re not the booleroo any more the former bolero groaned how not be wicked any more the old butler how reform a private citizen i shall be a model of deportment because it would be dangerous to be otherwise
so there were quite willing to obey the orders of their girl queen and in a short time the blast of trumpets and roll of drums and clashing of symbols told trot and cap’n bill that the blue bands had assembled before the palace
the guards had a terrible struggle with the goat which was loose in the room and still want it to fight but finally they subdued the animal and then they took the bolero out of the frame is tide in and brought both him and the goat before queen trot who awaited them in the throne room of the palace

transcribe.py misplaced 2 chunks of transcription, the every first line was placed last by one and another from the middle section in the wav towards the end

thank you very much

Topic		Replies	Views
Different outputs when using DeepSpeech as python library DeepSpeech	4	1561	August 19, 2018
Errors with using transcribe.py DeepSpeech	4	830	July 7, 2021
Problem when transcript from microphone DeepSpeech	1	636	December 16, 2020
How to get good transcription results with only a specific English vocabulary? DeepSpeech	15	1784	June 3, 2020
Transcription having lot of spelling errors and giving wrong spaces for words DeepSpeech	5	1423	January 17, 2019

Mismatched transcription on LibriSpeech test clean

Related topics