I am trying to create a voice corpus for DeepSpeech training. I have downloaded the youtube videos and subtitles. I noticed that for many videos, video frames and video subtitles are not 100% correctly aligned.
Is there any tool available that will align the frames with subtitles. Pls advise.