Is it possible to do word alignment?

Given some audio, I can get text. But is it possible to align each word to a start/end time?

Have you had a look at the Metadata API ?

@shamoons https://deepspeech.readthedocs.io/en/latest/Structs.html#_CPPv48Metadata

Perfect. Thank you so much!