Step, epoch, hardware, weird Duration

How much data do you have ? You mention “DeepSpeech English dataset” but your command line links to some Common Voice english, and no mention of the release.
I guess 12h on your GPU might be expected.

Yes, please read the doc and the help of --helpfull, it’s all documented.