Support for audios with background music

I am using this implementation of DeepSpeech on a dataset of around 600-700 audio files with background music like guitar etc. It appears that it can’t work with such data despite efforts of suppressing music components.
I would like to know if DeepSpeech implementation is supposed to work with narrated files with background music or not? thanks