Well, in fact, you should start with clean voices, and then, add modified voices, with noise add, and other deformations. (with voice-corpus-tool)
- ex : you live near a noisy road;
you made a lot of clean voices,
you recorded many noisy road sounds (ex : 10s each)
you duplicated those voices, adding this noise inside. (augment param in voice-corpus-tool)
Finally, you obtain a model working in your own environnment.
- Now, about cuts in your existing recs, have a look at the ‘-silence’ function of SOX
(but it’s not miraculous with noisy voice : sox will not only remove noise, it will surely remove important parts of your voice… Have a try.