DC-TTS has not been updated for a long time so I doubt you’d get better results than Mozilla TTS which is constantly maintained.
Now I am finetuning my TTS with r=1 and running another finetuning session with r=2 and BN. With r=1 I am able to get rid of all background noise and have an almost super clear voice (with HiFiGAN), but metallic breathing is still there. Also, with r=1, my dataset is very fragile.