I have Russian dataset, where directly point stress letter with special symbol “+”.
Example:
wav1|Тогд+а у мен+я
wav2|+Это б+удут вес+ёлые дн+и
After training it doesn’t work. tts generate same wav’s for words with different stress: “тв+орог твор+ог”.
I think it’s happened because I don’t specify anything in VOCABULARY PARAMETERS in config file. Where I can read about the rules of VOCABULARY PARAMETERS?
Is this config good for Russian?
// VOCABULARY PARAMETERS
// if custom character set is not defined,
// default set in symbols.py is used
// “characters”:{
// “pad”: “_”,
// “eos”: “~”,
// “bos”: “^”,
// “characters”: "АБВГДЕЁЖЗИЙКЛМНОПРСТУФХЦЧШЩЪЫЬЭЮЯабвгдеёжзийклмнопрстуфхцчшщъыьэюя!’(),-.:;? “,
// “punctuations”:”!’(),-.:;? ",
// “phonemes”:“iyɨʉɯuɪʏʊeøɘəɵɤoɛœɜɞʌɔæɐaɶɑɒᵻʘɓǀɗǃʄǂɠǁʛpbtdʈɖcɟkɡqɢʔɴŋɲɳnɱmʙrʀⱱɾɽɸβfvθðszʃʒʂʐçʝxɣχʁħʕhɦɬɮʋɹɻjɰlɭʎʟˈˌːˑʍwɥʜʢʡɕʑɺɧɚ˞ɫ”
// },