Size mismatches in released model

maneeshkyadav · February 16, 2021, 4:51am

Must be doing something stupid here. Trying to run one of the released multispeaker models:

Multi-Speaker-Tacotron2 DDC VCTK 2136433

when I checkout 2136433 and try a test with synthesize.py I still get a bunch of layer size mistmatches.

size mismatch for decoder.attention_rnn.weight_ih: copying a param with shape torch.Size([4096, 1024]) from checkpoint, the shape in current model is torch.Size([4096, 896])

(seems to be a delta of 128)…tried a few other commits around the time of the model and no luck. Can anyone point out my mistake?

EDIT: I also noticed that the link to Universal WaveGrad on the wiki doesn’t point to a google drive link like the rest of the vocoder models. Would like to try it if possible.

erogol · February 16, 2021, 10:14am

you need to use the right commit version with those models. Commit versions has been given on the table.

maneeshkyadav · February 16, 2021, 5:37pm

Thanks erogol,
I checked out 2136433 (21364331d22b09a347bb069f98744dea78ca899f) when I tried, do I have that wrong?

maneeshkyadav · February 18, 2021, 10:32pm

Ah, I think the commits on the wiki differ from the notebooks? For the two tacotron2 multispeaker models:

notebook:
!git clone https://github.com/Mozilla/TTS TTS_repo

%cd TTS_repo

!git checkout ea976b0

wiki:
4873601
2136433

Things seem to work fine with ea976b0.