When i see it right there’s no need for a special “emotional” corpus. I can reuse existing phrases and just pronounce them in an emotional way. This would surely be easier if the text is emotional by itself, but will work with all phrases.
So next step would be taking random 300 phrases and record these phrases in four different emotions. I just need to borrow mic equipment again.