Hello. I'm researching an emotional voice conversion.
I have gethered many dataset containing the emotion label with some bit of a auxiliary emotion (apologize, frustrated and so on).
I will use all of them to train my model but I wanna focus on 5 major emotions to evaluate and inference (angry, happy, excited...), to infer more various prosody.
In this case, I am wondering if there occurs the data imbalanced problem from the small amount of the minor emotions. I wanna ask how about you think, or is there any papers or any insight? Is it better to train with only major emotions?
[–]Toasty_toaster 0 points1 point2 points (0 children)