Uncertainty-Based Learning of a Lightweight Model for Multimodal Emotion Recognition