Multimodal Speech Emotion Recognition
Integration of language and audio encoders for emotion recognition in speech using attention pooling and token compression.
Integration of language and audio encoders for emotion recognition in speech using attention pooling and token compression.
jaime@workspace:~$ ./welcome.sh