Fusion of Modalities for Emotion Recognition with Deep Learning

Please use this identifier to cite or link to this item: http://hdl.handle.net/2080/4627

Title:	Fusion of Modalities for Emotion Recognition with Deep Learning
Authors:	Gyanvarsha, Shourya Mohanty, Swayamjit Sahoo, Himanshu Sekhar Patra, Dipti
Keywords:	audio-visual emotion recognition xlsrWav2Vec2.0 transformer transfer learning Action Units RAVDESS speech emotion recognition facial emotion recognition
Issue Date:	Jul-2024
Citation:	IEEE International Conference on Smart Power Control and Renewable Energy ((ICSPCRE), NIT Rourkela, India, 19-21 July 2024
Abstract:	The paper delineates a pioneering advancement in emotion recognition technology, showcasing a sophisticated multimodal system adept at discerning human emotions through the fusion of video, audio, and facial features, facilitated by state-of-the-art deep learning methodologies. By amalgamating information from diverse sensory modalities, the system attains remarkable precision in classifying emotions, surpassing the efficacy of unimodal approaches. Through a comprehensive array of experiments and evaluations, the study substantiates the system’s prowess in accurately deciphering emotional states. Notably, the fusion of multimodal cues enables nuanced insights into human affective responses, transcending the limitations of individual modalities and furnishing a holistic understanding of emotional dynamics.
Description:	Copyright belongs to proceeding publisher
URI:	http://hdl.handle.net/2080/4627
Appears in Collections:	Conference Papers

Files in This Item:

File	Description	Size	Format
2024_ICSPCRE_SGyanvarsha_Fusion.pdf		776.6 kB	Adobe PDF	View/Open