A Web-Based Platform for Real-Time Speech Emotion Recognition using CNN

Damir Kabdualiyev, Askar Madiyev, Adil Rakhaliyev, Balgynbek Dikhan, Kassymzhan Gizhduan, Hashim Ali

Research output: Chapter in Book/Report/Conference proceedingConference contribution

Abstract

This pilot study presents a web-based real-time speech emotion recognition platform using a convolutional neural network algorithm. The study aims to develop a reliable tool for predicting emotions in speech with a user-friendly design to enable easy access and display of recognition results. The platform recognizes seven emotions (angry, disgust, fear, happy, neutral, sad, and surprise) and has two functionalities: static and real-time speech signals analysis. The static analysis allows users to upload pre-recorded audio files for analysis, while the real-time analysis provides continuous audio processing as it is being recorded. The study also focuses on developing a reliable model with minimal features to predict emotions while accurately identifying various emotions detected in speech. The algorithmic performance of the model was evaluated using publicly available datasets (RAVDESS, TESS, and SAVEE). It achieved an accuracy of 86.46% in static analysis using the selected spectral feature: i.e., MFCC. The performance of the real-time analysis was validated through a user study involving 20 participants. It achieved an accuracy of 65% in recognizing emotions in real-time due to possible known factors. An interesting finding was the discrepancy between how individuals perceived their emotions and those detected by the ML model. The accuracy of the ML model was higher in pre-recorded audio recognition and about the same in real-time recognition compared to previous works. The user-friendly design and CNN algorithm make it a promising solution to address challenges in emotion recognition and highlight the importance of further research in this field.
Original languageEnglish
Title of host publication2023 International Conference on Smart Applications, Communications and Networking (SmartNets)
PublisherIEEE
Pages1-6
Edition2023
ISBN (Electronic)979-8-3503-0252-3
ISBN (Print)979-8-3503-0253-0
DOIs
Publication statusPublished - Aug 2023
EventIEEE International Conference on Smart Applications, Communications and Networking - Yeditepe University, Istanbul, Turkey
Duration: Jul 25 2023Jul 27 2023
https://smartnets.ieee.tn

Conference

ConferenceIEEE International Conference on Smart Applications, Communications and Networking
Abbreviated titleSmartNets23
Country/TerritoryTurkey
CityIstanbul
Period7/25/237/27/23
Internet address

Keywords

  • Speech Emotion Recognition
  • Convolutional Neural Network
  • Web-Based Platform
  • Real-time Analysis

Fingerprint

Dive into the research topics of 'A Web-Based Platform for Real-Time Speech Emotion Recognition using CNN'. Together they form a unique fingerprint.

Cite this