Перейти к основной навигации Перейти к поиску Перейти к основному содержанию

A Web-Based Platform for Real-Time Speech Emotion Recognition using CNN

  • Damir Kabdualiyev
  • , Askar Madiyev
  • , Adil Rakhaliyev
  • , Balgynbek Dikhan
  • , Kassymzhan Gizhduan
  • , Hashim Ali
  • School of Engineering and Digital Sciences

Результат исследований

1   !!Link opens in a new tab Цитирования (Scopus)

Аннотация

This pilot study presents a web-based real-time speech emotion recognition platform using a convolutional neural network algorithm. The study aims to develop a reliable tool for predicting emotions in speech with a user-friendly design to enable easy access and display of recognition results. The platform recognizes seven emotions (angry, disgust, fear, happy, neutral, sad, and surprise) and has two functionalities: static and real-time speech signals analysis. The static analysis allows users to upload pre-recorded audio files for analysis, while the real-time analysis provides continuous audio processing as it is being recorded. The study also focuses on developing a reliable model with minimal features to predict emotions while accurately identifying various emotions detected in speech. The algorithmic performance of the model was evaluated using publicly available datasets (RAVDESS, TESS, and SAVEE). It achieved an accuracy of 86.46% in static analysis using the selected spectral feature: i.e., MFCC. The performance of the real-time analysis was validated through a user study involving 20 participants. It achieved an accuracy of 65% in recognizing emotions in real-time due to possible known factors. An interesting finding was the discrepancy between how individuals perceived their emotions and those detected by the ML model. The accuracy of the ML model was higher in pre-recorded audio recognition and about the same in real-time recognition compared to previous works. The user-friendly design and CNN algorithm make it a promising solution to address challenges in emotion recognition and highlight the importance of further research in this field.
Язык оригиналаEnglish
Название основной публикации2023 International Conference on Smart Applications, Communications and Networking, SmartNets 2023
ИздательIEEE
Страницы1-6
Издание2023
ISBN (электронное издание)979-8-3503-0252-3
ISBN (печатное издание)979-8-3503-0253-0
DOI
СостояниеPublished - авг. 2023
СобытиеIEEE International Conference on Smart Applications, Communications and Networking - Yeditepe University, Istanbul
Продолжительность: июл. 25 2023июл. 27 2023
https://smartnets.ieee.tn

Серия публикаций

Название2023 International Conference on Smart Applications, Communications and Networking, SmartNets 2023

Conference

ConferenceIEEE International Conference on Smart Applications, Communications and Networking
Сокращенный заголовокSmartNets23
Страна/TерриторияTurkey
ГородIstanbul
Период7/25/237/27/23
Адрес в сети Интернет

ЦУР ООН

Работа этого автора способствует достижению следующих Целей устойчивого развития

  1. Good health and well being
    Good health and well being

ASJC Scopus subject areas

  • Artificial Intelligence
  • Computer Networks and Communications
  • Computer Science Applications
  • Industrial and Manufacturing Engineering
  • Safety, Risk, Reliability and Quality
  • Education

Fingerprint

Подробные сведения о темах исследования «A Web-Based Platform for Real-Time Speech Emotion Recognition using CNN». Вместе они формируют уникальный семантический отпечаток (fingerprint).

Цитировать