Automated rating of recorded classroom presentations using speech analysis in Kazakh

Akzharkyn Izbassarova, Aidana Irmanova, Alex Pappachen James

Research output: Chapter in Book/Report/Conference proceedingConference contribution

1 Citation (Scopus)

Abstract

Effective presentation skills can help to succeed in business, career and academy. This paper presents the design of speech assessment during the oral presentation and the algorithm for speech evaluation based on criteria of optimal intonation. As the pace of the speech and its optimal intonation varies from language to language, developing an automatic identification of language during the presentation is required. Proposed algorithm was tested with presentations delivered in Kazakh language. For testing purposes the features of Kazakh phonemes were extracted using MFCC and PLP methods and created a Hidden Markov Model (HMM) [5], [5] of Kazakh phonemes. Kazakh vowel formants were defined and the correlation between the deviation rate in fundamental frequency and the liveliness of the speech to evaluate intonation of the presentation was analyzed. It was established that the threshold value between monotone and dynamic speech is 0.16 and the error for intonation evaluation is 19%.

Original languageEnglish
Title of host publication2017 International Conference on Advances in Computing, Communications and Informatics, ICACCI 2017
PublisherInstitute of Electrical and Electronics Engineers Inc.
Pages393-397
Number of pages5
Volume2017-January
ISBN (Electronic)9781509063673
DOIs
Publication statusPublished - Nov 30 2017
Event2017 International Conference on Advances in Computing, Communications and Informatics, ICACCI 2017 - Manipal, Mangalore, India
Duration: Sep 13 2017Sep 16 2017

Conference

Conference2017 International Conference on Advances in Computing, Communications and Informatics, ICACCI 2017
CountryIndia
CityManipal, Mangalore
Period9/13/179/16/17

Fingerprint

Speech analysis
Hidden Markov models
Testing
Industry

Keywords

  • Images
  • MFCC
  • PLP
  • Presentations
  • Recognition
  • Speech

ASJC Scopus subject areas

  • Computer Networks and Communications
  • Computer Science Applications
  • Information Systems

Cite this

Izbassarova, A., Irmanova, A., & James, A. P. (2017). Automated rating of recorded classroom presentations using speech analysis in Kazakh. In 2017 International Conference on Advances in Computing, Communications and Informatics, ICACCI 2017 (Vol. 2017-January, pp. 393-397). Institute of Electrical and Electronics Engineers Inc.. https://doi.org/10.1109/ICACCI.2017.8125872

Automated rating of recorded classroom presentations using speech analysis in Kazakh. / Izbassarova, Akzharkyn; Irmanova, Aidana; James, Alex Pappachen.

2017 International Conference on Advances in Computing, Communications and Informatics, ICACCI 2017. Vol. 2017-January Institute of Electrical and Electronics Engineers Inc., 2017. p. 393-397.

Research output: Chapter in Book/Report/Conference proceedingConference contribution

Izbassarova, A, Irmanova, A & James, AP 2017, Automated rating of recorded classroom presentations using speech analysis in Kazakh. in 2017 International Conference on Advances in Computing, Communications and Informatics, ICACCI 2017. vol. 2017-January, Institute of Electrical and Electronics Engineers Inc., pp. 393-397, 2017 International Conference on Advances in Computing, Communications and Informatics, ICACCI 2017, Manipal, Mangalore, India, 9/13/17. https://doi.org/10.1109/ICACCI.2017.8125872
Izbassarova A, Irmanova A, James AP. Automated rating of recorded classroom presentations using speech analysis in Kazakh. In 2017 International Conference on Advances in Computing, Communications and Informatics, ICACCI 2017. Vol. 2017-January. Institute of Electrical and Electronics Engineers Inc. 2017. p. 393-397 https://doi.org/10.1109/ICACCI.2017.8125872
Izbassarova, Akzharkyn ; Irmanova, Aidana ; James, Alex Pappachen. / Automated rating of recorded classroom presentations using speech analysis in Kazakh. 2017 International Conference on Advances in Computing, Communications and Informatics, ICACCI 2017. Vol. 2017-January Institute of Electrical and Electronics Engineers Inc., 2017. pp. 393-397
@inproceedings{6e4b8289047d45beaf8798ab5583e591,
title = "Automated rating of recorded classroom presentations using speech analysis in Kazakh",
abstract = "Effective presentation skills can help to succeed in business, career and academy. This paper presents the design of speech assessment during the oral presentation and the algorithm for speech evaluation based on criteria of optimal intonation. As the pace of the speech and its optimal intonation varies from language to language, developing an automatic identification of language during the presentation is required. Proposed algorithm was tested with presentations delivered in Kazakh language. For testing purposes the features of Kazakh phonemes were extracted using MFCC and PLP methods and created a Hidden Markov Model (HMM) [5], [5] of Kazakh phonemes. Kazakh vowel formants were defined and the correlation between the deviation rate in fundamental frequency and the liveliness of the speech to evaluate intonation of the presentation was analyzed. It was established that the threshold value between monotone and dynamic speech is 0.16 and the error for intonation evaluation is 19{\%}.",
keywords = "Images, MFCC, PLP, Presentations, Recognition, Speech",
author = "Akzharkyn Izbassarova and Aidana Irmanova and James, {Alex Pappachen}",
year = "2017",
month = "11",
day = "30",
doi = "10.1109/ICACCI.2017.8125872",
language = "English",
volume = "2017-January",
pages = "393--397",
booktitle = "2017 International Conference on Advances in Computing, Communications and Informatics, ICACCI 2017",
publisher = "Institute of Electrical and Electronics Engineers Inc.",
address = "United States",

}

TY - GEN

T1 - Automated rating of recorded classroom presentations using speech analysis in Kazakh

AU - Izbassarova, Akzharkyn

AU - Irmanova, Aidana

AU - James, Alex Pappachen

PY - 2017/11/30

Y1 - 2017/11/30

N2 - Effective presentation skills can help to succeed in business, career and academy. This paper presents the design of speech assessment during the oral presentation and the algorithm for speech evaluation based on criteria of optimal intonation. As the pace of the speech and its optimal intonation varies from language to language, developing an automatic identification of language during the presentation is required. Proposed algorithm was tested with presentations delivered in Kazakh language. For testing purposes the features of Kazakh phonemes were extracted using MFCC and PLP methods and created a Hidden Markov Model (HMM) [5], [5] of Kazakh phonemes. Kazakh vowel formants were defined and the correlation between the deviation rate in fundamental frequency and the liveliness of the speech to evaluate intonation of the presentation was analyzed. It was established that the threshold value between monotone and dynamic speech is 0.16 and the error for intonation evaluation is 19%.

AB - Effective presentation skills can help to succeed in business, career and academy. This paper presents the design of speech assessment during the oral presentation and the algorithm for speech evaluation based on criteria of optimal intonation. As the pace of the speech and its optimal intonation varies from language to language, developing an automatic identification of language during the presentation is required. Proposed algorithm was tested with presentations delivered in Kazakh language. For testing purposes the features of Kazakh phonemes were extracted using MFCC and PLP methods and created a Hidden Markov Model (HMM) [5], [5] of Kazakh phonemes. Kazakh vowel formants were defined and the correlation between the deviation rate in fundamental frequency and the liveliness of the speech to evaluate intonation of the presentation was analyzed. It was established that the threshold value between monotone and dynamic speech is 0.16 and the error for intonation evaluation is 19%.

KW - Images

KW - MFCC

KW - PLP

KW - Presentations

KW - Recognition

KW - Speech

UR - http://www.scopus.com/inward/record.url?scp=85042804589&partnerID=8YFLogxK

UR - http://www.scopus.com/inward/citedby.url?scp=85042804589&partnerID=8YFLogxK

U2 - 10.1109/ICACCI.2017.8125872

DO - 10.1109/ICACCI.2017.8125872

M3 - Conference contribution

VL - 2017-January

SP - 393

EP - 397

BT - 2017 International Conference on Advances in Computing, Communications and Informatics, ICACCI 2017

PB - Institute of Electrical and Electronics Engineers Inc.

ER -