Towards building an intelligent voice system for kazakh: Acoustic database and system design

Research output: Chapter in Book/Report/Conference proceedingConference contribution

Abstract

In this paper we describe our initiative to build an intelligent voice system for Kazakh over the telephone lines. In particular, we collected the first acoustic database of Kazakh telephone speech containing common words and phrases uttered by 169 native speakers to train an acoustic model. The database has more than 17 hours of speech and is balanced according to the gender, region and age groups. The training was performed using CMU Sphinx Toolkits and exploited the context-dependent tied-state continuous Hidden Markov Models with 8 Gaussian mixtures per state. The experiments show that the best WER of 4, 1% on test data is obtained with 2000 senones and the dimension of the feature vectors of 23. Later, this model was used in the system's implementation. While designing the system, we tried to focus on friendly graphical user interface and all-in-one functionality. The system is intended to help easy and fast deployment of speech-enabled applications for the industry, governmental and educational institutions.

Original languageEnglish
Title of host publicationProceedings - 8th EUROSIM Congress on Modelling and Simulation, EUROSIM 2013
EditorsDavid Al-Dabass, Richard Cant, Richard Zobel, Khalid Al-Begain, Richard Cant, Alessandra Orsoni, Khalid Al-Begain, Alessandra Orsoni, David Al-Dabass
PublisherInstitute of Electrical and Electronics Engineers Inc.
Pages393-397
Number of pages5
ISBN (Electronic)9780769550732
DOIs
Publication statusPublished - Jan 8 2015
Event8th EUROSIM Congress on Modelling and Simulation, EUROSIM 2013 - Cardiff, Wales, United Kingdom
Duration: Sep 10 2013Sep 13 2013

Publication series

NameProceedings - 8th EUROSIM Congress on Modelling and Simulation, EUROSIM 2013

Other

Other8th EUROSIM Congress on Modelling and Simulation, EUROSIM 2013
CountryUnited Kingdom
CityCardiff, Wales
Period9/10/139/13/13

Keywords

  • Acoustic model for Kazakh
  • acoustic database
  • intelligent voice system
  • telephone speech recognition

ASJC Scopus subject areas

  • Modelling and Simulation
  • Computational Theory and Mathematics
  • Computer Science Applications

Fingerprint Dive into the research topics of 'Towards building an intelligent voice system for kazakh: Acoustic database and system design'. Together they form a unique fingerprint.

Cite this