Robust segmentation of speech signal using MFCC and acoustic parameters

Research output: Chapter in Book/Report/Conference proceedingConference contribution

2 Citations (Scopus)

Abstract

In the current work, we investigate the effect of combining the mel-frequency cepstral coefficients (MFCC) with the acoustic parameters (AP) in the task of segmentation of continuous speech into sonorant and obstruent regions using Hidden Markov Models (HMM) with Gaussian Mixture Models (GMM). Along with the influence of APs to the performance of the model built, we analyze the set of acoustic features extracted for each phoneme to see how robust they are in the noise. All the experiments were conducted on TIMIT database. The results of the experiments show that there are APs, which have nice separating property and, therefore, improve the performance of a system if used with MFCCs, however, they are not robust to noise. On the other hand, there are APs, which do not have this property, but possess the intrinsic stability in noisy conditions and, as a result, add some robustness to a system.

Original languageEnglish
Title of host publicationProceedings - 6th Asia International Conference on Mathematical Modelling and Computer Simulation, AMS 2012
Pages103-108
Number of pages6
DOIs
Publication statusPublished - Sep 27 2012
Event6th Asia International Conference on Mathematical Modelling and Computer Simulation, AMS 2012 - Bali, Indonesia
Duration: May 29 2012May 31 2012

Publication series

NameProceedings - 6th Asia International Conference on Mathematical Modelling and Computer Simulation, AMS 2012

Other

Other6th Asia International Conference on Mathematical Modelling and Computer Simulation, AMS 2012
CountryIndonesia
CityBali
Period5/29/125/31/12

Keywords

  • GMM
  • HMM
  • MFCC
  • acoustic parameters
  • robust segmentation

ASJC Scopus subject areas

  • Modelling and Simulation

Fingerprint Dive into the research topics of 'Robust segmentation of speech signal using MFCC and acoustic parameters'. Together they form a unique fingerprint.

  • Cite this

    Yessenbayev, Z. (2012). Robust segmentation of speech signal using MFCC and acoustic parameters. In Proceedings - 6th Asia International Conference on Mathematical Modelling and Computer Simulation, AMS 2012 (pp. 103-108). [6243930] (Proceedings - 6th Asia International Conference on Mathematical Modelling and Computer Simulation, AMS 2012). https://doi.org/10.1109/AMS.2012.26