Audio feature and classifier analysis for efficient recognition of environmental sounds

Cigdem Okuyucu, Mustafa Sert, Adnan Yazici

Research output: Chapter in Book/Report/Conference proceedingConference contribution

22 Citations (Scopus)

Abstract

Environmental sounds (ES) have different characteristics, such as unstructured nature and typically noise-like and flat spectrums, which make recognition task difficult compared to speech or music sounds. Here, we perform an exhaustive feature and classifier analysis for the recognition of considerably similar ES categories and propose a best representative feature to yield higher recognition accuracy. In the experiments, thirteen (13) ES categories, namely emergency alarm, car horn, gun, explosion, automobile, helicopter, water, wind, rain, applause, crowd, and laughter are detected and tested based on eleven (11) audio features (MPEG-7 family, ZCR, MFCC, and combinations) by using the HMM and SVM classifiers. Extensive experiments have been conducted to demonstrate the effectiveness of these joint features for ES classification. Our experiments show that, the joint feature set ASFCS-H (Audio Spectrum Flatness, Centroid, Spread, and Audio Harmonicity) is the best representative feature set with an average F-measure value of 80.6%.

Original languageEnglish
Title of host publicationProceedings - 2013 IEEE International Symposium on Multimedia, ISM 2013
Pages125-132
Number of pages8
DOIs
Publication statusPublished - 2013
Event15th IEEE International Symposium on Multimedia, ISM 2013 - Anaheim, CA, United States
Duration: Dec 9 2013Dec 11 2013

Publication series

NameProceedings - 2013 IEEE International Symposium on Multimedia, ISM 2013

Conference

Conference15th IEEE International Symposium on Multimedia, ISM 2013
Country/TerritoryUnited States
CityAnaheim, CA
Period12/9/1312/11/13

Keywords

  • Environmental sound classification
  • HMM
  • MFCC
  • MPEG-7
  • SVM

ASJC Scopus subject areas

  • Computer Graphics and Computer-Aided Design
  • Human-Computer Interaction
  • Software

Fingerprint

Dive into the research topics of 'Audio feature and classifier analysis for efficient recognition of environmental sounds'. Together they form a unique fingerprint.

Cite this