Structural and semantic modeling of audio for content-based querying and browsing

Mustafa Sert, Buyurman Baykal, Adnan Yazici

Research output: Chapter in Book/Report/Conference proceedingConference contribution

2 Citations (Scopus)

Abstract

A typical content-based audio management system deals with three aspects namely audio segmentation and classification, audio analysis, and content-based retrieval of audio. In this paper, we integrate the three aspects of content-based audio management into a single framework and propose an efficient method for flexible querying and browsing of auditory data. More specifically, we utilize two robust feature sets namely MPEG-7 Audio Spectrum Flatness (ASF) and Mel Frequency Cepstral Coefficients (MFCC) as the underlying features in order to improve the content-based retrieval accuracy, since both features have some advantages for distinct types of audio (e.g., music and speech). The proposed system provides a wide range of opportunities to query and browse an audio data by content, such as querying and browsing for a chorus section, sound effects, and query-by-example. In addition, the clients can express their queries in the form of point, ronge, and k-neanst neighbor, which are particularly significant in the multimedia domain.

Original languageEnglish
Title of host publicationFlexible Query Answering Systems - 7th International Conference, FQAS 2006, Proceedings
PublisherSpringer Verlag
Pages319-330
Number of pages12
ISBN (Print)3540346384, 9783540346388
DOIs
Publication statusPublished - 2006
Event7th International Conference on Flexible Query Answering Systems, FQAS 2006 - Milan, Italy
Duration: Jun 7 2006Jul 10 2006

Publication series

NameLecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)
Volume4027 LNAI
ISSN (Print)0302-9743
ISSN (Electronic)1611-3349

Conference

Conference7th International Conference on Flexible Query Answering Systems, FQAS 2006
Country/TerritoryItaly
CityMilan
Period6/7/067/10/06

ASJC Scopus subject areas

  • Theoretical Computer Science
  • General Computer Science

Fingerprint

Dive into the research topics of 'Structural and semantic modeling of audio for content-based querying and browsing'. Together they form a unique fingerprint.

Cite this