TY - GEN
T1 - A robust and time-efficient fingerprinting model for musical audio
AU - Sert, M.
AU - Baykal, B.
AU - Yazici, A.
PY - 2006
Y1 - 2006
N2 - An audio fingerprinting system deals with four challenging tasks: The robustness, the reliability, the compactness, and the scalability. By preserving the others, we explore the compactness and robustness aspects of audio fingerprinting systems and propose a description and storage model based on structural analysis of audio clips. The proposed method constructs the fingerprints from the most representative section of an audio clip. Contrary to similar studies, there is no need to construct and store all the fingerprints of each frame within the database; only one fingerprint per clip is sufficient. We make use of the Audio Spectrum Flatness (ASF) and the Audio Signature (AS) features of the MPEG-7 standard, which are new to the audio feature family and have not been considered as much as other feature types. The fingerprints are stored in the form of XML, thus providing the interoperability on a world-wide scale. XML-based representation of fingerprints is very suitable particularly for portable devices such as a PDA or a mobile phone due to the transportation issues. The proposed approach is evaluated on a test bed consisting of 540 musical clips based on the MPEG-7 features. The well known MFCC feature set is also considered in the experiments for the evaluation of features.
AB - An audio fingerprinting system deals with four challenging tasks: The robustness, the reliability, the compactness, and the scalability. By preserving the others, we explore the compactness and robustness aspects of audio fingerprinting systems and propose a description and storage model based on structural analysis of audio clips. The proposed method constructs the fingerprints from the most representative section of an audio clip. Contrary to similar studies, there is no need to construct and store all the fingerprints of each frame within the database; only one fingerprint per clip is sufficient. We make use of the Audio Spectrum Flatness (ASF) and the Audio Signature (AS) features of the MPEG-7 standard, which are new to the audio feature family and have not been considered as much as other feature types. The fingerprints are stored in the form of XML, thus providing the interoperability on a world-wide scale. XML-based representation of fingerprints is very suitable particularly for portable devices such as a PDA or a mobile phone due to the transportation issues. The proposed approach is evaluated on a test bed consisting of 540 musical clips based on the MPEG-7 features. The well known MFCC feature set is also considered in the experiments for the evaluation of features.
KW - Audio excerpt analysis
KW - Audio fingerprint
KW - Audio signature
KW - Audio spectrum flatness
UR - http://www.scopus.com/inward/record.url?scp=34547380150&partnerID=8YFLogxK
UR - http://www.scopus.com/inward/citedby.url?scp=34547380150&partnerID=8YFLogxK
M3 - Conference contribution
AN - SCOPUS:34547380150
SN - 1424402166
SN - 9781424402168
T3 - Proceedings of the International Symposium on Consumer Electronics, ISCE
SP - 628
EP - 633
BT - 2006 IEEE Tenth International Symposium on Consumer Electronics, ISCE 2006 - Proceedings
T2 - 2006 IEEE 10th International Symposium on Consumer Electronics, ISCE 2006
Y2 - 28 June 2006 through 1 July 2006
ER -