PASCAL - Pattern Analysis, Statistical Modelling and Computational Learning

Mel Frequency Cepstral Coefficients: An Evaluation of Robustness of MP3 Encoded Music
S. Sigurdson, K.B. Petersen and Jan Larsen
In: ISMIR 2006, Vicoria, Canada(2006).


In large MP3 databases, files are typically generated with different parameter settings, i.e., bit rate and sampling rates. This is of concern for MIR applications, as encoding difference can potentially confound meta-data estimation and similarity evaluation. In this paper we will discuss the influence of MP3 coding for the Mel frequency cepstral coeficients (MFCCs). The main result is that the widely used subset of the MFCCs is robust at bit rates equal or higher than 128 kbits/s, for the implementations we have investigated. However, for lower bit rates, e.g., 64 kbits/s, the implementation of the Mel filter bank becomes an issue.

EPrint Type:Conference or Workshop Item (Paper)
Project Keyword:Project Keyword UNSPECIFIED
Subjects:Learning/Statistics & Optimisation
Information Retrieval & Textual Information Access
ID Code:2820
Deposited By:Tue Lehn-Schiøler
Deposited On:22 November 2006