Efficient Index-Based Audio Matching

Kurth, Frank; Müller, Meinard

doi:10.1109/TASL.2007.911552

Local TagsRelease HistoryDetailsSummary

Efficient Index-Based Audio Matching

Kurth, F., & Müller, M. (2008). Efficient Index-Based Audio Matching. IEEE Transactions on Audio, Speech, and Language Processing, 16(2), 382-395. doi:10.1109/TASL.2007.911552.

Item is Released

show all hide all

Basic

show hide

Item Permalink: https://hdl.handle.net/11858/00-001M-0000-000F-1B7B-0 Version Permalink: https://hdl.handle.net/11858/00-001M-0000-000F-1B7C-E

Genre: Journal Article

Files

show Files

Locators

show

Creators

show

hide

Creators:
Kurth, Frank, Author
Müller, Meinard¹, Author

Affiliations:
1Computer Graphics, MPI for Informatics, Max Planck Society, ou_40047

Content

show

hide

Free keywords: -

Abstract: Given a large audio database of music recordings, the goal of classical audio identification is to identify a particular audio recording by means of a short audio fragment. Even though recent identification algorithms show a significant degree of robustness towards noise, MP3 compression artifacts, and uniform temporal distortions, the notion of similarity is rather close to the identity. In this paper, we address a higher level retrieval problem, which we refer to as audio matching: given a short query audio clip, the goal is to automatically retrieve all excerpts from all recordings within the database that musically correspond to the query. In our matching scenario, opposed to classical audio identification, we allow semantically motivated variations as they typically occur in different interpretations of a piece of music. To this end, this paper presents an efficient and robust audio matching procedure that works even in the presence of significant variations, such as nonlinear temporal, dynamical, and spectral deviations, where existing algorithms for audio identification would fail. Furthermore, the combination of various deformation- and fault-tolerance mechanisms allows us to employ standard indexing techniques to obtain an efficient, index-based matching procedure, thus providing an important step towards semantically searching large-scale real-world music collections.

Details

show

hide

Language(s): eng - English

Dates: Modified: 2009-03-16Date issued: 2008

Publication Status: Issued

Pages: -

Publishing info: -

Table of Contents: -

Rev. Type: Peer

Identifiers: eDoc: 428141
DOI: 10.1109/TASL.2007.911552
URI: http://dx.doi.org/10.1109/TASL.2007.911552
Other: Local-ID: C125756E0038A185-3AD2D78184F392C5C125753E0058C93D-Müller2008

Degree: -

Event

show

Legal Case

show

Project information

show

Source 1

show

hide

Title: IEEE Transactions on Audio, Speech, and Language Processing

Source Genre: Journal

Creator(s):

Affiliations:

Publ. Info: -

Pages: - Volume / Issue: 16 (2) Sequence Number: - Start / End Page: 382 - 395 Identifier: ISSN: 1558-7916