PICASSO - Automated Soundtrack Suggestion for Multi-modal Data

Stupar, Aleksandar; Michel, Sebastian

doi:10.1145/2063576.2064027

Item

ITEM ACTIONSEXPORT

Add to Basket

Local TagsRelease HistoryDetailsSummary

Released

Conference Paper

PICASSO - Automated Soundtrack Suggestion for Multi-modal Data

MPS-Authors

/persons/resource/persons45568

Stupar, Aleksandar
Databases and Information Systems, MPI for Informatics, Max Planck Society;

/persons/resource/persons45041

Michel, Sebastian
Databases and Information Systems, MPI for Informatics, Max Planck Society;

External Resource

No external resources are shared

Fulltext (restricted access)

There are currently no full texts shared for your IP range.

Fulltext (public)

There are no public fulltexts stored in PuRe

Supplementary Material (public)

There is no public supplementary material available

Citation

Stupar, A., & Michel, S. (2011). PICASSO - Automated Soundtrack Suggestion for Multi-modal Data. In B. Berendt, A. de Vries, W. Fan, & C. Macdonald (Eds.), CIKM’11 (pp. 2589-2592). New York, NY: ACM. doi:10.1145/2063576.2064027.

Cite as: https://hdl.handle.net/11858/00-001M-0000-0010-149A-0

Abstract

We demonstrate PICASSO, a novel approach to soundtrack recommendation. Given text, video, or image documents, PICASSO selects the best fitting music pieces, out of a given set of files, for instance, a user's personal mp3 collection. This task, commonly referred to as soundtrack suggestion, is non-trivial as it requires a lot of human attention and a good deal of experience, with master pieces distinguished, e.g., with the Academy Award for Best Original Score. We put forward PICASSO to solve this task in a fully automated way. We address the problem by extracting the required information, in form of music/screenshot samples, from available contemporary movies, making the training set easily obtainable. The training set is further extended with information acquired from movie scripts and subtitles, giving us a richer description of the action and atmosphere expressed in a particular movie scene. Although the number of applications for this approach is very large, we focus on two selected applications. First, we consider recommendation of the soundtrack for the slide show generation based on the given set of images. Second, we consider recommending a soundtrack as the background music for given audio books.