Topic models for semantics-preserving video compression

Wanke, J; Ulges, A; Lampert, CH; Breuel, TM

doi:10.1145/1743384.1743433

Lokale TagsFreigabegeschichteDetailsÜbersicht

Topic models for semantics-preserving video compression

Wanke, J., Ulges, A., Lampert, C., & Breuel, T. (2010). Topic models for semantics-preserving video compression. In J. Wang, N. Boujemaa, N. Ramirez, & A. Natsev (Eds.), MIR '10: Proceedings of the international conference on Multimedia information retrieval (pp. 275-284). New York, NY, USA: ACM Press.

Item is Freigegeben

einblenden: alle ausblenden: alle

Basisdaten

einblenden: ausblenden:

Datensatz-Permalink: https://hdl.handle.net/21.11116/0000-0002-94D8-3 Versions-Permalink: https://hdl.handle.net/21.11116/0000-0002-94D9-2

Genre: Konferenzbeitrag

Dateien

einblenden: Dateien

Externe Referenzen

einblenden:

ausblenden:

externe Referenz:
https://dl.acm.org/citation.cfm?doid=1743384.1743433 (Verlagsversion) Open Access Status unbekannt

Beschreibung:
-

OA-Status:

Urheber

einblenden:

ausblenden:

Urheber:
Wanke, J, Autor
Ulges, A, Autor
Lampert, CH^{1, 2}, Autor
Breuel, TM, Autor

Affiliations:
1Department Empirical Inference, Max Planck Institute for Biological Cybernetics, Max Planck Society, ou_1497795
2Max Planck Institute for Biological Cybernetics, Max Planck Society, Spemannstrasse 38, 72076 Tübingen, DE, ou_1497794

Inhalt

einblenden:

ausblenden:

Schlagwörter: -

Zusammenfassung:

Most state-of-the-art systems for content-based video understanding tasks require video content to be represented as collections of many low-level descriptors, e.g. as histograms of the color, texture or motion in local image regions. In order to preserve as much of the information contained in the original video as possible, these representations are typically high-dimensional, which conflicts with the aim for compact descriptors that would allow better efficiency and lower storage requirements.

In this paper, we address the problem of semantic compression of video, i.e. the reduction of low-level descriptors to a small number of dimensions while preserving most of the semantic information. For this, we adapt topic models - which have previously been used as compact representations of still images - to take into account the temporal structure of a video, as well as multi-modal components such as motion information.

Experiments on a large-scale collection of YouTube videos show that we can achieve a compression ratio of 20 : 1 compared to ordinary histogram representations and at least 2 : 1 compared to other dimensionality reduction techniques without significant loss of prediction accuracy. Also, improvements are demonstrated for our video-specific extensions modeling temporal structure and multiple modalities.

Details

einblenden:

ausblenden:

Sprache(n):

Datum: Erschienen: 2010-03

Publikationsstatus: Erschienen

Seiten: -

Ort, Verlag, Ausgabe: -

Inhaltsverzeichnis: -

Art der Begutachtung: -

Identifikatoren: DOI: 10.1145/1743384.1743433

Art des Abschluß: -

Veranstaltung

einblenden:

ausblenden:

Titel: 2010 ACM SIGMM International Conference on Multimedia Information Retrieval (MIR 2010)

Veranstaltungsort: Philadelphia, PA, USA

Start-/Enddatum: 2010-03-29 - 2010-03-31

ausblenden:

Titel: MIR '10: Proceedings of the international conference on Multimedia information retrieval

Genre der Quelle: Konferenzband

Urheber:
Wang, JZ, Herausgeber
Boujemaa, N, Herausgeber
Ramirez, NO, Herausgeber
Natsev, A, Herausgeber

Affiliations:
-

Ort, Verlag, Ausgabe: New York, NY, USA : ACM Press

Seiten: - Band / Heft: - Artikelnummer: - Start- / Endseite: 275 - 284 Identifikator: ISBN: 978-1-60558-815-5

Datensatz

Basisdaten

Dateien

Externe Referenzen

Urheber

Inhalt

Details

Veranstaltung

Entscheidung

Projektinformation

Quelle 1