Visual Salience and Perceptual Grouping in Multimodal Interactivity

Landragin, Frédéric; Bellalem, Nadia; Romary, Laurent

Datensatz

DATENSATZ AKTIONENEXPORT

Zur Ablage hinzufügen

Lokale TagsFreigabegeschichteDetailsÜbersicht

Freigegeben

Konferenzbeitrag

Visual Salience and Perceptual Grouping in Multimodal Interactivity

MPG-Autoren

/persons/resource/persons96341

Romary, Laurent
Max Planck Digital Library, Max Planck Society;

Externe Ressourcen

Es sind keine externen Ressourcen hinterlegt

Volltexte (beschränkter Zugriff)

Für Ihren IP-Bereich sind aktuell keine Volltexte freigegeben.

Volltexte (frei zugänglich)

landragin.pdf
(beliebiger Volltext), 89KB

Ergänzendes Material (frei zugänglich)

Es sind keine frei zugänglichen Ergänzenden Materialien verfügbar

Zitation

Landragin, F., Bellalem, N., & Romary, L. (2001). Visual Salience and Perceptual Grouping in Multimodal Interactivity. In International Workshop on Information Presentation and Natural Multimodal Dialogue (pp. 151-155).

Zitierlink: https://hdl.handle.net/11858/00-001M-0000-0013-877D-8

Zusammenfassung

This paper deals with the pragmatic interpretation of multimodal referring expressions in man-machine dialogue systems. We show the importance of building up a structure of the visual context at a semantic level, in order to enrich the significant possibilities of interpretations and to make possible the fusion of this structure with the ones obtained from the linguistic and gesture semantic analyses. Visual salience and perceptual grouping are two notions that guide such a structuring. We thus propose a hierarchy of salience criteria linked to an algorithm that detects salient objects, as well as guidelines for grouping algorithms. We show how the integration of the results of all these algorithms is a complex problem. We propose simple heuristics to reduce this complexity and we conclude on the usability of such heuristics in actual systems.