Mid-level motion features for the recognition of biological movements

Sigala, RA; Serre, T; Poggio, T; Casile, A

doi:10.1177/03010066050340S101

Item

ITEM ACTIONSEXPORT

Add to Basket

Local TagsRelease HistoryDetailsSummary

Released

Poster

Mid-level motion features for the recognition of biological movements

MPS-Authors

There are no MPG-Authors in the publication available

External Resource

https://journals.sagepub.com/doi/pdf/10.1177/03010066050340S101
(Publisher version)

Fulltext (restricted access)

There are currently no full texts shared for your IP range.

Fulltext (public)

There are no public fulltexts stored in PuRe

Supplementary Material (public)

There is no public supplementary material available

Citation

Sigala, R., Serre, T., Poggio, T., & Casile, A. (2005). Mid-level motion features for the recognition of biological movements. Poster presented at 28th European Conference on Visual Perception (ECVP 2005), A Coruña, Spain.

Cite as: https://hdl.handle.net/11858/00-001M-0000-0013-D4D9-2

Abstract

Recognition of biological motion probably needs the integration of form and motion information. For recognition and categorisation of complex static shapes, recognition performance can be significantly increased by optimisation of the extracted mid-level form features. Several algorithms for the learning of optimised mid-level features from image data have been proposed. It seems likely that the visual recognition of complex movements is also based on optimised features. Exploiting a new physiologically inspired algorithm and classical unsupervised learning methods, we have tried to determine mid-level motion features that are maximally useful for the recognition of body movements from image sequences. We optimised mid-level neural detectors in a hierarchical model for the recognition of human actions (Giese and Poggio, 2003 Nature Reviews Neuroscience 4 179 - 192) by unsupervised learning. Learning is based on a memory trace learning rule: Each detector is associated with a memory variable that increases when the detector is activated during correct classifications, and that decreases otherwise. Detectors whose memory variable falls below a critical threshold 'die', and are eliminated from the model. In addition, we tested a classical principal-components approach. The model is trained with movies showing different human actions, from which optic flow fields are computed. The tested learning algorithms extract mid-level motion features that lead to a substantial improvement of the recognition performance. For the special case of walking, many of the extracted motion features are characterised by horizontal opponent motion. This result is consistent with psychophysical data showing that opponent horizontal motion is a dominant mid-level feature that accounts for high recognition rates, even for strongly impoverished stimuli (Casile and Giese, 2005 Journal of Vision 5 348 - 360). As for the categorisation of static shapes, recognition performance for human actions is improved by choosing optimised mid-level features. The learned features might predict receptive field properties of complex motion-selective neurons (eg in area KO/V3B).