Fine-Grained Semantic Segmentation of Motion Capture Data using Convolutional 
Neural Networks

Cheema, Noshaba

Item

ITEM ACTIONSEXPORT

Add to Basket

Local TagsRelease HistoryDetailsSummary

Released

Thesis

Fine-Grained Semantic Segmentation of Motion Capture Data using Convolutional Neural Networks

MPS-Authors

Cheema, Noshaba
International Max Planck Research School, MPI for Informatics, Max Planck Society;

External Resource

No external resources are shared

Fulltext (restricted access)

There are currently no full texts shared for your IP range.

Fulltext (public)

There are no public fulltexts stored in PuRe

Supplementary Material (public)

There is no public supplementary material available

Citation

Cheema, N. (2019). Fine-Grained Semantic Segmentation of Motion Capture Data using Convolutional Neural Networks. Master Thesis, Universität des Saarlandes, Saarbrücken.

Cite as: https://hdl.handle.net/21.11116/0000-0005-9C5E-3

Abstract

Human motion capture data has been widely used in data-driven character animation. In
order to generate realistic, natural-looking motions, most data-driven approaches require
considerable eﬀorts of pre-processing, including motion segmentation, annotation, and
so on. Existing (semi-) automatic solutions either require hand-crafted features for
motion segmentation or do not produce the semantic annotations required for motion
synthesis and building large-scale motion databases. In this thesis, an approach for a
semi-automatic framework for semantic segmentation of motion capture data based on
(semi-) supervised machine learning techniques is developed. The motion capture data is
ﬁrst transformed into a “motion image” to apply common convolutional neural networks
for image segmentation. Convolutions over the time domain enable the extraction of
temporal information and dilated convolutions are used to enlarge the receptive ﬁeld
exponentially using comparably few layers and parameters. The ﬁnally developed dilated
temporal fully-convolutional model is compared against state-of-the-art models in action
segmentation, as well as a popular network for sequence modeling. The models are
further tested on noisy and inaccurate training labels and the developed model is found
to be surprisingly robust and self-correcting.