English
 
Help Privacy Policy Disclaimer
  Advanced SearchBrowse

Item

ITEM ACTIONSEXPORT
  Lightning Pose: improved animal pose estimation via semi-supervised learning, Bayesian ensembling, and cloud-native open source tools

Whiteway, M., Biderman, D., Hurwitz, C., Greenspan, N., Lee, R., Vishnubhotla, A., et al. (2023). Lightning Pose: improved animal pose estimation via semi-supervised learning, Bayesian ensembling, and cloud-native open source tools. Poster presented at 52nd Annual Meeting of the Society for Neuroscience (Neuroscience 2023), Washington, DC, USA.

Item is

Files

show Files

Creators

show
hide
 Creators:
Whiteway, M, Author
Biderman, D, Author
Hurwitz, CL, Author
Greenspan, N, Author
Lee, RS, Author
Vishnubhotla, A, Author
Schartner, M, Author
Huntenburg, JM1, Author                 
Khanal, A, Author
Meijer, G, Author
Noel, J, Author
Pan-Vazquez, A, Author
Socha, K, Author
Urai, AE, Author
Laboratory, I, Author
Warren, R, Author
Noone, D, Author
Pedraja, F, Author
Cunningham, JP, Author
Sawtell, N, Author
Paninski, L, Author more..
Affiliations:
1Department of Computational Neuroscience, Max Planck Institute for Biological Cybernetics, Max Planck Society, ou_3017468              

Content

show
hide
Free keywords: -
 Abstract: Pose estimation algorithms are shedding new light on animal behavior and intelligence. Most existing models are only trained with labeled frames (supervised learning). Although effective in many cases, the fully supervised approach requires extensive image labeling, struggles to generalize to new videos, and produces noisy outputs that hinder downstream analyses. We address each of these limitations with a semi-supervised approach that leverages the spatiotemporal statistics of unlabeled videos in two different ways. First, we introduce unsupervised training objectives that penalize the network whenever its predictions violate smoothness of physical motion, multiple-view geometry, or depart from a low- dimensional subspace of plausible body configurations. Second, we design a new network architecture that predicts pose for a given frame using temporal context from surrounding unlabeled frames. These context frames help resolve brief occlusions or ambiguities between nearby and similar-looking body parts. The resulting pose estimation networks achieve better performance with fewer labels, generalize better to unseen videos, and provide smoother and more reliable pose trajectories for downstream analysis; for example, these improved pose trajectories exhibit stronger correlations with neural activity. We also propose a Bayesian post- processing approach based on deep ensembling and Kalman smoothing that further improves tracking accuracy and robustness. We demonstrate our results on a range of datasets, including head-fixed mice running on a treadmill, freely swimming fish, and head-fixed mice data from the International Brain Lab. In addition, we release a deep learning package that adheres to industry best practices, supporting easy model development and accelerated training and prediction. Our package is accompanied by a cloud application that allows users to annotate data, train networks, and predict new videos at scale, directly from the browser.

Details

show
hide
Language(s):
 Dates: 2023-11
 Publication Status: Published online
 Pages: -
 Publishing info: -
 Table of Contents: -
 Rev. Type: -
 Identifiers: -
 Degree: -

Event

show
hide
Title: 52nd Annual Meeting of the Society for Neuroscience (Neuroscience 2023)
Place of Event: Washington, DC, USA
Start-/End Date: 2023-11-11 - 2023-11-15

Legal Case

show

Project information

show

Source 1

show
hide
Title: 52nd Annual Meeting of the Society for Neuroscience (Neuroscience 2023)
Source Genre: Proceedings
 Creator(s):
Affiliations:
Publ. Info: -
Pages: - Volume / Issue: - Sequence Number: PSTR512.04 Start / End Page: 1332 - 1333 Identifier: -