English
 
Help Privacy Policy Disclaimer
  Advanced SearchBrowse

Item

ITEM ACTIONSEXPORT

Released

Book Chapter

Dataset generation and Bonobo classification from weakly labelled videos

MPS-Authors
/persons/resource/persons266715

Martin,  Pierre-Etienne       
Department of Comparative Cultural Psychology, Max Planck Institute for Evolutionary Anthropology, Max Planck Society;

External Resource
No external resources are shared
Fulltext (restricted access)
There are currently no full texts shared for your IP range.
Fulltext (public)
There are no public fulltexts stored in PuRe
Supplementary Material (public)
There is no public supplementary material available
Citation

Martin, P.-E. (2024). Dataset generation and Bonobo classification from weakly labelled videos. In K. Arai (Ed.), Intelligent Systems and Applications: Lecture Notes in Networks and Systems: Proceedings of the 2023 Intelligent Systems Conference (IntelliSys), Volume 2 (pp. 689-700). Cham: Springer Nature Switzerland. doi:10.1007/978-3-031-47724-9_45.


Cite as: https://hdl.handle.net/21.11116/0000-000F-3960-5
Abstract
This paper presents a bonobo detection and classification pipeline built from the commonly used machine learning methods. Such application is motivated by the need to test bonobos in their enclosure using touch screen devices without human assistance. This work introduces a newly acquired dataset based on bonobo recordings generated semi-automatically. The recordings are weakly labelled and fed to a macaque detector in order to spatially detect the individual present in the video. Handcrafted features coupled with different classification algorithms and deep-learning methods using a ResNet architecture are investigated for bonobo identification. Performance is compared in terms of classification accuracy on the splits of the database using different data separation methods. We demonstrate the importance of data preparation and how a wrong data separation can lead to false good results. Finally, after a meaningful separation of the data, the best classification performance is obtained using a fine-tuned ResNet model and reaches 75% of accuracy.