Akata, Zeynep Computer Vision and Machine Learning, MPI for Informatics, Max Planck Society;
https://rdcu.be/c2vwp (Any fulltext)
Mercea, O.-B., Hummel, T., Koepke, A. S., & Akata, Z. (2022). Temporal and Cross-modal Attention for Audio-Visual Zero-Shot Learning. In S. Avidan, G. Brostow, M. Cissé, G. M. Farinella, & T. Hassner (Eds.), Computer Vision -- ECCV 2022 (pp. 488-505). Berlin: Springer. doi:10.1007/978-3-031-20044-1_28.