Akata, Zeynep Computer Vision and Machine Learning, MPI for Informatics, Max Planck Society;
arXiv:2105.01517.pdf (Preprint), 7MB
Chen, Y., Hummel, T., Koepke, A. S., & Akata, Z. (2021). Where and When: Space-Time Attention for Audio-Visual Explanations. Retrieved from https://arxiv.org/abs/2105.01517.