Dai, Dengxin Computer Vision and Machine Learning, MPI for Informatics, Max Planck Society;
Vasudevan_Sound_and_Visual_Representation_Learning_With_Multiple_Pretraining_Tasks_CVPR_2022_paper.pdf (Preprint), 745KB
Vasudevan, A. B., Dai, D., & Van Gool, L. (2022). Sound and Visual Representation Learning with Multiple Pretraining Tasks. In IEEE/CVF Conference on Computer Vision and Pattern Recognition (pp. 14596-14606). Piscataway, NJ: IEEE. doi:10.1109/CVPR52688.2022.01421.