Akata, Zeynep Computer Vision and Machine Learning, MPI for Informatics, Max Planck Society;
Kayser_E-ViL_A_Dataset_and_Benchmark_for_Natural_Language_Explanations_in_ICCV_2021_paper.pdf (プレプリント), 702KB
Kayser, M., Camburu, O.-M., Salewski, L., Emde, C., Do, V., Akata, Z., & Lukasiewicz, T. (2021). e-ViL: A Dataset and Benchmark for Natural Language Explanations in Vision-Language Tasks. In ICCV 2021 (pp. 1224-1234). Piscataway, NJ: IEEE. doi:10.1109/ICCV48922.2021.00128.