Vision-Language Transformers Learning Multimodal Representations? A Probing Perspective
Emmanuelle SALIN
LIS, Aix-Marseille Université
https://college-doctoral.univ-amu.fr/inscrit/12991
Date(s) : 04/02/2022 iCal
14h30 - 15h30
In recent years, joint text-image embeddings have significantly improved thanks to the development of transformer-bas
Catégories