Document Preview Unavailable
WIT: Wikipedia-based Image Text Dataset for Multimodal Multilingual Machine Learning
Srinivasan, Krishna; Raman, Karthik; Chen, Jiecao; Bendersky, Michael; Najork, Marc. arXiv.org, Mar 3, 2021.You might have access to this document
-
Try and log in through your institution to see if they have access to the full text.
Log in through your library