Document Preview Unavailable

When and why vision-language models behave like bags-of-words, and what to do about it?

Yuksekgonul, Mert; Bianchi, Federico; Kalluri, Pratyusha; Jurafsky, Dan; Zou, James.  arXiv.org, Mar 23, 2023.

You might have access to this document