Document Preview Unavailable

Enhancing Multimodal Large Language Models with Multi-instance Visual Prompt Generator for Visual Representation Enrichment

Zhong, Wenliang; Wu, Wenyi; Li, Qi; Barton, Rob; Du, Boxin; et al.  arXiv.org, Jun 5, 2024.

You might have access to this document