We gratefully acknowledge support from
the Simons Foundation and member institutions.

Jie-Neng Chen is qualified to endorse.

LLaVolta: Efficient Multi-modal Models via Stage-wise Visual Context Compression

Jie-Neng Chen: Is registered as an author of this paper.
Can endorse for cs.CV. (why?)

Luoxin Ye, Ju He, Zhao-Yang Wang, Daniel Khashabi and Alan Yuille are not registered as owners of this paper. (why?)