We gratefully acknowledge support from
the Simons Foundation and member institutions.

Dongxu Li is qualified to endorse.

LAVIS: A Library for Language-Vision Intelligence

Dongxu Li: Is registered as an author of this paper.
Can endorse for cs.CL, cs.CV. (why?)

Junnan Li, Hung Le, Guangsen Wang, Silvio Savarese and Steven C. H. Hoi are not registered as owners of this paper. (why?)