We gratefully acknowledge support from
the Simons Foundation and member institutions.

Xijun Wang is qualified to endorse.

ViLA: Efficient Video-Language Alignment for Video Question Answering

Xijun Wang: Is registered as an author of this paper.
Can endorse for cs.AI, cs.CV, cs.LG, cs.RO. (why?)

Junbang Liang, Chun-Kai Wang, Kenan Deng, Yu Lou, Ming Lin and Shan Yang are not registered as owners of this paper. (why?)