We gratefully acknowledge support from
the Simons Foundation and member institutions.

Mustafa Shukor is qualified to endorse.

UnIVAL: Unified Model for Image, Video, Audio and Language Tasks

Mustafa Shukor: Is registered as an author of this paper.
Can endorse for cs.CV. (why?)

Corentin Dancette, Alexandre Rame and Matthieu Cord are not registered as owners of this paper. (why?)