We gratefully acknowledge support from
the Simons Foundation and member institutions.

Mayug Maniparambil is qualified to endorse.

Do Vision and Language Encoders Represent the World Similarly?

Mayug Maniparambil: Is registered as an author of this paper.
Can endorse for cs.CV. (why?)

Raiymbek Akshulakov, Yasser Abdelaziz Dahou Djilali, Sanath Narayan, Mohamed El Amine Seddik, Karttikeya Mangalam and Noel E. O'Connor are not registered as owners of this paper. (why?)