We gratefully acknowledge support from
the Simons Foundation and member institutions.

Weixuan Sun and Yiran Zhong are qualified to endorse.

Vicinity Vision Transformer

Weixuan Sun: Is registered as an author of this paper.
Can endorse for cs.CL, cs.CV, cs.MM. (why?)
Yiran Zhong: Is registered as an author of this paper.
Can endorse for cs.CL, cs.CV. (why?)

Zhen Qin, Hui Deng, Jianyuan Wang, Yi Zhang, Kaihao Zhang, Nick Barnes, Stan Birchfield and Lingpeng Kong are not registered as owners of this paper. (why?)