We gratefully acknowledge support from
the Simons Foundation and member institutions.

Tomasz Limisiewicz is qualified to endorse.

Tokenization Impacts Multilingual Language Modeling: Assessing Vocabulary Allocation and Overlap Across Languages

Tomasz Limisiewicz: Is registered as an author of this paper.
Can endorse for cs.AI, cs.CL, cs.LG. (why?)

Jiří Balhar and David Mareček are not registered as owners of this paper. (why?)