We gratefully acknowledge support from
the Simons Foundation and member institutions.

Li Lyna Zhang is qualified to endorse.

Constraint-aware and Ranking-distilled Token Pruning for Efficient Transformer Inference

Li Lyna Zhang: Is registered as an author of this paper.
Can endorse for cs.AI, cs.CL, cs.CV, cs.IR, cs.LG. (why?)

Junyan Li, Jiahang Xu, Yu**g Wang, Shaoguang Yan, Yunqing Xia, Yuqing Yang, Ting Cao, Hao Sun, Weiwei Deng, Qi Zhang and Mao Yang are not registered as owners of this paper. (why?)