We gratefully acknowledge support from
the Simons Foundation and member institutions.

Yequan Wang is qualified to endorse.

Not all Layers of LLMs are Necessary during Inference

Yequan Wang: Is registered as an author of this paper.
Can endorse for cs.AI, cs.CL. (why?)

Siqi Fan, Xin Jiang, Xiang Li, Xuying Meng, Peng Han, Shuo Shang, Aixin Sun and Zhongyuan Wang are not registered as owners of this paper. (why?)