Skip to main content

Showing 1–1 of 1 results for author: Hadad, O

Searching in archive cs. Search in all archives.
.
  1. arXiv:2307.08771  [pdf, other

    cs.CV

    UPSCALE: Unconstrained Channel Pruning

    Authors: Alvin Wan, Hanxiang Hao, Kaushik Patnaik, Yueyang Xu, Omer Hadad, David Güera, Zhile Ren, Qi Shan

    Abstract: As neural networks grow in size and complexity, inference speeds decline. To combat this, one of the most effective compression techniques -- channel pruning -- removes channels from weights. However, for multi-branch segments of a model, channel removal can introduce inference-time memory copies. In turn, these copies increase inference latency -- so much so that the pruned model can be slower th… ▽ More

    Submitted 17 July, 2023; originally announced July 2023.

    Comments: 29 pages, 26 figures, accepted to ICML 2023