Skip to main content

Showing 1–2 of 2 results for author: Benbaki, R

.
  1. arXiv:2403.07094  [pdf, other

    cs.LG

    FALCON: FLOP-Aware Combinatorial Optimization for Neural Network Pruning

    Authors: Xiang Meng, Wenyu Chen, Riade Benbaki, Rahul Mazumder

    Abstract: The increasing computational demands of modern neural networks present deployment challenges on resource-constrained devices. Network pruning offers a solution to reduce model size and computational cost while maintaining performance. However, most current pruning methods focus primarily on improving sparsity by reducing the number of nonzero parameters, often neglecting other deployment costs suc… ▽ More

    Submitted 11 March, 2024; originally announced March 2024.

  2. arXiv:2302.14623  [pdf, other

    cs.LG cs.CV math.OC

    Fast as CHITA: Neural Network Pruning with Combinatorial Optimization

    Authors: Riade Benbaki, Wenyu Chen, Xiang Meng, Hussein Hazimeh, Natalia Ponomareva, Zhe Zhao, Rahul Mazumder

    Abstract: The sheer size of modern neural networks makes model serving a serious computational challenge. A popular class of compression techniques overcomes this challenge by pruning or sparsifying the weights of pretrained networks. While useful, these techniques often face serious tradeoffs between computational requirements and compression quality. In this work, we propose a novel optimization-based pru… ▽ More

    Submitted 28 February, 2023; originally announced February 2023.