Skip to main content

Showing 1–2 of 2 results for author: Nascimento, M G d

Searching in archive cs. Search in all archives.
.
  1. arXiv:2401.15024  [pdf, other

    cs.LG cs.CL

    SliceGPT: Compress Large Language Models by Deleting Rows and Columns

    Authors: Saleh Ashkboos, Maximilian L. Croci, Marcelo Gennari do Nascimento, Torsten Hoefler, James Hensman

    Abstract: Large language models have become the cornerstone of natural language processing, but their use comes with substantial costs in terms of compute and memory resources. Sparsification provides a solution to alleviate these resource constraints, and recent works have shown that trained models can be sparsified post-hoc. Existing sparsification techniques face challenges as they need additional data s… ▽ More

    Submitted 9 February, 2024; v1 submitted 26 January, 2024; originally announced January 2024.

    Comments: 22 pages, 8 figures, accepted at ICLR24

  2. arXiv:2007.07743  [pdf, other

    cs.CV

    Finding Non-Uniform Quantization Schemes using Multi-Task Gaussian Processes

    Authors: Marcelo Gennari do Nascimento, Theo W. Costain, Victor Adrian Prisacariu

    Abstract: We propose a novel method for neural network quantization that casts the neural architecture search problem as one of hyperparameter search to find non-uniform bit distributions throughout the layers of a CNN. We perform the search assuming a Multi-Task Gaussian Processes prior, which splits the problem to multiple tasks, each corresponding to different number of training epochs, and explore the s… ▽ More

    Submitted 20 July, 2020; v1 submitted 15 July, 2020; originally announced July 2020.

    Comments: Accepted for publication at ECCV 2020. Code availiable at https://code.active.vision . Updated for typo