Skip to main content

Showing 1–4 of 4 results for author: Chuang, P I

Searching in archive cs. Search in all archives.
.
  1. arXiv:2106.11890  [pdf, other

    cs.LG

    Latency-Aware Neural Architecture Search with Multi-Objective Bayesian Optimization

    Authors: David Eriksson, Pierce I-Jen Chuang, Samuel Daulton, Peng Xia, Akshat Shrivastava, Arun Babu, Shicong Zhao, Ahmed Aly, Ganesh Venkatesh, Maximilian Balandat

    Abstract: When tuning the architecture and hyperparameters of large machine learning models for on-device deployment, it is desirable to understand the optimal trade-offs between on-device latency and model accuracy. In this work, we leverage recent methodological advances in Bayesian optimization over high-dimensional search spaces and multi-objective Bayesian optimization to efficiently explore these trad… ▽ More

    Submitted 25 June, 2021; v1 submitted 22 June, 2021; originally announced June 2021.

    Comments: To Appear at the 8th ICML Workshop on Automated Machine Learning, ICML 2021

  2. arXiv:2008.09916  [pdf, other

    cs.LG cs.CV eess.IV

    One Weight Bitwidth to Rule Them All

    Authors: Ting-Wu Chin, Pierce I-Jen Chuang, Vikas Chandra, Diana Marculescu

    Abstract: Weight quantization for deep ConvNets has shown promising results for applications such as image classification and semantic segmentation and is especially important for applications where memory storage is limited. However, when aiming for quantization without accuracy degradation, different tasks may end up with different bitwidths. This creates complexity for software and hardware support and t… ▽ More

    Submitted 28 August, 2020; v1 submitted 22 August, 2020; originally announced August 2020.

    Comments: Accepted at ECCV 2020 Embedded Vision Workshop (Best paper)

  3. arXiv:1807.06964  [pdf, other

    cs.CV

    Bridging the Accuracy Gap for 2-bit Quantized Neural Networks (QNN)

    Authors: Jungwook Choi, Pierce I-Jen Chuang, Zhuo Wang, Swagath Venkataramani, Vijayalakshmi Srinivasan, Kailash Gopalakrishnan

    Abstract: Deep learning algorithms achieve high classification accuracy at the expense of significant computation cost. In order to reduce this cost, several quantization schemes have gained attention recently with some focusing on weight quantization, and others focusing on quantizing activations. This paper proposes novel techniques that target weight and activation quantizations separately resulting in a… ▽ More

    Submitted 17 July, 2018; originally announced July 2018.

    Comments: arXiv admin note: substantial text overlap with arXiv:1805.06085

  4. arXiv:1805.06085  [pdf, other

    cs.CV cs.AI

    PACT: Parameterized Clip** Activation for Quantized Neural Networks

    Authors: Jungwook Choi, Zhuo Wang, Swagath Venkataramani, Pierce I-Jen Chuang, Vijayalakshmi Srinivasan, Kailash Gopalakrishnan

    Abstract: Deep learning algorithms achieve high classification accuracy at the expense of significant computation cost. To address this cost, a number of quantization schemes have been proposed - but most of these techniques focused on quantizing weights, which are relatively smaller in size compared to activations. This paper proposes a novel quantization scheme for activations during training - that enabl… ▽ More

    Submitted 17 July, 2018; v1 submitted 15 May, 2018; originally announced May 2018.