Skip to main content

Showing 1–4 of 4 results for author: Gudovskiy, D A

.
  1. arXiv:2407.03442  [pdf, other

    cs.CV

    Fisher-aware Quantization for DETR Detectors with Critical-category Objectives

    Authors: Huanrui Yang, Yafeng Huang, Zhen Dong, Denis A Gudovskiy, Tomoyuki Okuno, Yohei Nakata, Yuan Du, Kurt Keutzer, Shanghang Zhang

    Abstract: The impact of quantization on the overall performance of deep learning models is a well-studied problem. However, understanding and mitigating its effects on a more fine-grained level is still lacking, especially for harder tasks such as object detection with both classification and regression objectives. This work defines the performance for a subset of task-critical categories, i.e. the critical… ▽ More

    Submitted 3 July, 2024; originally announced July 2024.

    Comments: Poster presentation at the 2nd Workshop on Advancing Neural Network Training: Computational Efficiency, Scalability, and Resource Optimization (WANT@ICML 2024)

  2. arXiv:2312.09148  [pdf, other

    cs.LG cs.CV

    Split-Ensemble: Efficient OOD-aware Ensemble via Task and Model Splitting

    Authors: Anthony Chen, Huanrui Yang, Yulu Gan, Denis A Gudovskiy, Zhen Dong, Haofan Wang, Tomoyuki Okuno, Yohei Nakata, Kurt Keutzer, Shanghang Zhang

    Abstract: Uncertainty estimation is crucial for machine learning models to detect out-of-distribution (OOD) inputs. However, the conventional discriminative deep learning classifiers produce uncalibrated closed-set predictions for OOD data. A more robust classifiers with the uncertainty estimation typically require a potentially unavailable OOD dataset for outlier exposure training, or a considerable amount… ▽ More

    Submitted 27 May, 2024; v1 submitted 14 December, 2023; originally announced December 2023.

    Comments: ICML2024. Project website is available at https://antonioo-c.github.io/projects/split-ensemble

  3. arXiv:1808.05285  [pdf, other

    cs.CV

    DNN Feature Map Compression using Learned Representation over GF(2)

    Authors: Denis A. Gudovskiy, Alec Hodgkinson, Luca Rigazio

    Abstract: In this paper, we introduce a method to compress intermediate feature maps of deep neural networks (DNNs) to decrease memory storage and bandwidth requirements during inference. Unlike previous works, the proposed method is based on converting fixed-point activations into vectors over the smallest GF(2) finite field followed by nonlinear dimensionality reduction (NDR) layers embedded into a DNN. S… ▽ More

    Submitted 15 August, 2018; originally announced August 2018.

    Comments: CEFRL2018

  4. arXiv:1706.02393  [pdf, other

    cs.CV cs.NE

    ShiftCNN: Generalized Low-Precision Architecture for Inference of Convolutional Neural Networks

    Authors: Denis A. Gudovskiy, Luca Rigazio

    Abstract: In this paper we introduce ShiftCNN, a generalized low-precision architecture for inference of multiplierless convolutional neural networks (CNNs). ShiftCNN is based on a power-of-two weight representation and, as a result, performs only shift and addition operations. Furthermore, ShiftCNN substantially reduces computational cost of convolutional layers by precomputing convolution terms. Such an o… ▽ More

    Submitted 7 June, 2017; originally announced June 2017.