Skip to main content

Showing 1–10 of 10 results for author: Egiazarian, V

Searching in archive cs. Search in all archives.
.
  1. arXiv:2401.06118  [pdf, other

    cs.LG cs.CL

    Extreme Compression of Large Language Models via Additive Quantization

    Authors: Vage Egiazarian, Andrei Panferov, Denis Kuznedelev, Elias Frantar, Artem Babenko, Dan Alistarh

    Abstract: The emergence of accurate open large language models (LLMs) has led to a race towards performant quantization techniques which can enable their execution on end-user devices. In this paper, we revisit the problem of ``extreme'' LLM compression -- defined as targeting extremely low bit counts, such as 2 to 3 bits per parameter -- from the point of view of classic methods in Multi-Codebook Quantizat… ▽ More

    Submitted 8 June, 2024; v1 submitted 11 January, 2024; originally announced January 2024.

    Comments: ICML, 2024

  2. arXiv:2306.03078  [pdf, other

    cs.CL cs.LG

    SpQR: A Sparse-Quantized Representation for Near-Lossless LLM Weight Compression

    Authors: Tim Dettmers, Ruslan Svirschevski, Vage Egiazarian, Denis Kuznedelev, Elias Frantar, Saleh Ashkboos, Alexander Borzunov, Torsten Hoefler, Dan Alistarh

    Abstract: Recent advances in large language model (LLM) pretraining have led to high-quality LLMs with impressive abilities. By compressing such LLMs via quantization to 3-4 bits per parameter, they can fit into memory-limited devices such as laptops and mobile phones, enabling personalized use. However, quantization down to 3-4 bits per parameter usually leads to moderate-to-high accuracy losses, especiall… ▽ More

    Submitted 5 June, 2023; originally announced June 2023.

    Comments: Extended preprint

  3. arXiv:2205.15403  [pdf, other

    cs.LG

    Neural Optimal Transport with General Cost Functionals

    Authors: Arip Asadulaev, Alexander Korotin, Vage Egiazarian, Petr Mokrov, Evgeny Burnaev

    Abstract: We introduce a novel neural network-based algorithm to compute optimal transport (OT) plans for general cost functionals. In contrast to common Euclidean costs, i.e., $\ell^1$ or $\ell^2$, such functionals provide more flexibility and allow using auxiliary information, such as class labels, to construct the required transport map. Existing methods for general costs are discrete and have limitation… ▽ More

    Submitted 30 May, 2024; v1 submitted 30 May, 2022; originally announced May 2022.

  4. arXiv:2201.12245  [pdf, other

    cs.LG stat.ML

    Wasserstein Iterative Networks for Barycenter Estimation

    Authors: Alexander Korotin, Vage Egiazarian, Lingxiao Li, Evgeny Burnaev

    Abstract: Wasserstein barycenters have become popular due to their ability to represent the average of probability measures in a geometrically meaningful way. In this paper, we present an algorithm to approximate the Wasserstein-2 barycenters of continuous measures via a generative model. Previous approaches rely on regularization (entropic/quadratic) which introduces bias or on input convex neural networks… ▽ More

    Submitted 9 January, 2023; v1 submitted 28 January, 2022; originally announced January 2022.

    Comments: 36th Conference on Neural Information Processing Systems (NeurIPS 2022)

  5. arXiv:2011.15081  [pdf, other

    cs.CV cs.CG

    DEF: Deep Estimation of Sharp Geometric Features in 3D Shapes

    Authors: Albert Matveev, Ruslan Rakhimov, Alexey Artemov, Gleb Bobrovskikh, Vage Egiazarian, Emil Bogomolov, Daniele Panozzo, Denis Zorin, Evgeny Burnaev

    Abstract: We propose Deep Estimators of Features (DEFs), a learning-based framework for predicting sharp geometric features in sampled 3D shapes. Differently from existing data-driven methods, which reduce this problem to feature classification, we propose to regress a scalar field representing the distance from point samples to the closest feature line on local patches. Our approach is the first that scale… ▽ More

    Submitted 26 May, 2022; v1 submitted 30 November, 2020; originally announced November 2020.

  6. Deep Vectorization of Technical Drawings

    Authors: Vage Egiazarian, Oleg Voynov, Alexey Artemov, Denis Volkhonskiy, Aleksandr Safin, Maria Taktasheva, Denis Zorin, Evgeny Burnaev

    Abstract: We present a new method for vectorization of technical line drawings, such as floor plans, architectural drawings, and 2D CAD images. Our method includes (1) a deep learning-based cleaning stage to eliminate the background and imperfections in the image and fill in missing parts, (2) a transformer-based network to estimate vector primitives, and (3) optimization procedure to obtain the final primi… ▽ More

    Submitted 30 July, 2020; v1 submitted 11 March, 2020; originally announced March 2020.

  7. Latent-Space Laplacian Pyramids for Adversarial Representation Learning with 3D Point Clouds

    Authors: Vage Egiazarian, Savva Ignatyev, Alexey Artemov, Oleg Voynov, Andrey Kravchenko, Youyi Zheng, Luiz Velho, Evgeny Burnaev

    Abstract: Constructing high-quality generative models for 3D shapes is a fundamental task in computer vision with diverse applications in geometry processing, engineering, and design. Despite the recent progress in deep generative modelling, synthesis of finely detailed 3D surfaces, such as high-resolution point clouds, from scratch has not been achieved with existing approaches. In this work, we propose to… ▽ More

    Submitted 13 December, 2019; originally announced December 2019.

  8. arXiv:1910.03524  [pdf, other

    cs.LG stat.ML

    Beyond Vector Spaces: Compact Data Representation as Differentiable Weighted Graphs

    Authors: Denis Mazur, Vage Egiazarian, Stanislav Morozov, Artem Babenko

    Abstract: Learning useful representations is a key ingredient to the success of modern machine learning. Currently, representation learning mostly relies on embedding data into Euclidean space. However, recent work has shown that data in some domains is better modeled by non-euclidean metric spaces, and inappropriate geometry can result in inferior performance. In this paper, we aim to eliminate the inducti… ▽ More

    Submitted 16 October, 2019; v1 submitted 8 October, 2019; originally announced October 2019.

  9. arXiv:1909.13082  [pdf, other

    cs.LG cs.CV stat.ML

    Wasserstein-2 Generative Networks

    Authors: Alexander Korotin, Vage Egiazarian, Arip Asadulaev, Alexander Safin, Evgeny Burnaev

    Abstract: We propose a novel end-to-end non-minimax algorithm for training optimal transport map**s for the quadratic cost (Wasserstein-2 distance). The algorithm uses input convex neural networks and a cycle-consistency regularization to approximate Wasserstein-2 distance. In contrast to popular entropic and quadratic regularizers, cycle-consistency does not introduce bias and scales well to high dimensi… ▽ More

    Submitted 10 December, 2020; v1 submitted 28 September, 2019; originally announced September 2019.

    Comments: 30 pages, 21 figures, 3 tables

  10. arXiv:1812.09874  [pdf, other

    cs.CV cs.GR cs.LG

    Perceptual deep depth super-resolution

    Authors: Oleg Voynov, Alexey Artemov, Vage Egiazarian, Alexander Notchenko, Gleb Bobrovskikh, Denis Zorin, Evgeny Burnaev

    Abstract: RGBD images, combining high-resolution color and lower-resolution depth from various types of depth sensors, are increasingly common. One can significantly improve the resolution of depth maps by taking advantage of color information; deep learning methods make combining color and depth information particularly easy. However, fusing these two sources of data may lead to a variety of artifacts. If… ▽ More

    Submitted 9 September, 2019; v1 submitted 24 December, 2018; originally announced December 2018.

    Comments: 26 pages