Skip to main content

Showing 1–43 of 43 results for author: Babenko, A

Searching in archive cs. Search in all archives.
.
  1. arXiv:2406.19380  [pdf, other

    cs.LG

    TabReD: A Benchmark of Tabular Machine Learning in-the-Wild

    Authors: Ivan Rubachev, Nikolay Kartashev, Yury Gorishniy, Artem Babenko

    Abstract: Benchmarks that closely reflect downstream application scenarios are essential for the streamlined adoption of new research in tabular machine learning (ML). In this work, we examine existing tabular benchmarks and find two common characteristics of industry-grade tabular data that are underrepresented in the datasets available to the academic community. First, tabular data often changes over time… ▽ More

    Submitted 1 July, 2024; v1 submitted 27 June, 2024; originally announced June 2024.

    Comments: Code: https://github.com/yandex-research/tabred (V2: fix the link to the code in this comment; no changes to the PDF)

  2. arXiv:2406.14539  [pdf, other

    cs.CV

    Invertible Consistency Distillation for Text-Guided Image Editing in Around 7 Steps

    Authors: Nikita Starodubcev, Mikhail Khoroshikh, Artem Babenko, Dmitry Baranchuk

    Abstract: Diffusion distillation represents a highly promising direction for achieving faithful text-to-image generation in a few sampling steps. However, despite recent successes, existing distilled models still do not provide the full spectrum of diffusion abilities, such as real image inversion, which enables many precise image manipulation methods. This work aims to enrich distilled text-to-image diffus… ▽ More

    Submitted 26 June, 2024; v1 submitted 20 June, 2024; originally announced June 2024.

    Comments: Project page: https://yandex-research.github.io/invertible-cd/

  3. arXiv:2404.05666  [pdf, other

    cs.CV

    YaART: Yet Another ART Rendering Technology

    Authors: Sergey Kastryulin, Artem Konev, Alexander Shishenya, Eugene Lyapustin, Artem Khurshudov, Alexander Tselousov, Nikita Vinokurov, Denis Kuznedelev, Alexander Markovich, Grigoriy Livshits, Alexey Kirillov, Anastasiia Tabisheva, Liubov Chubarova, Marina Kaminskaia, Alexander Ustyuzhanin, Artemii Shvetsov, Daniil Shlenskii, Valerii Startsev, Dmitrii Kornilov, Mikhail Romanov, Artem Babenko, Sergei Ovcharenko, Valentin Khrulkov

    Abstract: In the rapidly progressing field of generative models, the development of efficient and high-fidelity text-to-image diffusion systems represents a significant frontier. This study introduces YaART, a novel production-grade text-to-image cascaded diffusion model aligned to human preferences using Reinforcement Learning from Human Feedback (RLHF). During the development of YaART, we especially focus… ▽ More

    Submitted 8 April, 2024; originally announced April 2024.

    Comments: Prompts and additional information are available on the project page, see https://ya.ru/ai/art/paper-yaart-v1

  4. arXiv:2403.06866  [pdf, other

    cs.CV

    QUASAR: QUality and Aesthetics Scoring with Advanced Representations

    Authors: Sergey Kastryulin, Denis Prokopenko, Artem Babenko, Dmitry V. Dylov

    Abstract: This paper introduces a new data-driven, non-parametric method for image quality and aesthetics assessment, surpassing existing approaches and requiring no prompt engineering or fine-tuning. We eliminate the need for expressive textual embeddings by proposing efficient image anchors in the data. Through extensive evaluations of 7 state-of-the-art self-supervised models, our method demonstrates sup… ▽ More

    Submitted 20 March, 2024; v1 submitted 11 March, 2024; originally announced March 2024.

  5. arXiv:2401.06118  [pdf, other

    cs.LG cs.CL

    Extreme Compression of Large Language Models via Additive Quantization

    Authors: Vage Egiazarian, Andrei Panferov, Denis Kuznedelev, Elias Frantar, Artem Babenko, Dan Alistarh

    Abstract: The emergence of accurate open large language models (LLMs) has led to a race towards performant quantization techniques which can enable their execution on end-user devices. In this paper, we revisit the problem of ``extreme'' LLM compression -- defined as targeting extremely low bit counts, such as 2 to 3 bits per parameter -- from the point of view of classic methods in Multi-Codebook Quantizat… ▽ More

    Submitted 8 June, 2024; v1 submitted 11 January, 2024; originally announced January 2024.

    Comments: ICML, 2024

  6. arXiv:2312.10835  [pdf, other

    cs.CV

    Your Student is Better Than Expected: Adaptive Teacher-Student Collaboration for Text-Conditional Diffusion Models

    Authors: Nikita Starodubcev, Artem Fedorov, Artem Babenko, Dmitry Baranchuk

    Abstract: Knowledge distillation methods have recently shown to be a promising direction to speedup the synthesis of large-scale diffusion models by requiring only a few inference steps. While several powerful distillation methods were recently proposed, the overall quality of student samples is typically lower compared to the teacher ones, which hinders their practical usage. In this work, we investigate t… ▽ More

    Submitted 5 April, 2024; v1 submitted 17 December, 2023; originally announced December 2023.

    Comments: CVPR2024 camera ready v2

  7. arXiv:2307.14338  [pdf, other

    cs.LG

    TabR: Tabular Deep Learning Meets Nearest Neighbors in 2023

    Authors: Yury Gorishniy, Ivan Rubachev, Nikolay Kartashev, Daniil Shlenskii, Akim Kotelnikov, Artem Babenko

    Abstract: Deep learning (DL) models for tabular data problems (e.g. classification, regression) are currently receiving increasingly more attention from researchers. However, despite the recent efforts, the non-DL algorithms based on gradient-boosted decision trees (GBDT) remain a strong go-to solution for these problems. One of the research directions aimed at improving the position of tabular DL involves… ▽ More

    Submitted 26 October, 2023; v1 submitted 26 July, 2023; originally announced July 2023.

    Comments: Code: https://github.com/yandex-research/tabular-dl-tabr

  8. arXiv:2304.04344  [pdf, other

    cs.CV cs.LG

    Towards Real-time Text-driven Image Manipulation with Unconditional Diffusion Models

    Authors: Nikita Starodubcev, Dmitry Baranchuk, Valentin Khrulkov, Artem Babenko

    Abstract: Recent advances in diffusion models enable many powerful instruments for image editing. One of these instruments is text-driven image manipulations: editing semantic attributes of an image according to the provided text description. % Popular text-conditional diffusion models offer various high-quality image manipulation methods for a broad range of text prompts. Existing diffusion-based methods a… ▽ More

    Submitted 9 April, 2023; originally announced April 2023.

  9. arXiv:2302.13875  [pdf, other

    cs.LG stat.ML

    Evaluating Robustness and Uncertainty of Graph Models Under Structural Distributional Shifts

    Authors: Gleb Bazhenov, Denis Kuznedelev, Andrey Malinin, Artem Babenko, Liudmila Prokhorenkova

    Abstract: In reliable decision-making systems based on machine learning, models have to be robust to distributional shifts or provide the uncertainty of their predictions. In node-level problems of graph learning, distributional shifts can be especially complex since the samples are interdependent. To evaluate the performance of graph models, it is important to test them on diverse and meaningful distributi… ▽ More

    Submitted 1 November, 2023; v1 submitted 27 February, 2023; originally announced February 2023.

  10. arXiv:2302.11640  [pdf, ps, other

    cs.LG

    A critical look at the evaluation of GNNs under heterophily: Are we really making progress?

    Authors: Oleg Platonov, Denis Kuznedelev, Michael Diskin, Artem Babenko, Liudmila Prokhorenkova

    Abstract: Node classification is a classical graph machine learning task on which Graph Neural Networks (GNNs) have recently achieved strong results. However, it is often believed that standard GNNs only work well for homophilous graphs, i.e., graphs where edges tend to connect nodes of the same class. Graphs without this property are called heterophilous, and it is typically assumed that specialized method… ▽ More

    Submitted 2 March, 2024; v1 submitted 22 February, 2023; originally announced February 2023.

  11. arXiv:2302.04841  [pdf, other

    cs.CV cs.LG

    Is This Loss Informative? Faster Text-to-Image Customization by Tracking Objective Dynamics

    Authors: Anton Voronov, Mikhail Khoroshikh, Artem Babenko, Max Ryabinin

    Abstract: Text-to-image generation models represent the next step of evolution in image synthesis, offering a natural way to achieve flexible yet fine-grained control over the result. One emerging area of research is the fast adaptation of large text-to-image models to smaller datasets or new visual concepts. However, many efficient methods of adaptation have a long training time, which limits their practic… ▽ More

    Submitted 1 November, 2023; v1 submitted 9 February, 2023; originally announced February 2023.

    Comments: Accepted to Conference on Neural Information Processing Systems (NeurIPS) 2023. 20 pages, 15 figures. Code: https://github.com/yandex-research/DVAR

  12. arXiv:2209.15421  [pdf, other

    cs.LG

    TabDDPM: Modelling Tabular Data with Diffusion Models

    Authors: Akim Kotelnikov, Dmitry Baranchuk, Ivan Rubachev, Artem Babenko

    Abstract: Denoising diffusion probabilistic models are currently becoming the leading paradigm of generative modeling for many important data modalities. Being the most prevalent in the computer vision community, diffusion models have also recently gained some attention in other domains, including speech, NLP, and graph-like data. In this work, we investigate if the framework of diffusion models can be adva… ▽ More

    Submitted 30 September, 2022; originally announced September 2022.

    Comments: code https://github.com/rotot0/tab-ddpm

  13. arXiv:2209.06177  [pdf, other

    cs.SI cs.DM cs.LG math.PR

    Characterizing Graph Datasets for Node Classification: Homophily-Heterophily Dichotomy and Beyond

    Authors: Oleg Platonov, Denis Kuznedelev, Artem Babenko, Liudmila Prokhorenkova

    Abstract: Homophily is a graph property describing the tendency of edges to connect similar nodes; the opposite is called heterophily. It is often believed that heterophilous graphs are challenging for standard message-passing graph neural networks (GNNs), and much effort has been put into develo** efficient methods for this setting. However, there is no universally agreed-upon measure of homophily in the… ▽ More

    Submitted 15 April, 2024; v1 submitted 13 September, 2022; originally announced September 2022.

  14. arXiv:2207.03208  [pdf, other

    cs.LG

    Revisiting Pretraining Objectives for Tabular Deep Learning

    Authors: Ivan Rubachev, Artem Alekberov, Yury Gorishniy, Artem Babenko

    Abstract: Recent deep learning models for tabular data currently compete with the traditional ML models based on decision trees (GBDT). Unlike GBDT, deep models can additionally benefit from pretraining, which is a workhorse of DL for vision and NLP. For tabular problems, several pretraining methods were proposed, but it is not entirely clear if pretraining provides consistent noticeable improvements and wh… ▽ More

    Submitted 12 July, 2022; v1 submitted 7 July, 2022; originally announced July 2022.

    Comments: Code: https://github.com/puhsu/tabular-dl-pretrain-objectives

  15. arXiv:2205.03763  [pdf, other

    cs.LG cs.DB cs.DS cs.PF

    Results of the NeurIPS'21 Challenge on Billion-Scale Approximate Nearest Neighbor Search

    Authors: Harsha Vardhan Simhadri, George Williams, Martin Aumüller, Matthijs Douze, Artem Babenko, Dmitry Baranchuk, Qi Chen, Lucas Hosseini, Ravishankar Krishnaswamy, Gopal Srinivasa, Suhas Jayaram Subramanya, **gdong Wang

    Abstract: Despite the broad range of algorithms for Approximate Nearest Neighbor Search, most empirical evaluations of algorithms have focused on smaller datasets, typically of 1 million points~\citep{Benchmark}. However, deploying recent advances in embedding based techniques for search, recommendation and ranking at scale require ANNS indices at billion, trillion or larger scale. Barring a few recent pape… ▽ More

    Submitted 7 May, 2022; originally announced May 2022.

  16. arXiv:2203.05556  [pdf, other

    cs.LG

    On Embeddings for Numerical Features in Tabular Deep Learning

    Authors: Yury Gorishniy, Ivan Rubachev, Artem Babenko

    Abstract: Recently, Transformer-like deep architectures have shown strong performance on tabular data problems. Unlike traditional models, e.g., MLP, these architectures map scalar values of numerical features to high-dimensional embeddings before mixing them in the main backbone. In this work, we argue that embeddings for numerical features are an underexplored degree of freedom in tabular DL, which allows… ▽ More

    Submitted 26 October, 2023; v1 submitted 10 March, 2022; originally announced March 2022.

    Comments: NeurIPS 2022 camera-ready. Code: https://github.com/yandex-research/tabular-dl-num-embeddings (v3-v4: minor changes)

  17. arXiv:2202.08937  [pdf, other

    cs.LG cs.CV

    When, Why, and Which Pretrained GANs Are Useful?

    Authors: Timofey Grigoryev, Andrey Voynov, Artem Babenko

    Abstract: The literature has proposed several methods to finetune pretrained GANs on new datasets, which typically results in higher performance compared to training from scratch, especially in the limited-data regime. However, despite the apparent empirical benefits of GAN pretraining, its inner mechanisms were not analyzed in-depth, and understanding of its role is not entirely clear. Moreover, the essent… ▽ More

    Submitted 10 March, 2022; v1 submitted 17 February, 2022; originally announced February 2022.

  18. arXiv:2112.03126  [pdf, other

    cs.CV cs.LG

    Label-Efficient Semantic Segmentation with Diffusion Models

    Authors: Dmitry Baranchuk, Ivan Rubachev, Andrey Voynov, Valentin Khrulkov, Artem Babenko

    Abstract: Denoising diffusion probabilistic models have recently received much research attention since they outperform alternative approaches, such as GANs, and currently provide state-of-the-art generative performance. The superior performance of diffusion models has made them an appealing tool in several applications, including inpainting, super-resolution, and semantic editing. In this paper, we demonst… ▽ More

    Submitted 15 March, 2022; v1 submitted 6 December, 2021; originally announced December 2021.

    Comments: ICLR'2022; v3: camera ready

  19. arXiv:2111.14825  [pdf, other

    cs.CV cs.GR cs.LG

    Latent Transformations via NeuralODEs for GAN-based Image Editing

    Authors: Valentin Khrulkov, Leyla Mirvakhabova, Ivan Oseledets, Artem Babenko

    Abstract: Recent advances in high-fidelity semantic image editing heavily rely on the presumably disentangled latent spaces of the state-of-the-art generative models, such as StyleGAN. Specifically, recent works show that it is possible to achieve decent controllability of attributes in face images via linear shifts along with latent directions. Several recent methods address the discovery of such direction… ▽ More

    Submitted 29 November, 2021; originally announced November 2021.

    Comments: Published at ICCV 2021

  20. arXiv:2106.12699  [pdf, other

    cs.LG

    Distilling the Knowledge from Conditional Normalizing Flows

    Authors: Dmitry Baranchuk, Vladimir Aliev, Artem Babenko

    Abstract: Normalizing flows are a powerful class of generative models demonstrating strong performance in several speech and vision problems. In contrast to other generative models, normalizing flows are latent variable models with tractable likelihoods and allow for stable training. However, they have to be carefully designed to represent invertible functions with efficient Jacobian determinant calculation… ▽ More

    Submitted 5 August, 2021; v1 submitted 23 June, 2021; originally announced June 2021.

    Comments: ICML Workshop: INNF+2021

  21. arXiv:2106.11959  [pdf, other

    cs.LG

    Revisiting Deep Learning Models for Tabular Data

    Authors: Yury Gorishniy, Ivan Rubachev, Valentin Khrulkov, Artem Babenko

    Abstract: The existing literature on deep learning for tabular data proposes a wide range of novel architectures and reports competitive results on various datasets. However, the proposed models are usually not properly compared to each other and existing works often use different benchmarks and experiment protocols. As a result, it is unclear for both researchers and practitioners what models perform best.… ▽ More

    Submitted 26 October, 2023; v1 submitted 22 June, 2021; originally announced June 2021.

    Comments: NeurIPS 2021 camera-ready. Code: https://github.com/yandex-research/tabular-dl-revisiting-models (v3-v5: minor changes)

  22. arXiv:2102.06204  [pdf, other

    cs.LG

    Disentangled Representations from Non-Disentangled Models

    Authors: Valentin Khrulkov, Leyla Mirvakhabova, Ivan Oseledets, Artem Babenko

    Abstract: Constructing disentangled representations is known to be a difficult task, especially in the unsupervised scenario. The dominating paradigm of unsupervised disentanglement is currently to train a generative model that separates different factors of variation in its latent space. This separation is typically enforced by training with specific regularization terms in the model's objective function.… ▽ More

    Submitted 11 February, 2021; originally announced February 2021.

  23. arXiv:2102.04448  [pdf, other

    cs.LG

    Functional Space Analysis of Local GAN Convergence

    Authors: Valentin Khrulkov, Artem Babenko, Ivan Oseledets

    Abstract: Recent work demonstrated the benefits of studying continuous-time dynamics governing the GAN training. However, this dynamics is analyzed in the model parameter space, which results in finite-dimensional dynamical systems. We propose a novel perspective where we study the local dynamics of adversarial training in the general functional space and show how it can be represented as a system of partia… ▽ More

    Submitted 8 February, 2021; originally announced February 2021.

  24. arXiv:2011.13786  [pdf, other

    cs.LG cs.CV

    Navigating the GAN Parameter Space for Semantic Image Editing

    Authors: Anton Cherepkov, Andrey Voynov, Artem Babenko

    Abstract: Generative Adversarial Networks (GANs) are currently an indispensable tool for visual editing, being a standard component of image-to-image translation and image restoration pipelines. Furthermore, GANs are especially useful for controllable generation since their latent spaces contain a wide range of interpretable directions, well suited for semantic editing operations. By gradually changing late… ▽ More

    Submitted 21 April, 2021; v1 submitted 27 November, 2020; originally announced November 2020.

    Comments: Supplementary code: https://github.com/yandex-research/navigan

  25. arXiv:2006.04988  [pdf, other

    cs.LG cs.CV stat.ML

    Object Segmentation Without Labels with Large-Scale Generative Models

    Authors: Andrey Voynov, Stanislav Morozov, Artem Babenko

    Abstract: The recent rise of unsupervised and self-supervised learning has dramatically reduced the dependency on labeled data, providing effective image representations for transfer to downstream vision tasks. Furthermore, recent works employed these representations in a fully unsupervised setup for image classification, reducing the need for human labels on the fine-tuning stage as well. This work demonst… ▽ More

    Submitted 11 June, 2021; v1 submitted 8 June, 2020; originally announced June 2020.

  26. arXiv:2004.00345  [pdf, other

    cs.LG stat.ML

    Editable Neural Networks

    Authors: Anton Sinitsin, Vsevolod Plokhotnyuk, Dmitriy Pyrkin, Sergei Popov, Artem Babenko

    Abstract: These days deep neural networks are ubiquitously used in a wide range of tasks, from image classification and machine translation to face identification and self-driving cars. In many applications, a single model error can lead to devastating financial, reputational and even life-threatening consequences. Therefore, it is crucially important to correct model mistakes quickly as they appear. In thi… ▽ More

    Submitted 22 July, 2020; v1 submitted 1 April, 2020; originally announced April 2020.

  27. arXiv:2002.12632  [pdf

    stat.AP cs.CY

    The Atrial Fibrillation Risk Score for Hyperthyroidism Patients

    Authors: Ilya V. Derevitskii, Daria A. Savitskaya, Alina Y. Babenko, Sergey V. Kovalchuk

    Abstract: Thyrotoxicosis (TT) is associated with an increase in both total and cardiovascu-lar mortality. One of the main thyrotoxicosis risks is Atrial Fibrillation (AF). Right AF predicts help medical personal prescribe the correct medicaments and correct surgical or radioiodine therapy. The main goal of this study is creating a method for practical treatment and diagnostic AF. This study proposes a new m… ▽ More

    Submitted 28 February, 2020; originally announced February 2020.

  28. arXiv:2002.03754  [pdf, other

    cs.LG cs.CV stat.ML

    Unsupervised Discovery of Interpretable Directions in the GAN Latent Space

    Authors: Andrey Voynov, Artem Babenko

    Abstract: The latent spaces of GAN models often have semantically meaningful directions. Moving in these directions corresponds to human-interpretable image transformations, such as zooming or recoloring, enabling a more controllable generation process. However, the discovery of such directions is currently performed in a supervised manner, requiring human labels, pretrained models, or some form of self-sup… ▽ More

    Submitted 24 June, 2020; v1 submitted 10 February, 2020; originally announced February 2020.

  29. arXiv:1912.10920  [pdf, other

    cs.CV eess.IV

    RPGAN: GANs Interpretability via Random Routing

    Authors: Andrey Voynov, Artem Babenko

    Abstract: In this paper, we introduce Random Path Generative Adversarial Network (RPGAN) -- an alternative design of GANs that can serve as a tool for generative model analysis. While the latent space of a typical GAN consists of input vectors, randomly sampled from the standard Gaussian distribution, the latent space of RPGAN consists of random paths in a generator network. As we show, this design allows t… ▽ More

    Submitted 17 February, 2020; v1 submitted 23 December, 2019; originally announced December 2019.

  30. arXiv:1911.12122  [pdf, other

    cs.LG cs.AI stat.ML

    Towards Similarity Graphs Constructed by Deep Reinforcement Learning

    Authors: Dmitry Baranchuk, Artem Babenko

    Abstract: Similarity graphs are an active research direction for the nearest neighbor search (NNS) problem. New algorithms for similarity graph construction are continuously being proposed and analyzed by both theoreticians and practitioners. However, existing construction algorithms are mostly based on heuristics and do not explicitly maximize the target performance measure, i.e., search recall. Therefore,… ▽ More

    Submitted 13 February, 2020; v1 submitted 27 November, 2019; originally announced November 2019.

  31. arXiv:1910.03524  [pdf, other

    cs.LG stat.ML

    Beyond Vector Spaces: Compact Data Representation as Differentiable Weighted Graphs

    Authors: Denis Mazur, Vage Egiazarian, Stanislav Morozov, Artem Babenko

    Abstract: Learning useful representations is a key ingredient to the success of modern machine learning. Currently, representation learning mostly relies on embedding data into Euclidean space. However, recent work has shown that data in some domains is better modeled by non-euclidean metric spaces, and inappropriate geometry can result in inferior performance. In this paper, we aim to eliminate the inducti… ▽ More

    Submitted 16 October, 2019; v1 submitted 8 October, 2019; originally announced October 2019.

  32. arXiv:1909.06312  [pdf, other

    cs.LG stat.ML

    Neural Oblivious Decision Ensembles for Deep Learning on Tabular Data

    Authors: Sergei Popov, Stanislav Morozov, Artem Babenko

    Abstract: Nowadays, deep neural networks (DNNs) have become the main instrument for machine learning tasks within a wide range of domains, including vision, NLP, and speech. Meanwhile, in an important case of heterogenous tabular data, the advantage of DNNs over shallow counterparts remains questionable. In particular, there is no sufficient evidence that deep learning machinery allows constructing methods… ▽ More

    Submitted 19 September, 2019; v1 submitted 13 September, 2019; originally announced September 2019.

  33. arXiv:1908.06887  [pdf, other

    cs.IR

    Relevance Proximity Graphs for Fast Relevance Retrieval

    Authors: Stanislav Morozov, Artem Babenko

    Abstract: In plenty of machine learning applications, the most relevant items for a particular query should be efficiently extracted, while the relevance function is based on a highly-nonlinear model, e.g., DNNs or GBDTs. Due to the high computational complexity of such models, exhaustive search is infeasible even for medium-scale problems. To address this issue, we introduce Relevance Proximity Graphs (RPG… ▽ More

    Submitted 18 October, 2019; v1 submitted 19 August, 2019; originally announced August 2019.

  34. arXiv:1908.03883  [pdf, other

    cs.LG cs.CV stat.ML

    Unsupervised Neural Quantization for Compressed-Domain Similarity Search

    Authors: Stanislav Morozov, Artem Babenko

    Abstract: We tackle the problem of unsupervised visual descriptors compression, which is a key ingredient of large-scale image retrieval systems. While the deep learning machinery has benefited literally all computer vision pipelines, the existing state-of-the-art compression methods employ shallow architectures, and we aim to close this gap by our paper. In more detail, we introduce a DNN architecture for… ▽ More

    Submitted 11 August, 2019; originally announced August 2019.

  35. arXiv:1905.10987  [pdf, other

    cs.LG stat.ML

    Learning to Route in Similarity Graphs

    Authors: Dmitry Baranchuk, Dmitry Persiyanov, Anton Sinitsin, Artem Babenko

    Abstract: Recently similarity graphs became the leading paradigm for efficient nearest neighbor search, outperforming traditional tree-based and LSH-based methods. Similarity graphs perform the search via greedy routing: a query traverses the graph and in each vertex moves to the adjacent vertex that is the closest to this query. In practice, similarity graphs are often susceptible to local minima, when que… ▽ More

    Submitted 27 May, 2019; originally announced May 2019.

    Comments: Published in ICML 2019

  36. arXiv:1806.05217  [pdf, other

    cs.CV

    Impostor Networks for Fast Fine-Grained Recognition

    Authors: Vadim Lebedev, Artem Babenko, Victor Lempitsky

    Abstract: In this work we introduce impostor networks, an architecture that allows to perform fine-grained recognition with high accuracy and using a light-weight convolutional network, making it particularly suitable for fine-grained applications on low-power and non-GPU enabled platforms. Impostor networks compensate for the lightness of its `backend' network by combining it with a lightweight non-paramet… ▽ More

    Submitted 13 June, 2018; originally announced June 2018.

  37. arXiv:1802.02422  [pdf, other

    cs.CV

    Revisiting the Inverted Indices for Billion-Scale Approximate Nearest Neighbors

    Authors: Dmitry Baranchuk, Artem Babenko, Yury Malkov

    Abstract: This work addresses the problem of billion-scale nearest neighbor search. The state-of-the-art retrieval systems for billion-scale databases are currently based on the inverted multi-index, the recently proposed generalization of the inverted index structure. The multi-index provides a very fine-grained partition of the feature space that allows extracting concise and accurate short-lists of candi… ▽ More

    Submitted 23 July, 2018; v1 submitted 7 February, 2018; originally announced February 2018.

    Comments: Paper accepted to ECCV 18

  38. arXiv:1606.01550  [pdf, other

    cs.CV cs.IR

    Pairwise Quantization

    Authors: Artem Babenko, Relja Arandjelović, Victor Lempitsky

    Abstract: We consider the task of lossy compression of high-dimensional vectors through quantization. We propose the approach that learns quantization parameters by minimizing the distortion of scalar products and squared distances between pairs of points. This is in contrast to previous works that obtain these parameters through the minimization of the reconstruction error of individual points. The propose… ▽ More

    Submitted 5 June, 2016; originally announced June 2016.

  39. arXiv:1510.07493  [pdf, other

    cs.CV

    Aggregating Deep Convolutional Features for Image Retrieval

    Authors: Artem Babenko, Victor Lempitsky

    Abstract: Several recent works have shown that image descriptors produced by deep convolutional neural networks provide state-of-the-art performance for image classification and retrieval problems. It has also been shown that the activations from the convolutional layers can be interpreted as local features describing particular image regions. These local features can be aggregated using aggregation approac… ▽ More

    Submitted 26 October, 2015; originally announced October 2015.

    Comments: accepted for ICCV 2015

  40. arXiv:1404.1831  [pdf, other

    cs.CV

    Improving Bilayer Product Quantization for Billion-Scale Approximate Nearest Neighbors in High Dimensions

    Authors: Artem Babenko, Victor Lempitsky

    Abstract: The top-performing systems for billion-scale high-dimensional approximate nearest neighbor (ANN) search are all based on two-layer architectures that include an indexing structure and a compressed datapoints layer. An indexing structure is crucial as it allows to avoid exhaustive search, while the lossy data compression is needed to fit the dataset into RAM. Several of the most successful systems… ▽ More

    Submitted 7 April, 2014; originally announced April 2014.

    ACM Class: H.3.3

  41. arXiv:1404.1777  [pdf, other

    cs.CV

    Neural Codes for Image Retrieval

    Authors: Artem Babenko, Anton Slesarev, Alexandr Chigorin, Victor Lempitsky

    Abstract: It has been shown that the activations invoked by an image within the top layers of a large convolutional neural network provide a high-level descriptor of the visual content of the image. In this paper, we investigate the use of such descriptors (neural codes) within the image retrieval application. In the experiments with several standard retrieval benchmarks, we establish that neural codes perf… ▽ More

    Submitted 7 July, 2014; v1 submitted 7 April, 2014; originally announced April 2014.

    Comments: to appear at ECCV 2014

  42. arXiv:1212.0224  [pdf, ps, other

    math.CO cs.DM

    On Weighted Multicommodity Flows in Directed Networks

    Authors: Maxim A. Babenko, Alexander V. Karzanov

    Abstract: Let $G = (VG, AG)$ be a directed graph with a set $S \subseteq VG$ of terminals and nonnegative integer arc capacities $c$. A feasible multiflow is a nonnegative real function $F(P)$ of "flows" on paths $P$ connecting distinct terminals such that the sum of flows through each arc $a$ does not exceed $c(a)$. Given $μ\colon S \times S \to \R_+$, the \emph{$μ$-value} of $F$ is… ▽ More

    Submitted 2 December, 2012; originally announced December 2012.

    Comments: 12 pages

    MSC Class: 05C21; 05C85; 90C27

  43. arXiv:1004.2115  [pdf, ps, other

    math.CO cs.DS

    A Faster Algorithm for the Maximum Even Factor Problem

    Authors: Maxim A. Babenko

    Abstract: Given a digraph $G = (VG,AG)$, an \emph{even factor} $M \subseteq AG$ is a subset of arcs that decomposes into a collection of node-disjoint paths and even cycles. Even factors in digraphs were introduced by Geleen and Cunningham and generalize path matchings in undirected graphs. Finding an even factor of maximum cardinality in a general digraph is known to be NP-hard but for the class of \emph{o… ▽ More

    Submitted 14 April, 2010; v1 submitted 13 April, 2010; originally announced April 2010.