Skip to main content

Showing 1–19 of 19 results for author: Khrulkov, V

Searching in archive cs. Search in all archives.
.
  1. arXiv:2404.05666  [pdf, other

    cs.CV

    YaART: Yet Another ART Rendering Technology

    Authors: Sergey Kastryulin, Artem Konev, Alexander Shishenya, Eugene Lyapustin, Artem Khurshudov, Alexander Tselousov, Nikita Vinokurov, Denis Kuznedelev, Alexander Markovich, Grigoriy Livshits, Alexey Kirillov, Anastasiia Tabisheva, Liubov Chubarova, Marina Kaminskaia, Alexander Ustyuzhanin, Artemii Shvetsov, Daniil Shlenskii, Valerii Startsev, Dmitrii Kornilov, Mikhail Romanov, Artem Babenko, Sergei Ovcharenko, Valentin Khrulkov

    Abstract: In the rapidly progressing field of generative models, the development of efficient and high-fidelity text-to-image diffusion systems represents a significant frontier. This study introduces YaART, a novel production-grade text-to-image cascaded diffusion model aligned to human preferences using Reinforcement Learning from Human Feedback (RLHF). During the development of YaART, we especially focus… ▽ More

    Submitted 8 April, 2024; originally announced April 2024.

    Comments: Prompts and additional information are available on the project page, see https://ya.ru/ai/art/paper-yaart-v1

  2. arXiv:2304.04344  [pdf, other

    cs.CV cs.LG

    Towards Real-time Text-driven Image Manipulation with Unconditional Diffusion Models

    Authors: Nikita Starodubcev, Dmitry Baranchuk, Valentin Khrulkov, Artem Babenko

    Abstract: Recent advances in diffusion models enable many powerful instruments for image editing. One of these instruments is text-driven image manipulations: editing semantic attributes of an image according to the provided text description. % Popular text-conditional diffusion models offer various high-quality image manipulation methods for a broad range of text prompts. Existing diffusion-based methods a… ▽ More

    Submitted 9 April, 2023; originally announced April 2023.

  3. arXiv:2203.10833  [pdf, other

    cs.CV cs.LG

    Hyperbolic Vision Transformers: Combining Improvements in Metric Learning

    Authors: Aleksandr Ermolov, Leyla Mirvakhabova, Valentin Khrulkov, Nicu Sebe, Ivan Oseledets

    Abstract: Metric learning aims to learn a highly discriminative model encouraging the embeddings of similar classes to be close in the chosen metrics and pushed apart for dissimilar ones. The common recipe is to use an encoder to extract embeddings and a distance-based loss function to match the representations -- usually, the Euclidean distance is utilized. An emerging interest in learning hyperbolic data… ▽ More

    Submitted 22 March, 2022; v1 submitted 21 March, 2022; originally announced March 2022.

    Comments: CVPR 2022

  4. arXiv:2202.07477  [pdf, other

    stat.ML cs.AI cs.LG math.AP math.NA

    Understanding DDPM Latent Codes Through Optimal Transport

    Authors: Valentin Khrulkov, Gleb Ryzhakov, Andrei Chertkov, Ivan Oseledets

    Abstract: Diffusion models have recently outperformed alternative approaches to model the distribution of natural images, such as GANs. Such diffusion models allow for deterministic sampling via the probability flow ODE, giving rise to a latent space and an encoder map. While having important practical applications, such as estimation of the likelihood, the theoretical properties of this map are not yet ful… ▽ More

    Submitted 5 December, 2022; v1 submitted 14 February, 2022; originally announced February 2022.

  5. arXiv:2112.03126  [pdf, other

    cs.CV cs.LG

    Label-Efficient Semantic Segmentation with Diffusion Models

    Authors: Dmitry Baranchuk, Ivan Rubachev, Andrey Voynov, Valentin Khrulkov, Artem Babenko

    Abstract: Denoising diffusion probabilistic models have recently received much research attention since they outperform alternative approaches, such as GANs, and currently provide state-of-the-art generative performance. The superior performance of diffusion models has made them an appealing tool in several applications, including inpainting, super-resolution, and semantic editing. In this paper, we demonst… ▽ More

    Submitted 15 March, 2022; v1 submitted 6 December, 2021; originally announced December 2021.

    Comments: ICLR'2022; v3: camera ready

  6. arXiv:2111.14825  [pdf, other

    cs.CV cs.GR cs.LG

    Latent Transformations via NeuralODEs for GAN-based Image Editing

    Authors: Valentin Khrulkov, Leyla Mirvakhabova, Ivan Oseledets, Artem Babenko

    Abstract: Recent advances in high-fidelity semantic image editing heavily rely on the presumably disentangled latent spaces of the state-of-the-art generative models, such as StyleGAN. Specifically, recent works show that it is possible to achieve decent controllability of attributes in face images via linear shifts along with latent directions. Several recent methods address the discovery of such direction… ▽ More

    Submitted 29 November, 2021; originally announced November 2021.

    Comments: Published at ICCV 2021

  7. arXiv:2106.11959  [pdf, other

    cs.LG

    Revisiting Deep Learning Models for Tabular Data

    Authors: Yury Gorishniy, Ivan Rubachev, Valentin Khrulkov, Artem Babenko

    Abstract: The existing literature on deep learning for tabular data proposes a wide range of novel architectures and reports competitive results on various datasets. However, the proposed models are usually not properly compared to each other and existing works often use different benchmarks and experiment protocols. As a result, it is unclear for both researchers and practitioners what models perform best.… ▽ More

    Submitted 26 October, 2023; v1 submitted 22 June, 2021; originally announced June 2021.

    Comments: NeurIPS 2021 camera-ready. Code: https://github.com/yandex-research/tabular-dl-revisiting-models (v3-v5: minor changes)

  8. arXiv:2102.06204  [pdf, other

    cs.LG

    Disentangled Representations from Non-Disentangled Models

    Authors: Valentin Khrulkov, Leyla Mirvakhabova, Ivan Oseledets, Artem Babenko

    Abstract: Constructing disentangled representations is known to be a difficult task, especially in the unsupervised scenario. The dominating paradigm of unsupervised disentanglement is currently to train a generative model that separates different factors of variation in its latent space. This separation is typically enforced by training with specific regularization terms in the model's objective function.… ▽ More

    Submitted 11 February, 2021; originally announced February 2021.

  9. arXiv:2102.04448  [pdf, other

    cs.LG

    Functional Space Analysis of Local GAN Convergence

    Authors: Valentin Khrulkov, Artem Babenko, Ivan Oseledets

    Abstract: Recent work demonstrated the benefits of studying continuous-time dynamics governing the GAN training. However, this dynamics is analyzed in the model parameter space, which results in finite-dimensional dynamical systems. We propose a novel perspective where we study the local dynamics of adversarial training in the general functional space and show how it can be represented as a system of partia… ▽ More

    Submitted 8 February, 2021; originally announced February 2021.

  10. arXiv:2008.06716  [pdf, other

    cs.IR cs.LG stat.ML

    Performance of Hyperbolic Geometry Models on Top-N Recommendation Tasks

    Authors: Leyla Mirvakhabova, Evgeny Frolov, Valentin Khrulkov, Ivan Oseledets, Alexander Tuzhilin

    Abstract: We introduce a simple autoencoder based on hyperbolic geometry for solving standard collaborative filtering problem. In contrast to many modern deep learning techniques, we build our solution using only a single hidden layer. Remarkably, even with such a minimalistic approach, we not only outperform the Euclidean counterpart but also achieve a competitive performance with respect to the current st… ▽ More

    Submitted 15 August, 2020; originally announced August 2020.

    Comments: Accepted at ACM RecSys 2020; 7 pages

    ACM Class: H.3.3

  11. arXiv:2003.14210  [pdf, other

    cs.LG cs.AI stat.ML

    Sample Efficient Ensemble Learning with Catalyst.RL

    Authors: Sergey Kolesnikov, Valentin Khrulkov

    Abstract: We present Catalyst.RL, an open-source PyTorch framework for reproducible and sample efficient reinforcement learning (RL) research. Main features of Catalyst.RL include large-scale asynchronous distributed training, efficient implementations of various RL algorithms and auxiliary tricks, such as n-step returns, value distributions, hyperbolic reinforcement learning, etc. To demonstrate the effect… ▽ More

    Submitted 7 April, 2020; v1 submitted 29 March, 2020; originally announced March 2020.

    Comments: arXiv admin note: substantial text overlap with arXiv:1903.00027

  12. arXiv:1905.11520  [pdf, other

    cs.LG cs.AI stat.ML

    Universality Theorems for Generative Models

    Authors: Valentin Khrulkov, Ivan Oseledets

    Abstract: Despite the fact that generative models are extremely successful in practice, the theory underlying this phenomenon is only starting to catch up with practice. In this work we address the question of the universality of generative models: is it true that neural networks can approximate any data manifold arbitrarily well? We provide a positive answer to this question and show that under mild assump… ▽ More

    Submitted 13 December, 2020; v1 submitted 27 May, 2019; originally announced May 2019.

  13. arXiv:1904.02239  [pdf, other

    cs.CV cs.LG

    Hyperbolic Image Embeddings

    Authors: Valentin Khrulkov, Leyla Mirvakhabova, Evgeniya Ustinova, Ivan Oseledets, Victor Lempitsky

    Abstract: Computer vision tasks such as image classification, image retrieval and few-shot learning are currently dominated by Euclidean and spherical embeddings, so that the final decisions about class belongings or the degree of similarity are made using linear hyperplanes, Euclidean distances, or spherical geodesic distances (cosine similarity). In this work, we demonstrate that in many practical scenari… ▽ More

    Submitted 30 March, 2020; v1 submitted 3 April, 2019; originally announced April 2019.

  14. arXiv:1901.10801  [pdf, other

    cs.LG stat.ML

    Generalized Tensor Models for Recurrent Neural Networks

    Authors: Valentin Khrulkov, Oleksii Hrinchuk, Ivan Oseledets

    Abstract: Recurrent Neural Networks (RNNs) are very successful at solving challenging problems with sequential data. However, this observed efficiency is not yet entirely explained by theory. It is known that a certain class of multiplicative RNNs enjoys the property of depth efficiency --- a shallow network of exponentially large width is necessary to realize the same score function as computed by such an… ▽ More

    Submitted 30 January, 2019; originally announced January 2019.

    Comments: Accepted as a conference paper at ICLR 2019

  15. arXiv:1901.10787  [pdf, other

    cs.CL cs.LG

    Tensorized Embedding Layers for Efficient Model Compression

    Authors: Oleksii Hrinchuk, Valentin Khrulkov, Leyla Mirvakhabova, Elena Orlova, Ivan Oseledets

    Abstract: The embedding layers transforming input words into real vectors are the key components of deep neural networks used in natural language processing. However, when the vocabulary is large, the corresponding weight matrices can be enormous, which precludes their deployment in a limited resource setting. We introduce a novel way of parametrizing embedding layers based on the Tensor Train (TT) decompos… ▽ More

    Submitted 19 February, 2020; v1 submitted 30 January, 2019; originally announced January 2019.

  16. arXiv:1802.02664  [pdf, other

    cs.LG cs.CG stat.ML

    Geometry Score: A Method For Comparing Generative Adversarial Networks

    Authors: Valentin Khrulkov, Ivan Oseledets

    Abstract: One of the biggest challenges in the research of generative adversarial networks (GANs) is assessing the quality of generated samples and detecting various levels of mode collapse. In this work, we construct a novel measure of performance of a GAN by comparing geometrical properties of the underlying data manifold and the generated one, which provides both qualitative and quantitative means for ev… ▽ More

    Submitted 9 June, 2018; v1 submitted 7 February, 2018; originally announced February 2018.

    Comments: ICML 2018

  17. arXiv:1801.01928  [pdf, ps, other

    cs.MS math.NA

    Tensor Train decomposition on TensorFlow (T3F)

    Authors: Alexander Novikov, Pavel Izmailov, Valentin Khrulkov, Michael Figurnov, Ivan Oseledets

    Abstract: Tensor Train decomposition is used across many branches of machine learning. We present T3F -- a library for Tensor Train decomposition based on TensorFlow. T3F supports GPU execution, batch processing, automatic differentiation, and versatile functionality for the Riemannian optimization framework, which takes into account the underlying manifold structure to construct efficient optimization meth… ▽ More

    Submitted 2 March, 2020; v1 submitted 5 January, 2018; originally announced January 2018.

  18. arXiv:1711.00811  [pdf, other

    cs.LG

    Expressive power of recurrent neural networks

    Authors: Valentin Khrulkov, Alexander Novikov, Ivan Oseledets

    Abstract: Deep neural networks are surprisingly efficient at solving practical tasks, but the theory behind this phenomenon is only starting to catch up with the practice. Numerous works show that depth is the key to this efficiency. A certain class of deep convolutional networks -- namely those that correspond to the Hierarchical Tucker (HT) tensor decomposition -- has been proven to have exponentially hig… ▽ More

    Submitted 7 February, 2018; v1 submitted 2 November, 2017; originally announced November 2017.

    Comments: Accepted as a conference paper at ICLR 2018

  19. arXiv:1709.03582  [pdf, other

    cs.CV cs.AI cs.LG

    Art of singular vectors and universal adversarial perturbations

    Authors: Valentin Khrulkov, Ivan Oseledets

    Abstract: Vulnerability of Deep Neural Networks (DNNs) to adversarial attacks has been attracting a lot of attention in recent studies. It has been shown that for many state of the art DNNs performing image classification there exist universal adversarial perturbations --- image-agnostic perturbations mere addition of which to natural images with high probability leads to their misclassification. In this wo… ▽ More

    Submitted 19 November, 2017; v1 submitted 11 September, 2017; originally announced September 2017.

    Comments: Submitted to CVPR 2018