Skip to main content

Showing 1–5 of 5 results for author: Aliev, V

Searching in archive cs. Search in all archives.
.
  1. arXiv:2106.12699  [pdf, other

    cs.LG

    Distilling the Knowledge from Conditional Normalizing Flows

    Authors: Dmitry Baranchuk, Vladimir Aliev, Artem Babenko

    Abstract: Normalizing flows are a powerful class of generative models demonstrating strong performance in several speech and vision problems. In contrast to other generative models, normalizing flows are latent variable models with tractable likelihoods and allow for stable training. However, they have to be carefully designed to represent invertible functions with efficient Jacobian determinant calculation… ▽ More

    Submitted 5 August, 2021; v1 submitted 23 June, 2021; originally announced June 2021.

    Comments: ICML Workshop: INNF+2021

  2. arXiv:1908.02511  [pdf, other

    cs.LG cs.AI cs.CV

    Free-Lunch Saliency via Attention in Atari Agents

    Authors: Dmitry Nikulin, Anastasia Ianina, Vladimir Aliev, Sergey Nikolenko

    Abstract: We propose a new approach to visualize saliency maps for deep neural network models and apply it to deep reinforcement learning agents trained on Atari environments. Our method adds an attention module that we call FLS (Free Lunch Saliency) to the feature extractor from an established baseline (Mnih et al., 2015). This addition results in a trainable model that can produce saliency maps, i.e., vis… ▽ More

    Submitted 30 October, 2019; v1 submitted 7 August, 2019; originally announced August 2019.

    Comments: 2019 ICCV Workshop on Interpreting and Explaining Visual Artificial Intelligence Models. 15 pages, 14 figures, 5 tables

  3. arXiv:1811.11067  [pdf, other

    cs.LG cs.AI stat.ML

    Learning State Representations in Complex Systems with Multimodal Data

    Authors: Pavel Solovev, Vladimir Aliev, Pavel Ostyakov, Gleb Sterkin, Elizaveta Logacheva, Stepan Troeshestov, Roman Suvorov, Anton Mashikhin, Oleg Khomenko, Sergey I. Nikolenko

    Abstract: Representation learning becomes especially important for complex systems with multimodal data sources such as cameras or sensors. Recent advances in reinforcement learning and optimal control make it possible to design control algorithms on these latent representations, but the field still lacks a large-scale standard dataset for unified comparison. In this work, we present a large-scale dataset a… ▽ More

    Submitted 15 January, 2019; v1 submitted 27 November, 2018; originally announced November 2018.

    Comments: Fixed references

  4. arXiv:1810.02364  [pdf, other

    cs.SD cs.HC eess.AS

    Deep Learning Approaches for Understanding Simple Speech Commands

    Authors: Roman A. Solovyev, Maxim Vakhrushev, Alexander Radionov, Vladimir Aliev, Alexey A. Shvets

    Abstract: Automatic classification of sound commands is becoming increasingly important, especially for mobile and embedded devices. Many of these devices contain both cameras and microphones, and companies that develop them would like to use the same technology for both of these classification tasks. One way of achieving this is to represent sound commands as images, and use convolutional neural networks w… ▽ More

    Submitted 4 October, 2018; originally announced October 2018.

    Comments: 12 page, 4 figures, 1 table

  5. arXiv:1809.04403  [pdf, other

    cs.CV cs.LG

    Label Denoising with Large Ensembles of Heterogeneous Neural Networks

    Authors: Pavel Ostyakov, Elizaveta Logacheva, Roman Suvorov, Vladimir Aliev, Gleb Sterkin, Oleg Khomenko, Sergey I. Nikolenko

    Abstract: Despite recent advances in computer vision based on various convolutional architectures, video understanding remains an important challenge. In this work, we present and discuss a top solution for the large-scale video classification (labeling) problem introduced as a Kaggle competition based on the YouTube-8M dataset. We show and compare different approaches to preprocessing, data augmentation, m… ▽ More

    Submitted 15 January, 2019; v1 submitted 12 September, 2018; originally announced September 2018.