Skip to main content

Showing 1–12 of 12 results for author: Atanov, A

.
  1. arXiv:2406.11769  [pdf, other

    cs.CV

    Solving Vision Tasks with Simple Photoreceptors Instead of Cameras

    Authors: Andrei Atanov, Jiawei Fu, Rishubh Singh, Isabella Yu, Andrew Spielberg, Amir Zamir

    Abstract: A de facto standard in solving computer vision problems is to use a common high-resolution camera and choose its placement on an agent (i.e., position and orientation) based on human intuition. On the other hand, extremely simple and well-designed visual sensors found throughout nature allow many organisms to perform diverse, complex behaviors. In this work, motivated by these examples, we raise t… ▽ More

    Submitted 17 June, 2024; originally announced June 2024.

  2. arXiv:2403.15309  [pdf, other

    cs.CV cs.CL cs.LG

    Controlled Training Data Generation with Diffusion Models

    Authors: Teresa Yeo, Andrei Atanov, Harold Benoit, Aleksandr Alekseev, Ruchira Ray, Pooya Esmaeil Akhoondi, Amir Zamir

    Abstract: In this work, we present a method to control a text-to-image generative model to produce training data specifically "useful" for supervised learning. Unlike previous works that employ an open-loop approach and pre-define prompts to generate new data using either a language model or human expertise, we develop an automated closed-loop system which involves two feedback mechanisms. The first mechani… ▽ More

    Submitted 22 March, 2024; originally announced March 2024.

    Comments: Project page at https://adversarial-prompts.epfl.ch/

  3. arXiv:2312.16313  [pdf, other

    cs.LG

    Unraveling the Key Components of OOD Generalization via Diversification

    Authors: Harold Benoit, Liangze Jiang, Andrei Atanov, Oğuzhan Fatih Kar, Mattia Rigotti, Amir Zamir

    Abstract: Supervised learning datasets may contain multiple cues that explain the training set equally well, i.e., learning any of them would lead to the correct predictions on the training data. However, many of them can be spurious, i.e., lose their predictive power under a distribution shift and consequently fail to generalize to out-of-distribution (OOD) data. Recently developed "diversification" method… ▽ More

    Submitted 20 April, 2024; v1 submitted 26 December, 2023; originally announced December 2023.

    Comments: ICLR 2024

  4. arXiv:2212.00261  [pdf, other

    cs.LG

    Task Discovery: Finding the Tasks that Neural Networks Generalize on

    Authors: Andrei Atanov, Andrei Filatov, Teresa Yeo, Ajay Sohmshetty, Amir Zamir

    Abstract: When develo** deep learning models, we usually decide what task we want to solve then search for a model that generalizes well on the task. An intriguing question would be: what if, instead of fixing the task and searching in the model space, we fix the model and search in the task space? Can we find tasks that the model generalizes on? How do they look, or do they indicate anything? These are t… ▽ More

    Submitted 30 November, 2022; originally announced December 2022.

    Comments: NeurIPS 2022, Project page at https://taskdiscovery.epfl.ch

  5. arXiv:2204.01678  [pdf, other

    cs.CV cs.LG

    MultiMAE: Multi-modal Multi-task Masked Autoencoders

    Authors: Roman Bachmann, David Mizrahi, Andrei Atanov, Amir Zamir

    Abstract: We propose a pre-training strategy called Multi-modal Multi-task Masked Autoencoders (MultiMAE). It differs from standard Masked Autoencoding in two key aspects: I) it can optionally accept additional modalities of information in the input besides the RGB image (hence "multi-modal"), and II) its training objective accordingly includes predicting multiple outputs besides the RGB image (hence "multi… ▽ More

    Submitted 4 April, 2022; originally announced April 2022.

    Comments: Project page at https://multimae.epfl.ch

  6. arXiv:2203.01441  [pdf, other

    cs.CV cs.LG

    3D Common Corruptions and Data Augmentation

    Authors: Oğuzhan Fatih Kar, Teresa Yeo, Andrei Atanov, Amir Zamir

    Abstract: We introduce a set of image transformations that can be used as corruptions to evaluate the robustness of models as well as data augmentation mechanisms for training neural networks. The primary distinction of the proposed transformations is that, unlike existing approaches such as Common Corruptions, the geometry of the scene is incorporated in the transformations -- thus leading to corruptions t… ▽ More

    Submitted 29 April, 2022; v1 submitted 2 March, 2022; originally announced March 2022.

    Comments: CVPR 2022 (Oral). Project website at https://3dcommoncorruptions.epfl.ch/

  7. arXiv:2202.03365  [pdf, other

    cs.CV cs.LG

    Simple Control Baselines for Evaluating Transfer Learning

    Authors: Andrei Atanov, Shijian Xu, Onur Beker, Andrei Filatov, Amir Zamir

    Abstract: Transfer learning has witnessed remarkable progress in recent years, for example, with the introduction of augmentation-based contrastive self-supervised learning methods. While a number of large-scale empirical studies on the transfer performance of such models have been conducted, there is not yet an agreed-upon set of control baselines, evaluation practices, and metrics to report, which often h… ▽ More

    Submitted 7 February, 2022; originally announced February 2022.

    Comments: Project website: https://transfer-controls.epfl.ch

  8. arXiv:2106.08038  [pdf, other

    cs.LG cs.CV

    Mean Embeddings with Test-Time Data Augmentation for Ensembling of Representations

    Authors: Arsenii Ashukha, Andrei Atanov, Dmitry Vetrov

    Abstract: Averaging predictions over a set of models -- an ensemble -- is widely used to improve predictive performance and uncertainty estimation of deep learning models. At the same time, many machine learning systems, such as search, matching, and recommendation systems, heavily rely on embeddings. Unfortunately, due to misalignment of features of independently trained models, embeddings, cannot be impro… ▽ More

    Submitted 14 July, 2021; v1 submitted 15 June, 2021; originally announced June 2021.

  9. arXiv:1905.00505  [pdf, other

    stat.ML cs.LG

    Semi-Conditional Normalizing Flows for Semi-Supervised Learning

    Authors: Andrei Atanov, Alexandra Volokhova, Arsenii Ashukha, Ivan Sosnovik, Dmitry Vetrov

    Abstract: This paper proposes a semi-conditional normalizing flow model for semi-supervised learning. The model uses both labelled and unlabeled data to learn an explicit model of joint distribution over objects and labels. Semi-conditional architecture of the model allows us to efficiently compute a value and gradients of the marginal likelihood for unlabeled objects. The conditional part of the model is b… ▽ More

    Submitted 22 June, 2020; v1 submitted 1 May, 2019; originally announced May 2019.

  10. arXiv:1810.06943  [pdf, other

    stat.ML cs.LG

    The Deep Weight Prior

    Authors: Andrei Atanov, Arsenii Ashukha, Kirill Struminsky, Dmitry Vetrov, Max Welling

    Abstract: Bayesian inference is known to provide a general framework for incorporating prior knowledge or specific properties into machine learning models via carefully choosing a prior distribution. In this work, we propose a new type of prior distributions for convolutional neural networks, deep weight prior (DWP), that exploit generative models to encourage a specific structure of trained convolutional f… ▽ More

    Submitted 18 February, 2019; v1 submitted 16 October, 2018; originally announced October 2018.

    Comments: TL;DR: The deep weight prior learns a generative model for kernels of convolutional neural networks, that acts as a prior distribution while training on new datasets

  11. arXiv:1802.04893  [pdf, other

    stat.ML cs.LG

    Uncertainty Estimation via Stochastic Batch Normalization

    Authors: Andrei Atanov, Arsenii Ashukha, Dmitry Molchanov, Kirill Neklyudov, Dmitry Vetrov

    Abstract: In this work, we investigate Batch Normalization technique and propose its probabilistic interpretation. We propose a probabilistic model and show that Batch Normalization maximazes the lower bound of its marginalized log-likelihood. Then, according to the new probabilistic model, we design an algorithm which acts consistently during train and test. However, inference becomes computationally ineff… ▽ More

    Submitted 20 March, 2018; v1 submitted 13 February, 2018; originally announced February 2018.

    Comments: Under review as a workshop paper at ICLR 2018

    Journal ref: Workshop track - ICLR 2018

  12. arXiv:1401.2252  [pdf, ps, other

    math.CV

    Affine homogeneous strictly pseudoconvex hypersurfaces of the type (1/2,0) in $\mathbb{C}^3$

    Authors: A. V. Atanov, A. V. Loboda, A. V. Shipovskaya

    Abstract: In this paper we denote a type of affine homogeneous real hypersurface of $\mathbb{C}^3$ and present a classification of homogeneous surfaces of the type (1/2,0). The result was obtained by reducing the classification problem mentioned above to the problem of solving a system of nonlinear (quadratic) equations. Solutions of such system represent all the Lie algebras corresponding to the homogeneou… ▽ More

    Submitted 10 January, 2014; originally announced January 2014.

    Comments: 33 pages

    MSC Class: 32V40 (Primary); 58D19; 14R05 (Secondary)