Skip to main content

Showing 1–19 of 19 results for author: Sanakoyeu, A

Searching in archive cs. Search in all archives.
.
  1. arXiv:2405.05224  [pdf, other

    cs.CV

    Imagine Flash: Accelerating Emu Diffusion Models with Backward Distillation

    Authors: Jonas Kohler, Albert Pumarola, Edgar Schönfeld, Artsiom Sanakoyeu, Roshan Sumbaly, Peter Vajda, Ali Thabet

    Abstract: Diffusion models are a powerful generative framework, but come with expensive inference. Existing acceleration methods often compromise image quality or fail under complex conditioning when operating in an extremely low-step regime. In this work, we propose a novel distillation framework tailored to enable high-fidelity, diverse sample generation using just one to three steps. Our approach compris… ▽ More

    Submitted 8 May, 2024; originally announced May 2024.

  2. arXiv:2402.02736  [pdf, other

    cs.CV cs.LG

    Using Motion Cues to Supervise Single-Frame Body Pose and Shape Estimation in Low Data Regimes

    Authors: Andrey Davydov, Alexey Sidnev, Artsiom Sanakoyeu, Yuhua Chen, Mathieu Salzmann, Pascal Fua

    Abstract: When enough annotated training data is available, supervised deep-learning algorithms excel at estimating human body pose and shape using a single camera. The effects of too little such data being available can be mitigated by using other information sources, such as databases of body shapes, to learn priors. Unfortunately, such sources are not always available either. We show that, in such cases,… ▽ More

    Submitted 5 February, 2024; originally announced February 2024.

    Comments: 21 pages; TMLR

  3. arXiv:2312.03209  [pdf, other

    cs.CV

    Cache Me if You Can: Accelerating Diffusion Models through Block Caching

    Authors: Felix Wimbauer, Bichen Wu, Edgar Schoenfeld, Xiaoliang Dai, Ji Hou, Zijian He, Artsiom Sanakoyeu, Peizhao Zhang, Sam Tsai, Jonas Kohler, Christian Rupprecht, Daniel Cremers, Peter Vajda, Jialiang Wang

    Abstract: Diffusion models have recently revolutionized the field of image synthesis due to their ability to generate photorealistic images. However, one of the major drawbacks of diffusion models is that the image generation process is costly. A large image-to-image network has to be applied many times to iteratively refine an image from random noise. While many recent works propose techniques to reduce th… ▽ More

    Submitted 12 January, 2024; v1 submitted 5 December, 2023; originally announced December 2023.

    Comments: Project page: https://fwmb.github.io/blockcaching/

  4. arXiv:2304.11118  [pdf, other

    cs.CV cs.AI

    BoDiffusion: Diffusing Sparse Observations for Full-Body Human Motion Synthesis

    Authors: Angela Castillo, Maria Escobar, Guillaume Jeanneret, Albert Pumarola, Pablo Arbeláez, Ali Thabet, Artsiom Sanakoyeu

    Abstract: Mixed reality applications require tracking the user's full-body motion to enable an immersive experience. However, typical head-mounted devices can only track head and hand movements, leading to a limited reconstruction of full-body motion due to variability in lower body configurations. We propose BoDiffusion -- a generative diffusion model for motion synthesis to tackle this under-constrained r… ▽ More

    Submitted 21 April, 2023; originally announced April 2023.

  5. arXiv:2304.08577  [pdf, other

    cs.CV

    Avatars Grow Legs: Generating Smooth Human Motion from Sparse Tracking Inputs with Diffusion Model

    Authors: Yuming Du, Robin Kips, Albert Pumarola, Sebastian Starke, Ali Thabet, Artsiom Sanakoyeu

    Abstract: With the recent surge in popularity of AR/VR applications, realistic and accurate control of 3D full-body avatars has become a highly demanded feature. A particular challenge is that only a sparse tracking signal is available from standalone HMDs (Head Mounted Devices), often limited to tracking the user's head and wrists. While this signal is resourceful for reconstructing the upper body motion,… ▽ More

    Submitted 17 April, 2023; originally announced April 2023.

    Comments: CVPR 2023, project page: https://dulucas.github.io/agrol/

  6. arXiv:2303.14569  [pdf, other

    cs.CV

    VisCo Grids: Surface Reconstruction with Viscosity and Coarea Grids

    Authors: Albert Pumarola, Artsiom Sanakoyeu, Lior Yariv, Ali Thabet, Yaron Lipman

    Abstract: Surface reconstruction has been seeing a lot of progress lately by utilizing Implicit Neural Representations (INRs). Despite their success, INRs often introduce hard to control inductive bias (i.e., the solution surface can exhibit unexplainable behaviours), have costly inference, and are slow to train. The goal of this work is to show that replacing neural networks with simple grid functions, alo… ▽ More

    Submitted 25 March, 2023; originally announced March 2023.

    Comments: Published in NeurIPS 2022

  7. arXiv:2303.08717  [pdf, other

    cs.CV cs.GR

    Re-ReND: Real-time Rendering of NeRFs across Devices

    Authors: Sara Rojas, Jesus Zarzar, Juan Camilo Perez, Artsiom Sanakoyeu, Ali Thabet, Albert Pumarola, Bernard Ghanem

    Abstract: This paper proposes a novel approach for rendering a pre-trained Neural Radiance Field (NeRF) in real-time on resource-constrained devices. We introduce Re-ReND, a method enabling Real-time Rendering of NeRFs across Devices. Re-ReND is designed to achieve real-time performance by converting the NeRF into a representation that can be efficiently processed by standard graphics pipelines. The propose… ▽ More

    Submitted 15 March, 2023; originally announced March 2023.

  8. arXiv:2206.02163  [pdf, other

    cs.CV

    MotionCNN: A Strong Baseline for Motion Prediction in Autonomous Driving

    Authors: Stepan Konev, Kirill Brodt, Artsiom Sanakoyeu

    Abstract: To plan a safe and efficient route, an autonomous vehicle should anticipate future motions of other agents around it. Motion prediction is an extremely challenging task that recently gained significant attention within the research community. In this work, we present a simple and yet very strong baseline for multimodal motion prediction based purely on Convolutional Neural Networks. While being ea… ▽ More

    Submitted 5 June, 2022; originally announced June 2022.

    Comments: CVPR Workshop on Autonomous Driving 2021. Waymo Motion Prediction Challenge 2021

  9. arXiv:2109.04003  [pdf, other

    cs.CV

    Improving Deep Metric Learning by Divide and Conquer

    Authors: Artsiom Sanakoyeu, **chuan Ma, Vadim Tschernezki, Björn Ommer

    Abstract: Deep metric learning (DML) is a cornerstone of many computer vision applications. It aims at learning a map** from the input domain to an embedding space, where semantically similar objects are located nearby and dissimilar objects far from another. The target similarity on the training data is defined by user in form of ground-truth class labels. However, while the embedding space learns to mim… ▽ More

    Submitted 8 September, 2021; originally announced September 2021.

    Comments: Accepted to PAMI. Source code: https://github.com/CompVis/metric-learning-divide-and-conquer-improved

  10. arXiv:2106.09758  [pdf, other

    cs.CV

    Discovering Relationships between Object Categories via Universal Canonical Maps

    Authors: Natalia Neverova, Artsiom Sanakoyeu, Patrick Labatut, David Novotny, Andrea Vedaldi

    Abstract: We tackle the problem of learning the geometry of multiple categories of deformable objects jointly. Recent work has shown that it is possible to learn a unified dense pose predictor for several categories of related objects. However, training such models requires to initialize inter-category correspondences by hand. This is suboptimal and the resulting models fail to maintain correct corresponden… ▽ More

    Submitted 17 June, 2021; originally announced June 2021.

    Comments: Accepted at CVPR 2021; Project page: https://gdude.de/discovering-3d-obj-rel

  11. arXiv:2009.04776  [pdf, other

    cs.CV

    Self-supervised Depth Denoising Using Lower- and Higher-quality RGB-D sensors

    Authors: Akhmedkhan Shabanov, Ilya Krotov, Nikolay Chinaev, Vsevolod Poletaev, Sergei Kozlukov, Igor Pasechnik, Bulat Yakupov, Artsiom Sanakoyeu, Vadim Lebedev, Dmitry Ulyanov

    Abstract: Consumer-level depth cameras and depth sensors embedded in mobile devices enable numerous applications, such as AR games and face identification. However, the quality of the captured depth is sometimes insufficient for 3D reconstruction, tracking and other computer vision tasks. In this paper, we propose a self-supervised depth denoising approach to denoise and refine depth coming from a low quali… ▽ More

    Submitted 13 September, 2020; v1 submitted 10 September, 2020; originally announced September 2020.

  12. arXiv:2003.08407  [pdf, other

    cs.CV

    A Content Transformation Block For Image Style Transfer

    Authors: Dmytro Kotovenko, Artsiom Sanakoyeu, **chuan Ma, Sabine Lang, Björn Ommer

    Abstract: Style transfer has recently received a lot of attention, since it allows to study fundamental challenges in image understanding and synthesis. Recent work has significantly improved the representation of color and texture and computational speed and image resolution. The explicit transformation of image content has, however, been mostly neglected: while artistic style affects formal characteristic… ▽ More

    Submitted 18 March, 2020; originally announced March 2020.

    Comments: Accepted to CVPR 2019

  13. arXiv:2003.00080  [pdf, other

    cs.CV

    Transferring Dense Pose to Proximal Animal Classes

    Authors: Artsiom Sanakoyeu, Vasil Khalidov, Maureen S. McCarthy, Andrea Vedaldi, Natalia Neverova

    Abstract: Recent contributions have demonstrated that it is possible to recognize the pose of humans densely and accurately given a large dataset of poses annotated in detail. In principle, the same approach could be extended to any animal class, but the effort required for collecting new annotations for each case makes this strategy impractical, despite important applications in natural conservation, scien… ▽ More

    Submitted 28 February, 2020; originally announced March 2020.

    Comments: Accepted at CVPR 2020; Project page: https://asanakoy.github.io/densepose-evolution

  14. arXiv:1906.05990  [pdf, other

    cs.CV cs.LG

    Divide and Conquer the Embedding Space for Metric Learning

    Authors: Artsiom Sanakoyeu, Vadim Tschernezki, Uta Büchler, Björn Ommer

    Abstract: Learning the embedding space, where semantically similar objects are located close together and dissimilar objects far apart, is a cornerstone of many computer vision applications. Existing approaches usually learn a single metric in the embedding space for all available data points, which may have a very complex non-uniform distribution with different notions of similarity between objects, e.g. a… ▽ More

    Submitted 13 June, 2019; originally announced June 2019.

    Comments: Source code: https://github.com/CompVis/metric-learning-divide-and-conquer

    Journal ref: The IEEE Conference on Computer Vision and Pattern Recognition (CVPR), 2019, pp. 471-480

  15. arXiv:1904.04445  [pdf, other

    cs.CV

    Semi-Supervised Segmentation of Salt Bodies in Seismic Images using an Ensemble of Convolutional Neural Networks

    Authors: Yauhen Babakhin, Artsiom Sanakoyeu, Hirotoshi Kitamura

    Abstract: Seismic image analysis plays a crucial role in a wide range of industrial applications and has been receiving significant attention. One of the essential challenges of seismic imaging is detecting subsurface salt structure which is indispensable for identification of hydrocarbon reservoirs and drill path planning. Unfortunately, exact identification of large salt deposits is notoriously difficult… ▽ More

    Submitted 5 August, 2019; v1 submitted 8 April, 2019; originally announced April 2019.

    Comments: Accepted at GCPR 2019, Source code: https://github.com/ybabakhin/kaggle_salt_bes_phalanx

  16. arXiv:1807.10201  [pdf, other

    cs.CV

    A Style-Aware Content Loss for Real-time HD Style Transfer

    Authors: Artsiom Sanakoyeu, Dmytro Kotovenko, Sabine Lang, Björn Ommer

    Abstract: Recently, style transfer has received a lot of attention. While much of this research has aimed at speeding up processing, the approaches are still lacking from a principled, art historical standpoint: a style is more than just a single image or an artist, but previous work is limited to only a single instance of a style or shows no benefit from more images. Moreover, previous work has relied on a… ▽ More

    Submitted 28 July, 2018; v1 submitted 26 July, 2018; originally announced July 2018.

    Comments: Accepted at ECCV18 (Oral)

  17. Deep Unsupervised Learning of Visual Similarities

    Authors: Artsiom Sanakoyeu, Miguel A. Bautista, Björn Ommer

    Abstract: Exemplar learning of visual similarities in an unsupervised manner is a problem of paramount importance to Computer Vision. In this context, however, the recent breakthrough in deep learning could not yet unfold its full potential. With only a single positive sample, a great imbalance between one positive and many negatives, and unreliable relationships between most samples, training of Convolutio… ▽ More

    Submitted 21 February, 2018; originally announced February 2018.

    Comments: arXiv admin note: text overlap with arXiv:1608.08792

    Journal ref: Pattern Recognition Volume 78, June 2018, Pages 331-343

  18. arXiv:1704.02268  [pdf, other

    cs.CV

    Deep Unsupervised Similarity Learning using Partially Ordered Sets

    Authors: Miguel A Bautista, Artsiom Sanakoyeu, Björn Ommer

    Abstract: Unsupervised learning of visual similarities is of paramount importance to computer vision, particularly due to lacking training data for fine-grained similarities. Deep learning of similarities is often based on relationships between pairs or triplets of samples. Many of these relations are unreliable and mutually contradicting, implying inconsistencies when trained without supervision informatio… ▽ More

    Submitted 11 April, 2017; v1 submitted 7 April, 2017; originally announced April 2017.

    Comments: Accepted for publication at IEEE Computer Vision and Pattern Recognition 2017

  19. arXiv:1608.08792  [pdf, other

    cs.CV

    CliqueCNN: Deep Unsupervised Exemplar Learning

    Authors: Miguel A. Bautista, Artsiom Sanakoyeu, Ekaterina Sutter, Björn Ommer

    Abstract: Exemplar learning is a powerful paradigm for discovering visual similarities in an unsupervised manner. In this context, however, the recent breakthrough in deep learning could not yet unfold its full potential. With only a single positive sample, a great imbalance between one positive and many negatives, and unreliable relationships between most samples, training of Convolutional Neural networks… ▽ More

    Submitted 31 August, 2016; originally announced August 2016.

    Comments: Accepted for publication at NIPS 2016