Skip to main content

Showing 1–8 of 8 results for author: Kulikov, V

Searching in archive cs. Search in all archives.
.
  1. arXiv:2405.12211  [pdf, other

    cs.CV

    Slicedit: Zero-Shot Video Editing With Text-to-Image Diffusion Models Using Spatio-Temporal Slices

    Authors: Nathaniel Cohen, Vladimir Kulikov, Matan Kleiner, Inbar Huberman-Spiegelglas, Tomer Michaeli

    Abstract: Text-to-image (T2I) diffusion models achieve state-of-the-art results in image synthesis and editing. However, leveraging such pretrained models for video editing is considered a major challenge. Many existing works attempt to enforce temporal consistency in the edited video through explicit correspondence mechanisms, either in pixel space or between deep features. These methods, however, struggle… ▽ More

    Submitted 20 May, 2024; originally announced May 2024.

    Comments: ICML 2024. Code and examples are available at https://matankleiner.github.io/slicedit/

  2. arXiv:2304.06140  [pdf, other

    cs.CV cs.LG

    An Edit Friendly DDPM Noise Space: Inversion and Manipulations

    Authors: Inbar Huberman-Spiegelglas, Vladimir Kulikov, Tomer Michaeli

    Abstract: Denoising diffusion probabilistic models (DDPMs) employ a sequence of white Gaussian noise samples to generate an image. In analogy with GANs, those noise maps could be considered as the latent code associated with the generated image. However, this native noise space does not possess a convenient structure, and is thus challenging to work with in editing tasks. Here, we propose an alternative lat… ▽ More

    Submitted 9 April, 2024; v1 submitted 12 April, 2023; originally announced April 2023.

    Comments: CVPR 2024. Code and examples are available at https://github.com/inbarhub/DDPM_inversion

  3. arXiv:2211.16582  [pdf, other

    cs.CV cs.LG eess.IV

    SinDDM: A Single Image Denoising Diffusion Model

    Authors: Vladimir Kulikov, Shahar Yadin, Matan Kleiner, Tomer Michaeli

    Abstract: Denoising diffusion models (DDMs) have led to staggering performance leaps in image generation, editing and restoration. However, existing DDMs use very large datasets for training. Here, we introduce a framework for training a DDM on a single image. Our method, which we coin SinDDM, learns the internal statistics of the training image by using a multi-scale diffusion process. To drive the reverse… ▽ More

    Submitted 6 June, 2023; v1 submitted 29 November, 2022; originally announced November 2022.

    Comments: Updated for ICML 2023 and added the Appendix. Note that the images are lightly compressed. Visit our project page for uncompressed results: https://matankleiner.github.io/sinddm/

  4. arXiv:2209.13465  [pdf, other

    cs.CV cs.AI cs.LG

    AdaFocusV3: On Unified Spatial-temporal Dynamic Video Recognition

    Authors: Yulin Wang, Yang Yue, Xinhong Xu, Ali Hassani, Victor Kulikov, Nikita Orlov, Shiji Song, Humphrey Shi, Gao Huang

    Abstract: Recent research has revealed that reducing the temporal and spatial redundancy are both effective approaches towards efficient video recognition, e.g., allocating the majority of computation to a task-relevant subset of frames or the most valuable image regions of each frame. However, in most existing works, either type of redundancy is typically modeled with another absent. This paper explores th… ▽ More

    Submitted 27 September, 2022; originally announced September 2022.

    Comments: ECCV 2022

  5. arXiv:2112.14238  [pdf, other

    cs.CV cs.AI cs.LG

    AdaFocus V2: End-to-End Training of Spatial Dynamic Networks for Video Recognition

    Authors: Yulin Wang, Yang Yue, Yuanze Lin, Haojun Jiang, Zihang Lai, Victor Kulikov, Nikita Orlov, Humphrey Shi, Gao Huang

    Abstract: Recent works have shown that the computational efficiency of video recognition can be significantly improved by reducing the spatial redundancy. As a representative work, the adaptive focus method (AdaFocus) has achieved a favorable trade-off between accuracy and inference speed by dynamically identifying and attending to the informative regions in each video frame. However, AdaFocus requires a co… ▽ More

    Submitted 11 April, 2022; v1 submitted 28 December, 2021; originally announced December 2021.

    Comments: Accepted by CVPR-2022

  6. arXiv:1904.05257  [pdf, other

    cs.CV

    Instance Segmentation of Biological Images Using Harmonic Embeddings

    Authors: Victor Kulikov, Victor Lempitsky

    Abstract: We present a new instance segmentation approach tailored to biological images, where instances may correspond to individual cells, organisms or plant parts. Unlike instance segmentation for user photographs or road scenes, in biological data object instances may be particularly densely packed, the appearance variation may be particularly low, the processing power may be restricted, while, on the o… ▽ More

    Submitted 23 April, 2020; v1 submitted 10 April, 2019; originally announced April 2019.

    Comments: Accepted as oral to CVPR 2020

  7. arXiv:1812.05873  [pdf, ps, other

    cs.LO math.LO

    Facets of Distribution Identities in Probabilistic Team Semantics

    Authors: Miika Hannula, Åsa Hirvonen, Juha Kontinen, Vadim Kulikov, Jonni Virtema

    Abstract: We study probabilistic team semantics which is a semantical framework allowing the study of logical and probabilistic dependencies simultaneously. We examine and classify the expressive power of logical formalisms arising by different probabilistic atoms such as conditional independence and different variants of marginal distribution equivalences. We also relate the framework to the first-order th… ▽ More

    Submitted 25 February, 2019; v1 submitted 14 December, 2018; originally announced December 2018.

  8. arXiv:1807.10007  [pdf, other

    cs.CV

    Instance Segmentation by Deep Coloring

    Authors: Victor Kulikov, Victor Yurchenko, Victor Lempitsky

    Abstract: We propose a new and, arguably, a very simple reduction of instance segmentation to semantic segmentation. This reduction allows to train feed-forward non-recurrent deep instance segmentation systems in an end-to-end fashion using architectures that have been proposed for semantic segmentation. Our approach proceeds by introducing a fixed number of labels (colors) and then dynamically assigning ob… ▽ More

    Submitted 26 July, 2018; originally announced July 2018.

    Comments: 10 pages, 6 figures, 3 tables