Skip to main content

Showing 1–3 of 3 results for author: Grechka, A

Searching in archive cs. Search in all archives.
.
  1. arXiv:2309.09614  [pdf, other

    cs.CV cs.AI cs.LG

    Gradpaint: Gradient-Guided Inpainting with Diffusion Models

    Authors: Asya Grechka, Guillaume Couairon, Matthieu Cord

    Abstract: Denoising Diffusion Probabilistic Models (DDPMs) have recently achieved remarkable results in conditional and unconditional image generation. The pre-trained models can be adapted without further training to different downstream tasks, by guiding their iterative denoising process at inference time to satisfy additional constraints. For the specific task of image inpainting, the current guiding mec… ▽ More

    Submitted 18 September, 2023; originally announced September 2023.

  2. arXiv:2204.09730  [pdf, other

    cs.CV

    Transformer Decoders with MultiModal Regularization for Cross-Modal Food Retrieval

    Authors: Mustafa Shukor, Guillaume Couairon, Asya Grechka, Matthieu Cord

    Abstract: Cross-modal image-recipe retrieval has gained significant attention in recent years. Most work focuses on improving cross-modal embeddings using unimodal encoders, that allow for efficient retrieval in large-scale databases, leaving aside cross-attention between modalities which is more computationally expensive. We propose a new retrieval framework, T-Food (Transformer Decoders with MultiModal Re… ▽ More

    Submitted 20 April, 2022; originally announced April 2022.

    Comments: Accepted at CVPR 2022, MULA Workshop. Code is available at https://github.com/mshukor/TFood

  3. arXiv:2203.04705  [pdf, other

    cs.CV

    FlexIT: Towards Flexible Semantic Image Translation

    Authors: Guillaume Couairon, Asya Grechka, Jakob Verbeek, Holger Schwenk, Matthieu Cord

    Abstract: Deep generative models, like GANs, have considerably improved the state of the art in image synthesis, and are able to generate near photo-realistic images in structured domains such as human faces. Based on this success, recent work on image editing proceeds by projecting images to the GAN latent space and manipulating the latent vector. However, these approaches are limited in that only images f… ▽ More

    Submitted 9 March, 2022; originally announced March 2022.

    Comments: accepted at CVPR 2022