Skip to main content

Showing 1–2 of 2 results for author: Moreshet, A

Searching in archive cs. Search in all archives.
.
  1. arXiv:2404.11120  [pdf, other

    cs.CV

    TiNO-Edit: Timestep and Noise Optimization for Robust Diffusion-Based Image Editing

    Authors: Sherry X. Chen, Yaron Vaxman, Elad Ben Baruch, David Asulin, Aviad Moreshet, Kuo-Chin Lien, Misha Sra, Pradeep Sen

    Abstract: Despite many attempts to leverage pre-trained text-to-image models (T2I) like Stable Diffusion (SD) for controllable image editing, producing good predictable results remains a challenge. Previous approaches have focused on either fine-tuning pre-trained T2I models on specific datasets to generate certain kinds of images (e.g., with a specific object or person), or on optimizing the weights, text… ▽ More

    Submitted 17 April, 2024; originally announced April 2024.

    Comments: Conference on Computer Vision and Pattern Recognition (CVPR) 2024

  2. arXiv:2103.11247  [pdf, other

    cs.CV cs.LG

    Attention-Based Multimodal Image Matching

    Authors: Aviad Moreshet, Yosi Keller

    Abstract: We propose an attention-based approach for multimodal image patch matching using a Transformer encoder attending to the feature maps of a multiscale Siamese CNN. Our encoder is shown to efficiently aggregate multiscale image embeddings while emphasizing task-specific appearance-invariant image cues. We also introduce an attention-residual architecture, using a residual connection bypassing the enc… ▽ More

    Submitted 24 September, 2023; v1 submitted 20 March, 2021; originally announced March 2021.