Skip to main content

Showing 1–15 of 15 results for author: Arar, M

.
  1. arXiv:2401.06105  [pdf, other

    cs.CV cs.CL cs.GR cs.LG

    PALP: Prompt Aligned Personalization of Text-to-Image Models

    Authors: Moab Arar, Andrey Voynov, Amir Hertz, Omri Avrahami, Shlomi Fruchter, Yael Pritch, Daniel Cohen-Or, Ariel Shamir

    Abstract: Content creators often aim to create personalized images using personal subjects that go beyond the capabilities of conventional text-to-image models. Additionally, they may want the resulting image to encompass a specific location, style, ambiance, and more. Existing personalization methods may compromise personalization ability or the alignment to complex textual prompts. This trade-off can impe… ▽ More

    Submitted 11 January, 2024; originally announced January 2024.

    Comments: Project page available at https://prompt-aligned.github.io/

  2. arXiv:2311.17609  [pdf, other

    cs.CV cs.GR cs.LG

    AnyLens: A Generative Diffusion Model with Any Rendering Lens

    Authors: Andrey Voynov, Amir Hertz, Moab Arar, Shlomi Fruchter, Daniel Cohen-Or

    Abstract: State-of-the-art diffusion models can generate highly realistic images based on various conditioning like text, segmentation, and depth. However, an essential aspect often overlooked is the specific camera geometry used during image capture. The influence of different optical systems on the final scene appearance is frequently overlooked. This study introduces a framework that intimately integrate… ▽ More

    Submitted 29 November, 2023; originally announced November 2023.

  3. arXiv:2311.10093  [pdf, other

    cs.CV cs.GR cs.LG

    The Chosen One: Consistent Characters in Text-to-Image Diffusion Models

    Authors: Omri Avrahami, Amir Hertz, Yael Vinker, Moab Arar, Shlomi Fruchter, Ohad Fried, Daniel Cohen-Or, Dani Lischinski

    Abstract: Recent advances in text-to-image generation models have unlocked vast potential for visual creativity. However, the users that use these models struggle with the generation of consistent characters, a crucial aspect for numerous real-world applications such as story visualization, game development, asset design, advertising, and more. Current methods typically rely on multiple pre-existing images… ▽ More

    Submitted 5 June, 2024; v1 submitted 16 November, 2023; originally announced November 2023.

    Comments: Accepted to SIGGRAPH 2024. Project page is available at https://omriavrahami.com/the-chosen-one/

  4. arXiv:2307.06925  [pdf, other

    cs.CV cs.GR cs.LG

    Domain-Agnostic Tuning-Encoder for Fast Personalization of Text-To-Image Models

    Authors: Moab Arar, Rinon Gal, Yuval Atzmon, Gal Chechik, Daniel Cohen-Or, Ariel Shamir, Amit H. Bermano

    Abstract: Text-to-image (T2I) personalization allows users to guide the creative image generation process by combining their own visual concepts in natural language prompts. Recently, encoder-based techniques have emerged as a new effective approach for T2I personalization, reducing the need for multiple images and long training times. However, most existing encoders are limited to a single-class domain, wh… ▽ More

    Submitted 13 July, 2023; originally announced July 2023.

    Comments: Project page at https://datencoder.github.io

  5. arXiv:2304.05177  [pdf, other

    math.NA

    Bounds on non-linear errors for variance computation with stochastic rounding *

    Authors: E M El Arar, D Sohier, P de Oliveira Castro, E Petit

    Abstract: The main objective of this work is to investigate non-linear errors and pairwise summation using stochastic rounding (SR) in variance computation algorithms. We estimate the forward error of computations under SR through two methods: the first is based on a bound of the variance and Bienaym{é}-Chebyshev inequality, while the second is based on martingales and Azuma-Hoeffding inequality. The study… ▽ More

    Submitted 11 April, 2023; originally announced April 2023.

  6. arXiv:2302.12228  [pdf, other

    cs.CV cs.GR cs.LG

    Encoder-based Domain Tuning for Fast Personalization of Text-to-Image Models

    Authors: Rinon Gal, Moab Arar, Yuval Atzmon, Amit H. Bermano, Gal Chechik, Daniel Cohen-Or

    Abstract: Text-to-image personalization aims to teach a pre-trained diffusion model to reason about novel, user provided concepts, embedding them into new scenes guided by natural language prompts. However, current personalization approaches struggle with lengthy training times, high storage requirements or loss of identity. To overcome these limitations, we propose an encoder-based domain-tuning approach.… ▽ More

    Submitted 5 March, 2023; v1 submitted 23 February, 2023; originally announced February 2023.

    Comments: Project page at https://tuning-encoder.github.io/

  7. arXiv:2302.05905  [pdf, other

    cs.CV cs.AI cs.GR

    Single Motion Diffusion

    Authors: Sigal Raab, Inbal Leibovitch, Guy Tevet, Moab Arar, Amit H. Bermano, Daniel Cohen-Or

    Abstract: Synthesizing realistic animations of humans, animals, and even imaginary creatures, has long been a goal for artists and computer graphics professionals. Compared to the imaging domain, which is rich with large available datasets, the number of data instances for the motion domain is limited, particularly for the animation of animals and exotic creatures (e.g., dragons), which have unique skeleton… ▽ More

    Submitted 13 June, 2023; v1 submitted 12 February, 2023; originally announced February 2023.

    Comments: Video: https://www.youtube.com/watch?v=zuWpVTgb_0U, Project page: https://sinmdm.github.io/SinMDM-page, Code: https://github.com/SinMDM/SinMDM

  8. arXiv:2112.11435  [pdf, other

    cs.CV

    Learned Queries for Efficient Local Attention

    Authors: Moab Arar, Ariel Shamir, Amit H. Bermano

    Abstract: Vision Transformers (ViT) serve as powerful vision models. Unlike convolutional neural networks, which dominated vision research in previous years, vision transformers enjoy the ability to capture long-range dependencies in the data. Nonetheless, an integral part of any transformer architecture, the self-attention mechanism, suffers from high latency and inefficient memory utilization, making it l… ▽ More

    Submitted 19 April, 2022; v1 submitted 21 December, 2021; originally announced December 2021.

    Comments: CVPR 2022 - Oral

  9. arXiv:2104.03843  [pdf, other

    cs.CV

    InAugment: Improving Classifiers via Internal Augmentation

    Authors: Moab Arar, Ariel Shamir, Amit Bermano

    Abstract: Image augmentation techniques apply transformation functions such as rotation, shearing, or color distortion on an input image. These augmentations were proven useful in improving neural networks' generalization ability. In this paper, we present a novel augmentation operation, InAugment, that exploits image internal statistics. The key idea is to copy patches from the image itself, apply augmenta… ▽ More

    Submitted 8 April, 2021; originally announced April 2021.

  10. arXiv:2007.07723  [pdf, other

    cs.LG cs.CV stat.ML

    Focus-and-Expand: Training Guidance Through Gradual Manipulation of Input Features

    Authors: Moab Arar, Noa Fish, Dani Daniel, Evgeny Tenetov, Ariel Shamir, Amit Bermano

    Abstract: We present a simple and intuitive Focus-and-eXpand (\fax) method to guide the training process of a neural network towards a specific solution. Optimizing a neural network is a highly non-convex problem. Typically, the space of solutions is large, with numerous possible local minima, where reaching a specific minimum depends on many factors. In many cases, however, a solution which considers speci… ▽ More

    Submitted 15 July, 2020; originally announced July 2020.

  11. arXiv:2003.08073  [pdf, other

    cs.CV

    Unsupervised Multi-Modal Image Registration via Geometry Preserving Image-to-Image Translation

    Authors: Moab Arar, Yiftach Ginger, Dov Danon, Ilya Leizerson, Amit Bermano, Daniel Cohen-Or

    Abstract: Many applications, such as autonomous driving, heavily rely on multi-modal data where spatial alignment between the modalities is required. Most multi-modal registration methods struggle computing the spatial correspondence between the images using prevalent cross-modality similarity measures. In this work, we bypass the difficulties of develo** cross-modality similarity measures, by training an… ▽ More

    Submitted 18 March, 2020; originally announced March 2020.

  12. arXiv:1904.08475  [pdf, other

    cs.CV

    Image Resizing by Reconstruction from Deep Features

    Authors: Moab Arar, Dov Danon, Daniel Cohen-Or, Ariel Shamir

    Abstract: Traditional image resizing methods usually work in pixel space and use various saliency measures. The challenge is to adjust the image shape while trying to preserve important content. In this paper we perform image resizing in feature space where the deep layers of a neural network contain rich important semantic information. We directly adjust the image feature maps, extracted from a pre-trained… ▽ More

    Submitted 22 June, 2021; v1 submitted 17 April, 2019; originally announced April 2019.

    Comments: 13 pages, 21 figures

  13. arXiv:1711.06625  [pdf, ps, other

    cs.DS

    Dynamic Matching: Reducing Integral Algorithms to Approximately-Maximal Fractional Algorithms

    Authors: Moab Arar, Shiri Chechik, Sarel Cohen, Cliff Stein, David Wajc

    Abstract: We present a simple randomized reduction from fully-dynamic integral matching algorithms to fully-dynamic "approximately-maximal" fractional matching algorithms. Applying this reduction to the recent fractional matching algorithm of Bhattacharya, Henzinger, and Nanongkai (SODA 2017), we obtain a novel result for the integral problem. Specifically, our main result is a randomized fully-dynamic… ▽ More

    Submitted 27 February, 2018; v1 submitted 17 November, 2017; originally announced November 2017.

    ACM Class: F.2.2

  14. arXiv:1711.05444  [pdf, other

    cs.CV cs.HC

    Robust Real-Time Multi-View Eye Tracking

    Authors: Nuri Murat Arar, Jean-Philippe Thiran

    Abstract: Despite significant advances in improving the gaze tracking accuracy under controlled conditions, the tracking robustness under real-world conditions, such as large head pose and movements, use of eyeglasses, illumination and eye type variations, remains a major challenge in eye tracking. In this paper, we revisit this challenge and introduce a real-time multi-camera eye tracking framework to impr… ▽ More

    Submitted 3 January, 2018; v1 submitted 15 November, 2017; originally announced November 2017.

    Comments: Organisational changes in the main msp and supplementary info. Results unchanged. Main msp: 14 pages, 15 figures. Supplementary: 2 tables, 1 figure. Under review for an IEEE transactions publication

  15. Modeling the behavior of reinforced concrete walls under fire, considering the impact of the span on firewalls

    Authors: Nadia Otmani Benmehidi, Meriem Arar, Imene Chine

    Abstract: Numerical modeling using computers is known to present several advantages compared to experimental testing. The high cost and the amount of time required to prepare and to perform a test were among the main problems on the table when the first tools for modeling structures in fire were developed. The discipline structures-in-fire modeling is still currently the subject of important research effort… ▽ More

    Submitted 27 January, 2014; originally announced January 2014.

    Comments: 8 pages,12 figures, 4 tables

    Journal ref: International Journal of Soft Computing And Software Engineering (JSCSE), Vol.3,No.3, pp. 600-607, 2013