Skip to main content

Showing 1–49 of 49 results for author: Lischinski, D

.
  1. arXiv:2406.01594  [pdf, other

    cs.CV cs.GR cs.LG

    DiffUHaul: A Training-Free Method for Object Dragging in Images

    Authors: Omri Avrahami, Rinon Gal, Gal Chechik, Ohad Fried, Dani Lischinski, Arash Vahdat, Weili Nie

    Abstract: Text-to-image diffusion models have proven effective for solving many image editing tasks. However, the seemingly straightforward task of seamlessly relocating objects within a scene remains surprisingly challenging. Existing methods addressing this problem often struggle to function reliably in real-world scenarios due to lacking spatial reasoning. In this work, we propose a training-free method,… ▽ More

    Submitted 3 June, 2024; originally announced June 2024.

    Comments: Project page is available at https://omriavrahami.com/diffuhaul/

  2. arXiv:2405.12661  [pdf, other

    cs.CV

    EmoEdit: Evoking Emotions through Image Manipulation

    Authors: **gyuan Yang, Jiawei Feng, Weibin Luo, Dani Lischinski, Daniel Cohen-Or, Hui Huang

    Abstract: Affective Image Manipulation (AIM) seeks to modify user-provided images to evoke specific emotional responses. This task is inherently complex due to its twofold objective: significantly evoking the intended emotion, while preserving the original image composition. Existing AIM methods primarily adjust color and style, often failing to elicit precise and profound emotional shifts. Drawing on psych… ▽ More

    Submitted 21 May, 2024; originally announced May 2024.

  3. arXiv:2401.13245  [pdf, other

    cs.HC

    GraphiMind: LLM-centric Interface for Information Graphics Design

    Authors: Qirui Huang, Min Lu, Joel Lanir, Dani Lischinski, Daniel Cohen-Or, Hui Huang

    Abstract: Information graphics are pivotal in effective information dissemination and storytelling. However, creating such graphics is extremely challenging for non-professionals, since the design process requires multifaceted skills and comprehensive knowledge. Thus, despite the many available authoring tools, a significant gap remains in enabling non-experts to produce compelling information graphics seam… ▽ More

    Submitted 24 January, 2024; originally announced January 2024.

  4. arXiv:2401.02847  [pdf, other

    cs.CV cs.GR cs.LG

    Generating Non-Stationary Textures using Self-Rectification

    Authors: Yang Zhou, Rongjun Xiao, Dani Lischinski, Daniel Cohen-Or, Hui Huang

    Abstract: This paper addresses the challenge of example-based non-stationary texture synthesis. We introduce a novel twostep approach wherein users first modify a reference texture using standard image editing tools, yielding an initial rough target for the synthesis. Subsequently, our proposed method, termed "self-rectification", automatically refines this target into a coherent, seamless texture, while fa… ▽ More

    Submitted 30 January, 2024; v1 submitted 5 January, 2024; originally announced January 2024.

    Comments: Project page: https://github.com/xiaorongjun000/Self-Rectification

  5. arXiv:2312.03766  [pdf, other

    cs.CL cs.CV

    Mismatch Quest: Visual and Textual Feedback for Image-Text Misalignment

    Authors: Brian Gordon, Yonatan Bitton, Yonatan Shafir, Roopal Garg, Xi Chen, Dani Lischinski, Daniel Cohen-Or, Idan Szpektor

    Abstract: While existing image-text alignment models reach high quality binary assessments, they fall short of pinpointing the exact source of misalignment. In this paper, we present a method to provide detailed textual and visual explanation of detected misalignments between text-image pairs. We leverage large language models and visual grounding models to automatically construct a training set that holds… ▽ More

    Submitted 5 December, 2023; originally announced December 2023.

  6. arXiv:2312.00116  [pdf, other

    cs.CV cs.GR cs.LG

    S2ST: Image-to-Image Translation in the Seed Space of Latent Diffusion

    Authors: Or Greenberg, Eran Kishon, Dani Lischinski

    Abstract: Image-to-image translation (I2IT) refers to the process of transforming images from a source domain to a target domain while maintaining a fundamental connection in terms of image content. In the past few years, remarkable advancements in I2IT were achieved by Generative Adversarial Networks (GANs), which nevertheless struggle with translations requiring high precision. Recently, Diffusion Models… ▽ More

    Submitted 30 November, 2023; originally announced December 2023.

    Comments: 17 pages, 15 figures

  7. arXiv:2311.10093  [pdf, other

    cs.CV cs.GR cs.LG

    The Chosen One: Consistent Characters in Text-to-Image Diffusion Models

    Authors: Omri Avrahami, Amir Hertz, Yael Vinker, Moab Arar, Shlomi Fruchter, Ohad Fried, Daniel Cohen-Or, Dani Lischinski

    Abstract: Recent advances in text-to-image generation models have unlocked vast potential for visual creativity. However, the users that use these models struggle with the generation of consistent characters, a crucial aspect for numerous real-world applications such as story visualization, game development, asset design, advertising, and more. Current methods typically rely on multiple pre-existing images… ▽ More

    Submitted 5 June, 2024; v1 submitted 16 November, 2023; originally announced November 2023.

    Comments: Accepted to SIGGRAPH 2024. Project page is available at https://omriavrahami.com/the-chosen-one/

  8. arXiv:2310.17590  [pdf, other

    cs.CV

    Noise-Free Score Distillation

    Authors: Oren Katzir, Or Patashnik, Daniel Cohen-Or, Dani Lischinski

    Abstract: Score Distillation Sampling (SDS) has emerged as the de facto approach for text-to-content generation in non-image domains. In this paper, we reexamine the SDS process and introduce a straightforward interpretation that demystifies the necessity for large Classifier-Free Guidance (CFG) scales, rooted in the distillation of an undesired noise term. Building upon our interpretation, we propose a nov… ▽ More

    Submitted 26 October, 2023; originally announced October 2023.

    Comments: Project page at https://orenkatzir.github.io/nfsd/

  9. arXiv:2307.07961  [pdf, other

    cs.CV

    EmoSet: A Large-scale Visual Emotion Dataset with Rich Attributes

    Authors: **gyuan Yang, Qirui Huang, Tingting Ding, Dani Lischinski, Daniel Cohen-Or, Hui Huang

    Abstract: Visual Emotion Analysis (VEA) aims at predicting people's emotional responses to visual stimuli. This is a promising, yet challenging, task in affective computing, which has drawn increasing attention in recent years. Most of the existing work in this area focuses on feature design, while little attention has been paid to dataset construction. In this work, we introduce EmoSet, the first large-sca… ▽ More

    Submitted 28 July, 2023; v1 submitted 16 July, 2023; originally announced July 2023.

    Comments: Accepted to ICCV2023, similar to the final version

  10. arXiv:2306.16052  [pdf, other

    cs.CV

    SVNR: Spatially-variant Noise Removal with Denoising Diffusion

    Authors: Naama Pearl, Yaron Brodsky, Dana Berman, Assaf Zomet, Alex Rav Acha, Daniel Cohen-Or, Dani Lischinski

    Abstract: Denoising diffusion models have recently shown impressive results in generative tasks. By learning powerful priors from huge collections of training images, such models are able to gradually modify complete noise to a clean natural image via a sequence of small denoising steps, seemingly making them well-suited for single image denoising. However, effectively applying denoising diffusion models to… ▽ More

    Submitted 28 June, 2023; originally announced June 2023.

  11. arXiv:2306.12760  [pdf, other

    cs.CV cs.GR cs.LG

    Blended-NeRF: Zero-Shot Object Generation and Blending in Existing Neural Radiance Fields

    Authors: Ori Gordon, Omri Avrahami, Dani Lischinski

    Abstract: Editing a local region or a specific object in a 3D scene represented by a NeRF or consistently blending a new realistic object into the scene is challenging, mainly due to the implicit nature of the scene representation. We present Blended-NeRF, a robust and flexible framework for editing a specific region of interest in an existing NeRF scene, based on text prompts, along with a 3D ROI box. Our… ▽ More

    Submitted 7 September, 2023; v1 submitted 22 June, 2023; originally announced June 2023.

    Comments: 16 pages, 14 figures. Project page: https://www.vision.huji.ac.il/blended-nerf/

  12. arXiv:2305.20062  [pdf, other

    cs.CV

    Chatting Makes Perfect: Chat-based Image Retrieval

    Authors: Matan Levy, Rami Ben-Ari, Nir Darshan, Dani Lischinski

    Abstract: Chats emerge as an effective user-friendly approach for information retrieval, and are successfully employed in many domains, such as customer service, healthcare, and finance. However, existing image retrieval approaches typically address the case of a single query-to-image round, and the use of chats for image retrieval has been mostly overlooked. In this work, we introduce ChatIR: a chat-based… ▽ More

    Submitted 5 October, 2023; v1 submitted 31 May, 2023; originally announced May 2023.

    Comments: Camera Ready version for NeurIPS 2023

  13. arXiv:2305.16311  [pdf, other

    cs.CV cs.GR cs.LG

    Break-A-Scene: Extracting Multiple Concepts from a Single Image

    Authors: Omri Avrahami, Kfir Aberman, Ohad Fried, Daniel Cohen-Or, Dani Lischinski

    Abstract: Text-to-image model personalization aims to introduce a user-provided concept to the model, allowing its synthesis in diverse contexts. However, current methods primarily focus on the case of learning a single concept from multiple images with variations in backgrounds and poses, and struggle when adapted to a different scenario. In this work, we introduce the task of textual scene decomposition:… ▽ More

    Submitted 4 October, 2023; v1 submitted 25 May, 2023; originally announced May 2023.

    Comments: SIGGRAPH Asia 2023. Project page: at: https://omriavrahami.com/break-a-scene/ Video: https://www.youtube.com/watch?v=-9EA-BhizgM

  14. arXiv:2303.09429  [pdf, other

    cs.CV

    Data Roaming and Quality Assessment for Composed Image Retrieval

    Authors: Matan Levy, Rami Ben-Ari, Nir Darshan, Dani Lischinski

    Abstract: The task of Composed Image Retrieval (CoIR) involves queries that combine image and text modalities, allowing users to express their intent more effectively. However, current CoIR datasets are orders of magnitude smaller compared to other vision and language (V&L) datasets. Additionally, some of these datasets have noticeable issues, such as queries containing redundant modalities. To address thes… ▽ More

    Submitted 20 December, 2023; v1 submitted 16 March, 2023; originally announced March 2023.

    Comments: Camera Ready version for AAAI 2024

  15. SpaText: Spatio-Textual Representation for Controllable Image Generation

    Authors: Omri Avrahami, Thomas Hayes, Oran Gafni, Sonal Gupta, Yaniv Taigman, Devi Parikh, Dani Lischinski, Ohad Fried, Xi Yin

    Abstract: Recent text-to-image diffusion models are able to generate convincing results of unprecedented quality. However, it is nearly impossible to control the shapes of different regions/objects or their layout in a fine-grained fashion. Previous attempts to provide such controls were hindered by their reliance on a fixed set of labels. To this end, we present SpaText - a new method for text-to-image gen… ▽ More

    Submitted 19 March, 2023; v1 submitted 25 November, 2022; originally announced November 2022.

    Comments: CVPR 2023. Project page available at: https://omriavrahami.com/spatext

  16. arXiv:2210.06642  [pdf, other

    cs.CV cs.GR

    What's in a Decade? Transforming Faces Through Time

    Authors: Eric Ming Chen, ** Sun, Apoorv Khandelwal, Dani Lischinski, Noah Snavely, Hadar Averbuch-Elor

    Abstract: How can one visually characterize people in a decade? In this work, we assemble the Faces Through Time dataset, which contains over a thousand portrait images from each decade, spanning the 1880s to the present day. Using our new dataset, we present a framework for resynthesizing portrait images across time, imagining how a portrait taken during a particular decade might have looked like, had it b… ▽ More

    Submitted 31 January, 2023; v1 submitted 12 October, 2022; originally announced October 2022.

    Comments: Project Page: https://facesthroughtime.github.io

  17. arXiv:2206.02779  [pdf, other

    cs.CV cs.GR cs.LG

    Blended Latent Diffusion

    Authors: Omri Avrahami, Ohad Fried, Dani Lischinski

    Abstract: The tremendous progress in neural image generation, coupled with the emergence of seemingly omnipotent vision-language models has finally enabled text-based interfaces for creating and editing images. Handling generic images requires a diverse underlying generative model, hence the latest works utilize diffusion models, which were shown to surpass GANs in terms of diversity. One major drawback of… ▽ More

    Submitted 30 April, 2023; v1 submitted 6 June, 2022; originally announced June 2022.

    Comments: Accepted to SIGGRAPH 2023. Project page: https://omriavrahami.com/blended-latent-diffusion-page/

  18. arXiv:2204.01159  [pdf, other

    cs.CV

    Shape-Pose Disentanglement using SE(3)-equivariant Vector Neurons

    Authors: Oren Katzir, Dani Lischinski, Daniel Cohen-Or

    Abstract: We introduce an unsupervised technique for encoding point clouds into a canonical shape representation, by disentangling shape and pose. Our encoder is stable and consistent, meaning that the shape encoding is purely pose-invariant, while the extracted rotation and translation are able to semantically align different input shapes of the same class to a common canonical pose. Specifically, we desig… ▽ More

    Submitted 3 April, 2022; originally announced April 2022.

  19. arXiv:2202.05910  [pdf, other

    cs.CV cs.LG

    Multi-level Latent Space Structuring for Generative Control

    Authors: Oren Katzir, Vicky Perepelook, Dani Lischinski, Daniel Cohen-Or

    Abstract: Truncation is widely used in generative models for improving the quality of the generated samples, at the expense of reducing their diversity. We propose to leverage the StyleGAN generative architecture to devise a new truncation technique, based on a decomposition of the latent space into clusters, enabling customized truncation to be performed at multiple semantic levels. We do so by learning to… ▽ More

    Submitted 11 February, 2022; originally announced February 2022.

  20. arXiv:2201.13433  [pdf, other

    cs.CV

    Third Time's the Charm? Image and Video Editing with StyleGAN3

    Authors: Yuval Alaluf, Or Patashnik, Zongze Wu, Asif Zamir, Eli Shechtman, Dani Lischinski, Daniel Cohen-Or

    Abstract: StyleGAN is arguably one of the most intriguing and well-studied generative models, demonstrating impressive performance in image generation, inversion, and manipulation. In this work, we explore the recent StyleGAN3 architecture, compare it to its predecessor, and investigate its unique advantages, as well as drawbacks. In particular, we demonstrate that while StyleGAN3 can be trained on unaligne… ▽ More

    Submitted 31 January, 2022; originally announced January 2022.

    Comments: Project page available at https://yuval-alaluf.github.io/stylegan3-editing/

  21. arXiv:2201.10326  [pdf, other

    cs.CV cs.GR cs.LG

    ShapeFormer: Transformer-based Shape Completion via Sparse Representation

    Authors: Xingguang Yan, Liqiang Lin, Niloy J. Mitra, Dani Lischinski, Daniel Cohen-Or, Hui Huang

    Abstract: We present ShapeFormer, a transformer-based network that produces a distribution of object completions, conditioned on incomplete, and possibly noisy, point clouds. The resultant distribution can then be sampled to generate likely completions, each exhibiting plausible shape details while being faithful to the input. To facilitate the use of transformers for 3D, we introduce a compact 3D represent… ▽ More

    Submitted 22 May, 2022; v1 submitted 25 January, 2022; originally announced January 2022.

    Comments: Project page: https://shapeformer.github.io/

  22. Blended Diffusion for Text-driven Editing of Natural Images

    Authors: Omri Avrahami, Dani Lischinski, Ohad Fried

    Abstract: Natural language offers a highly intuitive interface for image editing. In this paper, we introduce the first solution for performing local (region-based) edits in generic natural images, based on a natural language description along with an ROI mask. We achieve our goal by leveraging and combining a pretrained language-image model (CLIP), to steer the edit towards a user-provided text prompt, wit… ▽ More

    Submitted 28 March, 2022; v1 submitted 29 November, 2021; originally announced November 2021.

    Comments: CVPR 2022. Code is available at: https://omriavrahami.com/blended-diffusion-page/

  23. arXiv:2111.14792  [pdf, other

    cs.CV

    Classification-Regression for Chart Comprehension

    Authors: Matan Levy, Rami Ben-Ari, Dani Lischinski

    Abstract: Chart question answering (CQA) is a task used for assessing chart comprehension, which is fundamentally different from understanding natural images. CQA requires analyzing the relationships between the textual and the visual components of a chart, in order to answer general questions or infer numerical values. Most existing CQA datasets and models are based on simplifying assumptions that often en… ▽ More

    Submitted 11 July, 2022; v1 submitted 29 November, 2021; originally announced November 2021.

    Comments: ECCV 2022

  24. arXiv:2110.11323  [pdf, other

    cs.CV cs.GR cs.LG

    StyleAlign: Analysis and Applications of Aligned StyleGAN Models

    Authors: Zongze Wu, Yotam Nitzan, Eli Shechtman, Dani Lischinski

    Abstract: In this paper, we perform an in-depth study of the properties and applications of aligned generative models. We refer to two models as aligned if they share the same architecture, and one of them (the child) is obtained from the other (the parent) via fine-tuning to another domain, a common practice in transfer learning. Several works already utilize some basic properties of aligned StyleGAN model… ▽ More

    Submitted 5 May, 2022; v1 submitted 21 October, 2021; originally announced October 2021.

    Comments: 44 pages, 37 figures

    Journal ref: Proc. 10th International Conference on Learning Representations, ICLR 2022

  25. arXiv:2108.10528  [pdf, other

    cs.CV

    ShapeConv: Shape-aware Convolutional Layer for Indoor RGB-D Semantic Segmentation

    Authors: **ming Cao, Hanchao Leng, Dani Lischinski, Danny Cohen-Or, Changhe Tu, Yangyan Li

    Abstract: RGB-D semantic segmentation has attracted increasing attention over the past few years. Existing methods mostly employ homogeneous convolution operators to consume the RGB and depth features, ignoring their intrinsic differences. In fact, the RGB values capture the photometric appearance properties in the projected image space, while the depth feature encodes both the shape of a local geometry as… ▽ More

    Submitted 24 August, 2021; originally announced August 2021.

    Comments: ICCV2021

  26. arXiv:2106.03847  [pdf, other

    cs.LG cs.CV cs.GR cs.NE

    GAN Cocktail: mixing GANs without dataset access

    Authors: Omri Avrahami, Dani Lischinski, Ohad Fried

    Abstract: Today's generative models are capable of synthesizing high-fidelity images, but each model specializes on a specific target domain. This raises the need for model merging: combining two or more pretrained generative models into a single unified one. In this work we tackle the problem of model merging, given two constraints that often come up in the real world: (1) no access to the original trainin… ▽ More

    Submitted 11 July, 2022; v1 submitted 7 June, 2021; originally announced June 2021.

    Comments: ECCV 2022. Project page is available at: https://omriavrahami.com/GAN-cocktail-page/

  27. arXiv:2103.17249  [pdf, other

    cs.CV cs.CL cs.GR cs.LG

    StyleCLIP: Text-Driven Manipulation of StyleGAN Imagery

    Authors: Or Patashnik, Zongze Wu, Eli Shechtman, Daniel Cohen-Or, Dani Lischinski

    Abstract: Inspired by the ability of StyleGAN to generate highly realistic images in a variety of domains, much recent work has focused on understanding how to use the latent spaces of StyleGAN to manipulate generated and real images. However, discovering semantically meaningful latent manipulations typically involves painstaking human examination of the many degrees of freedom, or an annotated collection o… ▽ More

    Submitted 31 March, 2021; originally announced March 2021.

    Comments: 18 pages, 24 figures, code and video may be found here: https://github.com/orpatashnik/StyleCLIP

  28. arXiv:2012.13778  [pdf, ps, other

    eess.IV cs.CV

    Evaluation and Comparison of Edge-Preserving Filters

    Authors: Sarah Gingichashvili, Dani Lischinski

    Abstract: Edge-preserving filters play an essential role in some of the most basic tasks of computational photography, such as abstraction, tonemap**, detail enhancement and texture removal, to name a few. The abundance and diversity of smoothing operators, accompanied by a lack of methodology to evaluate output quality and/or perform an unbiased comparison between them, could lead to misunderstanding and… ▽ More

    Submitted 26 December, 2020; originally announced December 2020.

  29. arXiv:2011.12799  [pdf, other

    cs.CV cs.GR cs.LG

    StyleSpace Analysis: Disentangled Controls for StyleGAN Image Generation

    Authors: Zongze Wu, Dani Lischinski, Eli Shechtman

    Abstract: We explore and analyze the latent style space of StyleGAN2, a state-of-the-art architecture for image generation, using models pretrained on several different datasets. We first show that StyleSpace, the space of channel-wise style parameters, is significantly more disentangled than the other intermediate latent spaces explored by previous works. Next, we describe a method for discovering a large… ▽ More

    Submitted 3 December, 2020; v1 submitted 25 November, 2020; originally announced November 2020.

    Comments: 25 pages, 21 figures

  30. Differentiable Refraction-Tracing for Mesh Reconstruction of Transparent Objects

    Authors: Jiahui Lyu, Bojian Wu, Dani Lischinski, Daniel Cohen-Or, Hui Huang

    Abstract: Capturing the 3D geometry of transparent objects is a challenging task, ill-suited for general-purpose scanning and reconstruction techniques, since these cannot handle specular light transport phenomena. Existing state-of-the-art methods, designed specifically for this task, either involve a complex setup to reconstruct complete refractive ray paths, or leverage a data-driven approach based on sy… ▽ More

    Submitted 18 September, 2020; originally announced September 2020.

    Comments: 13 pages, 21 figures

    Journal ref: ACM Trans. on Graphics (Proc. of SIGGRAPH Asia 2020)

  31. arXiv:2009.02969  [pdf, other

    cs.GR

    Palettailor: Discriminable Colorization for Categorical Data

    Authors: Kecheng Lu, Mi Feng, Xin Chen, Michael Sedlmair, Oliver Deussen, Dani Lischinski, Zhanglin Cheng, Yunhai Wang

    Abstract: We present an integrated approach for creating and assigning color palettes to different visualizations such as multi-class scatterplots, line, and bar charts. While other methods separate the creation of colors from their assignment, our approach takes data characteristics into account to produce color palettes, which are then assigned in a way that fosters better visual discrimination of classes… ▽ More

    Submitted 7 September, 2020; originally announced September 2020.

    Comments: 10 pages

  32. arXiv:2006.12075  [pdf, other

    cs.CV cs.GR cs.LG

    MotioNet: 3D Human Motion Reconstruction from Monocular Video with Skeleton Consistency

    Authors: Mingyi Shi, Kfir Aberman, Andreas Aristidou, Taku Komura, Dani Lischinski, Daniel Cohen-Or, Baoquan Chen

    Abstract: We introduce MotioNet, a deep neural network that directly reconstructs the motion of a 3D human skeleton from monocular video.While previous methods rely on either rigging or inverse kinematics (IK) to associate a consistent skeleton with temporally coherent joint rotations, our method is the first data-driven approach that directly outputs a kinematic skeleton, which is a complete, commonly used… ▽ More

    Submitted 22 June, 2020; originally announced June 2020.

    Comments: Accepted to Transactions on Graphics (ToG) 2020. Project page: {https://rubbly.cn/publications/motioNet} Video: {https://youtu.be/8YubchlzvFA}

    ACM Class: I.4.5

    Journal ref: ACM Transaction on Graphics, 40(1), Article 1, 2020

  33. arXiv:2006.12030  [pdf, other

    cs.CV eess.IV

    DO-Conv: Depthwise Over-parameterized Convolutional Layer

    Authors: **ming Cao, Yangyan Li, Mingchao Sun, Ying Chen, Dani Lischinski, Daniel Cohen-Or, Baoquan Chen, Changhe Tu

    Abstract: Convolutional layers are the core building blocks of Convolutional Neural Networks (CNNs). In this paper, we propose to augment a convolutional layer with an additional depthwise convolution, where each input channel is convolved with a different 2D kernel. The composition of the two convolutions constitutes an over-parameterization, since it adds learnable parameters, while the resulting linear o… ▽ More

    Submitted 22 June, 2020; originally announced June 2020.

  34. arXiv:2005.05751  [pdf, other

    cs.GR cs.CV cs.LG

    Unpaired Motion Style Transfer from Video to Animation

    Authors: Kfir Aberman, Yijia Weng, Dani Lischinski, Daniel Cohen-Or, Baoquan Chen

    Abstract: Transferring the motion style from one animation clip to another, while preserving the motion content of the latter, has been a long-standing problem in character animation. Most existing data-driven approaches are supervised and rely on paired data, where motions with the same content are performed in different styles. In addition, these approaches are limited to transfer of styles that were seen… ▽ More

    Submitted 12 May, 2020; originally announced May 2020.

    Comments: SIGGRAPH 2020. Project page: https://deepmotionediting.github.io/style_transfer , Video: https://www.youtube.com/watch?v=m04zuBSdGrc , Code: https://github.com/DeepMotionEditing/deep-motion-editing

  35. arXiv:2005.05732  [pdf, other

    cs.CV cs.GR cs.LG

    Skeleton-Aware Networks for Deep Motion Retargeting

    Authors: Kfir Aberman, Peizhuo Li, Dani Lischinski, Olga Sorkine-Hornung, Daniel Cohen-Or, Baoquan Chen

    Abstract: We introduce a novel deep learning framework for data-driven motion retargeting between skeletons, which may have different structure, yet corresponding to homeomorphic graphs. Importantly, our approach learns how to retarget without requiring any explicit pairing between the motions in the training set. We leverage the fact that different homeomorphic skeletons may be reduced to a common primal s… ▽ More

    Submitted 12 May, 2020; originally announced May 2020.

    Comments: SIGGRAPH 2020. Project page: https://deepmotionediting.github.io/retargeting , Video: https://www.youtube.com/watch?v=ym8Tnmiz5N8

  36. arXiv:2001.03640  [pdf, other

    cs.GR cs.LG

    Unsupervised multi-modal Styled Content Generation

    Authors: Omry Sendik, Dani Lischinski, Daniel Cohen-Or

    Abstract: The emergence of deep generative models has recently enabled the automatic generation of massive amounts of graphical content, both in 2D and in 3D. Generative Adversarial Networks (GANs) and style control mechanisms, such as Adaptive Instance Normalization (AdaIN), have proved particularly effective in this context, culminating in the state-of-the-art StyleGAN architecture. While such models are… ▽ More

    Submitted 27 April, 2020; v1 submitted 10 January, 2020; originally announced January 2020.

  37. arXiv:1906.05526  [pdf, other

    cs.CV

    Illuminant Chromaticity Estimation from Interreflections

    Authors: Eytan Lifshitz, Dani Lischinski

    Abstract: Reliable estimation of illuminant chromaticity is crucial for simulating color constancy and for white balancing digital images. However, estimating illuminant chromaticity from a single image is an ill-posed task, in general, and existing solutions typically employ a variety of assumptions and heuristics. In this paper, we present a new, physically-based, approach for estimating illuminant chroma… ▽ More

    Submitted 13 June, 2019; originally announced June 2019.

    ACM Class: I.4.8

  38. arXiv:1906.01526  [pdf, other

    cs.CV

    Cross-Domain Cascaded Deep Feature Translation

    Authors: Oren Katzir, Dani Lischinski, Daniel Cohen-Or

    Abstract: In recent years we have witnessed tremendous progress in unpaired image-to-image translation methods, propelled by the emergence of DNNs and adversarial training strategies. However, most existing methods focus on transfer of style and appearance, rather than on shape translation. The latter task is challenging, due to its intricate non-local nature, which calls for additional supervision. We miti… ▽ More

    Submitted 4 June, 2019; originally announced June 2019.

  39. Learning Character-Agnostic Motion for Motion Retargeting in 2D

    Authors: Kfir Aberman, Rundi Wu, Dani Lischinski, Baoquan Chen, Daniel Cohen-Or

    Abstract: Analyzing human motion is a challenging task with a wide variety of applications in computer vision and in graphics. One such application, of particular importance in computer animation, is the retargeting of motion from one performer to another. While humans move in three dimensions, the vast majority of human motions are captured using video, requiring 2D-to-3D pose and camera recovery, before e… ▽ More

    Submitted 5 May, 2019; originally announced May 2019.

    Comments: SIGGRAPH 2019. arXiv admin note: text overlap with arXiv:1804.05653 by other authors

  40. arXiv:1901.04530  [pdf, other

    cs.LG cs.CV stat.ML

    CrossNet: Latent Cross-Consistency for Unpaired Image Translation

    Authors: Omry Sendik, Dani Lischinski, Daniel Cohen-Or

    Abstract: Recent GAN-based architectures have been able to deliver impressive performance on the general task of image-to-image translation. In particular, it was shown that a wide variety of image translation operators may be learned from two image sets, containing images from two different domains, without establishing an explicit pairing between the images. This was made possible by introducing clever re… ▽ More

    Submitted 26 May, 2019; v1 submitted 14 January, 2019; originally announced January 2019.

  41. arXiv:1808.06847  [pdf, other

    cs.CV cs.GR

    Deep Video-Based Performance Cloning

    Authors: Kfir Aberman, Mingyi Shi, **g Liao, Dani Lischinski, Baoquan Chen, Daniel Cohen-Or

    Abstract: We present a new video-based performance cloning technique. After training a deep generative network using a reference video capturing the appearance and dynamics of a target actor, we are able to generate videos where this actor reenacts other performances. All of the training data and the driving performances are provided as ordinary video segments, without motion capture or depth information. O… ▽ More

    Submitted 21 August, 2018; originally announced August 2018.

  42. SAGNet:Structure-aware Generative Network for 3D-Shape Modeling

    Authors: Zhijie Wu, Xiang Wang, Di Lin, Dani Lischinski, Daniel Cohen-Or, Hui Huang

    Abstract: We present SAGNet, a structure-aware generative model for 3D shapes. Given a set of segmented objects of a certain class, the geometry of their parts and the pairwise relationships between them (the structure) are jointly learned and embedded in a latent space by an autoencoder. The encoder intertwines the geometry and structure features into a single latent code, while the decoder disentangles th… ▽ More

    Submitted 14 November, 2019; v1 submitted 12 August, 2018; originally announced August 2018.

    Comments: Accepted by SIGGRAPH 2019

    Journal ref: ACM Transactions on Graphics (SIGGRAPH 2019) 38, 4, Article 91

  43. arXiv:1805.08019  [pdf, other

    cs.CV

    DiDA: Disentangled Synthesis for Domain Adaptation

    Authors: **ming Cao, Oren Katzir, Peng Jiang, Dani Lischinski, Danny Cohen-Or, Changhe Tu, Yangyan Li

    Abstract: Unsupervised domain adaptation aims at learning a shared model for two related, but not identical, domains by leveraging supervision from a source domain to an unsupervised target domain. A number of effective domain adaptation approaches rely on the ability to extract discriminative, yet domain-invariant, latent factors which are common to both domains. Extracting latent commonality is also usefu… ▽ More

    Submitted 21 May, 2018; originally announced May 2018.

  44. Non-Stationary Texture Synthesis by Adversarial Expansion

    Authors: Yang Zhou, Zhen Zhu, Xiang Bai, Dani Lischinski, Daniel Cohen-Or, Hui Huang

    Abstract: The real world exhibits an abundance of non-stationary textures. Examples include textures with large-scale structures, as well as spatially variant and inhomogeneous textures. While existing example-based texture synthesis methods can cope well with stationary textures, non-stationary textures still pose a considerable challenge, which remains unresolved. In this paper, we propose a new approach… ▽ More

    Submitted 11 May, 2018; originally announced May 2018.

    Comments: Accepted to SIGGRAPH 2018

    Journal ref: ACM Trans. Graph. 37, 4, Article 49 (August 2018), 13 pages

  45. Neural Best-Buddies: Sparse Cross-Domain Correspondence

    Authors: Kfir Aberman, **g Liao, Mingyi Shi, Dani Lischinski, Baoquan Chen, Daniel Cohen-Or

    Abstract: Correspondence between images is a fundamental problem in computer vision, with a variety of graphics applications. This paper presents a novel method for sparse cross-domain correspondence. Our method is designed for pairs of images where the main objects of interest may belong to different semantic categories and differ drastically in shape and appearance, yet still contain semantically related… ▽ More

    Submitted 21 August, 2018; v1 submitted 10 May, 2018; originally announced May 2018.

    Comments: SIGGRAPH 2018

  46. arXiv:1711.08278  [pdf, other

    cs.CV

    Neuron-level Selective Context Aggregation for Scene Segmentation

    Authors: Zhenhua Wang, Fanglin Gu, Dani Lischinski, Daniel Cohen-Or, Changhe Tu, Baoquan Chen

    Abstract: Contextual information provides important cues for disambiguating visually similar pixels in scene segmentation. In this paper, we introduce a neuron-level Selective Context Aggregation (SCA) module for scene segmentation, comprised of a contextual dependency predictor and a context aggregation operator. The dependency predictor is implicitly trained to infer contextual dependencies between differ… ▽ More

    Submitted 22 November, 2017; originally announced November 2017.

  47. arXiv:1608.05180  [pdf, other

    cs.CV

    A Holistic Approach for Data-Driven Object Cutout

    Authors: Huayong Xu, Yangyan Li, Wenzheng Chen, Dani Lischinski, Daniel Cohen-Or, Baoquan Chen

    Abstract: Object cutout is a fundamental operation for image editing and manipulation, yet it is extremely challenging to automate it in real-world images, which typically contain considerable background clutter. In contrast to existing cutout methods, which are based mainly on low-level image analysis, we propose a more holistic approach, which considers the entire shape of the object of interest by levera… ▽ More

    Submitted 16 September, 2016; v1 submitted 18 August, 2016; originally announced August 2016.

  48. arXiv:1604.02703  [pdf, other

    cs.CV

    Synthesizing Training Images for Boosting Human 3D Pose Estimation

    Authors: Wenzheng Chen, Huan Wang, Yangyan Li, Hao Su, Zhenhua Wang, Changhe Tu, Dani Lischinski, Daniel Cohen-Or, Baoquan Chen

    Abstract: Human 3D pose estimation from a single image is a challenging task with numerous applications. Convolutional Neural Networks (CNNs) have recently achieved superior performance on the task of 2D pose estimation from a single image, by training on images with 2D annotations collected by crowd sourcing. This suggests that similar success could be achieved for direct estimation of 3D poses. However, 3… ▽ More

    Submitted 5 January, 2017; v1 submitted 10 April, 2016; originally announced April 2016.

  49. arXiv:1510.03023  [pdf, other

    cs.GR

    Printed Perforated Lampshades for Continuous Projective Images

    Authors: Haisen Zhao, Lin Lu, Yuan Wei, Dani Lischinski, Andrei Sharf, Daniel Cohen-Or, Baoquan Chen

    Abstract: We present a technique for designing 3D-printed perforated lampshades, which project continuous grayscale images onto the surrounding walls. Given the geometry of the lampshade and a target grayscale image, our method computes a distribution of tiny holes over the shell, such that the combined footprints of the light emanating through the holes form the target image on a nearby diffuse surface. Ou… ▽ More

    Submitted 11 October, 2015; originally announced October 2015.

    Comments: 10 pages

    ACM Class: I.3.3; I.3.8