Skip to main content

Showing 51–94 of 94 results for author: Shechtman, E

.
  1. arXiv:2102.03141  [pdf, other

    cs.CV

    CharacterGAN: Few-Shot Keypoint Character Animation and Reposing

    Authors: Tobias Hinz, Matthew Fisher, Oliver Wang, Eli Shechtman, Stefan Wermter

    Abstract: We introduce CharacterGAN, a generative model that can be trained on only a few samples (8 - 15) of a given character. Our model generates novel poses based on keypoint locations, which can be modified in real time while providing interactive feedback, allowing for intuitive reposing and animation. Since we only have very limited training samples, one of the key challenges lies in how to address (… ▽ More

    Submitted 12 January, 2022; v1 submitted 5 February, 2021; originally announced February 2021.

    Comments: Best Paper WACV 2022. Code available at https://github.com/tohinz/CharacterGAN

  2. arXiv:2012.02992  [pdf, other

    cs.CV

    Spatially-Adaptive Pixelwise Networks for Fast Image Translation

    Authors: Tamar Rott Shaham, Michael Gharbi, Richard Zhang, Eli Shechtman, Tomer Michaeli

    Abstract: We introduce a new generator architecture, aimed at fast and efficient high-resolution image-to-image translation. We design the generator to be an extremely lightweight function of the full-resolution image. In fact, we use pixel-wise networks; that is, each pixel is processed independently of others, through a composition of simple affine transformations and nonlinearities. We take three importa… ▽ More

    Submitted 5 December, 2020; originally announced December 2020.

  3. arXiv:2012.02780  [pdf, other

    cs.CV

    Few-shot Image Generation with Elastic Weight Consolidation

    Authors: Yijun Li, Richard Zhang, **gwan Lu, Eli Shechtman

    Abstract: Few-shot image generation seeks to generate more data of a given domain, with only few available training examples. As it is unreasonable to expect to fully infer the distribution from just a few observations (e.g., emojis), we seek to leverage a large, related source domain as pretraining (e.g., human faces). Thus, we wish to preserve the diversity of the source domain, while adapting to the appe… ▽ More

    Submitted 4 December, 2020; originally announced December 2020.

    Comments: Accepted by NeurIPS 2020, see https://yijunmaverick.github.io/publications/ewc/

  4. arXiv:2011.12799  [pdf, other

    cs.CV cs.GR cs.LG

    StyleSpace Analysis: Disentangled Controls for StyleGAN Image Generation

    Authors: Zongze Wu, Dani Lischinski, Eli Shechtman

    Abstract: We explore and analyze the latent style space of StyleGAN2, a state-of-the-art architecture for image generation, using models pretrained on several different datasets. We first show that StyleSpace, the space of channel-wise style parameters, is significantly more disentangled than the other intermediate latent spaces explored by previous works. Next, we describe a method for discovering a large… ▽ More

    Submitted 3 December, 2020; v1 submitted 25 November, 2020; originally announced November 2020.

    Comments: 25 pages, 21 figures

  5. arXiv:2008.05413  [pdf, other

    cs.CV

    Look here! A parametric learning based approach to redirect visual attention

    Authors: Youssef Alami Mejjati, Celso F. Gomez, Kwang In Kim, Eli Shechtman, Zoya Bylinskii

    Abstract: Across photography, marketing, and website design, being able to direct the viewer's attention is a powerful tool. Motivated by professional workflows, we introduce an automatic method to make an image region more attention-capturing via subtle image edits that maintain realism and fidelity to the original. From an input image and a user-provided mask, our GazeShiftNet model predicts a distinct se… ▽ More

    Submitted 12 August, 2020; originally announced August 2020.

    Comments: To appear in ECCV 2020

  6. arXiv:2007.00653  [pdf, other

    cs.CV cs.GR cs.LG

    Swap** Autoencoder for Deep Image Manipulation

    Authors: Taesung Park, Jun-Yan Zhu, Oliver Wang, **gwan Lu, Eli Shechtman, Alexei A. Efros, Richard Zhang

    Abstract: Deep generative models have become increasingly effective at producing realistic images from randomly sampled seeds, but using such models for controllable manipulation of existing images remains challenging. We propose the Swap** Autoencoder, a deep model designed specifically for image manipulation, rather than random sampling. The key idea is to encode an image with two independent components… ▽ More

    Submitted 14 December, 2020; v1 submitted 1 July, 2020; originally announced July 2020.

    Comments: NeurIPS 2020. Please visit https://taesung.me/Swap**Autoencoder/ for an introductory video. v2 mainly contains reorganization of the Introduction and Broader Impact section

  7. arXiv:2005.11742  [pdf, other

    cs.CV cs.MM

    High-Resolution Image Inpainting with Iterative Confidence Feedback and Guided Upsampling

    Authors: Yu Zeng, Zhe Lin, Jimei Yang, Jianming Zhang, Eli Shechtman, Huchuan Lu

    Abstract: Existing image inpainting methods often produce artifacts when dealing with large holes in real applications. To address this challenge, we propose an iterative inpainting method with a feedback mechanism. Specifically, we introduce a deep generative model which not only outputs an inpainting result but also a corresponding confidence map. Using this map as feedback, it progressively fills the hol… ▽ More

    Submitted 14 July, 2020; v1 submitted 24 May, 2020; originally announced May 2020.

  8. arXiv:2004.14071  [pdf, other

    cs.GR cs.CV cs.LG

    Image Morphing with Perceptual Constraints and STN Alignment

    Authors: Noa Fish, Richard Zhang, Lilach Perry, Daniel Cohen-Or, Eli Shechtman, Connelly Barnes

    Abstract: In image morphing, a sequence of plausible frames are synthesized and composited together to form a smooth transformation between given instances. Intermediates must remain faithful to the input, stand on their own as members of the set, and maintain a well-paced visual transition from one to the next. In this paper, we propose a conditional GAN morphing framework operating on a pair of input imag… ▽ More

    Submitted 29 April, 2020; originally announced April 2020.

    ACM Class: I.3.3

  9. MakeItTalk: Speaker-Aware Talking-Head Animation

    Authors: Yang Zhou, Xintong Han, Eli Shechtman, Jose Echevarria, Evangelos Kalogerakis, Dingzeyu Li

    Abstract: We present a method that generates expressive talking heads from a single facial image with audio as the only input. In contrast to previous approaches that attempt to learn direct map**s from audio to raw pixels or points for creating talking faces, our method first disentangles the content and speaker information in the input audio signal. The audio content robustly controls the motion of lips… ▽ More

    Submitted 25 February, 2021; v1 submitted 27 April, 2020; originally announced April 2020.

    Comments: SIGGRAPH Asia 2020, 15 pages, 13 figures

  10. arXiv:2004.03805  [pdf, other

    cs.CV cs.GR

    State of the Art on Neural Rendering

    Authors: Ayush Tewari, Ohad Fried, Justus Thies, Vincent Sitzmann, Stephen Lombardi, Kalyan Sunkavalli, Ricardo Martin-Brualla, Tomas Simon, Jason Saragih, Matthias Nießner, Rohit Pandey, Sean Fanello, Gordon Wetzstein, Jun-Yan Zhu, Christian Theobalt, Maneesh Agrawala, Eli Shechtman, Dan B Goldman, Michael Zollhöfer

    Abstract: Efficient rendering of photo-realistic virtual worlds is a long standing effort of computer graphics. Modern graphics techniques have succeeded in synthesizing photo-realistic images from hand-crafted scene representations. However, the automatic generation of shape, materials, lighting, and other aspects of scenes remains a challenging problem that, if solved, would make photo-realistic computer… ▽ More

    Submitted 8 April, 2020; originally announced April 2020.

    Comments: Eurographics 2020 survey paper

  11. arXiv:2003.12649  [pdf, other

    cs.CV

    Deep CG2Real: Synthetic-to-Real Translation via Image Disentanglement

    Authors: Sai Bi, Kalyan Sunkavalli, Federico Perazzi, Eli Shechtman, Vladimir Kim, Ravi Ramamoorthi

    Abstract: We present a method to improve the visual realism of low-quality, synthetic images, e.g. OpenGL renderings. Training an unpaired synthetic-to-real translation network in image space is severely under-constrained and produces visible artifacts. Instead, we propose a semi-supervised approach that operates on the disentangled shading and albedo layers of the image. Our two-stage pipeline first learns… ▽ More

    Submitted 27 March, 2020; originally announced March 2020.

    Comments: Accepted to ICCV 2019

  12. arXiv:2003.09764  [pdf, other

    cs.CV

    Lifespan Age Transformation Synthesis

    Authors: Roy Or-El, Soumyadip Sengupta, Ohad Fried, Eli Shechtman, Ira Kemelmacher-Shlizerman

    Abstract: We address the problem of single photo age progression and regression-the prediction of how a person might look in the future, or how they looked in the past. Most existing aging methods are limited to changing the texture, overlooking transformations in head shape that occur during the human aging and growth process. This limits the applicability of previous methods to aging of adults to slightly… ▽ More

    Submitted 24 July, 2020; v1 submitted 21 March, 2020; originally announced March 2020.

    Comments: ECCV 2020 Camera-Ready version. Main Changes: 1. Added Ethics & Bias statement in the supplementary material 2. Comparison figures to PyGAN [46] and S2GAN [13] were removed due to copyright issues. These figures can be found in the project's webpage (link is provided in the paper). 3. Added links to the code and dataset (Github)

  13. arXiv:1910.02060  [pdf, other

    cs.CV cs.GR cs.LG

    Neural Puppet: Generative Layered Cartoon Characters

    Authors: Omid Poursaeed, Vladimir G. Kim, Eli Shechtman, Jun Saito, Serge Belongie

    Abstract: We propose a learning based method for generating new animations of a cartoon character given a few example images. Our method is designed to learn from a traditionally animated sequence, where each frame is drawn by an artist, and thus the input images lack any common structure, correspondences, or labels. We express pose changes as a deformation of a layered 2.5D template mesh, and devise a nove… ▽ More

    Submitted 12 October, 2020; v1 submitted 4 October, 2019; originally announced October 2019.

    Comments: WACV 2020

  14. arXiv:1909.11081  [pdf, other

    cs.CV cs.LG eess.IV

    Interactive Sketch & Fill: Multiclass Sketch-to-Image Translation

    Authors: Arnab Ghosh, Richard Zhang, Puneet K. Dokania, Oliver Wang, Alexei A. Efros, Philip H. S. Torr, Eli Shechtman

    Abstract: We propose an interactive GAN-based sketch-to-image translation method that helps novice users create images of simple objects. As the user starts to draw a sketch of a desired object type, the network interactively recommends plausible completions, and shows a corresponding synthesized image to the user. This enables a feedback loop, where the user can edit their sketch based on the network's rec… ▽ More

    Submitted 25 September, 2019; v1 submitted 24 September, 2019; originally announced September 2019.

    Comments: ICCV 2019, Video Avaiable at https://youtu.be/T9xtpAMUDps

  15. arXiv:1908.07070  [pdf, other

    cs.CV

    UprightNet: Geometry-Aware Camera Orientation Estimation from Single Images

    Authors: Wenqi Xian, Zhengqi Li, Matthew Fisher, Jonathan Eisenmann, Eli Shechtman, Noah Snavely

    Abstract: We introduce UprightNet, a learning-based approach for estimating 2DoF camera orientation from a single RGB image of an indoor scene. Unlike recent methods that leverage deep learning to perform black-box regression from image to orientation parameters, we propose an end-to-end framework that incorporates explicit geometric reasoning. In particular, we design a network that predicts two representa… ▽ More

    Submitted 19 August, 2019; originally announced August 2019.

  16. arXiv:1906.01524  [pdf, other

    cs.CV cs.GR cs.LG

    Text-based Editing of Talking-head Video

    Authors: Ohad Fried, Ayush Tewari, Michael Zollhöfer, Adam Finkelstein, Eli Shechtman, Dan B Goldman, Kyle Genova, Zeyu **, Christian Theobalt, Maneesh Agrawala

    Abstract: Editing talking-head video to change the speech content or to remove filler words is challenging. We propose a novel method to edit talking-head video based on its transcript to produce a realistic output video in which the dialogue of the speaker has been modified, while maintaining a seamless audio-visual flow (i.e. no jump cuts). Our method automatically annotates an input talking-head video wi… ▽ More

    Submitted 4 June, 2019; originally announced June 2019.

    Comments: A version with higher resolution images can be downloaded from the authors' website

  17. arXiv:1903.08682  [pdf, other

    cs.CV

    Im2Pencil: Controllable Pencil Illustration from Photographs

    Authors: Yijun Li, Chen Fang, Aaron Hertzmann, Eli Shechtman, Ming-Hsuan Yang

    Abstract: We propose a high-quality photo-to-pencil translation method with fine-grained control over the drawing style. This is a challenging task due to multiple stroke types (e.g., outline and shading), structural complexity of pencil shading (e.g., hatching), and the lack of aligned training data pairs. To address these challenges, we develop a two-branch model that learns separate filters for generatin… ▽ More

    Submitted 20 March, 2019; originally announced March 2019.

    Comments: Accepted by CVPR 2019

  18. arXiv:1903.08642  [pdf, other

    cs.CV cs.GR

    Photometric Mesh Optimization for Video-Aligned 3D Object Reconstruction

    Authors: Chen-Hsuan Lin, Oliver Wang, Bryan C. Russell, Eli Shechtman, Vladimir G. Kim, Matthew Fisher, Simon Lucey

    Abstract: In this paper, we address the problem of 3D object mesh reconstruction from RGB videos. Our approach combines the best of multi-view geometric and data-driven methods for 3D reconstruction by optimizing object meshes for multi-view photometric consistency while constraining mesh deformations with a shape prior. We pose this as a piecewise image alignment problem for each mesh face projection. Our… ▽ More

    Submitted 20 March, 2019; originally announced March 2019.

    Comments: Accepted to CVPR 2019 (project page & code: https://chenhsuanlin.bitbucket.io/photometric-mesh-optim/)

  19. arXiv:1901.03447  [pdf, other

    cs.CV

    Texture Mixer: A Network for Controllable Synthesis and Interpolation of Texture

    Authors: Ning Yu, Connelly Barnes, Eli Shechtman, Sohrab Amirghodsi, Michal Lukac

    Abstract: This paper addresses the problem of interpolating visual textures. We formulate this problem by requiring (1) by-example controllability and (2) realistic and smooth interpolation among an arbitrary number of texture samples. To solve it we propose a neural network trained simultaneously on a reconstruction task and a generation task, which can project texture examples onto a latent space where th… ▽ More

    Submitted 16 April, 2019; v1 submitted 10 January, 2019; originally announced January 2019.

    Comments: Accepted to CVPR'19

  20. arXiv:1809.01337  [pdf, other

    cs.CV cs.CL

    Localizing Moments in Video with Temporal Language

    Authors: Lisa Anne Hendricks, Oliver Wang, Eli Shechtman, Josef Sivic, Trevor Darrell, Bryan Russell

    Abstract: Localizing moments in a longer video via natural language queries is a new, challenging task at the intersection of language and video understanding. Though moment localization with natural language is similar to other language and vision tasks like natural language object retrieval in images, moment localization offers an interesting opportunity to model temporal dependencies and reasoning in tex… ▽ More

    Submitted 5 September, 2018; originally announced September 2018.

    Comments: EMNLP 2018

  21. arXiv:1808.04545  [pdf, other

    cs.LG cs.AI cs.CV cs.GR stat.ML

    MT-VAE: Learning Motion Transformations to Generate Multimodal Human Dynamics

    Authors: Xinchen Yan, Akash Rastogi, Ruben Villegas, Kalyan Sunkavalli, Eli Shechtman, Sunil Hadap, Ersin Yumer, Honglak Lee

    Abstract: Long-term human motion can be represented as a series of motion modes---motion sequences that capture short-term temporal dynamics---with transitions between them. We leverage this structure and present a novel Motion Transformation Variational Auto-Encoders (MT-VAE) for learning motion sequence generation. Our model jointly learns a feature embedding for motion modes (that the motion sequence can… ▽ More

    Submitted 14 August, 2018; originally announced August 2018.

    Comments: Published at ECCV 2018

  22. arXiv:1808.00449  [pdf, other

    cs.CV

    Learning Blind Video Temporal Consistency

    Authors: Wei-Sheng Lai, Jia-Bin Huang, Oliver Wang, Eli Shechtman, Ersin Yumer, Ming-Hsuan Yang

    Abstract: Applying image processing algorithms independently to each frame of a video often leads to undesired inconsistent results over time. Develo** temporally consistent video-based extensions, however, requires domain knowledge for individual tasks and is unable to generalize to other applications. In this paper, we present an efficient end-to-end approach based on deep recurrent network for enforcin… ▽ More

    Submitted 1 August, 2018; originally announced August 2018.

    Comments: This work is accepted in ECCV 2018. Project website: http://vllab.ucmerced.edu/wlai24/video_consistency/

  23. arXiv:1807.03249  [pdf, other

    cs.GR

    StyleBlit: Fast Example-Based Stylization with Local Guidance

    Authors: Daniel Sýkora, Ondřej Jamriška, **gwan Lu, Eli Shechtman

    Abstract: We present StyleBlit---an efficient example-based style transfer algorithm that can deliver high-quality stylized renderings in real-time on a single-core CPU. Our technique is especially suitable for style transfer applications that use local guidance - descriptive guiding channels containing large spatial variations. Local guidance encourages transfer of content from the source exemplar to the t… ▽ More

    Submitted 9 July, 2018; originally announced July 2018.

  24. arXiv:1804.03189  [pdf, other

    cs.GR

    Deep Painterly Harmonization

    Authors: Fujun Luan, Sylvain Paris, Eli Shechtman, Kavita Bala

    Abstract: Copying an element from a photo and pasting it into a painting is a challenging task. Applying photo compositing techniques in this context yields subpar results that look like a collage --- and existing painterly stylization algorithms, which are global, perform poorly when applied locally. We address these issues with a dedicated algorithm that carefully determines the local statistics to be tra… ▽ More

    Submitted 26 June, 2018; v1 submitted 9 April, 2018; originally announced April 2018.

  25. arXiv:1803.01837  [pdf, other

    cs.CV cs.LG

    ST-GAN: Spatial Transformer Generative Adversarial Networks for Image Compositing

    Authors: Chen-Hsuan Lin, Ersin Yumer, Oliver Wang, Eli Shechtman, Simon Lucey

    Abstract: We address the problem of finding realistic geometric corrections to a foreground object such that it appears natural when composited into a background image. To achieve this, we propose a novel Generative Adversarial Network (GAN) architecture that utilizes Spatial Transformer Networks (STNs) as the generator, which we call Spatial Transformer GANs (ST-GANs). ST-GANs seek image realism by operati… ▽ More

    Submitted 5 March, 2018; originally announced March 2018.

    Comments: Accepted to CVPR 2018 (website & code: https://chenhsuanlin.bitbucket.io/spatial-transformer-GAN/)

  26. arXiv:1801.03924  [pdf, other

    cs.CV cs.GR

    The Unreasonable Effectiveness of Deep Features as a Perceptual Metric

    Authors: Richard Zhang, Phillip Isola, Alexei A. Efros, Eli Shechtman, Oliver Wang

    Abstract: While it is nearly effortless for humans to quickly assess the perceptual similarity between two images, the underlying processes are thought to be quite complex. Despite this, the most widely used perceptual metrics today, such as PSNR and SSIM, are simple, shallow functions, and fail to account for many nuances of human perception. Recently, the deep learning community has found that features of… ▽ More

    Submitted 10 April, 2018; v1 submitted 11 January, 2018; originally announced January 2018.

    Comments: Accepted to CVPR 2018; Code and data available at https://www.github.com/richzhang/PerceptualSimilarity

  27. arXiv:1712.00516  [pdf, other

    cs.CV

    Multi-Content GAN for Few-Shot Font Style Transfer

    Authors: Samaneh Azadi, Matthew Fisher, Vladimir Kim, Zhaowen Wang, Eli Shechtman, Trevor Darrell

    Abstract: In this work, we focus on the challenge of taking partial observations of highly-stylized text and generalizing the observations to generate unobserved glyphs in the ornamented typeface. To generate a set of multi-content images following a consistent style from very few examples, we propose an end-to-end stacked conditional GAN model considering content along channels and style along network laye… ▽ More

    Submitted 1 December, 2017; originally announced December 2017.

  28. arXiv:1711.11586  [pdf, other

    cs.CV cs.GR stat.ML

    Toward Multimodal Image-to-Image Translation

    Authors: Jun-Yan Zhu, Richard Zhang, Deepak Pathak, Trevor Darrell, Alexei A. Efros, Oliver Wang, Eli Shechtman

    Abstract: Many image-to-image translation problems are ambiguous, as a single input image may correspond to multiple possible outputs. In this work, we aim to model a \emph{distribution} of possible outputs in a conditional generative modeling setting. The ambiguity of the map** is distilled in a low-dimensional latent vector, which can be randomly sampled at test time. A generator learns to map the given… ▽ More

    Submitted 23 October, 2018; v1 submitted 30 November, 2017; originally announced November 2017.

    Comments: NIPS 2017 Final paper. v4 updated acknowledgment. Website: https://junyanz.github.io/BicycleGAN/

  29. arXiv:1709.09828  [pdf, other

    cs.CV

    Photorealistic Style Transfer with Screened Poisson Equation

    Authors: Roey Mechrez, Eli Shechtman, Lihi Zelnik-Manor

    Abstract: Recent work has shown impressive success in transferring painterly style to images. These approaches, however, fall short of photorealistic style transfer. Even when both the input and reference images are photographs, the output still exhibits distortions reminiscent of a painting. In this paper we propose an approach that takes as input a stylized image and makes it more photorealistic. It relie… ▽ More

    Submitted 28 September, 2017; originally announced September 2017.

    Comments: presented in BMVC 2017

  30. arXiv:1708.02212  [pdf, other

    cs.CV

    Training Deep Networks to be Spatially Sensitive

    Authors: Nicholas Kolkin, Gregory Shakhnarovich, Eli Shechtman

    Abstract: In many computer vision tasks, for example saliency prediction or semantic segmentation, the desired output is a foreground map that predicts pixels where some criteria is satisfied. Despite the inherently spatial nature of this task commonly used learning objectives do not incorporate the spatial relationships between misclassified pixels and the underlying ground truth. The Weighted F-measure, a… ▽ More

    Submitted 7 August, 2017; originally announced August 2017.

    Comments: ICCV 2017

  31. arXiv:1708.01641  [pdf, other

    cs.CV

    Localizing Moments in Video with Natural Language

    Authors: Lisa Anne Hendricks, Oliver Wang, Eli Shechtman, Josef Sivic, Trevor Darrell, Bryan Russell

    Abstract: We consider retrieving a specific temporal segment, or moment, from a video given a natural language text description. Methods designed to retrieve whole video clips with natural language determine what occurs in a video but not when. To address this issue, we propose the Moment Context Network (MCN) which effectively localizes natural language queries in videos by integrating local and global vid… ▽ More

    Submitted 4 August, 2017; originally announced August 2017.

    Comments: ICCV 2017

  32. arXiv:1704.04131  [pdf, other

    cs.CV

    Neural Face Editing with Intrinsic Image Disentangling

    Authors: Zhixin Shu, Ersin Yumer, Sunil Hadap, Kalyan Sunkavalli, Eli Shechtman, Dimitris Samaras

    Abstract: Traditional face editing methods often require a number of sophisticated and task specific algorithms to be applied one after the other --- a process that is tedious, fragile, and computationally intensive. In this paper, we propose an end-to-end generative adversarial network that infers a face-specific disentangled representation of intrinsic face properties, including shape (i.e. normals), albe… ▽ More

    Submitted 13 April, 2017; originally announced April 2017.

    Comments: CVPR 2017 oral

  33. arXiv:1703.07511  [pdf, other

    cs.CV

    Deep Photo Style Transfer

    Authors: Fujun Luan, Sylvain Paris, Eli Shechtman, Kavita Bala

    Abstract: This paper introduces a deep-learning approach to photographic style transfer that handles a large variety of image content while faithfully transferring the reference style. Our approach builds upon the recent work on painterly transfer that separates style from the content of an image by considering different layers of a neural network. However, as is, this approach is not suitable for photoreal… ▽ More

    Submitted 10 April, 2017; v1 submitted 22 March, 2017; originally announced March 2017.

  34. arXiv:1612.02184  [pdf, other

    cs.CV

    Saliency Driven Image Manipulation

    Authors: Roey Mechrez, Eli Shechtman, Lihi Zelnik-Manor

    Abstract: Have you ever taken a picture only to find out that an unimportant background object ended up being overly salient? Or one of those team sports photos where your favorite player blends with the rest? Wouldn't it be nice if you could tweak these pictures just a little bit so that the distractor would be attenuated and your favorite player will stand-out among her peers? Manipulating images in order… ▽ More

    Submitted 17 January, 2018; v1 submitted 7 December, 2016; originally announced December 2016.

    Comments: to appear in WACV'18

  35. arXiv:1611.09969  [pdf, other

    cs.CV

    High-Resolution Image Inpainting using Multi-Scale Neural Patch Synthesis

    Authors: Chao Yang, Xin Lu, Zhe Lin, Eli Shechtman, Oliver Wang, Hao Li

    Abstract: Recent advances in deep learning have shown exciting promise in filling large holes in natural images with semantically plausible and context aware details, impacting fundamental image manipulation tasks such as object removal. While these learning-based methods are significantly more effective in capturing high-level features than prior techniques, they can only handle very low-resolution inputs… ▽ More

    Submitted 13 April, 2017; v1 submitted 29 November, 2016; originally announced November 2016.

  36. arXiv:1611.07865  [pdf, other

    cs.CV

    Controlling Perceptual Factors in Neural Style Transfer

    Authors: Leon A. Gatys, Alexander S. Ecker, Matthias Bethge, Aaron Hertzmann, Eli Shechtman

    Abstract: Neural Style Transfer has shown very exciting results enabling new forms of image manipulation. Here we extend the existing method to introduce control over spatial location, colour information and across spatial scale. We demonstrate how this enhances the method by allowing high-resolution controlled stylisation and helps to alleviate common failure cases such as applying ground textures to sky r… ▽ More

    Submitted 11 May, 2017; v1 submitted 23 November, 2016; originally announced November 2016.

    Comments: Accepted at CVPR2017

  37. arXiv:1609.03552  [pdf, other

    cs.CV

    Generative Visual Manipulation on the Natural Image Manifold

    Authors: Jun-Yan Zhu, Philipp Krähenbühl, Eli Shechtman, Alexei A. Efros

    Abstract: Realistic image manipulation is challenging because it requires modifying the image appearance in a user-controlled way, while preserving the realism of the result. Unless the user has considerable artistic skill, it is easy to "fall off" the manifold of natural images while editing. In this paper, we propose to learn the natural image manifold directly from data using a generative adversarial neu… ▽ More

    Submitted 16 December, 2018; v1 submitted 12 September, 2016; originally announced September 2016.

    Comments: In European Conference on Computer Vision (ECCV 2016)

  38. arXiv:1606.05897  [pdf, other

    cs.CV

    Preserving Color in Neural Artistic Style Transfer

    Authors: Leon A. Gatys, Matthias Bethge, Aaron Hertzmann, Eli Shechtman

    Abstract: This note presents an extension to the neural artistic style transfer algorithm (Gatys et al.). The original algorithm transforms an image to have the style of another given image. For example, a photograph can be transformed to have the style of a famous painting. Here we address a potential shortcoming of the original method: the algorithm transfers the colors of the original painting, which can… ▽ More

    Submitted 19 June, 2016; originally announced June 2016.

  39. arXiv:1603.06398  [pdf, other

    cs.CV

    Appearance Harmonization for Single Image Shadow Removal

    Authors: Liqian Ma, Jue Wang, Eli Shechtman, Kalyan Sunkavalli, Shimin Hu

    Abstract: Shadows often create unwanted artifacts in photographs, and removing them can be very challenging. Previous shadow removal methods often produce de-shadowed regions that are visually inconsistent with the rest of the image. In this work we propose a fully automatic shadow region harmonization approach that improves the appearance compatibility of the de-shadowed region as typically produced by pre… ▽ More

    Submitted 21 March, 2016; originally announced March 2016.

  40. arXiv:1510.00477  [pdf, other

    cs.CV

    Learning a Discriminative Model for the Perception of Realism in Composite Images

    Authors: Jun-Yan Zhu, Philipp Krähenbühl, Eli Shechtman, Alexei A. Efros

    Abstract: What makes an image appear realistic? In this work, we are answering this question from a data-driven perspective by learning the perception of visual realism directly from large amounts of data. In particular, we train a Convolutional Neural Network (CNN) model that distinguishes natural photographs from automatically generated composite images. The model learns to predict visual realism of a sce… ▽ More

    Submitted 1 October, 2015; originally announced October 2015.

    Comments: International Conference on Computer Vision (ICCV) 2015

  41. arXiv:1507.03196  [pdf, other

    cs.CV

    DeepFont: Identify Your Font from An Image

    Authors: Zhangyang Wang, Jianchao Yang, Hailin **, Eli Shechtman, Aseem Agarwala, Jonathan Brandt, Thomas S. Huang

    Abstract: As font is one of the core design concepts, automatic font identification and similar font suggestion from an image or photo has been on the wish list of many designers. We study the Visual Font Recognition (VFR) problem, and advance the state-of-the-art remarkably by develo** the DeepFont system. First of all, we build up the first available large-scale VFR dataset, named AdobeVFR, consisting o… ▽ More

    Submitted 12 July, 2015; originally announced July 2015.

    Comments: To Appear in ACM Multimedia as a full paper

  42. arXiv:1504.00028  [pdf, other

    cs.CV cs.LG

    Real-World Font Recognition Using Deep Network and Domain Adaptation

    Authors: Zhangyang Wang, Jianchao Yang, Hailin **, Eli Shechtman, Aseem Agarwala, Jonathan Brandt, Thomas S. Huang

    Abstract: We address a challenging fine-grain classification problem: recognizing a font style from an image of text. In this task, it is very easy to generate lots of rendered font examples but very hard to obtain real-world labeled images. This real-to-synthetic domain gap caused poor generalization to new real data in previous methods (Chen et al. (2014)). In this paper, we refer to Convolutional Neural… ▽ More

    Submitted 31 March, 2015; originally announced April 2015.

  43. arXiv:1412.5758   

    cs.CV

    Decomposition-Based Domain Adaptation for Real-World Font Recognition

    Authors: Zhangyang Wang, Jianchao Yang, Hailin **, Eli Shechtman, Aseem Agarwala, Jonathan Brandt, Thomas S. Huang

    Abstract: We present a domain adaption framework to address a domain mismatch between synthetic training and real-world testing data. We demonstrate our method on a challenging fine-grain classification problem: recognizing a font style from an image of text. In this task, it is very easy to generate lots of rendered font examples but very hard to obtain real-world labeled images. This real-to-synthetic dom… ▽ More

    Submitted 1 April, 2015; v1 submitted 18 December, 2014; originally announced December 2014.

    Comments: This paper has been withdrawn by the author due to project concerns

  44. arXiv:1204.3367  [pdf, other

    cs.SI cs.HC

    Crowdsourcing Gaze Data Collection

    Authors: Dmitry Rudoy, Dan B. Goldman, Eli Shechtman, Lihi Zelnik-Manor

    Abstract: Knowing where people look is a useful tool in many various image and video applications. However, traditional gaze tracking hardware is expensive and requires local study participants, so acquiring gaze location data from a large number of participants is very problematic. In this work we propose a crowdsourced method for acquisition of gaze direction data from a virtually unlimited number of part… ▽ More

    Submitted 16 April, 2012; originally announced April 2012.

    Comments: Presented at Collective Intelligence conference, 2012 (arXiv:1204.2991)

    Report number: CollectiveIntelligence/2012/106