Skip to main content

Showing 1–8 of 8 results for author: Reda, F A

Searching in archive cs. Search in all archives.
.
  1. arXiv:2203.14863  [pdf, other

    cs.CV cs.MM

    HIME: Efficient Headshot Image Super-Resolution with Multiple Exemplars

    Authors: Xiaoyu Xiang, Jon Morton, Fitsum A Reda, Lucas Young, Federico Perazzi, Rakesh Ranjan, Amit Kumar, Andrea Colaco, Jan Allebach

    Abstract: A promising direction for recovering the lost information in low-resolution headshot images is utilizing a set of high-resolution exemplars from the same identity. Complementary images in the reference set can improve the generated headshot quality across many different views and poses. However, it is challenging to make the best use of multiple exemplars: the quality and alignment of each exempla… ▽ More

    Submitted 28 March, 2022; originally announced March 2022.

    Comments: Technical Report

  2. arXiv:2103.01524  [pdf, other

    eess.IV cs.CV cs.LG

    Feature-Align Network with Knowledge Distillation for Efficient Denoising

    Authors: Lucas D. Young, Fitsum A. Reda, Rakesh Ranjan, Jon Morton, Jun Hu, Yazhu Ling, Xiaoyu Xiang, David Liu, Vikas Chandra

    Abstract: We propose an efficient neural network for RAW image denoising. Although neural network-based denoising has been extensively studied for image restoration, little attention has been given to efficient denoising for compute limited and power sensitive devices, such as smartphones and smartwatches. In this paper, we present a novel architecture and a suite of training techniques for high quality den… ▽ More

    Submitted 17 March, 2021; v1 submitted 2 March, 2021; originally announced March 2021.

    MSC Class: 94A08 (Primary) 68T07; 65D19 (Secondary) ACM Class: I.4.5; I.2.6

  3. arXiv:2007.07243  [pdf, other

    cs.CV cs.GR

    Transposer: Universal Texture Synthesis Using Feature Maps as Transposed Convolution Filter

    Authors: Guilin Liu, Rohan Taori, Ting-Chun Wang, Zhiding Yu, Shiqiu Liu, Fitsum A. Reda, Karan Sapra, Andrew Tao, Bryan Catanzaro

    Abstract: Conventional CNNs for texture synthesis consist of a sequence of (de)-convolution and up/down-sampling layers, where each layer operates locally and lacks the ability to capture the long-term structural dependency required by texture synthesis. Thus, they often simply enlarge the input texture, rather than perform reasonable synthesis. As a compromise, many recent methods sacrifice generalizabilit… ▽ More

    Submitted 14 July, 2020; originally announced July 2020.

  4. arXiv:1906.05928  [pdf, other

    cs.CV

    Unsupervised Video Interpolation Using Cycle Consistency

    Authors: Fitsum A. Reda, Deqing Sun, Aysegul Dundar, Mohammad Shoeybi, Guilin Liu, Kevin J. Shih, Andrew Tao, Jan Kautz, Bryan Catanzaro

    Abstract: Learning to synthesize high frame rate videos via interpolation requires large quantities of high frame rate training videos, which, however, are scarce, especially at high resolutions. Here, we propose unsupervised techniques to synthesize high frame rate videos directly from low frame rate videos using cycle consistency. For a triplet of consecutive frames, we optimize models to minimize the dis… ▽ More

    Submitted 27 March, 2021; v1 submitted 13 June, 2019; originally announced June 2019.

    Comments: Published in ICCV 2019. Codes are available at https://github.com/NVIDIA/unsupervised-video-interpolation. Project website https://nv-adlr.github.io/publication/2019-UnsupervisedVideoInterpolation

  5. arXiv:1812.01593  [pdf, other

    cs.CV cs.AI cs.MM cs.RO

    Improving Semantic Segmentation via Video Propagation and Label Relaxation

    Authors: Yi Zhu, Karan Sapra, Fitsum A. Reda, Kevin J. Shih, Shawn Newsam, Andrew Tao, Bryan Catanzaro

    Abstract: Semantic segmentation requires large amounts of pixel-wise annotations to learn accurate models. In this paper, we present a video prediction-based methodology to scale up training sets by synthesizing new training samples in order to improve the accuracy of semantic segmentation networks. We exploit video prediction models' ability to predict future frames in order to also predict future labels.… ▽ More

    Submitted 2 July, 2019; v1 submitted 4 December, 2018; originally announced December 2018.

    Comments: CVPR 2019 Oral. Code link: https://github.com/NVIDIA/semantic-segmentation. YouTube link: https://www.youtube.com/watch?v=aEbXjGZDZSQ

  6. arXiv:1811.11718  [pdf, other

    cs.CV

    Partial Convolution based Padding

    Authors: Guilin Liu, Kevin J. Shih, Ting-Chun Wang, Fitsum A. Reda, Karan Sapra, Zhiding Yu, Andrew Tao, Bryan Catanzaro

    Abstract: In this paper, we present a simple yet effective padding scheme that can be used as a drop-in module for existing convolutional neural networks. We call it partial convolution based padding, with the intuition that the padded region can be treated as holes and the original input as non-holes. Specifically, during the convolution operation, the convolution results are re-weighted near image borders… ▽ More

    Submitted 28 November, 2018; originally announced November 2018.

    Comments: 11 pages; code is available at https://github.com/NVIDIA/partialconv

  7. arXiv:1811.00684  [pdf, other

    cs.CV

    SDCNet: Video Prediction Using Spatially-Displaced Convolution

    Authors: Fitsum A. Reda, Guilin Liu, Kevin J. Shih, Robert Kirby, Jon Barker, David Tarjan, Andrew Tao, Bryan Catanzaro

    Abstract: We present an approach for high-resolution video frame prediction by conditioning on both past frames and past optical flows. Previous approaches rely on resampling past frames, guided by a learned future optical flow, or on direct generation of pixels. Resampling based on flow is insufficient because it cannot deal with disocclusions. Generative models currently lead to blurry results. Recent app… ▽ More

    Submitted 27 March, 2021; v1 submitted 1 November, 2018; originally announced November 2018.

    Comments: Published in ECCV 2018. Codes available at https://github.com/NVIDIA/semantic-segmentation/tree/sdcnet/sdcnet. Project page available at https://nv-adlr.github.io/publication/2018-SDCNet

  8. arXiv:1804.07723  [pdf, other

    cs.CV

    Image Inpainting for Irregular Holes Using Partial Convolutions

    Authors: Guilin Liu, Fitsum A. Reda, Kevin J. Shih, Ting-Chun Wang, Andrew Tao, Bryan Catanzaro

    Abstract: Existing deep learning based image inpainting methods use a standard convolutional network over the corrupted image, using convolutional filter responses conditioned on both valid pixels as well as the substitute values in the masked holes (typically the mean value). This often leads to artifacts such as color discrepancy and blurriness. Post-processing is usually used to reduce such artifacts, bu… ▽ More

    Submitted 15 December, 2018; v1 submitted 20 April, 2018; originally announced April 2018.

    Comments: Update: camera-ready; L1 loss is size-averaged; code of partial conv layer: https://github.com/NVIDIA/partialconv. Published at ECCV 2018