Skip to main content

Showing 1–13 of 13 results for author: Levinshtein, A

Searching in archive cs. Search in all archives.
.
  1. arXiv:2310.17880  [pdf, other

    cs.CV

    Reconstructive Latent-Space Neural Radiance Fields for Efficient 3D Scene Representations

    Authors: Tristan Aumentado-Armstrong, Ashkan Mirzaei, Marcus A. Brubaker, Jonathan Kelly, Alex Levinshtein, Konstantinos G. Derpanis, Igor Gilitschenski

    Abstract: Neural Radiance Fields (NeRFs) have proven to be powerful 3D representations, capable of high quality novel view synthesis of complex scenes. While NeRFs have been applied to graphics, vision, and robotics, problems with slow rendering speed and characteristic visual artifacts prevent adoption in many use cases. In this work, we investigate combining an autoencoder (AE) with a NeRF, in which laten… ▽ More

    Submitted 26 October, 2023; originally announced October 2023.

    ACM Class: I.2.10

  2. arXiv:2309.08826  [pdf, other

    cs.CV

    Dual-Camera Joint Deblurring-Denoising

    Authors: Shayan Shekarforoush, Amanpreet Walia, Marcus A. Brubaker, Konstantinos G. Derpanis, Alex Levinshtein

    Abstract: Recent image enhancement methods have shown the advantages of using a pair of long and short-exposure images for low-light photography. These image modalities offer complementary strengths and weaknesses. The former yields an image that is clean but blurry due to camera or object motion, whereas the latter is sharp but noisy due to low photon count. Motivated by the fact that modern smartphones co… ▽ More

    Submitted 15 September, 2023; originally announced September 2023.

    Comments: Project webpage: http://shekshaa.github.io/Joint-Deblurring-Denoising/

  3. arXiv:2308.08947  [pdf, other

    cs.CV

    Watch Your Steps: Local Image and Scene Editing by Text Instructions

    Authors: Ashkan Mirzaei, Tristan Aumentado-Armstrong, Marcus A. Brubaker, Jonathan Kelly, Alex Levinshtein, Konstantinos G. Derpanis, Igor Gilitschenski

    Abstract: Denoising diffusion models have enabled high-quality image generation and editing. We present a method to localize the desired edit region implicit in a text instruction. We leverage InstructPix2Pix (IP2P) and identify the discrepancy between IP2P predictions with and without the instruction. This discrepancy is referred to as the relevance map. The relevance map conveys the importance of changing… ▽ More

    Submitted 17 August, 2023; originally announced August 2023.

    Comments: Project page: https://ashmrz.github.io/WatchYourSteps/

    Journal ref: European Conference on Computer Vision (ECCV) 2024

  4. arXiv:2304.09677  [pdf, other

    cs.CV

    Reference-guided Controllable Inpainting of Neural Radiance Fields

    Authors: Ashkan Mirzaei, Tristan Aumentado-Armstrong, Marcus A. Brubaker, Jonathan Kelly, Alex Levinshtein, Konstantinos G. Derpanis, Igor Gilitschenski

    Abstract: The popularity of Neural Radiance Fields (NeRFs) for view synthesis has led to a desire for NeRF editing tools. Here, we focus on inpainting regions in a view-consistent and controllable manner. In addition to the typical NeRF inputs and masks delineating the unwanted region in each view, we require only a single inpainted view of the scene, i.e., a reference view. We use monocular depth estimator… ▽ More

    Submitted 20 April, 2023; v1 submitted 19 April, 2023; originally announced April 2023.

    Comments: Project Page: https://ashmrz.github.io/reference-guided-3d

  5. arXiv:2301.10759  [pdf, other

    cs.CV

    Efficient Flow-Guided Multi-frame De-fencing

    Authors: Stavros Tsogkas, Fengjia Zhang, Allan Jepson, Alex Levinshtein

    Abstract: Taking photographs ''in-the-wild'' is often hindered by fence obstructions that stand between the camera user and the scene of interest, and which are hard or impossible to avoid. De-fencing is the algorithmic process of automatically removing such obstructions from images, revealing the invisible parts of the scene. While this problem can be formulated as a combination of fence segmentation and i… ▽ More

    Submitted 25 January, 2023; originally announced January 2023.

    Comments: 16 pages, 12 figures. Published at the Winter Conference on Application of Computer Vision (WACV) 2023

    Journal ref: Proceedings of the IEEE/CVF Winter Conference on Applications of Computer Vision (WACV) 2023, pp. 1838-1847

  6. arXiv:2211.12254  [pdf, other

    cs.CV

    SPIn-NeRF: Multiview Segmentation and Perceptual Inpainting with Neural Radiance Fields

    Authors: Ashkan Mirzaei, Tristan Aumentado-Armstrong, Konstantinos G. Derpanis, Jonathan Kelly, Marcus A. Brubaker, Igor Gilitschenski, Alex Levinshtein

    Abstract: Neural Radiance Fields (NeRFs) have emerged as a popular approach for novel view synthesis. While NeRFs are quickly being adapted for a wider set of applications, intuitively editing NeRF scenes is still an open challenge. One important editing task is the removal of unwanted objects from a 3D scene, such that the replaced region is visually plausible and consistent with its context. We refer to t… ▽ More

    Submitted 15 March, 2023; v1 submitted 22 November, 2022; originally announced November 2022.

    Comments: Project Page: https://spinnerf3d.github.io

    Journal ref: The IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR) 2023

  7. arXiv:2206.02715  [pdf, other

    cs.CV eess.IV

    Day-to-Night Image Synthesis for Training Nighttime Neural ISPs

    Authors: Abhijith Punnappurath, Abdullah Abuolaim, Abdelrahman Abdelhamed, Alex Levinshtein, Michael S. Brown

    Abstract: Many flagship smartphone cameras now use a dedicated neural image signal processor (ISP) to render noisy raw sensor images to the final processed output. Training nightmode ISP networks relies on large-scale datasets of image pairs with: (1) a noisy raw image captured with a short exposure and a high ISO gain; and (2) a ground truth low-noise raw image captured with a long exposure and low ISO tha… ▽ More

    Submitted 6 June, 2022; originally announced June 2022.

  8. GraN-GAN: Piecewise Gradient Normalization for Generative Adversarial Networks

    Authors: Vineeth S. Bhaskara, Tristan Aumentado-Armstrong, Allan Jepson, Alex Levinshtein

    Abstract: Modern generative adversarial networks (GANs) predominantly use piecewise linear activation functions in discriminators (or critics), including ReLU and LeakyReLU. Such models learn piecewise linear map**s, where each piece handles a subset of the input space, and the gradients per subset are piecewise constant. Under such a class of discriminator (or critic) functions, we present Gradient Norma… ▽ More

    Submitted 4 November, 2021; originally announced November 2021.

    Comments: WACV 2022 Main Conference Paper (Submitted: 18 Aug 2021, Accepted: 4 Oct 2021)

    Journal ref: 2022 IEEE/CVF Winter Conference on Applications of Computer Vision (WACV), 2022, pp. 2432-2441

  9. arXiv:2011.08026  [pdf, other

    cs.CV cs.LG

    Cycle-Consistent Generative Rendering for 2D-3D Modality Translation

    Authors: Tristan Aumentado-Armstrong, Alex Levinshtein, Stavros Tsogkas, Konstantinos G. Derpanis, Allan D. Jepson

    Abstract: For humans, visual understanding is inherently generative: given a 3D shape, we can postulate how it would look in the world; given a 2D image, we can infer the 3D structure that likely gave rise to it. We can thus translate between the 2D visual and 3D structural modalities of a given object. In the context of computer vision, this corresponds to a learnable module that serves two purposes: (i) g… ▽ More

    Submitted 16 November, 2020; originally announced November 2020.

    Comments: 3DV 2020 (oral). Project page: https://ttaa9.github.io/genren/

    ACM Class: I.2.10; I.2.6

  10. arXiv:2009.06943  [pdf, other

    eess.IV cs.CV

    AIM 2020 Challenge on Efficient Super-Resolution: Methods and Results

    Authors: Kai Zhang, Martin Danelljan, Yawei Li, Radu Timofte, Jie Liu, Jie Tang, Gangshan Wu, Yu Zhu, Xiangyu He, Wenjie Xu, Chenghua Li, Cong Leng, Jian Cheng, Guangyang Wu, Wenyi Wang, Xiaohong Liu, Hengyuan Zhao, Xiangtao Kong, **gwen He, Yu Qiao, Chao Dong, Xiaotong Luo, Liang Chen, Jiangtao Zhang, Maitreya Suin , et al. (60 additional authors not shown)

    Abstract: This paper reviews the AIM 2020 challenge on efficient single image super-resolution with focus on the proposed solutions and results. The challenge task was to super-resolve an input image with a magnification factor x4 based on a set of prior examples of low and corresponding high resolution images. The goal is to devise a network that reduces one or several aspects such as runtime, parameter co… ▽ More

    Submitted 15 September, 2020; originally announced September 2020.

  11. arXiv:1712.07168  [pdf, other

    cs.CV

    Real-time deep hair matting on mobile devices

    Authors: Alex Levinshtein, Cheng Chang, Edmund Phung, Irina Kezele, Wenzhangzhi Guo, Parham Aarabi

    Abstract: Augmented reality is an emerging technology in many application domains. Among them is the beauty industry, where live virtual try-on of beauty products is of great importance. In this paper, we address the problem of live hair color augmentation. To achieve this goal, hair needs to be segmented quickly and accurately. We show how a modified MobileNet CNN architecture can be used to segment the ha… ▽ More

    Submitted 10 January, 2018; v1 submitted 19 December, 2017; originally announced December 2017.

    Comments: 7 pages, 7 figures, submitted to CRV 2018

  12. arXiv:1712.02822  [pdf, other

    cs.CV

    Hybrid eye center localization using cascaded regression and hand-crafted model fitting

    Authors: Alex Levinshtein, Edmund Phung, Parham Aarabi

    Abstract: We propose a new cascaded regressor for eye center detection. Previous methods start from a face or an eye detector and use either advanced features or powerful regressors for eye center localization, but not both. Instead, we detect the eyes more accurately using an existing facial feature alignment method. We improve the robustness of localization by using both advanced features and powerful reg… ▽ More

    Submitted 7 December, 2017; originally announced December 2017.

    Comments: 12 pages, 5 figures, submitted to Journal of Image and Vision Computing

  13. arXiv:1502.01761  [pdf, other

    cs.CV

    A Framework for Symmetric Part Detection in Cluttered Scenes

    Authors: Tom Lee, Sanja Fidler, Alex Levinshtein, Cristian Sminchisescu, Sven Dickinson

    Abstract: The role of symmetry in computer vision has waxed and waned in importance during the evolution of the field from its earliest days. At first figuring prominently in support of bottom-up indexing, it fell out of favor as shape gave way to appearance and recognition gave way to detection. With a strong prior in the form of a target object, the role of the weaker priors offered by perceptual grou**… ▽ More

    Submitted 5 February, 2015; originally announced February 2015.

    Comments: 10 pages, 8 figures