Skip to main content

Showing 1–29 of 29 results for author: Barnes, C

Searching in archive cs. Search in all archives.
.
  1. arXiv:2405.15428  [pdf, other

    cs.CV cs.LG

    Enhancing Pollinator Conservation towards Agriculture 4.0: Monitoring of Bees through Object Recognition

    Authors: Ajay John Alex, Chloe M. Barnes, Pedro Machado, Isibor Ihianle, Gábor Markó, Martin Bencsik, Jordan J. Bird

    Abstract: In an era of rapid climate change and its adverse effects on food production, technological intervention to monitor pollinator conservation is of paramount importance for environmental monitoring and conservation for global food security. The survival of the human species depends on the conservation of pollinators. This article explores the use of Computer Vision and Object Recognition to autonomo… ▽ More

    Submitted 24 May, 2024; originally announced May 2024.

  2. arXiv:2405.05967  [pdf, other

    cs.CV cs.GR cs.LG

    Distilling Diffusion Models into Conditional GANs

    Authors: Minguk Kang, Richard Zhang, Connelly Barnes, Sylvain Paris, Suha Kwak, Jaesik Park, Eli Shechtman, Jun-Yan Zhu, Taesung Park

    Abstract: We propose a method to distill a complex multistep diffusion model into a single-step conditional GAN student model, dramatically accelerating inference, while preserving image quality. Our approach interprets diffusion distillation as a paired image-to-image translation task, using noise-to-image pairs of the diffusion model's ODE trajectory. For efficient regression loss computation, we propose… ▽ More

    Submitted 13 June, 2024; v1 submitted 9 May, 2024; originally announced May 2024.

    Comments: Project page: https://mingukkang.github.io/Diffusion2GAN/

  3. arXiv:2310.05590  [pdf, other

    cs.CV

    Perceptual Artifacts Localization for Image Synthesis Tasks

    Authors: Lingzhi Zhang, Zhengjie Xu, Connelly Barnes, Yuqian Zhou, Qing Liu, He Zhang, Sohrab Amirghodsi, Zhe Lin, Eli Shechtman, Jianbo Shi

    Abstract: Recent advancements in deep generative models have facilitated the creation of photo-realistic images across various tasks. However, these generated images often exhibit perceptual artifacts in specific regions, necessitating manual correction. In this study, we present a comprehensive empirical examination of Perceptual Artifacts Localization (PAL) spanning diverse image synthesis endeavors. We i… ▽ More

    Submitted 9 October, 2023; originally announced October 2023.

  4. arXiv:2305.17624  [pdf, other

    cs.CV cs.AI

    SimpSON: Simplifying Photo Cleanup with Single-Click Distracting Object Segmentation Network

    Authors: Chuong Huynh, Yuqian Zhou, Zhe Lin, Connelly Barnes, Eli Shechtman, Sohrab Amirghodsi, Abhinav Shrivastava

    Abstract: In photo editing, it is common practice to remove visual distractions to improve the overall image quality and highlight the primary subject. However, manually selecting and removing these small and dense distracting regions can be a laborious and time-consuming task. In this paper, we propose an interactive distractor selection method that is optimized to achieve the task with just a single click… ▽ More

    Submitted 28 May, 2023; originally announced May 2023.

    Comments: CVPR 2023. Project link: https://simpson-cvpr23.github.io

  5. arXiv:2304.00221  [pdf, other

    cs.CV

    Automatic High Resolution Wire Segmentation and Removal

    Authors: Mang Tik Chiu, Xuaner Zhang, Zijun Wei, Yuqian Zhou, Eli Shechtman, Connelly Barnes, Zhe Lin, Florian Kainz, Sohrab Amirghodsi, Humphrey Shi

    Abstract: Wires and powerlines are common visual distractions that often undermine the aesthetics of photographs. The manual process of precisely segmenting and removing them is extremely tedious and may take up hours, especially on high-resolution photos where wires may span the entire space. In this paper, we present an automatic wire clean-up system that eases the process of wire segmentation and removal… ▽ More

    Submitted 1 April, 2023; originally announced April 2023.

    Comments: https://github.com/adobe-research/auto-wire-removal

  6. arXiv:2212.06310  [pdf, other

    cs.CV cs.GR

    Structure-Guided Image Completion with Image-level and Object-level Semantic Discriminators

    Authors: Haitian Zheng, Zhe Lin, **gwan Lu, Scott Cohen, Eli Shechtman, Connelly Barnes, Jianming Zhang, Qing Liu, Yuqian Zhou, Sohrab Amirghodsi, Jiebo Luo

    Abstract: Structure-guided image completion aims to inpaint a local region of an image according to an input guidance map from users. While such a task enables many practical applications for interactive editing, existing methods often struggle to hallucinate realistic object instances in complex natural scenes. Such a limitation is partially due to the lack of semantic-level constraints inside the hole reg… ▽ More

    Submitted 23 April, 2024; v1 submitted 12 December, 2022; originally announced December 2022.

    Comments: 18 pages, 16 figures

  7. arXiv:2208.03552  [pdf, other

    cs.CV

    Inpainting at Modern Camera Resolution by Guided PatchMatch with Auto-Curation

    Authors: Lingzhi Zhang, Connelly Barnes, Kevin Wampler, Sohrab Amirghodsi, Eli Shechtman, Zhe Lin, Jianbo Shi

    Abstract: Recently, deep models have established SOTA performance for low-resolution image inpainting, but they lack fidelity at resolutions associated with modern cameras such as 4K or more, and for large holes. We contribute an inpainting benchmark dataset of photos at 4K and above representative of modern sensors. We demonstrate a novel framework that combines deep learning and traditional methods. We us… ▽ More

    Submitted 6 August, 2022; originally announced August 2022.

    Comments: 34 pages, 15 figures, ECCV 2022

  8. arXiv:2208.03357  [pdf, other

    cs.CV

    Perceptual Artifacts Localization for Inpainting

    Authors: Lingzhi Zhang, Yuqian Zhou, Connelly Barnes, Sohrab Amirghodsi, Zhe Lin, Eli Shechtman, Jianbo Shi

    Abstract: Image inpainting is an essential task for multiple practical applications like object removal and image editing. Deep GAN-based models greatly improve the inpainting performance in structures and textures within the hole, but might also generate unexpected artifacts like broken structures or color blobs. Users perceive these artifacts to judge the effectiveness of inpainting models, and retouch th… ▽ More

    Submitted 5 August, 2022; originally announced August 2022.

  9. arXiv:2208.00776  [pdf, other

    cs.CV

    Deep 360$^\circ$ Optical Flow Estimation Based on Multi-Projection Fusion

    Authors: Yiheng Li, Connelly Barnes, Kun Huang, Fang-Lue Zhang

    Abstract: Optical flow computation is essential in the early stages of the video processing pipeline. This paper focuses on a less explored problem in this area, the 360$^\circ$ optical flow estimation using deep neural networks to support increasingly popular VR applications. To address the distortions of panoramic representations when applying convolutional neural networks, we propose a novel multi-projec… ▽ More

    Submitted 27 July, 2022; originally announced August 2022.

    Comments: Accepted to ECCV2022

  10. arXiv:2206.04271  [pdf, other

    cs.CV

    DeepVerge: Classification of Roadside Verge Biodiversity and Conservation Potential

    Authors: Andrew Perrett, Charlie Barnes, Mark Schofield, Lan Qie, Petra Bosilj, James M. Brown

    Abstract: Open space grassland is being increasingly farmed or built upon, leading to a ram** up of conservation efforts targeting roadside verges. Approximately half of all UK grassland species can be found along the country's 500,000 km of roads, with some 91 species either threatened or near threatened. Careful management of these "wildlife corridors" is therefore essential to preventing species extinc… ▽ More

    Submitted 9 June, 2022; originally announced June 2022.

    ACM Class: I.4

  11. arXiv:2203.11947  [pdf, other

    cs.CV

    CM-GAN: Image Inpainting with Cascaded Modulation GAN and Object-Aware Training

    Authors: Haitian Zheng, Zhe Lin, **gwan Lu, Scott Cohen, Eli Shechtman, Connelly Barnes, Jianming Zhang, Ning Xu, Sohrab Amirghodsi, Jiebo Luo

    Abstract: Recent image inpainting methods have made great progress but often struggle to generate plausible image structures when dealing with large holes in complex images. This is partially due to the lack of effective network structures that can capture both the long-range dependency and high-level semantics of an image. We propose cascaded modulation GAN (CM-GAN), a new network design consisting of an e… ▽ More

    Submitted 20 July, 2022; v1 submitted 22 March, 2022; originally announced March 2022.

    Comments: 32 pages, 19 figures

  12. arXiv:2201.08131  [pdf, other

    cs.CV

    GeoFill: Reference-Based Image Inpainting with Better Geometric Understanding

    Authors: Yunhan Zhao, Connelly Barnes, Yuqian Zhou, Eli Shechtman, Sohrab Amirghodsi, Charless Fowlkes

    Abstract: Reference-guided image inpainting restores image pixels by leveraging the content from another single reference image. The primary challenge is how to precisely place the pixels from the reference image into the hole region. Therefore, understanding the 3D geometry that relates pixels between two views is a crucial step towards building a better model. Given the complexity of handling various type… ▽ More

    Submitted 8 October, 2022; v1 submitted 20 January, 2022; originally announced January 2022.

    Comments: Accepted to WACV 2023

  13. arXiv:2104.05647  [pdf, other

    cs.CV cs.LG eess.IV

    Fruit Quality and Defect Image Classification with Conditional GAN Data Augmentation

    Authors: Jordan J. Bird, Chloe M. Barnes, Luis J. Manso, Anikó Ekárt, Diego R. Faria

    Abstract: Contemporary Artificial Intelligence technologies allow for the employment of Computer Vision to discern good crops from bad, providing a step in the pipeline of selecting healthy fruit from undesirable fruit, such as those which are mouldy or gangrenous. State-of-the-art works in the field report high accuracy results on small datasets (<1000 images), which are not representative of the populatio… ▽ More

    Submitted 12 April, 2021; originally announced April 2021.

    Comments: 16 pages, 12 figures, 3 tables

  14. arXiv:2104.03960  [pdf, other

    cs.CV cs.GR

    Modulated Periodic Activations for Generalizable Local Functional Representations

    Authors: Ishit Mehta, Michaël Gharbi, Connelly Barnes, Eli Shechtman, Ravi Ramamoorthi, Manmohan Chandraker

    Abstract: Multi-Layer Perceptrons (MLPs) make powerful functional representations for sampling and reconstruction problems involving low-dimensional signals like images,shapes and light fields. Recent works have significantly improved their ability to represent high-frequency content by using periodic activations or positional encodings. This often came at the expense of generalization: modern methods are t… ▽ More

    Submitted 8 April, 2021; originally announced April 2021.

    Comments: Project Page at https://ishit.github.io/modsine/

  15. arXiv:2103.15982  [pdf, other

    cs.CV

    TransFill: Reference-guided Image Inpainting by Merging Multiple Color and Spatial Transformations

    Authors: Yuqian Zhou, Connelly Barnes, Eli Shechtman, Sohrab Amirghodsi

    Abstract: Image inpainting is the task of plausibly restoring missing pixels within a hole region that is to be removed from a target image. Most existing technologies exploit patch similarities within the image, or leverage large-scale training data to fill the hole using learned semantic and texture information. However, due to the ill-posed nature of the inpainting task, such methods struggle to complete… ▽ More

    Submitted 29 March, 2021; originally announced March 2021.

    Comments: Accepted by CVPR2021

  16. arXiv:2102.04533  [pdf, other

    cs.LG cs.GR

    Learning from Shader Program Traces

    Authors: Yuting Yang, Connelly Barnes, Adam Finkelstein

    Abstract: Deep learning for image processing typically treats input imagery as pixels in some color space. This paper proposes instead to learn from program traces of procedural fragment shaders -- programs that generate images. At each pixel, we collect the intermediate values computed at program execution, and these data form the input to the learned model. We investigate this learning task for a variety… ▽ More

    Submitted 24 April, 2022; v1 submitted 8 February, 2021; originally announced February 2021.

  17. Deep Learning based Automated Forest Health Diagnosis from Aerial Images

    Authors: Chia-Yen Chiang, Chloe Barnes, Plamen Angelov, Richard Jiang

    Abstract: Global climate change has had a drastic impact on our environment. Previous study showed that pest disaster occured from global climate change may cause a tremendous number of trees died and they inevitably became a factor of forest fire. An important portent of the forest fire is the condition of forests. Aerial image-based forest analysis can give an early detection of dead trees and living tree… ▽ More

    Submitted 16 October, 2020; originally announced October 2020.

    Comments: 16 pages

    ACM Class: I.4.6; I.4.9; I.2.6; J.2

    Journal ref: IEEE Access, vol. 8, pp. 144064-144076, 2020

  18. arXiv:2007.15068  [pdf, other

    cs.CV

    Unselfie: Translating Selfies to Neutral-pose Portraits in the Wild

    Authors: Liqian Ma, Zhe Lin, Connelly Barnes, Alexei A. Efros, **gwan Lu

    Abstract: Due to the ubiquity of smartphones, it is popular to take photos of one's self, or "selfies." Such photos are convenient to take, because they do not require specialized equipment or a third-party photographer. However, in selfies, constraints such as human arm length often make the body pose look unnatural. To address this issue, we introduce $\textit{unselfie}$, a novel photographic transformati… ▽ More

    Submitted 29 July, 2020; originally announced July 2020.

    Comments: To appear in ECCV 2020

  19. arXiv:2005.08891  [pdf, other

    cs.CV cs.GR

    Generative Tweening: Long-term Inbetweening of 3D Human Motions

    Authors: Yi Zhou, **gwan Lu, Connelly Barnes, Jimei Yang, Sitao Xiang, Hao li

    Abstract: The ability to generate complex and realistic human body animations at scale, while following specific artistic constraints, has been a fundamental goal for the game and animation industry for decades. Popular techniques include key-framing, physics-based simulation, and database methods via motion graphs. Recently, motion generators based on deep learning have been introduced. Although these lear… ▽ More

    Submitted 28 May, 2020; v1 submitted 18 May, 2020; originally announced May 2020.

  20. arXiv:2004.14071  [pdf, other

    cs.GR cs.CV cs.LG

    Image Morphing with Perceptual Constraints and STN Alignment

    Authors: Noa Fish, Richard Zhang, Lilach Perry, Daniel Cohen-Or, Eli Shechtman, Connelly Barnes

    Abstract: In image morphing, a sequence of plausible frames are synthesized and composited together to form a smooth transformation between given instances. Intermediates must remain faithful to the input, stand on their own as members of the set, and maintain a well-paced visual transition from one to the next. In this paper, we propose a conditional GAN morphing framework operating on a pair of input imag… ▽ More

    Submitted 29 April, 2020; originally announced April 2020.

    ACM Class: I.3.3

  21. arXiv:1912.03457  [pdf, other

    cs.CL cs.CY

    Unsung Challenges of Building and Deploying Language Technologies for Low Resource Language Communities

    Authors: Pratik Joshi, Christain Barnes, Sebastin Santy, Simran Khanuja, Sanket Shah, Anirudh Srinivasan, Satwik Bhattamishra, Sunayana Sitaram, Monojit Choudhury, Kalika Bali

    Abstract: In this paper, we examine and analyze the challenges associated with develo** and introducing language technologies to low-resource language communities. While doing so, we bring to light the successes and failures of past work in this area, challenges being faced in doing so, and what they have achieved. Throughout this paper, we take a problem-facing approach and describe essential factors whi… ▽ More

    Submitted 7 December, 2019; originally announced December 2019.

    Comments: Accepted at ICON 2019; 9 pages

  22. arXiv:1901.05945  [pdf, other

    cs.CV

    Foreground-aware Image Inpainting

    Authors: Wei Xiong, Jiahui Yu, Zhe Lin, Jimei Yang, Xin Lu, Connelly Barnes, Jiebo Luo

    Abstract: Existing image inpainting methods typically fill holes by borrowing information from surrounding pixels. They often produce unsatisfactory results when the holes overlap with or touch foreground objects due to lack of information about the actual extent of foreground and background regions within the holes. These scenarios, however, are very important in practice, especially for applications such… ▽ More

    Submitted 22 April, 2019; v1 submitted 17 January, 2019; originally announced January 2019.

    Comments: Camera Ready version of CVPR 2019 with supplementary materials

  23. arXiv:1901.03447  [pdf, other

    cs.CV

    Texture Mixer: A Network for Controllable Synthesis and Interpolation of Texture

    Authors: Ning Yu, Connelly Barnes, Eli Shechtman, Sohrab Amirghodsi, Michal Lukac

    Abstract: This paper addresses the problem of interpolating visual textures. We formulate this problem by requiring (1) by-example controllability and (2) realistic and smooth interpolation among an arbitrary number of texture samples. To solve it we propose a neural network trained simultaneously on a reconstruction task and a generation task, which can project texture examples onto a latent space where th… ▽ More

    Submitted 16 April, 2019; v1 submitted 10 January, 2019; originally announced January 2019.

    Comments: Accepted to CVPR'19

  24. arXiv:1812.07035  [pdf, other

    cs.LG stat.ML

    On the Continuity of Rotation Representations in Neural Networks

    Authors: Yi Zhou, Connelly Barnes, **gwan Lu, Jimei Yang, Hao Li

    Abstract: In neural networks, it is often desirable to work with various representations of the same space. For example, 3D rotations can be represented with quaternions or Euler angles. In this paper, we advance a definition of a continuous representation, which can be helpful for training deep neural networks. We relate this to topological concepts such as homeomorphism and embedding. We then investigate… ▽ More

    Submitted 8 June, 2020; v1 submitted 17 December, 2018; originally announced December 2018.

  25. arXiv:1808.07269  [pdf, other

    hep-ex cs.CV physics.data-an physics.ins-det

    A Deep Neural Network for Pixel-Level Electromagnetic Particle Identification in the MicroBooNE Liquid Argon Time Projection Chamber

    Authors: MicroBooNE collaboration, C. Adams, M. Alrashed, R. An, J. Anthony, J. Asaadi, A. Ashkenazi, M. Auger, S. Balasubramanian, B. Baller, C. Barnes, G. Barr, M. Bass, F. Bay, A. Bhat, K. Bhattacharya, M. Bishai, A. Blake, T. Bolton, L. Camilleri, D. Caratelli, I. Caro Terrazas, R. Carr, R. Castillo Fernandez, F. Cavanna , et al. (148 additional authors not shown)

    Abstract: We have developed a convolutional neural network (CNN) that can make a pixel-level prediction of objects in image data recorded by a liquid argon time projection chamber (LArTPC) for the first time. We describe the network design, training techniques, and software tools developed to train this network. The goal of this work is to develop a complete deep neural network based data reconstruction cha… ▽ More

    Submitted 22 August, 2018; originally announced August 2018.

    Journal ref: Phys. Rev. D 99, 092001 (2019)

  26. arXiv:1706.01208  [pdf, other

    cs.GR

    Approximate Program Smoothing Using Mean-Variance Statistics, with Application to Procedural Shader Bandlimiting

    Authors: Yuting Yang, Connelly Barnes

    Abstract: This paper introduces a general method to approximate the convolution of an arbitrary program with a Gaussian kernel. This process has the effect of smoothing out a program. Our compiler framework models intermediate values in the program as random variables, by using mean and variance statistics. Our approach breaks the input program into parts and relates the statistics of the different parts, u… ▽ More

    Submitted 5 June, 2017; originally announced June 2017.

    Comments: 13 pages, 6 figures

    ACM Class: D.3.4; I.3.7

  27. arXiv:1706.01021  [pdf, other

    cs.GR cs.CV

    Where and Who? Automatic Semantic-Aware Person Composition

    Authors: Fuwen Tan, Crispin Bernier, Benjamin Cohen, Vicente Ordonez, Connelly Barnes

    Abstract: Image compositing is a method used to generate realistic yet fake imagery by inserting contents from one image to another. Previous work in compositing has focused on improving appearance compatibility of a user selected foreground segment and a background image (i.e. color and illumination consistency). In this work, we instead develop a fully automated compositing model that additionally learns… ▽ More

    Submitted 2 December, 2017; v1 submitted 3 June, 2017; originally announced June 2017.

    Comments: 10 pages, 9 figures

  28. arXiv:1701.08893  [pdf, other

    cs.GR cs.CV cs.NE

    Stable and Controllable Neural Texture Synthesis and Style Transfer Using Histogram Losses

    Authors: Eric Risser, Pierre Wilmot, Connelly Barnes

    Abstract: Recently, methods have been proposed that perform texture synthesis and style transfer by using convolutional neural networks (e.g. Gatys et al. [2015,2016]). These methods are exciting because they can in some cases create results with state-of-the-art quality. However, in this paper, we show these methods also have limitations in texture quality, stability, requisite parameter tuning, and lack o… ▽ More

    Submitted 1 February, 2017; v1 submitted 30 January, 2017; originally announced January 2017.

  29. arXiv:1612.01635  [pdf, other

    cs.CV

    Learning to Detect Multiple Photographic Defects

    Authors: Ning Yu, Xiaohui Shen, Zhe Lin, Radomir Mech, Connelly Barnes

    Abstract: In this paper, we introduce the problem of simultaneously detecting multiple photographic defects. We aim at detecting the existence, severity, and potential locations of common photographic defects related to color, noise, blur and composition. The automatic detection of such defects could be used to provide users with suggestions for how to improve photos without the need to laboriously try vari… ▽ More

    Submitted 8 March, 2018; v1 submitted 5 December, 2016; originally announced December 2016.

    Comments: Accepted to WACV'18