Skip to main content

Showing 1–18 of 18 results for author: Perez-Pellitero, E

Searching in archive cs. Search in all archives.
.
  1. arXiv:2403.11909  [pdf, other

    cs.CV

    RoGUENeRF: A Robust Geometry-Consistent Universal Enhancer for NeRF

    Authors: Sibi Catley-Chandar, Richard Shaw, Gregory Slabaugh, Eduardo Perez-Pellitero

    Abstract: Recent advances in neural rendering have enabled highly photorealistic 3D scene reconstruction and novel view synthesis. Despite this progress, current state-of-the-art methods struggle to reconstruct high frequency detail, due to factors such as a low-frequency bias of radiance fields and inaccurate camera calibration. One approach to mitigate this issue is to enhance images post-rendering. 2D en… ▽ More

    Submitted 18 March, 2024; originally announced March 2024.

  2. arXiv:2403.11237  [pdf, other

    cs.CV

    FORCE: Dataset and Method for Intuitive Physics Guided Human-object Interaction

    Authors: Xiaohan Zhang, Bharat Lal Bhatnagar, Sebastian Starke, Ilya Petrov, Vladimir Guzov, Helisa Dhamo, Eduardo Pérez-Pellitero, Gerard Pons-Moll

    Abstract: Interactions between human and objects are influenced not only by the object's pose and shape, but also by physical attributes such as object mass and surface friction. They introduce important motion nuances that are essential for diversity and realism. Despite advancements in recent kinematics-based methods, this aspect has been overlooked. Generating nuanced human motion presents two challenges… ▽ More

    Submitted 17 March, 2024; originally announced March 2024.

    Comments: 24 pages, 9 figures

  3. arXiv:2402.05532  [pdf, other

    cs.CV

    NCRF: Neural Contact Radiance Fields for Free-Viewpoint Rendering of Hand-Object Interaction

    Authors: Zhongqun Zhang, Jifei Song, Eduardo Pérez-Pellitero, Yiren Zhou, Hyung ** Chang, Aleš Leonardis

    Abstract: Modeling hand-object interactions is a fundamentally challenging task in 3D computer vision. Despite remarkable progress that has been achieved in this field, existing methods still fail to synthesize the hand-object interaction photo-realistically, suffering from degraded rendering quality caused by the heavy mutual occlusions between the hand and the object, and inaccurate hand-object pose estim… ▽ More

    Submitted 9 February, 2024; v1 submitted 8 February, 2024; originally announced February 2024.

    Comments: Accepted by 3DV 2024

  4. arXiv:2312.15059  [pdf, other

    cs.CV cs.AI

    Deformable 3D Gaussian Splatting for Animatable Human Avatars

    Authors: HyunJun Jung, Nikolas Brasch, Jifei Song, Eduardo Perez-Pellitero, Yiren Zhou, Zhihao Li, Nassir Navab, Benjamin Busam

    Abstract: Recent advances in neural radiance fields enable novel view synthesis of photo-realistic images in dynamic settings, which can be applied to scenarios with human animation. Commonly used implicit backbones to establish accurate models, however, require many input views and additional annotations such as human masks, UV maps and depth maps. In this work, we propose ParDy-Human (Parameterized Dynami… ▽ More

    Submitted 22 December, 2023; originally announced December 2023.

  5. arXiv:2312.13308  [pdf, other

    cs.CV

    SWAGS: Sampling Windows Adaptively for Dynamic 3D Gaussian Splatting

    Authors: Richard Shaw, Jifei Song, Arthur Moreau, Michal Nazarczuk, Sibi Catley-Chandar, Helisa Dhamo, Eduardo Perez-Pellitero

    Abstract: Novel view synthesis has shown rapid progress recently, with methods capable of producing evermore photo-realistic results. 3D Gaussian Splatting has emerged as a particularly promising method, producing high-quality renderings of static scenes and enabling interactive viewing at real-time frame rates. However, it is currently limited to static scenes only. In this work, we extend 3D Gaussian Spla… ▽ More

    Submitted 19 December, 2023; originally announced December 2023.

  6. arXiv:2312.02902  [pdf, other

    cs.CV

    HeadGaS: Real-Time Animatable Head Avatars via 3D Gaussian Splatting

    Authors: Helisa Dhamo, Yinyu Nie, Arthur Moreau, Jifei Song, Richard Shaw, Yiren Zhou, Eduardo Pérez-Pellitero

    Abstract: 3D head animation has seen major quality and runtime improvements over the last few years, particularly empowered by the advances in differentiable rendering and neural radiance fields. Real-time rendering is a highly desirable goal for real-world applications. We propose HeadGaS, the first model to use 3D Gaussian Splats (3DGS) for 3D head reconstruction and animation. In this paper we introduce… ▽ More

    Submitted 5 December, 2023; originally announced December 2023.

  7. arXiv:2311.17113  [pdf, other

    cs.CV cs.GR

    Human Gaussian Splatting: Real-time Rendering of Animatable Avatars

    Authors: Arthur Moreau, Jifei Song, Helisa Dhamo, Richard Shaw, Yiren Zhou, Eduardo Pérez-Pellitero

    Abstract: This work addresses the problem of real-time rendering of photorealistic human body avatars learned from multi-view videos. While the classical approaches to model and render virtual humans generally use a textured mesh, recent research has developed neural body representations that achieve impressive visual quality. However, these models are difficult to render in real-time and their quality degr… ▽ More

    Submitted 28 March, 2024; v1 submitted 28 November, 2023; originally announced November 2023.

    Comments: Accepted to CVPR 2024

  8. arXiv:2210.11153  [pdf, other

    eess.IV cs.CV

    Reversed Image Signal Processing and RAW Reconstruction. AIM 2022 Challenge Report

    Authors: Marcos V. Conde, Radu Timofte, Yibin Huang, **gyang Peng, Chang Chen, Cheng Li, Eduardo Pérez-Pellitero, Fenglong Song, Furui Bai, Shuai Liu, Chaoyu Feng, Xiaotao Wang, Lei Lei, Yu Zhu, Chenghua Li, Yingying Jiang, Yong A, Peisong Wang, Cong Leng, Jian Cheng, Xiaoyu Liu, Zhicun Yin, Zhilu Zhang, Junyi Li, Ming Liu , et al. (18 additional authors not shown)

    Abstract: Cameras capture sensor RAW images and transform them into pleasant RGB images, suitable for the human eyes, using their integrated Image Signal Processor (ISP). Numerous low-level vision tasks operate in the RAW domain (e.g. image denoising, white balance) due to its linear relationship with the scene irradiance, wide-range of information at 12bits, and sensor designs. Despite this, RAW image data… ▽ More

    Submitted 20 October, 2022; originally announced October 2022.

    Comments: ECCV 2022 Advances in Image Manipulation (AIM) workshop

  9. arXiv:2210.03482  [pdf, other

    cs.CV cs.LG

    CLAD: A realistic Continual Learning benchmark for Autonomous Driving

    Authors: Eli Verwimp, Kuo Yang, Sarah Parisot, Hong Lanqing, Steven McDonagh, Eduardo Pérez-Pellitero, Matthias De Lange, Tinne Tuytelaars

    Abstract: In this paper we describe the design and the ideas motivating a new Continual Learning benchmark for Autonomous Driving (CLAD), that focuses on the problems of object classification and object detection. The benchmark utilises SODA10M, a recently released large-scale dataset that concerns autonomous driving related problems. First, we review and discuss existing continual learning benchmarks, how… ▽ More

    Submitted 7 October, 2022; originally announced October 2022.

  10. arXiv:2205.12633  [pdf, other

    cs.CV eess.IV

    NTIRE 2022 Challenge on High Dynamic Range Imaging: Methods and Results

    Authors: Eduardo Pérez-Pellitero, Sibi Catley-Chandar, Richard Shaw, Aleš Leonardis, Radu Timofte, Zexin Zhang, Cen Liu, Yunbo Peng, Yue Lin, Gaocheng Yu, ** Zhang, Zhe Ma, Hongbin Wang, Xiangyu Chen, Xintao Wang, Haiwei Wu, Lin Liu, Chao Dong, Jiantao Zhou, Qingsen Yan, Song Zhang, Weiye Chen, Yuhang Liu, Zhen Zhang, Yanning Zhang , et al. (68 additional authors not shown)

    Abstract: This paper reviews the challenge on constrained high dynamic range (HDR) imaging that was part of the New Trends in Image Restoration and Enhancement (NTIRE) workshop, held in conjunction with CVPR 2022. This manuscript focuses on the competition set-up, datasets, the proposed methods and their results. The challenge aims at estimating an HDR image from multiple respective low dynamic range (LDR)… ▽ More

    Submitted 25 May, 2022; originally announced May 2022.

    Comments: CVPR Workshops 2022. 15 pages, 21 figures, 2 tables

    Journal ref: Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR) Workshops, 2022

  11. arXiv:2204.01407  [pdf, other

    cs.CV cs.LG

    Re-examining Distillation For Continual Object Detection

    Authors: Eli Verwimp, Kuo Yang, Sarah Parisot, Hong Lanqing, Steven McDonagh, Eduardo Pérez-Pellitero, Matthias De Lange, Tinne Tuytelaars

    Abstract: Training models continually to detect and classify objects, from new classes and new domains, remains an open problem. In this work, we conduct a thorough analysis of why and how object detection models forget catastrophically. We focus on distillation-based approaches in two-stage networks; the most-common strategy employed in contemporary continual object detection work.Distillation aims to tran… ▽ More

    Submitted 7 October, 2022; v1 submitted 4 April, 2022; originally announced April 2022.

    Comments: Accepted at BMVC '22

  12. arXiv:2203.14825  [pdf, other

    cs.CV

    HDR Reconstruction from Bracketed Exposures and Events

    Authors: Richard Shaw, Sibi Catley-Chandar, Ales Leonardis, Eduardo Perez-Pellitero

    Abstract: Reconstruction of high-quality HDR images is at the core of modern computational photography. Significant progress has been made with multi-frame HDR reconstruction methods, producing high-resolution, rich and accurate color reconstructions with high-frequency details. However, they are still prone to fail in dynamic or largely over-exposed scenes, where frame misalignment often results in visible… ▽ More

    Submitted 28 March, 2022; originally announced March 2022.

  13. arXiv:2203.12311  [pdf, other

    cs.CV

    Self-supervised HDR Imaging from Motion and Exposure Cues

    Authors: Michal Nazarczuk, Sibi Catley-Chandar, Ales Leonardis, Eduardo Pérez-Pellitero

    Abstract: Recent High Dynamic Range (HDR) techniques extend the capabilities of current cameras where scenes with a wide range of illumination can not be accurately captured with a single low-dynamic-range (LDR) image. This is generally accomplished by capturing several LDR images with varying exposure values whose information is then incorporated into a merged HDR image. While such approaches work well for… ▽ More

    Submitted 23 March, 2022; originally announced March 2022.

  14. Model-Based Image Signal Processors via Learnable Dictionaries

    Authors: Marcos V. Conde, Steven McDonagh, Matteo Maggioni, Aleš Leonardis, Eduardo Pérez-Pellitero

    Abstract: Digital cameras transform sensor RAW readings into RGB images by means of their Image Signal Processor (ISP). Computational photography tasks such as image denoising and colour constancy are commonly performed in the RAW domain, in part due to the inherent hardware design, but also due to the appealing simplicity of noise statistics that result from the direct sensor readings. Despite this, the av… ▽ More

    Submitted 10 January, 2022; originally announced January 2022.

    Comments: AAAI 2022

    Journal ref: Vol. 36 No. 1: AAAI-22 Technical Tracks 1 (2022) 481-489

  15. FlexHDR: Modelling Alignment and Exposure Uncertainties for Flexible HDR Imaging

    Authors: Sibi Catley-Chandar, Thomas Tanay, Lucas Vandroux, Aleš Leonardis, Gregory Slabaugh, Eduardo Pérez-Pellitero

    Abstract: High dynamic range (HDR) imaging is of fundamental importance in modern digital photography pipelines and used to produce a high-quality photograph with well exposed regions despite varying illumination across the image. This is typically achieved by merging multiple low dynamic range (LDR) images taken at different exposures. However, over-exposed regions and misalignment errors due to poorly com… ▽ More

    Submitted 12 September, 2022; v1 submitted 7 January, 2022; originally announced January 2022.

    Comments: Accepted to IEEE Transactions on Image Processing (TIP) 2022

  16. arXiv:2106.10070  [pdf, other

    cs.CV cs.LG

    Residual Contrastive Learning for Image Reconstruction: Learning Transferable Representations from Noisy Images

    Authors: Nanqing Dong, Matteo Maggioni, Yongxin Yang, Eduardo Pérez-Pellitero, Ales Leonardis, Steven McDonagh

    Abstract: This paper is concerned with contrastive learning (CL) for low-level image restoration and enhancement tasks. We propose a new label-efficient learning paradigm based on residuals, residual contrastive learning (RCL), and derive an unsupervised visual representation learning framework, suitable for low-level vision tasks with noisy inputs. While supervised image reconstruction aims to minimize res… ▽ More

    Submitted 27 April, 2022; v1 submitted 18 June, 2021; originally announced June 2021.

    Comments: Accepted by IJCAI 2022

  17. arXiv:2106.01439  [pdf, other

    cs.CV eess.IV

    NTIRE 2021 Challenge on High Dynamic Range Imaging: Dataset, Methods and Results

    Authors: Eduardo Pérez-Pellitero, Sibi Catley-Chandar, Aleš Leonardis, Radu Timofte

    Abstract: This paper reviews the first challenge on high-dynamic range (HDR) imaging that was part of the New Trends in Image Restoration and Enhancement (NTIRE) workshop, held in conjunction with CVPR 2021. This manuscript focuses on the newly introduced dataset, the proposed methods and their results. The challenge aims at estimating a HDR image from one or multiple respective low-dynamic range (LDR) obse… ▽ More

    Submitted 2 June, 2021; originally announced June 2021.

    Comments: To appear in CVPRW 2021 (NTIRE)

  18. arXiv:1807.07930  [pdf, other

    cs.CV

    Perceptual Video Super Resolution with Enhanced Temporal Consistency

    Authors: Eduardo Pérez-Pellitero, Mehdi S. M. Sajjadi, Michael Hirsch, Bernhard Schölkopf

    Abstract: With the advent of perceptual loss functions, new possibilities in super-resolution have emerged, and we currently have models that successfully generate near-photorealistic high-resolution images from their low-resolution observations. Up to now, however, such approaches have been exclusively limited to single image super-resolution. The application of perceptual loss functions on video processin… ▽ More

    Submitted 2 May, 2019; v1 submitted 20 July, 2018; originally announced July 2018.

    Comments: Major revision and improvement of the manuscript: New network architecture, new loss function and extended experiments