Skip to main content

Showing 1–16 of 16 results for author: Conde, M V

Searching in archive eess. Search in all archives.
.
  1. arXiv:2404.16484  [pdf, other

    cs.CV eess.IV

    Real-Time 4K Super-Resolution of Compressed AVIF Images. AIS 2024 Challenge Survey

    Authors: Marcos V. Conde, Zhijun Lei, Wen Li, Cosmin Stejerean, Ioannis Katsavounidis, Radu Timofte, Kihwan Yoon, Ganzorig Gankhuyag, Jiangtao Lv, Long Sun, **shan Pan, Jiangxin Dong, **hui Tang, Zhiyuan Li, Hao Wei, Chenyang Ge, Dongyang Zhang, Tianle Liu, Huaian Chen, Yi **, Menghan Zhou, Yiqiang Yan, Si Gao, Biao Wu, Shaoli Liu , et al. (50 additional authors not shown)

    Abstract: This paper introduces a novel benchmark as part of the AIS 2024 Real-Time Image Super-Resolution (RTSR) Challenge, which aims to upscale compressed images from 540p to 4K resolution (4x factor) in real-time on commercial GPUs. For this, we use a diverse test set containing a variety of 4K images ranging from digital art to gaming and photography. The images are compressed using the modern AVIF cod… ▽ More

    Submitted 25 April, 2024; originally announced April 2024.

    Comments: CVPR 2024, AI for Streaming (AIS) Workshop

  2. arXiv:2404.16223  [pdf, other

    cs.CV eess.IV

    Deep RAW Image Super-Resolution. A NTIRE 2024 Challenge Survey

    Authors: Marcos V. Conde, Florin-Alexandru Vasluianu, Radu Timofte, Jianxing Zhang, Jia Li, Fan Wang, Xiaopeng Li, Zikun Liu, Hyunhee Park, Sejun Song, Changho Kim, Zhijuan Huang, Hongyuan Yu, Cheng Wan, Wending Xiang, Jiamin Lin, Hang Zhong, Qiaosong Zhang, Yue Sun, Xuanwu Yin, Kunlong Zuo, Senyan Xu, Siyuan Jiang, Zhi**g Sun, Jiaying Zhu , et al. (10 additional authors not shown)

    Abstract: This paper reviews the NTIRE 2024 RAW Image Super-Resolution Challenge, highlighting the proposed solutions and results. New methods for RAW Super-Resolution could be essential in modern Image Signal Processing (ISP) pipelines, however, this problem is not as explored as in the RGB domain. Th goal of this challenge is to upscale RAW Bayer images by 2x, considering unknown degradations such as nois… ▽ More

    Submitted 24 April, 2024; originally announced April 2024.

    Comments: CVPR 2024 - NTIRE Workshop

  3. arXiv:2404.11569  [pdf, other

    cs.CV cs.LG eess.IV

    Simple Image Signal Processing using Global Context Guidance

    Authors: Omar Elezabi, Marcos V. Conde, Radu Timofte

    Abstract: In modern smartphone cameras, the Image Signal Processor (ISP) is the core element that converts the RAW readings from the sensor into perceptually pleasant RGB images for the end users. The ISP is typically proprietary and handcrafted and consists of several blocks such as white balance, color correction, and tone map**. Deep learning-based ISPs aim to transform RAW images into DSLR-like RGB im… ▽ More

    Submitted 17 April, 2024; originally announced April 2024.

    Comments: Preprint under review

  4. arXiv:2401.16468  [pdf, other

    cs.CV cs.LG eess.IV

    InstructIR: High-Quality Image Restoration Following Human Instructions

    Authors: Marcos V. Conde, Gregor Geigle, Radu Timofte

    Abstract: Image restoration is a fundamental problem that involves recovering a high-quality clean image from its degraded observation. All-In-One image restoration models can effectively restore images from various types and levels of degradation using degradation-specific information as prompts to guide the restoration model. In this work, we present the first approach that uses human-written instructions… ▽ More

    Submitted 21 February, 2024; v1 submitted 29 January, 2024; originally announced January 2024.

    Comments: Technical Report

  5. arXiv:2312.15487  [pdf, other

    eess.IV cs.CV

    BSRAW: Improving Blind RAW Image Super-Resolution

    Authors: Marcos V. Conde, Florin Vasluianu, Radu Timofte

    Abstract: In smartphones and compact cameras, the Image Signal Processor (ISP) transforms the RAW sensor image into a human-readable sRGB image. Most popular super-resolution methods depart from a sRGB image and upscale it further, improving its quality. However, modeling the degradations in the sRGB domain is complicated because of the non-linear ISP transformations. Despite this known issue, only a few me… ▽ More

    Submitted 24 December, 2023; originally announced December 2023.

    Comments: IEEE/CVF Winter Conference on Applications of Computer Vision (WACV) 2024

  6. arXiv:2307.04916  [pdf, other

    cs.CV eess.IV

    Rapid Deforestation and Burned Area Detection using Deep Multimodal Learning on Satellite Imagery

    Authors: Gabor Fodor, Marcos V. Conde

    Abstract: Deforestation estimation and fire detection in the Amazon forest poses a significant challenge due to the vast size of the area and the limited accessibility. However, these are crucial problems that lead to severe environmental consequences, including climate change, global warming, and biodiversity loss. To effectively address this problem, multimodal satellite imagery and remote sensing offer a… ▽ More

    Submitted 10 July, 2023; originally announced July 2023.

    Comments: CVPR 2023 Workshop on Multimodal Learning for Earth and Environment (MultiEarth)

  7. arXiv:2211.14040  [pdf, other

    eess.IV cs.CV

    Real-Time Under-Display Cameras Image Restoration and HDR on Mobile Devices

    Authors: Marcos V. Conde, Florin Vasluianu, Sabari Nathan, Radu Timofte

    Abstract: The new trend of full-screen devices implies positioning the camera behind the screen to bring a larger display-to-body ratio, enhance eye contact, and provide a notch-free viewing experience on smartphones, TV or tablets. On the other hand, the images captured by under-display cameras (UDCs) are degraded by the screen in front of them. Deep learning methods for image restoration can significantly… ▽ More

    Submitted 25 November, 2022; originally announced November 2022.

    Comments: ECCV 2022 AIM Workshop. arXiv admin note: text overlap with arXiv:2210.13552

  8. arXiv:2211.04470  [pdf, other

    cs.CV eess.IV

    Efficient Single-Image Depth Estimation on Mobile Devices, Mobile AI & AIM 2022 Challenge: Report

    Authors: Andrey Ignatov, Grigory Malivenko, Radu Timofte, Lukasz Treszczotko, Xin Chang, Piotr Ksiazek, Michal Lopuszynski, Maciej Pioro, Rafal Rudnicki, Maciej Smyl, Yujie Ma, Zhenyu Li, Zehui Chen, Jialei Xu, Xianming Liu, Junjun Jiang, XueChao Shi, Difan Xu, Yanan Li, Xiaotao Wang, Lei Lei, Ziyu Zhang, Yicheng Wang, Zilong Huang, Guozhong Luo , et al. (14 additional authors not shown)

    Abstract: Various depth estimation models are now widely used on many mobile and IoT devices for image segmentation, bokeh effect rendering, object tracking and many other mobile tasks. Thus, it is very crucial to have efficient and accurate depth estimation models that can run fast on low-power mobile chipsets. In this Mobile AI challenge, the target was to develop deep learning-based single image depth es… ▽ More

    Submitted 7 November, 2022; originally announced November 2022.

    Comments: arXiv admin note: substantial text overlap with arXiv:2105.08630, arXiv:2211.03885; text overlap with arXiv:2105.08819, arXiv:2105.08826, arXiv:2105.08629, arXiv:2105.07809, arXiv:2105.07825

  9. arXiv:2211.03885  [pdf, other

    cs.CV eess.IV

    Learned Smartphone ISP on Mobile GPUs with Deep Learning, Mobile AI & AIM 2022 Challenge: Report

    Authors: Andrey Ignatov, Radu Timofte, Shuai Liu, Chaoyu Feng, Furui Bai, Xiaotao Wang, Lei Lei, Ziyao Yi, Yan Xiang, Zibin Liu, Shaoqing Li, Keming Shi, Dehui Kong, Ke Xu, Minsu Kwon, Yaqi Wu, Jiesi Zheng, Zhihao Fan, Xun Wu, Feng Zhang, Albert No, Minhyeok Cho, Zewen Chen, Xiaze Zhang, Ran Li , et al. (13 additional authors not shown)

    Abstract: The role of mobile cameras increased dramatically over the past few years, leading to more and more research in automatic image quality enhancement and RAW photo processing. In this Mobile AI challenge, the target was to develop an efficient end-to-end AI-based image signal processing (ISP) pipeline replacing the standard mobile ISPs that can run on modern smartphone GPUs using TensorFlow Lite. Th… ▽ More

    Submitted 7 November, 2022; originally announced November 2022.

  10. arXiv:2210.13552  [pdf, other

    cs.CV eess.IV

    Perceptual Image Enhancement for Smartphone Real-Time Applications

    Authors: Marcos V. Conde, Florin Vasluianu, Javier Vazquez-Corral, Radu Timofte

    Abstract: Recent advances in camera designs and imaging pipelines allow us to capture high-quality images using smartphones. However, due to the small size and lens limitations of the smartphone cameras, we commonly find artifacts or degradation in the processed images. The most common unpleasant effects are noise artifacts, diffraction artifacts, blur, and HDR overexposure. Deep learning methods for image… ▽ More

    Submitted 22 November, 2023; v1 submitted 24 October, 2022; originally announced October 2022.

    Comments: IEEE/CVF WACV 2023 (Oral)

  11. arXiv:2210.11153  [pdf, other

    eess.IV cs.CV

    Reversed Image Signal Processing and RAW Reconstruction. AIM 2022 Challenge Report

    Authors: Marcos V. Conde, Radu Timofte, Yibin Huang, **gyang Peng, Chang Chen, Cheng Li, Eduardo Pérez-Pellitero, Fenglong Song, Furui Bai, Shuai Liu, Chaoyu Feng, Xiaotao Wang, Lei Lei, Yu Zhu, Chenghua Li, Yingying Jiang, Yong A, Peisong Wang, Cong Leng, Jian Cheng, Xiaoyu Liu, Zhicun Yin, Zhilu Zhang, Junyi Li, Ming Liu , et al. (18 additional authors not shown)

    Abstract: Cameras capture sensor RAW images and transform them into pleasant RGB images, suitable for the human eyes, using their integrated Image Signal Processor (ISP). Numerous low-level vision tasks operate in the RAW domain (e.g. image denoising, white balance) due to its linear relationship with the scene irradiance, wide-range of information at 12bits, and sensor designs. Despite this, RAW image data… ▽ More

    Submitted 20 October, 2022; originally announced October 2022.

    Comments: ECCV 2022 Advances in Image Manipulation (AIM) workshop

  12. arXiv:2209.11345  [pdf, other

    cs.CV eess.IV

    Swin2SR: SwinV2 Transformer for Compressed Image Super-Resolution and Restoration

    Authors: Marcos V. Conde, Ui-** Choi, Maxime Burchi, Radu Timofte

    Abstract: Compression plays an important role on the efficient transmission and storage of images and videos through band-limited systems such as streaming services, virtual reality or videogames. However, compression unavoidably leads to artifacts and the loss of the original information, which may severely degrade the visual quality. For these reasons, quality enhancement of compressed images has become a… ▽ More

    Submitted 22 September, 2022; originally announced September 2022.

    Comments: European Conference on Computer Vision (ECCV 2022) Workshops

  13. arXiv:2206.11260  [pdf, other

    cs.SD cs.CV cs.LG eess.AS

    Few-shot Long-Tailed Bird Audio Recognition

    Authors: Marcos V. Conde, Ui-** Choi

    Abstract: It is easier to hear birds than see them. However, they still play an essential role in nature and are excellent indicators of deteriorating environmental quality and pollution. Recent advances in Deep Neural Networks allow us to process audio data to detect and classify birds. This technology can assist researchers in monitoring bird populations and biodiversity. We propose a sound detection and… ▽ More

    Submitted 4 July, 2022; v1 submitted 22 June, 2022; originally announced June 2022.

    Comments: LifeCLEF2022 (best paper award)

  14. arXiv:2204.12819  [pdf, other

    eess.IV cs.CV

    Conformer and Blind Noisy Students for Improved Image Quality Assessment

    Authors: Marcos V. Conde, Maxime Burchi, Radu Timofte

    Abstract: Generative models for image restoration, enhancement, and generation have significantly improved the quality of the generated images. Surprisingly, these models produce more pleasant images to the human eye than other methods, yet, they may get a lower perceptual quality score using traditional perceptual quality metrics such as PSNR or SSIM. Therefore, it is necessary to develop a quantitative me… ▽ More

    Submitted 27 April, 2022; originally announced April 2022.

    Comments: CVPR NTIRE 2022

  15. Model-Based Image Signal Processors via Learnable Dictionaries

    Authors: Marcos V. Conde, Steven McDonagh, Matteo Maggioni, Aleš Leonardis, Eduardo Pérez-Pellitero

    Abstract: Digital cameras transform sensor RAW readings into RGB images by means of their Image Signal Processor (ISP). Computational photography tasks such as image denoising and colour constancy are commonly performed in the RAW domain, in part due to the inherent hardware design, but also due to the appealing simplicity of noise statistics that result from the direct sensor readings. Despite this, the av… ▽ More

    Submitted 10 January, 2022; originally announced January 2022.

    Comments: AAAI 2022

    Journal ref: Vol. 36 No. 1: AAAI-22 Technical Tracks 1 (2022) 481-489

  16. arXiv:2107.04878  [pdf, other

    cs.SD cs.MM eess.AS

    Weakly-Supervised Classification and Detection of Bird Sounds in the Wild. A BirdCLEF 2021 Solution

    Authors: Marcos V. Conde, Kumar Shubham, Prateek Agnihotri, Nitin D. Movva, Szilard Bessenyei

    Abstract: It is easier to hear birds than see them, however, they still play an essential role in nature and they are excellent indicators of deteriorating environmental quality and pollution. Recent advances in Machine Learning and Convolutional Neural Networks allow us to detect and classify bird sounds, by doing this, we can assist researchers in monitoring the status and trends of bird populations and b… ▽ More

    Submitted 10 July, 2021; originally announced July 2021.

    Comments: Proceedings Working Notes CEURWS @ CLEF 2021 - BirdCLEF 2021