Skip to main content

Showing 1–27 of 27 results for author: Tekalp, A M

.
  1. A New Multi-Picture Architecture for Learned Video Deinterlacing and Demosaicing with Parallel Deformable Convolution and Self-Attention Blocks

    Authors: Ronglei Ji, A. Murat Tekalp

    Abstract: Despite the fact real-world video deinterlacing and demosaicing are well-suited to supervised learning from synthetically degraded data because the degradation models are known and fixed, learned video deinterlacing and demosaicing have received much less attention compared to denoising and super-resolution tasks. We propose a new multi-picture architecture for video deinterlacing or demosaicing b… ▽ More

    Submitted 19 April, 2024; originally announced April 2024.

    Comments: 13 pages, 6 figures, accepted to IMAVIS

  2. arXiv:2404.11273  [pdf, other

    eess.IV cs.CV

    Training Transformer Models by Wavelet Losses Improves Quantitative and Visual Performance in Single Image Super-Resolution

    Authors: Cansu Korkmaz, A. Murat Tekalp

    Abstract: Transformer-based models have achieved remarkable results in low-level vision tasks including image super-resolution (SR). However, early Transformer-based approaches that rely on self-attention within non-overlap** windows encounter challenges in acquiring global information. To activate more input pixels globally, hybrid attention models have been proposed. Moreover, training by solely minimiz… ▽ More

    Submitted 17 April, 2024; originally announced April 2024.

    Comments: total of 10 pages including references, 5 tables and 5 figures, accepted for NTIRE 2024 Single Image Super Resolution (x4) challenge

  3. arXiv:2404.09790  [pdf, other

    cs.CV

    NTIRE 2024 Challenge on Image Super-Resolution ($\times$4): Methods and Results

    Authors: Zheng Chen, Zongwei Wu, Eduard Zamfir, Kai Zhang, Yulun Zhang, Radu Timofte, Xiaokang Yang, Hongyuan Yu, Cheng Wan, Yuxin Hong, Zhijuan Huang, Yajun Zou, Yuan Huang, Jiamin Lin, Bingnan Han, Xianyu Guan, Yongsheng Yu, Daoan Zhang, Xuanwu Yin, Kunlong Zuo, **hua Hao, Kai Zhao, Kun Yuan, Ming Sun, Chao Zhou , et al. (63 additional authors not shown)

    Abstract: This paper reviews the NTIRE 2024 challenge on image super-resolution ($\times$4), highlighting the solutions proposed and the outcomes obtained. The challenge involves generating corresponding high-resolution (HR) images, magnified by a factor of four, from low-resolution (LR) inputs using prior information. The LR images originate from bicubic downsampling degradation. The aim of the challenge i… ▽ More

    Submitted 15 April, 2024; originally announced April 2024.

    Comments: NTIRE 2024 webpage: https://cvlai.net/ntire/2024. Code: https://github.com/zhengchen1999/NTIRE2024_ImageSR_x4

  4. arXiv:2403.11791  [pdf, other

    eess.IV cs.CV

    PAON: A New Neuron Model using Padé Approximants

    Authors: Onur Keleş, A. Murat Tekalp

    Abstract: Convolutional neural networks (CNN) are built upon the classical McCulloch-Pitts neuron model, which is essentially a linear model, where the nonlinearity is provided by a separate activation function. Several researchers have proposed enhanced neuron models, including quadratic neurons, generalized operational neurons, generative neurons, and super neurons, with stronger nonlinearity than that pr… ▽ More

    Submitted 18 March, 2024; originally announced March 2024.

    Comments: Submitted to IEEE ICIP 2024

  5. arXiv:2402.19215  [pdf, other

    eess.IV cs.CV

    Training Generative Image Super-Resolution Models by Wavelet-Domain Losses Enables Better Control of Artifacts

    Authors: Cansu Korkmaz, A. Murat Tekalp, Zafer Dogan

    Abstract: Super-resolution (SR) is an ill-posed inverse problem, where the size of the set of feasible solutions that are consistent with a given low-resolution image is very large. Many algorithms have been proposed to find a "good" solution among the feasible solutions that strike a balance between fidelity and perceptual quality. Unfortunately, all known methods generate artifacts and hallucinations whil… ▽ More

    Submitted 29 February, 2024; originally announced February 2024.

    Comments: Accepted for IEEE CVPR 2024, total of 11 pages, 3 pages for references, 7 figures and 2 tables

  6. arXiv:2402.08862  [pdf, other

    eess.IV

    Saliency-aware End-to-end Learned Variable-Bitrate 360-degree Image Compression

    Authors: Oguzhan Gungordu, A. Murat Tekalp

    Abstract: Effective compression of 360$^\circ$ images, also referred to as omnidirectional images (ODIs), is of high interest for various virtual reality (VR) and related applications. 2D image compression methods ignore the equator-biased nature of ODIs and fail to address oversampling near the poles, leading to inefficient compression when applied to ODI. We present a new learned saliency-aware 360… ▽ More

    Submitted 13 February, 2024; originally announced February 2024.

    Comments: 7 pages with double column, 1 and a half for references, 6 figures and 4 tables, submitted to IEEE ICIP 2024

  7. arXiv:2402.08550  [pdf, other

    eess.IV

    Motion-Adaptive Inference for Flexible Learned B-Frame Compression

    Authors: M. Akin Yilmaz, O. Ugur Ulas, Ahmet Bilican, A. Murat Tekalp

    Abstract: While the performance of recent learned intra and sequential video compression models exceed that of respective traditional codecs, the performance of learned B-frame compression models generally lag behind traditional B-frame coding. The performance gap is bigger for complex scenes with large motions. This is related to the fact that the distance between the past and future references vary in hie… ▽ More

    Submitted 13 February, 2024; originally announced February 2024.

    Comments: 7 pages, submitted to IEEE ICIP 2024

  8. arXiv:2402.07597  [pdf, other

    eess.IV cs.CV

    Trustworthy SR: Resolving Ambiguity in Image Super-resolution via Diffusion Models and Human Feedback

    Authors: Cansu Korkmaz, Ege Cirakman, A. Murat Tekalp, Zafer Dogan

    Abstract: Super-resolution (SR) is an ill-posed inverse problem with a large set of feasible solutions that are consistent with a given low-resolution image. Various deterministic algorithms aim to find a single solution that balances fidelity and perceptual quality; however, this trade-off often causes visual artifacts that bring ambiguity in information-centric applications. On the other hand, diffusion m… ▽ More

    Submitted 12 February, 2024; originally announced February 2024.

    Comments: total of 7 pages with double column, 1 and a half for references, 6 figures and 2 tables, submitted to IEEE ICIP 2024

  9. arXiv:2307.01556  [pdf, other

    eess.IV

    Spatio-Temporal Perception-Distortion Trade-off in Learned Video SR

    Authors: Nasrin Rahimi, A. Murat Tekalp

    Abstract: Perception-distortion trade-off is well-understood for single-image super-resolution. However, its extension to video super-resolution (VSR) is not straightforward, since popular perceptual measures only evaluate naturalness of spatial textures and do not take naturalness of flow (temporal coherence) into account. To this effect, we propose a new measure of spatio-temporal perceptual video quality… ▽ More

    Submitted 4 July, 2023; originally announced July 2023.

    Comments: Accepted for publication in IEEE International Conference on Image Processing (ICIP) 2023

  10. arXiv:2306.16544  [pdf, other

    eess.IV cs.CV

    Multi-Scale Deformable Alignment and Content-Adaptive Inference for Flexible-Rate Bi-Directional Video Compression

    Authors: M. Akın Yılmaz, O. Ugur Ulas, A. Murat Tekalp

    Abstract: The lack of ability to adapt the motion compensation model to video content is an important limitation of current end-to-end learned video compression models. This paper advances the state-of-the-art by proposing an adaptive motion-compensation model for end-to-end rate-distortion optimized hierarchical bi-directional video compression. In particular, we propose two novelties: i) a multi-scale def… ▽ More

    Submitted 28 June, 2023; originally announced June 2023.

    Comments: Accepted for publication in IEEE International Conference on Image Processing (ICIP) 2023

  11. Multi-Field De-interlacing using Deformable Convolution Residual Blocks and Self-Attention

    Authors: Ronglei Ji, A. Murat Tekalp

    Abstract: Although deep learning has made significant impact on image/video restoration and super-resolution, learned deinterlacing has so far received less attention in academia or industry. This is despite deinterlacing is well-suited for supervised learning from synthetic data since the degradation model is known and fixed. In this paper, we propose a novel multi-field full frame-rate deinterlacing netwo… ▽ More

    Submitted 21 September, 2022; originally announced September 2022.

    Comments: 5 pages, 4 figures, accepted to ICIP 2022

  12. arXiv:2209.08568  [pdf, other

    cs.CV cs.LG eess.IV eess.SP

    MMSR: Multiple-Model Learned Image Super-Resolution Benefiting From Class-Specific Image Priors

    Authors: Cansu Korkmaz, A. Murat Tekalp, Zafer Dogan

    Abstract: Assuming a known degradation model, the performance of a learned image super-resolution (SR) model depends on how well the variety of image characteristics within the training set matches those in the test set. As a result, the performance of an SR model varies noticeably from image to image over a test set depending on whether characteristics of specific images are similar to those in the trainin… ▽ More

    Submitted 18 September, 2022; originally announced September 2022.

    Comments: 5 pages, 4 figures, accepted for publication in IEEE ICIP 2022 Conference

  13. arXiv:2209.08564  [pdf, other

    cs.CV cs.LG eess.IV eess.SP

    Perception-Distortion Trade-off in the SR Space Spanned by Flow Models

    Authors: Cansu Korkmaz, A. Murat Tekalp, Zafer Dogan, Erkut Erdem, Aykut Erdem

    Abstract: Flow-based generative super-resolution (SR) models learn to produce a diverse set of feasible SR solutions, called the SR space. Diversity of SR solutions increases with the temperature ($τ$) of latent variables, which introduces random variations of texture among sample solutions, resulting in visual artifacts and low fidelity. In this paper, we present a simple but effective image ensembling/fus… ▽ More

    Submitted 18 September, 2022; originally announced September 2022.

    Comments: 5 pages, 4 figures, accepted for publication in IEEE ICIP 2022 Conference

  14. arXiv:2206.13613  [pdf, other

    eess.IV cs.CV

    Flexible-Rate Learned Hierarchical Bi-Directional Video Compression With Motion Refinement and Frame-Level Bit Allocation

    Authors: Eren Cetin, M. Akin Yilmaz, A. Murat Tekalp

    Abstract: This paper presents improvements and novel additions to our recent work on end-to-end optimized hierarchical bi-directional video compression to further advance the state-of-the-art in learned video compression. As an improvement, we combine motion estimation and prediction modules and compress refined residual motion vectors for improved rate-distortion performance. As novel addition, we adapted… ▽ More

    Submitted 27 June, 2022; originally announced June 2022.

    Comments: Accepted for publication in IEEE International Conference on Image Processing (ICIP 2022)

    Report number: 1850

  15. arXiv:2112.09529  [pdf, other

    eess.IV cs.CV

    End-to-End Rate-Distortion Optimized Learned Hierarchical Bi-Directional Video Compression

    Authors: M. Akın Yılmaz, A. Murat Tekalp

    Abstract: Conventional video compression (VC) methods are based on motion compensated transform coding, and the steps of motion estimation, mode and quantization parameter selection, and entropy coding are optimized individually due to the combinatorial nature of the end-to-end optimization problem. Learned VC allows end-to-end rate-distortion (R-D) optimized training of nonlinear transform, motion and entr… ▽ More

    Submitted 17 December, 2021; originally announced December 2021.

    Comments: Accepted for publication in IEEE Transactions on Image Processing on 15 Dec. 2021

  16. arXiv:2106.00504  [pdf, other

    eess.IV cs.LG eess.SP

    Two-stage domain adapted training for better generalization in real-world image restoration and super-resolution

    Authors: Cansu Korkmaz, A. Murat Tekalp, Zafer Dogan

    Abstract: It is well-known that in inverse problems, end-to-end trained networks overfit the degradation model seen in the training set, i.e., they do not generalize to other types of degradations well. Recently, an approach to first map images downsampled by unknown filters to bicubicly downsampled look-alike images was proposed to successfully super-resolve such images. In this paper, we show that any inv… ▽ More

    Submitted 1 June, 2021; originally announced June 2021.

    Comments: Accepted for publication in IEEE ICIP 2021 Conference

  17. arXiv:2105.14926  [pdf, other

    eess.IV

    Self-Organized Residual Blocks for Image Super-Resolution

    Authors: Onur Keleş, A. Murat Tekalp, Junaid Malik, Serkan Kıranyaz

    Abstract: It has become a standard practice to use the convolutional networks (ConvNet) with RELU non-linearity in image restoration and super-resolution (SR). Although the universal approximation theorem states that a multi-layer neural network can approximate any non-linear function with the desired precision, it does not reveal the best network architecture to do so. Recently, operational neural networks… ▽ More

    Submitted 31 May, 2021; originally announced May 2021.

    Comments: Accepted for publication in IEEE International Conference on Image Processing (ICIP) 2021

  18. arXiv:2105.12794  [pdf, other

    cs.CV eess.IV

    DFPN: Deformable Frame Prediction Network

    Authors: M. Akın Yılmaz, A. Murat Tekalp

    Abstract: Learned frame prediction is a current problem of interest in computer vision and video compression. Although several deep network architectures have been proposed for learned frame prediction, to the best of our knowledge, there is no work based on using deformable convolutions for frame prediction. To this effect, we propose a deformable frame prediction network (DFPN) for task oriented implicit… ▽ More

    Submitted 26 May, 2021; originally announced May 2021.

    Comments: Accepted for publication in IEEE International Conference on Image Processing (ICIP) 2021

  19. arXiv:2105.12107  [pdf, other

    eess.IV cs.CV

    Self-Organized Variational Autoencoders (Self-VAE) for Learned Image Compression

    Authors: M. Akın Yılmaz, Onur Keleş, Hilal Güven, A. Murat Tekalp, Junaid Malik, Serkan Kıranyaz

    Abstract: In end-to-end optimized learned image compression, it is standard practice to use a convolutional variational autoencoder with generalized divisive normalization (GDN) to transform images into a latent space. Recently, Operational Neural Networks (ONNs) that learn the best non-linearity from a set of alternatives, and their self-organized variants, Self-ONNs, that approximate any non-linearity via… ▽ More

    Submitted 28 May, 2021; v1 submitted 25 May, 2021; originally announced May 2021.

    Comments: Accepted for publication in IEEE International Conference on Image Processing (ICIP) 2021

  20. arXiv:2104.14868  [pdf, other

    eess.IV cs.MM

    On the Computation of PSNR for a Set of Images or Video

    Authors: Onur Keleş, M. Akın Yılmaz, A. Murat Tekalp, Cansu Korkmaz, Zafer Dogan

    Abstract: When comparing learned image/video restoration and compression methods, it is common to report peak-signal to noise ratio (PSNR) results. However, there does not exist a generally agreed upon practice to compute PSNR for sets of images or video. Some authors report average of individual image/frame PSNR, which is equivalent to computing a single PSNR from the geometric mean of individual image/fra… ▽ More

    Submitted 30 April, 2021; originally announced April 2021.

    Comments: accepted for publication in Picture Coding Symposium (PCS) 2021

  21. arXiv:2104.14836  [pdf, ps, other

    eess.IV

    A Practical Approach for Rate-Distortion-Perception Analysis in Learned Image Compression

    Authors: Ogun Kirmemis, A. Murat Tekalp

    Abstract: Rate-distortion optimization (RDO) of codecs, where distortion is quantified by the mean-square error, has been a standard practice in image/video compression over the years. RDO serves well for optimization of codec performance for evaluation of the results in terms of PSNR. However, it is well known that the PSNR does not correlate well with perceptual evaluation of images; hence, RDO is not wel… ▽ More

    Submitted 30 April, 2021; originally announced April 2021.

    Comments: accepted for publication in Picture Coding Symposium (PCS) 2021

  22. arXiv:2102.06531  [pdf, ps, other

    eess.IV cs.LG

    Editorial: Introduction to the Issue on Deep Learning for Image/Video Restoration and Compression

    Authors: A. Murat Tekalp, Michele Covell, Radu Timofte, Chao Dong

    Abstract: Recent works have shown that learned models can achieve significant performance gains, especially in terms of perceptual quality measures, over traditional methods. Hence, the state of the art in image restoration and compression is getting redefined. This special issue covers the state of the art in learned image/video restoration and compression to promote further progress in innovative architec… ▽ More

    Submitted 9 February, 2021; originally announced February 2021.

    Journal ref: IEEE JOURNAL OF SELECTED TOPICS IN SIGNAL PROCESSING, vol. 15, no. 2, FEBRUARY 2021

  23. Effect of Architectures and Training Methods on the Performance of Learned Video Frame Prediction

    Authors: M. Akin Yilmaz, A. Murat Tekalp

    Abstract: We analyze the performance of feedforward vs. recurrent neural network (RNN) architectures and associated training methods for learned frame prediction. To this effect, we trained a residual fully convolutional neural network (FCNN), a convolutional RNN (CRNN), and a convolutional long short-term memory (CLSTM) network for next frame prediction using the mean square loss. We performed both statele… ▽ More

    Submitted 13 August, 2020; originally announced August 2020.

    Comments: Accepted for publication at IEEE ICIP 2019

  24. End-to-End Rate-Distortion Optimization for Bi-Directional Learned Video Compression

    Authors: M. Akin Yilmaz, A. Murat Tekalp

    Abstract: Conventional video compression methods employ a linear transform and block motion model, and the steps of motion estimation, mode and quantization parameter selection, and entropy coding are optimized individually due to combinatorial nature of the end-to-end optimization problem. Learned video compression allows end-to-end rate-distortion optimized training of all nonlinear modules, quantization… ▽ More

    Submitted 26 May, 2021; v1 submitted 11 August, 2020; originally announced August 2020.

    Comments: This work is accepted for publication in IEEE ICIP 2020

  25. arXiv:2007.08922  [pdf, other

    eess.IV cs.CV cs.LG

    Can Learned Frame-Prediction Compete with Block-Motion Compensation for Video Coding?

    Authors: Serkan Sulun, A. Murat Tekalp

    Abstract: Given recent advances in learned video prediction, we investigate whether a simple video codec using a pre-trained deep model for next frame prediction based on previously encoded/decoded frames without sending any motion side information can compete with standard video codecs based on block-motion compensation. Frame differences given learned frame predictions are encoded by a standard still-imag… ▽ More

    Submitted 17 July, 2020; originally announced July 2020.

    Comments: Accepted for publication in Springer Journal of Signal, Image and Video Processing

  26. Realizing a Low-Power Head-Mounted Phase-Only Holographic Display by Light-Weight Compression

    Authors: Burak Soner, Erdem Ulusoy, A. Murat Tekalp, Hakan Urey

    Abstract: Head-mounted holographic displays (HMHD) are projected to be the first commercial realization of holographic video display systems. HMHDs use liquid crystal on silicon (LCoS) spatial light modulators (SLM), which are best suited to display phase-only holograms (POH). The performance/watt requirement of a monochrome, 60 fps Full HD, 2-eye, POH HMHD system is about 10 TFLOPS/W, which is orders of ma… ▽ More

    Submitted 14 February, 2020; originally announced February 2020.

    Comments: 10 pages, 6 figures, accepted for publication in the IEEE Transactions on Image Processing

    Journal ref: IEEE Transactions on Image Processing, vol. 29, pp. 4505-4515, 2020

  27. arXiv:1806.00333  [pdf, other

    eess.IV

    Learned Compression Artifact Removal by Deep Residual Networks

    Authors: Ogün Kırmemiş, Gonca Bakar, A. Murat Tekalp

    Abstract: We propose a method for learned compression artifact removal by post-processing of BPG compressed images. We trained three networks of different sizes. We encoded input images using BPG with different QP values. We submitted the best combination of test images, encoded with different QP and post-processed by one of three networks, which satisfy the file size and decode time constraints imposed by… ▽ More

    Submitted 1 June, 2018; originally announced June 2018.

    Comments: Accepted for publication in the CVPR 2018, Challenge on Learned Image Compression (CLIC), Salt Lake City, Utah, USA, 18 June 2018 and appears in compression.cc