Skip to main content

Showing 1–13 of 13 results for author: Hannuksela, M

Searching in archive cs. Search in all archives.
.
  1. arXiv:2401.10761  [pdf, other

    eess.IV cs.CV

    NN-VVC: Versatile Video Coding boosted by self-supervisedly learned image coding for machines

    Authors: Jukka I. Ahonen, Nam Le, Honglei Zhang, Antti Hallapuro, Francesco Cricri, Hamed Rezazadegan Tavakoli, Miska M. Hannuksela, Esa Rahtu

    Abstract: The recent progress in artificial intelligence has led to an ever-increasing usage of images and videos by machine analysis algorithms, mainly neural networks. Nonetheless, compression, storage and transmission of media have traditionally been designed considering human beings as the viewers of the content. Recent research on image and video coding for machine analysis has progressed mainly in two… ▽ More

    Submitted 19 January, 2024; originally announced January 2024.

    Comments: ISM 2023 Best paper award winner version

  2. Bridging the gap between image coding for machines and humans

    Authors: Nam Le, Honglei Zhang, Francesco Cricri, Ramin G. Youvalari, Hamed Rezazadegan Tavakoli, Emre Aksu, Miska M. Hannuksela, Esa Rahtu

    Abstract: Image coding for machines (ICM) aims at reducing the bitrate required to represent an image while minimizing the drop in machine vision analysis accuracy. In many use cases, such as surveillance, it is also important that the visual quality is not drastically deteriorated by the compression process. Recent works on using neural network (NN) based ICM codecs have shown significant coding gains agai… ▽ More

    Submitted 19 January, 2024; originally announced January 2024.

    Journal ref: IEEE International Conference on Image Processing (ICIP), Bordeaux, France, 2022, pp. 3411-3415

  3. arXiv:2210.04112  [pdf, other

    cs.CV cs.LG cs.MM eess.IV

    Leveraging progressive model and overfitting for efficient learned image compression

    Authors: Honglei Zhang, Francesco Cricri, Hamed Rezazadegan Tavakoli, Emre Aksu, Miska M. Hannuksela

    Abstract: Deep learning is overwhelmingly dominant in the field of computer vision and image/video processing for the last decade. However, for image and video compression, it lags behind the traditional techniques based on discrete cosine transform (DCT) and linear filters. Built on top of an autoencoder architecture, learned image compression (LIC) systems have drawn enormous attention in recent years. Ne… ▽ More

    Submitted 8 October, 2022; originally announced October 2022.

  4. Coding of volumetric content with MIV using VVC subpictures

    Authors: Maria Santamaria, Vinod Kumar Malamal Vadakital, Lukasz Kondrad, Antti Hallapuro, Miska M. Hannuksela

    Abstract: Storage and transport of six degrees of freedom (6DoF) dynamic volumetric visual content for immersive applications requires efficient compression. ISO/IEC MPEG has recently been working on a standard that aims to efficiently code and deliver 6DoF immersive visual experiences. This standard is called the MIV. MIV uses regular 2D video codecs to code the visual data. MPEG jointly with ITU-T VCEG, h… ▽ More

    Submitted 6 June, 2022; originally announced June 2022.

    Comments: 6 pages, 3 figures

    Journal ref: 2021 IEEE 23rd International Workshop on Multimedia Signal Processing (MMSP)

  5. arXiv:2203.01183  [pdf

    eess.IV cs.GR cs.HC cs.MM

    Omnidirectional MediA Format (OMAF): Toolbox for Virtual Reality Services

    Authors: Sachin Deshpande, Miska M. Hannuksela

    Abstract: This paper provides an overview of the Omnidirectional Media Format (OMAF) standard, second edition, which has been recently finalized. OMAF specifies the media format for coding, storage, delivery, and rendering of omnidirectional media, including video, audio, images, and timed text. Additionally, OMAF supports multiple viewpoints corresponding to omnidirectional cameras and overlay images or vi… ▽ More

    Submitted 2 March, 2022; originally announced March 2022.

    Comments: 7 pages, 1 figure. This document is the accepted version of the paper that has been published in 2021 IEEE Conference on Standards for Communications and Networking (CSCN)

    Journal ref: 2021 IEEE Conference on Standards for Communications and Networking (CSCN), 2021, pp. 20-25

  6. arXiv:2112.08767  [pdf, other

    eess.IV cs.CV cs.LG

    Adaptation and Attention for Neural Video Coding

    Authors: Nannan Zou, Honglei Zhang, Francesco Cricri, Ramin G. Youvalari, Hamed R. Tavakoli, Jani Lainema, Emre Aksu, Miska Hannuksela, Esa Rahtu

    Abstract: Neural image coding represents now the state-of-the-art image compression approach. However, a lot of work is still to be done in the video domain. In this work, we propose an end-to-end learned video codec that introduces several architectural novelties as well as training novelties, revolving around the concepts of adaptation and attention. Our codec is organized as an intra-frame codec paired w… ▽ More

    Submitted 16 December, 2021; originally announced December 2021.

  7. arXiv:2108.10551  [pdf, ps, other

    eess.IV cs.CV cs.LG

    Lossless Image Compression Using a Multi-Scale Progressive Statistical Model

    Authors: Honglei Zhang, Francesco Cricri, Hamed R. Tavakoli, Nannan Zou, Emre Aksu, Miska M. Hannuksela

    Abstract: Lossless image compression is an important technique for image storage and transmission when information loss is not allowed. With the fast development of deep learning techniques, deep neural networks have been used in this field to achieve a higher compression rate. Methods based on pixel-wise autoregressive statistical models have shown good performance. However, the sequential processing way p… ▽ More

    Submitted 24 August, 2021; originally announced August 2021.

    Comments: Accepted ACCV 2020

  8. arXiv:2007.16054  [pdf, other

    eess.IV cs.CV cs.LG cs.MM stat.ML

    Learning to Learn to Compress

    Authors: Nannan Zou, Honglei Zhang, Francesco Cricri, Hamed R. Tavakoli, Jani Lainema, Miska Hannuksela, Emre Aksu, Esa Rahtu

    Abstract: In this paper we present an end-to-end meta-learned system for image compression. Traditional machine learning based approaches to image compression train one or more neural network for generalization performance. However, at inference time, the encoder or the latent tensor output by the encoder can be optimized for each test image. This optimization can be regarded as a form of adaptation or bene… ▽ More

    Submitted 1 May, 2021; v1 submitted 31 July, 2020; originally announced July 2020.

  9. arXiv:2007.14267  [pdf, other

    eess.IV cs.CV cs.LG cs.MM

    Efficient Adaptation of Neural Network Filter for Video Compression

    Authors: Yat-Hong Lam, Alireza Zare, Francesco Cricri, Jani Lainema, Miska Hannuksela

    Abstract: We present an efficient finetuning methodology for neural-network filters which are applied as a postprocessing artifact-removal step in video coding pipelines. The fine-tuning is performed at encoder side to adapt the neural network to the specific content that is being encoded. In order to maximize the PSNR gain and minimize the bitrate overhead, we propose to finetune only the convolutional lay… ▽ More

    Submitted 13 August, 2020; v1 submitted 28 July, 2020; originally announced July 2020.

    Comments: Accepted in ACM Multimedia 2020

  10. arXiv:2004.09226  [pdf, other

    eess.IV cs.CV cs.LG

    End-to-End Learning for Video Frame Compression with Self-Attention

    Authors: Nannan Zou, Honglei Zhang, Francesco Cricri, Hamed R. Tavakoli, Jani Lainema, Emre Aksu, Miska Hannuksela, Esa Rahtu

    Abstract: One of the core components of conventional (i.e., non-learned) video codecs consists of predicting a frame from a previously-decoded frame, by leveraging temporal correlations. In this paper, we propose an end-to-end learned system for compressing video frames. Instead of relying on pixel-space motion (as with optical flow), our system learns deep embeddings of frames and encodes their difference… ▽ More

    Submitted 20 April, 2020; originally announced April 2020.

  11. arXiv:1905.10371  [pdf, other

    eess.IV cs.LG stat.ML

    A Compression Objective and a Cycle Loss for Neural Image Compression

    Authors: Caglar Aytekin, Francesco Cricri, Antti Hallapuro, Jani Lainema, Emre Aksu, Miska Hannuksela

    Abstract: In this manuscript we propose two objective terms for neural image compression: a compression objective and a cycle loss. These terms are applied on the encoder output of an autoencoder and are used in combination with reconstruction losses. The compression objective encourages sparsity and low entropy in the activations. The cycle loss term represents the distortion between encoder outputs comput… ▽ More

    Submitted 24 May, 2019; originally announced May 2019.

    Comments: Accepted in Challenge and Workshop on Learned Image Compression (CLIC) as a part of CVPR 2019

  12. arXiv:1905.04079  [pdf, other

    cs.LG cs.MM stat.ML

    Compressing Weight-updates for Image Artifacts Removal Neural Networks

    Authors: Yat Hong Lam, Alireza Zare, Caglar Aytekin, Francesco Cricri, Jani Lainema, Emre Aksu, Miska Hannuksela

    Abstract: In this paper, we present a novel approach for fine-tuning a decoder-side neural network in the context of image compression, such that the weight-updates are better compressible. At encoder side, we fine-tune a pre-trained artifact removal network on target data by using a compression objective applied on the weight-update. In particular, the compression objective encourages weight-updates which… ▽ More

    Submitted 14 June, 2019; v1 submitted 10 May, 2019; originally announced May 2019.

    Comments: Submission for CHALLENGE ON LEARNED IMAGE COMPRESSION (CLIC) 2019 (updated on 14 June 2019)

  13. arXiv:1805.10887  [pdf, ps, other

    cs.LG stat.ML

    Block-optimized Variable Bit Rate Neural Image Compression

    Authors: Caglar Aytekin, Xingyang Ni, Francesco Cricri, Jani Lainema, Emre Aksu, Miska Hannuksela

    Abstract: In this work, we propose an end-to-end block-based auto-encoder system for image compression. We introduce novel contributions to neural-network based image compression, mainly in achieving binarization simulation, variable bit rates with multiple networks, entropy-friendly representations, inference-stage code optimization and performance-improving normalization layers in the auto-encoder. We eva… ▽ More

    Submitted 28 May, 2018; originally announced May 2018.

    Comments: Accepted, Workshop and Challenge on Learned Image Compression (CLIC), CVPR 2018