Skip to main content

Showing 1–50 of 175 results for author: Kaup, A

.
  1. arXiv:2407.09038  [pdf, other

    eess.IV

    High-Resolution Hyperspectral Video Imaging Using A Hexagonal Camera Array

    Authors: Frank Sippel, Jürgen Seiler, André Kaup

    Abstract: Retrieving the reflectance spectrum from objects is an essential task for many classification and detection problems, since many materials and processes have a unique spectral behaviour. In many cases, it is highly desirable to capture hyperspectral images due to the high spectral flexibility. Often, it is even necessary to capture hyperspectral videos or at least to be able to record a hyperspect… ▽ More

    Submitted 12 July, 2024; originally announced July 2024.

  2. arXiv:2407.05900  [pdf, other

    eess.IV

    SVT-AV1 Encoding Bitrate Estimation Using Motion Search Information

    Authors: Lena Eichermüller, Gaurang Chaudhari, Ioannis Katsavounidis, Zhijun Lei, Hassene Tmar, Christian Herglotz, André Kaup

    Abstract: Enabling high compression efficiency while kee** encoding energy consumption at a low level, requires prioritization of which videos need more sophisticated encoding techniques. However, the effects vary highly based on the content, and information on how good a video can be compressed is required. This can be measured by estimating the encoded bitstream size prior to encoding. We identified the… ▽ More

    Submitted 8 July, 2024; originally announced July 2024.

    Comments: 5 pages, 4 figures, accepted for European Signal Processing Conference (EUSIPCO) 2024

  3. arXiv:2406.13709  [pdf, other

    eess.IV cs.CV

    A Study on the Effect of Color Spaces in Learned Image Compression

    Authors: Srivatsa Prativadibhayankaram, Mahadev Prasad Panda, Jürgen Seiler, Thomas Richter, Heiko Sparenberg, Siegfried Fößel, André Kaup

    Abstract: In this work, we present a comparison between color spaces namely YUV, LAB, RGB and their effect on learned image compression. For this we use the structure and color based learned image codec (SLIC) from our prior work, which consists of two branches - one for the luminance component (Y or L) and another for chrominance components (UV or AB). However, for the RGB variant we input all 3 channels i… ▽ More

    Submitted 19 June, 2024; originally announced June 2024.

    Comments: Accepter pre-print version for ICIP 2024

  4. arXiv:2406.11284  [pdf, other

    eess.IV

    Multispectral Snapshot Image Registration Using Learned Cross Spectral Disparity Estimation and a Deep Guided Occlusion Reconstruction Network

    Authors: Frank Sippel, Jürgen Seiler, André Kaup

    Abstract: Multispectral imaging aims at recording images in different spectral bands. This is extremely beneficial in diverse discrimination applications, for example in agriculture, recycling or healthcare. One approach for snapshot multispectral imaging, which is capable of recording multispectral videos, is by using camera arrays, where each camera records a different spectral band. Since the cameras are… ▽ More

    Submitted 17 June, 2024; originally announced June 2024.

  5. arXiv:2406.07938  [pdf, other

    eess.IV

    On Annotation-free Optimization of Video Coding for Machines

    Authors: Marc Windsheimer, Fabian Brand, André Kaup

    Abstract: Today, image and video data is not only viewed by humans, but also automatically analyzed by computer vision algorithms. However, current coding standards are optimized for human perception. Emerging from this, research on video coding for machines tries to develop coding methods designed for machines as information sink. Since many of these algorithms are based on neural networks, most proposals… ▽ More

    Submitted 12 June, 2024; originally announced June 2024.

    Comments: 7 pages, 10 figures

  6. arXiv:2405.17866  [pdf, ps, other

    eess.IV

    Towards Video Codec Performance Evaluation: A Rate-Energy-Distortion Perspective

    Authors: Geetha Ramasubbu, André Kaup, Christian Herglotz

    Abstract: The Bjøntegaard Delta rate (BD-rate) objectively assesses the coding efficiency of video codecs using the rate-distortion (R-D) performance but overlooks encoding energy, which is crucial in practical applications, especially for those on handheld devices. Although R-D analysis can be extended to incorporate encoding energy as energy-distortion (E-D), it fails to integrate all three parameters sea… ▽ More

    Submitted 11 June, 2024; v1 submitted 28 May, 2024; originally announced May 2024.

  7. arXiv:2405.12631  [pdf, other

    eess.IV

    Efficient Learned Wavelet Image and Video Coding

    Authors: Anna Meyer, Srivatsa Prativadibhayankaram, André Kaup

    Abstract: Learned wavelet image and video coding approaches provide an explainable framework with a latent space corresponding to a wavelet decomposition. The wavelet image coder iWave++ achieves state-of-the-art performance and has been employed for various compression tasks, including lossy as well as lossless image, video, and medical data compression. However, the approaches suffer from slow decoding sp… ▽ More

    Submitted 21 May, 2024; originally announced May 2024.

    Comments: 7 pages, 11 figures, submitted to ICIP2024

  8. arXiv:2402.17487  [pdf, other

    cs.CV cs.LG eess.IV

    Bit Rate Matching Algorithm Optimization in JPEG-AI Verification Model

    Authors: Panqi Jia, A. Burakhan Koyuncu, Jue Mao, Ze Cui, Yi Ma, Tiansheng Guo, Timofey Solovyev, Alexander Karabutov, Yin Zhao, **g Wang, Elena Alshina, Andre Kaup

    Abstract: The research on neural network (NN) based image compression has shown superior performance compared to classical compression frameworks. Unlike the hand-engineered transforms in the classical frameworks, NN-based models learn the non-linear transforms providing more compact bit representations, and achieve faster coding speed on parallel devices over their classical counterparts. Those properties… ▽ More

    Submitted 27 February, 2024; originally announced February 2024.

    Comments: Accepted at (IEEE) PCS 2024; 6 pages

  9. arXiv:2402.17470  [pdf, other

    cs.CV cs.LG eess.IV

    Bit Distribution Study and Implementation of Spatial Quality Map in the JPEG-AI Standardization

    Authors: Panqi Jia, Jue Mao, Esin Koyuncu, A. Burakhan Koyuncu, Timofey Solovyev, Alexander Karabutov, Yin Zhao, Elena Alshina, Andre Kaup

    Abstract: Currently, there is a high demand for neural network-based image compression codecs. These codecs employ non-linear transforms to create compact bit representations and facilitate faster coding speeds on devices compared to the hand-crafted transforms used in classical frameworks. The scientific and industrial communities are highly interested in these properties, leading to the standardization ef… ▽ More

    Submitted 27 February, 2024; originally announced February 2024.

    Comments: 5 pages, 3 figures, 4 tables

  10. arXiv:2402.10257  [pdf, other

    eess.IV

    Analysis of Neural Video Compression Networks for 360-Degree Video Coding

    Authors: Andy Regensky, Fabian Brand, André Kaup

    Abstract: With the increasing efforts of bringing high-quality virtual reality technologies into the market, efficient 360-degree video compression gains in importance. As such, the state-of-the-art H.266/VVC video coding standard integrates dedicated tools for 360-degree video, and considerable efforts have been put into designing 360-degree projection formats with improved compression efficiency. For the… ▽ More

    Submitted 15 February, 2024; originally announced February 2024.

    Comments: 5 pages, 4 figures, 1 table, accepted for Picture Coding Symposium 2024 (PCS 2024)

  11. arXiv:2402.09926  [pdf, other

    eess.IV

    Predicting the Energy Demand of a Hardware Video Decoder with Unknown Design Using Software Profiling

    Authors: Matthias Kränzler, Christian Herglotz, André Kaup

    Abstract: Energy efficiency for video communications and video-on-demand streaming is essential for mobile devices with a limited battery capacity. Therefore, hardware decoder implementations are commonly used to significantly reduce the energetic load of video playback. The energy consumption of such a hardware implementation largely depends on a previously published recommendation document of a video codi… ▽ More

    Submitted 4 July, 2024; v1 submitted 15 February, 2024; originally announced February 2024.

    Comments: Submitted to IEEE Journal on Emerging and Selected Topics in Circuits and Systems (JETCAS), 13 Pages

  12. arXiv:2402.09001  [pdf, other

    eess.IV

    A Comprehensive Review of Software and Hardware Energy Efficiency of Video Decoders

    Authors: Matthias Kränzler, Christian Herglotz, André Kaup

    Abstract: Energy and compression efficiency are two essential parts of modern video decoder implementations that have to be considered. This work comprehensively studies the following six video coding formats regarding compression and decoding energy efficiency: AVC, VP9, HEVC, AV1, VVC, and AVM. We first evaluate the energy demand of reference and optimized software decoder implementations. Furthermore, we… ▽ More

    Submitted 14 February, 2024; originally announced February 2024.

    Comments: accepted as a conference paper for Picture Coding Symposium (PCS) 2024

  13. arXiv:2401.17246  [pdf, other

    eess.IV cs.CV

    SLIC: A Learned Image Codec Using Structure and Color

    Authors: Srivatsa Prativadibhayankaram, Mahadev Prasad Panda, Thomas Richter, Heiko Sparenberg, Siegfried Fößel, André Kaup

    Abstract: We propose the structure and color based learned image codec (SLIC) in which the task of compression is split into that of luminance and chrominance. The deep learning model is built with a novel multi-scale architecture for Y and UV channels in the encoder, where the features from various stages are combined to obtain the latent representation. An autoregressive context model is employed for back… ▽ More

    Submitted 30 January, 2024; originally announced January 2024.

    Comments: Accepter paper for Data Compression Conference 2024

  14. arXiv:2401.16067  [pdf, other

    eess.IV cs.MM

    Encoding Time and Energy Model for SVT-AV1 based on Video Complexity

    Authors: Lena Eichermüller, Gaurang Chaudhari, Ioannis Katsavounidis, Zhijun Lei, Hassene Tmar, Christian Herglotz, André Kaup

    Abstract: The share of online video traffic in global carbon dioxide emissions is growing steadily. To comply with the demand for video media, dedicated compression techniques are continuously optimized, but at the expense of increasingly higher computational demands and thus rising energy consumption at the video encoder side. In order to find the best trade-off between compression and energy consumption,… ▽ More

    Submitted 30 January, 2024; v1 submitted 29 January, 2024; originally announced January 2024.

    Comments: 5 pages, 1 figure, accepted for IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP) 2024

  15. arXiv:2312.14491  [pdf, ps, other

    eess.IV

    Enhanced Color Palette Modeling for Lossless Screen Content Compression

    Authors: Hannah Och, Shabhrish Reddy Uddehal, Tilo Strutz, André Kaup

    Abstract: Soft context formation is a lossless image coding method for screen content. It encodes images pixel by pixel via arithmetic coding by collecting statistics for probability distribution estimation. Its main pipeline includes three stages, namely a context model based stage, a color palette stage and a residual coding stage. Each subsequent stage is only employed if the previous stage can not be ap… ▽ More

    Submitted 9 January, 2024; v1 submitted 22 December, 2023; originally announced December 2023.

    Comments: 5 pages, 3 figures, 2 tables; accepted for IEEE International Conference on Acoustics, Speech and Signal Processing 2024 (IEEE ICASSP 2024)

  16. arXiv:2312.11209  [pdf, other

    eess.IV

    Quantized Decoder in Learned Image Compression for Deterministic Reconstruction

    Authors: Esin Koyuncu, Timofey Solovyev, Johannes Sauer, Elena Alshina, André Kaup

    Abstract: Learned image compression has a problem of non-bit-exact reconstruction due to different calculations of floating point arithmetic on different devices. This paper shows a method to achieve a deterministic reconstructed image by quantizing only the decoder of the learned image compression model. From the implementation perspective of an image codec, it is beneficial to have the results reproducibl… ▽ More

    Submitted 11 January, 2024; v1 submitted 18 December, 2023; originally announced December 2023.

    Comments: 5 pages, 2 figures, 2024 IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP 2024)

  17. arXiv:2312.09266  [pdf, other

    eess.IV

    Geometry-Corrected Geodesic Motion Modeling with Per-Frame Camera Motion for 360-Degree Video Compression

    Authors: Andy Regensky, André Kaup

    Abstract: The large amounts of data associated with 360-degree video require highly effective compression techniques for efficient storage and distribution. The development of improved motion models for 360-degree motion compensation has shown significant improvements in compression efficiency. A geodesic motion model representing translational camera motion proved to be one of the most effective models. In… ▽ More

    Submitted 14 December, 2023; originally announced December 2023.

    Comments: 5 pages, 2 figures, 3 tables, accepted for IEEE International Conference on Acoustics, Speech and Signal Processing 2024 (IEEE ICASSP 2024)

  18. arXiv:2312.08949  [pdf, other

    eess.IV

    A Guided Upsampling Network for Short Wave Infrared Images Using Graph Regularization

    Authors: Frank Sippel, Jürgen Seiler, André Kaup

    Abstract: Exploiting the infrared area of the spectrum for classification problems is getting increasingly popular, because many materials have characteristic absorption bands in this area. However, sensors in the short wave infrared (SWIR) area and even higher wavelengths have a very low spatial resolution in comparison to classical cameras that operate in the visible wavelength area. Thus, in this paper a… ▽ More

    Submitted 14 December, 2023; originally announced December 2023.

    Journal ref: 2024 IEEE International Conference on Acoustics, Speech and Signal Processing

  19. arXiv:2312.08946  [pdf, other

    eess.IV

    Color Agnostic Cross-Spectral Disparity Estimation

    Authors: Frank Sippel, Nils Genser, Hannah Och, Jürgen Seiler, André Kaup

    Abstract: Since camera modules become more and more affordable, multispectral camera arrays have found their way from special applications to the mass market, e.g., in automotive systems, smartphones, or drones. Due to multiple modalities, the registration of different viewpoints and the required cross-spectral disparity estimation is up to the present extremely challenging. To overcome this problem, we int… ▽ More

    Submitted 14 December, 2023; originally announced December 2023.

    Journal ref: 2024 IEEE International Conference on Acoustics, Speech and Signal Processing

  20. arXiv:2310.17346  [pdf, ps, other

    eess.IV

    Extended Signaling Methods for Reduced Video Decoder Power Consumption Using Green Metadata

    Authors: Christian Herglotz, Matthias Kränzler, Xixue Chu, Edouard Francois, Yong He, André Kaup

    Abstract: In this paper, we discuss one aspect of the latest MPEG standard edition on energy-efficient media consumption, also known as Green Metadata (ISO/IEC 232001-11), which is the interactive signaling for remote decoder-power reduction for peer-to-peer video conferencing. In this scenario, the receiver of a video, e.g., a battery-driven portable device, can send a dedicated request to the sender which… ▽ More

    Submitted 26 October, 2023; originally announced October 2023.

    Comments: 5 pages, 2 figures

  21. Improving HEVC Encoding of Rendered Video Data Using True Motion Information

    Authors: Christian Herglotz, David Müller, Andreas Weinlich, Frank Bauer, Michael Ortner, Marc Stamminger, André Kaup

    Abstract: This paper shows that motion vectors representing the true motion of an object in a scene can be exploited to improve the encoding process of computer generated video sequences. Therefore, a set of sequences is presented for which the true motion vectors of the corresponding objects were generated on a per-pixel basis during the rendering process. In addition to conventional motion estimation meth… ▽ More

    Submitted 13 September, 2023; originally announced September 2023.

    Comments: 4 pages, 4 figures

    Journal ref: Proc. 2018 IEEE International Symposium on Multimedia (ISM)

  22. On Versatile Video Coding at UHD with Machine-Learning-Based Super-Resolution

    Authors: Kristian Fischer, Christian Herglotz, André Kaup

    Abstract: Coding 4K data has become of vital interest in recent years, since the amount of 4K data is significantly increasing. We propose a coding chain with spatial down- and upscaling that combines the next-generation VVC codec with machine learning based single image super-resolution algorithms for 4K. The investigated coding chain, which spatially downscales the 4K data before coding, shows superior qu… ▽ More

    Submitted 12 August, 2023; originally announced August 2023.

    Comments: Originally published as conference paper at QoMEX 2020

  23. Video Decoding Energy Estimation Using Processor Events

    Authors: Christian Herglotz, André Kaup

    Abstract: In this paper, we show that processor events like instruction counts or cache misses can be used to accurately estimate the processing energy of software video decoders. Therefore, we perform energy measurements on an ARM-based evaluation platform and count processor level events using a dedicated profiling software. Measurements are performed for various codecs and decoder implementations to prov… ▽ More

    Submitted 26 July, 2023; originally announced July 2023.

    Comments: 5 pages, 2 figures

    Journal ref: IEEE International Conference on Image Processing (ICIP), Bei**g, China, 2017, pp. 2493-2497

  24. arXiv:2307.12864  [pdf, other

    eess.IV

    Conditional Residual Coding: A Remedy for Bottleneck Problems in Conditional Inter Frame Coding

    Authors: Fabian Brand, Jürgen Seiler, André Kaup

    Abstract: Conditional coding is a new video coding paradigm enabled by neural-network-based compression. It can be shown that conditional coding is in theory better than the traditional residual coding, which is widely used in video compression standards like HEVC or VVC. However, on closer inspection, it becomes clear that conditional coders can suffer from information bottlenecks in the prediction path, i… ▽ More

    Submitted 26 January, 2024; v1 submitted 24 July, 2023; originally announced July 2023.

    Comments: 12 pages, 8 figures Accepted for Publication in TCSVT

  25. Component-wise Power Estimation of Electrical Devices Using Thermal Imaging

    Authors: Christian Herglotz, Simon Grosche, Akarsh Bharadwaj, André Kaup

    Abstract: This paper presents a novel method to estimate the power consumption of distinct active components on an electronic carrier board by using thermal imaging. The components and the board can be made of heterogeneous material such as plastic, coated microchips, and metal bonds or wires, where a special coating for high emissivity is not required. The thermal images are recorded when the components on… ▽ More

    Submitted 18 July, 2023; v1 submitted 17 July, 2023; originally announced July 2023.

    Comments: 10 pages, 8 figures

    Journal ref: IEEE Transactions on Consumer Electronics, vol. 67, no. 4, pp. 383-392, Nov. 2021,

  26. Power Modeling for Virtual Reality Video Playback Applications

    Authors: Christian Herglotz, Stéphane Coulombe, Ahmad Vakili, André Kaup

    Abstract: This paper proposes a method to evaluate and model the power consumption of modern virtual reality playback and streaming applications on smartphones. Due to the high computational complexity of the virtual reality processing toolchain, the corresponding power consumption is very high, which reduces operating times of battery-powered devices. To tackle this problem, we analyze the power consumptio… ▽ More

    Submitted 17 July, 2023; originally announced July 2023.

    Comments: 6 pages, 5 figures

    Journal ref: 2019 IEEE 23rd International Symposium on Consumer Technologies (ISCT), Ancona, Italy, 2019, pp. 105-110

  27. Power-Efficient Video Streaming on Mobile Devices Using Optimal Spatial Scaling

    Authors: Christian Herglotz, André Kaup, Stéphane Coulombe, Ahmad Vakili

    Abstract: This paper derives optimal spatial scaling and rate control parameters for power-efficient wireless video streaming on portable devices. A video streaming application is studied, which receives a high-resolution and high-quality video stream from a remote server and displays the content to the end-user.We show that the resolution of the input video can be adjusted such that the quality-power trade… ▽ More

    Submitted 17 July, 2023; originally announced July 2023.

    Comments: 6 pages, 7 figures

    Journal ref: Proc. IEEE 9th International Conference on Consumer Electronics (ICCE-Berlin), Berlin, Germany, 2019, pp. 233-238

  28. arXiv:2307.06102  [pdf, other

    eess.IV

    Spatially-Adaptive Learning-Based Image Compression with Hierarchical Multi-Scale Latent Spaces

    Authors: Fabian Brand, Alexander Kopte, Kristian Fischer, André Kaup

    Abstract: Adaptive block partitioning is responsible for large gains in current image and video compression systems. This method is able to compress large stationary image areas with only a few symbols, while maintaining a high level of quality in more detailed areas. Current state-of-the-art neural-network-based image compression systems however use only one scale to transmit the latent space. In previous… ▽ More

    Submitted 12 July, 2023; originally announced July 2023.

    Comments: 5 pages, 3 figures Accepted for presentation at ICIP 2023

  29. arXiv:2307.05208  [pdf, other

    eess.IV

    Encoder Complexity Control in SVT-AV1 by Speed-Adaptive Preset Switching

    Authors: Lena Eichermüller, Gaurang Chaudhari, Ioannis Katsavounidis, Zhijun Lei, Hassene Tmar, André Kaup, Christian Herglotz

    Abstract: Current developments in video encoding technology lead to continuously improving compression performance but at the expense of increasingly higher computational demands. Regarding the online video traffic increases during the last years and the concomitant need for video encoding, encoder complexity control mechanisms are required to restrict the processing time to a sufficient extent in order to… ▽ More

    Submitted 11 July, 2023; originally announced July 2023.

    Comments: 5 pages, 2 figures, accepted for IEEE International Conference on Image Processing (ICIP) 2023

  30. arXiv:2306.16755  [pdf, ps, other

    eess.IV

    Processing Energy Modeling for Neural Network Based Image Compression

    Authors: Christian Herglotz, Fabian Brand, Andy Regensky, Felix Rievel, André Kaup

    Abstract: Nowadays, the compression performance of neural-networkbased image compression algorithms outperforms state-of-the-art compression approaches such as JPEG or HEIC-based image compression. Unfortunately, most neural-network based compression methods are executed on GPUs and consume a high amount of energy during execution. Therefore, this paper performs an in-depth analysis on the energy consumptio… ▽ More

    Submitted 29 June, 2023; originally announced June 2023.

    Comments: 5 pages, 3 figures, accepted for IEEE International Conference on Image Processing (ICIP) 2023

  31. Cross Spectral Image Reconstruction Using a Deep Guided Neural Network

    Authors: Frank Sippel, Jürgen Seiler, André Kaup

    Abstract: Cross spectral camera arrays, where each camera records different spectral content, are becoming increasingly popular for RGB, multispectral and hyperspectral imaging, since they are capable of a high resolution in every dimension using off-the-shelf hardware. For these, it is necessary to build an image processing pipeline to calculate a consistent image data cube, i.e., it should look like as if… ▽ More

    Submitted 14 September, 2023; v1 submitted 27 June, 2023; originally announced June 2023.

    Journal ref: 2023 IEEE International Conference on Image Processing (ICIP)

  32. Motion Plane Adaptive Motion Modeling for Spherical Video Coding in H.266/VVC

    Authors: Andy Regensky, Christian Herglotz, André Kaup

    Abstract: Motion compensation is one of the key technologies enabling the high compression efficiency of modern video coding standards. To allow compression of spherical video content, special map** functions are required to project the video to the 2D image plane. Distortions inevitably occurring in these map**s impair the performance of classical motion models. In this paper, we propose a novel motion… ▽ More

    Submitted 23 June, 2023; originally announced June 2023.

    Comments: 5 pages, 4 figures, 1 table, accepted for IEEE International Conference on Image Processing 2023 (IEEE ICIP 2023). arXiv admin note: substantial text overlap with arXiv:2202.03323

  33. Improving Spherical Image Resampling through Viewport-Adaptivity

    Authors: Andy Regensky, Viktoria Heimann, Ruoyu Zhang, André Kaup

    Abstract: The conversion between different spherical image and video projection formats requires highly accurate resampling techniques in order to minimize the inevitable loss of information. Suitable resampling algorithms such as nearest neighbor, linear or cubic resampling are readily available. However, no generally applicable resampling technique exploits the special properties of spherical images so fa… ▽ More

    Submitted 23 June, 2023; originally announced June 2023.

    Comments: 5 pages, 3 figures, 2 tables, accepted for IEEE International Conference on Image Processing 2023 (IEEE ICIP 2023)

  34. Video Decoding Energy Reduction Using Temporal-Domain Filtering

    Authors: Christian Herglotz, Matthias Kränzler, Robert Ludwig, André Kaup

    Abstract: In this paper, we study decoding energy reduction opportunities using temporal-domain filtering and subsampling methods. In particular, we study spatiotemporal filtering using a contrast sensitivity function and temporal downscaling, i.e., frame rate reduction. We apply these concepts as a pre-filtering to the video before compression and evaluate the bitrate, the decoding energy, and the visual q… ▽ More

    Submitted 12 June, 2023; originally announced June 2023.

    Comments: 6 pages, 5 figures

  35. Learned Wavelet Video Coding using Motion Compensated Temporal Filtering

    Authors: Anna Meyer, Fabian Brand, André Kaup

    Abstract: We present an end-to-end trainable wavelet video coder based on motion-compensated temporal filtering (MCTF). Thereby, we introduce a different coding scheme for learned video compression, which is currently dominated by residual and conditional coding approaches. By performing discrete wavelet transforms in temporal, horizontal, and vertical dimension, we obtain an explainable framework with spat… ▽ More

    Submitted 12 October, 2023; v1 submitted 25 May, 2023; originally announced May 2023.

    Comments: 14 pages, 14 figures, Accepted for IEEE Access 2023

  36. arXiv:2305.15117  [pdf, other

    eess.IV

    Power Reduction Opportunities on End-User Devices in Quality-Steady Video Streaming

    Authors: Christian Herglotz, Werner Robitza, Alexander Raake, Tobias Hossfeld, André Kaup

    Abstract: This paper uses a crowdsourced dataset of online video streaming sessions to investigate opportunities to reduce the power consumption while considering QoE. For this, we base our work on prior studies which model both the end-user's QoE and the end-user device's power consumption with the help of high-level video features such as the bitrate, the frame rate, and the resolution. On top of existing… ▽ More

    Submitted 24 May, 2023; originally announced May 2023.

    Comments: 4 pages, 3 figures

  37. Image Segmentation For Improved Lossless Screen Content Compression

    Authors: Shabhrish Reddy Uddehal, Tilo Strutz, Hannah Och, André Kaup

    Abstract: In recent years, it has been found that screen content images (SCI) can be effectively compressed based on appropriate probability modelling and suitable entropy coding methods such as arithmetic coding. The key objective is determining the best probability distribution for each pixel position. This strategy works particularly well for images with synthetic (textual) content. However, usually scre… ▽ More

    Submitted 10 May, 2023; originally announced May 2023.

    Comments: 5 Pages, 3 Figures

  38. Multiscale Augmented Normalizing Flows for Image Compression

    Authors: Marc Windsheimer, Fabian Brand, André Kaup

    Abstract: Most learning-based image compression methods lack efficiency for high image quality due to their non-invertible design. The decoding function of the frequently applied compressive autoencoder architecture is only an approximated inverse of the encoding transform. This issue can be resolved by using invertible latent variable models, which allow a perfect reconstruction if no quantization is perfo… ▽ More

    Submitted 22 May, 2024; v1 submitted 9 May, 2023; originally announced May 2023.

    Comments: 5 pages, 7 figures

  39. arXiv:2305.05440  [pdf, other

    eess.IV cs.MM

    Improved Screen Content Coding in VVC Using Soft Context Formation

    Authors: Hannah Och, Shabhrish Reddy Uddehal, Tilo Strutz, André Kaup

    Abstract: Screen content images typically contain a mix of natural and synthetic image parts. Synthetic sections usually are comprised of uniformly colored areas and repeating colors and patterns. In the VVC standard, these properties are exploited using Intra Block Copy and Palette Mode. In this paper, we show that pixel-wise lossless coding can outperform lossy VVC coding in such areas. We propose an enha… ▽ More

    Submitted 9 January, 2024; v1 submitted 9 May, 2023; originally announced May 2023.

    Comments: 5 pages, 5 figures, 2 tables; accepted for IEEE International Conference on Acoustics, Speech and Signal Processing 2024 (IEEE ICASSP 2024)

  40. The Bjøntegaard Bible -- Why your Way of Comparing Video Codecs May Be Wrong

    Authors: Christian Herglotz, Hannah Och, Anna Meyer, Geetha Ramasubbu, Lena Eichermüller, Matthias Kränzler, Fabian Brand, Kristian Fischer, Dat Thanh Nguyen, Andy Regensky, André Kaup

    Abstract: In this paper, we provide an in-depth assessment on the Bjøntegaard Delta. We construct a large data set of video compression performance comparisons using a diverse set of metrics including PSNR, VMAF, bitrate, and processing energies. These metrics are evaluated for visual data types such as classic perspective video, 360$^\circ$ video, point clouds, and screen content. As compression technology… ▽ More

    Submitted 22 December, 2023; v1 submitted 25 April, 2023; originally announced April 2023.

    Comments: 21 pages, 14 figures

  41. arXiv:2304.12412  [pdf, other

    cs.CV cs.AI

    End-to-End Lidar-Camera Self-Calibration for Autonomous Vehicles

    Authors: Arya Rachman, Jürgen Seiler, André Kaup

    Abstract: Autonomous vehicles are equipped with a multi-modal sensor setup to enable the car to drive safely. The initial calibration of such perception sensors is a highly matured topic and is routinely done in an automated factory environment. However, an intriguing question arises on how to maintain the calibration quality throughout the vehicle's operating duration. Another challenge is to calibrate mul… ▽ More

    Submitted 27 April, 2023; v1 submitted 24 April, 2023; originally announced April 2023.

    Comments: Accepted for The 35th IEEE Intelligent Vehicles Symposium (IV 2023)

  42. Lossless Point Cloud Geometry and Attribute Compression Using a Learned Conditional Probability Model

    Authors: Dat Thanh Nguyen, Andre Kaup

    Abstract: In recent years, we have witnessed the presence of point cloud data in many aspects of our life, from immersive media, autonomous driving to healthcare, although at the cost of a tremendous amount of data. In this paper, we present an efficient lossless point cloud compression method that uses sparse tensor-based deep neural networks to learn point cloud geometry and color probability distribution… ▽ More

    Submitted 20 March, 2024; v1 submitted 11 March, 2023; originally announced March 2023.

    Comments: 12 pages, accepted to IEEE Transactions on Circuits and Systems for Video Technology

    Journal ref: EEE Transactions on Circuits and Systems for Video Technology, vol. 33, no. 8, pp. 4337-4348, Aug. 2023

  43. arXiv:2303.06517  [pdf, other

    eess.IV cs.LG

    Deep probabilistic model for lossless scalable point cloud attribute compression

    Authors: Dat Thanh Nguyen, Kamal Gopikrishnan Nambiar, Andre Kaup

    Abstract: In recent years, several point cloud geometry compression methods that utilize advanced deep learning techniques have been proposed, but there are limited works on attribute compression, especially lossless compression. In this work, we build an end-to-end multiscale point cloud attribute coding method (MNeT) that progressively projects the attributes onto multiscale latent spaces. The multiscale… ▽ More

    Submitted 11 March, 2023; originally announced March 2023.

    Comments: 5 pages, accepted for presentation at ICASSP 2023

  44. Multispectral Image Compression Based on HEVC Using Pel-Recursive Inter-Band Prediction

    Authors: Anna Meyer, Nils Genser, André Kaup

    Abstract: Recent developments in optical sensors enable a wide range of applications for multispectral imaging, e.g., in surveillance, optical sorting, and life-science instrumentation. Increasing spatial and spectral resolution allows creating higher quality products, however, it poses challenges in handling such large amounts of data. Consequently, specialized compression techniques for multispectral imag… ▽ More

    Submitted 9 March, 2023; originally announced March 2023.

    Comments: 6 pages, 4 figures, 1 table; Originally published as conference paper at IEEE MMSP 2020

    Journal ref: IEEE MMSP 2020

  45. arXiv:2303.05121  [pdf, other

    eess.IV

    A novel Cross-Component Context Model for End-to-End Wavelet Image Coding

    Authors: Anna Meyer, André Kaup

    Abstract: In contrast to traditional compression techniques performing linear transforms, the latent space of popular compressive autoencoders is obtained from a learned nonlinear map** and hard to interpret. In this paper, we explore a promising alternative approach for neural compression, with an autoencoder whose latent space represents a nonlinear wavelet decomposition. Previous work has shown that ne… ▽ More

    Submitted 9 March, 2023; originally announced March 2023.

    Comments: Accepted for publication at ICASSP 2023

  46. arXiv:2303.00433  [pdf, ps, other

    eess.IV

    Motion Estimation for Fisheye Video With an Application to Temporal Resolution Enhancement

    Authors: Andrea Eichenseer, Michel Bätz, André Kaup

    Abstract: Surveying wide areas with only one camera is a typical scenario in surveillance and automotive applications. Ultra wide-angle fisheye cameras employed to that end produce video data with characteristics that differ significantly from conventional rectilinear imagery as obtained by perspective pinhole cameras. Those characteristics are not considered in typical image and video processing algorithms… ▽ More

    Submitted 1 March, 2023; originally announced March 2023.

    Journal ref: IEEE Transactions on Circuits and Systems for Video Technology, vol. 29, no. 8, pp. 2376-2390, Aug. 2019

  47. Temporal Scalability of Dynamic Volume Data Using Mesh Compensated Wavelet Lifting

    Authors: Wolfgang Schnurrer, Niklas Pallast, Thomas Richter, André Kaup

    Abstract: Due to their high resolution, dynamic medical 2D+t and 3D+t volumes from computed tomography (CT) and magnetic resonance tomography (MR) reach a size which makes them very unhandy for teleradiologic applications. A lossless scalable representation offers the advantage of a down-scaled version which can be used for orientation or previewing, while the remaining information for reconstructing the fu… ▽ More

    Submitted 1 March, 2023; originally announced March 2023.

    Journal ref: IEEE Transactions on Image Processing, vol. 27, no. 1, pp. 419-431, Jan. 2018

  48. arXiv:2302.13581  [pdf, other

    eess.IV

    Saliency-Driven Hierarchical Learned Image Coding for Machines

    Authors: Kristian Fischer, Fabian Brand, Christian Blum, André Kaup

    Abstract: We propose to employ a saliency-driven hierarchical neural image compression network for a machine-to-machine communication scenario following the compress-then-analyze paradigm. By that, different areas of the image are coded at different qualities depending on whether salient objects are located in the corresponding area. Areas without saliency are transmitted in latent spaces of lower spatial r… ▽ More

    Submitted 27 February, 2023; originally announced February 2023.

    Comments: Accepted for publication in 2023 ICASSP

  49. arXiv:2302.01594  [pdf, ps, other

    eess.IV

    Analysis of mesh-based motion compensation in wavelet lifting of dynamical 3-D+t CT data

    Authors: Wolfgang Schnurrer, Thomas Richter, Jürgen Seiler, André Kaup

    Abstract: Factorized in the lifting structure, the wavelet transform can easily be extended by arbitrary compensation methods. Thereby, the transform can be adapted to displacements in the signal without losing the ability of perfect reconstruction. This leads to an improvement of scalability. In temporal direction of dynamic medical 3-D+t volumes from Computed Tomography, displacement is mainly given by ex… ▽ More

    Submitted 3 February, 2023; originally announced February 2023.

    Journal ref: IEEE 14th International Workshop on Multimedia Signal Processing (MMSP), Banff, AB, Canada, 2012, pp. 152-157

  50. Graph-Based Compensated Wavelet Lifting for Scalable Lossless Coding of Dynamic Medical Data

    Authors: Daniela Lanz, André Kaup

    Abstract: Lossless compression of dynamic 2D+t and 3D+t medical data is challenging regarding the huge amount of data, the characteristics of the inherent noise, and the high bit depth. Beyond that, a scalable representation is often required in telemedicine applications. Motion Compensated Temporal Filtering works well for lossless compression of medical volume data and additionally provides temporal, spat… ▽ More

    Submitted 3 February, 2023; originally announced February 2023.

    Journal ref: IEEE Transactions on Image Processing, vol. 29, pp. 2439-2451, 2020