Search | arXiv e-print repository

A Dictionary Based Approach for Removing Out-of-Focus Blur

Authors: Uditangshu Aurangabadkar, Anil Kokaram

Abstract: The field of image deblurring has seen tremendous progress with the rise of deep learning models. These models, albeit efficient, are computationally expensive and energy consuming. Dictionary based learning approaches have shown promising results in image denoising and Single Image Super-Resolution. We propose an extension of the Rapid and Accurate Image Super-Resolution (RAISR) algorithm introdu… ▽ More The field of image deblurring has seen tremendous progress with the rise of deep learning models. These models, albeit efficient, are computationally expensive and energy consuming. Dictionary based learning approaches have shown promising results in image denoising and Single Image Super-Resolution. We propose an extension of the Rapid and Accurate Image Super-Resolution (RAISR) algorithm introduced by Isidoro, Romano and Milanfar for the task of out-of-focus blur removal. We define a sharpness quality measure which aligns well with the perceptual quality of an image. A metric based blending strategy based on asset allocation management is also proposed. Our method demonstrates an average increase of approximately 13% (PSNR) and 10% (SSIM) compared to popular deblurring methods. Furthermore, our blending scheme curtails ringing artefacts post restoration. △ Less

Submitted 17 June, 2024; originally announced June 2024.

Comments: 6 pages, IEEE ICIP

arXiv:2404.05321 [pdf]

Unravelling the Power of Single-Pass Look-Ahead in Modern Codecs for Optimized Transcoding Deployment

Authors: Vibhoothi Vibhoothi, Julien Zouein, François Pitié, Anil Kokaram

Abstract: Modern video encoders have evolved into sophisticated pieces of software in which various coding tools interact with each other. In the past, singlepass encoding was not considered for Video-On-Demand (VOD) use cases. In this work, we evaluate production-ready encoders for H.264 (x264), H.265 (HEVC), AV1 (SVT-AV1) along with direct comparisons to the latest AV1 encoder inside NVIDIA GPUs (40 serie… ▽ More Modern video encoders have evolved into sophisticated pieces of software in which various coding tools interact with each other. In the past, singlepass encoding was not considered for Video-On-Demand (VOD) use cases. In this work, we evaluate production-ready encoders for H.264 (x264), H.265 (HEVC), AV1 (SVT-AV1) along with direct comparisons to the latest AV1 encoder inside NVIDIA GPUs (40 series), and AWS Mediaconvert's AV1 implementation. Our experimental results demonstrate single pass encoding inside modern encoder implementations can give us very good quality at a reasonable compute cost. The results are presented as three different scenarios targeting High, Medium, and Low complexity accounting quality/bitrate/compute load. Finally, a set of recommendations is presented for end-users to help decide which encoder/preset combination might be more suited to their use case. △ Less

Submitted 8 April, 2024; originally announced April 2024.

Comments: Accepted paper for NAB 2024

arXiv:2306.14432 [pdf, other]

doi 10.1109/ICIP49359.2023.10222332

Subjective assessment of the impact of a content adaptive optimiser for compressing 4K HDR content with AV1

Authors: Vibhoothi, Angeliki Katsenou, François Pitié, Katarina Domijan, Anil Kokaram

Abstract: Since 2015 video dimensionality has expanded to higher spatial and temporal resolutions and a wider colour gamut. This High Dynamic Range (HDR) content has gained traction in the consumer space as it delivers an enhanced quality of experience. At the same time, the complexity of codecs is growing. This has driven the development of tools for content-adaptive optimisation that achieve optimal rate-… ▽ More Since 2015 video dimensionality has expanded to higher spatial and temporal resolutions and a wider colour gamut. This High Dynamic Range (HDR) content has gained traction in the consumer space as it delivers an enhanced quality of experience. At the same time, the complexity of codecs is growing. This has driven the development of tools for content-adaptive optimisation that achieve optimal rate-distortion performance for HDR video at 4K resolution. While improvements of just a few percentage points in BD-Rate (1-5\%) are significant for the streaming media industry, the impact on subjective quality has been less studied especially for HDR/AV1. In this paper, we conduct a subjective quality assessment (42 subjects) of 4K HDR content with a per-clip optimisation strategy. We correlate these subjective scores with existing popular objective metrics used in standard development and show that some perceptual metrics correlate surprisingly well even though they are not tuned for HDR. We find that the DSQCS protocol is too insensitive to categorically compare the methods but the data allows us to make recommendations about the use of experts vs non-experts in HDR studies, and explain the subjective impact of film grain in HDR content under compression. △ Less

Submitted 26 June, 2023; originally announced June 2023.

Comments: Accepted Camera-ready version for the ICIP 2023 Paper

arXiv:2305.11858 [pdf, other]

Recommendations for Verifying HDR Subjective Testing Workflows

Authors: Vibhoothi, Angeliki Katsenou, John Squires, François Pitié, Anil Kokaram

Abstract: Over the past few years, there has been an increase in the demand and availability of High Dynamic Range (HDR) displays and content. To ensure the production of high-quality materials, human evaluation is required. However, ascertaining whether the full playback pipeline is indeed HDR-compliant can be challenging. In this paper, we present a set of recommendations for conformance testing to valida… ▽ More Over the past few years, there has been an increase in the demand and availability of High Dynamic Range (HDR) displays and content. To ensure the production of high-quality materials, human evaluation is required. However, ascertaining whether the full playback pipeline is indeed HDR-compliant can be challenging. In this paper, we present a set of recommendations for conformance testing to validate various aspects of the testing workflow, including playback, displays, brightness, colours, and viewing environment. We assessed the effectiveness of HDR conversion techniques used in current standards development (3GPP) for making source materials. Additionally, we evaluate HDR display technologies, including OLED and LCD, using both consumer television and a reference monitor. △ Less

Submitted 19 May, 2023; originally announced May 2023.

Comments: Accepted Camera-ready version of QOMEX 2023 Short-paper

arXiv:2303.16163 [pdf, other]

Comparison of HDR quality metrics in Per-Clip Lagrangian multiplier optimisation with AV1

Authors: Vibhoothi, François Pitié, Angeliki Katsenou, Ye** Su, Balu Adsumilli, Anil Kokaram

Abstract: The complexity of modern codecs along with the increased need of delivering high-quality videos at low bitrates has reinforced the idea of a per-clip tailoring of parameters for optimised rate-distortion performance. While the objective quality metrics used for Standard Dynamic Range (SDR) videos have been well studied, the transitioning of consumer displays to support High Dynamic Range (HDR) vid… ▽ More The complexity of modern codecs along with the increased need of delivering high-quality videos at low bitrates has reinforced the idea of a per-clip tailoring of parameters for optimised rate-distortion performance. While the objective quality metrics used for Standard Dynamic Range (SDR) videos have been well studied, the transitioning of consumer displays to support High Dynamic Range (HDR) videos, poses a new challenge to rate-distortion optimisation. In this paper, we review the popular HDR metrics DeltaE100 (DE100), PSNRL100, wPSNR, and HDR-VQM. We measure the impact of employing these metrics in per-clip direct search optimisation of the rate-distortion Lagrange multiplier in AV1. We report, on 35 HDR videos, average Bjontegaard Delta Rate (BD-Rate) gains of 4.675%, 2.226%, and 7.253% in terms of DE100, PSNRL100, and HDR-VQM. We also show that the inclusion of chroma in the quality metrics has a significant impact on optimisation, which can only be partially addressed by the use of chroma offsets. △ Less

Submitted 26 April, 2023; v1 submitted 28 March, 2023; originally announced March 2023.

Comments: Accepted version for ICME 2023 Special Session, "Optimised Media Delivery"

arXiv:2302.14516 [pdf, other]

Learnt Deep Hyperparameter selection in Adversarial Training for compressed video enhancement with perceptual critic

Authors: Darren Ramsook, Anil Kokaram

Abstract: Image based Deep Feature Quality Metrics (DFQMs) have been shown to better correlate with subjective perceptual scores over traditional metrics. The fundamental focus of these DFQMs is to exploit internal representations from a large scale classification network as the metric feature space. Previously, no attention has been given to the problem of identifying which layers are most perceptually rel… ▽ More Image based Deep Feature Quality Metrics (DFQMs) have been shown to better correlate with subjective perceptual scores over traditional metrics. The fundamental focus of these DFQMs is to exploit internal representations from a large scale classification network as the metric feature space. Previously, no attention has been given to the problem of identifying which layers are most perceptually relevant. In this paper we present a new method for selecting perceptually relevant layers from such a network, based on a neuroscience interpretation of layer behaviour. The selected layers are treated as a hyperparameter to the critic network in a W-GAN. The critic uses the output from these layers in the preliminary stages to extract perceptual information. A video enhancement network is trained adversarially with this critic. Our results show that the introduction of these selected features into the critic yields up to 10% (FID) and 15% (KID) performance increase against other critic networks that do not exploit the idea of optimised feature selection. △ Less

Submitted 28 February, 2023; originally announced February 2023.

arXiv:2211.05805 [pdf]

Impact of Video Compression on the Performance of Object Detection Systems for Surveillance Applications

Authors: Michael O'Byrne, Vibhoothi, Mark Sugrue, Anil Kokaram

Abstract: This study examines the relationship between H.264 video compression and the performance of an object detection network (YOLOv5). We curated a set of 50 surveillance videos and annotated targets of interest (people, bikes, and vehicles). Videos were encoded at 5 quality levels using Constant Rate Factor (CRF) values in the set {22,32,37,42,47}. YOLOv5 was applied to compressed videos and detection… ▽ More This study examines the relationship between H.264 video compression and the performance of an object detection network (YOLOv5). We curated a set of 50 surveillance videos and annotated targets of interest (people, bikes, and vehicles). Videos were encoded at 5 quality levels using Constant Rate Factor (CRF) values in the set {22,32,37,42,47}. YOLOv5 was applied to compressed videos and detection performance was analyzed at each CRF level. Test results indicate that the detection performance is generally robust to moderate levels of compression; using a CRF value of 37 instead of 22 leads to significantly reduced bitrates/file sizes without adversely affecting detection performance. However, detection performance degrades appreciably at higher compression levels, especially in complex scenes with poor lighting and fast-moving targets. Finally, retraining YOLOv5 on compressed imagery gives up to a 1% improvement in F1 score when applied to highly compressed footage. △ Less

Submitted 10 November, 2022; originally announced November 2022.

arXiv:2208.11150 [pdf, other]

doi 10.1117/12.2632272

Direct Optimisation of $\boldsymbolλ$ for HDR Content Adaptive Transcoding in AV1

Authors: Vibhoothi, François Pitié, Angeliki Katsenou, Daniel Joseph Ringis, Ye** Su, Neil Birkbeck, Jessie Lin, Balu Adsumilli, Anil Kokaram

Abstract: Since the adoption of VP9 by Netflix in 2016, royalty-free coding standards continued to gain prominence through the activities of the AOMedia consortium. AV1, the latest open source standard, is now widely supported. In the early years after standardisation, HDR video tends to be under served in open source encoders for a variety of reasons including the relatively small amount of true HDR conten… ▽ More Since the adoption of VP9 by Netflix in 2016, royalty-free coding standards continued to gain prominence through the activities of the AOMedia consortium. AV1, the latest open source standard, is now widely supported. In the early years after standardisation, HDR video tends to be under served in open source encoders for a variety of reasons including the relatively small amount of true HDR content being broadcast and the challenges in RD optimisation with that material. AV1 codec optimisation has been ongoing since 2020 including consideration of the computational load. In this paper, we explore the idea of direct optimisation of the Lagrangian $λ$ parameter used in the rate control of the encoders to estimate the optimal Rate-Distortion trade-off achievable for a High Dynamic Range signalled video clip. We show that by adjusting the Lagrange multiplier in the RD optimisation process on a frame-hierarchy basis, we are able to increase the Bjontegaard difference rate gains by more than 3.98$\times$ on average without visually affecting the quality. △ Less

Submitted 7 October, 2022; v1 submitted 23 August, 2022; originally announced August 2022.

Comments: SPIE2022:Applications of Digital Image Processing XLV accepted manuscript

arXiv:2204.09056 [pdf, ps, other]

doi 10.1109/PCS50896.2021.9477476

Near Optimal Per-Clip Lagrangian Multiplier Prediction in HEVC

Authors: Daniel J Ringis, François Pitié, Anil Kokaram

Abstract: The majority of internet traffic is video content. This drives the demand for video compression to deliver high quality video at low target bitrates. Optimising the parameters of a video codec for a specific video clip (per-clip optimisation) has been shown to yield significant bitrate savings. In previous work we have shown that per-clip optimisation of the Lagrangian multiplier leads to up to 24… ▽ More The majority of internet traffic is video content. This drives the demand for video compression to deliver high quality video at low target bitrates. Optimising the parameters of a video codec for a specific video clip (per-clip optimisation) has been shown to yield significant bitrate savings. In previous work we have shown that per-clip optimisation of the Lagrangian multiplier leads to up to 24% BD-Rate improvement. A key component of these algorithms is modeling the R-D characteristic across the appropriate bitrate range. This is computationally heavy as it usually involves repeated video encodes of the high resolution material at different parameter settings. This work focuses on reducing this computational load by deploying a NN operating on lower bandwidth features. Our system achieves BD-Rate improvement in approximately 90% of a large corpus with comparable results to previous work in direct optimisation. △ Less

Submitted 19 April, 2022; originally announced April 2022.

Comments: arXiv admin note: substantial text overlap with arXiv: 2204.09055, arXiv:2204.08966

Journal ref: 2021 Picture Coding Symposium (PCS)

arXiv:2204.09055 [pdf, ps, other]

doi 10.1117/12.2593238

Per-clip and per-bitrate adaptation of the Lagrangian multiplier in video coding

Authors: Daniel J. Ringis, François Pitié, Anil Kokaram

Abstract: In the past ten years there have been significant developments in optimization of transcoding parameters on a per-clip rather than per-genre basis. In our recent work we have presented per-clip optimization for the Lagrangian multiplier in Rate controlled compression, which yielded BD-Rate improvements of approximately 2\% across a corpus of videos using HEVC. However, in a video streaming applica… ▽ More In the past ten years there have been significant developments in optimization of transcoding parameters on a per-clip rather than per-genre basis. In our recent work we have presented per-clip optimization for the Lagrangian multiplier in Rate controlled compression, which yielded BD-Rate improvements of approximately 2\% across a corpus of videos using HEVC. However, in a video streaming application, the focus is on optimizing the rate/distortion tradeoff at a particular bitrate and not on average across a range of performance. We observed in previous work that a particular multiplier might give BD rate improvements over a certain range of bitrates, but not the entire range. Using different parameters across the range would improve gains overall. Therefore here we present a framework for choosing the best Lagrangian multiplier on a per-operating point basis across a range of bitrates. In effect, we are trying to find the para-optimal gain across bitrate and distortion for a single clip. In the experiments presented we employ direct optimization techniques to estimate this Lagrangian parameter path approximately 2,000 video clips. The clips are primarily from the YouTube-UGC dataset. We optimize both for bitrate savings as well as distortion metrics (PSNR, SSIM). △ Less

Submitted 19 April, 2022; originally announced April 2022.

Comments: arXiv admin note: substantial text overlap with arXiv:2204.09056, arXiv:2204.08966

Journal ref: Applications of Digital Image Processing XLIV. Vol. 11842. International Society for Optics and Photonics, 2021

arXiv:2204.08966 [pdf, ps, other]

doi 10.1117/12.2567654

Per-clip adaptive Lagrangian multiplier optimisation with low-resolution proxies

Authors: Daniel J. Ringis, François Pitié, Anil Kokaram

Abstract: This work focuses on reducing the computational cost of repeated video encodes by using a lower resolution clip as a proxy. Features extracted from the low resolution clip are used to learn an optimal lagrange multiplier for rate control on the original resolution clip. In addition to reducing the computational cost and encode time by using lower resolution clips, we also investigate the use of ol… ▽ More This work focuses on reducing the computational cost of repeated video encodes by using a lower resolution clip as a proxy. Features extracted from the low resolution clip are used to learn an optimal lagrange multiplier for rate control on the original resolution clip. In addition to reducing the computational cost and encode time by using lower resolution clips, we also investigate the use of older, but faster codecs such as H.264 to create proxies. This work shows that the computational load is reduced by 22 times using 144p proxies. Our tests are based on the YouTube UGC dataset, hence our results are based on a practical instance of the adaptive bitrate encoding problem. Further improvements are possible, by optimising the placement and sparsity of operating points required for the rate distortion curves. △ Less

Submitted 19 April, 2022; originally announced April 2022.

Journal ref: Proc. SPIE. 11510, Applications of Digital Image Processing XLIII 2020

arXiv:2204.08965 [pdf, ps, other]

doi 10.2352/ISSN.2470-1173.2020.10.IPAS-136

Per Clip Lagrangian Multiplier Optimisation for HEVC

Authors: Daniel J Ringis, François Pitié, Anil Kokaram

Abstract: The majority of internet traffic is video content. This drives the demand for video compression in order to deliver high quality video at low target bitrates. This paper investigates the impact of adjusting the rate distortion equation on compression performance. An constant of proportionality, k, is used to modify the Lagrange multiplier used in H.265 (HEVC). Direct optimisation methods are deplo… ▽ More The majority of internet traffic is video content. This drives the demand for video compression in order to deliver high quality video at low target bitrates. This paper investigates the impact of adjusting the rate distortion equation on compression performance. An constant of proportionality, k, is used to modify the Lagrange multiplier used in H.265 (HEVC). Direct optimisation methods are deployed to maximise BD-Rate improvement for a particular clip. This leads to up to 21% BD-Rate improvement for an individual clip. Furthermore we use a more realistic corpus of material provided by YouTube. The results show that direct optimisation using BD-rate as the objective function can lead to further gains in bitrate savings that are not available with previous approaches. △ Less

Submitted 19 April, 2022; originally announced April 2022.

Journal ref: Electronic Imaging 2020

arXiv:1709.08763 [pdf, ps, other]

Encoding Bitrate Optimization Using Playback Statistics for HTTP-based Adaptive Video Streaming

Authors: Chao Chen, Yao-Chung Lin, Anil Kokaram, Steve Benting

Abstract: HTTP video streaming is in wide use to deliver video over the Internet. With HTTP adaptive steaming, a video playback dynamically selects a video stream from a pre-encoded representation based on available bandwidth and viewport (screen) size. The viewer's video quality is therefore influenced by the encoded bitrates. We minimize the average delivered bitrate subject to a quality lower bound on a… ▽ More HTTP video streaming is in wide use to deliver video over the Internet. With HTTP adaptive steaming, a video playback dynamically selects a video stream from a pre-encoded representation based on available bandwidth and viewport (screen) size. The viewer's video quality is therefore influenced by the encoded bitrates. We minimize the average delivered bitrate subject to a quality lower bound on a per-chunk basis by modeling the probability that a player selects a particular encoding. Through simulation and real-world experiments, the proposed method saves 9.6% of bandwidth while average delivered video quality comparing with state of the art while kee** average delivered video quality. △ Less

Submitted 13 October, 2017; v1 submitted 25 September, 2017; originally announced September 2017.

Showing 1–13 of 13 results for author: Kokaram, A