Search | arXiv e-print repository

Filling the gaps in video transcoder deployment in the cloud

Authors: Vibhoothi, Daniel Joseph Ringis, Xin Shu, François Pitié, Zsolt Lorincz, Philippe Brodeur, Anil Kokaram

Abstract: Cloud-based deployment of content production and broadcast workflows has continued to disrupt the industry after the pandemic. The key tools required for unlocking cloud workflows, e.g., transcoding, metadata parsing, and streaming playback, are increasingly commoditized. However, as video traffic continues to increase there is a need to consider tools which offer opportunities for further bitrate… ▽ More Cloud-based deployment of content production and broadcast workflows has continued to disrupt the industry after the pandemic. The key tools required for unlocking cloud workflows, e.g., transcoding, metadata parsing, and streaming playback, are increasingly commoditized. However, as video traffic continues to increase there is a need to consider tools which offer opportunities for further bitrate/quality gains as well as those which facilitate cloud deployment. In this paper we consider preprocessing, rate/distortion optimisation and cloud cost prediction tools which are only just emerging from the research community. These tools are posed as part of the per-clip optimisation approach to transcoding which has been adopted by large streaming media processing entities but has yet to be made more widely available for the industry. △ Less

Submitted 17 April, 2023; originally announced April 2023.

Comments: Camera-ready version of BEIT Conference at NAB 2023

arXiv:2208.11150 [pdf, other]

doi 10.1117/12.2632272

Direct Optimisation of $\boldsymbolλ$ for HDR Content Adaptive Transcoding in AV1

Authors: Vibhoothi, François Pitié, Angeliki Katsenou, Daniel Joseph Ringis, Ye** Su, Neil Birkbeck, Jessie Lin, Balu Adsumilli, Anil Kokaram

Abstract: Since the adoption of VP9 by Netflix in 2016, royalty-free coding standards continued to gain prominence through the activities of the AOMedia consortium. AV1, the latest open source standard, is now widely supported. In the early years after standardisation, HDR video tends to be under served in open source encoders for a variety of reasons including the relatively small amount of true HDR conten… ▽ More Since the adoption of VP9 by Netflix in 2016, royalty-free coding standards continued to gain prominence through the activities of the AOMedia consortium. AV1, the latest open source standard, is now widely supported. In the early years after standardisation, HDR video tends to be under served in open source encoders for a variety of reasons including the relatively small amount of true HDR content being broadcast and the challenges in RD optimisation with that material. AV1 codec optimisation has been ongoing since 2020 including consideration of the computational load. In this paper, we explore the idea of direct optimisation of the Lagrangian $λ$ parameter used in the rate control of the encoders to estimate the optimal Rate-Distortion trade-off achievable for a High Dynamic Range signalled video clip. We show that by adjusting the Lagrange multiplier in the RD optimisation process on a frame-hierarchy basis, we are able to increase the Bjontegaard difference rate gains by more than 3.98$\times$ on average without visually affecting the quality. △ Less

Submitted 7 October, 2022; v1 submitted 23 August, 2022; originally announced August 2022.

Comments: SPIE2022:Applications of Digital Image Processing XLV accepted manuscript

arXiv:2204.09056 [pdf, ps, other]

doi 10.1109/PCS50896.2021.9477476

Near Optimal Per-Clip Lagrangian Multiplier Prediction in HEVC

Authors: Daniel J Ringis, François Pitié, Anil Kokaram

Abstract: The majority of internet traffic is video content. This drives the demand for video compression to deliver high quality video at low target bitrates. Optimising the parameters of a video codec for a specific video clip (per-clip optimisation) has been shown to yield significant bitrate savings. In previous work we have shown that per-clip optimisation of the Lagrangian multiplier leads to up to 24… ▽ More The majority of internet traffic is video content. This drives the demand for video compression to deliver high quality video at low target bitrates. Optimising the parameters of a video codec for a specific video clip (per-clip optimisation) has been shown to yield significant bitrate savings. In previous work we have shown that per-clip optimisation of the Lagrangian multiplier leads to up to 24% BD-Rate improvement. A key component of these algorithms is modeling the R-D characteristic across the appropriate bitrate range. This is computationally heavy as it usually involves repeated video encodes of the high resolution material at different parameter settings. This work focuses on reducing this computational load by deploying a NN operating on lower bandwidth features. Our system achieves BD-Rate improvement in approximately 90% of a large corpus with comparable results to previous work in direct optimisation. △ Less

Submitted 19 April, 2022; originally announced April 2022.

Comments: arXiv admin note: substantial text overlap with arXiv: 2204.09055, arXiv:2204.08966

Journal ref: 2021 Picture Coding Symposium (PCS)

arXiv:2204.09055 [pdf, ps, other]

doi 10.1117/12.2593238

Per-clip and per-bitrate adaptation of the Lagrangian multiplier in video coding

Authors: Daniel J. Ringis, François Pitié, Anil Kokaram

Abstract: In the past ten years there have been significant developments in optimization of transcoding parameters on a per-clip rather than per-genre basis. In our recent work we have presented per-clip optimization for the Lagrangian multiplier in Rate controlled compression, which yielded BD-Rate improvements of approximately 2\% across a corpus of videos using HEVC. However, in a video streaming applica… ▽ More In the past ten years there have been significant developments in optimization of transcoding parameters on a per-clip rather than per-genre basis. In our recent work we have presented per-clip optimization for the Lagrangian multiplier in Rate controlled compression, which yielded BD-Rate improvements of approximately 2\% across a corpus of videos using HEVC. However, in a video streaming application, the focus is on optimizing the rate/distortion tradeoff at a particular bitrate and not on average across a range of performance. We observed in previous work that a particular multiplier might give BD rate improvements over a certain range of bitrates, but not the entire range. Using different parameters across the range would improve gains overall. Therefore here we present a framework for choosing the best Lagrangian multiplier on a per-operating point basis across a range of bitrates. In effect, we are trying to find the para-optimal gain across bitrate and distortion for a single clip. In the experiments presented we employ direct optimization techniques to estimate this Lagrangian parameter path approximately 2,000 video clips. The clips are primarily from the YouTube-UGC dataset. We optimize both for bitrate savings as well as distortion metrics (PSNR, SSIM). △ Less

Submitted 19 April, 2022; originally announced April 2022.

Comments: arXiv admin note: substantial text overlap with arXiv:2204.09056, arXiv:2204.08966

Journal ref: Applications of Digital Image Processing XLIV. Vol. 11842. International Society for Optics and Photonics, 2021

arXiv:2204.08966 [pdf, ps, other]

doi 10.1117/12.2567654

Per-clip adaptive Lagrangian multiplier optimisation with low-resolution proxies

Authors: Daniel J. Ringis, François Pitié, Anil Kokaram

Abstract: This work focuses on reducing the computational cost of repeated video encodes by using a lower resolution clip as a proxy. Features extracted from the low resolution clip are used to learn an optimal lagrange multiplier for rate control on the original resolution clip. In addition to reducing the computational cost and encode time by using lower resolution clips, we also investigate the use of ol… ▽ More This work focuses on reducing the computational cost of repeated video encodes by using a lower resolution clip as a proxy. Features extracted from the low resolution clip are used to learn an optimal lagrange multiplier for rate control on the original resolution clip. In addition to reducing the computational cost and encode time by using lower resolution clips, we also investigate the use of older, but faster codecs such as H.264 to create proxies. This work shows that the computational load is reduced by 22 times using 144p proxies. Our tests are based on the YouTube UGC dataset, hence our results are based on a practical instance of the adaptive bitrate encoding problem. Further improvements are possible, by optimising the placement and sparsity of operating points required for the rate distortion curves. △ Less

Submitted 19 April, 2022; originally announced April 2022.

Journal ref: Proc. SPIE. 11510, Applications of Digital Image Processing XLIII 2020

arXiv:2204.08965 [pdf, ps, other]

doi 10.2352/ISSN.2470-1173.2020.10.IPAS-136

Per Clip Lagrangian Multiplier Optimisation for HEVC

Authors: Daniel J Ringis, François Pitié, Anil Kokaram

Abstract: The majority of internet traffic is video content. This drives the demand for video compression in order to deliver high quality video at low target bitrates. This paper investigates the impact of adjusting the rate distortion equation on compression performance. An constant of proportionality, k, is used to modify the Lagrange multiplier used in H.265 (HEVC). Direct optimisation methods are deplo… ▽ More The majority of internet traffic is video content. This drives the demand for video compression in order to deliver high quality video at low target bitrates. This paper investigates the impact of adjusting the rate distortion equation on compression performance. An constant of proportionality, k, is used to modify the Lagrange multiplier used in H.265 (HEVC). Direct optimisation methods are deployed to maximise BD-Rate improvement for a particular clip. This leads to up to 21% BD-Rate improvement for an individual clip. Furthermore we use a more realistic corpus of material provided by YouTube. The results show that direct optimisation using BD-rate as the objective function can lead to further gains in bitrate savings that are not available with previous approaches. △ Less

Submitted 19 April, 2022; originally announced April 2022.

Journal ref: Electronic Imaging 2020

arXiv:2007.11948 [pdf, ps, other]

doi 10.1117/12.2322411

Using modern motion estimation algorithms in existing video codecs

Authors: Daniel J. Ringis, Davinder Singh, Francois Pitie, Anil Kokaram

Abstract: Motion estimation is a key component of any modern video codec. Our understanding of motion and the estimation of motion from video has come a very long way since 2000. More than 135 different algorithms have been recently reviewed by Scharstein et al http://vision.middlebury.edu/flow/. These new algorithms differ markedly from Block Matching which has been the mainstay of video compression for so… ▽ More Motion estimation is a key component of any modern video codec. Our understanding of motion and the estimation of motion from video has come a very long way since 2000. More than 135 different algorithms have been recently reviewed by Scharstein et al http://vision.middlebury.edu/flow/. These new algorithms differ markedly from Block Matching which has been the mainstay of video compression for some time. This paper presents comparisons of H.264 and MP4 compression using different motion estimation methods. In so doing we present as well methods for adapting pre-computed motion fields for use within a codec. We do not observe significant gains to be had with the methods chosen w.r.t. Rate Distortion tradeoffs but the results reflect a significantly more complex interrelationship between motion and compression than would be expected. There remains much more to be done to improve the coverage of this comparison to the emerging standards but these initial results show that there is value in these explorations. △ Less

Submitted 23 July, 2020; originally announced July 2020.

Journal ref: Proc. SPIE 10752, Applications of Digital Image Processing XLI, 107520S (17 September 2018)

Showing 1–7 of 7 results for author: Ringis, D J