Search | arXiv e-print repository

arXiv:2007.14084 [pdf, other]

doi 10.1145/3394171.3413699

Kalman Filter-based Head Motion Prediction for Cloud-based Mixed Reality

Authors: Serhan Gül, Sebastian Bosse, Dimitri Podborski, Thomas Schierl, Cornelius Hellge

Abstract: Volumetric video allows viewers to experience highly-realistic 3D content with six degrees of freedom in mixed reality (MR) environments. Rendering complex volumetric videos can require a prohibitively high amount of computational power for mobile devices. A promising technique to reduce the computational burden on mobile devices is to perform the rendering at a cloud server. However, cloud-based… ▽ More Volumetric video allows viewers to experience highly-realistic 3D content with six degrees of freedom in mixed reality (MR) environments. Rendering complex volumetric videos can require a prohibitively high amount of computational power for mobile devices. A promising technique to reduce the computational burden on mobile devices is to perform the rendering at a cloud server. However, cloud-based rendering systems suffer from an increased interaction (motion-to-photon) latency that may cause registration errors in MR environments. One way of reducing the effective latency is to predict the viewer's head pose and render the corresponding view from the volumetric video in advance. In this paper, we design a Kalman filter for head motion prediction in our cloud-based volumetric video streaming system. We analyze the performance of our approach using recorded head motion traces and compare its performance to an autoregression model for different prediction intervals (look-ahead times). Our results show that the Kalman filter can predict head orientations 0.5 degrees more accurately than the autoregression model for a look-ahead time of 60 ms. △ Less

Submitted 28 July, 2020; originally announced July 2020.

Comments: Accepted at the ACM Multimedia Conference (ACMMM) 2020. 9 pages, 9 figures

Journal ref: Proceedings of the 28th ACM International Conference on Multimedia (2020) 3632-3641

arXiv:2003.02526 [pdf, other]

doi 10.1145/3339825.3393583

Cloud Rendering-based Volumetric Video Streaming System for Mixed Reality Services

Authors: Serhan Gül, Dimitri Podborski, Jangwoo Son, Gurdeep Singh Bhullar, Thomas Buchholz, Thomas Schierl, Cornelius Hellge

Abstract: Volumetric video is an emerging technology for immersive representation of 3D spaces that captures objects from all directions using multiple cameras and creates a dynamic 3D model of the scene. However, processing volumetric content requires high amounts of processing power and is still a very demanding task for today's mobile devices. To mitigate this, we propose a volumetric video streaming sys… ▽ More Volumetric video is an emerging technology for immersive representation of 3D spaces that captures objects from all directions using multiple cameras and creates a dynamic 3D model of the scene. However, processing volumetric content requires high amounts of processing power and is still a very demanding task for today's mobile devices. To mitigate this, we propose a volumetric video streaming system that offloads the rendering to a powerful cloud/edge server and only sends the rendered 2D view to the client instead of the full volumetric content. We use 6DoF head movement prediction techniques, WebRTC protocol and hardware video encoding to ensure low-latency in different parts of the processing chain. We demonstrate our system using both a browser-based client and a Microsoft HoloLens client. Our application contains generic interfaces that allow for easy deployment of various augmented/mixed reality clients using the same server implementation. △ Less

Submitted 16 July, 2020; v1 submitted 5 March, 2020; originally announced March 2020.

Comments: 4 pages, 2 figures

Journal ref: 11th ACM Multimedia Systems Conference (MMSys) 2020

arXiv:2001.06466 [pdf, other]

doi 10.1145/3386290.3396933

Low-latency Cloud-based Volumetric Video Streaming Using Head Motion Prediction

Authors: Serhan Gül, Dimitri Podborski, Thomas Buchholz, Thomas Schierl, Cornelius Hellge

Abstract: Volumetric video is an emerging key technology for immersive representation of 3D spaces and objects. Rendering volumetric video requires lots of computational power which is challenging especially for mobile devices. To mitigate this, we developed a streaming system that renders a 2D view from the volumetric video at a cloud server and streams a 2D video stream to the client. However, such networ… ▽ More Volumetric video is an emerging key technology for immersive representation of 3D spaces and objects. Rendering volumetric video requires lots of computational power which is challenging especially for mobile devices. To mitigate this, we developed a streaming system that renders a 2D view from the volumetric video at a cloud server and streams a 2D video stream to the client. However, such network-based processing increases the motion-to-photon (M2P) latency due to the additional network and processing delays. In order to compensate the added latency, prediction of the future user pose is necessary. We developed a head motion prediction model and investigated its potential to reduce the M2P latency for different look-ahead times. Our results show that the presented model reduces the rendering errors caused by the M2P latency compared to a baseline system in which no prediction is performed. △ Less

Submitted 16 July, 2020; v1 submitted 17 January, 2020; originally announced January 2020.

Comments: 7 pages, 4 figures

Journal ref: 30th ACM Workshop on Network and Operating Systems Support for Digital Audio and Video (NOSSDAV) 2020

arXiv:1903.02971 [pdf, other]

doi 10.1145/3304109.3323835

HTML5 MSE Playback of MPEG 360 VR Tiled Streaming

Authors: Dimitri Podborski, Jangwoo Son, Gurdeep Singh Bhullar, Robert Skupin, Yago Sanchez, Cornelius Hellge, Thomas Schierl

Abstract: Virtual Reality (VR) and 360-degree video streaming have gained significant attention in recent years. First standards have been published in order to avoid market fragmentation. For instance, 3GPP released its first VR specification to enable 360-degree video streaming over 5G networks which relies on several technologies specified in ISO/IEC 23090-2, also known as MPEG-OMAF. While some implement… ▽ More Virtual Reality (VR) and 360-degree video streaming have gained significant attention in recent years. First standards have been published in order to avoid market fragmentation. For instance, 3GPP released its first VR specification to enable 360-degree video streaming over 5G networks which relies on several technologies specified in ISO/IEC 23090-2, also known as MPEG-OMAF. While some implementations of OMAF-compatible players have already been demonstrated at several trade shows, so far, no web browser-based implementations have been presented. In this demo paper we describe a browser-based JavaScript player implementation of the most advanced media profile of OMAF: HEVC-based viewport-dependent OMAF video profile, also known as tile-based streaming, with multi-resolution HEVC tiles. We also describe the applied workarounds for the implementation challenges we encountered with state-of-the-art HTML5 browsers. The presented implementation was tested in the Safari browser with support of HEVC video through the HTML5 Media Source Extensions API. In addition, the WebGL API was used for rendering, using region-wise packing metadata as defined in OMAF. △ Less

Submitted 23 April, 2019; v1 submitted 7 March, 2019; originally announced March 2019.

Comments: Accepted for the demo track of ACM MMSys'19

Showing 1–4 of 4 results for author: Podborski, D