Skip to main content

Showing 1–1 of 1 results for author: Mehran, R

Searching in archive cs. Search in all archives.
.
  1. arXiv:2002.03977  [pdf

    eess.AS cs.LG cs.MM stat.ML

    Multimodal active speaker detection and virtual cinematography for video conferencing

    Authors: Ross Cutler, Ramin Mehran, Sam Johnson, Cha Zhang, Adam Kirk, Oliver Whyte, Adarsh Kowdle

    Abstract: Active speaker detection (ASD) and virtual cinematography (VC) can significantly improve the remote user experience of a video conference by automatically panning, tilting and zooming of a video conferencing camera: users subjectively rate an expert video cinematographer's video significantly higher than unedited video. We describe a new automated ASD and VC that performs within 0.3 MOS of an expe… ▽ More

    Submitted 24 May, 2022; v1 submitted 10 February, 2020; originally announced February 2020.