Skip to main content

Showing 1–1 of 1 results for author: Wuerkaixi, A

Searching in archive eess. Search in all archives.
.
  1. arXiv:2206.10421  [pdf, other

    cs.SD cs.AI cs.CV cs.MM eess.AS

    Rethinking Audio-visual Synchronization for Active Speaker Detection

    Authors: Abudukelimu Wuerkaixi, You Zhang, Zhiyao Duan, Changshui Zhang

    Abstract: Active speaker detection (ASD) systems are important modules for analyzing multi-talker conversations. They aim to detect which speakers or none are talking in a visual scene at any given time. Existing research on ASD does not agree on the definition of active speakers. We clarify the definition in this work and require synchronization between the audio and visual speaking activities. This clarif… ▽ More

    Submitted 10 July, 2022; v1 submitted 21 June, 2022; originally announced June 2022.

    Comments: Accepted by IEEE International Workshop on Machine Learning for Signal Processing (MLSP 2022)