Skip to main content

Showing 1–1 of 1 results for author: Mincato, E

Searching in archive cs. Search in all archives.
.
  1. arXiv:2310.14859  [pdf

    cs.CV cs.CL

    3M-TRANSFORMER: A Multi-Stage Multi-Stream Multimodal Transformer for Embodied Turn-Taking Prediction

    Authors: Mehdi Fatan, Emanuele Mincato, Dimitra Pintzou, Mariella Dimiccoli

    Abstract: Predicting turn-taking in multiparty conversations has many practical applications in human-computer/robot interaction. However, the complexity of human communication makes it a challenging task. Recent advances have shown that synchronous multi-perspective egocentric data can significantly improve turn-taking prediction compared to asynchronous, single-perspective transcriptions. Building on this… ▽ More

    Submitted 21 December, 2023; v1 submitted 23 October, 2023; originally announced October 2023.

    Comments: Accepted to ICASSP 2024