Skip to main content

Showing 1–2 of 2 results for author: Zinonos, A

Searching in archive cs. Search in all archives.
.
  1. arXiv:2404.02098  [pdf, other

    cs.CV

    BRAVEn: Improving Self-Supervised Pre-training for Visual and Auditory Speech Recognition

    Authors: Alexandros Haliassos, Andreas Zinonos, Rodrigo Mira, Stavros Petridis, Maja Pantic

    Abstract: Self-supervision has recently shown great promise for learning visual and auditory speech representations from unlabelled data. In this work, we propose BRAVEn, an extension to the recent RAVEn method, which learns speech representations entirely from raw audio-visual data. Our modifications to RAVEn enable BRAVEn to achieve state-of-the-art results among self-supervised methods in various setting… ▽ More

    Submitted 2 April, 2024; originally announced April 2024.

    Comments: ICASSP 2024. Code: https://github.com/ahaliassos/raven

  2. arXiv:2303.09455  [pdf, other

    cs.CL cs.CV cs.LG cs.SD eess.AS

    Learning Cross-lingual Visual Speech Representations

    Authors: Andreas Zinonos, Alexandros Haliassos, **chuan Ma, Stavros Petridis, Maja Pantic

    Abstract: Cross-lingual self-supervised learning has been a growing research topic in the last few years. However, current works only explored the use of audio signals to create representations. In this work, we study cross-lingual self-supervised visual representation learning. We use the recently-proposed Raw Audio-Visual Speech Encoders (RAVEn) framework to pre-train an audio-visual model with unlabelled… ▽ More

    Submitted 14 March, 2023; originally announced March 2023.