Skip to main content

Showing 1–1 of 1 results for author: Eldeshnawy, F

Searching in archive cs. Search in all archives.
.
  1. arXiv:2108.03543  [pdf, other

    cs.CV cs.AI

    Spatio-Temporal Attention Mechanism and Knowledge Distillation for Lip Reading

    Authors: Shahd Elashmawy, Marian Ramsis, Hesham M. Eraqi, Farah Eldeshnawy, Hadeel Mabrouk, Omar Abugabal, Nourhan Sakr

    Abstract: Despite the advancement in the domain of audio and audio-visual speech recognition, visual speech recognition systems are still quite under-explored due to the visual ambiguity of some phonemes. In this work, we propose a new lip-reading model that combines three contributions. First, the model front-end adopts a spatio-temporal attention mechanism to help extract the informative data from the inp… ▽ More

    Submitted 7 August, 2021; originally announced August 2021.