Skip to main content

Showing 1–1 of 1 results for author: Akula, J

Searching in archive cs. Search in all archives.
.
  1. arXiv:2104.04598  [pdf, other

    cs.SD cs.CV cs.LG eess.AS eess.IV

    Cross-Modal learning for Audio-Visual Video Parsing

    Authors: Jatin Lamba, Abhishek, Jayaprakash Akula, Rishabh Dabral, Preethi Jyothi, Ganesh Ramakrishnan

    Abstract: In this paper, we present a novel approach to the audio-visual video parsing (AVVP) task that demarcates events from a video separately for audio and visual modalities. The proposed parsing approach simultaneously detects the temporal boundaries in terms of start and end times of such events. We show how AVVP can benefit from the following techniques geared towards effective cross-modal learning:… ▽ More

    Submitted 21 June, 2021; v1 submitted 3 April, 2021; originally announced April 2021.

    Comments: Work accepted at Interspeech 2021