Skip to main content

Showing 1–1 of 1 results for author: Rawal, I S

Searching in archive cs. Search in all archives.
.
  1. arXiv:2306.08889  [pdf, other

    cs.CV cs.AI

    Dissecting Multimodality in VideoQA Transformer Models by Impairing Modality Fusion

    Authors: Ishaan Singh Rawal, Alexander Matyasko, Shantanu Jaiswal, Basura Fernando, Cheston Tan

    Abstract: While VideoQA Transformer models demonstrate competitive performance on standard benchmarks, the reasons behind their success are not fully understood. Do these models capture the rich multimodal structures and dynamics from video and text jointly? Or are they achieving high scores by exploiting biases and spurious features? Hence, to provide insights, we design $\textit{QUAG}$ (QUadrant AveraGe),… ▽ More

    Submitted 7 June, 2024; v1 submitted 15 June, 2023; originally announced June 2023.

    Comments: Accepted at ICML 2024