Skip to main content

Showing 1–1 of 1 results for author: Tavazoee, F

.
  1. arXiv:2402.11058  [pdf, other

    cs.CV cs.CL

    II-MMR: Identifying and Improving Multi-modal Multi-hop Reasoning in Visual Question Answering

    Authors: Jihyung Kil, Farideh Tavazoee, Dongyeop Kang, Joo-Kyung Kim

    Abstract: Visual Question Answering (VQA) often involves diverse reasoning scenarios across Vision and Language (V&L). Most prior VQA studies, however, have merely focused on assessing the model's overall accuracy without evaluating it on different reasoning cases. Furthermore, some recent works observe that conventional Chain-of-Thought (CoT) prompting fails to generate effective reasoning for VQA, especia… ▽ More

    Submitted 2 June, 2024; v1 submitted 16 February, 2024; originally announced February 2024.

    Comments: Accepted to ACL 2024 Findings