Skip to main content

Showing 1–1 of 1 results for author: Chaurasia, P

Searching in archive cs. Search in all archives.
.
  1. arXiv:2406.19237  [pdf, other

    cs.CL cs.CV cs.IR cs.LG

    FlowVQA: Map** Multimodal Logic in Visual Question Answering with Flowcharts

    Authors: Shubhankar Singh, Purvi Chaurasia, Yerram Varun, Pranshu Pandya, Vatsal Gupta, Vivek Gupta, Dan Roth

    Abstract: Existing benchmarks for visual question answering lack in visual grounding and complexity, particularly in evaluating spatial reasoning skills. We introduce FlowVQA, a novel benchmark aimed at assessing the capabilities of visual question-answering multimodal language models in reasoning with flowcharts as visual contexts. FlowVQA comprises 2,272 carefully generated and human-verified flowchart im… ▽ More

    Submitted 28 June, 2024; v1 submitted 27 June, 2024; originally announced June 2024.

    Comments: Accepted in ACL 2024 (Findings), 21 pages, 7 figures, 9 Tables