Skip to main content

Showing 1–1 of 1 results for author: Tomich, R

Searching in archive cs. Search in all archives.
.
  1. arXiv:2404.07917  [pdf, other

    cs.AI cs.CL

    DesignQA: A Multimodal Benchmark for Evaluating Large Language Models' Understanding of Engineering Documentation

    Authors: Anna C. Doris, Daniele Grandi, Ryan Tomich, Md Ferdous Alam, Hyunmin Cheong, Faez Ahmed

    Abstract: This research introduces DesignQA, a novel benchmark aimed at evaluating the proficiency of multimodal large language models (MLLMs) in comprehending and applying engineering requirements in technical documentation. Developed with a focus on real-world engineering challenges, DesignQA uniquely combines multimodal data-including textual design requirements, CAD images, and engineering drawings-deri… ▽ More

    Submitted 11 April, 2024; originally announced April 2024.