Skip to main content

Showing 1–1 of 1 results for author: Khanfar, N O

Searching in archive cs. Search in all archives.
.
  1. arXiv:2406.13898  [pdf

    cs.CV cs.CL cs.CY

    The Use of Multimodal Large Language Models to Detect Objects from Thermal Images: Transportation Applications

    Authors: Huthaifa I. Ashqar, Taqwa I. Alhadidi, Mohammed Elhenawy, Nour O. Khanfar

    Abstract: The integration of thermal imaging data with Multimodal Large Language Models (MLLMs) constitutes an exciting opportunity for improving the safety and functionality of autonomous driving systems and many Intelligent Transportation Systems (ITS) applications. This study investigates whether MLLMs can understand complex images from RGB and thermal cameras and detect objects directly. Our goals were… ▽ More

    Submitted 19 June, 2024; originally announced June 2024.