Skip to main content

Showing 1–2 of 2 results for author: Muhtar, D

.
  1. arXiv:2402.02544  [pdf, other

    cs.CV cs.AI cs.LG

    LHRS-Bot: Empowering Remote Sensing with VGI-Enhanced Large Multimodal Language Model

    Authors: Dilxat Muhtar, Zhenshi Li, Feng Gu, Xueliang Zhang, Pengfeng Xiao

    Abstract: The revolutionary capabilities of large language models (LLMs) have paved the way for multimodal large language models (MLLMs) and fostered diverse applications across various specialized domains. In the remote sensing (RS) field, however, the diverse geographical landscapes and varied objects in RS imagery are not adequately considered in recent MLLM endeavors. To bridge this gap, we construct a… ▽ More

    Submitted 18 March, 2024; v1 submitted 4 February, 2024; originally announced February 2024.

    Comments: 36 pages, 10 figures. Github https://github.com/NJU-LHRS/LHRS-Bot

  2. CMID: A Unified Self-Supervised Learning Framework for Remote Sensing Image Understanding

    Authors: Dilxat Muhtar, Xueliang Zhang, Pengfeng Xiao, Zhenshi Li, Feng Gu

    Abstract: Self-supervised learning (SSL) has gained widespread attention in the remote sensing (RS) and earth observation (EO) communities owing to its ability to learn task-agnostic representations without human-annotated labels. Nevertheless, most existing RS SSL methods are limited to learning either global semantic separable or local spatial perceptible representations. We argue that this learning strat… ▽ More

    Submitted 3 August, 2023; v1 submitted 19 April, 2023; originally announced April 2023.

    Comments: Accepted by IEEE TGRS. The codes and models are released at https://github.com/NJU-LHRS/official-CMID

    Journal ref: IEEE Transactions on Geoscience and Remote Sensing, vol. 61, pp. 1-17, 2023, Art no. 5607817