Skip to main content

Showing 1–5 of 5 results for author: Shahgir, H S

.
  1. arXiv:2407.00416  [pdf, other

    cs.CL

    Too Late to Train, Too Early To Use? A Study on Necessity and Viability of Low-Resource Bengali LLMs

    Authors: Tamzeed Mahfuz, Satak Kumar Dey, Ruwad Naswan, Hasnaen Adil, Khondker Salman Sayeed, Haz Sameen Shahgir

    Abstract: Each new generation of English-oriented Large Language Models (LLMs) exhibits enhanced cross-lingual transfer capabilities and significantly outperforms older LLMs on low-resource languages. This prompts the question: Is there a need for LLMs dedicated to a particular low-resource language? We aim to explore this question for Bengali, a low-to-moderate resource Indo-Aryan language native to the Be… ▽ More

    Submitted 29 June, 2024; originally announced July 2024.

  2. arXiv:2403.15952  [pdf, other

    cs.CV cs.CL

    IllusionVQA: A Challenging Optical Illusion Dataset for Vision Language Models

    Authors: Haz Sameen Shahgir, Khondker Salman Sayeed, Abhik Bhattacharjee, Wasi Uddin Ahmad, Yue Dong, Rifat Shahriyar

    Abstract: The advent of Vision Language Models (VLM) has allowed researchers to investigate the visual understanding of a neural network using natural language. Beyond object classification and detection, VLMs are capable of visual comprehension and common-sense reasoning. This naturally led to the question: How do VLMs respond when the image itself is inherently unreasonable? To this end, we present Illusi… ▽ More

    Submitted 30 March, 2024; v1 submitted 23 March, 2024; originally announced March 2024.

  3. arXiv:2401.12210  [pdf, other

    cs.CV

    Connecting the Dots: Leveraging Spatio-Temporal Graph Neural Networks for Accurate Bangla Sign Language Recognition

    Authors: Haz Sameen Shahgir, Khondker Salman Sayeed, Md Toki Tahmid, Tanjeem Azwad Zaman, Md. Zarif Ul Alam

    Abstract: Recent advances in Deep Learning and Computer Vision have been successfully leveraged to serve marginalized communities in various contexts. One such area is Sign Language - a primary means of communication for the deaf community. However, so far, the bulk of research efforts and investments have gone into American Sign Language, and research activity into low-resource sign languages - especially… ▽ More

    Submitted 22 January, 2024; originally announced January 2024.

  4. arXiv:2312.14440  [pdf, other

    cs.LG cs.CR

    Asymmetric Bias in Text-to-Image Generation with Adversarial Attacks

    Authors: Haz Sameen Shahgir, Xianghao Kong, Greg Ver Steeg, Yue Dong

    Abstract: The widespread use of Text-to-Image (T2I) models in content generation requires careful examination of their safety, including their robustness to adversarial attacks. Despite extensive research on adversarial attacks, the reasons for their effectiveness remain underexplored. This paper presents an empirical study on adversarial attacks against T2I models, focusing on analyzing factors associated… ▽ More

    Submitted 14 February, 2024; v1 submitted 22 December, 2023; originally announced December 2023.

    Comments: preprint version

  5. arXiv:2303.09306  [pdf, ps, other

    cs.CL cs.AI

    BanglaCoNER: Towards Robust Bangla Complex Named Entity Recognition

    Authors: HAZ Sameen Shahgir, Ramisa Alam, Md. Zarif Ul Alam

    Abstract: Named Entity Recognition (NER) is a fundamental task in natural language processing that involves identifying and classifying named entities in text. But much work hasn't been done for complex named entity recognition in Bangla, despite being the seventh most spoken language globally. CNER is a more challenging task than traditional NER as it involves identifying and classifying complex and compou… ▽ More

    Submitted 17 March, 2023; v1 submitted 16 March, 2023; originally announced March 2023.

    Comments: Winning Solution for the Bangla Complex Named Entity Recognition Challenge