Skip to main content

Showing 1–6 of 6 results for author: Sayeed, K S

Searching in archive cs. Search in all archives.
.
  1. arXiv:2407.00416  [pdf, other

    cs.CL

    Too Late to Train, Too Early To Use? A Study on Necessity and Viability of Low-Resource Bengali LLMs

    Authors: Tamzeed Mahfuz, Satak Kumar Dey, Ruwad Naswan, Hasnaen Adil, Khondker Salman Sayeed, Haz Sameen Shahgir

    Abstract: Each new generation of English-oriented Large Language Models (LLMs) exhibits enhanced cross-lingual transfer capabilities and significantly outperforms older LLMs on low-resource languages. This prompts the question: Is there a need for LLMs dedicated to a particular low-resource language? We aim to explore this question for Bengali, a low-to-moderate resource Indo-Aryan language native to the Be… ▽ More

    Submitted 29 June, 2024; originally announced July 2024.

  2. arXiv:2403.15952  [pdf, other

    cs.CV cs.CL

    IllusionVQA: A Challenging Optical Illusion Dataset for Vision Language Models

    Authors: Haz Sameen Shahgir, Khondker Salman Sayeed, Abhik Bhattacharjee, Wasi Uddin Ahmad, Yue Dong, Rifat Shahriyar

    Abstract: The advent of Vision Language Models (VLM) has allowed researchers to investigate the visual understanding of a neural network using natural language. Beyond object classification and detection, VLMs are capable of visual comprehension and common-sense reasoning. This naturally led to the question: How do VLMs respond when the image itself is inherently unreasonable? To this end, we present Illusi… ▽ More

    Submitted 30 March, 2024; v1 submitted 23 March, 2024; originally announced March 2024.

  3. arXiv:2401.12210  [pdf, other

    cs.CV

    Connecting the Dots: Leveraging Spatio-Temporal Graph Neural Networks for Accurate Bangla Sign Language Recognition

    Authors: Haz Sameen Shahgir, Khondker Salman Sayeed, Md Toki Tahmid, Tanjeem Azwad Zaman, Md. Zarif Ul Alam

    Abstract: Recent advances in Deep Learning and Computer Vision have been successfully leveraged to serve marginalized communities in various contexts. One such area is Sign Language - a primary means of communication for the deaf community. However, so far, the bulk of research efforts and investments have gone into American Sign Language, and research activity into low-resource sign languages - especially… ▽ More

    Submitted 22 January, 2024; originally announced January 2024.

  4. arXiv:2310.14005  [pdf, ps, other

    eess.IV cs.CV

    Ophthalmic Biomarker Detection Using Ensembled Vision Transformers -- Winning Solution to IEEE SPS VIP Cup 2023

    Authors: H. A. Z. Sameen Shahgir, Khondker Salman Sayeed, Tanjeem Azwad Zaman, Md. Asif Haider, Sheikh Saifur Rahman Jony, M. Sohel Rahman

    Abstract: This report outlines our approach in the IEEE SPS VIP Cup 2023: Ophthalmic Biomarker Detection competition. Our primary objective in this competition was to identify biomarkers from Optical Coherence Tomography (OCT) images obtained from a diverse range of patients. Using robust augmentations and 5-fold cross-validation, we trained two vision transformer-based models: MaxViT and EVA-02, and ensemb… ▽ More

    Submitted 21 October, 2023; originally announced October 2023.

  5. arXiv:2303.10612  [pdf, other

    cs.CL cs.LG

    Bangla Grammatical Error Detection Using T5 Transformer Model

    Authors: H. A. Z. Sameen Shahgir, Khondker Salman Sayeed

    Abstract: This paper presents a method for detecting grammatical errors in Bangla using a Text-to-Text Transfer Transformer (T5) Language Model, using the small variant of BanglaT5, fine-tuned on a corpus of 9385 sentences where errors were bracketed by the dedicated demarcation symbol. The T5 model was primarily designed for translation and is not specifically designed for this task, so extensive post-proc… ▽ More

    Submitted 19 March, 2023; originally announced March 2023.

  6. arXiv:2209.06581  [pdf, ps, other

    eess.AS cs.AI cs.LG

    Applying wav2vec2 for Speech Recognition on Bengali Common Voices Dataset

    Authors: H. A. Z. Sameen Shahgir, Khondker Salman Sayeed, Tanjeem Azwad Zaman

    Abstract: Speech is inherently continuous, where discrete words, phonemes and other units are not clearly segmented, and so speech recognition has been an active research problem for decades. In this work we have fine-tuned wav2vec 2.0 to recognize and transcribe Bengali speech -- training it on the Bengali Common Voice Speech Dataset. After training for 71 epochs, on a training set consisting of 36919 mp3… ▽ More

    Submitted 11 September, 2022; originally announced September 2022.

    Comments: 5 pages