Skip to main content

Showing 1–3 of 3 results for author: Shahgir, H A Z S

Searching in archive cs. Search in all archives.
.
  1. arXiv:2310.14005  [pdf, ps, other

    eess.IV cs.CV

    Ophthalmic Biomarker Detection Using Ensembled Vision Transformers -- Winning Solution to IEEE SPS VIP Cup 2023

    Authors: H. A. Z. Sameen Shahgir, Khondker Salman Sayeed, Tanjeem Azwad Zaman, Md. Asif Haider, Sheikh Saifur Rahman Jony, M. Sohel Rahman

    Abstract: This report outlines our approach in the IEEE SPS VIP Cup 2023: Ophthalmic Biomarker Detection competition. Our primary objective in this competition was to identify biomarkers from Optical Coherence Tomography (OCT) images obtained from a diverse range of patients. Using robust augmentations and 5-fold cross-validation, we trained two vision transformer-based models: MaxViT and EVA-02, and ensemb… ▽ More

    Submitted 21 October, 2023; originally announced October 2023.

  2. arXiv:2303.10612  [pdf, other

    cs.CL cs.LG

    Bangla Grammatical Error Detection Using T5 Transformer Model

    Authors: H. A. Z. Sameen Shahgir, Khondker Salman Sayeed

    Abstract: This paper presents a method for detecting grammatical errors in Bangla using a Text-to-Text Transfer Transformer (T5) Language Model, using the small variant of BanglaT5, fine-tuned on a corpus of 9385 sentences where errors were bracketed by the dedicated demarcation symbol. The T5 model was primarily designed for translation and is not specifically designed for this task, so extensive post-proc… ▽ More

    Submitted 19 March, 2023; originally announced March 2023.

  3. arXiv:2209.06581  [pdf, ps, other

    eess.AS cs.AI cs.LG

    Applying wav2vec2 for Speech Recognition on Bengali Common Voices Dataset

    Authors: H. A. Z. Sameen Shahgir, Khondker Salman Sayeed, Tanjeem Azwad Zaman

    Abstract: Speech is inherently continuous, where discrete words, phonemes and other units are not clearly segmented, and so speech recognition has been an active research problem for decades. In this work we have fine-tuned wav2vec 2.0 to recognize and transcribe Bengali speech -- training it on the Bengali Common Voice Speech Dataset. After training for 71 epochs, on a training set consisting of 36919 mp3… ▽ More

    Submitted 11 September, 2022; originally announced September 2022.

    Comments: 5 pages