Skip to main content

Showing 1–5 of 5 results for author: Samin, A M

Searching in archive cs. Search in all archives.
.
  1. arXiv:2405.11685  [pdf, other

    cs.CV cs.CL

    ColorFoil: Investigating Color Blindness in Large Vision and Language Models

    Authors: Ahnaf Mozib Samin, M. Firoz Ahmed, Md. Mushtaq Shahriyar Rafee

    Abstract: With the utilization of Transformer architecture, large Vision and Language (V&L) models have shown promising performance in even zero-shot settings. Several studies, however, indicate a lack of robustness of the models when dealing with complex linguistics and visual attributes. In this work, we introduce a novel V&L benchmark - ColorFoil, by creating color-related foils to assess the models' per… ▽ More

    Submitted 19 May, 2024; originally announced May 2024.

  2. arXiv:2401.15532  [pdf, other

    cs.CL cs.SD eess.AS

    Byte Pair Encoding Is All You Need For Automatic Bengali Speech Recognition

    Authors: Ahnaf Mozib Samin

    Abstract: Byte pair encoding (BPE) emerges as an effective tokenization method for tackling the out-of-vocabulary (OOV) challenge in various natural language and speech processing tasks. Recent research highlights the dependency of BPE subword tokenization's efficacy on the morphological nature of the language, particularly in languages rich in inflectional morphology, where fewer BPE merges suffice for gen… ▽ More

    Submitted 27 January, 2024; originally announced January 2024.

    Comments: Under-review

  3. arXiv:2211.14995  [pdf, other

    cs.CL

    Arguments to Key Points Map** with Prompt-based Learning

    Authors: Ahnaf Mozib Samin, Behrooz Nikandish, **gyan Chen

    Abstract: Handling and digesting a huge amount of information in an efficient manner has been a long-term demand in modern society. Some solutions to map key points (short textual summaries capturing essential information and filtering redundancies) to a large number of arguments/opinions have been provided recently (Bar-Haim et al., 2020). To complement the full picture of the argument-to-keypoint map**… ▽ More

    Submitted 27 November, 2022; originally announced November 2022.

    Comments: Accepted at ICNLSP 2022

  4. arXiv:2211.06366  [pdf, other

    cs.CL stat.AP

    Analysis of Male and Female Speakers' Word Choices in Public Speeches

    Authors: Md Zobaer Hossain, Ahnaf Mozib Samin

    Abstract: The extent to which men and women use language differently has been questioned previously. Finding clear and consistent gender differences in language is not conclusive in general, and the research is heavily influenced by the context and method employed to identify the difference. In addition, the majority of the research was conducted in written form, and the sample was collected in writing. The… ▽ More

    Submitted 11 November, 2022; originally announced November 2022.

  5. arXiv:2210.12921  [pdf

    cs.CL cs.SD eess.AS

    Investigating self-supervised, weakly supervised and fully supervised training approaches for multi-domain automatic speech recognition: a study on Bangladeshi Bangla

    Authors: Ahnaf Mozib Samin, M. Humayon Kobir, Md. Mushtaq Shahriyar Rafee, M. Firoz Ahmed, Mehedi Hasan, Partha Ghosh, Shafkat Kibria, M. Shahidur Rahman

    Abstract: Despite huge improvements in automatic speech recognition (ASR) employing neural networks, ASR systems still suffer from a lack of robustness and generalizability issues due to domain shifting. This is mainly because principal corpus design criteria are often not identified and examined adequately while compiling ASR datasets. In this study, we investigate the robustness of the state-of-the-art tr… ▽ More

    Submitted 10 May, 2023; v1 submitted 23 October, 2022; originally announced October 2022.