Skip to main content

Showing 1–17 of 17 results for author: Raihan, N

Searching in archive cs. Search in all archives.
.
  1. arXiv:2407.00581  [pdf, other

    cs.CL

    MasonTigers at SemEval-2024 Task 10: Emotion Discovery and Flip Reasoning in Conversation with Ensemble of Transformers and Prompting

    Authors: Al Nahian Bin Emran, Amrita Ganguly, Sadiya Sayara Chowdhury Puspo, Nishat Raihan, Dhiman Goswami

    Abstract: In this paper, we present MasonTigers' participation in SemEval-2024 Task 10, a shared task aimed at identifying emotions and understanding the rationale behind their flips within monolingual English and Hindi-English code-mixed dialogues. This task comprises three distinct subtasks - emotion recognition in conversation for Hindi-English code-mixed dialogues, emotion flip reasoning for Hindi-Engli… ▽ More

    Submitted 29 June, 2024; originally announced July 2024.

  2. arXiv:2405.06922  [pdf, other

    cs.CL

    EmoMix-3L: A Code-Mixed Dataset for Bangla-English-Hindi Emotion Detection

    Authors: Nishat Raihan, Dhiman Goswami, Antara Mahmud, Antonios Anastasopoulos, Marcos Zampieri

    Abstract: Code-mixing is a well-studied linguistic phenomenon that occurs when two or more languages are mixed in text or speech. Several studies have been conducted on building datasets and performing downstream NLP tasks on code-mixed data. Although it is not uncommon to observe code-mixing of three or more languages, most available datasets in this domain contain code-mixed data from only two languages.… ▽ More

    Submitted 11 May, 2024; originally announced May 2024.

    Comments: arXiv admin note: substantial text overlap with arXiv:2310.18387, arXiv:2310.18023

  3. arXiv:2404.02540  [pdf, ps, other

    cs.CL

    CSEPrompts: A Benchmark of Introductory Computer Science Prompts

    Authors: Nishat Raihan, Dhiman Goswami, Sadiya Sayara Chowdhury Puspo, Christian Newman, Tharindu Ranasinghe, Marcos Zampieri

    Abstract: Recent advances in AI, machine learning, and NLP have led to the development of a new generation of Large Language Models (LLMs) that are trained on massive amounts of data and often have trillions of parameters. Commercial applications (e.g., ChatGPT) have made this technology available to the general public, thus making it possible to use LLMs to produce high-quality texts for academic and profe… ▽ More

    Submitted 4 April, 2024; v1 submitted 3 April, 2024; originally announced April 2024.

  4. arXiv:2403.14990  [pdf, other

    cs.CL

    MasonTigers at SemEval-2024 Task 1: An Ensemble Approach for Semantic Textual Relatedness

    Authors: Dhiman Goswami, Sadiya Sayara Chowdhury Puspo, Md Nishat Raihan, Al Nahian Bin Emran, Amrita Ganguly, Marcos Zampieri

    Abstract: This paper presents the MasonTigers entry to the SemEval-2024 Task 1 - Semantic Textual Relatedness. The task encompasses supervised (Track A), unsupervised (Track B), and cross-lingual (Track C) approaches across 14 different languages. MasonTigers stands out as one of the two teams who participated in all languages across the three tracks. Our approaches achieved rankings ranging from 11th to 21… ▽ More

    Submitted 5 April, 2024; v1 submitted 22 March, 2024; originally announced March 2024.

  5. arXiv:2403.14989  [pdf, other

    cs.CL

    MasonTigers at SemEval-2024 Task 8: Performance Analysis of Transformer-based Models on Machine-Generated Text Detection

    Authors: Sadiya Sayara Chowdhury Puspo, Md Nishat Raihan, Dhiman Goswami, Al Nahian Bin Emran, Amrita Ganguly, Ozlem Uzuner

    Abstract: This paper presents the MasonTigers entry to the SemEval-2024 Task 8 - Multigenerator, Multidomain, and Multilingual Black-Box Machine-Generated Text Detection. The task encompasses Binary Human-Written vs. Machine-Generated Text Classification (Track A), Multi-Way Machine-Generated Text Classification (Track B), and Human-Machine Mixed Text Detection (Track C). Our best performing approaches util… ▽ More

    Submitted 5 April, 2024; v1 submitted 22 March, 2024; originally announced March 2024.

  6. arXiv:2403.14982  [pdf, other

    cs.CL

    MasonTigers at SemEval-2024 Task 9: Solving Puzzles with an Ensemble of Chain-of-Thoughts

    Authors: Md Nishat Raihan, Dhiman Goswami, Al Nahian Bin Emran, Sadiya Sayara Chowdhury Puspo, Amrita Ganguly, Marcos Zampieri

    Abstract: Our paper presents team MasonTigers submission to the SemEval-2024 Task 9 - which provides a dataset of puzzles for testing natural language understanding. We employ large language models (LLMs) to solve this task through several prompting techniques. Zero-shot and few-shot prompting generate reasonably good results when tested with proprietary LLMs, compared to the open-source models. We obtain f… ▽ More

    Submitted 3 April, 2024; v1 submitted 22 March, 2024; originally announced March 2024.

  7. arXiv:2402.01976  [pdf, other

    cs.CL

    MasonPerplexity at ClimateActivism 2024: Integrating Advanced Ensemble Techniques and Data Augmentation for Climate Activism Stance and Hate Event Identification

    Authors: Al Nahian Bin Emran, Amrita Ganguly, Sadiya Sayara Chowdhury Puspo, Dhiman Goswami, Md Nishat Raihan

    Abstract: The task of identifying public opinions on social media, particularly regarding climate activism and the detection of hate events, has emerged as a critical area of research in our rapidly changing world. With a growing number of people voicing either to support or oppose to climate-related issues - understanding these diverse viewpoints has become increasingly vital. Our team, MasonPerplexity, pa… ▽ More

    Submitted 2 February, 2024; originally announced February 2024.

  8. arXiv:2402.01967  [pdf, other

    cs.CL

    MasonPerplexity at Multimodal Hate Speech Event Detection 2024: Hate Speech and Target Detection Using Transformer Ensembles

    Authors: Amrita Ganguly, Al Nahian Bin Emran, Sadiya Sayara Chowdhury Puspo, Md Nishat Raihan, Dhiman Goswami, Marcos Zampieri

    Abstract: The automatic identification of offensive language such as hate speech is important to keep discussions civil in online communities. Identifying hate speech in multimodal content is a particularly challenging task because offensiveness can be manifested in either words or images or a juxtaposition of the two. This paper presents the MasonPerplexity submission for the Shared Task on Multimodal Hate… ▽ More

    Submitted 18 February, 2024; v1 submitted 2 February, 2024; originally announced February 2024.

  9. arXiv:2401.14681  [pdf, other

    cs.CL

    MasonTigers@LT-EDI-2024: An Ensemble Approach Towards Detecting Homophobia and Transphobia in Social Media Comments

    Authors: Dhiman Goswami, Sadiya Sayara Chowdhury Puspo, Md Nishat Raihan, Al Nahian Bin Emran

    Abstract: In this paper, we describe our approaches and results for Task 2 of the LT-EDI 2024 Workshop, aimed at detecting homophobia and/or transphobia across ten languages. Our methodologies include monolingual transformers and ensemble methods, capitalizing on the strengths of each to enhance the performance of the models. The ensemble models worked well, placing our team, MasonTigers, in the top five fo… ▽ More

    Submitted 15 February, 2024; v1 submitted 26 January, 2024; originally announced January 2024.

  10. arXiv:2311.15032  [pdf, other

    cs.CL

    nlpBDpatriots at BLP-2023 Task 2: A Transfer Learning Approach to Bangla Sentiment Analysis

    Authors: Dhiman Goswami, Md Nishat Raihan, Sadiya Sayara Chowdhury Puspo, Marcos Zampieri

    Abstract: In this paper, we discuss the nlpBDpatriots entry to the shared task on Sentiment Analysis of Bangla Social Media Posts organized at the first workshop on Bangla Language Processing (BLP) co-located with EMNLP. The main objective of this task is to identify the polarity of social media content using a Bangla dataset annotated with positive, neutral, and negative labels provided by the shared task… ▽ More

    Submitted 25 November, 2023; originally announced November 2023.

  11. arXiv:2311.15029  [pdf, other

    cs.CL

    nlpBDpatriots at BLP-2023 Task 1: A Two-Step Classification for Violence Inciting Text Detection in Bangla

    Authors: Md Nishat Raihan, Dhiman Goswami, Sadiya Sayara Chowdhury Puspo, Marcos Zampieri

    Abstract: In this paper, we discuss the nlpBDpatriots entry to the shared task on Violence Inciting Text Detection (VITD) organized as part of the first workshop on Bangla Language Processing (BLP) co-located with EMNLP. The aim of this task is to identify and classify the violent threats, that provoke further unlawful violent acts. Our best-performing approach for the task is two-step classification using… ▽ More

    Submitted 25 November, 2023; originally announced November 2023.

  12. arXiv:2311.15023  [pdf, other

    cs.CL

    Offensive Language Identification in Transliterated and Code-Mixed Bangla

    Authors: Md Nishat Raihan, Umma Hani Tanmoy, Anika Binte Islam, Kai North, Tharindu Ranasinghe, Antonios Anastasopoulos, Marcos Zampieri

    Abstract: Identifying offensive content in social media is vital for creating safe online communities. Several recent studies have addressed this problem by creating datasets for various languages. In this paper, we explore offensive language identification in texts with transliterations and code-mixing, linguistic phenomena common in multilingual societies, and a known challenge for NLP systems. We introdu… ▽ More

    Submitted 25 November, 2023; originally announced November 2023.

  13. arXiv:2310.18387  [pdf, other

    cs.CL cs.AI

    OffMix-3L: A Novel Code-Mixed Dataset in Bangla-English-Hindi for Offensive Language Identification

    Authors: Dhiman Goswami, Md Nishat Raihan, Antara Mahmud, Antonios Anastasopoulos, Marcos Zampieri

    Abstract: Code-mixing is a well-studied linguistic phenomenon when two or more languages are mixed in text or speech. Several works have been conducted on building datasets and performing downstream NLP tasks on code-mixed data. Although it is not uncommon to observe code-mixing of three or more languages, most available datasets in this domain contain code-mixed data from only two languages. In this paper,… ▽ More

    Submitted 25 November, 2023; v1 submitted 27 October, 2023; originally announced October 2023.

    Comments: arXiv admin note: substantial text overlap with arXiv:2310.18023

  14. arXiv:2310.18023  [pdf, other

    cs.CL

    SentMix-3L: A Bangla-English-Hindi Code-Mixed Dataset for Sentiment Analysis

    Authors: Md Nishat Raihan, Dhiman Goswami, Antara Mahmud, Antonios Anastasopoulos, Marcos Zampieri

    Abstract: Code-mixing is a well-studied linguistic phenomenon when two or more languages are mixed in text or speech. Several datasets have been build with the goal of training computational models for code-mixing. Although it is very common to observe code-mixing with multiple languages, most datasets available contain code-mixed between only two languages. In this paper, we introduce SentMix-3L, a novel d… ▽ More

    Submitted 29 November, 2023; v1 submitted 27 October, 2023; originally announced October 2023.

  15. arXiv:2310.00820  [pdf, other

    cs.LG

    Determining the Optimal Number of Clusters for Time Series Datasets with Symbolic Pattern Forest

    Authors: Md Nishat Raihan

    Abstract: Clustering algorithms are among the most widely used data mining methods due to their exploratory power and being an initial preprocessing step that paves the way for other techniques. But the problem of calculating the optimal number of clusters (say k) is one of the significant challenges for such methods. The most widely used clustering algorithms like k-means and k-shape in time series data mi… ▽ More

    Submitted 1 October, 2023; originally announced October 2023.

  16. arXiv:2309.10272  [pdf, other

    cs.CL

    Mixed-Distil-BERT: Code-mixed Language Modeling for Bangla, English, and Hindi

    Authors: Md Nishat Raihan, Dhiman Goswami, Antara Mahmud

    Abstract: One of the most popular downstream tasks in the field of Natural Language Processing is text classification. Text classification tasks have become more daunting when the texts are code-mixed. Though they are not exposed to such text during pre-training, different BERT models have demonstrated success in tackling Code-Mixed NLP challenges. Again, in order to enhance their performance, Code-Mixed NL… ▽ More

    Submitted 14 March, 2024; v1 submitted 18 September, 2023; originally announced September 2023.

  17. arXiv:1905.08545  [pdf, ps, other

    cs.CV eess.IV

    Contrast Enhancement of Medical X-Ray Image Using Morphological Operators with Optimal Structuring Element

    Authors: Rafsanjany Kushol, Md. Nishat Raihan, Md Sirajus Salekin, A. B. M. Ashikur Rahman

    Abstract: To guide surgical and medical treatment X-ray images have been used by physicians in every modern healthcare organization and hospitals. Doctor's evaluation process and disease identification in the area of skeletal system can be performed in a faster and efficient way with the help of X-ray imaging technique as they can depict bone structure painlessly. This paper presents an efficient contrast e… ▽ More

    Submitted 21 May, 2019; originally announced May 2019.

    Comments: 5 pages, 4 figures, conference paper