Skip to main content

Showing 1–7 of 7 results for author: Rashid, M M O

.
  1. arXiv:2312.05467  [pdf

    cs.CL

    Textual Toxicity in Social Media: Understanding the Bangla Toxic Language Expressed in Facebook Comment

    Authors: Mohammad Mamun Or Rashid

    Abstract: Social Media is a repository of digital literature including user-generated content. The users of social media are expressing their opinion with diverse mediums such as text, emojis, memes, and also through other visual and textual mediums. A major portion of these media elements could be treated as harmful to others and they are known by many words including Cyberbullying and Toxic Language . The… ▽ More

    Submitted 9 December, 2023; originally announced December 2023.

  2. arXiv:2311.03078  [pdf

    cs.CL

    BanLemma: A Word Formation Dependent Rule and Dictionary Based Bangla Lemmatizer

    Authors: Sadia Afrin, Md. Shahad Mahmud Chowdhury, Md. Ekramul Islam, Faisal Ahamed Khan, Labib Imam Chowdhury, MD. Motahar Mahtab, Nazifa Nuha Chowdhury, Massud Forkan, Neelima Kundu, Hakim Arif, Mohammad Mamun Or Rashid, Mohammad Ruhul Amin, Nabeel Mohammed

    Abstract: Lemmatization holds significance in both natural language processing (NLP) and linguistics, as it effectively decreases data density and aids in comprehending contextual meaning. However, due to the highly inflected nature and morphological richness, lemmatization in Bangla text poses a complex challenge. In this study, we propose linguistic rules for lemmatization and utilize a dictionary along w… ▽ More

    Submitted 6 November, 2023; originally announced November 2023.

  3. arXiv:2310.14348  [pdf, other

    cs.MA stat.ML

    DePAint: A Decentralized Safe Multi-Agent Reinforcement Learning Algorithm considering Peak and Average Constraints

    Authors: Raheeb Hassan, K. M. Shadman Wadith, Md. Mamun or Rashid, Md. Mosaddek Khan

    Abstract: The domain of safe multi-agent reinforcement learning (MARL), despite its potential applications in areas ranging from drone delivery and vehicle automation to the development of zero-energy communities, remains relatively unexplored. The primary challenge involves training agents to learn optimal policies that maximize rewards while adhering to stringent safety constraints, all without the oversi… ▽ More

    Submitted 3 April, 2024; v1 submitted 22 October, 2023; originally announced October 2023.

    Comments: accepted for publication in Springer Applied Intelligence Journal

  4. SentiGOLD: A Large Bangla Gold Standard Multi-Domain Sentiment Analysis Dataset and its Evaluation

    Authors: Md. Ekramul Islam, Labib Chowdhury, Faisal Ahamed Khan, Shazzad Hossain, Sourave Hossain, Mohammad Mamun Or Rashid, Nabeel Mohammed, Mohammad Ruhul Amin

    Abstract: This study introduces SentiGOLD, a Bangla multi-domain sentiment analysis dataset. Comprising 70,000 samples, it was created from diverse sources and annotated by a gender-balanced team of linguists. SentiGOLD adheres to established linguistic conventions agreed upon by the Government of Bangladesh and a Bangla linguistics committee. Unlike English and other languages, Bangla lacks standard sentim… ▽ More

    Submitted 9 June, 2023; originally announced June 2023.

    Comments: Accepted in KDD 2023 Applied Data Science Track; 12 pages, 14 figures

  5. arXiv:2306.01743  [pdf

    cs.CL

    Unicode Normalization and Grapheme Parsing of Indic Languages

    Authors: Nazmuddoha Ansary, Quazi Adibur Rahman Adib, Tahsin Reasat, Asif Shahriyar Sushmit, Ahmed Imtiaz Humayun, Sazia Mehnaz, Kanij Fatema, Mohammad Mamun Or Rashid, Farig Sadeque

    Abstract: Writing systems of Indic languages have orthographic syllables, also known as complex graphemes, as unique horizontal units. A prominent feature of these languages is these complex grapheme units that comprise consonants/consonant conjuncts, vowel diacritics, and consonant diacritics, which, together make a unique Language. Unicode-based writing schemes of these languages often disregard this feat… ▽ More

    Submitted 27 May, 2024; v1 submitted 11 May, 2023; originally announced June 2023.

    Comments: Published at LREC-COLING 2024

  6. arXiv:2304.03682  [pdf, other

    cs.CL

    BenCoref: A Multi-Domain Dataset of Nominal Phrases and Pronominal Reference Annotations

    Authors: Shadman Rohan, Mojammel Hossain, Mohammad Mamun Or Rashid, Nabeel Mohammed

    Abstract: Coreference Resolution is a well studied problem in NLP. While widely studied for English and other resource-rich languages, research on coreference resolution in Bengali largely remains unexplored due to the absence of relevant datasets. Bengali, being a low-resource language, exhibits greater morphological richness compared to English. In this article, we introduce a new dataset, BenCoref, compr… ▽ More

    Submitted 3 July, 2023; v1 submitted 7 April, 2023; originally announced April 2023.

  7. arXiv:1605.03283  [pdf

    cs.DC

    Implementation of the open source virtualization technologies in cloud computing

    Authors: Mohammad Mamun Or Rashid, M. Masud Rana, Jugal Krishna Das

    Abstract: The Virtualization and Cloud Computing is a recent buzzword in the digital world. Cloud computing provide IT as a service to the users on demand basis. This service has greater flexibility, availability, reliability and scalability with utility computing model. This new concept of computing has an immense potential in it to be used in the field of e-governance and in the overall IT development per… ▽ More

    Submitted 11 May, 2016; originally announced May 2016.

    Comments: 19 pages