Skip to main content

Showing 1–5 of 5 results for author: Balouchzahi, F

Searching in archive cs. Search in all archives.
.
  1. arXiv:2404.05365  [pdf, other

    cs.CL

    NLP Progress in Indigenous Latin American Languages

    Authors: Atnafu Lambebo Tonja, Fazlourrahman Balouchzahi, Sabur Butt, Olga Kolesnikova, Hector Ceballos, Alexander Gelbukh, Thamar Solorio

    Abstract: The paper focuses on the marginalization of indigenous language communities in the face of rapid technological advancements. We highlight the cultural richness of these languages and the risk they face of being overlooked in the realm of Natural Language Processing (NLP). We aim to bridge the gap between these communities and researchers, emphasizing the need for inclusive technological advancemen… ▽ More

    Submitted 12 May, 2024; v1 submitted 8 April, 2024; originally announced April 2024.

    Comments: Accepted at NAACL 2024

  2. arXiv:2401.16541  [pdf, other

    cs.CL cs.AI

    GuReT: Distinguishing Guilt and Regret related Text

    Authors: Sabur Butt, Fazlourrahman Balouchzahi, Abdul Gafar Manuel Meque, Maaz Amjad, Hector G. Ceballos Cancino, Grigori Sidorov, Alexander Gelbukh

    Abstract: The intricate relationship between human decision-making and emotions, particularly guilt and regret, has significant implications on behavior and well-being. Yet, these emotions subtle distinctions and interplay are often overlooked in computational models. This paper introduces a dataset tailored to dissect the relationship between guilt and regret and their unique textual markers, filling a not… ▽ More

    Submitted 29 January, 2024; originally announced January 2024.

  3. arXiv:2212.07549  [pdf, other

    cs.CL cs.AI cs.CY cs.LG

    ReDDIT: Regret Detection and Domain Identification from Text

    Authors: Fazlourrahman Balouchzahi, Sabur Butt, Grigori Sidorov, Alexander Gelbukh

    Abstract: In this paper, we present a study of regret and its expression on social media platforms. Specifically, we present a novel dataset of Reddit texts that have been classified into three classes: Regret by Action, Regret by Inaction, and No Regret. We then use this dataset to investigate the language used to express regret on Reddit and to identify the domains of text that are most commonly associate… ▽ More

    Submitted 14 December, 2022; originally announced December 2022.

  4. arXiv:2211.09847  [pdf, other

    cs.CL cs.AI cs.CY cs.LG

    CoLI-Machine Learning Approaches for Code-mixed Language Identification at the Word Level in Kannada-English Texts

    Authors: H. L. Shashirekha, F. Balouchzahi, M. D. Anusha, G. Sidorov

    Abstract: The task of automatically identifying a language used in a given text is called Language Identification (LI). India is a multilingual country and many Indians especially youths are comfortable with Hindi and English, in addition to their local languages. Hence, they often use more than one language to post their comments on social media. Texts containing more than one language are called "code-mix… ▽ More

    Submitted 17 November, 2022; originally announced November 2022.

  5. arXiv:2210.14136  [pdf, other

    cs.CL cs.AI cs.CY cs.LG

    PolyHope: Two-Level Hope Speech Detection from Tweets

    Authors: Fazlourrahman Balouchzahi, Grigori Sidorov, Alexander Gelbukh

    Abstract: Hope is characterized as openness of spirit toward the future, a desire, expectation, and wish for something to happen or to be true that remarkably affects human's state of mind, emotions, behaviors, and decisions. Hope is usually associated with concepts of desired expectations and possibility/probability concerning the future. Despite its importance, hope has rarely been studied as a social med… ▽ More

    Submitted 3 November, 2022; v1 submitted 25 October, 2022; originally announced October 2022.

    Comments: 20 pages, 9 figures