Skip to main content

Showing 1–50 of 96 results for author: Choudhury, M

Searching in archive cs. Search in all archives.
.
  1. arXiv:2406.12702  [pdf, other

    cs.CL

    [WIP] Jailbreak Paradox: The Achilles' Heel of LLMs

    Authors: Abhinav Rao, Monojit Choudhury, Somak Aditya

    Abstract: We introduce two paradoxes concerning jailbreak of foundation models: First, it is impossible to construct a perfect jailbreak classifier, and second, a weaker model cannot consistently detect whether a stronger (in a pareto-dominant sense) model is jailbroken or not. We provide formal proofs for these paradoxes and a short case study on Llama and GPT4-o to demonstrate this. We discuss broader the… ▽ More

    Submitted 20 June, 2024; v1 submitted 18 June, 2024; originally announced June 2024.

  2. arXiv:2406.11661  [pdf, other

    cs.CL

    Cultural Conditioning or Placebo? On the Effectiveness of Socio-Demographic Prompting

    Authors: Sagnik Mukherjee, Muhammad Farid Adilazuarda, Sunayana Sitaram, Kalika Bali, Alham Fikri Aji, Monojit Choudhury

    Abstract: Socio-demographic prompting is a commonly employed approach to study cultural biases in LLMs as well as for aligning models to certain cultures. In this paper, we systematically probe four LLMs (Llama 3, Mistral v0.2, GPT-3.5 Turbo and GPT-4) with prompts that are conditioned on culturally sensitive and non-sensitive cues, on datasets that are supposed to be culturally sensitive (EtiCor and CALI)… ▽ More

    Submitted 20 June, 2024; v1 submitted 17 June, 2024; originally announced June 2024.

  3. arXiv:2406.00314  [pdf, other

    cs.CL cs.AI cs.LG

    CASE: Efficient Curricular Data Pre-training for Building Assistive Psychology Expert Models

    Authors: Sarthak Harne, Monjoy Narayan Choudhury, Madhav Rao, TK Srikanth, Seema Mehrotra, Apoorva Vashisht, Aarushi Basu, Manjit Sodhi

    Abstract: The limited availability of psychologists necessitates efficient identification of individuals requiring urgent mental healthcare. This study explores the use of Natural Language Processing (NLP) pipelines to analyze text data from online mental health forums used for consultations. By analyzing forum posts, these pipelines can flag users who may require immediate professional attention. A crucial… ▽ More

    Submitted 16 June, 2024; v1 submitted 1 June, 2024; originally announced June 2024.

  4. arXiv:2405.17840  [pdf, other

    cs.CL

    Benchmarks Underestimate the Readiness of Multi-lingual Dialogue Agents

    Authors: Andrew H. Lee, Sina J. Semnani, Galo Castillo-López, Gäel de Chalendar, Monojit Choudhury, Ashna Dua, Kapil Rajesh Kavitha, Sungkyun Kim, Prashant Kodali, Ponnurangam Kumaraguru, Alexis Lombard, Mehrad Moradshahi, Gihyun Park, Nasredine Semmar, Jiwon Seo, Tianhao Shen, Manish Shrivastava, Deyi Xiong, Monica S. Lam

    Abstract: Creating multilingual task-oriented dialogue (TOD) agents is challenging due to the high cost of training data acquisition. Following the research trend of improving training data efficiency, we show for the first time, that in-context learning is sufficient to tackle multilingual TOD. To handle the challenging dialogue state tracking (DST) subtask, we break it down to simpler steps that are mor… ▽ More

    Submitted 16 June, 2024; v1 submitted 28 May, 2024; originally announced May 2024.

  5. arXiv:2405.05572  [pdf, other

    cs.CL cs.AI

    From Human Judgements to Predictive Models: Unravelling Acceptability in Code-Mixed Sentences

    Authors: Prashant Kodali, Anmol Goel, Likhith Asapu, Vamshi Krishna Bonagiri, Anirudh Govil, Monojit Choudhury, Manish Shrivastava, Ponnurangam Kumaraguru

    Abstract: Current computational approaches for analysing or generating code-mixed sentences do not explicitly model "naturalness" or "acceptability" of code-mixed sentences, but rely on training corpora to reflect distribution of acceptable code-mixed sentences. Modelling human judgement for the acceptability of code-mixed text can help in distinguishing natural code-mixed text and enable quality-controlled… ▽ More

    Submitted 9 May, 2024; originally announced May 2024.

  6. arXiv:2405.05378  [pdf, other

    cs.CL cs.AI cs.CY cs.HC cs.LG

    "They are uncultured": Unveiling Covert Harms and Social Threats in LLM Generated Conversations

    Authors: Preetam Prabhu Srikar Dammu, Hayoung Jung, Anjali Singh, Monojit Choudhury, Tanushree Mitra

    Abstract: Large language models (LLMs) have emerged as an integral part of modern societies, powering user-facing applications such as personal assistants and enterprise applications like recruitment tools. Despite their utility, research indicates that LLMs perpetuate systemic biases. Yet, prior works on LLM harms predominantly focus on Western concepts like race and gender, often overlooking cultural conc… ▽ More

    Submitted 8 May, 2024; originally announced May 2024.

  7. arXiv:2404.18460  [pdf, other

    cs.CL cs.AI

    Ethical Reasoning and Moral Value Alignment of LLMs Depend on the Language we Prompt them in

    Authors: Utkarsh Agarwal, Kumar Tanmay, Aditi Khandelwal, Monojit Choudhury

    Abstract: Ethical reasoning is a crucial skill for Large Language Models (LLMs). However, moral values are not universal, but rather influenced by language and culture. This paper explores how three prominent LLMs -- GPT-4, ChatGPT, and Llama2-70B-Chat -- perform ethical reasoning in different languages and if their moral judgement depend on the language in which they are prompted. We extend the study of et… ▽ More

    Submitted 29 April, 2024; originally announced April 2024.

  8. arXiv:2404.14548  [pdf, ps, other

    cs.CY

    Advancing a Consent-Forward Paradigm for Digital Mental Health Data

    Authors: Sachin R. Pendse, Logan Stapleton, Neha Kumar, Munmun De Choudhury, Stevie Chancellor

    Abstract: The field of digital mental health is advancing at a rapid pace. Passively collected data from user engagements with digital tools and services continue to contribute new insights into mental health and illness. As the field of digital mental health grows, a concerning norm has been established -- digital service users are given little say over how their data is collected, shared, or used to gener… ▽ More

    Submitted 22 April, 2024; originally announced April 2024.

    Comments: 15 pages with 2 tables

  9. arXiv:2403.15412  [pdf, other

    cs.CY cs.AI cs.CL

    Towards Measuring and Modeling "Culture" in LLMs: A Survey

    Authors: Muhammad Farid Adilazuarda, Sagnik Mukherjee, Pradhyumna Lavania, Siddhant Singh, Alham Fikri Aji, Jacki O'Neill, Ashutosh Modi, Monojit Choudhury

    Abstract: We present a survey of more than 90 recent papers that aim to study cultural representation and inclusion in large language models (LLMs). We observe that none of the studies explicitly define "culture, which is a complex, multifaceted concept; instead, they probe the models on some specially designed datasets which represent certain aspects of "culture". We call these aspects the proxies of cultu… ▽ More

    Submitted 19 June, 2024; v1 submitted 5 March, 2024; originally announced March 2024.

  10. arXiv:2403.10058  [pdf, other

    cs.CV

    RID-TWIN: An end-to-end pipeline for automatic face de-identification in videos

    Authors: Anirban Mukherjee, Monjoy Narayan Choudhury, Dinesh Babu Jayagopi

    Abstract: Face de-identification in videos is a challenging task in the domain of computer vision, primarily used in privacy-preserving applications. Despite the considerable progress achieved through generative vision models, there remain multiple challenges in the latest approaches. They lack a comprehensive discussion and evaluation of aspects such as realism, temporal coherence, and preservation of non-… ▽ More

    Submitted 15 March, 2024; originally announced March 2024.

    Comments: This work has been submitted to IEEE ICIP 2024

  11. arXiv:2402.02135  [pdf, other

    cs.CL cs.AI

    Do Moral Judgment and Reasoning Capability of LLMs Change with Language? A Study using the Multilingual Defining Issues Test

    Authors: Aditi Khandelwal, Utkarsh Agarwal, Kumar Tanmay, Monojit Choudhury

    Abstract: This paper explores the moral judgment and moral reasoning abilities exhibited by Large Language Models (LLMs) across languages through the Defining Issues Test. It is a well known fact that moral judgment depends on the language in which the question is asked. We extend the work of beyond English, to 5 new languages (Chinese, Hindi, Russian, Spanish and Swahili), and probe three LLMs -- ChatGPT,… ▽ More

    Submitted 3 February, 2024; originally announced February 2024.

    Comments: Accepted to EACL 2024 (main)

  12. arXiv:2401.14362  [pdf, ps, other

    cs.HC cs.AI cs.CY

    The Ty** Cure: Experiences with Large Language Model Chatbots for Mental Health Support

    Authors: Inhwa Song, Sachin R. Pendse, Neha Kumar, Munmun De Choudhury

    Abstract: People experiencing severe distress increasingly use Large Language Model (LLM) chatbots as mental health support tools. Discussions on social media have described how engagements were lifesaving for some, but evidence suggests that general-purpose LLM chatbots also have notable risks that could endanger the welfare of users if not designed responsibly. In this study, we investigate the lived expe… ▽ More

    Submitted 6 March, 2024; v1 submitted 25 January, 2024; originally announced January 2024.

    Comments: The first two authors contributed equally to this work; typos corrected

  13. arXiv:2312.08800  [pdf, other

    cs.CL cs.AI cs.HC cs.LG

    Evaluating Large Language Models for Health-related Queries with Presuppositions

    Authors: Navreet Kaur, Monojit Choudhury, Danish Pruthi

    Abstract: As corporations rush to integrate large language models (LLMs) to their search offerings, it is critical that they provide factually accurate information that is robust to any presuppositions that a user may express. In this work, we introduce UPHILL, a dataset consisting of health-related queries with varying degrees of presuppositions. Using UPHILL, we evaluate the factual accuracy and consisten… ▽ More

    Submitted 6 June, 2024; v1 submitted 14 December, 2023; originally announced December 2023.

    Comments: Findings of ACL 2024

  14. arXiv:2311.14693  [pdf, other

    cs.CL cs.AI cs.CY cs.HC

    Benefits and Harms of Large Language Models in Digital Mental Health

    Authors: Munmun De Choudhury, Sachin R. Pendse, Neha Kumar

    Abstract: The past decade has been transformative for mental health research and practice. The ability to harness large repositories of data, whether from electronic health records (EHR), mobile devices, or social media, has revealed a potential for valuable insights into patient experiences, promising early, proactive interventions, as well as personalized treatment plans. Recent developments in generative… ▽ More

    Submitted 7 November, 2023; originally announced November 2023.

  15. arXiv:2311.04262  [pdf, other

    cs.CV cs.AI cs.DL cs.LG

    ETDPC: A Multimodality Framework for Classifying Pages in Electronic Theses and Dissertations

    Authors: Muntabir Hasan Choudhury, Lamia Salsabil, William A. Ingram, Edward A. Fox, Jian Wu

    Abstract: Electronic theses and dissertations (ETDs) have been proposed, advocated, and generated for more than 25 years. Although ETDs are hosted by commercial or institutional digital library repositories, they are still an understudied type of scholarly big data, partially because they are usually longer than conference proceedings and journals. Segmenting ETDs will allow researchers to study sectional c… ▽ More

    Submitted 7 November, 2023; originally announced November 2023.

    Comments: 10 pages, 3 figures, accepted to Innovative Applications of Artificial Intelligence (IAAI-24)

  16. arXiv:2310.13132  [pdf, other

    cs.CL cs.AI

    Better to Ask in English: Cross-Lingual Evaluation of Large Language Models for Healthcare Queries

    Authors: Yiqiao **, Mohit Chandra, Gaurav Verma, Yibo Hu, Munmun De Choudhury, Srijan Kumar

    Abstract: Large language models (LLMs) are transforming the ways the general public accesses and consumes information. Their influence is particularly pronounced in pivotal sectors like healthcare, where lay individuals are increasingly appropriating LLMs as conversational agents for everyday queries. While LLMs demonstrate impressive language understanding and generation proficiencies, concerns regarding t… ▽ More

    Submitted 23 October, 2023; v1 submitted 19 October, 2023; originally announced October 2023.

    Comments: 18 pages, 7 figures

  17. arXiv:2310.10928  [pdf, ps, other

    cs.HC cs.AI cs.LG

    Using Audio Data to Facilitate Depression Risk Assessment in Primary Health Care

    Authors: Adam Valen Levinson, Abhay Goyal, Roger Ho Chun Man, Roy Ka-Wei Lee, Koustuv Saha, Nimay Parekh, Frederick L. Altice, Lam Yin Cheung, Munmun De Choudhury, Navin Kumar

    Abstract: Telehealth is a valuable tool for primary health care (PHC), where depression is a common condition. PHC is the first point of contact for most people with depression, but about 25% of diagnoses made by PHC physicians are inaccurate. Many other barriers also hinder depression detection and treatment in PHC. Artificial intelligence (AI) may help reduce depression misdiagnosis in PHC and improve ove… ▽ More

    Submitted 16 October, 2023; originally announced October 2023.

  18. arXiv:2310.08483  [pdf, other

    cs.CL cs.SI

    Understanding the Humans Behind Online Misinformation: An Observational Study Through the Lens of the COVID-19 Pandemic

    Authors: Mohit Chandra, Anush Mattapalli, Munmun De Choudhury

    Abstract: The proliferation of online misinformation has emerged as one of the biggest threats to society. Considerable efforts have focused on building misinformation detection models, still the perils of misinformation remain abound. Mitigating online misinformation and its ramifications requires a holistic approach that encompasses not only an understanding of its intricate landscape in relation to the c… ▽ More

    Submitted 18 January, 2024; v1 submitted 12 October, 2023; originally announced October 2023.

  19. arXiv:2310.07251  [pdf, other

    cs.CL cs.AI

    Ethical Reasoning over Moral Alignment: A Case and Framework for In-Context Ethical Policies in LLMs

    Authors: Abhinav Rao, Aditi Khandelwal, Kumar Tanmay, Utkarsh Agarwal, Monojit Choudhury

    Abstract: In this position paper, we argue that instead of morally aligning LLMs to specific set of ethical principles, we should infuse generic ethical reasoning capabilities into them so that they can handle value pluralism at a global scale. When provided with an ethical policy, an LLM should be capable of making decisions that are ethically consistent to the policy. We develop a framework that integrate… ▽ More

    Submitted 11 October, 2023; originally announced October 2023.

  20. arXiv:2309.13356  [pdf, other

    cs.CL cs.AI

    Probing the Moral Development of Large Language Models through Defining Issues Test

    Authors: Kumar Tanmay, Aditi Khandelwal, Utkarsh Agarwal, Monojit Choudhury

    Abstract: In this study, we measure the moral reasoning ability of LLMs using the Defining Issues Test - a psychometric instrument developed for measuring the moral development stage of a person according to the Kohlberg's Cognitive Moral Development Model. DIT uses moral dilemmas followed by a set of ethical considerations that the respondent has to judge for importance in resolving the dilemma, and then r… ▽ More

    Submitted 7 October, 2023; v1 submitted 23 September, 2023; originally announced September 2023.

    Comments: First three authors contributed equally

  21. arXiv:2309.07462  [pdf, other

    cs.CL

    Are Large Language Model-based Evaluators the Solution to Scaling Up Multilingual Evaluation?

    Authors: Rishav Hada, Varun Gumma, Adrian de Wynter, Harshita Diddee, Mohamed Ahmed, Monojit Choudhury, Kalika Bali, Sunayana Sitaram

    Abstract: Large Language Models (LLMs) excel in various Natural Language Processing (NLP) tasks, yet their evaluation, particularly in languages beyond the top $20$, remains inadequate due to existing benchmarks and metrics limitations. Employing LLMs as evaluators to rank or score other models' outputs emerges as a viable solution, addressing the constraints tied to human annotators and established benchma… ▽ More

    Submitted 13 February, 2024; v1 submitted 14 September, 2023; originally announced September 2023.

    Comments: Accepted to EACL 2024 findings

  22. arXiv:2307.12402  [pdf, ps, other

    cs.CL

    ChatGPT and Bard Responses to Polarizing Questions

    Authors: Abhay Goyal, Muhammad Siddique, Nimay Parekh, Zach Schwitzky, Clara Broekaert, Connor Michelotti, Allie Wong, Lam Yin Cheung, Robin O Hanlon, Lam Yin Cheung, Munmun De Choudhury, Roy Ka-Wei Lee, Navin Kumar

    Abstract: Recent developments in natural language processing have demonstrated the potential of large language models (LLMs) to improve a range of educational and learning outcomes. Of recent chatbots based on LLMs, ChatGPT and Bard have made it clear that artificial intelligence (AI) technology will have significant implications on the way we obtain and search for information. However, these tools sometime… ▽ More

    Submitted 13 July, 2023; originally announced July 2023.

  23. arXiv:2306.17674  [pdf, other

    cs.CL

    X-RiSAWOZ: High-Quality End-to-End Multilingual Dialogue Datasets and Few-shot Agents

    Authors: Mehrad Moradshahi, Tianhao Shen, Kalika Bali, Monojit Choudhury, Gaël de Chalendar, Anmol Goel, Sungkyun Kim, Prashant Kodali, Ponnurangam Kumaraguru, Nasredine Semmar, Sina J. Semnani, Jiwon Seo, Vivek Seshadri, Manish Shrivastava, Michael Sun, Aditya Yadavalli, Chaobin You, Deyi Xiong, Monica S. Lam

    Abstract: Task-oriented dialogue research has mainly focused on a few popular languages like English and Chinese, due to the high dataset creation cost for a new language. To reduce the cost, we apply manual editing to automatically translated data. We create a new multilingual benchmark, X-RiSAWOZ, by translating the Chinese RiSAWOZ to 4 languages: English, French, Hindi, Korean; and a code-mixed English-H… ▽ More

    Submitted 30 June, 2023; originally announced June 2023.

    Comments: Accepted by ACL 2023 Findings

  24. arXiv:2306.03316  [pdf, other

    cs.CL

    CoSiNES: Contrastive Siamese Network for Entity Standardization

    Authors: Jiaqing Yuan, Michele Merler, Mihir Choudhury, Raju Pavuluri, Munindar P. Singh, Maja Vukovic

    Abstract: Entity standardization maps noisy mentions from free-form text to standard entities in a knowledge base. The unique challenge of this task relative to other entity-related tasks is the lack of surrounding context and numerous variations in the surface form of the mentions, especially when it comes to generalization across domains where labeled data is scarce. Previous research mostly focuses on de… ▽ More

    Submitted 5 June, 2023; originally announced June 2023.

    Comments: Accepted by Matching Workshop at ACL2023

  25. arXiv:2305.14965  [pdf, other

    cs.CL

    Tricking LLMs into Disobedience: Formalizing, Analyzing, and Detecting Jailbreaks

    Authors: Abhinav Rao, Sachin Vashistha, Atharva Naik, Somak Aditya, Monojit Choudhury

    Abstract: Recent explorations with commercial Large Language Models (LLMs) have shown that non-expert users can jailbreak LLMs by simply manipulating their prompts; resulting in degenerate output behavior, privacy and security breaches, offensive outputs, and violations of content regulator policies. Limited studies have been conducted to formalize and analyze these attacks and their mitigations. We bridge… ▽ More

    Submitted 27 March, 2024; v1 submitted 24 May, 2023; originally announced May 2023.

    Comments: Accepted at LREC-COLING 2024 - The 2024 Joint International Conference on Computational Linguistics, Language Resources and Evaluation

  26. arXiv:2305.14288  [pdf, other

    cs.CL

    LLM-powered Data Augmentation for Enhanced Cross-lingual Performance

    Authors: Chenxi Whitehouse, Monojit Choudhury, Alham Fikri Aji

    Abstract: This paper explores the potential of leveraging Large Language Models (LLMs) for data augmentation in multilingual commonsense reasoning datasets where the available training data is extremely limited. To achieve this, we utilise several LLMs, namely Dolly-v2, StableVicuna, ChatGPT, and GPT-4, to augment three datasets: XCOPA, XWinograd, and XStoryCloze. Subsequently, we evaluate the effectiveness… ▽ More

    Submitted 22 October, 2023; v1 submitted 23 May, 2023; originally announced May 2023.

    Comments: EMNLP 2023 Main Conference

  27. arXiv:2305.14218  [pdf, other

    cs.CV cs.AI

    DUBLIN -- Document Understanding By Language-Image Network

    Authors: Kriti Aggarwal, Aditi Khandelwal, Kumar Tanmay, Owais Mohammed Khan, Qiang Liu, Monojit Choudhury, Hardik Hansrajbhai Chauhan, Subhojit Som, Vishrav Chaudhary, Saurabh Tiwary

    Abstract: Visual document understanding is a complex task that involves analyzing both the text and the visual elements in document images. Existing models often rely on manual feature engineering or domain-specific pipelines, which limit their generalization ability across different document types and languages. In this paper, we propose DUBLIN, which is pretrained on web pages using three novel objectives… ▽ More

    Submitted 27 October, 2023; v1 submitted 23 May, 2023; originally announced May 2023.

    ACM Class: F.2.2; I.2.7

  28. arXiv:2304.10439  [pdf, other

    cs.CV

    A Study on Reproducibility and Replicability of Table Structure Recognition Methods

    Authors: Kehinde Ajayi, Muntabir Hasan Choudhury, Sarah Rajtmajer, Jian Wu

    Abstract: Concerns about reproducibility in artificial intelligence (AI) have emerged, as researchers have reported unsuccessful attempts to directly reproduce published findings in the field. Replicability, the ability to affirm a finding using the same procedures on new data, has not been well studied. In this paper, we examine both reproducibility and replicability of a corpus of 16 papers on table struc… ▽ More

    Submitted 20 April, 2023; originally announced April 2023.

    Comments: 10 pages, 5 figures

  29. arXiv:2304.07417  [pdf, other

    cs.SI cs.HC

    Understanding and Mitigating Mental Health Misinformation on Video Sharing Platforms

    Authors: Viet Cuong Nguyen, Michael Birnbaum, Munmun De Choudhury

    Abstract: Despite the ever-strong demand for mental health care globally, access to traditional mental health services remains severely limited expensive, and stifled by stigma and systemic barriers. Thus, over the last few years, young people are increasingly turning to content on video-sharing platforms (VSPs) like TikTok and YouTube to help them navigate their mental health journey. However, navigating t… ▽ More

    Submitted 14 April, 2023; originally announced April 2023.

    Comments: 5 pages, 1 figure

  30. arXiv:2303.17661  [pdf, other

    cs.DL cs.AI cs.LG

    MetaEnhance: Metadata Quality Improvement for Electronic Theses and Dissertations of University Libraries

    Authors: Muntabir Hasan Choudhury, Lamia Salsabil, Himarsha R. Jayanetti, Jian Wu, William A. Ingram, Edward A. Fox

    Abstract: Metadata quality is crucial for digital objects to be discovered through digital library interfaces. However, due to various reasons, the metadata of digital objects often exhibits incomplete, inconsistent, and incorrect values. We investigate methods to automatically detect, correct, and canonicalize scholarly metadata, using seven key fields of electronic theses and dissertations (ETDs) as a cas… ▽ More

    Submitted 30 March, 2023; originally announced March 2023.

    Comments: 7 pages, 3 tables, and 1 figure. Accepted by 2023 ACM/IEEE Joint Conference on Digital Libraries (JCDL '23) as a short paper

  31. arXiv:2303.02357  [pdf, other

    cs.CL cs.AI

    DiTTO: A Feature Representation Imitation Approach for Improving Cross-Lingual Transfer

    Authors: Shanu Kumar, Abbaraju Soujanya, Sandipan Dandapat, Sunayana Sitaram, Monojit Choudhury

    Abstract: Zero-shot cross-lingual transfer is promising, however has been shown to be sub-optimal, with inferior transfer performance across low-resource languages. In this work, we envision languages as domains for improving zero-shot transfer by jointly reducing the feature incongruity between the source and the target language and increasing the generalization capabilities of pre-trained multilingual tra… ▽ More

    Submitted 4 March, 2023; originally announced March 2023.

    Comments: Accepted at EACL 2023

  32. arXiv:2302.12578  [pdf, other

    cs.CL cs.CY cs.LG

    Fairness in Language Models Beyond English: Gaps and Challenges

    Authors: Krithika Ramesh, Sunayana Sitaram, Monojit Choudhury

    Abstract: With language models becoming increasingly ubiquitous, it has become essential to address their inequitable treatment of diverse demographic groups and factors. Most research on evaluating and mitigating fairness harms has been concentrated on English, while multilingual models and non-English languages have received comparatively little attention. This paper presents a survey of fairness in multi… ▽ More

    Submitted 28 February, 2023; v1 submitted 24 February, 2023; originally announced February 2023.

    Comments: Accepted to EACL 2023 (Findings)

  33. arXiv:2302.12388  [pdf, other

    cs.LG

    TrafFormer: A Transformer Model for Predicting Long-term Traffic

    Authors: David Alexander Tedjopurnomo, Farhana M. Choudhury, A. K. Qin

    Abstract: Traffic prediction is a flourishing research field due to its importance in human mobility in the urban space. Despite this, existing studies only focus on short-term prediction of up to few hours in advance, with most being up to one hour only. Long-term traffic prediction can enable more comprehensive, informed, and proactive measures against traffic congestion and is therefore an important task… ▽ More

    Submitted 2 March, 2023; v1 submitted 23 February, 2023; originally announced February 2023.

    Comments: 14 pages, 6 figures

    MSC Class: ACM-class: I.2.1

  34. arXiv:2302.00799  [pdf, other

    cs.HC cs.AI

    Charting the Sociotechnical Gap in Explainable AI: A Framework to Address the Gap in XAI

    Authors: Upol Ehsan, Koustuv Saha, Munmun De Choudhury, Mark O. Riedl

    Abstract: Explainable AI (XAI) systems are sociotechnical in nature; thus, they are subject to the sociotechnical gap--divide between the technical affordances and the social needs. However, charting this gap is challenging. In the context of XAI, we argue that charting the gap improves our problem understanding, which can reflexively provide actionable insights to improve explainability. Utilizing two case… ▽ More

    Submitted 1 February, 2023; originally announced February 2023.

    Comments: Published at ACM CSCW 2023

    Journal ref: ACM CSCW 2023

  35. arXiv:2210.15184  [pdf, other

    cs.CL

    Too Brittle To Touch: Comparing the Stability of Quantization and Distillation Towards Develo** Lightweight Low-Resource MT Models

    Authors: Harshita Diddee, Sandipan Dandapat, Monojit Choudhury, Tanuja Ganu, Kalika Bali

    Abstract: Leveraging shared learning through Massively Multilingual Models, state-of-the-art machine translation models are often able to adapt to the paucity of data for low-resource languages. However, this performance comes at the cost of significantly bloated models which are not practically deployable. Knowledge Distillation is one popular technique to develop competitive, lightweight models: In this w… ▽ More

    Submitted 9 November, 2022; v1 submitted 27 October, 2022; originally announced October 2022.

    Comments: 16 Pages, 7 Figures, Accepted to WMT 2022 (Research Track)

  36. arXiv:2210.12265  [pdf, other

    cs.CL

    On the Calibration of Massively Multilingual Language Models

    Authors: Kabir Ahuja, Sunayana Sitaram, Sandipan Dandapat, Monojit Choudhury

    Abstract: Massively Multilingual Language Models (MMLMs) have recently gained popularity due to their surprising effectiveness in cross-lingual transfer. While there has been much work in evaluating these models for their performance on a variety of tasks and languages, little attention has been paid on how well calibrated these models are with respect to the confidence in their predictions. We first invest… ▽ More

    Submitted 21 October, 2022; originally announced October 2022.

    Comments: EMNLP 2022

  37. arXiv:2208.14641  [pdf, other

    cs.CL cs.AI

    Generating Intermediate Steps for NLI with Next-Step Supervision

    Authors: Deepanway Ghosal, Somak Aditya, Monojit Choudhury

    Abstract: The Natural Language Inference (NLI) task often requires reasoning over multiple steps to reach the conclusion. While the necessity of generating such intermediate steps (instead of a summary explanation) has gained popular support, it is unclear how to generate such steps without complete end-to-end supervision and how such generated steps can be further utilized. In this work, we train a sequenc… ▽ More

    Submitted 31 August, 2022; originally announced August 2022.

  38. arXiv:2206.15010  [pdf, other

    cs.CL

    "Diversity and Uncertainty in Moderation" are the Key to Data Selection for Multilingual Few-shot Transfer

    Authors: Shanu Kumar, Sandipan Dandapat, Monojit Choudhury

    Abstract: Few-shot transfer often shows substantial gain over zero-shot transfer~\cite{lauscher2020zero}, which is a practically useful trade-off between fully supervised and unsupervised learning approaches for multilingual pretrained model-based systems. This paper explores various strategies for selecting data for annotation that can result in a better few-shot transfer. The proposed approaches rely on m… ▽ More

    Submitted 30 June, 2022; originally announced June 2022.

    Comments: NAACL 2022

  39. arXiv:2206.10594  [pdf

    cs.SI

    How is Va** Framed on Online Knowledge Dissemination Platforms?

    Authors: Keyu Chen, Yiwen Shi, Jun Luo, Joyce Jiang, Shweta Yadav, Munmun De Choudhury, Ashiqur R. KhudaBukhsh, Marzieh Babaeianjelodar, Frederick Altice, Navin Kumar

    Abstract: We analyze 1,888 articles and 1,119,453 va** posts to study how va** is framed across multiple knowledge dissemination platforms (Wikipedia, Quora, Medium, Reddit, Stack Exchange, wikiHow). We use various NLP techniques to understand these differences. For example, n-grams, emotion recognition, and question answering results indicate that Medium, Quora, and Stack Exchange are appropriate venue… ▽ More

    Submitted 22 July, 2022; v1 submitted 17 June, 2022; originally announced June 2022.

    Comments: arXiv admin note: text overlap with arXiv:2206.07765, arXiv:2206.09024

  40. arXiv:2206.09024  [pdf

    cs.SI

    Partisan US News Media Representations of Syrian Refugees

    Authors: Keyu Chen, Marzieh Babaeianjelodar, Yiwen Shi, Kamila Janmohamed, Rupak Sarkar, Ingmar Weber, Thomas Davidson, Munmun De Choudhury, Jonathan Huang, Shweta Yadav, Ashique Khudabukhsh, Preslav Ivanov Nakov, Chris Bauch, Orestis Papakyriakopoulos, Kaveh Khoshnood, Navin Kumar

    Abstract: We investigate how representations of Syrian refugees (2011-2021) differ across US partisan news outlets. We analyze 47,388 articles from the online US media about Syrian refugees to detail differences in reporting between left- and right-leaning media. We use various NLP techniques to understand these differences. Our polarization and question answering results indicated that left-leaning media t… ▽ More

    Submitted 17 June, 2022; originally announced June 2022.

  41. arXiv:2206.07765  [pdf

    cs.SI

    US News and Social Media Framing around Va**

    Authors: Keyu Chen, Marzieh Babaeianjelodar, Yiwen Shi, Rohan Aanegola, Lam Yin Cheung, Preslav Ivanov Nakov, Shweta Yadav, Angus Bancroft, Ashiqur R. KhudaBukhsh, Munmun De Choudhury, Frederick L. Altice, Navin Kumar

    Abstract: In this paper, we investigate how va** is framed differently (2008-2021) between US news and social media. We analyze 15,711 news articles and 1,231,379 Facebook posts about va** to study the differences in framing between media varieties. We use word embeddings to provide two-dimensional visualizations of the semantic changes around va** for news and for social media. We detail that news me… ▽ More

    Submitted 22 July, 2022; v1 submitted 15 June, 2022; originally announced June 2022.

  42. arXiv:2205.09744  [pdf, other

    cs.LG cs.CY cs.MM

    Overcoming Language Disparity in Online Content Classification with Multimodal Learning

    Authors: Gaurav Verma, Rohit Mujumdar, Zijie J. Wang, Munmun De Choudhury, Srijan Kumar

    Abstract: Advances in Natural Language Processing (NLP) have revolutionized the way researchers and practitioners address crucial societal problems. Large language models are now the standard to develop state-of-the-art solutions for text detection and classification tasks. However, the development of advanced computational techniques and resources is disproportionately focused on the English language, side… ▽ More

    Submitted 19 May, 2022; originally announced May 2022.

    Comments: Accepted for publication at ICWSM 2022 as a full paper

  43. arXiv:2205.06356  [pdf, other

    cs.CL

    Beyond Static Models and Test Sets: Benchmarking the Potential of Pre-trained Models Across Tasks and Languages

    Authors: Kabir Ahuja, Sandipan Dandapat, Sunayana Sitaram, Monojit Choudhury

    Abstract: Although recent Massively Multilingual Language Models (MMLMs) like mBERT and XLMR support around 100 languages, most existing multilingual NLP benchmarks provide evaluation data in only a handful of these languages with little linguistic diversity. We argue that this makes the existing practices in multilingual evaluation unreliable and does not provide a full picture of the performance of MMLMs… ▽ More

    Submitted 14 November, 2022; v1 submitted 12 May, 2022; originally announced May 2022.

    Comments: NLP Power! Workshop, ACL 2022

  44. arXiv:2205.06350  [pdf, other

    cs.CL

    On the Economics of Multilingual Few-shot Learning: Modeling the Cost-Performance Trade-offs of Machine Translated and Manual Data

    Authors: Kabir Ahuja, Monojit Choudhury, Sandipan Dandapat

    Abstract: Borrowing ideas from {\em Production functions} in micro-economics, in this paper we introduce a framework to systematically evaluate the performance and cost trade-offs between machine-translated and manually-created labelled data for task-specific fine-tuning of massively multilingual language models. We illustrate the effectiveness of our framework through a case-study on the TyDIQA-GoldP datas… ▽ More

    Submitted 14 November, 2022; v1 submitted 12 May, 2022; originally announced May 2022.

    Comments: NAACL 2022

  45. arXiv:2205.06130  [pdf, other

    cs.CL

    Multi Task Learning For Zero Shot Performance Prediction of Multilingual Models

    Authors: Kabir Ahuja, Shanu Kumar, Sandipan Dandapat, Monojit Choudhury

    Abstract: Massively Multilingual Transformer based Language Models have been observed to be surprisingly effective on zero-shot transfer across languages, though the performance varies from language to language depending on the pivot language(s) used for fine-tuning. In this work, we build upon some of the existing techniques for predicting the zero-shot performance on a task, by modeling it as a multi-task… ▽ More

    Submitted 12 May, 2022; originally announced May 2022.

    Comments: ACL 2022

  46. arXiv:2204.02790  [pdf, other

    cs.CY cs.CL

    Global Readiness of Language Technology for Healthcare: What would it Take to Combat the Next Pandemic?

    Authors: Ishani Mondal, Kabir Ahuja, Mohit Jain, Jacki O Neil, Kalika Bali, Monojit Choudhury

    Abstract: The COVID-19 pandemic has brought out both the best and worst of language technology (LT). On one hand, conversational agents for information dissemination and basic diagnosis have seen widespread use, and arguably, had an important role in combating the pandemic. On the other hand, it has also become clear that such technologies are readily available for a handful of languages, and the vast major… ▽ More

    Submitted 6 April, 2022; originally announced April 2022.

    Comments: Under Revision

  47. arXiv:2203.12865  [pdf, other

    cs.CL cs.LG

    Multilingual CheckList: Generation and Evaluation

    Authors: Karthikeyan K, Shaily Bhatt, Pankaj Singh, Somak Aditya, Sandipan Dandapat, Sunayana Sitaram, Monojit Choudhury

    Abstract: Multilingual evaluation benchmarks usually contain limited high-resource languages and do not test models for specific linguistic capabilities. CheckList is a template-based evaluation approach that tests models for specific capabilities. The CheckList template creation process requires native speakers, posing a challenge in scaling to hundreds of languages. In this work, we explore multiple appro… ▽ More

    Submitted 11 October, 2022; v1 submitted 24 March, 2022; originally announced March 2022.

    Comments: Accepted to Findings of AACL-IJCNLP 2022

  48. arXiv:2201.08277  [pdf, other

    cs.CL cs.AI

    NaijaSenti: A Nigerian Twitter Sentiment Corpus for Multilingual Sentiment Analysis

    Authors: Shamsuddeen Hassan Muhammad, David Ifeoluwa Adelani, Sebastian Ruder, Ibrahim Said Ahmad, Idris Abdulmumin, Bello Shehu Bello, Monojit Choudhury, Chris Chinenye Emezue, Saheed Salahudeen Abdullahi, Anuoluwapo Aremu, Alipio Jeorge, Pavel Brazdil

    Abstract: Sentiment analysis is one of the most widely studied applications in NLP, but most work focuses on languages with large amounts of data. We introduce the first large-scale human-annotated Twitter sentiment dataset for the four most widely spoken languages in Nigeria (Hausa, Igbo, Nigerian-Pidgin, and Yorùbá ) consisting of around 30,000 annotated tweets per language (and 14,000 for Nigerian-Pidgin… ▽ More

    Submitted 18 June, 2022; v1 submitted 20 January, 2022; originally announced January 2022.

    Comments: Submitted to LREC 2022, 13 pages, 2 figures

  49. arXiv:2201.03074  [pdf, other

    cs.HC

    A Survey of Passive Sensing in the Workplace

    Authors: Subigya Nepal, Gonzalo J. Martinez, Arvind Pillai, Koustuv Saha, Shayan Mirjafari, Vedant Das Swain, Xuhai Xu, Pino G. Audia, Munmun De Choudhury, Anind K. Dey, Aaron Striegel, Andrew T. Campbell

    Abstract: As emerging technologies increasingly integrate into all facets of our lives, the workplace stands at the forefront of potential transformative changes. A notable development in this realm is the advent of passive sensing technology, designed to enhance both cognitive and physical capabilities by monitoring human behavior. This paper reviews current research on the application of passive sensing t… ▽ More

    Submitted 30 March, 2024; v1 submitted 9 January, 2022; originally announced January 2022.

    Comments: Added references and other minor revisions. Also udated to include relevant works published after 2022

    ACM Class: H.5.0

  50. arXiv:2112.14705  [pdf, other

    cs.RO cs.AI cs.LG

    Lane Change Decision-Making through Deep Reinforcement Learning

    Authors: Mukesh Ghimire, Malobika Roy Choudhury, Guna Sekhar Sai Harsha Lagudu

    Abstract: Due to the complexity and volatility of the traffic environment, decision-making in autonomous driving is a significantly hard problem. In this project, we use a Deep Q-Network, along with rule-based constraints to make lane-changing decision. A safe and efficient lane change behavior may be obtained by combining high-level lateral decision-making with low-level rule-based trajectory monitoring. T… ▽ More

    Submitted 23 December, 2021; originally announced December 2021.

    Comments: 6 pages

    MSC Class: 15-04 ACM Class: I.2.6