Skip to main content

Showing 1–50 of 107 results for author: Chadha, A

.
  1. arXiv:2406.14805  [pdf, other

    cs.CL

    How Well Do LLMs Represent Values Across Cultures? Empirical Analysis of LLM Responses Based on Hofstede Cultural Dimensions

    Authors: Julia Kharchenko, Tanya Roosta, Aman Chadha, Chirag Shah

    Abstract: Large Language Models (LLMs) attempt to imitate human behavior by responding to humans in a way that pleases them, including by adhering to their values. However, humans come from diverse cultures with different values. It is critical to understand whether LLMs showcase different values to the user based on the stereotypical values of a user's known country. We prompt different LLMs with a series… ▽ More

    Submitted 20 June, 2024; originally announced June 2024.

  2. arXiv:2406.12644  [pdf, other

    cs.CL cs.AI

    Hierarchical Prompting Taxonomy: A Universal Evaluation Framework for Large Language Models

    Authors: Devichand Budagam, Sankalp KJ, Ashutosh Kumar, Vinija Jain, Aman Chadha

    Abstract: Assessing the effectiveness of large language models (LLMs) in addressing diverse tasks is essential for comprehending their strengths and weaknesses. Conventional evaluation techniques typically apply a single prompting strategy uniformly across datasets, not considering the varying degrees of task complexity. We introduce the Hierarchical Prompting Taxonomy (HPT), a taxonomy that employs a Hiera… ▽ More

    Submitted 27 June, 2024; v1 submitted 18 June, 2024; originally announced June 2024.

  3. arXiv:2406.11402  [pdf, other

    cs.CL cs.AI cs.LG

    Evaluating Open Language Models Across Task Types, Application Domains, and Reasoning Types: An In-Depth Experimental Analysis

    Authors: Neelabh Sinha, Vinija Jain, Aman Chadha

    Abstract: The rapid rise of Language Models (LMs) has expanded their use in several applications. Yet, due to constraints of model size, associated cost, or proprietary restrictions, utilizing state-of-the-art (SOTA) LLMs is not always feasible. With open, smaller LMs emerging, more applications can leverage their capabilities, but selecting the right LM can be challenging. This work conducts an in-depth ex… ▽ More

    Submitted 17 June, 2024; originally announced June 2024.

  4. arXiv:2406.11109  [pdf, other

    cs.CL cs.AI cs.LG

    Investigating Annotator Bias in Large Language Models for Hate Speech Detection

    Authors: Amit Das, Zheng Zhang, Fatemeh Jamshidi, Vinija Jain, Aman Chadha, Nilanjana Raychawdhary, Mary Sandage, Lauramarie Pope, Gerry Dozier, Cheryl Seals

    Abstract: Data annotation, the practice of assigning descriptive labels to raw data, is pivotal in optimizing the performance of machine learning models. However, it is a resource-intensive process susceptible to biases introduced by annotators. The emergence of sophisticated Large Language Models (LLMs), like ChatGPT presents a unique opportunity to modernize and streamline this complex procedure. While ex… ▽ More

    Submitted 18 June, 2024; v1 submitted 16 June, 2024; originally announced June 2024.

  5. arXiv:2406.09559  [pdf, other

    cs.CL cs.AI cs.LG

    Decoding the Diversity: A Review of the Indic AI Research Landscape

    Authors: Sankalp KJ, Vinija Jain, Sreyoshi Bhaduri, Tamoghna Roy, Aman Chadha

    Abstract: This review paper provides a comprehensive overview of large language model (LLM) research directions within Indic languages. Indic languages are those spoken in the Indian subcontinent, including India, Pakistan, Bangladesh, Sri Lanka, Nepal, and Bhutan, among others. These languages have a rich cultural and linguistic heritage and are spoken by over 1.5 billion people worldwide. With the tremend… ▽ More

    Submitted 13 June, 2024; originally announced June 2024.

    Comments: 27 pages, 1 figure

  6. arXiv:2406.08862  [pdf, other

    cs.LG

    Cognitively Inspired Energy-Based World Models

    Authors: Alexi Gladstone, Ganesh Nanduru, Md Mofijul Islam, Aman Chadha, Jundong Li, Tariq Iqbal

    Abstract: One of the predominant methods for training world models is autoregressive prediction in the output space of the next element of a sequence. In Natural Language Processing (NLP), this takes the form of Large Language Models (LLMs) predicting the next token; in Computer Vision (CV), this takes the form of autoregressive models predicting the next frame/token/pixel. However, this approach differs fr… ▽ More

    Submitted 13 June, 2024; originally announced June 2024.

    Comments: 23 pages, 6 figures

  7. arXiv:2406.05344  [pdf, other

    cs.CL

    MemeGuard: An LLM and VLM-based Framework for Advancing Content Moderation via Meme Intervention

    Authors: Prince Jha, Raghav Jain, Konika Mandal, Aman Chadha, Sriparna Saha, Pushpak Bhattacharyya

    Abstract: In the digital world, memes present a unique challenge for content moderation due to their potential to spread harmful content. Although detection methods have improved, proactive solutions such as intervention are still limited, with current research focusing mostly on text-based content, neglecting the widespread influence of multimodal content like memes. Addressing this gap, we present \textit… ▽ More

    Submitted 8 June, 2024; originally announced June 2024.

  8. arXiv:2405.17927  [pdf, other

    cs.AI cs.CL cs.CV cs.LG eess.AS

    The Evolution of Multimodal Model Architectures

    Authors: Shakti N. Wadekar, Abhishek Chaurasia, Aman Chadha, Eugenio Culurciello

    Abstract: This work uniquely identifies and characterizes four prevalent multimodal model architectural patterns in the contemporary multimodal landscape. Systematically categorizing models by architecture type facilitates monitoring of developments in the multimodal domain. Distinct from recent survey papers that present general information on multimodal architectures, this research conducts a comprehensiv… ▽ More

    Submitted 28 May, 2024; originally announced May 2024.

    Comments: 30 pages, 6 tables, 7 figures

  9. arXiv:2405.17475  [pdf, other

    cs.CV cs.AI cs.CL cs.LG

    How Culturally Aware are Vision-Language Models?

    Authors: Olena Burda-Lassen, Aman Chadha, Shashank Goswami, Vinija Jain

    Abstract: An image is often said to be worth a thousand words, and certain images can tell rich and insightful stories. Can these stories be told via image captioning? Images from folklore genres, such as mythology, folk dance, cultural signs, and symbols, are vital to every culture. Our research compares the performance of four popular vision-language models (GPT-4V, Gemini Pro Vision, LLaVA, and OpenFlami… ▽ More

    Submitted 24 May, 2024; originally announced May 2024.

  10. arXiv:2405.15766  [pdf, other

    cs.AI cs.CL cs.CV

    Enhancing Adverse Drug Event Detection with Multimodal Dataset: Corpus Creation and Model Development

    Authors: Pranab Sahoo, Ayush Kumar Singh, Sriparna Saha, Aman Chadha, Samrat Mondal

    Abstract: The mining of adverse drug events (ADEs) is pivotal in pharmacovigilance, enhancing patient safety by identifying potential risks associated with medications, facilitating early detection of adverse events, and guiding regulatory decision-making. Traditional ADE detection methods are reliable but slow, not easily adaptable to large-scale operations, and offer limited information. With the exponent… ▽ More

    Submitted 26 May, 2024; v1 submitted 24 May, 2024; originally announced May 2024.

    Comments: ACL Findings 2024

  11. arXiv:2405.13019  [pdf, other

    cs.CL cs.AI

    A Comprehensive Survey of Accelerated Generation Techniques in Large Language Models

    Authors: Mahsa Khoshnoodi, Vinija Jain, Mingye Gao, Malavika Srikanth, Aman Chadha

    Abstract: Despite the crucial importance of accelerating text generation in large language models (LLMs) for efficiently producing content, the sequential nature of this process often leads to high inference latency, posing challenges for real-time applications. Various techniques have been proposed and developed to address these challenges and improve efficiency. This paper presents a comprehensive survey… ▽ More

    Submitted 24 May, 2024; v1 submitted 15 May, 2024; originally announced May 2024.

  12. arXiv:2405.09589  [pdf, other

    cs.LG cs.AI cs.CL cs.CV cs.SD eess.AS

    Unveiling Hallucination in Text, Image, Video, and Audio Foundation Models: A Comprehensive Survey

    Authors: Pranab Sahoo, Prabhash Meharia, Akash Ghosh, Sriparna Saha, Vinija Jain, Aman Chadha

    Abstract: The rapid advancement of foundation models (FMs) across language, image, audio, and video domains has shown remarkable capabilities in diverse tasks. However, the proliferation of FMs brings forth a critical challenge: the potential to generate hallucinated outputs, particularly in high-stakes applications. The tendency of foundation models to produce hallucinated content arguably represents the b… ▽ More

    Submitted 20 May, 2024; v1 submitted 15 May, 2024; originally announced May 2024.

  13. arXiv:2404.13506  [pdf, other

    cs.LG cs.AI cs.CL

    Parameter Efficient Fine Tuning: A Comprehensive Analysis Across Applications

    Authors: Charith Chandra Sai Balne, Sreyoshi Bhaduri, Tamoghna Roy, Vinija Jain, Aman Chadha

    Abstract: The rise of deep learning has marked significant progress in fields such as computer vision, natural language processing, and medical imaging, primarily through the adaptation of pre-trained models for specific tasks. Traditional fine-tuning methods, involving adjustments to all parameters, face challenges due to high computational and memory demands. This has led to the development of Parameter E… ▽ More

    Submitted 23 April, 2024; v1 submitted 20 April, 2024; originally announced April 2024.

  14. arXiv:2404.11036  [pdf, other

    cs.LG cs.CL

    Cross-Platform Hate Speech Detection with Weakly Supervised Causal Disentanglement

    Authors: Paras Sheth, Tharindu Kumarage, Raha Moraffah, Aman Chadha, Huan Liu

    Abstract: Content moderation faces a challenging task as social media's ability to spread hate speech contrasts with its role in promoting global connectivity. With rapidly evolving slang and hate speech, the adaptability of conventional deep learning to the fluid landscape of online dialogue remains limited. In response, causality inspired disentanglement has shown promise by segregating platform specific… ▽ More

    Submitted 16 April, 2024; originally announced April 2024.

  15. arXiv:2404.07214  [pdf, other

    cs.CV cs.AI cs.CL

    Exploring the Frontier of Vision-Language Models: A Survey of Current Methodologies and Future Directions

    Authors: Akash Ghosh, Arkadeep Acharya, Sriparna Saha, Vinija Jain, Aman Chadha

    Abstract: The advent of Large Language Models (LLMs) has significantly reshaped the trajectory of the AI revolution. Nevertheless, these LLMs exhibit a notable limitation, as they are primarily adept at processing textual information. To address this constraint, researchers have endeavored to integrate visual capabilities with LLMs, resulting in the emergence of Vision-Language Models (VLMs). These advanced… ▽ More

    Submitted 12 April, 2024; v1 submitted 20 February, 2024; originally announced April 2024.

    Comments: The most extensive and up to date Survey on Visual Language Models covering 76 Visual Language Models

  16. arXiv:2403.19113  [pdf, other

    cs.CL cs.AI

    FACTOID: FACtual enTailment fOr hallucInation Detection

    Authors: Vipula Rawte, S. M Towhidul Islam Tonmoy, Krishnav Rajbangshi, Shravani Nag, Aman Chadha, Amit P. Sheth, Amitava Das

    Abstract: The widespread adoption of Large Language Models (LLMs) has facilitated numerous benefits. However, hallucination is a significant concern. In response, Retrieval Augmented Generation (RAG) has emerged as a highly promising paradigm to improve LLM outputs by grounding them in factual information. RAG relies on textual entailment (TE) or similar methods to check if the text produced by LLMs is supp… ▽ More

    Submitted 27 March, 2024; originally announced March 2024.

  17. arXiv:2403.18976  [pdf, other

    cs.CL cs.AI

    "Sorry, Come Again?" Prompting -- Enhancing Comprehension and Diminishing Hallucination with [PAUSE]-injected Optimal Paraphrasing

    Authors: Vipula Rawte, S. M Towhidul Islam Tonmoy, S M Mehedi Zaman, Prachi Priya, Aman Chadha, Amit P. Sheth, Amitava Das

    Abstract: Hallucination has emerged as the most vulnerable aspect of contemporary Large Language Models (LLMs). In this paper, we introduce the Sorry, Come Again (SCA) prompting, aimed to avoid LLM hallucinations by enhancing comprehension through: (i) optimal paraphrasing and (ii) injecting [PAUSE] tokens to delay LLM generation. First, we provide an in-depth analysis of linguistic nuances: formality, read… ▽ More

    Submitted 27 March, 2024; originally announced March 2024.

  18. arXiv:2403.16422  [pdf, other

    cs.CV cs.AI

    Refining Text-to-Image Generation: Towards Accurate Training-Free Glyph-Enhanced Image Generation

    Authors: Sanyam Lakhanpal, Shivang Chopra, Vinija Jain, Aman Chadha, Man Luo

    Abstract: Over the past few years, Text-to-Image (T2I) generation approaches based on diffusion models have gained significant attention. However, vanilla diffusion models often suffer from spelling inaccuracies in the text displayed within the generated images. The capability to generate visual text is crucial, offering both academic interest and a wide range of practical applications. To produce accurate… ▽ More

    Submitted 25 March, 2024; originally announced March 2024.

  19. arXiv:2403.14633  [pdf, other

    cs.CY cs.AI cs.CL

    Born With a Silver Spoon? Investigating Socioeconomic Bias in Large Language Models

    Authors: Smriti Singh, Shuvam Keshari, Vinija Jain, Aman Chadha

    Abstract: Socioeconomic bias in society exacerbates disparities, influencing access to opportunities and resources based on individuals' economic and social backgrounds. This pervasive issue perpetuates systemic inequalities, hindering the pursuit of inclusive progress as a society. In this paper, we investigate the presence of socioeconomic bias, if any, in large language models. To this end, we introduce… ▽ More

    Submitted 16 April, 2024; v1 submitted 16 February, 2024; originally announced March 2024.

  20. arXiv:2403.09724  [pdf, other

    cs.CL cs.CY cs.LG

    ClaimVer: Explainable Claim-Level Verification and Evidence Attribution of Text Through Knowledge Graphs

    Authors: Preetam Prabhu Srikar Dammu, Himanshu Naidu, Mouly Dewan, YoungMin Kim, Tanya Roosta, Aman Chadha, Chirag Shah

    Abstract: In the midst of widespread misinformation and disinformation through social media and the proliferation of AI-generated texts, it has become increasingly difficult for people to validate and trust information they encounter. Many fact-checking approaches and tools have been developed, but they often lack appropriate explainability or granularity to be useful in various contexts. A text validation… ▽ More

    Submitted 23 June, 2024; v1 submitted 12 March, 2024; originally announced March 2024.

  21. arXiv:2403.04786  [pdf, other

    cs.CR cs.CL

    Breaking Down the Defenses: A Comparative Survey of Attacks on Large Language Models

    Authors: Arijit Ghosh Chowdhury, Md Mofijul Islam, Vaibhav Kumar, Faysal Hossain Shezan, Vaibhav Kumar, Vinija Jain, Aman Chadha

    Abstract: Large Language Models (LLMs) have become a cornerstone in the field of Natural Language Processing (NLP), offering transformative capabilities in understanding and generating human-like text. However, with their rising prominence, the security and vulnerability aspects of these models have garnered significant attention. This paper presents a comprehensive survey of the various forms of attacks ta… ▽ More

    Submitted 23 March, 2024; v1 submitted 2 March, 2024; originally announced March 2024.

  22. arXiv:2403.02472  [pdf, other

    cs.CL

    OffensiveLang: A Community Based Implicit Offensive Language Dataset

    Authors: Amit Das, Mostafa Rahgouy, Dongji Feng, Zheng Zhang, Tathagata Bhattacharya, Nilanjana Raychawdhary, Fatemeh Jamshidi, Vinija Jain, Aman Chadha, Mary Sandage, Lauramarie Pope, Gerry Dozier, Cheryl Seals

    Abstract: The widespread presence of hateful languages on social media has resulted in adverse effects on societal well-being. As a result, addressing this issue with high priority has become very important. Hate speech or offensive languages exist in both explicit and implicit forms, with the latter being more challenging to detect. Current research in this domain encounters several challenges. Firstly, th… ▽ More

    Submitted 17 June, 2024; v1 submitted 4 March, 2024; originally announced March 2024.

  23. arXiv:2403.02246  [pdf

    cs.CL

    PHAnToM: Personality Has An Effect on Theory-of-Mind Reasoning in Large Language Models

    Authors: Fiona Anting Tan, Gerard Christopher Yeo, Fanyou Wu, Weijie Xu, Vinija Jain, Aman Chadha, Kokil Jaidka, Yang Liu, See-Kiong Ng

    Abstract: Recent advances in large language models (LLMs) demonstrate that their capabilities are comparable, or even superior, to humans in many tasks in natural language processing. Despite this progress, LLMs are still inadequate at social-cognitive reasoning, which humans are naturally good at. Drawing inspiration from psychological research on the links between certain personality traits and Theory-of-… ▽ More

    Submitted 18 March, 2024; v1 submitted 4 March, 2024; originally announced March 2024.

  24. arXiv:2403.01152  [pdf, other

    cs.CL cs.AI

    A Survey of AI-generated Text Forensic Systems: Detection, Attribution, and Characterization

    Authors: Tharindu Kumarage, Garima Agrawal, Paras Sheth, Raha Moraffah, Aman Chadha, Joshua Garland, Huan Liu

    Abstract: We have witnessed lately a rapid proliferation of advanced Large Language Models (LLMs) capable of generating high-quality text. While these LLMs have revolutionized text generation across various domains, they also pose significant risks to the information ecosystem, such as the potential for generating convincing propaganda, misinformation, and disinformation at scale. This paper offers a review… ▽ More

    Submitted 2 March, 2024; originally announced March 2024.

  25. arXiv:2402.18590  [pdf, ps, other

    cs.IR cs.AI

    Exploring the Impact of Large Language Models on Recommender Systems: An Extensive Review

    Authors: Arpita Vats, Vinija Jain, Rahul Raja, Aman Chadha

    Abstract: The paper underscores the significance of Large Language Models (LLMs) in resha** recommender systems, attributing their value to unique reasoning abilities absent in traditional recommenders. Unlike conventional systems lacking direct user interaction data, LLMs exhibit exceptional proficiency in recommending items, showcasing their adeptness in comprehending intricacies of language. This marks… ▽ More

    Submitted 19 March, 2024; v1 submitted 10 February, 2024; originally announced February 2024.

  26. arXiv:2402.18139  [pdf, other

    cs.CL cs.AI

    Cause and Effect: Can Large Language Models Truly Understand Causality?

    Authors: Swagata Ashwani, Kshiteesh Hegde, Nishith Reddy Mannuru, Mayank **dal, Dushyant Singh Sengar, Krishna Chaitanya Rao Kathala, Dishant Banga, Vinija Jain, Aman Chadha

    Abstract: With the rise of Large Language Models(LLMs), it has become crucial to understand their capabilities and limitations in deciphering and explaining the complex web of causal relationships that language entails. Current methods use either explicit or implicit causal reasoning, yet there is a strong need for a unified approach combining both to tackle a wide array of causal relationships more effecti… ▽ More

    Submitted 15 April, 2024; v1 submitted 28 February, 2024; originally announced February 2024.

  27. arXiv:2402.14889  [pdf

    cs.CL cs.AI

    COBIAS: Contextual Reliability in Bias Assessment

    Authors: Priyanshul Govil, Hemang Jain, Vamshi Krishna Bonagiri, Aman Chadha, Ponnurangam Kumaraguru, Manas Gaur, Sanorita Dey

    Abstract: Large Language Models (LLMs) are trained on extensive web corpora, which enable them to understand and generate human-like text. However, this training process also results in inherent biases within the models. These biases arise from web data's diverse and often uncurated nature, containing various stereotypes and prejudices. Previous works on debiasing models rely on benchmark datasets to measur… ▽ More

    Submitted 17 June, 2024; v1 submitted 22 February, 2024; originally announced February 2024.

  28. arXiv:2402.11512  [pdf, other

    cs.CL cs.CY

    From Prejudice to Parity: A New Approach to Debiasing Large Language Model Word Embeddings

    Authors: Aishik Rakshit, Smriti Singh, Shuvam Keshari, Arijit Ghosh Chowdhury, Vinija Jain, Aman Chadha

    Abstract: Embeddings play a pivotal role in the efficacy of Large Language Models. They are the bedrock on which these models grasp contextual relationships and foster a more nuanced understanding of language and consequently perform remarkably on a plethora of complex tasks that require a fundamental understanding of human language. Given that these embeddings themselves often reflect or exhibit bias, it s… ▽ More

    Submitted 16 April, 2024; v1 submitted 18 February, 2024; originally announced February 2024.

  29. arXiv:2402.09346  [pdf, other

    cs.AI

    LLMAuditor: A Framework for Auditing Large Language Models Using Human-in-the-Loop

    Authors: Maryam Amirizaniani, Jihan Yao, Adrian Lavergne, Elizabeth Snell Okada, Aman Chadha, Tanya Roosta, Chirag Shah

    Abstract: As Large Language Models (LLMs) become more pervasive across various users and scenarios, identifying potential issues when using these models becomes essential. Examples of such issues include: bias, inconsistencies, and hallucination. Although auditing the LLM for these problems is often warranted, such a process is neither easy nor accessible for most. An effective method is to probe the LLM us… ▽ More

    Submitted 22 May, 2024; v1 submitted 14 February, 2024; originally announced February 2024.

  30. arXiv:2402.09334  [pdf, other

    cs.AI

    AuditLLM: A Tool for Auditing Large Language Models Using Multiprobe Approach

    Authors: Maryam Amirizaniani, Elias Martin, Tanya Roosta, Aman Chadha, Chirag Shah

    Abstract: As Large Language Models (LLMs) are integrated into various sectors, ensuring their reliability and safety is crucial. This necessitates rigorous probing and auditing to maintain their effectiveness and trustworthiness in practical applications. Subjecting LLMs to varied iterations of a single query can unveil potential inconsistencies in their knowledge base or functional capacity. However, a too… ▽ More

    Submitted 17 June, 2024; v1 submitted 14 February, 2024; originally announced February 2024.

  31. arXiv:2402.07927  [pdf, other

    cs.AI cs.CL cs.HC

    A Systematic Survey of Prompt Engineering in Large Language Models: Techniques and Applications

    Authors: Pranab Sahoo, Ayush Kumar Singh, Sriparna Saha, Vinija Jain, Samrat Mondal, Aman Chadha

    Abstract: Prompt engineering has emerged as an indispensable technique for extending the capabilities of large language models (LLMs) and vision-language models (VLMs). This approach leverages task-specific instructions, known as prompts, to enhance model efficacy without modifying the core model parameters. Rather than updating the model parameters, prompts allow seamless integration of pre-trained models… ▽ More

    Submitted 5 February, 2024; originally announced February 2024.

    Comments: 9 pages, 2 figures

  32. arXiv:2402.04929  [pdf, other

    cs.CV cs.AI cs.LG

    Source-Free Domain Adaptation with Diffusion-Guided Source Data Generation

    Authors: Shivang Chopra, Suraj Kothawade, Houda Aynaou, Aman Chadha

    Abstract: This paper introduces a novel approach to leverage the generalizability of Diffusion Models for Source-Free Domain Adaptation (DM-SFDA). Our proposed DMSFDA method involves fine-tuning a pre-trained text-to-image diffusion model to generate source domain images using features from the target images to guide the diffusion process. Specifically, the pre-trained diffusion model is fine-tuned to gener… ▽ More

    Submitted 26 June, 2024; v1 submitted 7 February, 2024; originally announced February 2024.

    Comments: arXiv admin note: substantial text overlap with arXiv:2310.01701

  33. Post-Training Embedding Alignment for Decoupling Enrollment and Runtime Speaker Recognition Models

    Authors: Chenyang Gao, Brecht Desplanques, Chelsea J. -T. Ju, Aman Chadha, Andreas Stolcke

    Abstract: Automated speaker identification (SID) is a crucial step for the personalization of a wide range of speech-enabled services. Typical SID systems use a symmetric enrollment-verification framework with a single model to derive embeddings both offline for voice profiles extracted from enrollment utterances, and online from runtime utterances. Due to the distinct circumstances of enrollment and runtim… ▽ More

    Submitted 22 January, 2024; originally announced January 2024.

    Comments: Accepted to ICASSP 2024

  34. arXiv:2401.11143  [pdf, other

    cs.LG cs.AI cs.CL cs.CV cs.SD eess.AS eess.SP

    Gaussian Adaptive Attention is All You Need: Robust Contextual Representations Across Multiple Modalities

    Authors: Georgios Ioannides, Aman Chadha, Aaron Elkins

    Abstract: We propose the Multi-Head Gaussian Adaptive Attention Mechanism (GAAM), a novel probabilistic attention framework, and the Gaussian Adaptive Transformer (GAT), designed to enhance information aggregation across multiple modalities, including Speech, Text and Vision. GAAM integrates learnable mean and variance into its attention mechanism, implemented in a Multi-Headed framework enabling it to coll… ▽ More

    Submitted 30 January, 2024; v1 submitted 20 January, 2024; originally announced January 2024.

  35. arXiv:2401.07872  [pdf, other

    cs.CL

    The What, Why, and How of Context Length Extension Techniques in Large Language Models -- A Detailed Survey

    Authors: Saurav Pawar, S. M Towhidul Islam Tonmoy, S M Mehedi Zaman, Vinija Jain, Aman Chadha, Amitava Das

    Abstract: The advent of Large Language Models (LLMs) represents a notable breakthrough in Natural Language Processing (NLP), contributing to substantial progress in both text comprehension and generation. However, amidst these advancements, it is noteworthy that LLMs often face a limitation in terms of context length extrapolation. Understanding and extending the context length for LLMs is crucial in enhanc… ▽ More

    Submitted 15 January, 2024; originally announced January 2024.

  36. arXiv:2401.06709  [pdf, other

    cs.CL cs.AI

    Reliability Analysis of Psychological Concept Extraction and Classification in User-penned Text

    Authors: Muskan Garg, MSVPJ Sathvik, Amrit Chadha, Shaina Raza, Sunghwan Sohn

    Abstract: The social NLP research community witness a recent surge in the computational advancements of mental health analysis to build responsible AI models for a complex interplay between language use and self-perception. Such responsible AI models aid in quantifying the psychological concepts from user-penned texts on social media. On thinking beyond the low-level (classification) task, we advance the ex… ▽ More

    Submitted 12 January, 2024; originally announced January 2024.

  37. arXiv:2401.03378  [pdf, other

    cs.DC math.NA

    CG-Kit: Code Generation Toolkit for Performant and Maintainable Variants of Source Code Applied to Flash-X Hydrodynamics Simulations

    Authors: Johann Rudi, Youngjun Lee, Aidan H. Chadha, Mohamed Wahib, Klaus Weide, Jared P. O'Neal, Anshu Dubey

    Abstract: CG-Kit is a new code generation toolkit that we propose as a solution for portability and maintainability for scientific computing applications. The development of CG-Kit is rooted in the urgent need created by the shifting landscape of high-performance computing platforms and the algorithmic complexities of a particular large-scale multiphysics application: Flash-X. This combination leads to uniq… ▽ More

    Submitted 6 January, 2024; originally announced January 2024.

    Comments: submitted

  38. arXiv:2401.01596  [pdf, other

    cs.AI cs.CL

    MedSumm: A Multimodal Approach to Summarizing Code-Mixed Hindi-English Clinical Queries

    Authors: Akash Ghosh, Arkadeep Acharya, Prince Jha, Aniket Gaudgaul, Rajdeep Majumdar, Sriparna Saha, Aman Chadha, Raghav Jain, Setu Sinha, Shivani Agarwal

    Abstract: In the healthcare domain, summarizing medical questions posed by patients is critical for improving doctor-patient interactions and medical decision-making. Although medical data has grown in complexity and quantity, the current body of research in this domain has primarily concentrated on text-based methods, overlooking the integration of visual cues. Also prior works in the area of medical quest… ▽ More

    Submitted 3 January, 2024; originally announced January 2024.

    Comments: ECIR 2024

  39. arXiv:2401.01313  [pdf, other

    cs.CL

    A Comprehensive Survey of Hallucination Mitigation Techniques in Large Language Models

    Authors: S. M Towhidul Islam Tonmoy, S M Mehedi Zaman, Vinija Jain, Anku Rani, Vipula Rawte, Aman Chadha, Amitava Das

    Abstract: As Large Language Models (LLMs) continue to advance in their ability to write human-like text, a key challenge remains around their tendency to hallucinate generating content that appears factual but is ungrounded. This issue of hallucination is arguably the biggest hindrance to safely deploying these powerful LLMs into real-world production systems that impact people's lives. The journey toward w… ▽ More

    Submitted 8 January, 2024; v1 submitted 2 January, 2024; originally announced January 2024.

  40. arXiv:2312.11541  [pdf, other

    cs.AI cs.CL

    CLIPSyntel: CLIP and LLM Synergy for Multimodal Question Summarization in Healthcare

    Authors: Akash Ghosh, Arkadeep Acharya, Raghav Jain, Sriparna Saha, Aman Chadha, Setu Sinha

    Abstract: In the era of modern healthcare, swiftly generating medical question summaries is crucial for informed and timely patient care. Despite the increasing complexity and volume of medical data, existing studies have focused solely on text-based summarization, neglecting the integration of visual information. Recognizing the untapped potential of combining textual queries with visual representations of… ▽ More

    Submitted 15 December, 2023; originally announced December 2023.

    Comments: AAAI 2024

  41. arXiv:2312.07028  [pdf, other

    cs.CL cs.AI

    Dynamic Corrective Self-Distillation for Better Fine-Tuning of Pretrained Models

    Authors: Ibtihel Amara, Vinija Jain, Aman Chadha

    Abstract: We tackle the challenging issue of aggressive fine-tuning encountered during the process of transfer learning of pre-trained language models (PLMs) with limited labeled downstream data. This problem primarily results in a decline in performance on the subsequent task. Inspired by the adaptive boosting method in traditional machine learning, we present an effective dynamic corrective self-distillat… ▽ More

    Submitted 12 December, 2023; originally announced December 2023.

  42. arXiv:2312.00292  [pdf, other

    cs.CL

    SEPSIS: I Can Catch Your Lies -- A New Paradigm for Deception Detection

    Authors: Anku Rani, Dwip Dalal, Shreya Gautam, Pankaj Gupta, Vinija Jain, Aman Chadha, Amit Sheth, Amitava Das

    Abstract: Deception is the intentional practice of twisting information. It is a nuanced societal practice deeply intertwined with human societal evolution, characterized by a multitude of facets. This research explores the problem of deception through the lens of psychology, employing a framework that categorizes deception into three forms: lies of omission, lies of commission, and lies of influence. The p… ▽ More

    Submitted 30 November, 2023; originally announced December 2023.

  43. arXiv:2310.09680  [pdf, other

    cs.CL cs.AI cs.SD eess.AS

    Improved Contextual Recognition In Automatic Speech Recognition Systems By Semantic Lattice Rescoring

    Authors: Ankitha Sudarshan, Vinay Samuel, Parth Patwa, Ibtihel Amara, Aman Chadha

    Abstract: Automatic Speech Recognition (ASR) has witnessed a profound research interest. Recent breakthroughs have given ASR systems different prospects such as faithfully transcribing spoken language, which is a pivotal advancement in building conversational agents. However, there is still an imminent challenge of accurately discerning context-dependent words and phrases. In this work, we propose a novel a… ▽ More

    Submitted 3 March, 2024; v1 submitted 14 October, 2023; originally announced October 2023.

  44. arXiv:2310.07818  [pdf, other

    cs.CL cs.AI

    On the Relationship between Sentence Analogy Identification and Sentence Structure Encoding in Large Language Models

    Authors: Thilini Wijesiriwardene, Ruwan Wickramarachchi, Aishwarya Naresh Reganti, Vinija Jain, Aman Chadha, Amit Sheth, Amitava Das

    Abstract: The ability of Large Language Models (LLMs) to encode syntactic and semantic structures of language is well examined in NLP. Additionally, analogy identification, in the form of word analogies are extensively studied in the last decade of language modeling literature. In this work we specifically look at how LLMs' abilities to capture sentence analogies (sentences that convey analogous meaning to… ▽ More

    Submitted 5 February, 2024; v1 submitted 11 October, 2023; originally announced October 2023.

    Comments: To appear in Findings of EACL 2024

  45. arXiv:2310.05280  [pdf, other

    cs.CL cs.AI

    Are Personalized Stochastic Parrots More Dangerous? Evaluating Persona Biases in Dialogue Systems

    Authors: Yixin Wan, Jieyu Zhao, Aman Chadha, Nanyun Peng, Kai-Wei Chang

    Abstract: Recent advancements in Large Language Models empower them to follow freeform instructions, including imitating generic or specific demographic personas in conversations. We define generic personas to represent demographic groups, such as "an Asian person", whereas specific personas may take the form of specific popular Asian names like "Yumi". While the adoption of personas enriches user experienc… ▽ More

    Submitted 2 November, 2023; v1 submitted 8 October, 2023; originally announced October 2023.

  46. arXiv:2310.05030  [pdf, other

    cs.CL cs.AI

    Counter Turing Test CT^2: AI-Generated Text Detection is Not as Easy as You May Think -- Introducing AI Detectability Index

    Authors: Megha Chakraborty, S. M Towhidul Islam Tonmoy, S M Mehedi Zaman, Krish Sharma, Niyar R Barman, Chandan Gupta, Shreya Gautam, Tanay Kumar, Vinija Jain, Aman Chadha, Amit P. Sheth, Amitava Das

    Abstract: With the rise of prolific ChatGPT, the risk and consequences of AI-generated text has increased alarmingly. To address the inevitable question of ownership attribution for AI-generated artifacts, the US Copyright Office released a statement stating that 'If a work's traditional elements of authorship were produced by a machine, the work lacks human authorship and the Office will not register it'.… ▽ More

    Submitted 23 October, 2023; v1 submitted 8 October, 2023; originally announced October 2023.

    Comments: EMNLP 2023 Main

  47. arXiv:2310.04988  [pdf, other

    cs.AI

    The Troubling Emergence of Hallucination in Large Language Models -- An Extensive Definition, Quantification, and Prescriptive Remediations

    Authors: Vipula Rawte, Swagata Chakraborty, Agnibh Pathak, Anubhav Sarkar, S. M Towhidul Islam Tonmoy, Aman Chadha, Amit P. Sheth, Amitava Das

    Abstract: The recent advancements in Large Language Models (LLMs) have garnered widespread acclaim for their remarkable emerging capabilities. However, the issue of hallucination has parallelly emerged as a by-product, posing significant concerns. While some recent endeavors have been made to identify and mitigate different types of hallucination, there has been a limited emphasis on the nuanced categorizat… ▽ More

    Submitted 22 October, 2023; v1 submitted 7 October, 2023; originally announced October 2023.

  48. arXiv:2310.01701   

    cs.CV cs.AI

    Transcending Domains through Text-to-Image Diffusion: A Source-Free Approach to Domain Adaptation

    Authors: Shivang Chopra, Suraj Kothawade, Houda Aynaou, Aman Chadha

    Abstract: Domain Adaptation (DA) is a method for enhancing a model's performance on a target domain with inadequate annotated data by applying the information the model has acquired from a related source domain with sufficient labeled data. The escalating enforcement of data-privacy regulations like HIPAA, COPPA, FERPA, etc. have sparked a heightened interest in adapting models to novel domains while circum… ▽ More

    Submitted 6 February, 2024; v1 submitted 2 October, 2023; originally announced October 2023.

    Comments: Revamped the whole paper; new version will be re-submitted

  49. arXiv:2309.12426  [pdf, other

    cs.CL cs.AI

    Can LLMs Augment Low-Resource Reading Comprehension Datasets? Opportunities and Challenges

    Authors: Vinay Samuel, Houda Aynaou, Arijit Ghosh Chowdhury, Karthik Venkat Ramanan, Aman Chadha

    Abstract: Large Language Models (LLMs) have demonstrated impressive zero shot performance on a wide range of NLP tasks, demonstrating the ability to reason and apply commonsense. A relevant application is to use them for creating high quality synthetic datasets for downstream tasks. In this work, we probe whether GPT-4 can be used to augment existing extractive reading comprehension datasets. Automating dat… ▽ More

    Submitted 21 September, 2023; originally announced September 2023.

    Comments: 5 pages, 1 figure, 3 tables

  50. arXiv:2309.06517  [pdf, other

    cs.CL

    Overview of Memotion 3: Sentiment and Emotion Analysis of Codemixed Hinglish Memes

    Authors: Shreyash Mishra, S Suryavardan, Megha Chakraborty, Parth Patwa, Anku Rani, Aman Chadha, Aishwarya Reganti, Amitava Das, Amit Sheth, Manoj Chinnakotla, Asif Ekbal, Srijan Kumar

    Abstract: Analyzing memes on the internet has emerged as a crucial endeavor due to the impact this multi-modal form of content wields in sha** online discourse. Memes have become a powerful tool for expressing emotions and sentiments, possibly even spreading hate and misinformation, through humor and sarcasm. In this paper, we present the overview of the Memotion 3 shared task, as part of the DeFactify 2… ▽ More

    Submitted 12 September, 2023; originally announced September 2023.

    Comments: Defactify2 @AAAI 2023