Skip to main content

Showing 1–50 of 129 results for author: Sheth, A

.
  1. arXiv:2407.01644  [pdf, other

    stat.ML cs.LG

    Evaluating the Role of Data Enrichment Approaches Towards Rare Event Analysis in Manufacturing

    Authors: Chathurangi Shyalika, Ruwan Wickramarachchi, Fadi El Kalach, Ramy Harik, Amit Sheth

    Abstract: Rare events are occurrences that take place with a significantly lower frequency than more common regular events. In manufacturing, predicting such events is particularly important, as they lead to unplanned downtime, shortening equipment lifespan, and high energy consumption. The occurrence of events is considered frequently-rare if observed in more than 10% of all instances, very-rare if it is 1… ▽ More

    Submitted 30 June, 2024; originally announced July 2024.

    Comments: 27 pages, 11 figures, 16 tables

  2. arXiv:2406.15573  [pdf, other

    stat.ME stat.CO

    Sparse Bayesian multidimensional scaling(s)

    Authors: Ami Sheth, Aaron Smith, Andrew J. Holbrook

    Abstract: Bayesian multidimensional scaling (BMDS) is a probabilistic dimension reduction tool that allows one to model and visualize data consisting of dissimilarities between pairs of objects. Although BMDS has proven useful within, e.g., Bayesian phylogenetic inference, its likelihood and gradient calculations require a burdensome order of $N^2$ floating-point operations, where $N$ is the number of data… ▽ More

    Submitted 21 June, 2024; originally announced June 2024.

  3. arXiv:2406.13856  [pdf, other

    cs.DB

    Kishu: Time-Traveling for Computational Notebooks

    Authors: Zhaoheng Li, Supawit Chockchowwat, Ribhav Sahu, Areet Sheth, Yongjoo Park

    Abstract: Computational notebooks (e.g., Jupyter, Google Colab) are widely used by data scientists. A key feature of notebooks is the interactive computing model of iteratively executing cells (i.e., a set of statements) and observing the result (e.g., model or plot). Unfortunately, existing notebook systems do not offer time-traveling to past states: when the user executes a cell, the notebook session stat… ▽ More

    Submitted 19 June, 2024; originally announced June 2024.

  4. arXiv:2406.02598  [pdf, other

    cs.LG cs.AI

    Towards Learning Foundation Models for Heuristic Functions to Solve Pathfinding Problems

    Authors: Vedant Khandelwal, Amit Sheth, Forest Agostinelli

    Abstract: Pathfinding problems are found throughout robotics, computational science, and natural sciences. Traditional methods to solve these require training deep neural networks (DNNs) for each new problem domain, consuming substantial time and resources. This study introduces a novel foundation model, leveraging deep reinforcement learning to train heuristic functions that seamlessly adapt to new domains… ▽ More

    Submitted 1 June, 2024; originally announced June 2024.

  5. arXiv:2405.02327  [pdf, other

    cs.AI

    CausalDisco: Causal discovery using knowledge graph link prediction

    Authors: Utkarshani Jaimini, Cory Henson, Amit P. Sheth

    Abstract: Causal discovery is a process of discovering new causal relations from observational data. Traditional causal discovery methods often suffer from issues related to missing data To address these issues, this paper presents a novel approach called CausalDisco that formulates causal discovery as a knowledge graph completion problem. More specifically, the task of discovering causal relations is mappe… ▽ More

    Submitted 23 April, 2024; originally announced May 2024.

    Comments: 9 pages, 8 figures

  6. arXiv:2405.02228  [pdf, other

    cs.CL cs.AI cs.IR

    REASONS: A benchmark for REtrieval and Automated citationS Of scieNtific Sentences using Public and Proprietary LLMs

    Authors: Deepa Tilwani, Yash Saxena, Ali Mohammadi, Edward Raff, Amit Sheth, Srinivasan Parthasarathy, Manas Gaur

    Abstract: Automatic citation generation for sentences in a document or report is paramount for intelligence analysts, cybersecurity, news agencies, and education personnel. In this research, we investigate whether large language models (LLMs) are capable of generating references based on two forms of sentence queries: (a) Direct Queries, LLMs are asked to provide author names of the given research article,… ▽ More

    Submitted 8 May, 2024; v1 submitted 3 May, 2024; originally announced May 2024.

    Comments: Work in progress

  7. arXiv:2405.01512  [pdf, ps, other

    math.NT

    Euler Products at the Centre and Applications to Chebyshev's Bias

    Authors: Arshay Sheth

    Abstract: Let $π$ be an irreducible cuspidal automorphic representation of $\text{GL}_n(\mathbb A_\mathbb Q)$ with associated $L$-function $L(s, π)$. We study the behaviour of the partial Euler product of $L(s, π)$ at the center of the critical strip. Under the assumption of the Generalized Riemann Hypothesis for $L(s, π)$ and assuming the Ramanujan--Petersson conjecture when necessary, we establish an asym… ▽ More

    Submitted 2 May, 2024; originally announced May 2024.

    Comments: 17 pages, comments welcome!

  8. arXiv:2403.19113  [pdf, other

    cs.CL cs.AI

    FACTOID: FACtual enTailment fOr hallucInation Detection

    Authors: Vipula Rawte, S. M Towhidul Islam Tonmoy, Krishnav Rajbangshi, Shravani Nag, Aman Chadha, Amit P. Sheth, Amitava Das

    Abstract: The widespread adoption of Large Language Models (LLMs) has facilitated numerous benefits. However, hallucination is a significant concern. In response, Retrieval Augmented Generation (RAG) has emerged as a highly promising paradigm to improve LLM outputs by grounding them in factual information. RAG relies on textual entailment (TE) or similar methods to check if the text produced by LLMs is supp… ▽ More

    Submitted 27 March, 2024; originally announced March 2024.

  9. arXiv:2403.18976  [pdf, other

    cs.CL cs.AI

    "Sorry, Come Again?" Prompting -- Enhancing Comprehension and Diminishing Hallucination with [PAUSE]-injected Optimal Paraphrasing

    Authors: Vipula Rawte, S. M Towhidul Islam Tonmoy, S M Mehedi Zaman, Prachi Priya, Aman Chadha, Amit P. Sheth, Amitava Das

    Abstract: Hallucination has emerged as the most vulnerable aspect of contemporary Large Language Models (LLMs). In this paper, we introduce the Sorry, Come Again (SCA) prompting, aimed to avoid LLM hallucinations by enhancing comprehension through: (i) optimal paraphrasing and (ii) injecting [PAUSE] tokens to delay LLM generation. First, we provide an in-depth analysis of linguistic nuances: formality, read… ▽ More

    Submitted 27 March, 2024; originally announced March 2024.

  10. arXiv:2403.17306  [pdf, other

    cs.AI

    Visual Hallucination: Definition, Quantification, and Prescriptive Remediations

    Authors: Anku Rani, Vipula Rawte, Harshad Sharma, Neeraj Anand, Krishnav Rajbangshi, Amit Sheth, Amitava Das

    Abstract: The troubling rise of hallucination presents perhaps the most significant impediment to the advancement of responsible AI. In recent times, considerable research has focused on detecting and mitigating hallucination in Large Language Models (LLMs). However, it's worth noting that hallucination is also quite prevalent in Vision-Language models (VLMs). In this paper, we offer a fine-grained discours… ▽ More

    Submitted 30 March, 2024; v1 submitted 25 March, 2024; originally announced March 2024.

  11. arXiv:2403.04738  [pdf, ps, other

    math.NT

    Control Theorems for Hilbert Modular Varieties

    Authors: Arshay Sheth

    Abstract: We prove an exact control theorem, in the sense of Hida theory, for the ordinary part of the middle degree étale cohomology of certain Hilbert modular varieties, after localizing at a suitable maximal ideal of the Hecke algebra. Our method of proof builds upon the techniques introduced by Loeffler-Rockwood-Zerbes; another important ingredient in our proof is the recent work of Caraiani-Tamiozzo on… ▽ More

    Submitted 7 March, 2024; originally announced March 2024.

    Comments: 15 pages, comments welcome!

  12. Grounding from an AI and Cognitive Science Lens

    Authors: Goonmeet Bajaj, Srinivasan Parthasarathy, Valerie L. Shalin, Amit Sheth

    Abstract: Grounding is a challenging problem, requiring a formal definition and different levels of abstraction. This article explores grounding from both cognitive science and machine learning perspectives. It identifies the subtleties of grounding, its significance for collaborative agents, and similarities and differences in grounding approaches in both communities. The article examines the potential of… ▽ More

    Submitted 19 February, 2024; originally announced February 2024.

    Journal ref: IEEE Intelligent Systems, 2024

  13. arXiv:2401.02500  [pdf, other

    cs.AI

    On the Prospects of Incorporating Large Language Models (LLMs) in Automated Planning and Scheduling (APS)

    Authors: Vishal Pallagani, Kaushik Roy, Bharath Muppasani, Francesco Fabiano, Andrea Loreggia, Keerthiram Murugesan, Biplav Srivastava, Francesca Rossi, Lior Horesh, Amit Sheth

    Abstract: Automated Planning and Scheduling is among the growing areas in Artificial Intelligence (AI) where mention of LLMs has gained popularity. Based on a comprehensive review of 126 papers, this paper investigates eight categories based on the unique applications of LLMs in addressing various aspects of planning problems: language translation, plan generation, model construction, multi-agent planning,… ▽ More

    Submitted 20 January, 2024; v1 submitted 4 January, 2024; originally announced January 2024.

  14. arXiv:2312.10084  [pdf, other

    q-fin.ST

    A Decadal Analysis of the Lead-Lag Effect in the NYSE

    Authors: Aarush Pratik Sheth, Jonah Riley Weinbaum, Kevin Javier Zvonarek

    Abstract: As is widely known, the stock market is a complex system in which a multitude of factors influence the performance of individual stocks and the market as a whole. One method for comprehending -- and potentially predicting -- stock market behavior is through network analysis, which can offer insights into the relationships between stocks and the overall market structure. In this paper, we seek to a… ▽ More

    Submitted 11 December, 2023; originally announced December 2023.

  15. arXiv:2312.09948  [pdf, other

    cs.IR cs.DL

    GEAR-Up: Generative AI and External Knowledge-based Retrieval Upgrading Scholarly Article Searches for Systematic Reviews

    Authors: Kaushik Roy, Vedant Khandelwal, Harshul Surana, Valerie Vera, Amit Sheth, Heather Heckman

    Abstract: Systematic reviews (SRs) - the librarian-assisted literature survey of scholarly articles takes time and requires significant human resources. Given the ever-increasing volume of published studies, applying existing computing and informatics technology can decrease this time and resource burden. Due to the revolutionary advances in (1) Generative AI such as ChatGPT, and (2) External knowledge-augm… ▽ More

    Submitted 15 December, 2023; originally announced December 2023.

  16. arXiv:2312.09932  [pdf, other

    cs.CL cs.AI

    RDR: the Recap, Deliberate, and Respond Method for Enhanced Language Understanding

    Authors: Yuxin Zi, Hariram Veeramani, Kaushik Roy, Amit Sheth

    Abstract: Natural language understanding (NLU) using neural network pipelines often requires additional context that is not solely present in the input data. Through Prior research, it has been evident that NLU benchmarks are susceptible to manipulation by neural models, wherein these models exploit statistical artifacts within the encoded external knowledge to artificially inflate performance metrics for d… ▽ More

    Submitted 5 March, 2024; v1 submitted 15 December, 2023; originally announced December 2023.

  17. arXiv:2312.09928  [pdf, other

    cs.AI

    Neurosymbolic Value-Inspired AI (Why, What, and How)

    Authors: Amit Sheth, Kaushik Roy

    Abstract: The rapid progression of Artificial Intelligence (AI) systems, facilitated by the advent of Large Language Models (LLMs), has resulted in their widespread application to provide human assistance across diverse industries. This trend has sparked significant discourse centered around the ever-increasing need for LLM-based AI systems to function among humans as part of human society, sharing human va… ▽ More

    Submitted 15 December, 2023; originally announced December 2023.

  18. arXiv:2312.06798  [pdf, other

    cs.AI cs.CL cs.LG

    Building Trustworthy NeuroSymbolic AI Systems: Consistency, Reliability, Explainability, and Safety

    Authors: Manas Gaur, Amit Sheth

    Abstract: Explainability and Safety engender Trust. These require a model to exhibit consistency and reliability. To achieve these, it is necessary to use and analyze data and knowledge with statistical and symbolic AI methods relevant to the AI application - neither alone will do. Consequently, we argue and seek to demonstrate that the NeuroSymbolic AI approach is better suited for making AI a trusted AI s… ▽ More

    Submitted 5 December, 2023; originally announced December 2023.

    Comments: To Appear in AAAI AI Magazine. 15 pages, 7 figures

    ACM Class: I.2; I.2.7; J.3; H.3.3

  19. arXiv:2312.05236  [pdf, ps, other

    math.NT

    Euler Product Asymptotics for $L$-functions of Elliptic Curves

    Authors: Arshay Sheth

    Abstract: Let $E/\mathbb Q$ be an elliptic curve and for each prime $p$, let $N_p$ denote the number of points of $E$ modulo $p$. The original version of the Birch and Swinnerton-Dyer conjecture asserts that $\prod \limits _{p \leq x} \frac{N_p}{p} \sim C (\log x) ^{\text{rank}(E(\mathbb Q))}$ as $x \to \infty$. Goldfeld showed that this conjecture implies both the Riemann Hypothesis for $L(E, s)$ and the m… ▽ More

    Submitted 29 March, 2024; v1 submitted 8 December, 2023; originally announced December 2023.

    Comments: We have rewritten the introduction and strengthened the statement of one of our main theorems (Theorem 1.5 from the previous version)

  20. arXiv:2312.00292  [pdf, other

    cs.CL

    SEPSIS: I Can Catch Your Lies -- A New Paradigm for Deception Detection

    Authors: Anku Rani, Dwip Dalal, Shreya Gautam, Pankaj Gupta, Vinija Jain, Aman Chadha, Amit Sheth, Amitava Das

    Abstract: Deception is the intentional practice of twisting information. It is a nuanced societal practice deeply intertwined with human societal evolution, characterized by a multitude of facets. This research explores the problem of deception through the lens of psychology, employing a framework that categorizes deception into three forms: lies of omission, lies of commission, and lies of influence. The p… ▽ More

    Submitted 30 November, 2023; originally announced December 2023.

  21. arXiv:2311.13852  [pdf, other

    cs.AI

    A Cross Attention Approach to Diagnostic Explainability using Clinical Practice Guidelines for Depression

    Authors: Sumit Dalal, Deepa Tilwani, Kaushik Roy, Manas Gaur, Sarika Jain, Valerie Shalin, Amit Sheth

    Abstract: The lack of explainability using relevant clinical knowledge hinders the adoption of Artificial Intelligence-powered analysis of unstructured clinical dialogue. A wealth of relevant, untapped Mental Health (MH) data is available in online communities, providing the opportunity to address the explainability problem with substantial potential impact as a screening tool for both online and offline ap… ▽ More

    Submitted 28 April, 2024; v1 submitted 23 November, 2023; originally announced November 2023.

  22. arXiv:2311.06493  [pdf, other

    cs.CL

    L3 Ensembles: Lifelong Learning Approach for Ensemble of Foundational Language Models

    Authors: Aidin Shiri, Kaushik Roy, Amit Sheth, Manas Gaur

    Abstract: Fine-tuning pre-trained foundational language models (FLM) for specific tasks is often impractical, especially for resource-constrained devices. This necessitates the development of a Lifelong Learning (L3) framework that continuously adapts to a stream of Natural Language Processing (NLP) tasks efficiently. We propose an approach that focuses on extracting meaningful representations from unseen d… ▽ More

    Submitted 11 November, 2023; originally announced November 2023.

  23. arXiv:2310.07818  [pdf, other

    cs.CL cs.AI

    On the Relationship between Sentence Analogy Identification and Sentence Structure Encoding in Large Language Models

    Authors: Thilini Wijesiriwardene, Ruwan Wickramarachchi, Aishwarya Naresh Reganti, Vinija Jain, Aman Chadha, Amit Sheth, Amitava Das

    Abstract: The ability of Large Language Models (LLMs) to encode syntactic and semantic structures of language is well examined in NLP. Additionally, analogy identification, in the form of word analogies are extensively studied in the last decade of language modeling literature. In this work we specifically look at how LLMs' abilities to capture sentence analogies (sentences that convey analogous meaning to… ▽ More

    Submitted 5 February, 2024; v1 submitted 11 October, 2023; originally announced October 2023.

    Comments: To appear in Findings of EACL 2024

  24. arXiv:2310.05030  [pdf, other

    cs.CL cs.AI

    Counter Turing Test CT^2: AI-Generated Text Detection is Not as Easy as You May Think -- Introducing AI Detectability Index

    Authors: Megha Chakraborty, S. M Towhidul Islam Tonmoy, S M Mehedi Zaman, Krish Sharma, Niyar R Barman, Chandan Gupta, Shreya Gautam, Tanay Kumar, Vinija Jain, Aman Chadha, Amit P. Sheth, Amitava Das

    Abstract: With the rise of prolific ChatGPT, the risk and consequences of AI-generated text has increased alarmingly. To address the inevitable question of ownership attribution for AI-generated artifacts, the US Copyright Office released a statement stating that 'If a work's traditional elements of authorship were produced by a machine, the work lacks human authorship and the Office will not register it'.… ▽ More

    Submitted 23 October, 2023; v1 submitted 8 October, 2023; originally announced October 2023.

    Comments: EMNLP 2023 Main

  25. arXiv:2310.04988  [pdf, other

    cs.AI

    The Troubling Emergence of Hallucination in Large Language Models -- An Extensive Definition, Quantification, and Prescriptive Remediations

    Authors: Vipula Rawte, Swagata Chakraborty, Agnibh Pathak, Anubhav Sarkar, S. M Towhidul Islam Tonmoy, Aman Chadha, Amit P. Sheth, Amitava Das

    Abstract: The recent advancements in Large Language Models (LLMs) have garnered widespread acclaim for their remarkable emerging capabilities. However, the issue of hallucination has parallelly emerged as a by-product, posing significant concerns. While some recent endeavors have been made to identify and mitigate different types of hallucination, there has been a limited emphasis on the nuanced categorizat… ▽ More

    Submitted 22 October, 2023; v1 submitted 7 October, 2023; originally announced October 2023.

  26. arXiv:2309.11356  [pdf, other

    cs.AI

    A Comprehensive Survey on Rare Event Prediction

    Authors: Chathurangi Shyalika, Ruwan Wickramarachchi, Amit Sheth

    Abstract: Rare event prediction involves identifying and forecasting events with a low probability using machine learning and data analysis. Due to the imbalanced data distributions, where the frequency of common events vastly outweighs that of rare events, it requires using specialized methods within each step of the machine learning pipeline, i.e., from data processing to algorithms to evaluation protocol… ▽ More

    Submitted 20 September, 2023; originally announced September 2023.

    Comments: 44 pages

  27. arXiv:2309.11064  [pdf, other

    cs.AI

    Exploring the Relationship between LLM Hallucinations and Prompt Linguistic Nuances: Readability, Formality, and Concreteness

    Authors: Vipula Rawte, Prachi Priya, S. M Towhidul Islam Tonmoy, S M Mehedi Zaman, Amit Sheth, Amitava Das

    Abstract: As Large Language Models (LLMs) have advanced, they have brought forth new challenges, with one of the prominent issues being LLM hallucination. While various mitigation techniques are emerging to address hallucination, it is equally crucial to delve into its underlying causes. Consequently, in this preliminary exploratory investigation, we examine how linguistic factors in prompts, specifically r… ▽ More

    Submitted 20 September, 2023; originally announced September 2023.

  28. arXiv:2309.06517  [pdf, other

    cs.CL

    Overview of Memotion 3: Sentiment and Emotion Analysis of Codemixed Hinglish Memes

    Authors: Shreyash Mishra, S Suryavardan, Megha Chakraborty, Parth Patwa, Anku Rani, Aman Chadha, Aishwarya Reganti, Amitava Das, Amit Sheth, Manoj Chinnakotla, Asif Ekbal, Srijan Kumar

    Abstract: Analyzing memes on the internet has emerged as a crucial endeavor due to the impact this multi-modal form of content wields in sha** online discourse. Memes have become a powerful tool for expressing emotions and sentiments, possibly even spreading hate and misinformation, through humor and sarcasm. In this paper, we present the overview of the Memotion 3 shared task, as part of the DeFactify 2… ▽ More

    Submitted 12 September, 2023; originally announced September 2023.

    Comments: Defactify2 @AAAI 2023

  29. arXiv:2309.05922  [pdf, other

    cs.AI cs.CL cs.IR

    A Survey of Hallucination in Large Foundation Models

    Authors: Vipula Rawte, Amit Sheth, Amitava Das

    Abstract: Hallucination in a foundation model (FM) refers to the generation of content that strays from factual reality or includes fabricated information. This survey paper provides an extensive overview of recent efforts that aim to identify, elucidate, and tackle the problem of hallucination, with a particular focus on ``Large'' Foundation Models (LFMs). The paper classifies various types of hallucinatio… ▽ More

    Submitted 11 September, 2023; originally announced September 2023.

  30. arXiv:2309.05804  [pdf, other

    cs.CL

    Hi Model, generating 'nice' instead of 'good' is not as bad as generating 'rice'! Towards Context and Semantic Infused Dialogue Generation Loss Function and Evaluation Metric

    Authors: Abhisek Tiwari, Muhammed Sinan, Kaushik Roy, Amit Sheth, Sriparna Saha, Pushpak Bhattacharyya

    Abstract: Over the past two decades, dialogue modeling has made significant strides, moving from simple rule-based responses to personalized and persuasive response generation. However, despite these advancements, the objective functions and evaluation metrics for dialogue generation have remained stagnant. These lexical-based metrics, e.g., cross-entropy and BLEU, have two key limitations: (a) word-to-word… ▽ More

    Submitted 29 May, 2024; v1 submitted 11 September, 2023; originally announced September 2023.

  31. arXiv:2308.14659  [pdf, other

    cs.LG

    RESTORE: Graph Embedding Assessment Through Reconstruction

    Authors: Hong Yung Yip, Chidaksh Ravuru, Neelabha Banerjee, Shashwat Jha, Amit Sheth, Aman Chadha, Amitava Das

    Abstract: Following the success of Word2Vec embeddings, graph embeddings (GEs) have gained substantial traction. GEs are commonly generated and evaluated extrinsically on downstream applications, but intrinsic evaluations of the original graph properties in terms of topological structure and semantic information have been lacking. Understanding these will help identify the deficiency of the various families… ▽ More

    Submitted 5 September, 2023; v1 submitted 28 August, 2023; originally announced August 2023.

  32. arXiv:2308.01936  [pdf

    cs.AI cs.CL

    Why Do We Need Neuro-symbolic AI to Model Pragmatic Analogies?

    Authors: Thilini Wijesiriwardene, Amit Sheth, Valerie L. Shalin, Amitava Das

    Abstract: A hallmark of intelligence is the ability to use a familiar domain to make inferences about a less familiar domain, known as analogical reasoning. In this article, we delve into the performance of Large Language Models (LLMs) in dealing with progressively complex analogies expressed in unstructured text. We discuss analogies at four distinct levels of complexity: lexical analogies, syntactic analo… ▽ More

    Submitted 12 September, 2023; v1 submitted 2 August, 2023; originally announced August 2023.

    Comments: 12 pages 3 figures

  33. arXiv:2307.10475  [pdf

    cs.CL cs.CV

    Findings of Factify 2: Multimodal Fake News Detection

    Authors: S Suryavardan, Shreyash Mishra, Megha Chakraborty, Parth Patwa, Anku Rani, Aman Chadha, Aishwarya Reganti, Amitava Das, Amit Sheth, Manoj Chinnakotla, Asif Ekbal, Srijan Kumar

    Abstract: With social media usage growing exponentially in the past few years, fake news has also become extremely prevalent. The detrimental impact of fake news emphasizes the need for research focused on automating the detection of false information and verifying its accuracy. In this work, we present the outcome of the Factify 2 shared task, which provides a multi-modal fact verification and satire news… ▽ More

    Submitted 12 September, 2023; v1 submitted 19 July, 2023; originally announced July 2023.

    Comments: Defactify2 @AAAI 2023

  34. arXiv:2306.13865  [pdf, other

    cs.CL

    IERL: Interpretable Ensemble Representation Learning -- Combining CrowdSourced Knowledge and Distributed Semantic Representations

    Authors: Yuxin Zi, Kaushik Roy, Vignesh Narayanan, Manas Gaur, Amit Sheth

    Abstract: Large Language Models (LLMs) encode meanings of words in the form of distributed semantics. Distributed semantics capture common statistical patterns among language tokens (words, phrases, and sentences) from large amounts of data. LLMs perform exceedingly well across General Language Understanding Evaluation (GLUE) tasks designed to test a model's understanding of the meanings of the input tokens… ▽ More

    Submitted 24 June, 2023; originally announced June 2023.

    Comments: Accepted for publication at the KDD workshop on Knowledge-infused Machine Learning, 2023

  35. arXiv:2306.13501  [pdf, other

    cs.CL

    Knowledge-Infused Self Attention Transformers

    Authors: Kaushik Roy, Yuxin Zi, Vignesh Narayanan, Manas Gaur, Amit Sheth

    Abstract: Transformer-based language models have achieved impressive success in various natural language processing tasks due to their ability to capture complex dependencies and contextual information using self-attention mechanisms. However, they are not without limitations. These limitations include hallucinations, where they produce incorrect outputs with high confidence, and alignment issues, where the… ▽ More

    Submitted 23 June, 2023; originally announced June 2023.

    Comments: Accepted for publication at the Second Workshop on Knowledge Augmented Methods for NLP, colocated with KDD 2023

  36. arXiv:2306.09824  [pdf, other

    cs.CL cs.AI

    Process Knowledge-infused Learning for Clinician-friendly Explanations

    Authors: Kaushik Roy, Yuxin Zi, Manas Gaur, **endra Malekar, Qi Zhang, Vignesh Narayanan, Amit Sheth

    Abstract: Language models have the potential to assess mental health using social media data. By analyzing online posts and conversations, these models can detect patterns indicating mental health conditions like depression, anxiety, or suicidal thoughts. They examine keywords, language markers, and sentiment to gain insights into an individual's mental well-being. This information is crucial for early dete… ▽ More

    Submitted 16 June, 2023; originally announced June 2023.

    Comments: Accepted for Publication at AAAI Second Symposium on Human Partnership with Medical Artificial Intelligence (HUMAN.AI Summer 2023): Design, Operationalization, and Ethics. July 17-19, 2023

  37. arXiv:2306.05523  [pdf, other

    cs.CL cs.AI cs.CV cs.MM

    FACTIFY3M: A Benchmark for Multimodal Fact Verification with Explainability through 5W Question-Answering

    Authors: Megha Chakraborty, Khushbu Pahwa, Anku Rani, Shreyas Chatterjee, Dwip Dalal, Harshit Dave, Ritvik G, Preethi Gurumurthy, Adarsh Mahor, Samahriti Mukherjee, Aditya Pakala, Ishan Paul, Janvita Reddy, Arghya Sarkar, Kinjal Sensharma, Aman Chadha, Amit P. Sheth, Amitava Das

    Abstract: Combating disinformation is one of the burning societal crises -- about 67% of the American population believes that disinformation produces a lot of uncertainty, and 10% of them knowingly propagate disinformation. Evidence shows that disinformation can manipulate democratic processes and public opinion, causing disruption in the share market, panic and anxiety in society, and even death during cr… ▽ More

    Submitted 30 October, 2023; v1 submitted 22 May, 2023; originally announced June 2023.

    Comments: arXiv admin note: text overlap with arXiv:2305.04329

  38. arXiv:2306.01805  [pdf, other

    cs.CL cs.AI cs.IR

    Cook-Gen: Robust Generative Modeling of Cooking Actions from Recipes

    Authors: Revathy Venkataramanan, Kaushik Roy, Kanak Raj, Renjith Prasad, Yuxin Zi, Vignesh Narayanan, Amit Sheth

    Abstract: As people become more aware of their food choices, food computation models have become increasingly popular in assisting people in maintaining healthy eating habits. For example, food recommendation systems analyze recipe instructions to assess nutritional contents and provide recipe recommendations. The recent and remarkable successes of generative AI methods, such as auto-regressive large langua… ▽ More

    Submitted 1 June, 2023; originally announced June 2023.

  39. arXiv:2305.10438  [pdf, other

    cs.CL cs.AI cs.CV cs.MM

    IMAGINATOR: Pre-Trained Image+Text Joint Embeddings using Word-Level Grounding of Images

    Authors: Varuna Krishna, S Suryavardan, Shreyash Mishra, Sathyanarayanan Ramamoorthy, Parth Patwa, Megha Chakraborty, Aman Chadha, Amitava Das, Amit Sheth

    Abstract: Word embeddings, i.e., semantically meaningful vector representation of words, are largely influenced by the distributional hypothesis "You shall know a word by the company it keeps" (Harris, 1954), whereas modern prediction-based neural network embeddings rely on design choices and hyperparameter optimization. Word embeddings like Word2Vec, GloVe etc. well capture the contextuality and real-world… ▽ More

    Submitted 12 May, 2023; originally announced May 2023.

  40. ProKnow: Process Knowledge for Safety Constrained and Explainable Question Generation for Mental Health Diagnostic Assistance

    Authors: Kaushik Roy, Manas Gaur, Misagh Soltani, Vipula Rawte, Ashwin Kalyan, Amit Sheth

    Abstract: Current Virtual Mental Health Assistants (VMHAs) provide counseling and suggestive care. They refrain from patient diagnostic assistance because they lack training in safety-constrained and specialized clinical process knowledge. In this work, we define Proknow as an ordered set of information that maps to evidence-based guidelines or categories of conceptual understanding to experts in a domain.… ▽ More

    Submitted 1 June, 2023; v1 submitted 13 May, 2023; originally announced May 2023.

    Journal ref: Front. Big Data, 09 January 2023, Sec. Data Science, Volume 5 - 2022

  41. arXiv:2305.05050  [pdf, other

    cs.CL cs.AI

    ANALOGICAL -- A Novel Benchmark for Long Text Analogy Evaluation in Large Language Models

    Authors: Thilini Wijesiriwardene, Ruwan Wickramarachchi, Bimal G. Gajera, Shreeyash Mukul Gowaikar, Chandan Gupta, Aman Chadha, Aishwarya Naresh Reganti, Amit Sheth, Amitava Das

    Abstract: Over the past decade, analogies, in the form of word-level analogies, have played a significant role as an intrinsic measure of evaluating the quality of word embedding methods such as word2vec. Modern large language models (LLMs), however, are primarily evaluated on extrinsic measures based on benchmarks such as GLUE and SuperGLUE, and there are only a few investigations on whether LLMs can draw… ▽ More

    Submitted 25 May, 2023; v1 submitted 8 May, 2023; originally announced May 2023.

    Comments: Accepted as a long paper at Findings of ACL 2023

  42. arXiv:2305.04989  [pdf, other

    cs.CL cs.AI

    Knowledge Graph Guided Semantic Evaluation of Language Models For User Trust

    Authors: Kaushik Roy, Tarun Garg, Vedant Palit, Yuxin Zi, Vignesh Narayanan, Amit Sheth

    Abstract: A fundamental question in natural language processing is - what kind of language structure and semantics is the language model capturing? Graph formats such as knowledge graphs are easy to evaluate as they explicitly express language semantics and structure. This study evaluates the semantics encoded in the self-attention transformers by leveraging explicit knowledge graph structures. We propose n… ▽ More

    Submitted 8 May, 2023; originally announced May 2023.

  43. arXiv:2305.04329  [pdf, other

    cs.CL

    FACTIFY-5WQA: 5W Aspect-based Fact Verification through Question Answering

    Authors: Anku Rani, S. M Towhidul Islam Tonmoy, Dwip Dalal, Shreya Gautam, Megha Chakraborty, Aman Chadha, Amit Sheth, Amitava Das

    Abstract: Automatic fact verification has received significant attention recently. Contemporary automatic fact-checking systems focus on estimating truthfulness using numerical scores which are not human-interpretable. A human fact-checker generally follows several logical steps to verify a verisimilitude claim and conclude whether its truthful or a mere masquerade. Popular fact-checking websites follow a c… ▽ More

    Submitted 28 May, 2023; v1 submitted 7 May, 2023; originally announced May 2023.

    Comments: Accepted at ACL main conference 2023

  44. arXiv:2305.00813  [pdf, other

    cs.AI

    Neurosymbolic AI -- Why, What, and How

    Authors: Amit Sheth, Kaushik Roy, Manas Gaur

    Abstract: Humans interact with the environment using a combination of perception - transforming sensory inputs from their environment into symbols, and cognition - map** symbols to knowledge about the environment for supporting abstraction, reasoning by analogy, and long-term planning. Human perception-inspired machine perception, in the context of AI, refers to large-scale pattern recognition from raw da… ▽ More

    Submitted 1 May, 2023; originally announced May 2023.

    Comments: To appear in IEEE Intelligent Systems

  45. arXiv:2304.10512  [pdf, other

    cs.LG cs.CL cs.SI

    "Can We Detect Substance Use Disorder?": Knowledge and Time Aware Classification on Social Media from Darkweb

    Authors: Usha Lokala, Orchid Chetia Phukan, Triyasha Ghosh Dastidar, Francois Lamy, Raminta Daniulaityte, Amit Sheth

    Abstract: Opioid and substance misuse is rampant in the United States today, with the phenomenon known as the "opioid crisis". The relationship between substance use and mental health has been extensively studied, with one possible relationship being: substance misuse causes poor mental health. However, the lack of evidence on the relationship has resulted in opioids being largely inaccessible through legal… ▽ More

    Submitted 20 April, 2023; originally announced April 2023.

  46. arXiv:2304.03897  [pdf

    cs.CL cs.CV

    Factify 2: A Multimodal Fake News and Satire News Dataset

    Authors: S Suryavardan, Shreyash Mishra, Parth Patwa, Megha Chakraborty, Anku Rani, Aishwarya Reganti, Aman Chadha, Amitava Das, Amit Sheth, Manoj Chinnakotla, Asif Ekbal, Srijan Kumar

    Abstract: The internet gives the world an open platform to express their views and share their stories. While this is very valuable, it makes fake news one of our society's most pressing problems. Manual fact checking process is time consuming, which makes it challenging to disprove misleading assertions before they cause significant harm. This is he driving interest in automatic fact or claim verification.… ▽ More

    Submitted 2 October, 2023; v1 submitted 7 April, 2023; originally announced April 2023.

    Comments: Defactify2 @AAAI2023

  47. arXiv:2304.00025  [pdf, other

    cs.CL

    Demo Alleviate: Demonstrating Artificial Intelligence Enabled Virtual Assistance for Telehealth: The Mental Health Case

    Authors: Kaushik Roy, Vedant Khandelwal, Raxit Goswami, Nathan Dolbir, **endra Malekar, Amit Sheth

    Abstract: After the pandemic, artificial intelligence (AI) powered support for mental health care has become increasingly important. The breadth and complexity of significant challenges required to provide adequate care involve: (a) Personalized patient understanding, (b) Safety-constrained and medically validated chatbot patient interactions, and (c) Support for continued feedback-based refinements in desi… ▽ More

    Submitted 31 March, 2023; originally announced April 2023.

  48. arXiv:2303.09892  [pdf

    cs.CL

    Memotion 3: Dataset on Sentiment and Emotion Analysis of Codemixed Hindi-English Memes

    Authors: Shreyash Mishra, S Suryavardan, Parth Patwa, Megha Chakraborty, Anku Rani, Aishwarya Reganti, Aman Chadha, Amitava Das, Amit Sheth, Manoj Chinnakotla, Asif Ekbal, Srijan Kumar

    Abstract: Memes are the new-age conveyance mechanism for humor on social media sites. Memes often include an image and some text. Memes can be used to promote disinformation or hatred, thus it is crucial to investigate in details. We introduce Memotion 3, a new dataset with 10,000 annotated memes. Unlike other prevalent datasets in the domain, including prior iterations of Memotion, Memotion 3 introduces Hi… ▽ More

    Submitted 2 October, 2023; v1 submitted 17 March, 2023; originally announced March 2023.

    Comments: Defactify2 @AAAI

  49. arXiv:2210.04307  [pdf, other

    cs.CL cs.AI

    KSAT: Knowledge-infused Self Attention Transformer -- Integrating Multiple Domain-Specific Contexts

    Authors: Kaushik Roy, Yuxin Zi, Vignesh Narayanan, Manas Gaur, Amit Sheth

    Abstract: Domain-specific language understanding requires integrating multiple pieces of relevant contextual information. For example, we see both suicide and depression-related behavior (multiple contexts) in the text ``I have a gun and feel pretty bad about my life, and it wouldn't be the worst thing if I didn't wake up tomorrow''. Domain specificity in self-attention architectures is handled by fine-tuni… ▽ More

    Submitted 24 June, 2023; v1 submitted 9 October, 2022; originally announced October 2022.

    Comments: Preprint version of paper accepted for publication at KDD workshop on Knowledge Augmented Methods for NLP, 2023

  50. arXiv:2206.13349  [pdf, other

    cs.AI cs.CL

    Process Knowledge-Infused AI: Towards User-level Explainability, Interpretability, and Safety

    Authors: Amit Sheth, Manas Gaur, Kaushik Roy, Revathy Venkataraman, Vedant Khandelwal

    Abstract: AI systems have been widely adopted across various domains in the real world. However, in high-value, sensitive, or safety-critical applications such as self-management for personalized health or food recommendation with a specific purpose (e.g., allergy-aware recipe recommendations), their adoption is unlikely. Firstly, the AI system needs to follow guidelines or well-defined processes set by exp… ▽ More

    Submitted 9 June, 2022; originally announced June 2022.

    Comments: To paper in IEEE Internet Computing 2022