Skip to main content

Showing 1–9 of 9 results for author: Menon, R R

.
  1. arXiv:2312.05200  [pdf, other

    cs.CL

    DelucionQA: Detecting Hallucinations in Domain-specific Question Answering

    Authors: Mobashir Sadat, Zhengyu Zhou, Lukas Lange, Jun Araki, Arsalan Gundroo, Bingqing Wang, Rakesh R Menon, Md Rizwan Parvez, Zhe Feng

    Abstract: Hallucination is a well-known phenomenon in text generated by large language models (LLMs). The existence of hallucinatory responses is found in almost all application scenarios e.g., summarization, question-answering (QA) etc. For applications requiring high reliability (e.g., customer-facing assistants), the potential existence of hallucination in LLM-generated text is a critical problem. The am… ▽ More

    Submitted 8 December, 2023; originally announced December 2023.

    Comments: Accepted in EMNLP 2023 (Findings)

  2. arXiv:2311.07538  [pdf, other

    cs.CL cs.LG

    Leveraging Multiple Teachers for Test-Time Adaptation of Language-Guided Classifiers

    Authors: Kangda Wei, Sayan Ghosh, Rakesh R. Menon, Shashank Srivastava

    Abstract: Recent approaches have explored language-guided classifiers capable of classifying examples from novel tasks when provided with task-specific natural language explanations, instructions or prompts (Sanh et al., 2022; R. Menon et al., 2022). While these classifiers can generalize in zero-shot settings, their task performance often varies substantially between different language explanations in unpr… ▽ More

    Submitted 13 November, 2023; originally announced November 2023.

  3. arXiv:2311.04659  [pdf, other

    cs.AI

    Pragmatic Reasoning Unlocks Quantifier Semantics for Foundation Models

    Authors: Yiyuan Li, Rakesh R. Menon, Sayan Ghosh, Shashank Srivastava

    Abstract: Generalized quantifiers (e.g., few, most) are used to indicate the proportions predicates are satisfied (for example, some apples are red). One way to interpret quantifier semantics is to explicitly bind these satisfactions with percentage scopes (e.g., 30%-40% of apples are red). This approach can be helpful for tasks like logic formalization and surface-form quantitative reasoning (Gordon and Sc… ▽ More

    Submitted 8 November, 2023; originally announced November 2023.

    Comments: EMNLP 2023

  4. arXiv:2305.12995  [pdf, other

    cs.CL cs.AI cs.LG

    MaNtLE: Model-agnostic Natural Language Explainer

    Authors: Rakesh R. Menon, Kerem Zaman, Shashank Srivastava

    Abstract: Understanding the internal reasoning behind the predictions of machine learning systems is increasingly vital, given their rising adoption and acceptance. While previous approaches, such as LIME, generate algorithmic explanations by attributing importance to input features for individual examples, recent research indicates that practitioners prefer examining language explanations that explain sub-… ▽ More

    Submitted 22 May, 2023; originally announced May 2023.

    Comments: 17 pages, 13 figures, 6 tables

  5. arXiv:2212.09104  [pdf, other

    cs.CL

    LaSQuE: Improved Zero-Shot Classification from Explanations Through Quantifier Modeling and Curriculum Learning

    Authors: Sayan Ghosh, Rakesh R Menon, Shashank Srivastava

    Abstract: A hallmark of human intelligence is the ability to learn new concepts purely from language. Several recent approaches have explored training machine learning models via natural language supervision. However, these approaches fall short in leveraging linguistic quantifiers (such as 'always' or 'rarely') and mimicking humans in compositionally learning complex tasks. Here, we present LaSQuE, a metho… ▽ More

    Submitted 18 December, 2022; originally announced December 2022.

    Comments: Work in progress

  6. arXiv:2204.07142  [pdf, other

    cs.CL cs.AI cs.LG

    CLUES: A Benchmark for Learning Classifiers using Natural Language Explanations

    Authors: Rakesh R Menon, Sayan Ghosh, Shashank Srivastava

    Abstract: Supervised learning has traditionally focused on inductive learning by observing labeled examples of a task. In contrast, humans have the ability to learn new concepts from language. Here, we explore training zero-shot classifiers for structured data purely from language. For this, we introduce CLUES, a benchmark for Classifier Learning Using natural language ExplanationS, consisting of a range of… ▽ More

    Submitted 14 April, 2022; originally announced April 2022.

    Comments: ACL 2022 (25 pages, 16 figures)

  7. arXiv:2103.11955  [pdf, other

    cs.CL cs.AI cs.LG

    Improving and Simplifying Pattern Exploiting Training

    Authors: Derek Tam, Rakesh R Menon, Mohit Bansal, Shashank Srivastava, Colin Raffel

    Abstract: Recently, pre-trained language models (LMs) have achieved strong performance when fine-tuned on difficult benchmarks like SuperGLUE. However, performance can suffer when there are very few labeled examples available for fine-tuning. Pattern Exploiting Training (PET) is a recent approach that leverages patterns for few-shot learning. However, PET uses task-specific unlabeled data. In this paper, we… ▽ More

    Submitted 28 September, 2021; v1 submitted 22 March, 2021; originally announced March 2021.

    Comments: EMNLP 2021 (12 pages, 2 figures)

  8. arXiv:1711.10386  [pdf

    q-bio.OT physics.app-ph

    Screening of Fungi for the Application of Self-Healing Concrete

    Authors: Rakenth R. Menon, **g Luo, Xiaobo Chen, Hui Zhou, Zhiyong Liu, Guangwen Zhou, Ning Zhang, Congrui **

    Abstract: Concrete is susceptible to cracking owing to drying shrinkage, freeze-thaw cycles, delayed ettringite formation, reinforcement corrosion, creep and fatigue, etc. Since maintenance and inspection of concrete infrastructure require onerous labor and high costs, self-healing of harmful cracks without human interference or intervention could be of great attraction. The goal of this study is to explore… ▽ More

    Submitted 16 July, 2018; v1 submitted 23 November, 2017; originally announced November 2017.

    Comments: 21 pages. arXiv admin note: text overlap with arXiv:1708.01337

  9. arXiv:1709.04909  [pdf, other

    cs.LG cs.AI

    Shared Learning : Enhancing Reinforcement in $Q$-Ensembles

    Authors: Rakesh R Menon, Balaraman Ravindran

    Abstract: Deep Reinforcement Learning has been able to achieve amazing successes in a variety of domains from video games to continuous control by trying to maximize the cumulative reward. However, most of these successes rely on algorithms that require a large amount of data to train in order to obtain results on par with human-level performance. This is not feasible if we are to deploy these systems on re… ▽ More

    Submitted 14 September, 2017; originally announced September 2017.

    Comments: Submitted to AAAI 2018