Skip to main content

Showing 1–8 of 8 results for author: Baheti, A

Searching in archive cs. Search in all archives.
.
  1. arXiv:2312.05979  [pdf, other

    cs.CL

    NovaCOMET: Open Commonsense Foundation Models with Symbolic Knowledge Distillation

    Authors: Peter West, Ronan Le Bras, Taylor Sorensen, Bill Yuchen Lin, Liwei Jiang, Ximing Lu, Khyathi Chandu, Jack Hessel, Ashutosh Baheti, Chandra Bhagavatula, Ye** Choi

    Abstract: We present NovaCOMET, an open commonsense knowledge model, that combines the best aspects of knowledge and general task models. Compared to previous knowledge models, NovaCOMET allows open-format relations enabling direct application to reasoning tasks; compared to general task models like Flan-T5, it explicitly centers knowledge, enabling superior performance for commonsense reasoning. NovaCOME… ▽ More

    Submitted 10 December, 2023; originally announced December 2023.

  2. arXiv:2305.14718  [pdf, other

    cs.CL

    Leftover Lunch: Advantage-based Offline Reinforcement Learning for Language Models

    Authors: Ashutosh Baheti, Ximing Lu, Faeze Brahman, Ronan Le Bras, Maarten Sap, Mark Riedl

    Abstract: Reinforcement Learning with Human Feedback (RLHF) is the most prominent method for Language Model (LM) alignment. However, RLHF is an unstable and data-hungry process that continually requires new high-quality LM-generated data for finetuning. We introduce Advantage-Leftover Lunch RL (A-LoL), a new class of offline policy gradient algorithms that enable RL training on any pre-existing data. By ass… ▽ More

    Submitted 19 April, 2024; v1 submitted 24 May, 2023; originally announced May 2023.

    Comments: published at ICLR 2024

  3. arXiv:2210.15954  [pdf, other

    cs.CL

    Stanceosaurus: Classifying Stance Towards Multilingual Misinformation

    Authors: Jonathan Zheng, Ashutosh Baheti, Tarek Naous, Wei Xu, Alan Ritter

    Abstract: We present Stanceosaurus, a new corpus of 28,033 tweets in English, Hindi, and Arabic annotated with stance towards 251 misinformation claims. As far as we are aware, it is the largest corpus annotated with stance towards misinformation claims. The claims in Stanceosaurus originate from 15 fact-checking sources that cover diverse geographical regions and cultures. Unlike existing stance datasets,… ▽ More

    Submitted 28 October, 2022; originally announced October 2022.

    Comments: Accepted to EMNLP 2022 main conference

  4. arXiv:2108.11830  [pdf, other

    cs.CL

    Just Say No: Analyzing the Stance of Neural Dialogue Generation in Offensive Contexts

    Authors: Ashutosh Baheti, Maarten Sap, Alan Ritter, Mark Riedl

    Abstract: Dialogue models trained on human conversations inadvertently learn to generate toxic responses. In addition to producing explicitly offensive utterances, these models can also implicitly insult a group or individual by aligning themselves with an offensive statement. To better understand the dynamics of contextually offensive language, we investigate the stance of dialogue model responses in offen… ▽ More

    Submitted 13 September, 2021; v1 submitted 26 August, 2021; originally announced August 2021.

    Comments: Accepted at EMNLP 2021

  5. arXiv:2006.02567  [pdf, other

    cs.CL cs.SI

    Extracting a Knowledge Base of COVID-19 Events from Social Media

    Authors: Shi Zong, Ashutosh Baheti, Wei Xu, Alan Ritter

    Abstract: In this paper, we present a manually annotated corpus of 10,000 tweets containing public reports of five COVID-19 events, including positive and negative tests, deaths, denied access to testing, claimed cures and preventions. We designed slot-filling questions for each event type and annotated a total of 31 fine-grained slots, such as the location of events, recent travel, and close contacts. We s… ▽ More

    Submitted 9 September, 2022; v1 submitted 3 June, 2020; originally announced June 2020.

    Comments: Accepted at COLING 2022

  6. arXiv:2005.10464  [pdf, other

    cs.CL

    Fluent Response Generation for Conversational Question Answering

    Authors: Ashutosh Baheti, Alan Ritter, Kevin Small

    Abstract: Question answering (QA) is an important aspect of open-domain conversational agents, garnering specific research focus in the conversational QA (ConvQA) subtask. One notable limitation of recent ConvQA efforts is the response being answer span extraction from the target corpus, thus ignoring the natural language generation (NLG) aspect of high-quality conversational agents. In this work, we propos… ▽ More

    Submitted 16 December, 2020; v1 submitted 21 May, 2020; originally announced May 2020.

    Comments: 2020 Annual Conference of the Association for Computational Linguistics

  7. arXiv:1809.01215  [pdf, other

    cs.CL

    Generating More Interesting Responses in Neural Conversation Models with Distributional Constraints

    Authors: Ashutosh Baheti, Alan Ritter, Jiwei Li, Bill Dolan

    Abstract: Neural conversation models tend to generate safe, generic responses for most inputs. This is due to the limitations of likelihood-based decoding objectives in generation tasks with diverse outputs, such as conversation. To address this challenge, we propose a simple yet effective approach for incorporating side information in the form of distributional constraints over the generated responses. We… ▽ More

    Submitted 4 September, 2018; originally announced September 2018.

  8. arXiv:1611.07397  [pdf, other

    cs.NI eess.SY

    Non-linear Barrier Coverage using Mobile Wireless Sensors

    Authors: Ashutosh Baheti, Arobinda Gupta

    Abstract: A belt region is said to be k-barrier covered by a set of sensors if all paths crossing the width of the belt region intersect the sensing regions of at least k sensors. Barrier coverage can be achieved from a random initial deployment of mobile sensors by suitably relocating the sensors to form a barrier. Reducing the movement of the sensors is important in such scenarios due to the energy constr… ▽ More

    Submitted 22 November, 2016; originally announced November 2016.

    Comments: 6 pages