Skip to main content

Showing 1–4 of 4 results for author: Sawant, S A

Searching in archive cs. Search in all archives.
.
  1. arXiv:2310.17876  [pdf, other

    cs.CL

    TarGEN: Targeted Data Generation with Large Language Models

    Authors: Himanshu Gupta, Kevin Scaria, Ujjwala Anantheswaran, Shreyas Verma, Mihir Parmar, Saurabh Arjun Sawant, Chitta Baral, Swaroop Mishra

    Abstract: The rapid advancement of large language models (LLMs) has sparked interest in data synthesis techniques, aiming to generate diverse and high-quality synthetic datasets. However, these synthetic datasets often suffer from a lack of diversity and added noise. In this paper, we present TarGEN, a multi-step prompting strategy for generating high-quality synthetic datasets utilizing a LLM. An advantage… ▽ More

    Submitted 30 October, 2023; v1 submitted 26 October, 2023; originally announced October 2023.

    Comments: 10 pages, 6 tables, 5 figures, 5 pages references, 17 pages appendix

  2. arXiv:2306.05539  [pdf, other

    cs.CL

    Instruction Tuned Models are Quick Learners

    Authors: Himanshu Gupta, Saurabh Arjun Sawant, Swaroop Mishra, Mutsumi Nakamura, Arindam Mitra, Santosh Mashetty, Chitta Baral

    Abstract: Instruction tuning of language models has demonstrated the ability to enhance model generalization to unseen tasks via in-context learning using a few examples. However, typical supervised learning still requires a plethora of downstream training data for finetuning. Often in real-world situations, there is a scarcity of data available for finetuning, falling somewhere between few shot inference a… ▽ More

    Submitted 17 May, 2023; originally announced June 2023.

    Comments: 9 pages, 5 figures, 19 Tables (inclusing appendix), 12 pages of Appendix

  3. arXiv:2302.08624  [pdf, other

    cs.CL cs.LG

    InstructABSA: Instruction Learning for Aspect Based Sentiment Analysis

    Authors: Kevin Scaria, Himanshu Gupta, Siddharth Goyal, Saurabh Arjun Sawant, Swaroop Mishra, Chitta Baral

    Abstract: We introduce InstructABSA, an instruction learning paradigm for Aspect-Based Sentiment Analysis (ABSA) subtasks. Our method introduces positive, negative, and neutral examples to each training sample, and instruction tune the model (Tk-Instruct) for ABSA subtasks, yielding significant performance improvements. Experimental results on the Sem Eval 2014, 15, and 16 datasets demonstrate that Instruct… ▽ More

    Submitted 13 November, 2023; v1 submitted 16 February, 2023; originally announced February 2023.

    Comments: 4 pages, 3 figures, 9 tables, 9 appendix pages

  4. arXiv:2210.07471  [pdf, other

    cs.CL

    "John is 50 years old, can his son be 65?" Evaluating NLP Models' Understanding of Feasibility

    Authors: Himanshu Gupta, Neeraj Varshney, Swaroop Mishra, Kuntal Kumar Pal, Saurabh Arjun Sawant, Kevin Scaria, Siddharth Goyal, Chitta Baral

    Abstract: In current NLP research, large-scale language models and their abilities are widely being discussed. Some recent works have also found notable failures of these models. Often these failure examples involve complex reasoning abilities. This work focuses on a simple commonsense ability, reasoning about when an action (or its effect) is feasible. To this end, we introduce FeasibilityQA, a question-an… ▽ More

    Submitted 2 February, 2023; v1 submitted 13 October, 2022; originally announced October 2022.

    Comments: EACL 2023