Skip to main content

Showing 1–10 of 10 results for author: Lal, Y K

Searching in archive cs. Search in all archives.
.
  1. arXiv:2406.17104  [pdf, other

    cs.CL

    Automated Adversarial Discovery for Safety Classifiers

    Authors: Yash Kumar Lal, Preethi Lahoti, Aradhana Sinha, Yao Qin, Ananth Balashankar

    Abstract: Safety classifiers are critical in mitigating toxicity on online forums such as social media and in chatbots. Still, they continue to be vulnerable to emergent, and often innumerable, adversarial attacks. Traditional automated adversarial data generation methods, however, tend to produce attacks that are not diverse, but variations of previously observed harm types. We formalize the task of automa… ▽ More

    Submitted 24 June, 2024; originally announced June 2024.

    Comments: Published at Fourth Workshop on TrustworthyNLP (TrustNLP) at NAACL 2024

  2. arXiv:2406.15823  [pdf, other

    cs.CL

    CaT-BENCH: Benchmarking Language Model Understanding of Causal and Temporal Dependencies in Plans

    Authors: Yash Kumar Lal, Vanya Cohen, Nathanael Chambers, Niranjan Balasubramanian, Raymond Mooney

    Abstract: Understanding the abilities of LLMs to reason about natural language plans, such as instructional text and recipes, is critical to reliably using them in decision-making systems. A fundamental aspect of plans is the temporal order in which their steps needs to be executed, which reflects the underlying causal dependencies between them. We introduce CaT-Bench, a benchmark of Step Order Prediction q… ▽ More

    Submitted 22 June, 2024; originally announced June 2024.

  3. arXiv:2402.01980  [pdf, other

    cs.CL

    SOCIALITE-LLAMA: An Instruction-Tuned Model for Social Scientific Tasks

    Authors: Gourab Dey, Adithya V Ganesan, Yash Kumar Lal, Manal Shah, Shreyashee Sinha, Matthew Matero, Salvatore Giorgi, Vivek Kulkarni, H. Andrew Schwartz

    Abstract: Social science NLP tasks, such as emotion or humor detection, are required to capture the semantics along with the implicit pragmatics from text, often with limited amounts of training data. Instruction tuning has been shown to improve the many capabilities of large language models (LLMs) such as commonsense reasoning, reading comprehension, and computer programming. However, little is known about… ▽ More

    Submitted 14 March, 2024; v1 submitted 2 February, 2024; originally announced February 2024.

    Comments: Short paper accepted to EACL 2024. 4 pgs, 2 tables

  4. arXiv:2311.09510  [pdf, other

    cs.CL

    Tailoring with Targeted Precision: Edit-Based Agents for Open-Domain Procedure Customization

    Authors: Yash Kumar Lal, Li Zhang, Faeze Brahman, Bodhisattwa Prasad Majumder, Peter Clark, Niket Tandon

    Abstract: How-to procedures, such as how to plant a garden, are now used by millions of users, but sometimes need customizing to meet a user's specific needs, e.g., planting a garden without pesticides. Our goal is to measure and improve an LLM's ability to perform such customization. Our approach is to test several simple multi-LLM-agent architectures for customization, as well as an end-to-end LLM, using… ▽ More

    Submitted 30 May, 2024; v1 submitted 15 November, 2023; originally announced November 2023.

    Comments: Camera ready version accepted to Findings of ACL 2024

  5. arXiv:2306.16722  [pdf, other

    cs.CL cs.AI

    Evaluating Paraphrastic Robustness in Textual Entailment Models

    Authors: Dhruv Verma, Yash Kumar Lal, Shreyashee Sinha, Benjamin Van Durme, Adam Poliak

    Abstract: We present PaRTE, a collection of 1,126 pairs of Recognizing Textual Entailment (RTE) examples to evaluate whether models are robust to paraphrasing. We posit that if RTE models understand language, their predictions should be consistent across inputs that share the same meaning. We use the evaluation set to determine if RTE models' predictions change when examples are paraphrased. In our experime… ▽ More

    Submitted 29 June, 2023; originally announced June 2023.

  6. arXiv:2306.01183  [pdf, other

    cs.CL

    Systematic Evaluation of GPT-3 for Zero-Shot Personality Estimation

    Authors: Adithya V Ganesan, Yash Kumar Lal, August HÃ¥kan Nilsson, H. Andrew Schwartz

    Abstract: Very large language models (LLMs) perform extremely well on a spectrum of NLP tasks in a zero-shot setting. However, little is known about their performance on human-level NLP problems which rely on understanding psychological concepts, such as assessing personality traits. In this work, we investigate the zero-shot ability of GPT-3 to estimate the Big 5 personality traits from users' social media… ▽ More

    Submitted 1 June, 2023; originally announced June 2023.

    Comments: Short Paper (5 pages), Accepted to (WASSA) 13th Workshop on Computational Approaches to Subjectivity, Sentiment & Social Media Analysis at ACL 2023

    MSC Class: 68T50 ACM Class: J.4; I.2; I.7

  7. TellMeWhy: A Dataset for Answering Why-Questions in Narratives

    Authors: Yash Kumar Lal, Nathanael Chambers, Raymond Mooney, Niranjan Balasubramanian

    Abstract: Answering questions about why characters perform certain actions is central to understanding and reasoning about narratives. Despite recent progress in QA, it is not clear if existing models have the ability to answer "why" questions that may require commonsense knowledge external to the input narrative. In this work, we introduce TellMeWhy, a new crowd-sourced dataset that consists of more than 3… ▽ More

    Submitted 17 August, 2021; v1 submitted 10 June, 2021; originally announced June 2021.

    Comments: Accepted to Findings of ACL, 2021 Data and evaluation suite available at http://lunr.cs.stonybrook.edu/tellmewhy

  8. arXiv:2106.01199  [pdf, other

    cs.CL

    IrEne: Interpretable Energy Prediction for Transformers

    Authors: Qingqing Cao, Yash Kumar Lal, Harsh Trivedi, Aruna Balasubramanian, Niranjan Balasubramanian

    Abstract: Existing software-based energy measurements of NLP models are not accurate because they do not consider the complex interactions between energy consumption and model execution. We present IrEne, an interpretable and extensible energy prediction system that accurately predicts the inference energy consumption of a wide range of Transformer-based NLP models. IrEne constructs a model tree graph that… ▽ More

    Submitted 2 June, 2021; originally announced June 2021.

    Comments: ACL 2021 camera ready

  9. arXiv:1808.00957  [pdf, other

    cs.IR cs.CL

    SWDE : A Sub-Word And Document Embedding Based Engine for Clickbait Detection

    Authors: Vaibhav Kumar, Mrinal Dhar, Dhruv Khattar, Yash Kumar Lal, Abhimanshu Mishra, Manish Shrivastava, Vasudeva Varma

    Abstract: In order to expand their reach and increase website ad revenue, media outlets have started using clickbait techniques to lure readers to click on articles on their digital platform. Having successfully enticed the user to open the article, the article fails to satiate his curiosity serving only to boost click-through rates. Initial methods for this task were dependent on feature engineering, which… ▽ More

    Submitted 2 August, 2018; originally announced August 2018.

    Comments: Accepted at SIGIR 2018 as Computational Surprise in Information Retrieval (CompS) Workshop Paper. arXiv admin note: substantial text overlap with arXiv:1710.01507

    Journal ref: "SWDE : A Sub-Word And Document Embedding Based Engine for Clickbait Detection". In Proceedings of SIGIR 2018 Workshop on Computational Surprise in Information Retrieval, Ann Arbor, MI, USA, July 8-12 (CompS'18, SIGIR), 4 pages

  10. arXiv:1710.01507  [pdf, other

    cs.IR cs.CL cs.CY cs.SI

    Identifying Clickbait: A Multi-Strategy Approach Using Neural Networks

    Authors: Vaibhav Kumar, Dhruv Khattar, Siddhartha Gairola, Yash Kumar Lal, Vasudeva Varma

    Abstract: Online media outlets, in a bid to expand their reach and subsequently increase revenue through ad monetisation, have begun adopting clickbait techniques to lure readers to click on articles. The article fails to fulfill the promise made by the headline. Traditional methods for clickbait detection have relied heavily on feature engineering which, in turn, is dependent on the dataset it is built for… ▽ More

    Submitted 1 August, 2018; v1 submitted 4 October, 2017; originally announced October 2017.

    Comments: Accepted at SIGIR 2018 as Short Paper

    Journal ref: "Identifying Clickbait: A Multi-Strategy Approach Using Neural Networks". In Proceedings of the 41st International ACM SIGIR Conference on Research and Development in Information Retrieval 2018. Pages: 1225-1228