Skip to main content

Showing 1–3 of 3 results for author: Dolin, P

Searching in archive cs. Search in all archives.
.
  1. arXiv:2406.04046  [pdf, other

    cs.CC cs.AI

    ActionReasoningBench: Reasoning about Actions with and without Ramification Constraints

    Authors: Divij Handa, Pavel Dolin, Shrinidhi Kumbhar, Chitta Baral, Tran Cao Son

    Abstract: Reasoning about actions and change (RAC) has historically driven the development of many early AI challenges, such as the frame problem, and many AI disciplines, including non-monotonic and commonsense reasoning. The role of RAC remains important even now, particularly for tasks involving dynamic environments, interactive scenarios, and commonsense reasoning. Despite the progress of Large Language… ▽ More

    Submitted 6 June, 2024; originally announced June 2024.

    Comments: 54 pages, 11 figures

  2. arXiv:2401.00287  [pdf, other

    cs.CL

    The Art of Defending: A Systematic Evaluation and Analysis of LLM Defense Strategies on Safety and Over-Defensiveness

    Authors: Neeraj Varshney, Pavel Dolin, Agastya Seth, Chitta Baral

    Abstract: As Large Language Models (LLMs) play an increasingly pivotal role in natural language processing applications, their safety concerns become critical areas of NLP research. This paper presents Safety and Over-Defensiveness Evaluation (SODE) benchmark: a collection of diverse safe and unsafe prompts with carefully designed evaluation methods that facilitate systematic evaluation, comparison, and ana… ▽ More

    Submitted 30 December, 2023; originally announced January 2024.

  3. arXiv:2108.08411  [pdf, other

    cs.CL cs.LG

    FeelsGoodMan: Inferring Semantics of Twitch Neologisms

    Authors: Pavel Dolin, Luc d'Hauthuille, Andrea Vattani

    Abstract: Twitch chats pose a unique problem in natural language understanding due to a large presence of neologisms, specifically emotes. There are a total of 8.06 million emotes, over 400k of which were used in the week studied. There is virtually no information on the meaning or sentiment of emotes, and with a constant influx of new emotes and drift in their frequencies, it becomes impossible to maintain… ▽ More

    Submitted 17 November, 2021; v1 submitted 18 August, 2021; originally announced August 2021.