Skip to main content

Showing 1–7 of 7 results for author: Subbiah, M

Searching in archive cs. Search in all archives.
.
  1. arXiv:2403.01061  [pdf, other

    cs.CL

    Reading Subtext: Evaluating Large Language Models on Short Story Summarization with Writers

    Authors: Melanie Subbiah, Sean Zhang, Lydia B. Chilton, Kathleen McKeown

    Abstract: We evaluate recent Large language Models (LLMs) on the challenging task of summarizing short stories, which can be lengthy, and include nuanced subtext or scrambled timelines. Importantly, we work directly with authors to ensure that the stories have not been shared online (and therefore are unseen by the models), and to obtain informed evaluations of summary quality using judgments from the autho… ▽ More

    Submitted 1 March, 2024; originally announced March 2024.

  2. arXiv:2305.18265  [pdf, other

    cs.CL cs.AI cs.CY

    Check-COVID: Fact-Checking COVID-19 News Claims with Scientific Evidence

    Authors: Gengyu Wang, Kate Harwood, Lawrence Chillrud, Amith Ananthram, Melanie Subbiah, Kathleen McKeown

    Abstract: We present a new fact-checking benchmark, Check-COVID, that requires systems to verify claims about COVID-19 from news using evidence from scientific articles. This approach to fact-checking is particularly challenging as it requires checking internet text written in everyday language against evidence from journal articles written in formal academic language. Check-COVID contains 1, 504 expert-ann… ▽ More

    Submitted 29 May, 2023; originally announced May 2023.

    Comments: Accepted as ACL 2023 Findings

  3. arXiv:2305.17534  [pdf, other

    cs.CL cs.AI cs.LG

    Unsupervised Selective Rationalization with Noise Injection

    Authors: Adam Storek, Melanie Subbiah, Kathleen McKeown

    Abstract: A major issue with using deep learning models in sensitive applications is that they provide no explanation for their output. To address this problem, unsupervised selective rationalization produces rationales alongside predictions by chaining two jointly-trained components, a rationale generator and a predictor. Although this architecture guarantees that the prediction relies solely on the ration… ▽ More

    Submitted 27 May, 2023; originally announced May 2023.

    Comments: Accepted to ACL 2023

  4. arXiv:2302.00102  [pdf, other

    cs.CL cs.LG

    Towards Detecting Harmful Agendas in News Articles

    Authors: Melanie Subbiah, Amrita Bhattacharjee, Yilun Hua, Tharindu Kumarage, Huan Liu, Kathleen McKeown

    Abstract: Manipulated news online is a growing problem which necessitates the use of automated systems to curtail its spread. We argue that while misinformation and disinformation detection have been studied, there has been a lack of investment in the important open challenge of detecting harmful agendas in news articles; identifying harmful agendas is critical to flag news campaigns with the greatest poten… ▽ More

    Submitted 2 August, 2023; v1 submitted 31 January, 2023; originally announced February 2023.

    Comments: Camera-ready for ACL-WASSA 2023. First two authors contributed equally

  5. arXiv:2210.10045  [pdf, other

    cs.CL cs.AI

    SafeText: A Benchmark for Exploring Physical Safety in Language Models

    Authors: Sharon Levy, Emily Allaway, Melanie Subbiah, Lydia Chilton, Desmond Patton, Kathleen McKeown, William Yang Wang

    Abstract: Understanding what constitutes safe text is an important issue in natural language processing and can often prevent the deployment of models deemed harmful and unsafe. One such type of safety that has been scarcely studied is commonsense physical safety, i.e. text that is not explicitly violent and requires additional commonsense knowledge to comprehend that it leads to physical harm. We create th… ▽ More

    Submitted 18 October, 2022; originally announced October 2022.

    Comments: Accepted to EMNLP 2022

  6. arXiv:2210.09306  [pdf, other

    cs.AI cs.CL cs.LG

    Mitigating Covertly Unsafe Text within Natural Language Systems

    Authors: Alex Mei, Anisha Kabir, Sharon Levy, Melanie Subbiah, Emily Allaway, John Judge, Desmond Patton, Bruce Bimber, Kathleen McKeown, William Yang Wang

    Abstract: An increasingly prevalent problem for intelligent technologies is text safety, as uncontrolled systems may generate recommendations to their users that lead to injury or life-threatening consequences. However, the degree of explicitness of a generated statement that can cause physical harm varies. In this paper, we distinguish types of text that can lead to physical harm and establish one particul… ▽ More

    Submitted 20 March, 2023; v1 submitted 17 October, 2022; originally announced October 2022.

    Comments: In Findings of the 2022 Conference on Empirical Methods in Natural Language Processing

  7. arXiv:2005.14165  [pdf, other

    cs.CL

    Language Models are Few-Shot Learners

    Authors: Tom B. Brown, Benjamin Mann, Nick Ryder, Melanie Subbiah, Jared Kaplan, Prafulla Dhariwal, Arvind Neelakantan, Pranav Shyam, Girish Sastry, Amanda Askell, Sandhini Agarwal, Ariel Herbert-Voss, Gretchen Krueger, Tom Henighan, Rewon Child, Aditya Ramesh, Daniel M. Ziegler, Jeffrey Wu, Clemens Winter, Christopher Hesse, Mark Chen, Eric Sigler, Mateusz Litwin, Scott Gray, Benjamin Chess , et al. (6 additional authors not shown)

    Abstract: Recent work has demonstrated substantial gains on many NLP tasks and benchmarks by pre-training on a large corpus of text followed by fine-tuning on a specific task. While typically task-agnostic in architecture, this method still requires task-specific fine-tuning datasets of thousands or tens of thousands of examples. By contrast, humans can generally perform a new language task from only a few… ▽ More

    Submitted 22 July, 2020; v1 submitted 28 May, 2020; originally announced May 2020.

    Comments: 40+32 pages