Skip to main content

Showing 1–7 of 7 results for author: Meade, N

Searching in archive cs. Search in all archives.
.
  1. arXiv:2404.16020  [pdf, other

    cs.CL

    Universal Adversarial Triggers Are Not Universal

    Authors: Nicholas Meade, Arkil Patel, Siva Reddy

    Abstract: Recent work has developed optimization procedures to find token sequences, called adversarial triggers, which can elicit unsafe responses from aligned language models. These triggers are believed to be universally transferable, i.e., a trigger optimized on one model can jailbreak other models. In this paper, we concretely show that such adversarial triggers are not universal. We extensively invest… ▽ More

    Submitted 24 April, 2024; originally announced April 2024.

  2. arXiv:2307.16877  [pdf, other

    cs.CL cs.AI

    Evaluating Correctness and Faithfulness of Instruction-Following Models for Question Answering

    Authors: Vaibhav Adlakha, Parishad BehnamGhader, Xing Han Lu, Nicholas Meade, Siva Reddy

    Abstract: Retriever-augmented instruction-following models are attractive alternatives to fine-tuned approaches for information-seeking tasks such as question answering (QA). By simply prepending retrieved documents in its input along with an instruction, these models can be adapted to various information domains and tasks without additional fine-tuning. While the model responses tend to be natural and flue… ▽ More

    Submitted 17 April, 2024; v1 submitted 31 July, 2023; originally announced July 2023.

    Comments: accepted at TACL

  3. arXiv:2305.06161  [pdf, other

    cs.CL cs.AI cs.PL cs.SE

    StarCoder: may the source be with you!

    Authors: Raymond Li, Loubna Ben Allal, Yangtian Zi, Niklas Muennighoff, Denis Kocetkov, Chenghao Mou, Marc Marone, Christopher Akiki, Jia Li, Jenny Chim, Qian Liu, Evgenii Zheltonozhskii, Terry Yue Zhuo, Thomas Wang, Olivier Dehaene, Mishig Davaadorj, Joel Lamy-Poirier, João Monteiro, Oleh Shliazhko, Nicolas Gontier, Nicholas Meade, Armel Zebaze, Ming-Ho Yee, Logesh Kumar Umapathi, Jian Zhu , et al. (42 additional authors not shown)

    Abstract: The BigCode community, an open-scientific collaboration working on the responsible development of Large Language Models for Code (Code LLMs), introduces StarCoder and StarCoderBase: 15.5B parameter models with 8K context length, infilling capabilities and fast large-batch inference enabled by multi-query attention. StarCoderBase is trained on 1 trillion tokens sourced from The Stack, a large colle… ▽ More

    Submitted 13 December, 2023; v1 submitted 9 May, 2023; originally announced May 2023.

  4. arXiv:2302.00871  [pdf, other

    cs.CL

    Using In-Context Learning to Improve Dialogue Safety

    Authors: Nicholas Meade, Spandana Gella, Devamanyu Hazarika, Prakhar Gupta, Di **, Siva Reddy, Yang Liu, Dilek Hakkani-Tür

    Abstract: While large neural-based conversational models have become increasingly proficient dialogue agents, recent work has highlighted safety issues with these systems. For example, these systems can be goaded into generating toxic content, which often perpetuates social biases or stereotypes. We investigate a retrieval-based method for reducing bias and toxicity in responses from chatbots. It uses in-co… ▽ More

    Submitted 22 October, 2023; v1 submitted 1 February, 2023; originally announced February 2023.

    Comments: Findings of EMNLP 2023

  5. arXiv:2110.08527  [pdf, other

    cs.CL cs.LG

    An Empirical Survey of the Effectiveness of Debiasing Techniques for Pre-trained Language Models

    Authors: Nicholas Meade, Elinor Poole-Dayan, Siva Reddy

    Abstract: Recent work has shown pre-trained language models capture social biases from the large amounts of text they are trained on. This has attracted attention to develo** techniques that mitigate such biases. In this work, we perform an empirical survey of five recently proposed bias mitigation techniques: Counterfactual Data Augmentation (CDA), Dropout, Iterative Nullspace Projection, Self-Debias, an… ▽ More

    Submitted 2 April, 2022; v1 submitted 16 October, 2021; originally announced October 2021.

    Comments: ACL 2022

  6. arXiv:2110.08412  [pdf, other

    cs.CL cs.AI cs.LG

    Evaluating the Faithfulness of Importance Measures in NLP by Recursively Masking Allegedly Important Tokens and Retraining

    Authors: Andreas Madsen, Nicholas Meade, Vaibhav Adlakha, Siva Reddy

    Abstract: To explain NLP models a popular approach is to use importance measures, such as attention, which inform input tokens are important for making a prediction. However, an open question is how well these explanations accurately reflect a model's logic, a property called faithfulness. To answer this question, we propose Recursive ROAR, a new faithfulness metric. This works by recursively masking alle… ▽ More

    Submitted 31 October, 2022; v1 submitted 15 October, 2021; originally announced October 2021.

  7. arXiv:1907.04352  [pdf, other

    cs.SD cs.LG eess.AS

    Exploring Conditioning for Generative Music Systems with Human-Interpretable Controls

    Authors: Nicholas Meade, Nicholas Barreyre, Scott C. Lowe, Sageev Oore

    Abstract: Performance RNN is a machine-learning system designed primarily for the generation of solo piano performances using an event-based (rather than audio) representation. More specifically, Performance RNN is a long short-term memory (LSTM) based recurrent neural network that models polyphonic music with expressive timing and dynamics (Oore et al., 2018). The neural network uses a simple language mode… ▽ More

    Submitted 3 August, 2019; v1 submitted 9 July, 2019; originally announced July 2019.

    Journal ref: International Conference on Computational Creativity, 2019