Skip to main content

Showing 1–5 of 5 results for author: Rumshisky, A

Searching in archive stat. Search in all archives.
.
  1. arXiv:1910.10488  [pdf, other

    cs.LG cs.CL stat.ML

    Injecting Hierarchy with U-Net Transformers

    Authors: David Donahue, Vladislav Lialin, Anna Rumshisky

    Abstract: The Transformer architecture has become increasingly popular over the past two years, owing to its impressive performance on a number of natural language processing (NLP) tasks. However, all Transformer computations occur at the level of word representations and therefore, it may be argued that Transformer models do not explicitly attempt to learn hierarchical structure which is widely assumed to… ▽ More

    Submitted 1 April, 2021; v1 submitted 16 October, 2019; originally announced October 2019.

    Comments: 10 pages

  2. arXiv:1910.10487  [pdf, other

    cs.CL cs.LG stat.ML

    Memory-Augmented Recurrent Networks for Dialogue Coherence

    Authors: David Donahue, Yuanliang Meng, Anna Rumshisky

    Abstract: Recent dialogue approaches operate by reading each word in a conversation history, and aggregating accrued dialogue information into a single state. This fixed-size vector is not expandable and must maintain a consistent format over time. Other recent approaches exploit an attention mechanism to extract useful information from past conversational utterances, but this introduces an increased comput… ▽ More

    Submitted 16 October, 2019; originally announced October 2019.

    Comments: Honors project, 12 pages

  3. arXiv:1908.08593  [pdf, other

    cs.CL cs.LG stat.ML

    Revealing the Dark Secrets of BERT

    Authors: Olga Kovaleva, Alexey Romanov, Anna Rogers, Anna Rumshisky

    Abstract: BERT-based architectures currently give state-of-the-art performance on many NLP tasks, but little is known about the exact mechanisms that contribute to its success. In the current work, we focus on the interpretation of self-attention, which is one of the fundamental underlying components of BERT. Using a subset of GLUE tasks and a set of handcrafted features-of-interest, we propose the methodol… ▽ More

    Submitted 11 September, 2019; v1 submitted 21 August, 2019; originally announced August 2019.

    Comments: Accepted to EMNLP 2019

  4. arXiv:1904.05233  [pdf, other

    cs.LG cs.CL stat.ML

    What's in a Name? Reducing Bias in Bios without Access to Protected Attributes

    Authors: Alexey Romanov, Maria De-Arteaga, Hanna Wallach, Jennifer Chayes, Christian Borgs, Alexandra Chouldechova, Sahin Geyik, Krishnaram Kenthapadi, Anna Rumshisky, Adam Tauman Kalai

    Abstract: There is a growing body of work that proposes methods for mitigating bias in machine learning systems. These methods typically rely on access to protected attributes such as race, gender, or age. However, this raises two significant challenges: (1) protected attributes may not be available or it may not be legal to use them, and (2) it is often desirable to simultaneously consider multiple protect… ▽ More

    Submitted 10 April, 2019; originally announced April 2019.

    Comments: Accepted at NAACL 2019; Best Thematic Paper

  5. arXiv:1810.06640  [pdf, other

    cs.CL cs.LG stat.ML

    Adversarial Text Generation Without Reinforcement Learning

    Authors: David Donahue, Anna Rumshisky

    Abstract: Generative Adversarial Networks (GANs) have experienced a recent surge in popularity, performing competitively in a variety of tasks, especially in computer vision. However, GAN training has shown limited success in natural language processing. This is largely because sequences of text are discrete, and thus gradients cannot propagate from the discriminator to the generator. Recent solutions use r… ▽ More

    Submitted 1 January, 2019; v1 submitted 11 October, 2018; originally announced October 2018.

    Comments: Four pages without references. ACL latex style. Four figures