Skip to main content

Showing 1–12 of 12 results for author: Angluin, D

.
  1. arXiv:2404.02040  [pdf, other

    cs.FL cs.LG

    Transformers as Transducers

    Authors: Lena Strobl, Dana Angluin, David Chiang, Jonathan Rawski, Ashish Sabharwal

    Abstract: We study the sequence-to-sequence map** capacity of transformers by relating them to finite transducers, and find that they can express surprisingly large classes of transductions. We do so using variants of RASP, a programming language designed to help people "think like transformers," as an intermediate representation. We extend the existing Boolean variant B-RASP to sequence-to-sequence funct… ▽ More

    Submitted 2 April, 2024; originally announced April 2024.

  2. arXiv:2311.00208  [pdf, other

    cs.LG cs.CL cs.FL cs.LO

    What Formal Languages Can Transformers Express? A Survey

    Authors: Lena Strobl, William Merrill, Gail Weiss, David Chiang, Dana Angluin

    Abstract: As transformers have gained prominence in natural language processing, some researchers have investigated theoretically what problems they can and cannot solve, by treating problems as formal languages. Exploring such questions can help clarify the power of transformers relative to other models of computation, their fundamental capabilities and limits, and the impact of architectural choices. Work… ▽ More

    Submitted 6 May, 2024; v1 submitted 31 October, 2023; originally announced November 2023.

  3. arXiv:2310.13897  [pdf, other

    cs.FL cs.LG cs.LO

    Masked Hard-Attention Transformers Recognize Exactly the Star-Free Languages

    Authors: Andy Yang, David Chiang, Dana Angluin

    Abstract: The expressive power of transformers over inputs of unbounded size can be studied through their ability to recognize classes of formal languages. We consider transformer encoders with hard attention (in which all attention is focused on exactly one position) and strict future masking (in which each position only attends to positions strictly to its left), and prove that they are equivalent to line… ▽ More

    Submitted 22 May, 2024; v1 submitted 20 October, 2023; originally announced October 2023.

  4. arXiv:2209.09336  [pdf, ps, other

    cs.FL

    Constructing Concise Characteristic Samples for Acceptors of Omega Regular Languages

    Authors: Dana Angluin, Dana Fisman

    Abstract: A characteristic sample for a language $L$ and a learning algorithm $\textbf{L}$ is a finite sample of words $T_L$ labeled by their membership in $L$ such that for any sample $T \supseteq T_L$ consistent with $L$, on input $T$ the learning algorithm $\textbf{L}$ returns a hypothesis equivalent to $L$. Which omega automata have characteristic sets of polynomial size, and can these sets be construct… ▽ More

    Submitted 22 April, 2024; v1 submitted 19 September, 2022; originally announced September 2022.

    ACM Class: F.4.3

  5. arXiv:2204.06618  [pdf, ps, other

    cs.CC cs.AI cs.CL cs.FL cs.LG

    Formal Language Recognition by Hard Attention Transformers: Perspectives from Circuit Complexity

    Authors: Yiding Hao, Dana Angluin, Robert Frank

    Abstract: This paper analyzes three formal models of Transformer encoders that differ in the form of their self-attention mechanism: unique hard attention (UHAT); generalized unique hard attention (GUHAT), which generalizes UHAT; and averaging hard attention (AHAT). We show that UHAT and GUHAT Transformers, viewed as string acceptors, can only recognize formal languages in the complexity class AC$^0$, the c… ▽ More

    Submitted 13 April, 2022; originally announced April 2022.

    Comments: To appear in Transactions of the Association for Computational Linguistics

  6. arXiv:2002.03191  [pdf, ps, other

    cs.FL

    Polynomial time algorithms for inclusion and equivalence of deterministic omega acceptors

    Authors: Dana Angluin, Dana Fisman

    Abstract: The class of omega languages recognized by deterministic parity acceptors (DPAs) or deterministic Muller acceptors (DMAs) is exactly the regular omega languages. The inclusion problem is the following: given two acceptors A1 and A2, determine whether the language recognized by A1 is a subset of the language recognized by A2, and if not, return an ultimately periodic omega word accepted by A1 but n… ▽ More

    Submitted 9 May, 2020; v1 submitted 8 February, 2020; originally announced February 2020.

    ACM Class: F.4.3

  7. Regular omega-Languages with an Informative Right Congruence

    Authors: Dana Angluin, Dana Fisman

    Abstract: A regular language is almost fully characterized by its right congruence relation. Indeed, a regular language can always be recognized by a DFA isomorphic to the automaton corresponding to its right congruence, henceforth the Rightcon automaton. The same does not hold for regular omega-languages. The right congruence of a regular omega-language is not informative enough; many regular omega-langua… ▽ More

    Submitted 9 September, 2018; originally announced September 2018.

    Comments: In Proceedings GandALF 2018, arXiv:1809.02416

    Journal ref: EPTCS 277, 2018, pp. 265-279

  8. arXiv:1809.02836  [pdf, other

    cs.NE cs.CL cs.LG

    Context-Free Transductions with Neural Stacks

    Authors: Yiding Hao, William Merrill, Dana Angluin, Robert Frank, Noah Amsel, Andrew Benz, Simon Mendelsohn

    Abstract: This paper analyzes the behavior of stack-augmented recurrent neural network (RNN) models. Due to the architectural similarity between stack RNNs and pushdown transducers, we train stack RNN models on a number of tasks, including string reversal, context-free language modelling, and cumulative XOR evaluation. Examining the behavior of our networks, we show that stack-augmented RNNs can discover in… ▽ More

    Submitted 8 September, 2018; originally announced September 2018.

    Comments: To appear in the proceedings of the Analyzing and Interpreting Neural Networks for NLP workshop at EMNLP 2018

  9. Query learning of derived $ω$-tree languages in polynomial time

    Authors: Dana Angluin, Timos Antonopoulos, Dana Fisman

    Abstract: We present the first polynomial time algorithm to learn nontrivial classes of languages of infinite trees. Specifically, our algorithm uses membership and equivalence queries to learn classes of $ω$-tree languages derived from weak regular $ω$-word languages in polynomial time. The method is a general polynomial time reduction of learning a class of derived $ω$-tree languages to learning the under… ▽ More

    Submitted 26 August, 2019; v1 submitted 13 February, 2018; originally announced February 2018.

    Journal ref: Logical Methods in Computer Science, Volume 15, Issue 3 (August 27, 2019) lmcs:4311

  10. Families of DFAs as Acceptors of $ω$-Regular Languages

    Authors: Dana Angluin, Udi Boker, Dana Fisman

    Abstract: Families of DFAs (FDFAs) provide an alternative formalism for recognizing $ω$-regular languages. The motivation for introducing them was a desired correlation between the automaton states and right congruence relations, in a manner similar to the Myhill-Nerode theorem for regular languages. This correlation is beneficial for learning algorithms, and indeed it was recently shown that $ω$-regular la… ▽ More

    Submitted 13 February, 2018; v1 submitted 24 December, 2016; originally announced December 2016.

    ACM Class: F.1.1; D.2.4

    Journal ref: Logical Methods in Computer Science, Volume 14, Issue 1 (February 14, 2018) lmcs:2624

  11. arXiv:1304.4303  [pdf, other

    cs.DB

    Learning and Verifying Quantified Boolean Queries by Example

    Authors: Azza Abouzied, Dana Angluin, Christos Papadimitriou, Joseph M. Hellerstein, Avi Silberschatz

    Abstract: To help a user specify and verify quantified queries --- a class of database queries known to be very challenging for all but the most expert users --- one can question the user on whether certain data objects are answers or non-answers to her intended query. In this paper, we analyze the number of questions needed to learn or verify qhorn queries, a special class of Boolean quantified queries who… ▽ More

    Submitted 15 April, 2013; originally announced April 2013.

    Comments: Extended Version of PODS 2013 paper

  12. arXiv:cs/0608084  [pdf, ps, other

    cs.CC cs.DC

    The computational power of population protocols

    Authors: Dana Angluin, James Aspnes, David Eisenstat, Eric Ruppert

    Abstract: We consider the model of population protocols introduced by Angluin et al., in which anonymous finite-state agents stably compute a predicate of the multiset of their inputs via two-way interactions in the all-pairs family of communication networks. We prove that all predicates stably computable in this model (and certain generalizations of it) are semilinear, answering a central open question a… ▽ More

    Submitted 21 August, 2006; originally announced August 2006.

    Comments: Combined version of OPODIS 2005 and PODC 2006 papers; submitted to Distributed Computing