Skip to main content

Showing 1–5 of 5 results for author: Soulos, P

Searching in archive cs. Search in all archives.
.
  1. arXiv:2306.00751  [pdf, other

    cs.CL cs.LG

    Differentiable Tree Operations Promote Compositional Generalization

    Authors: Paul Soulos, Edward Hu, Kate McCurdy, Yunmo Chen, Roland Fernandez, Paul Smolensky, Jianfeng Gao

    Abstract: In the context of structure-to-structure transformation tasks, learning sequences of discrete symbolic operations poses significant challenges due to their non-differentiability. To facilitate the learning of these symbolic sequences, we introduce a differentiable tree interpreter that compiles high-level symbolic tree operations into subsymbolic matrix operations on tensors. We present a novel Di… ▽ More

    Submitted 1 June, 2023; originally announced June 2023.

    Comments: ICML 2023. Code available at https://github.com/psoulos/dtm

  2. arXiv:2208.06061  [pdf, other

    cs.CL

    Structural Biases for Improving Transformers on Translation into Morphologically Rich Languages

    Authors: Paul Soulos, Sudha Rao, Caitlin Smith, Eric Rosen, Asli Celikyilmaz, R. Thomas McCoy, Yichen Jiang, Coleman Haley, Roland Fernandez, Hamid Palangi, Jianfeng Gao, Paul Smolensky

    Abstract: Machine translation has seen rapid progress with the advent of Transformer-based models. These models have no explicit linguistic structure built into them, yet they may still implicitly learn structured relationships by attending to relevant tokens. We hypothesize that this structural learning could be made more robust by explicitly endowing Transformers with a structural bias, and we investigate… ▽ More

    Submitted 11 August, 2022; originally announced August 2022.

    Comments: Revised edition to 4th Workshop on Technologies for MT of Low Resource Languages

    Journal ref: Proceedings of the 4th Workshop on Technologies for MT of Low Resource Languages (LoResMT2021)

  3. arXiv:2106.01317  [pdf, other

    cs.CL cs.AI cs.LG

    Enriching Transformers with Structured Tensor-Product Representations for Abstractive Summarization

    Authors: Yichen Jiang, Asli Celikyilmaz, Paul Smolensky, Paul Soulos, Sudha Rao, Hamid Palangi, Roland Fernandez, Caitlin Smith, Mohit Bansal, Jianfeng Gao

    Abstract: Abstractive summarization, the task of generating a concise summary of input documents, requires: (1) reasoning over the source document to determine the salient pieces of information scattered across the long document, and (2) composing a cohesive text by reconstructing these salient facts into a shorter summary that faithfully reflects the complex relations connecting these facts. In this paper,… ▽ More

    Submitted 2 June, 2021; originally announced June 2021.

    Comments: NAACL 2021 (14 pages)

  4. Discovering the Compositional Structure of Vector Representations with Role Learning Networks

    Authors: Paul Soulos, Tom McCoy, Tal Linzen, Paul Smolensky

    Abstract: How can neural networks perform so well on compositional tasks even though they lack explicit compositional representations? We use a novel analysis technique called ROLE to show that recurrent neural networks perform well on such tasks by converging to solutions which implicitly represent symbolic structure. This method uncovers a symbolic structure which, when properly embedded in vector space,… ▽ More

    Submitted 16 November, 2020; v1 submitted 20 October, 2019; originally announced October 2019.

  5. arXiv:1805.07647  [pdf, other

    cs.CV

    Learning Hierarchical Visual Representations in Deep Neural Networks Using Hierarchical Linguistic Labels

    Authors: Joshua C. Peterson, Paul Soulos, Aida Nematzadeh, Thomas L. Griffiths

    Abstract: Modern convolutional neural networks (CNNs) are able to achieve human-level object classification accuracy on specific tasks, and currently outperform competing models in explaining complex human visual representations. However, the categorization problem is posed differently for these networks than for humans: the accuracy of these networks is evaluated by their ability to identify single labels… ▽ More

    Submitted 19 May, 2018; originally announced May 2018.

    Comments: 6 pages, 4 figures, 1 table. Accepted as a paper to the 40th Annual Meeting of the Cognitive Science Society (CogSci 2018)