Skip to main content

Showing 1–13 of 13 results for author: Downey, C

Searching in archive cs. Search in all archives.
.
  1. arXiv:2405.12413  [pdf, other

    cs.CL

    Targeted Multilingual Adaptation for Low-resource Language Families

    Authors: C. M. Downey, Terra Blevins, Dhwani Serai, Dwija Parikh, Shane Steinert-Threlkeld

    Abstract: The "massively-multilingual" training of multilingual models is known to limit their utility in any one language, and they perform particularly poorly on low-resource languages. However, there is evidence that low-resource languages can benefit from targeted multilinguality, where the model is trained on closely related languages. To test this approach more rigorously, we systematically study best… ▽ More

    Submitted 20 May, 2024; originally announced May 2024.

  2. arXiv:2309.04679  [pdf, other

    cs.CL

    Embedding structure matters: Comparing methods to adapt multilingual vocabularies to new languages

    Authors: C. M. Downey, Terra Blevins, Nora Goldfine, Shane Steinert-Threlkeld

    Abstract: Pre-trained multilingual language models underpin a large portion of modern NLP tools outside of English. A strong baseline for specializing these models for specific languages is Language-Adaptive Pre-Training (LAPT). However, retaining a large cross-lingual vocabulary and embedding matrix comes at considerable excess computational cost during adaptation. In this study, we propose several simple… ▽ More

    Submitted 26 October, 2023; v1 submitted 9 September, 2023; originally announced September 2023.

    Comments: Camera-ready for Proceedings of the 3rd Workshop on Multilingual Representation Learning

  3. arXiv:2212.08619  [pdf, other

    cs.CL cs.CR

    Planting and Mitigating Memorized Content in Predictive-Text Language Models

    Authors: C. M. Downey, Wei Dai, Huseyin A. Inan, Kim Laine, Saurabh Naik, Tomasz Religa

    Abstract: Language models are widely deployed to provide automatic text completion services in user products. However, recent research has revealed that language models (especially large ones) bear considerable risk of memorizing private training data, which is then vulnerable to leakage and extraction by adversaries. In this study, we test the efficacy of a range of privacy-preserving techniques to mitigat… ▽ More

    Submitted 16 December, 2022; originally announced December 2022.

  4. arXiv:2207.07025  [pdf, other

    cs.CL cs.AI

    Learning to translate by learning to communicate

    Authors: C. M. Downey, Xuhui Zhou, Leo Z. Liu, Shane Steinert-Threlkeld

    Abstract: We formulate and test a technique to use Emergent Communication (EC) with a pre-trained multilingual model to improve on modern Unsupervised NMT systems, especially for low-resource languages. It has been argued that the current dominant paradigm in NLP of pre-training on text-only corpora will not yield robust natural language understanding systems, and the need for grounded, goal-oriented, and i… ▽ More

    Submitted 19 October, 2023; v1 submitted 14 July, 2022; originally announced July 2022.

    Comments: Camera-ready for 3rd Multilingual Representation Learning Workshop (MRL 2023)

  5. arXiv:2206.04176  [pdf, other

    cs.CV cs.LG cs.RO

    VN-Transformer: Rotation-Equivariant Attention for Vector Neurons

    Authors: Serge Assaad, Carlton Downey, Rami Al-Rfou, Nigamaa Nayakanti, Ben Sapp

    Abstract: Rotation equivariance is a desirable property in many practical applications such as motion forecasting and 3D perception, where it can offer benefits like sample efficiency, better generalization, and robustness to input perturbations. Vector Neurons (VN) is a recently developed framework offering a simple yet effective approach for deriving rotation-equivariant analogs of standard machine learni… ▽ More

    Submitted 24 January, 2023; v1 submitted 8 June, 2022; originally announced June 2022.

    Comments: Published in Transactions on Machine Learning Research (TMLR), 2023; Previous version appeared in Workshop on Machine Learning for Autonomous Driving, Conference on Neural Information Processing Systems (NeurIPS), 2022

  6. arXiv:2110.08415  [pdf, other

    cs.CL

    Multilingual unsupervised sequence segmentation transfers to extremely low-resource languages

    Authors: C. M. Downey, Shannon Drizin, Levon Haroutunian, Shivin Thukral

    Abstract: We show that unsupervised sequence-segmentation performance can be transferred to extremely low-resource languages by pre-training a Masked Segmental Language Model (Downey et al., 2021) multilingually. Further, we show that this transfer can be achieved by training over a collection of low-resource languages that are typologically similar (but phylogenetically unrelated) to the target language. I… ▽ More

    Submitted 14 March, 2022; v1 submitted 15 October, 2021; originally announced October 2021.

    Comments: ACL 2022 camera-ready

  7. arXiv:2104.09959  [pdf, other

    cs.RO

    Identifying Driver Interactions via Conditional Behavior Prediction

    Authors: Ekaterina Tolstaya, Reza Mahjourian, Carlton Downey, Balakrishnan Varadarajan, Benjamin Sapp, Dragomir Anguelov

    Abstract: Interactive driving scenarios, such as lane changes, merges and unprotected turns, are some of the most challenging situations for autonomous driving. Planning in interactive scenarios requires accurately modeling the reactions of other agents to different future actions of the ego agent. We develop end-to-end models for conditional behavior prediction (CBP) that take as an input a query future tr… ▽ More

    Submitted 1 June, 2021; v1 submitted 20 April, 2021; originally announced April 2021.

  8. arXiv:2104.07829  [pdf, other

    cs.CL

    A Masked Segmental Language Model for Unsupervised Natural Language Segmentation

    Authors: C. M. Downey, Fei Xia, Gina-Anne Levow, Shane Steinert-Threlkeld

    Abstract: Segmentation remains an important preprocessing step both in languages where "words" or other important syntactic/semantic units (like morphemes) are not clearly delineated by white space, as well as when dealing with continuous speech data, where there is often no meaningful pause between words. Near-perfect supervised methods have been developed for use in resource-rich languages such as Chinese… ▽ More

    Submitted 3 September, 2021; v1 submitted 15 April, 2021; originally announced April 2021.

  9. arXiv:1810.12369  [pdf, other

    stat.ML cs.LG quant-ph

    Learning and Inference in Hilbert Space with Quantum Graphical Models

    Authors: Siddarth Srinivasan, Carlton Downey, Byron Boots

    Abstract: Quantum Graphical Models (QGMs) generalize classical graphical models by adopting the formalism for reasoning about uncertainty from quantum mechanics. Unlike classical graphical models, QGMs represent uncertainty with density matrices in complex Hilbert spaces. Hilbert space embeddings (HSEs) also generalize Bayesian inference in Hilbert spaces. We investigate the link between QGMs and HSEs and s… ▽ More

    Submitted 29 October, 2018; originally announced October 2018.

    Comments: 13 pages total, 9 pages content, 3 pages appendix; NIPS 2018

  10. arXiv:1801.10123  [pdf, ps, other

    stat.ML cs.LG

    Links: A High-Dimensional Online Clustering Method

    Authors: Philip Andrew Mansfield, Quan Wang, Carlton Downey, Li Wan, Ignacio Lopez Moreno

    Abstract: We present a novel algorithm, called Links, designed to perform online clustering on unit vectors in a high-dimensional Euclidean space. The algorithm is appropriate when it is necessary to cluster data efficiently as it streams in, and is to be contrasted with traditional batch clustering algorithms that have access to all data at once. For example, Links has been successfully applied to embeddin… ▽ More

    Submitted 30 January, 2018; originally announced January 2018.

  11. arXiv:1710.10468  [pdf, other

    eess.AS cs.LG cs.SD stat.ML

    Speaker Diarization with LSTM

    Authors: Quan Wang, Carlton Downey, Li Wan, Philip Andrew Mansfield, Ignacio Lopez Moreno

    Abstract: For many years, i-vector based audio embedding techniques were the dominant approach for speaker verification and speaker diarization applications. However, mirroring the rise of deep learning in various domains, neural network based audio embeddings, also known as d-vectors, have consistently demonstrated superior speaker verification performance. In this paper, we build on the success of d-vecto… ▽ More

    Submitted 23 January, 2022; v1 submitted 28 October, 2017; originally announced October 2017.

    Comments: Published at ICASSP 2018

  12. arXiv:1702.04121  [pdf, other

    stat.ML cs.LG

    Practical Learning of Predictive State Representations

    Authors: Carlton Downey, Ahmed Hefny, Geoffrey Gordon

    Abstract: Over the past decade there has been considerable interest in spectral algorithms for learning Predictive State Representations (PSRs). Spectral algorithms have appealing theoretical guarantees; however, the resulting models do not always perform well on inference tasks in practice. One reason for this behavior is the mismatch between the intended task (accurate filtering or prediction) and the los… ▽ More

    Submitted 14 February, 2017; originally announced February 2017.

  13. arXiv:1505.05310  [pdf, other

    stat.ML cs.LG

    Supervised Learning for Dynamical System Learning

    Authors: Ahmed Hefny, Carlton Downey, Geoffrey Gordon

    Abstract: Recently there has been substantial interest in spectral methods for learning dynamical systems. These methods are popular since they often offer a good tradeoff between computational and statistical efficiency. Unfortunately, they can be difficult to use and extend in practice: e.g., they can make it difficult to incorporate prior information such as sparsity or structure. To address this problem… ▽ More

    Submitted 4 November, 2015; v1 submitted 20 May, 2015; originally announced May 2015.