Skip to main content

Showing 1–4 of 4 results for author: Ndirango, A

Searching in archive cs. Search in all archives.
.
  1. arXiv:2406.00057  [pdf, other

    cs.CL cs.LG

    Toward Conversational Agents with Context and Time Sensitive Long-term Memory

    Authors: Nick Alonso, Tomás Figliolia, Anthony Ndirango, Beren Millidge

    Abstract: There has recently been growing interest in conversational agents with long-term memory which has led to the rapid development of language models that use retrieval-augmented generation (RAG). Until recently, most work on RAG has focused on information retrieval from large databases of texts, like Wikipedia, rather than information from long-form conversations. In this paper, we argue that effecti… ▽ More

    Submitted 4 June, 2024; v1 submitted 29 May, 2024; originally announced June 2024.

  2. arXiv:2108.12001  [pdf, other

    cs.LG cs.AI

    Understanding the Logit Distributions of Adversarially-Trained Deep Neural Networks

    Authors: Landan Seguin, Anthony Ndirango, Neeli Mishra, SueYeon Chung, Tyler Lee

    Abstract: Adversarial defenses train deep neural networks to be invariant to the input perturbations from adversarial attacks. Almost all defense strategies achieve this invariance through adversarial training i.e. training on inputs with adversarial perturbations. Although adversarial training is successful at mitigating adversarial attacks, the behavioral differences between adversarially-trained (AT) mod… ▽ More

    Submitted 26 August, 2021; originally announced August 2021.

    Comments: 29 pages (13 main, 16 supplemental), 22 figures (5 main, 17 supplemental)

  3. arXiv:1910.13593  [pdf, other

    cs.LG stat.ML

    Generalization in multitask deep neural classifiers: a statistical physics approach

    Authors: Tyler Lee, Anthony Ndirango

    Abstract: A proper understanding of the striking generalization abilities of deep neural networks presents an enduring puzzle. Recently, there has been a growing body of numerically-grounded theoretical work that has contributed important insights to the theory of learning in deep neural nets. There has also been a recent interest in extending these analyses to understanding how multitask learning can furth… ▽ More

    Submitted 29 October, 2019; originally announced October 2019.

    Comments: Accepted to NeurIPS 2019

  4. arXiv:1910.12587  [pdf, other

    eess.AS cs.LG cs.SD stat.ML

    Label-efficient audio classification through multitask learning and self-supervision

    Authors: Tyler Lee, Ting Gong, Suchismita Padhy, Andrew Rouditchenko, Anthony Ndirango

    Abstract: While deep learning has been incredibly successful in modeling tasks with large, carefully curated labeled datasets, its application to problems with limited labeled data remains a challenge. The aim of the present work is to improve the label efficiency of large neural networks operating on audio data through a combination of multitask learning and self-supervised learning on unlabeled data. We t… ▽ More

    Submitted 18 October, 2019; originally announced October 2019.

    Comments: Presented at ICLR 2019 Limited Labeled Data (LLD) Workshop