Skip to main content

Showing 1–5 of 5 results for author: Ponnusamy, P

Searching in archive cs. Search in all archives.
.
  1. arXiv:2403.17844  [pdf, other

    cs.LG

    Mechanistic Design and Scaling of Hybrid Architectures

    Authors: Michael Poli, Armin W Thomas, Eric Nguyen, Pragaash Ponnusamy, Björn Deiseroth, Kristian Kersting, Taiji Suzuki, Brian Hie, Stefano Ermon, Christopher Ré, Ce Zhang, Stefano Massaroli

    Abstract: The development of deep learning architectures is a resource-demanding process, due to a vast design space, long prototy** times, and high compute costs associated with at-scale model training and evaluation. We set out to simplify this process by grounding it in an end-to-end mechanistic architecture design (MAD) pipeline, encompassing small-scale capability unit tests predictive of scaling law… ▽ More

    Submitted 26 March, 2024; originally announced March 2024.

  2. arXiv:2205.00029  [pdf, other

    cs.CL cs.AI cs.LG

    Self-Aware Feedback-Based Self-Learning in Large-Scale Conversational AI

    Authors: Pragaash Ponnusamy, Clint Solomon Mathialagan, Gustavo Aguilar, Chengyuan Ma, Chenlei Guo

    Abstract: Self-learning paradigms in large-scale conversational AI agents tend to leverage user feedback in bridging between what they say and what they mean. However, such learning, particularly in Markov-based query rewriting systems have far from addressed the impact of these models on future training where successive feedback is inevitably contingent on the rewrite itself, especially in a continually up… ▽ More

    Submitted 29 April, 2022; originally announced May 2022.

  3. arXiv:2204.10815  [pdf

    cs.CL cs.AI

    A Vocabulary-Free Multilingual Neural Tokenizer for End-to-End Task Learning

    Authors: Md Mofijul Islam, Gustavo Aguilar, Pragaash Ponnusamy, Clint Solomon Mathialagan, Chengyuan Ma, Chenlei Guo

    Abstract: Subword tokenization is a commonly used input pre-processing step in most recent NLP models. However, it limits the models' ability to leverage end-to-end task learning. Its frequency-based vocabulary creation compromises tokenization in low-resource languages, leading models to produce suboptimal representations. Additionally, the dependency on a fixed vocabulary limits the subword models' adapta… ▽ More

    Submitted 22 April, 2022; originally announced April 2022.

    Journal ref: ACL 2022 Workshop on Representation Learning for NLP

  4. arXiv:2011.04748  [pdf, other

    cs.AI cs.CL cs.LG

    Personalized Query Rewriting in Conversational AI Agents

    Authors: Alireza Roshan-Ghias, Clint Solomon Mathialagan, Pragaash Ponnusamy, Lambert Mathias, Chenlei Guo

    Abstract: Spoken language understanding (SLU) systems in conversational AI agents often experience errors in the form of misrecognitions by automatic speech recognition (ASR) or semantic gaps in natural language understanding (NLU). These errors easily translate to user frustrations, particularly so in recurrent events e.g. regularly toggling an appliance, calling a frequent contact, etc. In this work, we p… ▽ More

    Submitted 9 November, 2020; originally announced November 2020.

    Comments: 5 pages, 3 figures

  5. arXiv:1911.02557  [pdf, other

    cs.LG cs.AI

    Feedback-Based Self-Learning in Large-Scale Conversational AI Agents

    Authors: Pragaash Ponnusamy, Alireza Roshan Ghias, Chenlei Guo, Ruhi Sarikaya

    Abstract: Today, most large-scale conversational AI agents (e.g. Alexa, Siri, or Google Assistant) are built using manually annotated data to train the different components of the system. Typically, the accuracy of the ML models in these components are improved by manually transcribing and annotating data. As the scope of these systems increase to cover more scenarios and domains, manual annotation to impro… ▽ More

    Submitted 6 November, 2019; originally announced November 2019.

    Comments: 8 pages, 2 figures