Skip to main content

Showing 1–10 of 10 results for author: Anantha, R

Searching in archive cs. Search in all archives.
.
  1. arXiv:2406.20060  [pdf, other

    cs.CL

    Applying RLAIF for Code Generation with API-usage in Lightweight LLMs

    Authors: Sujan Dutta, Sayantan Mahinder, Raviteja Anantha, Bortik Bandyopadhyay

    Abstract: Reinforcement Learning from AI Feedback (RLAIF) has demonstrated significant potential across various domains, including mitigating harm in LLM outputs, enhancing text summarization, and mathematical reasoning. This paper introduces an RLAIF framework for improving the code generation abilities of lightweight (<1B parameters) LLMs. We specifically focus on code generation tasks that require writin… ▽ More

    Submitted 28 June, 2024; originally announced June 2024.

  2. arXiv:2312.10332  [pdf, other

    cs.IR cs.AI cs.LG

    ProTIP: Progressive Tool Retrieval Improves Planning

    Authors: Raviteja Anantha, Bortik Bandyopadhyay, Anirudh Kashi, Sayantan Mahinder, Andrew W Hill, Srinivas Chappidi

    Abstract: Large language models (LLMs) are increasingly employed for complex multi-step planning tasks, where the tool retrieval (TR) step is crucial for achieving successful outcomes. Two prevalent approaches for TR are single-step retrieval, which utilizes the complete query, and sequential retrieval using task decomposition (TD), where a full query is segmented into discrete atomic subtasks. While single… ▽ More

    Submitted 16 December, 2023; originally announced December 2023.

    Comments: preprint version

  3. arXiv:2312.05708  [pdf, other

    cs.IR cs.AI cs.LG

    Context Tuning for Retrieval Augmented Generation

    Authors: Raviteja Anantha, Tharun Bethi, Danil Vodianik, Srinivas Chappidi

    Abstract: Large language models (LLMs) have the remarkable ability to solve new tasks with just a few examples, but they need access to the right tools. Retrieval Augmented Generation (RAG) addresses this problem by retrieving a list of relevant tools for a given task. However, RAG's tool retrieval step requires all the required information to be explicitly present in the query. This is a limitation, as sem… ▽ More

    Submitted 9 December, 2023; originally announced December 2023.

    Comments: preprint version

  4. arXiv:2303.00171  [pdf, other

    cs.LG cs.AI eess.AS

    DTW-SiameseNet: Dynamic Time Warped Siamese Network for Mispronunciation Detection and Correction

    Authors: Raviteja Anantha, Kriti Bhasin, Daniela de la Parra Aguilar, Prabal Vashisht, Becci Williamson, Srinivas Chappidi

    Abstract: Personal Digital Assistants (PDAs) - such as Siri, Alexa and Google Assistant, to name a few - play an increasingly important role to access information and complete tasks spanning multiple domains, and by diverse groups of users. A text-to-speech (TTS) module allows PDAs to interact in a natural, human-like manner, and play a vital role when the interaction involves people with visual impairments… ▽ More

    Submitted 28 February, 2023; originally announced March 2023.

    Comments: Preprint version

  5. arXiv:2108.06329  [pdf, other

    cs.CL cs.LG

    Low-Resource Adaptation of Open-Domain Generative Chatbots

    Authors: Greyson Gerhard-Young, Raviteja Anantha, Srinivas Chappidi, Björn Hoffmeister

    Abstract: Recent work building open-domain chatbots has demonstrated that increasing model size improves performance. On the other hand, latency and connectivity considerations dictate the move of digital assistants on the device. Giving a digital assistant like Siri, Alexa, or Google Assistant the ability to discuss just about anything leads to the need for reducing the chatbot model size such that it fits… ▽ More

    Submitted 8 April, 2022; v1 submitted 13 August, 2021; originally announced August 2021.

    Comments: Accepted at ACL DialDoc 2022

  6. arXiv:2010.06835  [pdf, other

    cs.CL

    A Wrong Answer or a Wrong Question? An Intricate Relationship between Question Reformulation and Answer Selection in Conversational Question Answering

    Authors: Svitlana Vakulenko, Shayne Longpre, Zhucheng Tu, Raviteja Anantha

    Abstract: The dependency between an adequate question formulation and correct answer selection is a very intriguing but still underexplored area. In this paper, we show that question rewriting (QR) of the conversational context allows to shed more light on this phenomenon and also use it to evaluate robustness of different answer selection approaches. We introduce a simple framework that enables an automate… ▽ More

    Submitted 3 February, 2022; v1 submitted 13 October, 2020; originally announced October 2020.

    Comments: Accepted at the Workshop on Search-Oriented Conversational AI (SCAI) 2020. Code for error analysis: https://github.com/svakulenk0/QRQA. arXiv admin note: text overlap with arXiv:2004.14652

  7. arXiv:2010.04898  [pdf, other

    cs.IR cs.CL

    Open-Domain Question Answering Goes Conversational via Question Rewriting

    Authors: Raviteja Anantha, Svitlana Vakulenko, Zhucheng Tu, Shayne Longpre, Stephen Pulman, Srinivas Chappidi

    Abstract: We introduce a new dataset for Question Rewriting in Conversational Context (QReCC), which contains 14K conversations with 80K question-answer pairs. The task in QReCC is to find answers to conversational questions within a collection of 10M web pages (split into 54M passages). Answers to questions in the same conversation may be distributed across several web pages. QReCC provides annotations tha… ▽ More

    Submitted 14 April, 2021; v1 submitted 10 October, 2020; originally announced October 2020.

    Comments: 15 pages, 10 tables, 3 figures, accepted at NAACL 2021

  8. arXiv:2005.01246  [pdf, other

    cs.LG cs.AI stat.ML

    Generalized Reinforcement Meta Learning for Few-Shot Optimization

    Authors: Raviteja Anantha, Stephen Pulman, Srinivas Chappidi

    Abstract: We present a generic and flexible Reinforcement Learning (RL) based meta-learning framework for the problem of few-shot learning. During training, it learns the best optimization algorithm to produce a learner (ranker/classifier, etc) by exploiting stable patterns in loss surfaces. Our method implicitly estimates the gradients of a scaled loss function while retaining the general properties intact… ▽ More

    Submitted 3 May, 2020; originally announced May 2020.

    Comments: 10 pages, 4 figures, 4 tables, 2 algorithms, ICML conference

  9. arXiv:2005.00119  [pdf, other

    cs.LG cs.CL cs.IR

    Learning to Rank Intents in Voice Assistants

    Authors: Raviteja Anantha, Srinivas Chappidi, William Dawoodi

    Abstract: Voice Assistants aim to fulfill user requests by choosing the best intent from multiple options generated by its Automated Speech Recognition and Natural Language Understanding sub-systems. However, voice assistants do not always produce the expected results. This can happen because voice assistants choose from ambiguous intents - user-specific or domain-specific contextual information reduces the… ▽ More

    Submitted 3 May, 2020; v1 submitted 30 April, 2020; originally announced May 2020.

    Comments: 11 pages, 7 figures, 2 tables, accepted at IWSDS 2020 conference

  10. arXiv:2004.14652  [pdf, other

    cs.IR cs.LG

    Question Rewriting for Conversational Question Answering

    Authors: Svitlana Vakulenko, Shayne Longpre, Zhucheng Tu, Raviteja Anantha

    Abstract: Conversational question answering (QA) requires the ability to correctly interpret a question in the context of previous conversation turns. We address the conversational QA task by decomposing it into question rewriting and question answering subtasks. The question rewriting (QR) subtask is specifically designed to reformulate ambiguous questions, which depend on the conversational context, into… ▽ More

    Submitted 23 October, 2020; v1 submitted 30 April, 2020; originally announced April 2020.

    Comments: Version accepted to WSDM 2021