Skip to main content

Showing 1–3 of 3 results for author: Mahinder, S

Searching in archive cs. Search in all archives.
.
  1. arXiv:2406.20060  [pdf, other

    cs.CL

    Applying RLAIF for Code Generation with API-usage in Lightweight LLMs

    Authors: Sujan Dutta, Sayantan Mahinder, Raviteja Anantha, Bortik Bandyopadhyay

    Abstract: Reinforcement Learning from AI Feedback (RLAIF) has demonstrated significant potential across various domains, including mitigating harm in LLM outputs, enhancing text summarization, and mathematical reasoning. This paper introduces an RLAIF framework for improving the code generation abilities of lightweight (<1B parameters) LLMs. We specifically focus on code generation tasks that require writin… ▽ More

    Submitted 28 June, 2024; originally announced June 2024.

  2. arXiv:2312.10332  [pdf, other

    cs.IR cs.AI cs.LG

    ProTIP: Progressive Tool Retrieval Improves Planning

    Authors: Raviteja Anantha, Bortik Bandyopadhyay, Anirudh Kashi, Sayantan Mahinder, Andrew W Hill, Srinivas Chappidi

    Abstract: Large language models (LLMs) are increasingly employed for complex multi-step planning tasks, where the tool retrieval (TR) step is crucial for achieving successful outcomes. Two prevalent approaches for TR are single-step retrieval, which utilizes the complete query, and sequential retrieval using task decomposition (TD), where a full query is segmented into discrete atomic subtasks. While single… ▽ More

    Submitted 16 December, 2023; originally announced December 2023.

    Comments: preprint version

  3. arXiv:2001.01697  [pdf, other

    cs.CY cs.LG

    Social Media Attributions in the Context of Water Crisis

    Authors: Rupak Sarkar, Hirak Sarkar, Sayantan Mahinder, Ashiqur R. KhudaBukhsh

    Abstract: Attribution of natural disasters/collective misfortune is a widely-studied political science problem. However, such studies are typically survey-centric or rely on a handful of experts to weigh in on the matter. In this paper, we explore how can we use social media data and an AI-driven approach to complement traditional surveys and automatically extract attribution factors. We focus on the most-r… ▽ More

    Submitted 6 January, 2020; originally announced January 2020.