Skip to main content

Showing 1–8 of 8 results for author: Nandwani, Y

.
  1. arXiv:2403.04890  [pdf, other

    cs.CL

    Few shot chain-of-thought driven reasoning to prompt LLMs for open ended medical question answering

    Authors: Ojas Gramopadhye, Saeel Sandeep Nachane, Prateek Chanda, Ganesh Ramakrishnan, Kshitij Sharad Jadhav, Yatin Nandwani, Dinesh Raghu, Sachindra Joshi

    Abstract: Large Language models (LLMs) have demonstrated significant potential in transforming healthcare by automating tasks such as clinical documentation, information retrieval, and decision support. In this aspect, carefully engineered prompts have emerged as a powerful tool for using LLMs for medical scenarios, e.g., patient clinical scenarios. In this paper, we propose a modified version of the MedQA-… ▽ More

    Submitted 7 March, 2024; originally announced March 2024.

  2. arXiv:2402.02479  [pdf, other

    cs.LG cs.AI cs.CL cs.HC

    BRAIn: Bayesian Reward-conditioned Amortized Inference for natural language generation from feedback

    Authors: Gaurav Pandey, Yatin Nandwani, Tahira Naseem, Mayank Mishra, Guangxuan Xu, Dinesh Raghu, Sachindra Joshi, Asim Munawar, Ramón Fernandez Astudillo

    Abstract: Distribution matching methods for language model alignment such as Generation with Distributional Control (GDC) and Distributional Policy Gradient (DPG) have not received the same level of attention in reinforcement learning from human feedback (RLHF) as contrastive methods such as Sequence Likelihood Calibration (SLiC), Direct Preference Optimization (DPO) and its variants. We identify high varia… ▽ More

    Submitted 10 June, 2024; v1 submitted 4 February, 2024; originally announced February 2024.

    Comments: Accepted at ICML 2024 (main conference)

  3. arXiv:2305.12191  [pdf, other

    cs.CL

    Pointwise Mutual Information Based Metric and Decoding Strategy for Faithful Generation in Document Grounded Dialogs

    Authors: Yatin Nandwani, Vineet Kumar, Dinesh Raghu, Sachindra Joshi, Luis A. Lastras

    Abstract: A major concern in using deep learning based generative models for document-grounded dialogs is the potential generation of responses that are not \textit{faithful} to the underlying document. Existing automated metrics used for evaluating the faithfulness of response with respect to the grounding document measure the degree of similarity between the generated response and the document's content.… ▽ More

    Submitted 1 December, 2023; v1 submitted 20 May, 2023; originally announced May 2023.

    Comments: EMNLP 2023

  4. arXiv:2210.09082  [pdf, other

    cs.LG

    A Solver-Free Framework for Scalable Learning in Neural ILP Architectures

    Authors: Yatin Nandwani, Rishabh Ranjan, Mausam, Parag Singla

    Abstract: There is a recent focus on designing architectures that have an Integer Linear Programming (ILP) layer within a neural model (referred to as Neural ILP in this paper). Neural ILP architectures are suitable for pure reasoning tasks that require data-driven constraint learning or for tasks requiring both perception (neural) and reasoning (ILP). A recent SOTA approach for end-to-end training of Neura… ▽ More

    Submitted 13 January, 2023; v1 submitted 17 October, 2022; originally announced October 2022.

    Comments: Accepted at NeurIPS 2022

  5. arXiv:2202.12273  [pdf, other

    cs.AI cs.IR

    Matching Papers and Reviewers at Large Conferences

    Authors: Kevin Leyton-Brown, Mausam, Yatin Nandwani, Hedayat Zarkoob, Chris Cameron, Neil Newman, Dinesh Raghu

    Abstract: Peer-reviewed conferences, the main publication venues in CS, rely critically on matching highly qualified reviewers for each paper. Because of the growing scale of these conferences, the tight timelines on which they operate, and a recent surge in explicitly dishonest behavior, there is now no alternative to performing this matching in an automated way. This paper studies a novel reviewer-paper m… ▽ More

    Submitted 5 August, 2022; v1 submitted 24 February, 2022; originally announced February 2022.

  6. arXiv:2202.03229  [pdf, other

    cs.LG cs.AI

    Neural Models for Output-Space Invariance in Combinatorial Problems

    Authors: Yatin Nandwani, Vidit Jain, Mausam, Parag Singla

    Abstract: Recently many neural models have been proposed to solve combinatorial puzzles by implicitly learning underlying constraints using their solved instances, such as sudoku or graph coloring (GCP). One drawback of the proposed architectures, which are often based on Graph Neural Networks (GNN), is that they cannot generalize across the size of the output space from which variables are assigned a value… ▽ More

    Submitted 7 February, 2022; originally announced February 2022.

    Comments: Published as a conference paper at ICLR 2022

  7. arXiv:2104.08741  [pdf, other

    cs.CL

    CEAR: Cross-Entity Aware Reranker for Knowledge Base Completion

    Authors: Keshav Kolluru, Mayank Singh Chauhan, Yatin Nandwani, Parag Singla, Mausam

    Abstract: Pre-trained language models (LMs) like BERT have shown to store factual knowledge about the world. This knowledge can be used to augment the information present in Knowledge Bases, which tend to be incomplete. However, prior attempts at using BERT for task of Knowledge Base Completion (KBC) resulted in performance worse than embedding based techniques that rely only on the graph structure. In this… ▽ More

    Submitted 27 January, 2022; v1 submitted 18 April, 2021; originally announced April 2021.

    Comments: We found a bug in the code that invalidates the reported results for FB15k-237 and WN18RR. The results for OLPBench hold the same. We are in process of updating the paper

  8. arXiv:2008.11990  [pdf, other

    cs.LG stat.ML

    Neural Learning of One-of-Many Solutions for Combinatorial Problems in Structured Output Spaces

    Authors: Yatin Nandwani, Deepanshu **dal, Mausam, Parag Singla

    Abstract: Recent research has proposed neural architectures for solving combinatorial problems in structured output spaces. In many such problems, there may exist multiple solutions for a given input, e.g. a partially filled Sudoku puzzle may have many completions satisfying all constraints. Further, we are often interested in finding any one of the possible solutions, without any preference between them. E… ▽ More

    Submitted 4 April, 2021; v1 submitted 27 August, 2020; originally announced August 2020.