Skip to main content

Showing 1–10 of 10 results for author: Kant, N

Searching in archive cs. Search in all archives.
.
  1. arXiv:2403.13313  [pdf, other

    cs.AI cs.CL

    Polaris: A Safety-focused LLM Constellation Architecture for Healthcare

    Authors: Subhabrata Mukherjee, Paul Gamble, Markel Sanz Ausin, Neel Kant, Kriti Aggarwal, Neha Manjunath, Debajyoti Datta, Zhengliang Liu, Jiayuan Ding, Sophia Busacca, Cezanne Bianco, Swapnil Sharma, Rae Lasko, Michelle Voisard, Sanchay Harneja, Darya Filippova, Gerry Meixiong, Kevin Cha, Amir Youssefi, Meyhaa Buvanesh, Howard Weingram, Sebastian Bierman-Lytle, Harpreet Singh Mangat, Kim Parikh, Saad Godil , et al. (1 additional authors not shown)

    Abstract: We develop Polaris, the first safety-focused LLM constellation for real-time patient-AI healthcare conversations. Unlike prior LLM works in healthcare focusing on tasks like question answering, our work specifically focuses on long multi-turn voice conversations. Our one-trillion parameter constellation system is composed of several multibillion parameter LLMs as co-operative agents: a stateful pr… ▽ More

    Submitted 20 March, 2024; originally announced March 2024.

  2. arXiv:2311.09528  [pdf, other

    cs.CL cs.AI cs.LG

    HelpSteer: Multi-attribute Helpfulness Dataset for SteerLM

    Authors: Zhilin Wang, Yi Dong, Jiaqi Zeng, Virginia Adams, Makesh Narsimhan Sreedhar, Daniel Egert, Olivier Delalleau, Jane Polak Scowcroft, Neel Kant, Aidan Swope, Oleksii Kuchaiev

    Abstract: Existing open-source helpfulness preference datasets do not specify what makes some responses more helpful and others less so. Models trained on these datasets can incidentally learn to model dataset artifacts (e.g. preferring longer but unhelpful responses only due to their length). To alleviate this problem, we collect HelpSteer, a multi-attribute helpfulness dataset annotated for the various as… ▽ More

    Submitted 15 November, 2023; originally announced November 2023.

  3. arXiv:2210.14767  [pdf, ps, other

    eess.SY cs.RO

    Stabilization of Energy-Conserving Gaits for Point-Foot Planar Bipeds

    Authors: Aakash Khandelwal, Nilay Kant, Ranjan Mukherjee

    Abstract: The problem of designing and stabilizing impact-free, energy-conserving gaits is considered for underactuated, point-foot planar bipeds. Virtual holonomic constraints are used to design energy-conserving gaits. A desired gait corresponds to a periodic hybrid orbit and is stabilized using the Impulse Controlled Poincaré Map approach. Numerical simulations for the case of a five-link biped demonstra… ▽ More

    Submitted 26 October, 2022; originally announced October 2022.

    Comments: 6 pages, 6 figures. This work has been submitted to the IEEE for possible publication. Copyright may be transferred without notice, after which this version may no longer be accessible

  4. PrefixRL: Optimization of Parallel Prefix Circuits using Deep Reinforcement Learning

    Authors: Rajarshi Roy, Jonathan Raiman, Neel Kant, Ilyas Elkin, Robert Kirby, Michael Siu, Stuart Oberman, Saad Godil, Bryan Catanzaro

    Abstract: In this work, we present a reinforcement learning (RL) based approach to designing parallel prefix circuits such as adders or priority encoders that are fundamental to high-performance digital design. Unlike prior methods, our approach designs solutions tabula rasa purely through learning with synthesis in the loop. We design a grid-based state-action representation and an RL environment for const… ▽ More

    Submitted 14 May, 2022; originally announced May 2022.

    Comments: Copyright 2021 IEEE. Personal use of this material is permitted. Permission from IEEE must be obtained for all other uses, in any current or future media, including reprinting/republishing this material for advertising or promotional purposes, creating new collective works, for resale or redistribution to servers or lists, or reuse of any copyrighted component of this work in other works

    Journal ref: ACM/IEEE Design Automation Conference (DAC), 2021, pp. 853-858

  5. arXiv:2202.05819  [pdf, ps, other

    eess.SY cs.RO

    Nonprehensile Manipulation of a Stick Using Impulsive Forces

    Authors: Aakash Khandelwal, Nilay Kant, Ranjan Mukherjee

    Abstract: The problem of nonprehensile manipulation of a stick in three-dimensional space using intermittent impulsive forces is considered. The objective is to juggle the stick between a sequence of configurations that are rotationally symmetric about the vertical axis. The dynamics of the stick is described by five generalized coordinates and three control inputs. Between two consecutive configurations wh… ▽ More

    Submitted 6 July, 2022; v1 submitted 11 February, 2022; originally announced February 2022.

    Comments: This work has been submitted for possible publication. This version submitted to Nonlinear Dynamics on 28 Jun 2022

  6. arXiv:2101.00408  [pdf, other

    cs.CL cs.AI

    End-to-End Training of Neural Retrievers for Open-Domain Question Answering

    Authors: Devendra Singh Sachan, Mostofa Patwary, Mohammad Shoeybi, Neel Kant, Wei **, William L Hamilton, Bryan Catanzaro

    Abstract: Recent work on training neural retrievers for open-domain question answering (OpenQA) has employed both supervised and unsupervised approaches. However, it remains unclear how unsupervised and supervised methods can be used most effectively for neural retrievers. In this work, we systematically study retriever pre-training. We first propose an approach of unsupervised pre-training with the Inverse… ▽ More

    Submitted 1 June, 2021; v1 submitted 2 January, 2021; originally announced January 2021.

    Comments: ACL 2021

  7. arXiv:1912.12345  [pdf, other

    cs.LG cs.AI cs.PL stat.ML

    Synthetic Datasets for Neural Program Synthesis

    Authors: Richard Shin, Neel Kant, Kavi Gupta, Christopher Bender, Brandon Trabucco, Rishabh Singh, Dawn Song

    Abstract: The goal of program synthesis is to automatically generate programs in a particular language from corresponding specifications, e.g. input-output behavior. Many current approaches achieve impressive results after training on randomly generated I/O examples in limited domain-specific languages (DSLs), as with string transformations in RobustFill. However, we empirically discover that applying test… ▽ More

    Submitted 27 December, 2019; originally announced December 2019.

    Comments: ICLR 2019

  8. arXiv:1905.10615  [pdf, other

    cs.LG cs.AI cs.CR stat.ML

    Adversarial Policies: Attacking Deep Reinforcement Learning

    Authors: Adam Gleave, Michael Dennis, Cody Wild, Neel Kant, Sergey Levine, Stuart Russell

    Abstract: Deep reinforcement learning (RL) policies are known to be vulnerable to adversarial perturbations to their observations, similar to adversarial examples for classifiers. However, an attacker is not usually able to directly modify another agent's observations. This might lead one to wonder: is it possible to attack an RL agent simply by choosing an adversarial policy acting in a multi-agent environ… ▽ More

    Submitted 17 January, 2021; v1 submitted 25 May, 2019; originally announced May 2019.

    Comments: Presented at ICLR 2020

    ACM Class: I.2.6

  9. arXiv:1812.01207  [pdf, other

    cs.CL

    Practical Text Classification With Large Pre-Trained Language Models

    Authors: Neel Kant, Raul Puri, Nikolai Yakovenko, Bryan Catanzaro

    Abstract: Multi-emotion sentiment classification is a natural language processing (NLP) problem with valuable use cases on real-world data. We demonstrate that large-scale unsupervised language modeling combined with finetuning offers a practical solution to this task on difficult datasets, including those with label class imbalance and domain-specific context. By training an attention-based Transformer net… ▽ More

    Submitted 3 December, 2018; originally announced December 2018.

    Comments: 8 pages, submitted to AAAI 2019

  10. arXiv:1802.02353  [pdf, other

    cs.AI cs.PL

    Recent Advances in Neural Program Synthesis

    Authors: Neel Kant

    Abstract: In recent years, deep learning has made tremendous progress in a number of fields that were previously out of reach for artificial intelligence. The successes in these problems has led researchers to consider the possibilities for intelligent systems to tackle a problem that humans have only recently themselves considered: program synthesis. This challenge is unlike others such as object recogniti… ▽ More

    Submitted 7 February, 2018; originally announced February 2018.

    Comments: 16 pages (without citations); Literature Review