Skip to main content

Showing 1–6 of 6 results for author: Ashok, D

.
  1. arXiv:2405.01490  [pdf, other

    cs.CL cs.AI

    Controllable Text Generation in the Instruction-Tuning Era

    Authors: Dhananjay Ashok, Barnabas Poczos

    Abstract: While most research on controllable text generation has focused on steering base Language Models, the emerging instruction-tuning and prompting paradigm offers an alternate approach to controllability. We compile and release ConGenBench, a testbed of 17 different controllable generation tasks, using a subset of it to benchmark the performance of 9 different baselines and methods on Instruction-tun… ▽ More

    Submitted 2 May, 2024; originally announced May 2024.

  2. arXiv:2307.15475  [pdf, other

    cs.HC cs.AI cs.LG

    FeedbackLogs: Recording and Incorporating Stakeholder Feedback into Machine Learning Pipelines

    Authors: Matthew Barker, Emma Kallina, Dhananjay Ashok, Katherine M. Collins, Ashley Casovan, Adrian Weller, Ameet Talwalkar, Valerie Chen, Umang Bhatt

    Abstract: Even though machine learning (ML) pipelines affect an increasing array of stakeholders, there is little work on how input from stakeholders is recorded and incorporated. We propose FeedbackLogs, addenda to existing documentation of ML pipelines, to track the input of multiple stakeholders. Each log records important details about the feedback collection process, the feedback itself, and how the fe… ▽ More

    Submitted 28 July, 2023; originally announced July 2023.

  3. arXiv:2305.15444  [pdf, other

    cs.CL cs.AI cs.LG

    PromptNER: Prompting For Named Entity Recognition

    Authors: Dhananjay Ashok, Zachary C. Lipton

    Abstract: In a surprising turn, Large Language Models (LLMs) together with a growing arsenal of prompt-based heuristics now offer powerful off-the-shelf approaches providing few-shot solutions to myriad classic NLP problems. However, despite promising early results, these LLM-based few-shot methods remain far from the state of the art in Named Entity Recognition (NER), where prevailing methods include learn… ▽ More

    Submitted 20 June, 2023; v1 submitted 24 May, 2023; originally announced May 2023.

  4. arXiv:2305.14707  [pdf, other

    cs.CL cs.AI cs.LG

    SciFix: Outperforming GPT3 on Scientific Factual Error Correction

    Authors: Dhananjay Ashok, Atharva Kulkarni, Hai Pham, Barnabás Póczos

    Abstract: Due to the prohibitively high cost of creating error correction datasets, most Factual Claim Correction methods rely on a powerful verification model to guide the correction process. This leads to a significant drop in performance in domains like scientific claims, where good verification models do not always exist. In this work, we introduce SciFix, a scientific claim correction system that does… ▽ More

    Submitted 12 October, 2023; v1 submitted 24 May, 2023; originally announced May 2023.

    Comments: To appear in proceedings of EMNLP2023 (findings)

  5. arXiv:2207.03264  [pdf, other

    cs.LG cs.LO

    A Solver + Gradient Descent Training Algorithm for Deep Neural Networks

    Authors: Dhananjay Ashok, Vineel Nagisetty, Christopher Srinivasa, Vijay Ganesh

    Abstract: We present a novel hybrid algorithm for training Deep Neural Networks that combines the state-of-the-art Gradient Descent (GD) method with a Mixed Integer Linear Programming (MILP) solver, outperforming GD and variants in terms of accuracy, as well as resource and data efficiency for both regression and classification tasks. Our GD+Solver hybrid algorithm, called GDSolver, works as follows: given… ▽ More

    Submitted 25 July, 2022; v1 submitted 7 July, 2022; originally announced July 2022.

  6. arXiv:2010.11328  [pdf, other

    cs.NE cs.AI cs.LG cs.SC

    Logic Guided Genetic Algorithms

    Authors: Dhananjay Ashok, Joseph Scott, Sebastian Wetzel, Maysum Panju, Vijay Ganesh

    Abstract: We present a novel Auxiliary Truth enhanced Genetic Algorithm (GA) that uses logical or mathematical constraints as a means of data augmentation as well as to compute loss (in conjunction with the traditional MSE), with the aim of increasing both data efficiency and accuracy of symbolic regression (SR) algorithms. Our method, logic-guided genetic algorithm (LGGA), takes as input a set of labelled… ▽ More

    Submitted 21 October, 2020; originally announced October 2020.