Skip to main content

Showing 1–28 of 28 results for author: Ramachandran, D

.
  1. arXiv:2406.16807  [pdf, other

    cs.LG cs.CL cs.CV

    Beyond Thumbs Up/Down: Untangling Challenges of Fine-Grained Feedback for Text-to-Image Generation

    Authors: Katherine M. Collins, Najoung Kim, Yonatan Bitton, Verena Rieser, Shayegan Omidshafiei, Yushi Hu, Sherol Chen, Senjuti Dutta, Minsuk Chang, Kimin Lee, Youwei Liang, Georgina Evans, Sahil Singla, Gang Li, Adrian Weller, Junfeng He, Deepak Ramachandran, Krishnamurthy Dj Dvijotham

    Abstract: Human feedback plays a critical role in learning and refining reward models for text-to-image generation, but the optimal form the feedback should take for learning an accurate reward function has not been conclusively established. This paper investigates the effectiveness of fine-grained feedback which captures nuanced distinctions in image quality and prompt-alignment, compared to traditional co… ▽ More

    Submitted 24 June, 2024; originally announced June 2024.

  2. arXiv:2406.09574  [pdf, other

    cs.LG

    Online Bandit Learning with Offline Preference Data

    Authors: Akhil Agnihotri, Rahul Jain, Deepak Ramachandran, Zheng Wen

    Abstract: Reinforcement Learning with Human Feedback (RLHF) is at the core of fine-tuning methods for generative AI models for language and images. Such feedback is often sought as rank or preference feedback from human raters, as opposed to eliciting scores since the latter tends to be very noisy. On the other hand, RL theory and algorithms predominantly assume that a reward feedback is available. In parti… ▽ More

    Submitted 13 June, 2024; originally announced June 2024.

  3. arXiv:2406.09563  [pdf, other

    cs.LG

    e-COP : Episodic Constrained Optimization of Policies

    Authors: Akhil Agnihotri, Rahul Jain, Deepak Ramachandran, Sahil Singla

    Abstract: In this paper, we present the $\texttt{e-COP}$ algorithm, the first policy optimization algorithm for constrained Reinforcement Learning (RL) in episodic (finite horizon) settings. Such formulations are applicable when there are separate sets of optimization criteria and constraints on a system's behavior. We approach this problem by first establishing a policy difference lemma for the episodic se… ▽ More

    Submitted 13 June, 2024; originally announced June 2024.

  4. arXiv:2403.17853  [pdf, other

    cs.CL cs.LG

    Using Domain Knowledge to Guide Dialog Structure Induction via Neural Probabilistic Soft Logic

    Authors: Connor Pryor, Quan Yuan, Jeremiah Liu, Mehran Kazemi, Deepak Ramachandran, Tania Bedrax-Weiss, Lise Getoor

    Abstract: Dialog Structure Induction (DSI) is the task of inferring the latent dialog structure (i.e., a set of dialog states and their temporal transitions) of a given goal-oriented dialog. It is a critical component for modern dialog system design and discourse analysis. Existing DSI approaches are often purely data-driven, deploy models that infer latent states without access to domain knowledge, underpe… ▽ More

    Submitted 26 March, 2024; originally announced March 2024.

  5. arXiv:2403.05576  [pdf

    cs.HC cs.AI

    Understanding Subjectivity through the Lens of Motivational Context in Model-Generated Image Satisfaction

    Authors: Senjuti Dutta, Sherol Chen, Sunny Mak, Amnah Ahmad, Katherine Collins, Alena Butryna, Deepak Ramachandran, Krishnamurthy Dvijotham, Ellie Pavlick, Ravi Rajakumar

    Abstract: Image generation models are poised to become ubiquitous in a range of applications. These models are often fine-tuned and evaluated using human quality judgments that assume a universal standard, failing to consider the subjectivity of such tasks. To investigate how to quantify subjectivity, and the scale of its impact, we measure how assessments differ among human annotators across different use… ▽ More

    Submitted 26 February, 2024; originally announced March 2024.

  6. arXiv:2312.16720  [pdf, other

    cs.CV

    Prompt Expansion for Adaptive Text-to-Image Generation

    Authors: Siddhartha Datta, Alexander Ku, Deepak Ramachandran, Peter Anderson

    Abstract: Text-to-image generation models are powerful but difficult to use. Users craft specific prompts to get better images, though the images can be repetitive. This paper proposes a Prompt Expansion framework that helps users generate high-quality, diverse images with less effort. The Prompt Expansion model takes a text query as input and outputs a set of expanded text prompts that are optimized such t… ▽ More

    Submitted 27 December, 2023; originally announced December 2023.

  7. arXiv:2312.10240  [pdf, other

    cs.CV

    Rich Human Feedback for Text-to-Image Generation

    Authors: Youwei Liang, Junfeng He, Gang Li, Peizhao Li, Arseniy Klimovskiy, Nicholas Carolan, Jiao Sun, Jordi Pont-Tuset, Sarah Young, Feng Yang, Junjie Ke, Krishnamurthy Dj Dvijotham, Katie Collins, Yiwen Luo, Yang Li, Kai J Kohlhoff, Deepak Ramachandran, Vidhya Navalpakkam

    Abstract: Recent Text-to-Image (T2I) generation models such as Stable Diffusion and Imagen have made significant progress in generating high-resolution images based on text descriptions. However, many generated images still suffer from issues such as artifacts/implausibility, misalignment with text descriptions, and low aesthetic quality. Inspired by the success of Reinforcement Learning with Human Feedback… ▽ More

    Submitted 8 April, 2024; v1 submitted 15 December, 2023; originally announced December 2023.

    Comments: CVPR'24

  8. arXiv:2312.09244  [pdf, other

    cs.LG

    Hel** or Herding? Reward Model Ensembles Mitigate but do not Eliminate Reward Hacking

    Authors: Jacob Eisenstein, Chirag Nagpal, Alekh Agarwal, Ahmad Beirami, Alex D'Amour, DJ Dvijotham, Adam Fisch, Katherine Heller, Stephen Pfohl, Deepak Ramachandran, Peter Shaw, Jonathan Berant

    Abstract: Reward models play a key role in aligning language model applications towards human preferences. However, this setup creates an incentive for the language model to exploit errors in the reward model to achieve high estimated reward, a phenomenon often termed \emph{reward hacking}. A natural mitigation is to train an ensemble of reward models, aggregating over model outputs to obtain a more robust… ▽ More

    Submitted 20 December, 2023; v1 submitted 14 December, 2023; originally announced December 2023.

  9. arXiv:2311.00203  [pdf, other

    cs.AI

    Modeling subjectivity (by Mimicking Annotator Annotation) in toxic comment identification across diverse communities

    Authors: Senjuti Dutta, Sid Mittal, Sherol Chen, Deepak Ramachandran, Ravi Rajakumar, Ian Kivlichan, Sunny Mak, Alena Butryna, Praveen Paritosh

    Abstract: The prevalence and impact of toxic discussions online have made content moderation crucial.Automated systems can play a vital role in identifying toxicity, and reducing the reliance on human moderation.Nevertheless, identifying toxic comments for diverse communities continues to present challenges that are addressed in this paper.The two-part goal of this study is to(1)identify intuitive variances… ▽ More

    Submitted 31 October, 2023; originally announced November 2023.

  10. arXiv:2310.04475  [pdf, other

    cs.CL cs.AI cs.LG

    Demystifying Embedding Spaces using Large Language Models

    Authors: Guy Tennenholtz, Yinlam Chow, Chih-Wei Hsu, Jihwan Jeong, Lior Shani, Azamat Tulepbergenov, Deepak Ramachandran, Martin Mladenov, Craig Boutilier

    Abstract: Embeddings have become a pivotal means to represent complex, multi-faceted information about entities, concepts, and relationships in a condensed and useful format. Nevertheless, they often preclude direct interpretation. While downstream tasks make use of these compressed representations, meaningful interpretation usually requires visualization using dimensionality reduction or specialized machin… ▽ More

    Submitted 13 March, 2024; v1 submitted 6 October, 2023; originally announced October 2023.

    Comments: Accepted to ICLR 2024

  11. arXiv:2308.15299  [pdf, other

    cs.CL

    TaskLAMA: Probing the Complex Task Understanding of Language Models

    Authors: Quan Yuan, Mehran Kazemi, Xin Xu, Isaac Noble, Vaiva Imbrasaite, Deepak Ramachandran

    Abstract: Structured Complex Task Decomposition (SCTD) is the problem of breaking down a complex real-world task (such as planning a wedding) into a directed acyclic graph over individual steps that contribute to achieving the task, with edges specifying temporal dependencies between them. SCTD is an important component of assistive planning tools, and a challenge for commonsense reasoning systems. We probe… ▽ More

    Submitted 29 August, 2023; originally announced August 2023.

  12. arXiv:2306.07934  [pdf, other

    cs.CL cs.AI cs.LG

    BoardgameQA: A Dataset for Natural Language Reasoning with Contradictory Information

    Authors: Mehran Kazemi, Quan Yuan, Deepti Bhatia, Najoung Kim, Xin Xu, Vaiva Imbrasaite, Deepak Ramachandran

    Abstract: Automated reasoning with unstructured natural text is a key requirement for many potential applications of NLP and for develo** robust AI systems. Recently, Language Models (LMs) have demonstrated complex reasoning capacities even without any finetuning. However, existing evaluation for automated reasoning assumes access to a consistent and coherent set of information over which models reason. W… ▽ More

    Submitted 13 June, 2023; originally announced June 2023.

  13. arXiv:2304.10331  [pdf, other

    physics.atom-ph hep-ex hep-ph

    Nuclear T-violation search using octupole-deformed nuclei in a crystal

    Authors: Harish D. Ramachandran, Amar C. Vutha

    Abstract: Precision measurements with atoms and molecules can search for subtle violations of time-reversal symmetry (T) in nuclei, and thereby probe a variety of new physics models. We present a detailed scheme for a nuclear T-violation search experiment using $^{153}$Eu$^{3+}$ ions doped in non-centrosymmetric sites within a Y$_2$SiO$_5$ crystal. The ions in this solid contain nuclei that are highly sensi… ▽ More

    Submitted 7 July, 2023; v1 submitted 20 April, 2023; originally announced April 2023.

    Comments: 10 pages, 3 figures, 5 tables

    Journal ref: Phys. Rev. A 108, 012819 (2023)

  14. arXiv:2304.06189  [pdf, other

    physics.atom-ph

    Coherent quantum beats: spectroscopy of energy differences masked by inhomogeneous broadening

    Authors: Harish D. Ramachandran, Julia E. Ford, Amar C. Vutha

    Abstract: Precision spectroscopy of solid-state systems is challenging due to inhomogeneous broadening. We describe a technique -- coherent quantum beats -- that enables the measurement of small frequency shifts within an inhomogeneously broadened distribution while addressing the full ensemble. We show that the technique can be used to obtain improvements in signal size and spectral resolution, offering ad… ▽ More

    Submitted 30 June, 2023; v1 submitted 12 April, 2023; originally announced April 2023.

    Comments: 8 pages, 8 figures

  15. arXiv:2302.05807  [pdf, other

    cs.LG stat.ML

    Pushing the Accuracy-Group Robustness Frontier with Introspective Self-play

    Authors: Jeremiah Zhe Liu, Krishnamurthy Dj Dvijotham, Jihyeon Lee, Quan Yuan, Martin Strobel, Balaji Lakshminarayanan, Deepak Ramachandran

    Abstract: Standard empirical risk minimization (ERM) training can produce deep neural network (DNN) models that are accurate on average but under-perform in under-represented population subgroups, especially when there are imbalanced group distributions in the long-tailed training data. Therefore, approaches that improve the accuracy-group robustness trade-off frontier of a DNN model (i.e. improving worst-g… ▽ More

    Submitted 11 February, 2023; originally announced February 2023.

    Comments: Accepted to ICLR 2023. Included additional contribution from Martin Strobel

  16. arXiv:2301.11293  [pdf, other

    cs.CL cs.LG

    Understanding Finetuning for Factual Knowledge Extraction from Language Models

    Authors: Mehran Kazemi, Sid Mittal, Deepak Ramachandran

    Abstract: Language models (LMs) pretrained on large corpora of text from the web have been observed to contain large amounts of various types of knowledge about the world. This observation has led to a new and exciting paradigm in knowledge graph construction where, instead of manual curation or text mining, one extracts knowledge from the parameters of an LM. Recently, it has been shown that finetuning LMs… ▽ More

    Submitted 26 January, 2023; originally announced January 2023.

  17. arXiv:2212.13894  [pdf, other

    cs.AI cs.LG

    LAMBADA: Backward Chaining for Automated Reasoning in Natural Language

    Authors: Mehran Kazemi, Najoung Kim, Deepti Bhatia, Xin Xu, Deepak Ramachandran

    Abstract: Remarkable progress has been made on automated reasoning with natural text, by using Language Models (LMs) and methods such as Chain-of-Thought and Selection-Inference. These techniques search for proofs in the forward direction from axioms to the conclusion, which suffers from a combinatorial explosion of the search space, and thus high failure rates for problems requiring longer chains of reason… ▽ More

    Submitted 29 May, 2023; v1 submitted 20 December, 2022; originally announced December 2022.

    Comments: Accepted at ACL 2023

  18. arXiv:2211.06309  [pdf, other

    quant-ph

    A Riemannian Genuine Measure of Entanglement for Pure States

    Authors: Dharmaraj Ramachandran, Radhika Vathsan

    Abstract: While several measures exist for entanglement of multipartite pure states, a true entanglement measure for mixed states still eludes us. A deeper study of the geometry of quantum states may be the way to address this issue, on which context we come up with a measure for pure states based on a geodesic distance on the space of quantum states. Our measure satisfies all the desirable properties of a… ▽ More

    Submitted 13 January, 2024; v1 submitted 11 November, 2022; originally announced November 2022.

    Comments: Revised version, submitted to QIP

  19. BaF molecules in neon ice: trap**, spectroscopy and optical control of electron spins

    Authors: Samuel J. Li, Harish D. Ramachandran, Rhys Anderson, Amar C. Vutha

    Abstract: We have trapped BaF molecules in neon ice, and used laser-induced fluorescence spectroscopy to map out optical transitions in the trapped molecules. Our measurements show that the neon lattice does not significantly perturb certain optical transitions in the trapped molecules. We used one of these transitions to polarize the electron spins, detect spin flips and measure hyperfine transitions in th… ▽ More

    Submitted 25 January, 2023; v1 submitted 14 July, 2022; originally announced July 2022.

    Journal ref: New J. Phys. 25, 082001 (2023)

  20. arXiv:2205.10403  [pdf, other

    cs.LG cs.CC

    Tackling Provably Hard Representative Selection via Graph Neural Networks

    Authors: Mehran Kazemi, Anton Tsitsulin, Hossein Esfandiari, MohammadHossein Bateni, Deepak Ramachandran, Bryan Perozzi, Vahab Mirrokni

    Abstract: Representative Selection (RS) is the problem of finding a small subset of exemplars from a dataset that is representative of the dataset. In this paper, we study RS for attributed graphs, and focus on finding representative nodes that optimize the accuracy of a model trained on the selected representatives. Theoretically, we establish a new hardness result forRS (in the absence of a graph structur… ▽ More

    Submitted 19 July, 2023; v1 submitted 20 May, 2022; originally announced May 2022.

    Comments: Accepted at the Transactions of Machine Learning Research (TMLR) Journal

  21. arXiv:2205.06262  [pdf, other

    cs.CL

    FETA: A Benchmark for Few-Sample Task Transfer in Open-Domain Dialogue

    Authors: Alon Albalak, Yi-Lin Tuan, Pegah Jandaghi, Connor Pryor, Luke Yoffe, Deepak Ramachandran, Lise Getoor, Jay Pujara, William Yang Wang

    Abstract: Task transfer, transferring knowledge contained in related tasks, holds the promise of reducing the quantity of labeled data required to fine-tune language models. Dialogue understanding encompasses many diverse tasks, yet task transfer has not been thoroughly studied in conversational AI. This work explores conversational task transfer by introducing FETA: a benchmark for few-sample task transfer… ▽ More

    Submitted 13 October, 2022; v1 submitted 12 May, 2022; originally announced May 2022.

    Comments: EMNLP 2022. benchmark available at https://alon-albalak.github.io/feta-website

  22. arXiv:2202.02830  [pdf, other

    cs.IR cs.AI cs.LG

    Discovering Personalized Semantics for Soft Attributes in Recommender Systems using Concept Activation Vectors

    Authors: Christina Göpfert, Alex Haig, Yinlam Chow, Chih-wei Hsu, Ivan Vendrov, Tyler Lu, Deepak Ramachandran, Hubert Pham, Mohammad Ghavamzadeh, Craig Boutilier

    Abstract: Interactive recommender systems have emerged as a promising paradigm to overcome the limitations of the primitive user feedback used by traditional recommender systems (e.g., clicks, item consumption, ratings). They allow users to express intent, preferences, constraints, and contexts in a richer fashion, often using natural language (including faceted search and dialogue). Yet more research is ne… ▽ More

    Submitted 2 June, 2023; v1 submitted 6 February, 2022; originally announced February 2022.

  23. arXiv:2101.00391  [pdf, other

    cs.CL

    Which Linguist Invented the Lightbulb? Presupposition Verification for Question-Answering

    Authors: Najoung Kim, Ellie Pavlick, Burcu Karagol Ayan, Deepak Ramachandran

    Abstract: Many Question-Answering (QA) datasets contain unanswerable questions, but their treatment in QA systems remains primitive. Our analysis of the Natural Questions (Kwiatkowski et al. 2019) dataset reveals that a substantial portion of unanswerable questions ($\sim$21%) can be explained based on the presence of unverifiable presuppositions. We discuss the shortcomings of current models in handling su… ▽ More

    Submitted 3 September, 2021; v1 submitted 2 January, 2021; originally announced January 2021.

    Comments: ACL 2021 Camera-ready

  24. arXiv:2010.05345  [pdf, other

    cs.CL

    Do Language Embeddings Capture Scales?

    Authors: Xikun Zhang, Deepak Ramachandran, Ian Tenney, Yanai Elazar, Dan Roth

    Abstract: Pretrained Language Models (LMs) have been shown to possess significant linguistic, common sense, and factual knowledge. One form of knowledge that has not been studied yet in this context is information about the scalar magnitudes of objects. We show that pretrained language models capture a significant amount of this information but are short of the capability required for general common-sense r… ▽ More

    Submitted 24 November, 2020; v1 submitted 11 October, 2020; originally announced October 2020.

    Comments: Accepted at EMNLP Findings 2020 and EMNLP BlackboxNLP workshop 2020; 8 pages, 2 figures; Minor changes to the acknowledgment section

    ACM Class: I.2.7

  25. arXiv:1906.01327  [pdf, other

    cs.CL

    How Large Are Lions? Inducing Distributions over Quantitative Attributes

    Authors: Yanai Elazar, Abhijit Mahabal, Deepak Ramachandran, Tania Bedrax-Weiss, Dan Roth

    Abstract: Most current NLP systems have little knowledge about quantitative attributes of objects and events. We propose an unsupervised method for collecting quantitative information from large amounts of web data, and use it to create a new, very large resource consisting of distributions over physical quantities associated with objects, adjectives, and verbs which we call Distributions over Quantitative… ▽ More

    Submitted 4 June, 2019; originally announced June 2019.

  26. arXiv:1906.00589  [pdf

    math.CO

    An upper bound for the clique number using clique ceiling numbers

    Authors: R. Dharmarajan, D. Ramachandran

    Abstract: In this article we present the idea of clique ceiling numbers of the vertices of a given graph that has a universal vertex. We follow up with a polynomial-time algorithm to compute an upper bound for the clique number of such a graph using clique ceiling numbers. We compare this algorithm with some upper bound formulas for the clique number.

    Submitted 3 June, 2019; originally announced June 2019.

    Comments: 09 pages

    MSC Class: 05C07; 05C69

  27. arXiv:1903.10700  [pdf

    cs.DS

    On the tractability of the maximum clique problem

    Authors: R. Dharmarajan, D. Ramachandran

    Abstract: The maximum clique problem is a classical NP-complete problem in graph theory and has important applications in many domains. In this paper we show, in a partially non-constructive way, the existence of an exact polynomial-time algorithm for this problem. We outline the algorithm in pseudo-code style. Then we prove its exactness and efficiency by analysis.

    Submitted 17 May, 2019; v1 submitted 26 March, 2019; originally announced March 2019.

    Comments: 15 (fifteen) pages

    MSC Class: 05C69

  28. arXiv:1901.00626  [pdf

    math.CO cs.DM cs.DS

    A modified greedy algorithm to improve bounds for the vertex cover number

    Authors: R. Dharmarajan, D. Ramachandran

    Abstract: In any attempt at designing an efficient algorithm for the minimum vertex cover problem, obtaining good upper and lower bounds for the vertex cover number could be crucial. In this article we present a modified greedy algorithm of worst-case time complexity O(n3) to obtain bounds for the vertex cover number of an input graph of order n. Using simple facts, the proposed algorithm computes a lower b… ▽ More

    Submitted 3 January, 2019; originally announced January 2019.

    Comments: 13 pages

    MSC Class: 05C69; 05C70