Skip to main content

Showing 1–11 of 11 results for author: Chaganty, A T

Searching in archive cs. Search in all archives.
.
  1. Beyond Single Items: Exploring User Preferences in Item Sets with the Conversational Playlist Curation Dataset

    Authors: Arun Tejasvi Chaganty, Megan Leszczynski, Shu Zhang, Ravi Ganti, Krisztian Balog, Filip Radlinski

    Abstract: Users in consumption domains, like music, are often able to more efficiently provide preferences over a set of items (e.g. a playlist or radio) than over single items (e.g. songs). Unfortunately, this is an underexplored area of research, with most existing recommendation systems limited to understanding preferences over single items. Curating an item set exponentiates the search space that recomm… ▽ More

    Submitted 5 May, 2023; v1 submitted 12 March, 2023; originally announced March 2023.

    Comments: Appearing in Proceedings of the 46th International ACM SIGIR Conference on Research and Development in Information Retrieval

  2. arXiv:2301.11489  [pdf, other

    cs.IR cs.CL

    Talk the Walk: Synthetic Data Generation for Conversational Music Recommendation

    Authors: Megan Leszczynski, Shu Zhang, Ravi Ganti, Krisztian Balog, Filip Radlinski, Fernando Pereira, Arun Tejasvi Chaganty

    Abstract: Recommender systems are ubiquitous yet often difficult for users to control, and adjust if recommendation quality is poor. This has motivated conversational recommender systems (CRSs), with control provided through natural language feedback. However, as with most application domains, building robust CRSs requires training data that reflects system usage$\unicode{x2014}$here conversations with user… ▽ More

    Submitted 17 November, 2023; v1 submitted 26 January, 2023; originally announced January 2023.

  3. arXiv:2210.08726  [pdf, other

    cs.CL cs.AI cs.IR cs.LG

    RARR: Researching and Revising What Language Models Say, Using Language Models

    Authors: Luyu Gao, Zhuyun Dai, Panupong Pasupat, Anthony Chen, Arun Tejasvi Chaganty, Yicheng Fan, Vincent Y. Zhao, Ni Lao, Hongrae Lee, Da-Cheng Juan, Kelvin Guu

    Abstract: Language models (LMs) now excel at many tasks such as few-shot learning, question answering, reasoning, and dialog. However, they sometimes generate unsupported or misleading content. A user cannot easily determine whether their outputs are trustworthy or not, because most LMs do not have any built-in mechanism for attribution to external evidence. To enable attribution while still preserving all… ▽ More

    Submitted 31 May, 2023; v1 submitted 16 October, 2022; originally announced October 2022.

    Comments: ACL 2023

  4. arXiv:2205.09073  [pdf, other

    cs.CL cs.AI

    Dialog Inpainting: Turning Documents into Dialogs

    Authors: Zhuyun Dai, Arun Tejasvi Chaganty, Vincent Zhao, Aida Amini, Qazi Mamunur Rashid, Mike Green, Kelvin Guu

    Abstract: Many important questions (e.g. "How to eat healthier?") require conversation to establish context and explore in depth. However, conversational question answering (ConvQA) systems have long been stymied by scarce training data that is expensive to collect. To address this problem, we propose a new technique for synthetically generating diverse and high-quality dialog data: dialog inpainting. Our a… ▽ More

    Submitted 31 May, 2022; v1 submitted 18 May, 2022; originally announced May 2022.

  5. arXiv:2010.04842  [pdf, other

    cs.LG cs.CL

    Conformal retrofitting via Riemannian manifolds: distilling task-specific graphs into pretrained embeddings

    Authors: Justin Dieter, Arun Tejasvi Chaganty

    Abstract: Pretrained (language) embeddings are versatile, task-agnostic feature representations of entities, like words, that are central to many machine learning applications. These representations can be enriched through retrofitting, a class of methods that incorporate task-specific domain knowledge encoded as a graph over a subset of these entities. However, existing retrofitting algorithms face two lim… ▽ More

    Submitted 9 October, 2020; originally announced October 2020.

    Comments: 14 pages, 5 figures

    ACM Class: I.2.6; I.2.7

  6. arXiv:1809.02700  [pdf, other

    cs.CL

    Textual Analogy Parsing: What's Shared and What's Compared among Analogous Facts

    Authors: Matthew Lamm, Arun Tejasvi Chaganty, Christopher D. Manning, Dan Jurafsky, Percy Liang

    Abstract: To understand a sentence like "whereas only 10% of White Americans live at or below the poverty line, 28% of African Americans do" it is important not only to identify individual facts, e.g., poverty rates of distinct demographic groups, but also the higher-order relations between them, e.g., the disparity between them. In this paper, we propose the task of Textual Analogy Parsing (TAP) to model t… ▽ More

    Submitted 7 September, 2018; originally announced September 2018.

    Comments: 12 pages including appendix and references. To be presented at EMNLP 2018

  7. arXiv:1807.02202  [pdf, other

    cs.CL

    The price of debiasing automatic metrics in natural language evaluation

    Authors: Arun Tejasvi Chaganty, Stephen Mussman, Percy Liang

    Abstract: For evaluating generation systems, automatic metrics such as BLEU cost nothing to run but have been shown to correlate poorly with human judgment, leading to systematic bias against certain model improvements. On the other hand, averaging human judgments, the unbiased gold standard, is often too expensive. In this paper, we use control variates to combine automatic metrics with human evaluation to… ▽ More

    Submitted 5 July, 2018; originally announced July 2018.

    Comments: To appear ACL 2018

  8. How Much is 131 Million Dollars? Putting Numbers in Perspective with Compositional Descriptions

    Authors: Arun Tejasvi Chaganty, Percy Liang

    Abstract: How much is 131 million US dollars? To help readers put such numbers in context, we propose a new task of automatically generating short descriptions known as perspectives, e.g. "$131 million is about the cost to employ everyone in Texas over a lunch period". First, we collect a dataset of numeric mentions in news articles, where each mention is labeled with a set of rated perspectives. We then pr… ▽ More

    Submitted 31 August, 2016; originally announced September 2016.

    Journal ref: ACL (2016), 578-587

  9. arXiv:1603.08482  [pdf, other

    stat.ML cs.LG

    Estimating Mixture Models via Mixtures of Polynomials

    Authors: Sida I. Wang, Arun Tejasvi Chaganty, Percy Liang

    Abstract: Mixture modeling is a general technique for making any simple model more expressive through weighted combination. This generality and simplicity in part explains the success of the Expectation Maximization (EM) algorithm, in which updates are easy to derive for a wide class of mixture models. However, the likelihood of a mixture model is non-convex, so EM has no known global convergence guarantees… ▽ More

    Submitted 28 March, 2016; originally announced March 2016.

    Comments: NIPS 2015

  10. arXiv:1501.07320  [pdf, other

    cs.LG stat.ML

    Tensor Factorization via Matrix Factorization

    Authors: Volodymyr Kuleshov, Arun Tejasvi Chaganty, Percy Liang

    Abstract: Tensor factorization arises in many machine learning applications, such knowledge base modeling and parameter estimation in latent variable models. However, numerical methods for tensor factorization have not reached the level of maturity of matrix factorization methods. In this paper, we propose a new method for CP tensor factorization that uses random projections to reduce the problem to simulta… ▽ More

    Submitted 18 May, 2015; v1 submitted 28 January, 2015; originally announced January 2015.

    Comments: Appearing in Proceedings of the 18th International Conference on Artificial Intelligence and Statistics (AISTATS) 2015, San Diego, CA, USA. JMLR: W&CP volume 38

  11. arXiv:1306.3729  [pdf, other

    cs.LG stat.ML

    Spectral Experts for Estimating Mixtures of Linear Regressions

    Authors: Arun Tejasvi Chaganty, Percy Liang

    Abstract: Discriminative latent-variable models are typically learned using EM or gradient-based optimization, which suffer from local optima. In this paper, we develop a new computationally efficient and provably consistent estimator for a mixture of linear regressions, a simple instance of a discriminative latent-variable model. Our approach relies on a low-rank linear regression to recover a symmetric te… ▽ More

    Submitted 16 June, 2013; originally announced June 2013.

    Comments: Accepted at ICML 2013. Includes supplementary material