Skip to main content

Showing 1–23 of 23 results for author: Jhamtani, H

.
  1. arXiv:2406.14739  [pdf, other

    cs.CL

    Learning to Retrieve Iteratively for In-Context Learning

    Authors: Yunmo Chen, Tongfei Chen, Harsh Jhamtani, Patrick Xia, Richard Shin, Jason Eisner, Benjamin Van Durme

    Abstract: We introduce iterative retrieval, a novel framework that empowers retrievers to make iterative decisions through policy optimization. Finding an optimal portfolio of retrieved items is a combinatorial optimization problem, generally considered NP-hard. This approach provides a learned approximation to such a solution, meeting specific task requirements under a given family of large language models… ▽ More

    Submitted 20 June, 2024; originally announced June 2024.

  2. arXiv:2311.09796  [pdf, other

    cs.CL cs.AI

    Interpreting User Requests in the Context of Natural Language Standing Instructions

    Authors: Nikita Moghe, Patrick Xia, Jacob Andreas, Jason Eisner, Benjamin Van Durme, Harsh Jhamtani

    Abstract: Users of natural language interfaces, generally powered by Large Language Models (LLMs),often must repeat their preferences each time they make a similar request. We describe an approach to LLM-based dialogue modeling in which persistent user constraints and preferences -- collectively termed standing instructions -- as additional context for such interfaces. For example, when a user states "I'm h… ▽ More

    Submitted 7 March, 2024; v1 submitted 16 November, 2023; originally announced November 2023.

    Comments: Updated with results from LLaMA-2

  3. arXiv:2309.13075  [pdf, other

    cs.AI cs.CL cs.LG

    SCREWS: A Modular Framework for Reasoning with Revisions

    Authors: Kumar Shridhar, Harsh Jhamtani, Hao Fang, Benjamin Van Durme, Jason Eisner, Patrick Xia

    Abstract: Large language models (LLMs) can improve their accuracy on various tasks through iteratively refining and revising their output based on feedback. We observe that these revisions can introduce errors, in which case it is better to roll back to a previous result. Further, revisions are typically homogeneous: they use the same reasoning method that produced the initial answer, which may not correct… ▽ More

    Submitted 20 September, 2023; originally announced September 2023.

  4. arXiv:2305.08677  [pdf, other

    cs.CL

    Natural Language Decomposition and Interpretation of Complex Utterances

    Authors: Harsh Jhamtani, Hao Fang, Patrick Xia, Eran Levy, Jacob Andreas, Ben Van Durme

    Abstract: Designing natural language interfaces has historically required collecting supervised data to translate user requests into carefully designed intent representations. This requires enumerating and labeling a long tail of user requests, which is challenging. At the same time, large language models (LLMs) encode knowledge about goals and plans that can help conversational assistants interpret user re… ▽ More

    Submitted 8 January, 2024; v1 submitted 15 May, 2023; originally announced May 2023.

  5. arXiv:2212.10618  [pdf, ps, other

    cs.CL

    Ontologically Faithful Generation of Non-Player Character Dialogues

    Authors: Nathaniel Weir, Ryan Thomas, Randolph D'Amore, Kellie Hill, Benjamin Van Durme, Harsh Jhamtani

    Abstract: We introduce a language generation task grounded in a popular video game environment. KNUDGE (KNowledge Constrained User-NPC Dialogue GEneration) requires models to produce trees of dialogue between video game characters that accurately reflect quest and entity specifications stated in natural language. KNUDGE is constructed from side quest dialogues drawn directly from game data of Obsidian Enter… ▽ More

    Submitted 13 May, 2023; v1 submitted 20 December, 2022; originally announced December 2022.

  6. arXiv:2209.07800  [pdf, other

    cs.CL

    The Whole Truth and Nothing But the Truth: Faithful and Controllable Dialogue Response Generation with Dataflow Transduction and Constrained Decoding

    Authors: Hao Fang, Anusha Balakrishnan, Harsh Jhamtani, John Bufe, Jean Crawford, Jayant Krishnamurthy, Adam Pauls, Jason Eisner, Jacob Andreas, Dan Klein

    Abstract: In a real-world dialogue system, generated text must be truthful and informative while remaining fluent and adhering to a prescribed style. Satisfying these constraints simultaneously is difficult for the two predominant paradigms in language generation: neural language modeling and rule-based generation. We describe a hybrid architecture for dialogue response generation that combines the strength… ▽ More

    Submitted 26 May, 2023; v1 submitted 16 September, 2022; originally announced September 2022.

    Comments: Findings of ACL 2023

  7. arXiv:2209.07752  [pdf, other

    cs.CL cs.AI cs.LG

    PINEAPPLE: Personifying INanimate Entities by Acquiring Parallel Personification data for Learning Enhanced generation

    Authors: Sedrick Scott Keh, Kevin Lu, Varun Gangal, Steven Y. Feng, Harsh Jhamtani, Malihe Alikhani, Eduard Hovy

    Abstract: A personification is a figure of speech that endows inanimate entities with properties and actions typically seen as requiring animacy. In this paper, we explore the task of personification generation. To this end, we propose PINEAPPLE: Personifying INanimate Entities by Acquiring Parallel Personification data for Learning Enhanced generation. We curate a corpus of personifications called Personif… ▽ More

    Submitted 16 September, 2022; originally announced September 2022.

    Comments: Accepted to COLING 2022; official Github repo at https://github.com/sedrickkeh/PINEAPPLE

  8. arXiv:2205.09314  [pdf, other

    cs.CL

    Target-Guided Dialogue Response Generation Using Commonsense and Data Augmentation

    Authors: Prakhar Gupta, Harsh Jhamtani, Jeffrey P. Bigham

    Abstract: Target-guided response generation enables dialogue systems to smoothly transition a conversation from a dialogue context toward a target sentence. Such control is useful for designing dialogue systems that direct a conversation toward specific goals, such as creating non-obtrusive recommendations or introducing new topics in the conversation. In this paper, we introduce a new technique for target-… ▽ More

    Submitted 19 May, 2022; originally announced May 2022.

    Comments: Accepted at NAACL 2022 (Findings)

  9. arXiv:2203.11399  [pdf, other

    cs.CL

    Achieving Conversational Goals with Unsupervised Post-hoc Knowledge Injection

    Authors: Bodhisattwa Prasad Majumder, Harsh Jhamtani, Taylor Berg-Kirkpatrick, Julian McAuley

    Abstract: A limitation of current neural dialog models is that they tend to suffer from a lack of specificity and informativeness in generated responses, primarily due to dependence on training data that covers a limited variety of scenarios and conveys limited knowledge. One way to alleviate this issue is to extract relevant knowledge from external sources at decoding time and incorporate it into the dialo… ▽ More

    Submitted 21 March, 2022; originally announced March 2022.

    Comments: Accepted at ACL 2022 main conference

  10. arXiv:2110.01839  [pdf, other

    cs.CL cs.LG

    Truth-Conditional Captioning of Time Series Data

    Authors: Harsh Jhamtani, Taylor Berg-Kirkpatrick

    Abstract: In this paper, we explore the task of automatically generating natural language descriptions of salient patterns in a time series, such as stock prices of a company over a week. A model for this task should be able to extract high-level patterns such as presence of a peak or a dip. While typical contemporary neural models with attention mechanisms can generate fluent output descriptions for this t… ▽ More

    Submitted 5 October, 2021; originally announced October 2021.

    Comments: EMNLP 2021

  11. arXiv:2110.00687  [pdf, other

    cs.CL

    Investigating Robustness of Dialog Models to Popular Figurative Language Constructs

    Authors: Harsh Jhamtani, Varun Gangal, Eduard Hovy, Taylor Berg-Kirkpatrick

    Abstract: Humans often employ figurative language use in communication, including during interactions with dialog systems. Thus, it is important for real-world dialog systems to be able to handle popular figurative language constructs like metaphor and simile. In this work, we analyze the performance of existing dialog models in situations where the input dialog context exhibits use of figurative language.… ▽ More

    Submitted 1 October, 2021; originally announced October 2021.

    Comments: EMNLP 2021 Short Paper

  12. arXiv:2106.08364  [pdf, other

    cs.CL

    Unsupervised Enrichment of Persona-grounded Dialog with Background Stories

    Authors: Bodhisattwa Prasad Majumder, Taylor Berg-Kirkpatrick, Julian McAuley, Harsh Jhamtani

    Abstract: Humans often refer to personal narratives, life experiences, and events to make a conversation more engaging and rich. While persona-grounded dialog models are able to generate responses that follow a given persona, they often miss out on stating detailed experiences or events related to a persona, often leaving conversations shallow and dull. In this work, we equip dialog models with 'background… ▽ More

    Submitted 15 June, 2021; originally announced June 2021.

    Comments: Accepted at ACL 2021 for oral presentation

  13. arXiv:2106.02833  [pdf, other

    cs.CL

    Improving Automated Evaluation of Open Domain Dialog via Diverse Reference Augmentation

    Authors: Varun Gangal, Harsh Jhamtani, Eduard Hovy, Taylor Berg-Kirkpatrick

    Abstract: Multiple different responses are often plausible for a given open domain dialog context. Prior work has shown the importance of having multiple valid reference responses for meaningful and robust automated evaluations. In such cases, common practice has been to collect more human written references. However, such collection can be expensive, time consuming, and not easily scalable. Instead, we pro… ▽ More

    Submitted 5 June, 2021; originally announced June 2021.

    Comments: Findings of ACL 2021

  14. arXiv:2102.01672  [pdf, other

    cs.CL cs.AI cs.LG

    The GEM Benchmark: Natural Language Generation, its Evaluation and Metrics

    Authors: Sebastian Gehrmann, Tosin Adewumi, Karmanya Aggarwal, Pawan Sasanka Ammanamanchi, Aremu Anuoluwapo, Antoine Bosselut, Khyathi Raghavi Chandu, Miruna Clinciu, Dipanjan Das, Kaustubh D. Dhole, Wanyu Du, Esin Durmus, Ondřej Dušek, Chris Emezue, Varun Gangal, Cristina Garbacea, Tatsunori Hashimoto, Yufang Hou, Yacine Jernite, Harsh Jhamtani, Yangfeng Ji, Shailza Jolly, Mihir Kale, Dhruv Kumar, Faisal Ladhak , et al. (31 additional authors not shown)

    Abstract: We introduce GEM, a living benchmark for natural language Generation (NLG), its Evaluation, and Metrics. Measuring progress in NLG relies on a constantly evolving ecosystem of automated metrics, datasets, and human evaluation standards. Due to this moving target, new models often still evaluate on divergent anglo-centric corpora with well-established, but flawed, metrics. This disconnect makes it… ▽ More

    Submitted 1 April, 2021; v1 submitted 2 February, 2021; originally announced February 2021.

  15. arXiv:2010.03274  [pdf, other

    cs.CL cs.AI

    Learning to Explain: Datasets and Models for Identifying Valid Reasoning Chains in Multihop Question-Answering

    Authors: Harsh Jhamtani, Peter Clark

    Abstract: Despite the rapid progress in multihop question-answering (QA), models still have trouble explaining why an answer is correct, with limited explanation training data available to learn from. To address this, we introduce three explanation datasets in which explanations formed from corpus facts are annotated. Our first dataset, eQASC, contains over 98K explanation annotations for the multihop quest… ▽ More

    Submitted 7 October, 2020; originally announced October 2020.

    Comments: EMNLP 2020

  16. arXiv:2010.03272  [pdf, other

    cs.CL cs.AI

    Narrative Text Generation with a Latent Discrete Plan

    Authors: Harsh Jhamtani, Taylor Berg-Kirkpatrick

    Abstract: Past work on story generation has demonstrated the usefulness of conditioning on a generation plan to generate coherent stories. However, these approaches have used heuristics or off-the-shelf models to first tag training stories with the desired type of plan, and then train generation models in a supervised fashion. In this paper, we propose a deep latent variable model that first samples a seque… ▽ More

    Submitted 7 October, 2020; originally announced October 2020.

    Comments: Findings of EMNLP 2020

  17. arXiv:2010.03205  [pdf, other

    cs.CL cs.AI

    Like hiking? You probably enjoy nature: Persona-grounded Dialog with Commonsense Expansions

    Authors: Bodhisattwa Prasad Majumder, Harsh Jhamtani, Taylor Berg-Kirkpatrick, Julian McAuley

    Abstract: Existing persona-grounded dialog models often fail to capture simple implications of given persona descriptions, something which humans are able to do seamlessly. For example, state-of-the-art models cannot infer that interest in hiking might imply love for nature or longing for a break. In this paper, we propose to expand available persona sentences using existing commonsense knowledge bases and… ▽ More

    Submitted 7 October, 2020; originally announced October 2020.

    Comments: Accepted in EMNLP 2020

  18. arXiv:1909.06743  [pdf, other

    cs.CL cs.LG

    Learning Rhyming Constraints using Structured Adversaries

    Authors: Harsh Jhamtani, Sanket Vaibhav Mehta, Jaime Carbonell, Taylor Berg-Kirkpatrick

    Abstract: Existing recurrent neural language models often fail to capture higher-level structure present in text: for example, rhyming patterns present in poetry. Much prior work on poetry generation uses manually defined constraints which are satisfied during decoding using either specialized decoding procedures or rejection sampling. The rhyming constraints themselves are typically not learned by the gene… ▽ More

    Submitted 15 September, 2019; originally announced September 2019.

    Comments: EMNLP-IJCNLP 2019 Short Paper

  19. arXiv:1808.10584  [pdf, other

    cs.CL cs.CV

    Learning to Describe Differences Between Pairs of Similar Images

    Authors: Harsh Jhamtani, Taylor Berg-Kirkpatrick

    Abstract: In this paper, we introduce the task of automatically generating text to describe the differences between two similar images. We collect a new dataset by crowd-sourcing difference descriptions for pairs of image frames extracted from video-surveillance footage. Annotators were asked to succinctly describe all the differences in a short paragraph. As a result, our novel dataset provides an opportun… ▽ More

    Submitted 30 August, 2018; originally announced August 2018.

    Comments: EMNLP 2018

  20. arXiv:1711.08792  [pdf, other

    cs.CL

    SPINE: SParse Interpretable Neural Embeddings

    Authors: Anant Subramanian, Danish Pruthi, Harsh Jhamtani, Taylor Berg-Kirkpatrick, Eduard Hovy

    Abstract: Prediction without justification has limited utility. Much of the success of neural models can be attributed to their ability to learn rich, dense and expressive representations. While these representations capture the underlying complexity and latent trends in the data, they are far from being interpretable. We propose a novel variant of denoising k-sparse autoencoders that generates highly effic… ▽ More

    Submitted 23 November, 2017; originally announced November 2017.

    Comments: AAAI 2018

  21. arXiv:1707.01176  [pdf, other

    cs.CL

    CharManteau: Character Embedding Models For Portmanteau Creation

    Authors: Varun Gangal, Harsh Jhamtani, Graham Neubig, Eduard Hovy, Eric Nyberg

    Abstract: Portmanteaus are a word formation phenomenon where two words are combined to form a new word. We propose character-level neural sequence-to-sequence (S2S) methods for the task of portmanteau generation that are end-to-end-trainable, language independent, and do not explicitly use additional phonetic information. We propose a noisy-channel-style model, which allows for the incorporation of unsuperv… ▽ More

    Submitted 24 July, 2017; v1 submitted 4 July, 2017; originally announced July 2017.

    Comments: Accepted for publication in EMNLP 2017

  22. arXiv:1707.01161  [pdf, other

    cs.CL

    Shakespearizing Modern Language Using Copy-Enriched Sequence-to-Sequence Models

    Authors: Harsh Jhamtani, Varun Gangal, Eduard Hovy, Eric Nyberg

    Abstract: Variations in writing styles are commonly used to adapt the content to a specific context, audience, or purpose. However, applying stylistic variations is still by and large a manual process, and there have been little efforts towards automating it. In this paper we explore automated methods to transform text from modern English to Shakespearean English using an end to end trainable neural model w… ▽ More

    Submitted 20 July, 2017; v1 submitted 4 July, 2017; originally announced July 2017.

    Comments: Accepted at EMNLP 2017 Workshop on Stylistic Variation

  23. arXiv:1706.09335  [pdf, ps, other

    cs.CL

    Generating Appealing Brand Names

    Authors: Gaurush Hiranandani, Pranav Maneriker, Harsh Jhamtani

    Abstract: Providing appealing brand names to newly launched products, newly formed companies or for renaming existing companies is highly important as it can play a crucial role in deciding its success or failure. In this work, we propose a computational method to generate appealing brand names based on the description of such entities. We use quantitative scores for readability, pronounceability, memorabil… ▽ More

    Submitted 28 June, 2017; originally announced June 2017.

    Comments: Has been accepted to and presented in CICLING 2017