Skip to main content

Showing 1–34 of 34 results for author: Pasunuru, R

Searching in archive cs. Search in all archives.
.
  1. arXiv:2401.17464  [pdf, other

    cs.CL

    Efficient Tool Use with Chain-of-Abstraction Reasoning

    Authors: Silin Gao, Jane Dwivedi-Yu, ** Yu, Xiaoqing Ellen Tan, Ramakanth Pasunuru, Olga Golovneva, Koustuv Sinha, Asli Celikyilmaz, Antoine Bosselut, Tianlu Wang

    Abstract: To achieve faithful reasoning that aligns with human expectations, large language models (LLMs) need to ground their reasoning to real-world knowledge (e.g., web facts, math and physical rules). Tools help LLMs access this external knowledge, but there remains challenges for fine-tuning LLM agents (e.g., Toolformer) to invoke tools in multi-step reasoning problems, where inter-connected tool calls… ▽ More

    Submitted 26 February, 2024; v1 submitted 30 January, 2024; originally announced January 2024.

  2. arXiv:2312.05180  [pdf, other

    cs.CL

    PathFinder: Guided Search over Multi-Step Reasoning Paths

    Authors: Olga Golovneva, Sean O'Brien, Ramakanth Pasunuru, Tianlu Wang, Luke Zettlemoyer, Maryam Fazel-Zarandi, Asli Celikyilmaz

    Abstract: With recent advancements in large language models, methods like chain-of-thought prompting to elicit reasoning chains have been shown to improve results on reasoning tasks. However, tasks that require multiple steps of reasoning still pose significant challenges to state-of-the-art models. Drawing inspiration from the beam search algorithm, we propose PathFinder, a tree-search-based reasoning path… ▽ More

    Submitted 12 December, 2023; v1 submitted 8 December, 2023; originally announced December 2023.

    Comments: NeurIPS 2023 R0-FoMo Workshop

  3. arXiv:2311.07961  [pdf, other

    cs.CL

    The ART of LLM Refinement: Ask, Refine, and Trust

    Authors: Kumar Shridhar, Koustuv Sinha, Andrew Cohen, Tianlu Wang, ** Yu, Ram Pasunuru, Mrinmaya Sachan, Jason Weston, Asli Celikyilmaz

    Abstract: In recent years, Large Language Models (LLMs) have demonstrated remarkable generative abilities, but can they judge the quality of their own generations? A popular concept, referred to as self-refinement, postulates that LLMs can detect and correct the errors in their generations when asked to do so. However, recent empirical evidence points in the opposite direction, suggesting that LLMs often st… ▽ More

    Submitted 14 November, 2023; originally announced November 2023.

  4. arXiv:2310.05029  [pdf, other

    cs.CL

    Walking Down the Memory Maze: Beyond Context Limit through Interactive Reading

    Authors: Howard Chen, Ramakanth Pasunuru, Jason Weston, Asli Celikyilmaz

    Abstract: Large language models (LLMs) have advanced in large strides due to the effectiveness of the self-attention mechanism that processes and compares all tokens at once. However, this mechanism comes with a fundamental issue -- the predetermined context window is bound to be limited. Despite attempts to extend the context window through methods like extrapolating the positional embedding, using recurre… ▽ More

    Submitted 8 October, 2023; originally announced October 2023.

  5. arXiv:2310.04921  [pdf, other

    cs.AI cs.CL cs.LG

    Crystal: Introspective Reasoners Reinforced with Self-Feedback

    Authors: Jiacheng Liu, Ramakanth Pasunuru, Hannaneh Hajishirzi, Ye** Choi, Asli Celikyilmaz

    Abstract: Extensive work has shown that the performance and interpretability of commonsense reasoning can be improved via knowledge-augmented reasoning methods, where the knowledge that underpins the reasoning process is explicitly verbalized and utilized. However, existing implementations, including "chain-of-thought" and its variants, fall short in capturing the introspective nature of knowledge required… ▽ More

    Submitted 18 October, 2023; v1 submitted 7 October, 2023; originally announced October 2023.

    Comments: EMNLP 2023 main conference

  6. arXiv:2309.15028  [pdf, other

    cs.CL cs.AI cs.LG

    Don't throw away your value model! Generating more preferable text with Value-Guided Monte-Carlo Tree Search decoding

    Authors: Jiacheng Liu, Andrew Cohen, Ramakanth Pasunuru, Ye** Choi, Hannaneh Hajishirzi, Asli Celikyilmaz

    Abstract: Inference-time search algorithms such as Monte-Carlo Tree Search (MCTS) may seem unnecessary when generating natural language text based on state-of-the-art reinforcement learning such as Proximal Policy Optimization (PPO). In this paper, we demonstrate that it is possible to get extra mileage out of PPO by integrating MCTS on top. The key idea is not to throw out the value network, a byproduct of… ▽ More

    Submitted 2 April, 2024; v1 submitted 26 September, 2023; originally announced September 2023.

  7. arXiv:2309.02591  [pdf, other

    cs.LG cs.CL cs.CV

    Scaling Autoregressive Multi-Modal Models: Pretraining and Instruction Tuning

    Authors: Lili Yu, Bowen Shi, Ramakanth Pasunuru, Benjamin Muller, Olga Golovneva, Tianlu Wang, Arun Babu, Binh Tang, Brian Karrer, Shelly Sheynin, Candace Ross, Adam Polyak, Russell Howes, Vasu Sharma, Puxin Xu, Hovhannes Tamoyan, Oron Ashual, Uriel Singer, Shang-Wen Li, Susan Zhang, Richard James, Gargi Ghosh, Yaniv Taigman, Maryam Fazel-Zarandi, Asli Celikyilmaz , et al. (2 additional authors not shown)

    Abstract: We present CM3Leon (pronounced "Chameleon"), a retrieval-augmented, token-based, decoder-only multi-modal language model capable of generating and infilling both text and images. CM3Leon uses the CM3 multi-modal architecture but additionally shows the extreme benefits of scaling up and tuning on more diverse instruction-style data. It is the first multi-modal model trained with a recipe adapted fr… ▽ More

    Submitted 5 September, 2023; originally announced September 2023.

  8. arXiv:2308.04592  [pdf, other

    cs.CL cs.AI

    Shepherd: A Critic for Language Model Generation

    Authors: Tianlu Wang, ** Yu, Xiaoqing Ellen Tan, Sean O'Brien, Ramakanth Pasunuru, Jane Dwivedi-Yu, Olga Golovneva, Luke Zettlemoyer, Maryam Fazel-Zarandi, Asli Celikyilmaz

    Abstract: As large language models improve, there is increasing interest in techniques that leverage these models' capabilities to refine their own outputs. In this work, we introduce Shepherd, a language model specifically tuned to critique responses and suggest refinements, extending beyond the capabilities of an untuned model to identify diverse errors and provide suggestions to remedy them. At the core… ▽ More

    Submitted 8 August, 2023; originally announced August 2023.

    Comments: 7 figures, 7 tables

  9. arXiv:2302.07842  [pdf, ps, other

    cs.CL

    Augmented Language Models: a Survey

    Authors: Grégoire Mialon, Roberto Dessì, Maria Lomeli, Christoforos Nalmpantis, Ram Pasunuru, Roberta Raileanu, Baptiste Rozière, Timo Schick, Jane Dwivedi-Yu, Asli Celikyilmaz, Edouard Grave, Yann LeCun, Thomas Scialom

    Abstract: This survey reviews works in which language models (LMs) are augmented with reasoning skills and the ability to use tools. The former is defined as decomposing a potentially complex task into simpler subtasks while the latter consists in calling external modules such as a code interpreter. LMs can leverage these augmentations separately or in combination via heuristics, or learn to do so from demo… ▽ More

    Submitted 15 February, 2023; originally announced February 2023.

  10. arXiv:2212.12017  [pdf, other

    cs.CL

    OPT-IML: Scaling Language Model Instruction Meta Learning through the Lens of Generalization

    Authors: Srinivasan Iyer, Xi Victoria Lin, Ramakanth Pasunuru, Todor Mihaylov, Daniel Simig, ** Yu, Kurt Shuster, Tianlu Wang, Qing Liu, Punit Singh Koura, Xian Li, Brian O'Horo, Gabriel Pereyra, Jeff Wang, Christopher Dewan, Asli Celikyilmaz, Luke Zettlemoyer, Ves Stoyanov

    Abstract: Recent work has shown that fine-tuning large pre-trained language models on a collection of tasks described via instructions, a.k.a. instruction-tuning, improves their zero and few-shot generalization to unseen tasks. However, there is a limited understanding of the performance trade-offs of different decisions made during the instruction-tuning process. These decisions include the scale and diver… ▽ More

    Submitted 30 January, 2023; v1 submitted 22 December, 2022; originally announced December 2022.

    Comments: 56 pages. v2->v3: fix OPT-30B evaluation results across benchmarks (previously we reported lower performance of this model due to an evaluation pipeline bug)

  11. arXiv:2212.09803  [pdf, other

    cs.CL cs.AI cs.LG

    Training Trajectories of Language Models Across Scales

    Authors: Mengzhou Xia, Mikel Artetxe, Chunting Zhou, Xi Victoria Lin, Ramakanth Pasunuru, Danqi Chen, Luke Zettlemoyer, Ves Stoyanov

    Abstract: Scaling up language models has led to unprecedented performance gains, but little is understood about how the training dynamics change as models get larger. How do language models of different sizes learn during pre-training? Why do larger language models demonstrate more desirable behaviors? In this paper, we analyze the intermediate training checkpoints of differently sized OPT models (Zhang et… ▽ More

    Submitted 29 May, 2023; v1 submitted 19 December, 2022; originally announced December 2022.

    Comments: Accepted to ACL 2023; The code and analysis results are available at https://github.com/xiamengzhou/training_trajectory_analysis

  12. arXiv:2212.08607  [pdf, other

    cs.CL cs.AI cs.LG

    MURMUR: Modular Multi-Step Reasoning for Semi-Structured Data-to-Text Generation

    Authors: Swarnadeep Saha, Xinyan Velocity Yu, Mohit Bansal, Ramakanth Pasunuru, Asli Celikyilmaz

    Abstract: Prompting large language models has enabled significant recent progress in multi-step reasoning over text. However, when applied to text generation from semi-structured data (e.g., graphs or tables), these methods typically suffer from low semantic coverage, hallucination, and logical inconsistency. We propose MURMUR, a neuro-symbolic modular approach to text generation from semi-structured data w… ▽ More

    Submitted 16 December, 2022; originally announced December 2022.

    Comments: 22 pages (9 figures, 18 tables)

  13. arXiv:2211.13892  [pdf, other

    cs.CL

    Complementary Explanations for Effective In-Context Learning

    Authors: Xi Ye, Srinivasan Iyer, Asli Celikyilmaz, Ves Stoyanov, Greg Durrett, Ramakanth Pasunuru

    Abstract: Large language models (LLMs) have exhibited remarkable capabilities in learning from explanations in prompts, but there has been limited understanding of exactly how these explanations function or why they are effective. This work aims to better understand the mechanisms by which explanations are used for in-context learning. We first study the impact of two different factors on the performance of… ▽ More

    Submitted 12 June, 2023; v1 submitted 24 November, 2022; originally announced November 2022.

    Comments: ACL Findings 2023 Camera-Ready

  14. arXiv:2205.01703  [pdf, other

    cs.CL

    Improving In-Context Few-Shot Learning via Self-Supervised Training

    Authors: Mingda Chen, **gfei Du, Ramakanth Pasunuru, Todor Mihaylov, Srini Iyer, Veselin Stoyanov, Zornitsa Kozareva

    Abstract: Self-supervised pretraining has made few-shot learning possible for many NLP tasks. But the pretraining objectives are not typically adapted specifically for in-context few-shot learning. In this paper, we propose to use self-supervision in an intermediate training stage between pretraining and downstream few-shot usage with the goal to teach the model to perform in-context few shot learning. We p… ▽ More

    Submitted 6 June, 2022; v1 submitted 3 May, 2022; originally announced May 2022.

    Comments: NAACL 2022

  15. arXiv:2112.10684  [pdf, other

    cs.CL cs.AI cs.LG

    Efficient Large Scale Language Modeling with Mixtures of Experts

    Authors: Mikel Artetxe, Shruti Bhosale, Naman Goyal, Todor Mihaylov, Myle Ott, Sam Shleifer, Xi Victoria Lin, **gfei Du, Srinivasan Iyer, Ramakanth Pasunuru, Giri Anantharaman, Xian Li, Shuohui Chen, Halil Akin, Mandeep Baines, Louis Martin, Xing Zhou, Punit Singh Koura, Brian O'Horo, Jeff Wang, Luke Zettlemoyer, Mona Diab, Zornitsa Kozareva, Ves Stoyanov

    Abstract: Mixture of Experts layers (MoEs) enable efficient scaling of language models through conditional computation. This paper presents a detailed empirical study of how autoregressive MoE language models scale in comparison with dense models in a wide range of settings: in- and out-of-domain language modeling, zero- and few-shot priming, and full-shot fine-tuning. With the exception of fine-tuning, we… ▽ More

    Submitted 26 October, 2022; v1 submitted 20 December, 2021; originally announced December 2021.

    Comments: EMNLP 2022

  16. arXiv:2112.10668  [pdf, other

    cs.CL cs.AI

    Few-shot Learning with Multilingual Language Models

    Authors: Xi Victoria Lin, Todor Mihaylov, Mikel Artetxe, Tianlu Wang, Shuohui Chen, Daniel Simig, Myle Ott, Naman Goyal, Shruti Bhosale, **gfei Du, Ramakanth Pasunuru, Sam Shleifer, Punit Singh Koura, Vishrav Chaudhary, Brian O'Horo, Jeff Wang, Luke Zettlemoyer, Zornitsa Kozareva, Mona Diab, Veselin Stoyanov, Xian Li

    Abstract: Large-scale generative language models such as GPT-3 are competitive few-shot learners. While these models are known to be able to jointly represent many different languages, their training data is dominated by English, potentially limiting their cross-lingual generalization. In this work, we train multilingual generative language models on a corpus covering a diverse set of languages, and study t… ▽ More

    Submitted 10 November, 2022; v1 submitted 20 December, 2021; originally announced December 2021.

    Comments: Accepted to EMNLP 2022; 34 pages

  17. arXiv:2112.08770  [pdf, other

    cs.CL cs.LG

    Proposition-Level Clustering for Multi-Document Summarization

    Authors: Ori Ernst, Avi Caciularu, Ori Shapira, Ramakanth Pasunuru, Mohit Bansal, Jacob Goldberger, Ido Dagan

    Abstract: Text clustering methods were traditionally incorporated into multi-document summarization (MDS) as a means for co** with considerable information repetition. Particularly, clusters were leveraged to indicate information saliency as well as to avoid redundancy. Such prior methods focused on clustering sentences, even though closely related sentences usually contain also non-aligned parts. In this… ▽ More

    Submitted 19 May, 2022; v1 submitted 16 December, 2021; originally announced December 2021.

    Comments: NAACl 2022

  18. arXiv:2110.01073  [pdf, other

    cs.CL

    Multi-Document Keyphrase Extraction: Dataset, Baselines and Review

    Authors: Ori Shapira, Ramakanth Pasunuru, Ido Dagan, Yael Amsterdamer

    Abstract: Keyphrase extraction has been extensively researched within the single-document setting, with an abundance of methods, datasets and applications. In contrast, multi-document keyphrase extraction has been infrequently studied, despite its utility for describing sets of documents, and its use in summarization. Moreover, no prior dataset exists for multi-document keyphrase extraction, hindering the p… ▽ More

    Submitted 1 July, 2022; v1 submitted 3 October, 2021; originally announced October 2021.

  19. arXiv:2109.11621  [pdf, other

    cs.CL

    iFacetSum: Coreference-based Interactive Faceted Summarization for Multi-Document Exploration

    Authors: Eran Hirsch, Alon Eirew, Ori Shapira, Avi Caciularu, Arie Cattan, Ori Ernst, Ramakanth Pasunuru, Hadar Ronen, Mohit Bansal, Ido Dagan

    Abstract: We introduce iFacetSum, a web application for exploring topical document sets. iFacetSum integrates interactive summarization together with faceted search, by providing a novel faceted navigation scheme that yields abstractive summaries for the user's selections. This approach offers both a comprehensive overview as well as concise details regarding subtopics of choice. Fine-grained facets are aut… ▽ More

    Submitted 23 September, 2021; originally announced September 2021.

    Comments: Proceedings of EMNLP 2021, System Demonstrations. 7 pages and an appendix

  20. arXiv:2103.01867  [pdf, other

    cs.CL cs.AI cs.CV

    Dual Reinforcement-Based Specification Generation for Image De-Rendering

    Authors: Ramakanth Pasunuru, David Rosenberg, Gideon Mann, Mohit Bansal

    Abstract: Advances in deep learning have led to promising progress in inferring graphics programs by de-rendering computer-generated images. However, current methods do not explore which decoding methods lead to better inductive bias for inferring graphics programs. In our work, we first explore the effectiveness of LSTM-RNN versus Transformer networks as decoders for order-independent graphics programs. Si… ▽ More

    Submitted 2 March, 2021; originally announced March 2021.

    Comments: AAAI 2021 Scientific Document Understanding Workshop (9 pages)

  21. arXiv:2103.01863  [pdf, other

    cs.CL cs.AI

    Data Augmentation for Abstractive Query-Focused Multi-Document Summarization

    Authors: Ramakanth Pasunuru, Asli Celikyilmaz, Michel Galley, Chenyan Xiong, Yizhe Zhang, Mohit Bansal, Jianfeng Gao

    Abstract: The progress in Query-focused Multi-Document Summarization (QMDS) has been limited by the lack of sufficient largescale high-quality training datasets. We present two QMDS training datasets, which we construct using two data augmentation methods: (1) transferring the commonly used single-document CNN/Daily Mail summarization dataset to create the QMDSCNN dataset, and (2) mining search-query logs t… ▽ More

    Submitted 2 March, 2021; originally announced March 2021.

    Comments: AAAI 2021 (13 pages)

  22. arXiv:2011.07635  [pdf, other

    cs.CL cs.AI cs.LG

    DORB: Dynamically Optimizing Multiple Rewards with Bandits

    Authors: Ramakanth Pasunuru, Han Guo, Mohit Bansal

    Abstract: Policy gradients-based reinforcement learning has proven to be a promising approach for directly optimizing non-differentiable evaluation metrics for language generation tasks. However, optimizing for a specific metric reward leads to improvements in mostly that metric only, suggesting that the model is gaming the formulation of that metric in a particular way without often achieving real qualitat… ▽ More

    Submitted 15 November, 2020; originally announced November 2020.

    Comments: EMNLP 2020 (15 pages)

  23. arXiv:2009.08380  [pdf, other

    cs.CL

    Evaluating Interactive Summarization: an Expansion-Based Framework

    Authors: Ori Shapira, Ramakanth Pasunuru, Hadar Ronen, Mohit Bansal, Yael Amsterdamer, Ido Dagan

    Abstract: Allowing users to interact with multi-document summarizers is a promising direction towards improving and customizing summary results. Different ideas for interactive summarization have been proposed in previous work but these solutions are highly divergent and incomparable. In this paper, we develop an end-to-end evaluation framework for expansion-based interactive summarization, which considers… ▽ More

    Submitted 17 September, 2020; originally announced September 2020.

  24. arXiv:2009.00590  [pdf, other

    cs.CL

    Summary-Source Proposition-level Alignment: Task, Datasets and Supervised Baseline

    Authors: Ori Ernst, Ori Shapira, Ramakanth Pasunuru, Michael Lepioshkin, Jacob Goldberger, Mohit Bansal, Ido Dagan

    Abstract: Aligning sentences in a reference summary with their counterparts in source documents was shown as a useful auxiliary summarization task, notably for generating training data for salience detection. Despite its assessed utility, the alignment step was mostly approached with heuristic unsupervised methods, typically ROUGE-based, and was never independently optimized or evaluated. In this paper, we… ▽ More

    Submitted 22 September, 2021; v1 submitted 1 September, 2020; originally announced September 2020.

    Comments: CoNLL 2021

  25. arXiv:2001.04362  [pdf, other

    cs.CL cs.LG stat.ML

    Multi-Source Domain Adaptation for Text Classification via DistanceNet-Bandits

    Authors: Han Guo, Ramakanth Pasunuru, Mohit Bansal

    Abstract: Domain adaptation performance of a learning algorithm on a target domain is a function of its source domain error and a divergence measure between the data distribution of these two domains. We present a study of various distance-based measures in the context of NLP tasks, that characterize the dissimilarity between domains based on sample estimates. We first conduct analysis experiments to show w… ▽ More

    Submitted 3 March, 2020; v1 submitted 13 January, 2020; originally announced January 2020.

    Comments: AAAI 2020 (10 pages)

  26. arXiv:1906.05226  [pdf, other

    cs.CL cs.CV cs.LG

    Continual and Multi-Task Architecture Search

    Authors: Ramakanth Pasunuru, Mohit Bansal

    Abstract: Architecture search is the process of automatically learning the neural model or cell structure that best suits the given task. Recently, this approach has shown promising performance improvements (on language modeling and image classification) with reasonable training speed, using a weight sharing strategy called Efficient Neural Architecture Search (ENAS). In our work, we first introduce a novel… ▽ More

    Submitted 12 June, 2019; originally announced June 2019.

    Comments: ACL 2019 (12 pages)

  27. arXiv:1904.05929  [pdf, ps, other

    cs.CL

    Crowdsourcing Lightweight Pyramids for Manual Summary Evaluation

    Authors: Ori Shapira, David Gabay, Yang Gao, Hadar Ronen, Ramakanth Pasunuru, Mohit Bansal, Yael Amsterdamer, Ido Dagan

    Abstract: Conducting a manual evaluation is considered an essential part of summary evaluation methodology. Traditionally, the Pyramid protocol, which exhaustively compares system summaries to references, has been perceived as very reliable, providing objective scores. Yet, due to the high cost of the Pyramid method and the required expertise, researchers resorted to cheaper and less thorough manual evaluat… ▽ More

    Submitted 11 April, 2019; originally announced April 2019.

    Comments: 5 pages, 2 graphs, 1 table. Published in NAACL 2019

  28. arXiv:1904.04153  [pdf, other

    cs.CL cs.LG stat.ML

    AutoSeM: Automatic Task Selection and Mixing in Multi-Task Learning

    Authors: Han Guo, Ramakanth Pasunuru, Mohit Bansal

    Abstract: Multi-task learning (MTL) has achieved success over a wide range of problems, where the goal is to improve the performance of a primary task using a set of relevant auxiliary tasks. However, when the usefulness of the auxiliary tasks w.r.t. the primary task is not known a priori, the success of MTL models depends on the correct choice of these auxiliary tasks and also a balanced mixing ratio of th… ▽ More

    Submitted 8 April, 2019; originally announced April 2019.

    Comments: NAACL 2019 (12 pages)

  29. arXiv:1809.04560  [pdf, other

    cs.CL cs.AI cs.CV

    Game-Based Video-Context Dialogue

    Authors: Ramakanth Pasunuru, Mohit Bansal

    Abstract: Current dialogue systems focus more on textual and speech context knowledge and are usually based on two speakers. Some recent work has investigated static image-based dialogue. However, several real-world human interactions also involve dynamic visual context (similar to videos) as well as dialogue exchanges among multiple speakers. To move closer towards such multimodal conversational skills and… ▽ More

    Submitted 17 October, 2018; v1 submitted 12 September, 2018; originally announced September 2018.

    Comments: EMNLP 2018 (14 pages) (fixed Table5 typo in v2)

  30. arXiv:1806.07304  [pdf, other

    cs.CL cs.AI cs.LG

    Dynamic Multi-Level Multi-Task Learning for Sentence Simplification

    Authors: Han Guo, Ramakanth Pasunuru, Mohit Bansal

    Abstract: Sentence simplification aims to improve readability and understandability, based on several operations such as splitting, deletion, and paraphrasing. However, a valid simplified sentence should also be logically entailed by its input sentence. In this work, we first present a strong pointer-copy mechanism based sequence-to-sequence sentence simplification model, and then improve its entailment and… ▽ More

    Submitted 19 June, 2018; originally announced June 2018.

    Comments: COLING 2018 (15 pages)

  31. arXiv:1805.11004  [pdf, other

    cs.CL cs.AI cs.LG

    Soft Layer-Specific Multi-Task Summarization with Entailment and Question Generation

    Authors: Han Guo, Ramakanth Pasunuru, Mohit Bansal

    Abstract: An accurate abstractive summary of a document should contain all its salient information and should be logically entailed by the input document. We improve these important aspects of abstractive summarization via multi-task learning with the auxiliary tasks of question generation and entailment generation, where the former teaches the summarization model how to look for salient questioning-worthy… ▽ More

    Submitted 28 May, 2018; originally announced May 2018.

    Comments: ACL 2018 (16 pages)

  32. arXiv:1804.06451  [pdf, other

    cs.CL cs.AI cs.LG

    Multi-Reward Reinforced Summarization with Saliency and Entailment

    Authors: Ramakanth Pasunuru, Mohit Bansal

    Abstract: Abstractive text summarization is the task of compressing and rewriting a long document into a short summary while maintaining saliency, directed logical entailment, and non-redundancy. In this work, we address these three important aspects of a good summary via a reinforcement learning approach with two novel reward functions: ROUGESal and Entail, on top of a coverage-based baseline. The ROUGESal… ▽ More

    Submitted 29 May, 2018; v1 submitted 17 April, 2018; originally announced April 2018.

    Comments: NAACL 2018 (9 pages; added human evaluation and more analysis)

  33. arXiv:1708.02300  [pdf, ps, other

    cs.CL cs.AI cs.CV cs.LG

    Reinforced Video Captioning with Entailment Rewards

    Authors: Ramakanth Pasunuru, Mohit Bansal

    Abstract: Sequence-to-sequence models have shown promising improvements on the temporal task of video captioning, but they optimize word-level cross-entropy loss during training. First, using policy gradient and mixed-loss methods for reinforcement learning, we directly optimize sentence-level task-based metrics (as rewards), achieving significant improvements over the baseline, based on both automatic metr… ▽ More

    Submitted 7 August, 2017; originally announced August 2017.

    Comments: EMNLP 2017 (9 pages)

  34. arXiv:1704.07489  [pdf, ps, other

    cs.CL cs.AI cs.CV

    Multi-Task Video Captioning with Video and Entailment Generation

    Authors: Ramakanth Pasunuru, Mohit Bansal

    Abstract: Video captioning, the task of describing the content of a video, has seen some promising improvements in recent years with sequence-to-sequence models, but accurately learning the temporal and logical dynamics involved in the task still remains a challenge, especially given the lack of sufficient annotated data. We improve video captioning by sharing knowledge with two related directed-generation… ▽ More

    Submitted 8 August, 2017; v1 submitted 24 April, 2017; originally announced April 2017.

    Comments: ACL 2017 (14 pages w/ supplementary)