Skip to main content

Showing 1–14 of 14 results for author: Tomar, G

Searching in archive cs. Search in all archives.
.
  1. arXiv:2402.02077  [pdf, other

    cs.CL

    Investigating Content Planning for Navigating Trade-offs in Knowledge-Grounded Dialogue

    Authors: Kushal Chawla, Hannah Rashkin, Gaurav Singh Tomar, David Reitter

    Abstract: Knowledge-grounded dialogue generation is a challenging task because it requires satisfying two fundamental yet often competing constraints: being responsive in a manner that is specific to what the conversation partner has said while also being attributable to an underlying source document. In this work, we bring this trade-off between these two objectives (specificity and attribution) to light a… ▽ More

    Submitted 3 February, 2024; originally announced February 2024.

    Comments: Accepted at EACL 2024 Main Conference (Long)

  2. arXiv:2312.11805  [pdf, other

    cs.CL cs.AI cs.CV

    Gemini: A Family of Highly Capable Multimodal Models

    Authors: Gemini Team, Rohan Anil, Sebastian Borgeaud, Jean-Baptiste Alayrac, Jiahui Yu, Radu Soricut, Johan Schalkwyk, Andrew M. Dai, Anja Hauth, Katie Millican, David Silver, Melvin Johnson, Ioannis Antonoglou, Julian Schrittwieser, Amelia Glaese, Jilin Chen, Emily Pitler, Timothy Lillicrap, Angeliki Lazaridou, Orhan Firat, James Molloy, Michael Isard, Paul R. Barham, Tom Hennigan, Benjamin Lee , et al. (1325 additional authors not shown)

    Abstract: This report introduces a new family of multimodal models, Gemini, that exhibit remarkable capabilities across image, audio, video, and text understanding. The Gemini family consists of Ultra, Pro, and Nano sizes, suitable for applications ranging from complex reasoning tasks to on-device memory-constrained use-cases. Evaluation on a broad range of benchmarks shows that our most-capable Gemini Ultr… ▽ More

    Submitted 17 June, 2024; v1 submitted 18 December, 2023; originally announced December 2023.

  3. Dungeons and Dragons as a Dialog Challenge for Artificial Intelligence

    Authors: Chris Callison-Burch, Gaurav Singh Tomar, Lara J. Martin, Daphne Ippolito, Suma Bailis, David Reitter

    Abstract: AI researchers have posited Dungeons and Dragons (D&D) as a challenge problem to test systems on various language-related capabilities. In this paper, we frame D&D specifically as a dialogue system challenge, where the tasks are to both generate the next conversational turn in the game and predict the state of the game given the dialogue history. We create a gameplay dataset consisting of nearly 9… ▽ More

    Submitted 13 October, 2022; originally announced October 2022.

    Comments: Accepted at EMNLP 2022

    Journal ref: Conference on Empirical Methods in Natural Language Processing (EMNLP), pp. 9379-9393, Dec. 2022

  4. arXiv:2112.12870  [pdf, other

    cs.CL

    Measuring Attribution in Natural Language Generation Models

    Authors: Hannah Rashkin, Vitaly Nikolaev, Matthew Lamm, Lora Aroyo, Michael Collins, Dipanjan Das, Slav Petrov, Gaurav Singh Tomar, Iulia Turc, David Reitter

    Abstract: With recent improvements in natural language generation (NLG) models for various applications, it has become imperative to have the means to identify and evaluate whether NLG output is only sharing verifiable information about the external world. In this work, we present a new evaluation framework entitled Attributable to Identified Sources (AIS) for assessing the output of natural language genera… ▽ More

    Submitted 2 August, 2022; v1 submitted 23 December, 2021; originally announced December 2021.

  5. arXiv:2112.08558  [pdf, other

    cs.CL

    CONQRR: Conversational Query Rewriting for Retrieval with Reinforcement Learning

    Authors: Zeqiu Wu, Yi Luan, Hannah Rashkin, David Reitter, Hannaneh Hajishirzi, Mari Ostendorf, Gaurav Singh Tomar

    Abstract: Compared to standard retrieval tasks, passage retrieval for conversational question answering (CQA) poses new challenges in understanding the current user question, as each question needs to be interpreted within the dialogue context. Moreover, it can be expensive to re-train well-established retrievers such as search engines that are originally developed for non-conversational queries. To facilit… ▽ More

    Submitted 28 October, 2022; v1 submitted 15 December, 2021; originally announced December 2021.

    Comments: EMNLP 2022 camera-ready

  6. arXiv:2107.06963  [pdf, other

    cs.CL

    Increasing Faithfulness in Knowledge-Grounded Dialogue with Controllable Features

    Authors: Hannah Rashkin, David Reitter, Gaurav Singh Tomar, Dipanjan Das

    Abstract: Knowledge-grounded dialogue systems are intended to convey information that is based on evidence provided in a given source text. We discuss the challenges of training a generative neural dialogue model for such systems that is controlled to stay faithful to the evidence. Existing datasets contain a mix of conversational responses that are faithful to selected evidence as well as more subjective o… ▽ More

    Submitted 14 July, 2021; originally announced July 2021.

    Comments: ACL 2021

  7. arXiv:1910.12366  [pdf, other

    cs.CL cs.CR cs.LG

    Thieves on Sesame Street! Model Extraction of BERT-based APIs

    Authors: Kalpesh Krishna, Gaurav Singh Tomar, Ankur P. Parikh, Nicolas Papernot, Mohit Iyyer

    Abstract: We study the problem of model extraction in natural language processing, in which an adversary with only query access to a victim model attempts to reconstruct a local copy of that model. Assuming that both the adversary and victim model fine-tune a large pretrained language model such as BERT (Devlin et al. 2019), we show that the adversary does not need any real training data to successfully mou… ▽ More

    Submitted 12 October, 2020; v1 submitted 27 October, 2019; originally announced October 2019.

    Comments: ICLR 2020 Camera Ready (19 pages)

  8. arXiv:1909.12764  [pdf, other

    cs.CL

    Improving Semantic Parsing with Neural Generator-Reranker Architecture

    Authors: Huseyin A. Inan, Gaurav Singh Tomar, Huapu Pan

    Abstract: Semantic parsing is the problem of deriving machine interpretable meaning representations from natural language utterances. Neural models with encoder-decoder architectures have recently achieved substantial improvements over traditional methods. Although neural semantic parsers appear to have relatively high recall using large beam sizes, there is room for improvement with respect to one-best pre… ▽ More

    Submitted 27 September, 2019; originally announced September 2019.

  9. arXiv:1909.11218  [pdf, other

    cs.CL cs.LG

    Attention Interpretability Across NLP Tasks

    Authors: Shikhar Vashishth, Shyam Upadhyay, Gaurav Singh Tomar, Manaal Faruqui

    Abstract: The attention layer in a neural network model provides insights into the model's reasoning behind its prediction, which are usually criticized for being opaque. Recently, seemingly contradictory viewpoints have emerged about the interpretability of attention weights (Jain & Wallace, 2019; Vig & Belinkov, 2019). Amid such confusion arises the need to understand attention mechanism more systematical… ▽ More

    Submitted 24 September, 2019; originally announced September 2019.

    Report number: 2019

  10. arXiv:1811.08008  [pdf, ps, other

    cs.IR cs.CL cs.LG

    End-to-End Retrieval in Continuous Space

    Authors: Daniel Gillick, Alessandro Presta, Gaurav Singh Tomar

    Abstract: Most text-based information retrieval (IR) systems index objects by words or phrases. These discrete systems have been augmented by models that use embeddings to measure similarity in continuous space. But continuous-space models are typically used just to re-rank the top candidates. We consider the problem of end-to-end continuous retrieval, where standard approximate nearest neighbor (ANN) searc… ▽ More

    Submitted 19 November, 2018; originally announced November 2018.

  11. arXiv:1809.04177  [pdf, other

    cs.HC

    Time Series Analysis of Clickstream Logs from Online Courses

    Authors: Yohan Jo, Keith Maki, Gaurav Tomar

    Abstract: Due to the rapidly rising popularity of Massive Open Online Courses (MOOCs), there is a growing demand for scalable automated support technologies for student learning. Transferring traditional educational resources to online contexts has become an increasingly relevant problem in recent years. For learning science theories to be applicable, educators need a way to identify learning behaviors of s… ▽ More

    Submitted 11 September, 2018; originally announced September 2018.

    Comments: Technical Report

  12. arXiv:1704.05543  [pdf

    cs.CY cs.AI cs.CL cs.HC

    Coordinating Collaborative Chat in Massive Open Online Courses

    Authors: Gaurav Singh Tomar, Sreecharan Sankaranarayanan, Xu Wang, Carolyn Penstein Rosé

    Abstract: An earlier study of a collaborative chat intervention in a Massive Open Online Course (MOOC) identified negative effects on attrition stemming from a requirement for students to be matched with exactly one partner prior to beginning the activity. That study raised questions about how to orchestrate a collaborative chat intervention in a MOOC context in order to provide the benefit of synchronous s… ▽ More

    Submitted 18 April, 2017; originally announced April 2017.

    Comments: 8 pages

    Journal ref: Proceedings of the International Conference of the Learning Sciences 2016, Volume 1, pp 607-614

  13. arXiv:1704.04565  [pdf, other

    cs.CL

    Neural Paraphrase Identification of Questions with Noisy Pretraining

    Authors: Gaurav Singh Tomar, Thyago Duque, Oscar Täckström, Jakob Uszkoreit, Dipanjan Das

    Abstract: We present a solution to the problem of paraphrase identification of questions. We focus on a recent dataset of question pairs annotated with binary paraphrase labels and show that a variant of the decomposable attention model (Parikh et al., 2016) results in accurate performance on this task, while being far simpler than many competing neural architectures. Furthermore, when the model is pretrain… ▽ More

    Submitted 19 August, 2017; v1 submitted 14 April, 2017; originally announced April 2017.

  14. arXiv:1605.02836  [pdf, other

    cs.CY

    Expediting Support for Social Learning with Behavior Modeling

    Authors: Yohan Jo, Gaurav Tomar, Oliver Ferschke, Carolyn P. Rose, Dragan Gasevic

    Abstract: An important research problem for Educational Data Mining is to expedite the cycle of data leading to the analysis of student learning processes and the improvement of support for those processes. For this goal in the context of social interaction in learning, we propose a three-part pipeline that includes data infrastructure, learning process analysis with behavior modeling, and intervention for… ▽ More

    Submitted 13 July, 2016; v1 submitted 9 May, 2016; originally announced May 2016.

    Comments: in The 9th International Conference on Educational Data Mining, 2016