Skip to main content

Showing 1–10 of 10 results for author: Adolphs, L

.
  1. arXiv:2211.05826  [pdf, other

    cs.CL cs.AI

    The CRINGE Loss: Learning what language not to model

    Authors: Leonard Adolphs, Tianyu Gao, **g Xu, Kurt Shuster, Sainbayar Sukhbaatar, Jason Weston

    Abstract: Standard language model training employs gold human documents or human-human interaction data, and treats all training data as positive examples. Growing evidence shows that even with very large amounts of positive training data, issues remain that can be alleviated with relatively small amounts of negative data -- examples of what the model should not do. In this work, we propose a novel procedur… ▽ More

    Submitted 10 November, 2022; originally announced November 2022.

  2. arXiv:2210.12084  [pdf, other

    cs.CL cs.AI cs.LG

    Decoding a Neural Retriever's Latent Space for Query Suggestion

    Authors: Leonard Adolphs, Michelle Chen Huebscher, Christian Buck, Sertan Girgin, Olivier Bachem, Massimiliano Ciaramita, Thomas Hofmann

    Abstract: Neural retrieval models have superseded classic bag-of-words methods such as BM25 as the retrieval framework of choice. However, neural systems lack the interpretability of bag-of-words models; it is not trivial to connect a query change to a change in the latent space that ultimately determines the retrieval results. To shed light on this embedding space, we learn a "query decoder" that, given a… ▽ More

    Submitted 21 October, 2022; originally announced October 2022.

  3. arXiv:2203.13224  [pdf, other

    cs.CL cs.AI

    Language Models that Seek for Knowledge: Modular Search & Generation for Dialogue and Prompt Completion

    Authors: Kurt Shuster, Mojtaba Komeili, Leonard Adolphs, Stephen Roller, Arthur Szlam, Jason Weston

    Abstract: Language models (LMs) have recently been shown to generate more factual responses by employing modularity (Zhou et al., 2021) in combination with retrieval (Adolphs et al., 2021). We extend the recent approach of Adolphs et al. (2021) to include internet search as a module. Our SeeKeR (Search engine->Knowledge->Response) method thus applies a single LM to three modular tasks in succession: search,… ▽ More

    Submitted 29 March, 2022; v1 submitted 24 March, 2022; originally announced March 2022.

  4. arXiv:2203.10623  [pdf, other

    cs.CL cs.LG

    Calibration of Machine Reading Systems at Scale

    Authors: Shehzaad Dhuliawala, Leonard Adolphs, Rajarshi Das, Mrinmaya Sachan

    Abstract: In typical machine learning systems, an estimate of the probability of the prediction is used to assess the system's confidence in the prediction. This confidence measure is usually uncalibrated; i.e.\ the system's confidence in the prediction does not match the true probability of the predicted output. In this paper, we present an investigation into calibrating open setting machine reading system… ▽ More

    Submitted 23 May, 2022; v1 submitted 20 March, 2022; originally announced March 2022.

    Comments: Accepted at ACL 2022 Findings

    MSC Class: 68T50 (Primary) 68T07 (Secondary)

  5. arXiv:2111.05204  [pdf, other

    cs.CL cs.AI cs.LG

    Reason first, then respond: Modular Generation for Knowledge-infused Dialogue

    Authors: Leonard Adolphs, Kurt Shuster, Jack Urbanek, Arthur Szlam, Jason Weston

    Abstract: Large language models can produce fluent dialogue but often hallucinate factual inaccuracies. While retrieval-augmented models help alleviate this issue, they still face a difficult challenge of both reasoning to provide correct knowledge and generating conversation simultaneously. In this work, we propose a modular model, Knowledge to Response (K2R), for incorporating knowledge into conversationa… ▽ More

    Submitted 9 November, 2021; originally announced November 2021.

  6. arXiv:2109.00527  [pdf, other

    cs.CL cs.AI cs.IR cs.LG

    Boosting Search Engines with Interactive Agents

    Authors: Leonard Adolphs, Benjamin Boerschinger, Christian Buck, Michelle Chen Huebscher, Massimiliano Ciaramita, Lasse Espeholt, Thomas Hofmann, Yannic Kilcher, Sascha Rothe, Pier Giuseppe Sessa, Lierni Sestorain Saralegui

    Abstract: This paper presents first successful steps in designing search agents that learn meta-strategies for iterative query refinement in information-seeking tasks. Our approach uses machine reading to guide the selection of refinement terms from aggregated search results. Agents are then empowered with simple but effective search operators to exert fine-grained and transparent control over queries and s… ▽ More

    Submitted 7 June, 2022; v1 submitted 1 September, 2021; originally announced September 2021.

    Comments: Published in Transactions on Machine Learning Research (06/2022)

  7. arXiv:2108.01928  [pdf, other

    cs.CL cs.IR cs.LG

    How to Query Language Models?

    Authors: Leonard Adolphs, Shehzaad Dhuliawala, Thomas Hofmann

    Abstract: Large pre-trained language models (LMs) are capable of not only recovering linguistic but also factual and commonsense knowledge. To access the knowledge stored in mask-based LMs, we can use cloze-style questions and let the model fill in the blank. The flexibility advantage over structured knowledge bases comes with the drawback of finding the right query for a certain information need. Inspired… ▽ More

    Submitted 4 August, 2021; originally announced August 2021.

  8. arXiv:1909.01646  [pdf, other

    cs.LG cs.AI stat.ML

    LeDeepChef: Deep Reinforcement Learning Agent for Families of Text-Based Games

    Authors: Leonard Adolphs, Thomas Hofmann

    Abstract: While Reinforcement Learning (RL) approaches lead to significant achievements in a variety of areas in recent history, natural language tasks remained mostly unaffected, due to the compositional and combinatorial nature that makes them notoriously hard to optimize. With the emerging field of Text-Based Games (TBGs), researchers try to bridge this gap. Inspired by the success of RL algorithms on At… ▽ More

    Submitted 4 September, 2019; originally announced September 2019.

  9. arXiv:1905.09201  [pdf, other

    cs.LG stat.ML

    Adaptive norms for deep learning with regularized Newton methods

    Authors: Jonas Kohler, Leonard Adolphs, Aurelien Lucchi

    Abstract: We investigate the use of regularized Newton methods with adaptive norms for optimizing neural networks. This approach can be seen as a second-order counterpart of adaptive gradient methods, which we here show to be interpretable as first-order trust region methods with ellipsoidal constraints. In particular, we prove that the preconditioning matrix used in RMSProp and Adam satisfies the necessary… ▽ More

    Submitted 28 September, 2020; v1 submitted 22 May, 2019; originally announced May 2019.

  10. arXiv:1805.05751  [pdf, other

    cs.LG math.OC stat.ML

    Local Saddle Point Optimization: A Curvature Exploitation Approach

    Authors: Leonard Adolphs, Hadi Daneshmand, Aurelien Lucchi, Thomas Hofmann

    Abstract: Gradient-based optimization methods are the most popular choice for finding local optima for classical minimization and saddle point problems. Here, we highlight a systemic issue of gradient dynamics that arise for saddle point problems, namely the presence of undesired stable stationary points that are no local optima. We propose a novel optimization approach that exploits curvature information i… ▽ More

    Submitted 14 February, 2019; v1 submitted 15 May, 2018; originally announced May 2018.