Skip to main content

Showing 1–32 of 32 results for author: Lamprier, S

.
  1. arXiv:2403.00420  [pdf, other

    cs.LG cs.AI

    Robust Deep Reinforcement Learning Through Adversarial Attacks and Training : A Survey

    Authors: Lucas Schott, Josephine Delas, Hatem Hajri, Elies Gherbi, Reda Yaich, Nora Boulahia-Cuppens, Frederic Cuppens, Sylvain Lamprier

    Abstract: Deep Reinforcement Learning (DRL) is an approach for training autonomous agents across various complex environments. Despite its significant performance in well known environments, it remains susceptible to minor conditions variations, raising concerns about its reliability in real-world applications. To improve usability, DRL must demonstrate trustworthiness and robustness. A way to improve robus… ▽ More

    Submitted 1 March, 2024; originally announced March 2024.

    Comments: 57 pages, 16 figues, 2 tables

  2. arXiv:2402.13288  [pdf, other

    cs.DB cs.AI

    Training Table Question Answering via SQL Query Decomposition

    Authors: Raphaël Mouravieff, Benjamin Piwowarski, Sylvain Lamprier

    Abstract: Table Question-Answering involves both understanding the natural language query and grounding it in the context of the input table to extract the relevant information. In this context, many methods have highlighted the benefits of intermediate pre-training from SQL queries. However, while most approaches aim at generating final answers from inputs directly, we claim that there is better to do with… ▽ More

    Submitted 19 February, 2024; originally announced February 2024.

  3. arXiv:2402.09166  [pdf, other

    cs.LG

    Deinterleaving of Discrete Renewal Process Mixtures with Application to Electronic Support Measures

    Authors: Jean Pinsolle, Olivier Goudet, Cyrille Enderli, Sylvain Lamprier, **-Kao Hao

    Abstract: In this paper, we propose a new deinterleaving method for mixtures of discrete renewal Markov chains. This method relies on the maximization of a penalized likelihood score. It exploits all available information about both the sequence of the different symbols and their arrival times. A theoretical analysis is carried out to prove that minimizing this score allows to recover the true partition of… ▽ More

    Submitted 14 February, 2024; originally announced February 2024.

  4. arXiv:2310.18413  [pdf, other

    cs.LG cs.AI stat.ML

    On the Fairness ROAD: Robust Optimization for Adversarial Debiasing

    Authors: Vincent Grari, Thibault Laugel, Tatsunori Hashimoto, Sylvain Lamprier, Marcin Detyniecki

    Abstract: In the field of algorithmic fairness, significant attention has been put on group fairness criteria, such as Demographic Parity and Equalized Odds. Nevertheless, these objectives, measured as global averages, have raised concerns about persistent local disparities between sensitive groups. In this work, we address the problem of local fairness, which ensures that the predictor is unbiased not only… ▽ More

    Submitted 27 October, 2023; originally announced October 2023.

    Comments: 23 pages, 10 figures

  5. arXiv:2302.11223  [pdf, other

    cs.LG

    Deep Generative Symbolic Regression with Monte-Carlo-Tree-Search

    Authors: Pierre-Alexandre Kamienny, Guillaume Lample, Sylvain Lamprier, Marco Virgolin

    Abstract: Symbolic regression (SR) is the problem of learning a symbolic expression from numerical data. Recently, deep neural models trained on procedurally-generated synthetic datasets showed competitive performance compared to more classical Genetic Programming (GP) algorithms. Unlike their GP counterparts, these neural approaches are trained to generate expressions from datasets given as context. This a… ▽ More

    Submitted 10 May, 2023; v1 submitted 22 February, 2023; originally announced February 2023.

  6. arXiv:2302.02662  [pdf, other

    cs.LG

    Grounding Large Language Models in Interactive Environments with Online Reinforcement Learning

    Authors: Thomas Carta, Clément Romac, Thomas Wolf, Sylvain Lamprier, Olivier Sigaud, Pierre-Yves Oudeyer

    Abstract: Recent works successfully leveraged Large Language Models' (LLM) abilities to capture abstract knowledge about world's physics to solve decision-making problems. Yet, the alignment between LLMs' knowledge and the environment can be wrong and limit functional competence due to lack of grounding. In this paper, we study an approach (named GLAM) to achieve this alignment through functional grounding:… ▽ More

    Submitted 6 September, 2023; v1 submitted 6 February, 2023; originally announced February 2023.

    Journal ref: PMLR 202 (2023):3676-3713

  7. arXiv:2206.09674  [pdf, other

    cs.CL cs.AI cs.LG

    EAGER: Asking and Answering Questions for Automatic Reward Sha** in Language-guided RL

    Authors: Thomas Carta, Pierre-Yves Oudeyer, Olivier Sigaud, Sylvain Lamprier

    Abstract: Reinforcement learning (RL) in long horizon and sparse reward tasks is notoriously difficult and requires a lot of training steps. A standard solution to speed up the process is to leverage additional reward signals, sha** it to better guide the learning process. In the context of language-conditioned RL, the abstraction and generalisation properties of the language input provide opportunities f… ▽ More

    Submitted 13 October, 2022; v1 submitted 20 June, 2022; originally announced June 2022.

    Comments: 24 pages, 16 figures, 5 tables

  8. arXiv:2206.06719  [pdf, other

    cs.LG

    Stein Variational Goal Generation for adaptive Exploration in Multi-Goal Reinforcement Learning

    Authors: Nicolas Castanet, Sylvain Lamprier, Olivier Sigaud

    Abstract: In multi-goal Reinforcement Learning, an agent can share experience between related training tasks, resulting in better generalization for new tasks at test time. However, when the goal space has discontinuities and the reward is sparse, a majority of goals are difficult to reach. In this context, a curriculum over goals helps agents learn by adapting training tasks to their current capabilities.… ▽ More

    Submitted 2 May, 2023; v1 submitted 14 June, 2022; originally announced June 2022.

  9. Which Discriminator for Cooperative Text Generation?

    Authors: Antoine Chaffin, Thomas Scialom, Sylvain Lamprier, Jacopo Staiano, Benjamin Piwowarski, Ewa Kijak, Vincent Claveau

    Abstract: Language models generate texts by successively predicting probability distributions for next tokens given past ones. A growing field of interest tries to leverage external information in the decoding process so that the generated texts have desired properties, such as being more natural, non toxic, faithful, or having a specific writing style. A solution is to use a classifier at each generation s… ▽ More

    Submitted 25 April, 2022; originally announced April 2022.

    Comments: 6 pages, 2 figures, accepted to SIGIR 2022

  10. arXiv:2201.12320  [pdf, other

    cs.LG cs.CL

    Generative Cooperative Networks for Natural Language Generation

    Authors: Sylvain Lamprier, Thomas Scialom, Antoine Chaffin, Vincent Claveau, Ewa Kijak, Jacopo Staiano, Benjamin Piwowarski

    Abstract: Generative Adversarial Networks (GANs) have known a tremendous success for many continuous generation tasks, especially in the field of image generation. However, for discrete outputs such as language, optimizing GANs remains an open problem with many instabilities, as no gradient can be properly back-propagated from the discriminator output to the generator parameters. An alternative is to learn… ▽ More

    Submitted 28 January, 2022; originally announced January 2022.

  11. arXiv:2110.14457  [pdf, other

    cs.LG

    Direct then Diffuse: Incremental Unsupervised Skill Discovery for State Covering and Goal Reaching

    Authors: Pierre-Alexandre Kamienny, Jean Tarbouriech, Sylvain Lamprier, Alessandro Lazaric, Ludovic Denoyer

    Abstract: Learning meaningful behaviors in the absence of reward is a difficult problem in reinforcement learning. A desirable and challenging unsupervised objective is to learn a set of diverse skills that provide a thorough coverage of the state space while being directed, i.e., reliably reaching distinct regions of the environment. In this paper, we build on the mutual information framework for skill dis… ▽ More

    Submitted 30 April, 2022; v1 submitted 27 October, 2021; originally announced October 2021.

    Comments: ICLR 2022

  12. arXiv:2109.04999  [pdf, other

    cs.LG cs.AI cs.CY stat.ML

    Fairness without the sensitive attribute via Causal Variational Autoencoder

    Authors: Vincent Grari, Sylvain Lamprier, Marcin Detyniecki

    Abstract: In recent years, most fairness strategies in machine learning models focus on mitigating unwanted biases by assuming that the sensitive information is observed. However this is not always possible in practice. Due to privacy purposes and var-ious regulations such as RGPD in EU, many personal sensitive attributes are frequently not collected. We notice a lack of approaches for mitigating bias in su… ▽ More

    Submitted 10 September, 2021; originally announced September 2021.

    Comments: 8 pages, 9 figures

    Journal ref: IJCAI 2022

  13. arXiv:2106.06363  [pdf, other

    cs.CL cs.AI

    To Beam Or Not To Beam: That is a Question of Cooperation for Language GANs

    Authors: Thomas Scialom, Paul-Alexis Dray, Sylvain Lamprier, Benjamin Piwowarski, Jacopo Staiano

    Abstract: Due to the discrete nature of words, language GANs require to be optimized from rewards provided by discriminator networks, via reinforcement learning methods. This is a much harder setting than for continuous tasks, which enjoy gradient flows from discriminators to generators, usually leading to dramatic learning instabilities. However, we claim that this can be solved by making discriminator and… ▽ More

    Submitted 11 June, 2021; originally announced June 2021.

  14. arXiv:2106.05566  [pdf, other

    cs.LG cs.NE stat.ML

    A Neural Tangent Kernel Perspective of GANs

    Authors: Jean-Yves Franceschi, Emmanuel de Bézenac, Ibrahim Ayed, Mickaël Chen, Sylvain Lamprier, Patrick Gallinari

    Abstract: We propose a novel theoretical framework of analysis for Generative Adversarial Networks (GANs). We reveal a fundamental flaw of previous analyses which, by incorrectly modeling GANs' training scheme, are subject to ill-defined discriminator gradients. We overcome this issue which impedes a principled study of GAN training, solving it within our framework by taking into account the discriminator's… ▽ More

    Submitted 7 November, 2022; v1 submitted 10 June, 2021; originally announced June 2021.

    Journal ref: 39th International Conference on Machine Learning, International Machine Learning Society, Jul 2022, Baltimore, MD, United States. pp.6660-6704

  15. arXiv:2104.07555  [pdf, other

    cs.CL

    Data-QuestEval: A Referenceless Metric for Data-to-Text Semantic Evaluation

    Authors: Clément Rebuffel, Thomas Scialom, Laure Soulier, Benjamin Piwowarski, Sylvain Lamprier, Jacopo Staiano, Geoffrey Scoutheeten, Patrick Gallinari

    Abstract: QuestEval is a reference-less metric used in text-to-text tasks, that compares the generated summaries directly to the source text, by automatically asking and answering questions. Its adaptation to Data-to-Text tasks is not straightforward, as it requires multimodal Question Generation and Answering systems on the considered tasks, which are seldom available. To this purpose, we propose a method… ▽ More

    Submitted 7 September, 2021; v1 submitted 15 April, 2021; originally announced April 2021.

    Comments: Accepted at EMNLP 2021

  16. Improving Robustness of Deep Reinforcement Learning Agents: Environment Attack based on the Critic Network

    Authors: Lucas Schott, Hatem Hajri, Sylvain Lamprier

    Abstract: To improve policy robustness of deep reinforcement learning agents, a line of recent works focus on producing disturbances of the environment. Existing approaches of the literature to generate meaningful disturbances of the environment are adversarial reinforcement learning methods. These methods set the problem as a two-player game between the protagonist agent, which learns to perform a task in… ▽ More

    Submitted 3 October, 2022; v1 submitted 7 April, 2021; originally announced April 2021.

    Comments: 8 pages, 8 figures

    Journal ref: 2022 International Joint Conference on Neural Networks (IJCNN), 2022, pp. 1-8

  17. arXiv:2103.12693  [pdf, other

    cs.CL

    QuestEval: Summarization Asks for Fact-based Evaluation

    Authors: Thomas Scialom, Paul-Alexis Dray, Patrick Gallinari, Sylvain Lamprier, Benjamin Piwowarski, Jacopo Staiano, Alex Wang

    Abstract: Summarization evaluation remains an open research problem: current metrics such as ROUGE are known to be limited and to correlate poorly with human judgments. To alleviate this issue, recent work has proposed evaluation metrics which rely on question answering models to assess whether a summary contains all the relevant information in its source document. Though promising, the proposed approaches… ▽ More

    Submitted 9 April, 2021; v1 submitted 23 March, 2021; originally announced March 2021.

    Comments: project page: https://github.com/recitalAI/QuestEval

  18. arXiv:2011.12423  [pdf, ps, other

    cs.LG cs.CR cs.CV

    Stochastic sparse adversarial attacks

    Authors: Manon Césaire, Lucas Schott, Hatem Hajri, Sylvain Lamprier, Patrick Gallinari

    Abstract: This paper introduces stochastic sparse adversarial attacks (SSAA), standing as simple, fast and purely noise-based targeted and untargeted attacks of neural network classifiers (NNC). SSAA offer new examples of sparse (or $L_0$) attacks for which only few methods have been proposed previously. These attacks are devised by exploiting a small-time expansion idea widely used for Markov processes. Ex… ▽ More

    Submitted 19 February, 2022; v1 submitted 24 November, 2020; originally announced November 2020.

    Comments: Final version published at the ICTAI 2021 conference with a best student paper award. Codes are available through the link: https://github.com/hhajri/stochastic-sparse-adv-attacks

  19. arXiv:2009.03183  [pdf, other

    cs.LG cs.AI cs.CY stat.ML

    Learning Unbiased Representations via Rényi Minimization

    Authors: Vincent Grari, Oualid El Hajouji, Sylvain Lamprier, Marcin Detyniecki

    Abstract: In recent years, significant work has been done to include fairness constraints in the training objective of machine learning algorithms. Many state-of the-art algorithms tackle this challenge by learning a fair representation which captures all the relevant information to predict the output Y while not containing any information about a sensitive attribute S. In this paper, we propose an adversar… ▽ More

    Submitted 7 September, 2020; originally announced September 2020.

    Comments: 23 pages, 4 figures

  20. arXiv:2008.13122  [pdf, other

    cs.LG cs.AI cs.CY stat.ML

    Adversarial Learning for Counterfactual Fairness

    Authors: Vincent Grari, Sylvain Lamprier, Marcin Detyniecki

    Abstract: In recent years, fairness has become an important topic in the machine learning research community. In particular, counterfactual fairness aims at building prediction models which ensure fairness at the most individual level. Rather than globally considering equity over the entire population, the idea is to imagine what any individual would look like with a variation of a given attribute of intere… ▽ More

    Submitted 30 August, 2020; originally announced August 2020.

    Comments: 11 pages, 5 figures

  21. arXiv:2008.01352  [pdf, other

    cs.LG cs.NE stat.ML

    PDE-Driven Spatiotemporal Disentanglement

    Authors: Jérémie Donà, Jean-Yves Franceschi, Sylvain Lamprier, Patrick Gallinari

    Abstract: A recent line of work in the machine learning community addresses the problem of predicting high-dimensional spatiotemporal phenomena by leveraging specific tools from the differential equations theory. Following this direction, we propose in this article a novel and general paradigm for this task based on a resolution method for partial differential equations: the separation of variables. This in… ▽ More

    Submitted 23 March, 2021; v1 submitted 4 August, 2020; originally announced August 2020.

    Journal ref: The Ninth International Conference on Learning Representations, International Conference on Representation Learning, May 2021, Vienne, Austria

  22. arXiv:2006.04643  [pdf, other

    cs.CL cs.LG

    ColdGANs: Taming Language GANs with Cautious Sampling Strategies

    Authors: Thomas Scialom, Paul-Alexis Dray, Sylvain Lamprier, Benjamin Piwowarski, Jacopo Staiano

    Abstract: Training regimes based on Maximum Likelihood Estimation (MLE) suffer from known limitations, often leading to poorly generated text sequences. At the root of these limitations is the mismatch between training and inference, i.e. the so-called exposure bias, exacerbated by considering only the reference texts as correct, while in practice several alternative formulations could be as good. Generativ… ▽ More

    Submitted 8 June, 2020; originally announced June 2020.

  23. arXiv:2004.14900  [pdf, other

    cs.CL

    MLSUM: The Multilingual Summarization Corpus

    Authors: Thomas Scialom, Paul-Alexis Dray, Sylvain Lamprier, Benjamin Piwowarski, Jacopo Staiano

    Abstract: We present MLSUM, the first large-scale MultiLingual SUMmarization dataset. Obtained from online newspapers, it contains 1.5M+ article/summary pairs in five different languages -- namely, French, German, Spanish, Russian, Turkish. Together with English newspapers from the popular CNN/Daily mail dataset, the collected data form a large scale multilingual dataset which can enable new research direct… ▽ More

    Submitted 30 April, 2020; originally announced April 2020.

  24. arXiv:2002.10375  [pdf, other

    cs.CL cs.LG

    Discriminative Adversarial Search for Abstractive Summarization

    Authors: Thomas Scialom, Paul-Alexis Dray, Sylvain Lamprier, Benjamin Piwowarski, Jacopo Staiano

    Abstract: We introduce a novel approach for sequence decoding, Discriminative Adversarial Search (DAS), which has the desirable properties of alleviating the effects of exposure bias without requiring external metrics. Inspired by Generative Adversarial Networks (GANs), wherein a discriminator is used to improve the generator, our method differs from GANs in that the generator parameters are not updated at… ▽ More

    Submitted 30 August, 2020; v1 submitted 24 February, 2020; originally announced February 2020.

    Comments: ICML 2020

  25. arXiv:2002.09219  [pdf, other

    cs.CV cs.LG stat.ML

    Stochastic Latent Residual Video Prediction

    Authors: Jean-Yves Franceschi, Edouard Delasalles, Mickaël Chen, Sylvain Lamprier, Patrick Gallinari

    Abstract: Designing video prediction models that account for the inherent uncertainty of the future is challenging. Most works in the literature are based on stochastic image-autoregressive recurrent networks, which raises several performance and applicability issues. An alternative is to use fully latent temporal models which untie frame synthesis and temporal dynamics. However, no such model for stochasti… ▽ More

    Submitted 7 August, 2020; v1 submitted 21 February, 2020; originally announced February 2020.

    Journal ref: Thirty-seventh International Conference on Machine Learning, International Machine Learning Society, Jul 2020, Vienne, Austria. pp.89--102

  26. arXiv:1911.05369  [pdf, other

    cs.LG cs.AI cs.CY stat.ML

    Fair Adversarial Gradient Tree Boosting

    Authors: Vincent Grari, Boris Ruf, Sylvain Lamprier, Marcin Detyniecki

    Abstract: Fair classification has become an important topic in machine learning research. While most bias mitigation strategies focus on neural networks, we noticed a lack of work on fair classifiers based on decision trees even though they have proven very efficient. In an up-to-date comparison of state-of-the-art classification algorithms in tabular data, tree boosting outperforms deep learning. For this… ▽ More

    Submitted 18 November, 2019; v1 submitted 13 November, 2019; originally announced November 2019.

  27. arXiv:1911.04929  [pdf, other

    cs.LG cs.AI cs.CY stat.ML

    Fairness-Aware Neural Réyni Minimization for Continuous Features

    Authors: Vincent Grari, Boris Ruf, Sylvain Lamprier, Marcin Detyniecki

    Abstract: The past few years have seen a dramatic rise of academic and societal interest in fair machine learning. While plenty of fair algorithms have been proposed recently to tackle this challenge for discrete variables, only a few ideas exist for continuous ones. The objective in this paper is to ensure some independence level between the outputs of regression models and any given continuous sensitive v… ▽ More

    Submitted 12 November, 2019; originally announced November 2019.

  28. arXiv:1909.04985  [pdf, other

    cs.CL cs.LG stat.ML

    Learning Dynamic Author Representations with Temporal Language Models

    Authors: Edouard Delasalles, Sylvain Lamprier, Ludovic Denoyer

    Abstract: Language models are at the heart of numerous works, notably in the text mining and information retrieval communities. These statistical models aim at extracting word distributions, from simple unigram models to recurrent approaches with latent variables that capture subtle dependencies in texts. However, those models are learned from word sequences only, and authors' identities, as well as publica… ▽ More

    Submitted 11 September, 2019; originally announced September 2019.

    Comments: International Conference on Data Mining, ICDM 2019

  29. arXiv:1909.01610  [pdf, ps, other

    cs.CL cs.AI cs.IR

    Answers Unite! Unsupervised Metrics for Reinforced Summarization Models

    Authors: Thomas Scialom, Sylvain Lamprier, Benjamin Piwowarski, Jacopo Staiano

    Abstract: Abstractive summarization approaches based on Reinforcement Learning (RL) have recently been proposed to overcome classical likelihood maximization. RL enables to consider complex, possibly non-differentiable, metrics that globally assess the quality and relevance of the generated outputs. ROUGE, the most used summarization metric, is known to suffer from bias towards lexical similarity as well as… ▽ More

    Submitted 4 September, 2019; originally announced September 2019.

    Comments: Accepted at EMNLP 2019

  30. arXiv:1812.10962  [pdf, ps, other

    cs.SI cs.LG stat.ML

    A Variational Topological Neural Model for Cascade-based Diffusion in Networks

    Authors: Sylvain Lamprier

    Abstract: Many works have been proposed in the literature to capture the dynamics of diffusion in networks. While some of them define graphical markovian models to extract temporal relationships between node infections in networks, others consider diffusion episodes as sequences of infections via recurrent neural models. In this paper we propose a model at the crossroads of these two extremes, which embeds… ▽ More

    Submitted 28 December, 2018; originally announced December 2018.

  31. arXiv:1510.01562  [pdf, other

    cs.IR cs.CL

    Parameterized Neural Network Language Models for Information Retrieval

    Authors: Benjamin Piwowarski, Sylvain Lamprier, Nicolas Despres

    Abstract: Information Retrieval (IR) models need to deal with two difficult issues, vocabulary mismatch and term dependencies. Vocabulary mismatch corresponds to the difficulty of retrieving relevant documents that do not contain exact query terms but semantically related terms. Term dependencies refers to the need of considering the relationship between the words of the query when estimating the relevance… ▽ More

    Submitted 6 October, 2015; originally announced October 2015.

    ACM Class: H.3.3; I.2.6

  32. arXiv:1312.6169  [pdf, other

    cs.LG cs.SI physics.soc-ph

    Learning Information Spread in Content Networks

    Authors: Cédric Lagnier, Simon Bourigault, Sylvain Lamprier, Ludovic Denoyer, Patrick Gallinari

    Abstract: We introduce a model for predicting the diffusion of content information on social media. When propagation is usually modeled on discrete graph structures, we introduce here a continuous diffusion model, where nodes in a diffusion cascade are projected onto a latent space with the property that their proximity in this space reflects the temporal diffusion process. We focus on the task of predictin… ▽ More

    Submitted 2 February, 2014; v1 submitted 20 December, 2013; originally announced December 2013.

    Comments: 4 pages