Skip to main content

Showing 1–42 of 42 results for author: Preux, P

Searching in archive cs. Search in all archives.
.
  1. arXiv:2405.14956  [pdf, other

    cs.AI cs.LG

    Interpretable and Editable Programmatic Tree Policies for Reinforcement Learning

    Authors: Hector Kohler, Quentin Delfosse, Riad Akrour, Kristian Kersting, Philippe Preux

    Abstract: Deep reinforcement learning agents are prone to goal misalignments. The black-box nature of their policies hinders the detection and correction of such misalignments, and the trust necessary for real-world deployment. So far, solutions learning interpretable policies are inefficient or require many human priors. We propose INTERPRETER, a fast distillation method producing INTerpretable Editable tR… ▽ More

    Submitted 23 May, 2024; originally announced May 2024.

  2. arXiv:2404.10906  [pdf, other

    cs.AI cs.HC cs.LG cs.SC

    Towards a Research Community in Interpretable Reinforcement Learning: the InterpPol Workshop

    Authors: Hector Kohler, Quentin Delfosse, Paul Festor, Philippe Preux

    Abstract: Embracing the pursuit of intrinsically explainable reinforcement learning raises crucial questions: what distinguishes explainability from interpretability? Should explainable and interpretable agents be developed outside of domains where transparency is imperative? What advantages do interpretable policies offer over neural networks? How can we rigorously define and measure interpretability in po… ▽ More

    Submitted 16 April, 2024; originally announced April 2024.

  3. arXiv:2402.16608  [pdf, other

    cs.CL cs.IR

    PAQA: Toward ProActive Open-Retrieval Question Answering

    Authors: Pierre Erbacher, Jian-Yun Nie, Philippe Preux, Laure Soulier

    Abstract: Conversational systems have made significant progress in generating natural language responses. However, their potential as conversational search systems is currently limited due to their passive role in the information-seeking process. One major limitation is the scarcity of datasets that provide labelled ambiguous questions along with a supporting corpus of documents and relevant clarifying ques… ▽ More

    Submitted 26 February, 2024; originally announced February 2024.

  4. arXiv:2402.03337  [pdf, other

    cs.RO cs.AI cs.LG

    Reinforcement-learning robotic sailboats: simulator and preliminary results

    Authors: Eduardo Charles Vasconcellos, Ronald M Sampaio, André P D Araújo, Esteban Walter Gonzales Clua, Philippe Preux, Raphael Guerra, Luiz M G Gonçalves, Luis Martí, Hernan Lira, Nayat Sanchez-Pi

    Abstract: This work focuses on the main challenges and problems in develo** a virtual oceanic environment reproducing real experiments using Unmanned Surface Vehicles (USV) digital twins. We introduce the key features for building virtual worlds, considering using Reinforcement Learning (RL) agents for autonomous navigation and control. With this in mind, the main problems concern the definition of the si… ▽ More

    Submitted 16 January, 2024; originally announced February 2024.

    Journal ref: NeurIPS 2023 Workshop on Robot Learning Workshop: Pretraining, Fine-Tuning, and Generalization with Large Scale Models, Dec 2023, New Orelans, United States

  5. arXiv:2311.06119  [pdf, other

    cs.IR

    Augmenting Ad-Hoc IR Dataset for Interactive Conversational Search

    Authors: Pierre Erbacher, Jian-Yun Nie, Philippe Preux, Laure Soulier

    Abstract: A peculiarity of conversational search systems is that they involve mixed-initiatives such as system-generated query clarifying questions. Evaluating those systems at a large scale on the end task of IR is very challenging, requiring adequate datasets containing such interactions. However, current datasets only focus on either traditional ad-hoc IR tasks or query clarification tasks, the latter be… ▽ More

    Submitted 10 November, 2023; originally announced November 2023.

  6. arXiv:2309.13365  [pdf, other

    cs.LG cs.AI

    Limits of Actor-Critic Algorithms for Decision Tree Policies Learning in IBMDPs

    Authors: Hector Kohler, Riad Akrour, Philippe Preux

    Abstract: Interpretability of AI models allows for user safety checks to build trust in such AIs. In particular, Decision Trees (DTs) provide a global look at the learned model and transparently reveal which features of the input are critical for making a decision. However, interpretability is hindered if the DT is too large. To learn compact trees, a recent Reinforcement Learning (RL) framework has been pr… ▽ More

    Submitted 21 January, 2024; v1 submitted 23 September, 2023; originally announced September 2023.

    Comments: To be included in an other submission. arXiv admin note: text overlap with arXiv:2304.05839

  7. arXiv:2309.12701  [pdf, other

    cs.LG

    Interpretable Decision Tree Search as a Markov Decision Process

    Authors: Hector Kohler, Riad Akrour, Philippe Preux

    Abstract: Finding an optimal decision tree for a supervised learning task is a challenging combinatorial problem to solve at scale. It was recently proposed to frame the problem as a Markov Decision Problem (MDP) and use deep reinforcement learning to tackle scaling. Unfortunately, these methods are not competitive with the current branch-and-bound state-of-the-art. We propose instead to scale the resolutio… ▽ More

    Submitted 13 June, 2024; v1 submitted 22 September, 2023; originally announced September 2023.

  8. Development and validation of an interpretable machine learning-based calculator for predicting 5-year weight trajectories after bariatric surgery: a multinational retrospective cohort SOPHIA study

    Authors: Patrick Saux, Pierre Bauvin, Violeta Raverdy, Julien Teigny, Hélène Verkindt, Tomy Soumphonphakdy, Maxence Debert, Anne Jacobs, Daan Jacobs, Valerie Monpellier, Phong Ching Lee, Chin Hong Lim, Johanna C Andersson-Assarsson, Lena Carlsson, Per-Arne Svensson, Florence Galtier, Guelareh Dezfoulian, Mihaela Moldovanu, Severine Andrieux, Julien Couster, Marie Lepage, Erminia Lembo, Ornella Verrastro, Maud Robert, Paulina Salminen , et al. (9 additional authors not shown)

    Abstract: Background Weight loss trajectories after bariatric surgery vary widely between individuals, and predicting weight loss before the operation remains challenging. We aimed to develop a model using machine learning to provide individual preoperative prediction of 5-year weight loss trajectories after surgery. Methods In this multinational retrospective observational study we enrolled adult participa… ▽ More

    Submitted 31 August, 2023; originally announced August 2023.

    Comments: The Lancet Digital Health, 2023

  9. arXiv:2306.10882  [pdf, other

    cs.LG stat.ML

    AdaStop: adaptive statistical testing for sound comparisons of Deep RL agents

    Authors: Timothée Mathieu, Riccardo Della Vecchia, Alena Shilova, Matheus Medeiros Centa, Hector Kohler, Odalric-Ambrym Maillard, Philippe Preux

    Abstract: Recently, the scientific community has questioned the statistical reproducibility of many empirical results, especially in the field of machine learning. To solve this reproducibility crisis, we propose a theoretically sound methodology to compare the overall performance of multiple algorithms with stochastic returns. We exemplify our methodology in Deep RL. Indeed, the performance of one executio… ▽ More

    Submitted 27 January, 2024; v1 submitted 19 June, 2023; originally announced June 2023.

  10. arXiv:2304.05839  [pdf, other

    cs.LG cs.AI

    Optimal Interpretability-Performance Trade-off of Classification Trees with Black-Box Reinforcement Learning

    Authors: Hector Kohler, Riad Akrour, Philippe Preux

    Abstract: Interpretability of AI models allows for user safety checks to build trust in these models. In particular, decision trees (DTs) provide a global view on the learned model and clearly outlines the role of the features that are critical to classify a given data. However, interpretability is hindered if the DT is too large. To learn compact trees, a Reinforcement Learning (RL) framework has been rece… ▽ More

    Submitted 11 April, 2023; originally announced April 2023.

  11. arXiv:2210.08503  [pdf, other

    cs.LG

    Entropy Regularized Reinforcement Learning with Cascading Networks

    Authors: Riccardo Della Vecchia, Alena Shilova, Philippe Preux, Riad Akrour

    Abstract: Deep Reinforcement Learning (Deep RL) has had incredible achievements on high dimensional problems, yet its learning process remains unstable even on the simplest tasks. Deep RL uses neural networks as function approximators. These neural models are largely inspired by developments in the (un)supervised machine learning community. Compared to these learning frameworks, one of the major difficultie… ▽ More

    Submitted 16 October, 2022; originally announced October 2022.

  12. arXiv:2209.09882  [pdf, other

    cs.LG cs.AI

    Soft Action Priors: Towards Robust Policy Transfer

    Authors: Matheus Centa, Philippe Preux

    Abstract: Despite success in many challenging problems, reinforcement learning (RL) is still confronted with sample inefficiency, which can be mitigated by introducing prior knowledge to agents. However, many transfer techniques in reinforcement learning make the limiting assumption that the teacher is an expert. In this paper, we use the action prior from the Reinforcement Learning as Inference framework -… ▽ More

    Submitted 20 September, 2022; originally announced September 2022.

    Comments: Preprint

  13. arXiv:2207.03270  [pdf, other

    cs.AI

    gym-DSSAT: a crop model turned into a Reinforcement Learning environment

    Authors: Romain Gautron, Emilio J. Padrón, Philippe Preux, Julien Bigot, Odalric-Ambrym Maillard, David Emukpere

    Abstract: Addressing a real world sequential decision problem with Reinforcement Learning (RL) usually starts with the use of a simulated environment that mimics real conditions. We present a novel open source RL environment for realistic crop management tasks. gym-DSSAT is a gym interface to the Decision Support System for Agrotechnology Transfer (DSSAT), a high fidelity crop simulator. DSSAT has been deve… ▽ More

    Submitted 27 September, 2022; v1 submitted 7 July, 2022; originally announced July 2022.

    Report number: Report-no: Inria RR-9460

  14. arXiv:2110.10632  [pdf, other

    cs.LG cs.AI

    More Efficient Exploration with Symbolic Priors on Action Sequence Equivalences

    Authors: Toby Johnstone, Nathan Grinsztajn, Johan Ferret, Philippe Preux

    Abstract: Incorporating prior knowledge in reinforcement learning algorithms is mainly an open question. Even when insights about the environment dynamics are available, reinforcement learning is traditionally used in a tabula rasa setting and must explore and learn everything from scratch. In this paper, we consider the problem of exploiting priors about action sequence equivalence: that is, when different… ▽ More

    Submitted 7 November, 2021; v1 submitted 20 October, 2021; originally announced October 2021.

  15. arXiv:2106.07360  [pdf, other

    cs.SI cs.LG

    Low-Rank Projections of GCNs Laplacian

    Authors: Nathan Grinsztajn, Philippe Preux, Edouard Oyallon

    Abstract: In this work, we study the behavior of standard models for community detection under spectral manipulations. Through various ablation experiments, we evaluate the impact of bandpass filtering on the performance of a GCN: we empirically show that most of the necessary and used information for nodes classification is contained in the low-frequency domain, and thus contrary to images, high frequencie… ▽ More

    Submitted 4 June, 2021; originally announced June 2021.

    Journal ref: ICLR 2021 Workshop GTRL, 2021, Online, France

  16. arXiv:2106.05875  [pdf, other

    cs.LG

    Interferometric Graph Transform for Community Labeling

    Authors: Nathan Grinsztajn, Louis Leconte, Philippe Preux, Edouard Oyallon

    Abstract: We present a new approach for learning unsupervised node representations in community graphs. We significantly extend the Interferometric Graph Transform (IGT) to community labeling: this non-linear operator iteratively extracts features that take advantage of the graph topology through demodulation operations. An unsupervised feature extraction step cascades modulus non-linearity with linear oper… ▽ More

    Submitted 4 June, 2021; originally announced June 2021.

  17. arXiv:2106.04480  [pdf, other

    cs.LG cs.AI

    There Is No Turning Back: A Self-Supervised Approach for Reversibility-Aware Reinforcement Learning

    Authors: Nathan Grinsztajn, Johan Ferret, Olivier Pietquin, Philippe Preux, Matthieu Geist

    Abstract: We propose to learn to distinguish reversible from irreversible actions for better informed decision-making in Reinforcement Learning (RL). From theoretical considerations, we show that approximate reversibility can be learned through a simple surrogate task: ranking randomly sampled trajectory events in chronological order. Intuitively, pairs of events that are always observed in the same order a… ▽ More

    Submitted 29 October, 2021; v1 submitted 8 June, 2021; originally announced June 2021.

  18. arXiv:2105.09992  [pdf, other

    cs.LG

    Don't Do What Doesn't Matter: Intrinsic Motivation with Action Usefulness

    Authors: Mathieu Seurin, Florian Strub, Philippe Preux, Olivier Pietquin

    Abstract: Sparse rewards are double-edged training signals in reinforcement learning: easy to design but hard to optimize. Intrinsic motivation guidances have thus been developed toward alleviating the resulting exploration problem. They usually incentivize agents to look for new states through novelty signals. Yet, such methods encourage exhaustive exploration of the state space rather than focusing on the… ▽ More

    Submitted 31 May, 2021; v1 submitted 20 May, 2021; originally announced May 2021.

    Comments: Accepted at Internationnal Joint Conference on Artificial Intelligence (IJCAI'21) and Self-Supervision for Reinforcement Learning Workshop (SSL-RL @ICLR'21)

  19. arXiv:2102.04376  [pdf, other

    cs.LG cs.AI stat.ML

    Adversarially Guided Actor-Critic

    Authors: Yannis Flet-Berliac, Johan Ferret, Olivier Pietquin, Philippe Preux, Matthieu Geist

    Abstract: Despite definite success in deep reinforcement learning problems, actor-critic algorithms are still confronted with sample inefficiency in complex environments, particularly in tasks where efficient exploration is a bottleneck. These methods consider a policy (the actor) and a value function (the critic) whose respective losses are built using different motivations and approaches. This paper intro… ▽ More

    Submitted 8 February, 2021; originally announced February 2021.

    Comments: Accepted at ICLR 2021

  20. arXiv:2011.04333  [pdf, other

    cs.AI

    Geometric Deep Reinforcement Learning for Dynamic DAG Scheduling

    Authors: Nathan Grinsztajn, Olivier Beaumont, Emmanuel Jeannot, Philippe Preux

    Abstract: In practice, it is quite common to face combinatorial optimization problems which contain uncertainty along with non-determinism and dynamicity. These three properties call for appropriate algorithms; reinforcement learning (RL) is dealing with them in a very natural way. Today, despite some efforts, most real-life combinatorial optimization problems remain out of the reach of reinforcement learni… ▽ More

    Submitted 9 November, 2020; originally announced November 2020.

  21. arXiv:2010.04440  [pdf, other

    cs.LG cs.AI math.OC stat.ML

    Learning Value Functions in Deep Policy Gradients using Residual Variance

    Authors: Yannis Flet-Berliac, Reda Ouhamma, Odalric-Ambrym Maillard, Philippe Preux

    Abstract: Policy gradient algorithms have proven to be successful in diverse decision making and control tasks. However, these methods suffer from high sample complexity and instability issues. In this paper, we address these challenges by providing a different approach for training the critic in the actor-critic framework. Our work builds on recent studies indicating that traditional actor-critic algorithm… ▽ More

    Submitted 15 March, 2021; v1 submitted 9 October, 2020; originally announced October 2020.

    Comments: Accepted at ICLR 2021

  22. arXiv:2008.03127  [pdf, other

    eess.AS cs.LG cs.SD

    A Machine of Few Words -- Interactive Speaker Recognition with Reinforcement Learning

    Authors: Mathieu Seurin, Florian Strub, Philippe Preux, Olivier Pietquin

    Abstract: Speaker recognition is a well known and studied task in the speech processing domain. It has many applications, either for security or speaker adaptation of personal devices. In this paper, we present a new paradigm for automatic speaker recognition that we call Interactive Speaker Recognition (ISR). In this paradigm, the recognition system aims to incrementally build a representation of the speak… ▽ More

    Submitted 7 August, 2020; originally announced August 2020.

  23. arXiv:1910.02078  [pdf, other

    cs.LG stat.ML

    I'm sorry Dave, I'm afraid I can't do that, Deep Q-learning from forbidden action

    Authors: Mathieu Seurin, Philippe Preux, Olivier Pietquin

    Abstract: The use of Reinforcement Learning (RL) is still restricted to simulation or to enhance human-operated systems through recommendations. Real-world environments (e.g. industrial robots or power grids) are generally designed with safety constraints in mind implemented in the shape of valid actions masks or contingency controllers. For example, the range of motion and the angles of the motors of a rob… ▽ More

    Submitted 13 August, 2020; v1 submitted 4 October, 2019; originally announced October 2019.

    Comments: Accepted at Internationnal Joint Conference on Neural Networks (IJCNN'2020)

  24. arXiv:1909.11939  [pdf, other

    cs.LG cs.AI stat.ML

    MERL: Multi-Head Reinforcement Learning

    Authors: Yannis Flet-Berliac, Philippe Preux

    Abstract: A common challenge in reinforcement learning is how to convert the agent's interactions with an environment into fast and robust learning. For instance, earlier work makes use of domain knowledge to improve existing reinforcement learning algorithms in complex tasks. While promising, previously acquired knowledge is often costly and challenging to scale up. Instead, we decide to consider problem k… ▽ More

    Submitted 31 March, 2020; v1 submitted 26 September, 2019; originally announced September 2019.

    Comments: Deep Reinforcement Learning Workshop, NeurIPS 2019

  25. Only Relevant Information Matters: Filtering Out Noisy Samples to Boost RL

    Authors: Yannis Flet-Berliac, Philippe Preux

    Abstract: In reinforcement learning, policy gradient algorithms optimize the policy directly and rely on sampling efficiently an environment. Nevertheless, while most sampling procedures are based on direct policy sampling, self-performance measures could be used to improve such sampling prior to each policy update. Following this line of thought, we introduce SAUNA, a method where non-informative transitio… ▽ More

    Submitted 20 November, 2020; v1 submitted 8 April, 2019; originally announced April 2019.

    Comments: Accepted at IJCAI 2020

  26. A Large-Scale Study of Call Graph-based Impact Prediction using Mutation Testing

    Authors: Vincenzo Musco, Martin Monperrus, Philippe Preux

    Abstract: In software engineering, impact analysis involves predicting the software elements (e.g., modules, classes, methods) potentially impacted by a change in the source code. Impact analysis is required to optimize the testing effort. In this paper, we propose an evaluation technique to predict impact propagation. Based on 10 open-source Java projects and 5 classical mutation operators, we create 17,00… ▽ More

    Submitted 15 December, 2018; originally announced December 2018.

    Journal ref: Software Quality Journal, Volume 25, Issue 3, pp 921-950, 2017

  27. arXiv:1808.04446  [pdf, other

    cs.CV cs.CL cs.LG stat.ML

    Visual Reasoning with Multi-hop Feature Modulation

    Authors: Florian Strub, Mathieu Seurin, Ethan Perez, Harm de Vries, Jérémie Mary, Philippe Preux, Aaron Courville, Olivier Pietquin

    Abstract: Recent breakthroughs in computer vision and natural language processing have spurred interest in challenging multi-modal tasks such as visual question-answering and visual dialogue. For such tasks, one successful approach is to condition image-based convolutional network computation on language via Feature-wise Linear Modulation (FiLM) layers, i.e., per-channel scaling and shifting. We propose to… ▽ More

    Submitted 12 October, 2018; v1 submitted 3 August, 2018; originally announced August 2018.

    Comments: In Proc of ECCV 2018

  28. arXiv:1807.09142  [pdf, other

    cs.IR cs.LG stat.ML

    Recurrent Neural Networks for Long and Short-Term Sequential Recommendation

    Authors: Kiewan Villatel, Elena Smirnova, Jérémie Mary, Philippe Preux

    Abstract: Recommender systems objectives can be broadly characterized as modeling user preferences over short-or long-term time horizon. A large body of previous research studied long-term recommendation through dimensionality reduction techniques applied to the historical user-item interactions. A recently introduced session-based recommendation setting highlighted the importance of modeling short-term use… ▽ More

    Submitted 23 July, 2018; originally announced July 2018.

  29. A generative model for sparse, evolving digraphs

    Authors: Georgios Papoudakis, Philippe Preux, Martin Monperrus

    Abstract: Generating graphs that are similar to real ones is an open problem, while the similarity notion is quite elusive and hard to formalize. In this paper, we focus on sparse digraphs and propose SDG, an algorithm that aims at generating graphs similar to real ones. Since real graphs are evolving and this evolution is important to study in order to understand the underlying dynamical system, we tackle… ▽ More

    Submitted 17 October, 2017; originally announced October 2017.

    Journal ref: 6th International Conference on Complex Networks and their applications, Nov 2017, Lyon, France

  30. Correctness Attraction: A Study of Stability of Software Behavior Under Runtime Perturbation

    Authors: Benjamin Danglot, Philippe Preux, Benoit Baudry, Martin Monperrus

    Abstract: Can the execution of a software be perturbed without breaking the correctness of the output? In this paper, we devise a novel protocol to answer this rarely investigated question. In an experimental study, we observe that many perturbations do not break the correctness in ten subject programs. We call this phenomenon ``correctness attraction''. The uniqueness of this protocol is that it considers… ▽ More

    Submitted 30 May, 2017; v1 submitted 28 November, 2016; originally announced November 2016.

    Journal ref: Empirical Software Engineering, Springer Verlag, 2017

  31. A Learning Algorithm for Change Impact Prediction

    Authors: Vincenzo Musco, Antonin Carette, Martin Monperrus, Philippe Preux

    Abstract: Change impact analysis consists in predicting the impact of a code change in a software application. In this paper, we take a learning perspective on change impact analysis and consider the problem formulated as follows. The artifacts that are considered are methods of object-oriented software, the change under study is a change in the code of the method, the impact is the test methods that fail b… ▽ More

    Submitted 6 May, 2018; v1 submitted 23 December, 2015; originally announced December 2015.

    Comments: 5th International Workshop on Realizing Artificial Intelligence Synergies in Software Engineering, 2016

    Journal ref: Proceedings of the 5th International Workshop on Realizing Artificial Intelligence Synergies in Software Engineering, 2016

  32. arXiv:1510.08231  [pdf, other

    cs.LG stat.ML

    Operator-valued Kernels for Learning from Functional Response Data

    Authors: Hachem Kadri, Emmanuel Duflos, Philippe Preux, Stéphane Canu, Alain Rakotomamonjy, Julien Audiffren

    Abstract: In this paper we consider the problems of supervised classification and regression in the case where attributes and labels are functions: a data is represented by a set of functions, and the label is also a function. We focus on the use of reproducing kernel Hilbert space theory to learn from such functional data. Basic concepts and properties of kernel-based learning are extended to include the e… ▽ More

    Submitted 2 November, 2016; v1 submitted 28 October, 2015; originally announced October 2015.

    Comments: in Journal of Machine Learning Research (JMLR), 2016

    Journal ref: Journal of Machine Learning Research 17 (2016) 1-54

  33. arXiv:1410.7921  [pdf, other

    cs.SE

    A Generative Model of Software Dependency Graphs to Better Understand Software Evolution

    Authors: Vincenzo Musco, Martin Monperrus, Philippe Preux

    Abstract: Software systems are composed of many interacting elements. A natural way to abstract over software systems is to model them as graphs. In this paper we consider software dependency graphs of object-oriented software and we study one topological property: the degree distribution. Based on the analysis of ten software systems written in Java, we show that there exists completely different systems t… ▽ More

    Submitted 10 April, 2017; v1 submitted 29 October, 2014; originally announced October 2014.

  34. arXiv:1405.7544  [pdf, other

    cs.IR

    Cold-start Problems in Recommendation Systems via Contextual-bandit Algorithms

    Authors: Hai Thanh Nguyen, Jérémie Mary, Philippe Preux

    Abstract: In this paper, we study a cold-start problem in recommendation systems where we have completely new users entered the systems. There is not any interaction or feedback of the new users with the systems previoustly, thus no ratings are available. Trivial approaches are to select ramdom items or the most popular ones to recommend to the new users. However, these methods perform poorly in many case.… ▽ More

    Submitted 29 May, 2014; originally announced May 2014.

  35. arXiv:1405.3536  [pdf, other

    stat.ML cs.LG

    Improving offline evaluation of contextual bandit algorithms via bootstrap** techniques

    Authors: Olivier Nicol, Jérémie Mary, Philippe Preux

    Abstract: In many recommendation applications such as news recommendation, the items that can be rec- ommended come and go at a very fast pace. This is a challenge for recommender systems (RS) to face this setting. Online learning algorithms seem to be the most straight forward solution. The contextual bandit framework was introduced for that very purpose. In general the evaluation of a RS is a critical iss… ▽ More

    Submitted 14 May, 2014; originally announced May 2014.

    Journal ref: International Conference on Machine Learning 32 (2014)

  36. arXiv:1301.2656  [pdf, ps, other

    stat.ML cs.LG

    Multiple functional regression with both discrete and continuous covariates

    Authors: Hachem Kadri, Philippe Preux, Emmanuel Duflos, Stéphane Canu

    Abstract: In this paper we present a nonparametric method for extending functional regression methodology to the situation where more than one functional covariate is used to predict a functional response. Borrowing the idea from Kadri et al. (2010a), the method, which support mixed discrete and continuous explanatory variables, is based on estimating a function-valued function in reproducing kernel Hilbert… ▽ More

    Submitted 12 January, 2013; originally announced January 2013.

    Journal ref: 2nd International Workshop on Functional and Operatorial Statistics (IWFOS), Santander : Spain (2011)

  37. arXiv:1301.2655  [pdf, other

    cs.LG stat.ML

    Functional Regularized Least Squares Classi cation with Operator-valued Kernels

    Authors: Hachem Kadri, Asma Rabaoui, Philippe Preux, Emmanuel Duflos, Alain Rakotomamonjy

    Abstract: Although operator-valued kernels have recently received increasing interest in various machine learning and functional data analysis problems such as multi-task learning or functional regression, little attention has been paid to the understanding of their associated feature spaces. In this paper, we explore the potential of adopting an operator-valued kernel feature space perspective for the anal… ▽ More

    Submitted 12 January, 2013; originally announced January 2013.

    Journal ref: 28th International Conference on Machine Learning (ICML), Seattle : United States (2011)

  38. arXiv:1205.2171  [pdf, other

    stat.ML cs.LG

    A Generalized Kernel Approach to Structured Output Learning

    Authors: Hachem Kadri, Mohammad Ghavamzadeh, Philippe Preux

    Abstract: We study the problem of structured output learning from a regression perspective. We first provide a general formulation of the kernel dependency estimation (KDE) problem using operator-valued kernels. We show that some of the existing formulations of this problem are special cases of our framework. We then propose a covariance-based operator-valued kernel that allows us to take into account the s… ▽ More

    Submitted 15 July, 2015; v1 submitted 10 May, 2012; originally announced May 2012.

    Comments: in International Conference on Machine Learning (ICML), Jun 2013, Atlanta, United States. 2013

    Report number: RR-7956

  39. arXiv:1203.1596  [pdf, ps, other

    stat.ML cs.LG

    Multiple Operator-valued Kernel Learning

    Authors: Hachem Kadri, Alain Rakotomamonjy, Francis Bach, Philippe Preux

    Abstract: Positive definite operator-valued kernels generalize the well-known notion of reproducing kernels, and are naturally adapted to multi-output learning situations. This paper addresses the problem of learning a finite linear combination of infinite-dimensional operator-valued kernels which are suitable for extending functional data analysis methods to nonlinear contexts. We study this problem in the… ▽ More

    Submitted 14 June, 2012; v1 submitted 7 March, 2012; originally announced March 2012.

    Comments: No. RR-7900 (2012)

  40. arXiv:1203.0203  [pdf, other

    cs.LG stat.ML

    Fast Reinforcement Learning with Large Action Sets using Error-Correcting Output Codes for MDP Factorization

    Authors: Gabriel Dulac-Arnold, Ludovic Denoyer, Philippe Preux, Patrick Gallinari

    Abstract: The use of Reinforcement Learning in real-world scenarios is strongly limited by issues of scale. Most RL learning algorithms are unable to deal with problems composed of hundreds or sometimes even dozens of possible actions, and therefore cannot be applied to many real-world problems. We consider the RL problem in the supervised classification framework where the optimal policy is obtained throug… ▽ More

    Submitted 29 February, 2012; originally announced March 2012.

    MSC Class: 68T05

  41. Datum-Wise Classification: A Sequential Approach to Sparsity

    Authors: Gabriel Dulac-Arnold, Ludovic Denoyer, Philippe Preux, Patrick Gallinari

    Abstract: We propose a novel classification technique whose aim is to select an appropriate representation for each datapoint, in contrast to the usual approach of selecting a representation encompassing the whole dataset. This datum-wise representation is found by using a sparsity inducing empirical risk, which is a relaxation of the standard L 0 regularized risk. The classification problem is modeled as a… ▽ More

    Submitted 29 August, 2011; originally announced August 2011.

    Comments: ECML2011

    Journal ref: Lecture Notes in Computer Science, 2011, Volume 6911/2011, 375-390

  42. arXiv:cs/0611145  [pdf, ps, other

    cs.LG

    A Unified View of TD Algorithms; Introducing Full-Gradient TD and Equi-Gradient Descent TD

    Authors: Manuel Loth, Philippe Preux

    Abstract: This paper addresses the issue of policy evaluation in Markov Decision Processes, using linear function approximation. It provides a unified view of algorithms such as TD(lambda), LSTD(lambda), iLSTD, residual-gradient TD. It is asserted that they all consist in minimizing a gradient function and differ by the form of this function and their means of minimizing it. Two new schemes are introduced… ▽ More

    Submitted 28 November, 2006; originally announced November 2006.

    Journal ref: Dans European Symposium on Artificial Neural Networks (2006)