Skip to main content

Showing 1–8 of 8 results for author: Bolland, A

Searching in archive cs. Search in all archives.
.
  1. arXiv:2406.19825  [pdf, other

    cs.LG

    Reinforcement Learning for Efficient Design and Control Co-optimisation of Energy Systems

    Authors: Marine Cauz, Adrien Bolland, Nicolas Wyrsch, Christophe Ballif

    Abstract: The ongoing energy transition drives the development of decentralised renewable energy sources, which are heterogeneous and weather-dependent, complicating their integration into energy systems. This study tackles this issue by introducing a novel reinforcement learning (RL) framework tailored for the co-optimisation of design and control in energy systems. Traditionally, the integration of renewa… ▽ More

    Submitted 28 June, 2024; originally announced June 2024.

    Journal ref: AI for Science workshop at ICML 2024

  2. arXiv:2402.00162  [pdf, other

    cs.LG stat.ML

    Behind the Myth of Exploration in Policy Gradients

    Authors: Adrien Bolland, Gaspard Lambrechts, Damien Ernst

    Abstract: Policy-gradient algorithms are effective reinforcement learning methods for solving control problems with continuous state and action spaces. To compute near-optimal policies, it is essential in practice to include exploration terms in the learning objective. Although the effectiveness of these terms is usually justified by an intrinsic need to explore environments, we propose a novel analysis and… ▽ More

    Submitted 31 January, 2024; originally announced February 2024.

  3. arXiv:2306.11488  [pdf, other

    cs.LG

    Informed POMDP: Leveraging Additional Information in Model-Based RL

    Authors: Gaspard Lambrechts, Adrien Bolland, Damien Ernst

    Abstract: In this work, we generalize the problem of learning through interaction in a POMDP by accounting for eventual additional information available at training time. First, we introduce the informed POMDP, a new learning paradigm offering a clear distinction between the information at training and the observation at execution. Next, we propose an objective that leverages this information for learning a… ▽ More

    Submitted 12 June, 2024; v1 submitted 20 June, 2023; originally announced June 2023.

    Comments: In Reinforcement Learning Conference, 2024. 10 pages, 22 pages total, 10 figures

  4. arXiv:2305.06851  [pdf, other

    cs.LG math.OC stat.ML

    Policy Gradient Algorithms Implicitly Optimize by Continuation

    Authors: Adrien Bolland, Gilles Louppe, Damien Ernst

    Abstract: Direct policy optimization in reinforcement learning is usually solved with policy-gradient algorithms, which optimize policy parameters via stochastic gradient ascent. This paper provides a new theoretical interpretation and justification of these algorithms. First, we formulate direct policy optimization in the optimization by continuation framework. The latter is a framework for optimizing nonc… ▽ More

    Submitted 21 October, 2023; v1 submitted 11 May, 2023; originally announced May 2023.

    Comments: In Transactions on Machine Learning Research (2023)

  5. arXiv:2208.03520  [pdf, other

    cs.LG stat.ML

    Recurrent networks, hidden states and beliefs in partially observable environments

    Authors: Gaspard Lambrechts, Adrien Bolland, Damien Ernst

    Abstract: Reinforcement learning aims to learn optimal policies from interaction with environments whose dynamics are unknown. Many methods rely on the approximation of a value function to derive near-optimal policies. In partially observable environments, these functions depend on the complete sequence of observations and past actions, called the history. In this work, we show empirically that recurrent ne… ▽ More

    Submitted 6 August, 2022; originally announced August 2022.

    Comments: 12 pages, 28 pages total, 20 figures. Transactions on Machine Learning Research (2022)

    Journal ref: Transactions on Machine Learning Research, 2022

  6. Distributional Reinforcement Learning with Unconstrained Monotonic Neural Networks

    Authors: Thibaut Théate, Antoine Wehenkel, Adrien Bolland, Gilles Louppe, Damien Ernst

    Abstract: The distributional reinforcement learning (RL) approach advocates for representing the complete probability distribution of the random return instead of only modelling its expectation. A distributional RL algorithm may be characterised by two main components, namely the representation of the distribution together with its parameterisation and the probability metric defining the loss. The present r… ▽ More

    Submitted 17 March, 2023; v1 submitted 6 June, 2021; originally announced June 2021.

    Comments: Research paper accepted for publication in the peer-reviewed Neurocomputing journal edited by Elsevier

  7. arXiv:2006.01738  [pdf, other

    cs.LG stat.ML

    Jointly Learning Environments and Control Policies with Projected Stochastic Gradient Ascent

    Authors: Adrien Bolland, Ioannis Boukas, Mathias Berger, Damien Ernst

    Abstract: We consider the joint design and control of discrete-time stochastic dynamical systems over a finite time horizon. We formulate the problem as a multi-step optimization problem under uncertainty seeking to identify a system design and a control policy that jointly maximize the expected sum of rewards collected over the time horizon considered. The transition function, the reward function and the p… ▽ More

    Submitted 6 January, 2022; v1 submitted 2 June, 2020; originally announced June 2020.

    Journal ref: Journal of Artificial Intelligence Research 73 (2022) 117-171

  8. arXiv:2004.05940  [pdf, other

    q-fin.TR cs.AI cs.LG

    A Deep Reinforcement Learning Framework for Continuous Intraday Market Bidding

    Authors: Ioannis Boukas, Damien Ernst, Thibaut Théate, Adrien Bolland, Alexandre Huynen, Martin Buchwald, Christelle Wynants, Bertrand Cornélusse

    Abstract: The large integration of variable energy resources is expected to shift a large part of the energy exchanges closer to real-time, where more accurate forecasts are available. In this context, the short-term electricity markets and in particular the intraday market are considered a suitable trading floor for these exchanges to occur. A key component for the successful renewable energy sources integ… ▽ More

    Submitted 13 April, 2020; originally announced April 2020.