Skip to main content

Showing 1–3 of 3 results for author: Winqvist, R

Searching in archive cs. Search in all archives.
.
  1. arXiv:2304.01701  [pdf, ps, other

    cs.LG eess.SY

    Optimal Transport for Correctional Learning

    Authors: Rebecka Winqvist, Inês Lourenco, Francesco Quinzan, Cristian R. Rojas, Bo Wahlberg

    Abstract: The contribution of this paper is a generalized formulation of correctional learning using optimal transport, which is about how to optimally transport one mass distribution to another. Correctional learning is a framework developed to enhance the accuracy of parameter estimation processes by means of a teacher-student approach. In this framework, an expert agent, referred to as the teacher, modif… ▽ More

    Submitted 4 April, 2023; originally announced April 2023.

  2. arXiv:2111.07818  [pdf, ps, other

    cs.LG eess.SY

    A Teacher-Student Markov Decision Process-based Framework for Online Correctional Learning

    Authors: Inês Lourenço, Rebecka Winqvist, Cristian R. Rojas, Bo Wahlberg

    Abstract: A classical learning setting typically concerns an agent/student who collects data, or observations, from a system in order to estimate a certain property of interest. Correctional learning is a type of cooperative teacher-student framework where a teacher, who has partial knowledge about the system, has the ability to observe and alter (correct) the observations received by the student in order t… ▽ More

    Submitted 29 March, 2022; v1 submitted 15 November, 2021; originally announced November 2021.

    Comments: 8 pages, 7 figures

  3. arXiv:2005.04112  [pdf, other

    stat.ML cs.LG eess.SY

    On Training and Evaluation of Neural Network Approaches for Model Predictive Control

    Authors: Rebecka Winqvist, Arun Venkitaraman, Bo Wahlberg

    Abstract: The contribution of this paper is a framework for training and evaluation of Model Predictive Control (MPC) implemented using constrained neural networks. Recent studies have proposed to use neural networks with differentiable convex optimization layers to implement model predictive controllers. The motivation is to replace real-time optimization in safety critical feedback control systems with le… ▽ More

    Submitted 8 May, 2020; originally announced May 2020.