Skip to main content

Showing 1–3 of 3 results for author: Massiani, P

Searching in archive cs. Search in all archives.
.
  1. arXiv:2406.06101  [pdf, ps, other

    cs.LG stat.ML

    On the Consistency of Kernel Methods with Dependent Observations

    Authors: Pierre-François Massiani, Sebastian Trimpe, Friedrich Solowjow

    Abstract: The consistency of a learning method is usually established under the assumption that the observations are a realization of an independent and identically distributed (i.i.d.) or mixing process. Yet, kernel methods such as support vector machines (SVMs), Gaussian processes, or conditional kernel mean embeddings (CKMEs) all give excellent performance under sampling schemes that are obviously non-i.… ▽ More

    Submitted 10 June, 2024; originally announced June 2024.

    Comments: 26 pages, 1 figure

  2. Data-Driven Observability Analysis for Nonlinear Stochastic Systems

    Authors: Pierre-François Massiani, Mona Buisson-Fenet, Friedrich Solowjow, Florent Di Meglio, Sebastian Trimpe

    Abstract: Distinguishability and, by extension, observability are key properties of dynamical systems. Establishing these properties is challenging, especially when no analytical model is available and they are to be inferred directly from measurement data. The presence of noise further complicates this analysis, as standard notions of distinguishability are tailored to deterministic systems. We build on di… ▽ More

    Submitted 7 June, 2024; v1 submitted 23 February, 2023; originally announced February 2023.

    Comments: 9 pages, 3 figures

    Journal ref: IEEE Transactions of Automatic Control 69 (2023) 4042 -- 4049

  3. arXiv:2105.12204  [pdf, other

    eess.SY cs.LG cs.RO

    Safe Value Functions

    Authors: Pierre-François Massiani, Steve Heim, Friedrich Solowjow, Sebastian Trimpe

    Abstract: Safety constraints and optimality are important, but sometimes conflicting criteria for controllers. Although these criteria are often solved separately with different tools to maintain formal guarantees, it is also common practice in reinforcement learning to simply modify reward functions by penalizing failures, with the penalty treated as a mere heuristic. We rigorously examine the relationship… ▽ More

    Submitted 1 December, 2022; v1 submitted 25 May, 2021; originally announced May 2021.

    Comments: 16 pages, 6 figures. Accepted for publication in: Transactions of Automatic Control, special issue on Learning and Control

    Journal ref: IEEE Transactions of Automatic Control 68, Issue 5 (2023) 2743 -- 2757