Skip to main content

Showing 1–13 of 13 results for author: Ziemann, I

Searching in archive cs. Search in all archives.
.
  1. arXiv:2404.09030  [pdf, other

    eess.SY cs.LG

    Active Learning for Control-Oriented Identification of Nonlinear Systems

    Authors: Bruce D. Lee, Ingvar Ziemann, George J. Pappas, Nikolai Matni

    Abstract: Model-based reinforcement learning is an effective approach for controlling an unknown system. It is based on a longstanding pipeline familiar to the control community in which one performs experiments on the environment to collect a dataset, uses the resulting dataset to identify a model of the system, and finally performs control synthesis using the identified model. As interacting with the syst… ▽ More

    Submitted 13 April, 2024; originally announced April 2024.

  2. arXiv:2404.07937  [pdf, ps, other

    math.ST cs.LG eess.SY stat.ML

    Rate-Optimal Non-Asymptotics for the Quadratic Prediction Error Method

    Authors: Charis Stamouli, Ingvar Ziemann, George J. Pappas

    Abstract: We study the quadratic prediction error method -- i.e., nonlinear least squares -- for a class of time-varying parametric predictor models satisfying a certain identifiability condition. While this method is known to asymptotically achieve the optimal rate for a wide range of problems, there have been no non-asymptotic results matching these optimal rates outside of a select few, typically linear,… ▽ More

    Submitted 15 April, 2024; v1 submitted 11 April, 2024; originally announced April 2024.

    Comments: 38 pages, added acknowledgements

  3. arXiv:2402.05928  [pdf, ps, other

    cs.LG stat.ML

    Sharp Rates in Dependent Learning Theory: Avoiding Sample Size Deflation for the Square Loss

    Authors: Ingvar Ziemann, Stephen Tu, George J. Pappas, Nikolai Matni

    Abstract: In this work, we study statistical learning with dependent ($β$-mixing) data and square loss in a hypothesis class $\mathscr{F}\subset L_{Ψ_p}$ where $Ψ_p$ is the norm $\|f\|_{Ψ_p} \triangleq \sup_{m\geq 1} m^{-1/p} \|f\|_{L^m} $ for some $p\in [2,\infty]$. Our inquiry is motivated by the search for a sharp noise interaction term, or variance proxy, in learning with dependent data. Absent any real… ▽ More

    Submitted 12 June, 2024; v1 submitted 8 February, 2024; originally announced February 2024.

  4. arXiv:2309.03873  [pdf, ps, other

    eess.SY cs.LG stat.ML

    A Tutorial on the Non-Asymptotic Theory of System Identification

    Authors: Ingvar Ziemann, Anastasios Tsiamis, Bruce Lee, Yassir Jedra, Nikolai Matni, George J. Pappas

    Abstract: This tutorial serves as an introduction to recently developed non-asymptotic methods in the theory of -- mainly linear -- system identification. We emphasize tools we deem particularly useful for a range of problems in this domain, such as the covering technique, the Hanson-Wright Inequality and the method of self-normalized martingales. We then employ these tools to give streamlined proofs of the… ▽ More

    Submitted 16 June, 2024; v1 submitted 7 September, 2023; originally announced September 2023.

  5. arXiv:2305.11165  [pdf, ps, other

    cs.LG math.ST stat.ML

    The noise level in linear regression with dependent data

    Authors: Ingvar Ziemann, Stephen Tu, George J. Pappas, Nikolai Matni

    Abstract: We derive upper bounds for random design linear regression with dependent ($β$-mixing) data absent any realizability assumptions. In contrast to the strictly realizable martingale noise regime, no sharp instance-optimal non-asymptotics are available in the literature. Up to constant factors, our analysis correctly recovers the variance term predicted by the Central Limit Theorem -- the noise level… ▽ More

    Submitted 27 October, 2023; v1 submitted 18 May, 2023; originally announced May 2023.

  6. arXiv:2212.09508  [pdf, ps, other

    eess.SY cs.LG math.PR

    A note on the smallest eigenvalue of the empirical covariance of causal Gaussian processes

    Authors: Ingvar Ziemann

    Abstract: We present a simple proof for bounding the smallest eigenvalue of the empirical covariance in a causal Gaussian process. Along the way, we establish a one-sided tail inequality for Gaussian quadratic forms using a causal decomposition. Our proof only uses elementary facts about the Gaussian distribution and the union bound. We conclude with an example in which we provide a performance guarantee fo… ▽ More

    Submitted 27 October, 2023; v1 submitted 19 December, 2022; originally announced December 2022.

  7. arXiv:2209.05423  [pdf, other

    eess.SY cs.LG math.OC stat.ML

    Statistical Learning Theory for Control: A Finite Sample Perspective

    Authors: Anastasios Tsiamis, Ingvar Ziemann, Nikolai Matni, George J. Pappas

    Abstract: This tutorial survey provides an overview of recent non-asymptotic advances in statistical learning theory as relevant to control and system identification. While there has been substantial progress across all areas of control, the theory is most well-developed when it comes to linear system identification and learning for the linear quadratic regulator, which are the focus of this manuscript. Fro… ▽ More

    Submitted 27 April, 2023; v1 submitted 12 September, 2022; originally announced September 2022.

    Comments: Survey Paper, Submitted to Control Systems Magazine. Second version contains additional motivation for finite sample statistics and more detailed comparison with classical literature

  8. arXiv:2206.08269  [pdf, other

    cs.LG stat.ML

    Learning with little mixing

    Authors: Ingvar Ziemann, Stephen Tu

    Abstract: We study square loss in a realizable time-series framework with martingale difference noise. Our main result is a fast rate excess risk bound which shows that whenever a trajectory hypercontractivity condition holds, the risk of the least-squares estimator on dependent data matches the iid rate order-wise after a burn-in time. In comparison, many existing results in learning from dependent data ha… ▽ More

    Submitted 13 June, 2024; v1 submitted 16 June, 2022; originally announced June 2022.

  9. arXiv:2206.06863  [pdf, other

    math.OC cs.LG

    How are policy gradient methods affected by the limits of control?

    Authors: Ingvar Ziemann, Anastasios Tsiamis, Henrik Sandberg, Nikolai Matni

    Abstract: We study stochastic policy gradient methods from the perspective of control-theoretic limitations. Our main result is that ill-conditioned linear systems in the sense of Doyle inevitably lead to noisy gradient estimates. We also give an example of a class of stable systems in which policy gradient methods suffer from the curse of dimensionality. Our results apply to both state feedback and partial… ▽ More

    Submitted 14 June, 2022; originally announced June 2022.

  10. arXiv:2205.14035  [pdf, ps, other

    cs.LG eess.SY math.OC stat.ML

    Learning to Control Linear Systems can be Hard

    Authors: Anastasios Tsiamis, Ingvar Ziemann, Manfred Morari, Nikolai Matni, George J. Pappas

    Abstract: In this paper, we study the statistical difficulty of learning to control linear systems. We focus on two standard benchmarks, the sample complexity of stabilization, and the regret of the online learning of the Linear Quadratic Regulator (LQR). Prior results state that the statistical difficulty for both benchmarks scales polynomially with the system state dimension up to system-theoretic quantit… ▽ More

    Submitted 27 May, 2022; originally announced May 2022.

    Comments: Accepted to COLT 2022

  11. arXiv:2202.08311  [pdf, other

    cs.LG math.OC stat.ML

    Single Trajectory Nonparametric Learning of Nonlinear Dynamics

    Authors: Ingvar Ziemann, Henrik Sandberg, Nikolai Matni

    Abstract: Given a single trajectory of a dynamical system, we analyze the performance of the nonparametric least squares estimator (LSE). More precisely, we give nonasymptotic expected $l^2$-distance bounds between the LSE and the true regression function, where expectation is evaluated on a fresh, counterfactual, trajectory. We leverage recently developed information-theoretic methods to establish the opti… ▽ More

    Submitted 19 February, 2022; v1 submitted 16 February, 2022; originally announced February 2022.

  12. arXiv:2201.01680  [pdf, other

    cs.LG math.OC stat.ML

    Regret Lower Bounds for Learning Linear Quadratic Gaussian Systems

    Authors: Ingvar Ziemann, Henrik Sandberg

    Abstract: TWe establish regret lower bounds for adaptively controlling an unknown linear Gaussian system with quadratic costs. We combine ideas from experiment design, estimation theory and a perturbation bound of certain information matrices to derive regret lower bounds exhibiting scaling on the order of magnitude $\sqrt{T}$ in the time horizon $T$. Our bounds accurately capture the role of control-theore… ▽ More

    Submitted 12 June, 2024; v1 submitted 5 January, 2022; originally announced January 2022.

  13. arXiv:2011.09288  [pdf, ps, other

    math.OC cs.LG stat.ML

    On Uninformative Optimal Policies in Adaptive LQR with Unknown B-Matrix

    Authors: Ingvar Ziemann, Henrik Sandberg

    Abstract: This paper presents local asymptotic minimax regret lower bounds for adaptive Linear Quadratic Regulators (LQR). We consider affinely parametrized $B$-matrices and known $A$-matrices and aim to understand when logarithmic regret is impossible even in the presence of structural side information. After defining the intrinsic notion of an uninformative optimal policy in terms of a singularity conditi… ▽ More

    Submitted 30 April, 2021; v1 submitted 18 November, 2020; originally announced November 2020.