Skip to main content

Showing 1–50 of 159 results for author: Schön, T

.
  1. arXiv:2406.19175  [pdf, other

    cs.LG cs.CV

    Towards Reducing Data Acquisition and Labeling for Defect Detection using Simulated Data

    Authors: Lukas Malte Kemeter, Rasmus Hvingelby, Paulina Sierak, Tobias Schön, Bishwajit Gosswam

    Abstract: In many manufacturing settings, annotating data for machine learning and computer vision is costly, but synthetic data can be generated at significantly lower cost. Substituting the real-world data with synthetic data is therefore appealing for many machine learning applications that require large amounts of training data. However, relying solely on synthetic data is frequently inadequate for effe… ▽ More

    Submitted 27 June, 2024; originally announced June 2024.

  2. arXiv:2405.13794  [pdf, other

    stat.ML cs.LG stat.CO stat.ME

    Conditioning diffusion models by explicit forward-backward bridging

    Authors: Adrien Corenflos, Zheng Zhao, Simo Särkkä, Jens Sjölund, Thomas B. Schön

    Abstract: Given an unconditional diffusion model $π(x, y)$, using it to perform conditional simulation $π(x \mid y)$ is still largely an open question and is typically achieved by learning conditional drifts to the denoising SDE after the fact. In this work, we express conditional simulation as an inference problem on an augmented space corresponding to a partial SDE bridge. This perspective allows us to im… ▽ More

    Submitted 22 May, 2024; originally announced May 2024.

    Comments: 24 pages, 12 figures

  3. arXiv:2404.09732  [pdf, other

    cs.CV

    Photo-Realistic Image Restoration in the Wild with Controlled Vision-Language Models

    Authors: Ziwei Luo, Fredrik K. Gustafsson, Zheng Zhao, Jens Sjölund, Thomas B. Schön

    Abstract: Though diffusion models have been successfully applied to various image restoration (IR) tasks, their performance is sensitive to the choice of training datasets. Typically, diffusion models trained in specific datasets fail to recover images that have out-of-distribution degradations. To address this problem, this work leverages a capable vision-language model and a synthetic degradation pipeline… ▽ More

    Submitted 15 April, 2024; originally announced April 2024.

    Comments: CVPRW 2024; Code: https://github.com/Algolzw/daclip-uir

  4. On the equivalence of direct and indirect data-driven predictive control approaches

    Authors: Per Mattsson, Fabio Bonassi, Valentina Breschi, Thomas B. Schön

    Abstract: Recently, several direct Data-Driven Predictive Control (DDPC) methods have been proposed, advocating the possibility of designing predictive controllers from historical input-output trajectories without the need to identify a model. In this work, we show that these approaches are equivalent to an indirect approach. Reformulating the direct methods in terms of estimated parameters and covariance m… ▽ More

    Submitted 20 May, 2024; v1 submitted 9 March, 2024; originally announced March 2024.

    Comments: \c{opyright} 2024 IEEE. Personal use of this material is permitted. Permission from IEEE must be obtained for all other uses, in any current or future media, including reprinting/republishing this material for advertising or promotional purposes, creating new collective works, for resale or redistribution to servers or lists, or reuse of any copyrighted component of this work in other works

  5. arXiv:2402.04080  [pdf, other

    cs.LG eess.SY

    Entropy-regularized Diffusion Policy with Q-Ensembles for Offline Reinforcement Learning

    Authors: Ruoqi Zhang, Ziwei Luo, Jens Sjölund, Thomas B. Schön, Per Mattsson

    Abstract: This paper presents advanced techniques of training diffusion policies for offline reinforcement learning (RL). At the core is a mean-reverting stochastic differential equation (SDE) that transfers a complex action distribution into a standard Gaussian and then samples actions conditioned on the environment state with a corresponding reverse-time SDE, like a typical diffusion policy. We show that… ▽ More

    Submitted 6 February, 2024; originally announced February 2024.

  6. arXiv:2401.14325  [pdf, other

    cs.CV

    Unlocking Past Information: Temporal Embeddings in Cooperative Bird's Eye View Prediction

    Authors: Dominik Rößle, Jeremias Gerner, Klaus Bogenberger, Daniel Cremers, Stefanie Schmidtner, Torsten Schön

    Abstract: Accurate and comprehensive semantic segmentation of Bird's Eye View (BEV) is essential for ensuring safe and proactive navigation in autonomous driving. Although cooperative perception has exceeded the detection capabilities of single-agent systems, prevalent camera-based algorithms in cooperative perception neglect valuable information derived from historical observations. This limitation becomes… ▽ More

    Submitted 25 January, 2024; originally announced January 2024.

  7. arXiv:2401.05876  [pdf, other

    cs.LG cs.RO

    Safe reinforcement learning in uncertain contexts

    Authors: Dominik Baumann, Thomas B. Schön

    Abstract: When deploying machine learning algorithms in the real world, guaranteeing safety is an essential asset. Existing safe learning approaches typically consider continuous variables, i.e., regression tasks. However, in practice, robotic systems are also subject to discrete, external environmental changes, e.g., having to carry objects of certain weights or operating on frozen, wet, or dry surfaces. S… ▽ More

    Submitted 11 January, 2024; originally announced January 2024.

    Comments: Accepted final version to appear in the IEEE Transactions on Robotics

  8. arXiv:2312.06211  [pdf, other

    eess.SY cs.LG

    Structured state-space models are deep Wiener models

    Authors: Fabio Bonassi, Carl Andersson, Per Mattsson, Thomas B. Schön

    Abstract: The goal of this paper is to provide a system identification-friendly introduction to the Structured State-space Models (SSMs). These models have become recently popular in the machine learning community since, owing to their parallelizability, they can be efficiently and scalably trained to tackle extremely-long sequence classification and regression problems. Interestingly, SSMs appear as an eff… ▽ More

    Submitted 20 May, 2024; v1 submitted 11 December, 2023; originally announced December 2023.

    Comments: \c{opyright} 2024 the authors. This work has been accepted to IFAC for publication under a Creative Commons Licence CC-BY-NC-ND

  9. arXiv:2311.12566  [pdf, other

    cs.LG stat.ML

    Variational Elliptical Processes

    Authors: Maria Bånkestad, Jens Sjölund, Jalil Taghia, Thomas B. Schöon

    Abstract: We present elliptical processes, a family of non-parametric probabilistic models that subsume Gaussian processes and Student's t processes. This generalization includes a range of new heavy-tailed behaviors while retaining computational tractability. Elliptical processes are based on a representation of elliptical distributions as a continuous mixture of Gaussian distributions. We parameterize thi… ▽ More

    Submitted 21 November, 2023; originally announced November 2023.

    Comments: 14 pages, 15 figures, appendix 9 pages

    Journal ref: Transactions on Machine Learning Research, September 2023

  10. arXiv:2311.04125   

    math.CO math.NT

    New bounds in the Bogolyubov-Ruzsa lemma

    Authors: Tomasz Kosciuszko, Tomasz Schoen

    Abstract: We establish new bounds in the Bogolyubov-Ruzsa lemma, demonstrating that if A is a subset of a finite abelian group with density alpha, then 3A-3A contains a Bohr set of rank O(log^2 (2/alpha)) and radius Omega(log^{-2} (2/alpha)). The Bogolyubov-Ruzsa lemma is one of the deepest results in additive combinatorics, with a plethora of important consequences. In particular, we obtain new results tow… ▽ More

    Submitted 9 November, 2023; v1 submitted 7 November, 2023; originally announced November 2023.

    Comments: Mistake in Lemma 5.1

  11. arXiv:2310.19608  [pdf, other

    cs.LG stat.ML

    On Feynman--Kac training of partial Bayesian neural networks

    Authors: Zheng Zhao, Sebastian Mair, Thomas B. Schön, Jens Sjölund

    Abstract: Recently, partial Bayesian neural networks (pBNNs), which only consider a subset of the parameters to be stochastic, were shown to perform competitively with full Bayesian neural networks. However, pBNNs are often multi-modal in the latent variable space and thus challenging to approximate with parametric models. To address this problem, we propose an efficient sampling-based training strategy, wh… ▽ More

    Submitted 27 February, 2024; v1 submitted 30 October, 2023; originally announced October 2023.

    Comments: In AISTATS 2024

  12. arXiv:2310.11335  [pdf, other

    cs.LG

    Non-ergodicity in reinforcement learning: robustness via ergodicity transformations

    Authors: Dominik Baumann, Erfaun Noorani, James Price, Ole Peters, Colm Connaughton, Thomas B. Schön

    Abstract: Envisioned application areas for reinforcement learning (RL) include autonomous driving, precision agriculture, and finance, which all require RL agents to make decisions in the real world. A significant challenge hindering the adoption of RL methods in these domains is the non-robustness of conventional algorithms. In this paper, we argue that a fundamental issue contributing to this lack of robu… ▽ More

    Submitted 10 April, 2024; v1 submitted 17 October, 2023; originally announced October 2023.

  13. arXiv:2310.10807  [pdf, other

    stat.ML cs.CR cs.LG math.OC

    Regularization properties of adversarially-trained linear regression

    Authors: Antônio H. Ribeiro, Dave Zachariah, Francis Bach, Thomas B. Schön

    Abstract: State-of-the-art machine learning models can be vulnerable to very small input perturbations that are adversarially constructed. Adversarial training is an effective approach to defend against it. Formulated as a min-max problem, it searches for the best solution when the training data were corrupted by the worst-case attacks. Linear models are among the simple models where vulnerabilities can be… ▽ More

    Submitted 16 October, 2023; originally announced October 2023.

    Comments: Accepted (spotlight) NeurIPS 2023; A preliminary version of this work titled: "Surprises in adversarially-trained linear regression" was made available under a different identifier: arXiv:2205.12695

  14. arXiv:2310.09584  [pdf, ps, other

    math.CO

    On convex equations

    Authors: Tomasz Schoen

    Abstract: We prove that every subset of $\{1,\dots, N\}$ which does not contain any solutions to the equation $x+y+z=3w$ has at most $\exp(-c(\log N)^{1/5+o(1)})N$ elements, for some $c>0$. This theorem improves upon previous estimates. Additionally, our method has the potential to yield an optimal estimate for this problem that matches the known Behrend's lower estimate. Our approach relies on a new result… ▽ More

    Submitted 14 October, 2023; originally announced October 2023.

    MSC Class: 11B30; 11B25

  15. arXiv:2310.01018  [pdf, other

    cs.CV

    Controlling Vision-Language Models for Multi-Task Image Restoration

    Authors: Ziwei Luo, Fredrik K. Gustafsson, Zheng Zhao, Jens Sjölund, Thomas B. Schön

    Abstract: Vision-language models such as CLIP have shown great impact on diverse downstream tasks for zero-shot or label-free predictions. However, when it comes to low-level vision such as image restoration their performance deteriorates dramatically due to corrupted inputs. In this paper, we present a degradation-aware vision-language model (DA-CLIP) to better transfer pretrained vision-language models to… ▽ More

    Submitted 28 February, 2024; v1 submitted 2 October, 2023; originally announced October 2023.

    Comments: Accepted by ICLR 2024. Project page: https://algolzw.github.io/daclip-uir/index.html

  16. arXiv:2309.16335  [pdf, other

    cs.LG cs.AI q-bio.QM stat.AP

    End-to-end Risk Prediction of Atrial Fibrillation from the 12-Lead ECG by Deep Neural Networks

    Authors: Theogene Habineza, Antônio H. Ribeiro, Daniel Gedon, Joachim A. Behar, Antonio Luiz P. Ribeiro, Thomas B. Schön

    Abstract: Background: Atrial fibrillation (AF) is one of the most common cardiac arrhythmias that affects millions of people each year worldwide and it is closely linked to increased risk of cardiovascular diseases such as stroke and heart failure. Machine learning methods have shown promising results in evaluating the risk of develo** atrial fibrillation from the electrocardiogram. We aim to develop and… ▽ More

    Submitted 28 September, 2023; originally announced September 2023.

    Comments: 16 pages with 7 figures

    Journal ref: @article{HABINEZA2023193, journal = {Journal of Electrocardiology}, volume = {81}, pages = {193-200}, year = {2023}, issn = {0022-0736}}

  17. arXiv:2308.10245  [pdf, other

    math.CO math.NT

    Note on the Theorem of Balog, Szemerédi, and Gowers

    Authors: Christian Reiher, Tomasz Schoen

    Abstract: We prove that every additive set $A$ with energy $E(A)\ge |A|^3/K$ has a subset $A'\subseteq A$ of size $|A'|\ge (1-\varepsilon)K^{-1/2}|A|$ such that $|A'-A'|\le O_\varepsilon(K^{4}|A'|)$. This is, essentially, the largest structured set one can get in the Balog-Szemerédi-Gowers theorem.

    Submitted 18 February, 2024; v1 submitted 20 August, 2023; originally announced August 2023.

    Comments: second version addresses referee reports

  18. arXiv:2308.02632  [pdf, other

    cs.CV cs.AI cs.LG

    Generation of Realistic Synthetic Raw Radar Data for Automated Driving Applications using Generative Adversarial Networks

    Authors: Eduardo C. Fidelis, Fabio Reway, Herick Y. S. Ribeiro, Pietro L. Campos, Werner Huber, Christian Icking, Lester A. Faria, Torsten Schön

    Abstract: The main approaches for simulating FMCW radar are based on ray tracing, which is usually computationally intensive and do not account for background noise. This work proposes a faster method for FMCW radar simulation capable of generating synthetic raw radar data using generative adversarial networks (GAN). The code and pre-trained weights are open-source and available on GitHub. This method gener… ▽ More

    Submitted 8 August, 2023; v1 submitted 4 August, 2023; originally announced August 2023.

  19. arXiv:2306.16042  [pdf, other

    math.OC eess.SY

    Guarantees for data-driven control of nonlinear systems using semidefinite programming: A survey

    Authors: Tim Martin, Thomas B. Schön, Frank Allgöwer

    Abstract: This survey presents recent research on determining control-theoretic properties and designing controllers with rigorous guarantees using semidefinite programming and for nonlinear systems for which no mathematical models but measured trajectories are available. Data-driven control techniques have been developed to circumvent a time-consuming modelling by first principles and because of the increa… ▽ More

    Submitted 3 November, 2023; v1 submitted 28 June, 2023; originally announced June 2023.

  20. arXiv:2306.03953  [pdf, other

    cs.RO eess.SP eess.SY

    Rao-Blackwellized Particle Smoothing for Simultaneous Localization and Map**

    Authors: Manon Kok, Arno Solin, Thomas B. Schön

    Abstract: Simultaneous localization and map** (SLAM) is the task of building a map representation of an unknown environment while at the same time using it for positioning. A probabilistic interpretation of the SLAM task allows for incorporating prior knowledge and for operation under uncertainty. Contrary to the common practice of computing point estimates of the system states, we capture the full poster… ▽ More

    Submitted 5 June, 2024; v1 submitted 6 June, 2023; originally announced June 2023.

    Comments: 23 pages, 7 figures

    Journal ref: Data-Centric Engineering. 2024;5:e15

  21. arXiv:2304.08291  [pdf, other

    cs.CV

    Refusion: Enabling Large-Size Realistic Image Restoration with Latent-Space Diffusion Models

    Authors: Ziwei Luo, Fredrik K. Gustafsson, Zheng Zhao, Jens Sjölund, Thomas B. Schön

    Abstract: This work aims to improve the applicability of diffusion models in realistic image restoration. Specifically, we enhance the diffusion model in several aspects such as network architecture, noise level, denoising steps, training image size, and optimizer/scheduler. We show that tuning these hyperparameters allows us to achieve better performance on both distortion and perceptual scores. We also pr… ▽ More

    Submitted 17 April, 2023; originally announced April 2023.

    Comments: CVPRW 2023. Runner-up method in NTIRE 2023 Image Shadow Removal Challenge. Code is available at https://github.com/Algolzw/image-restoration-sde

  22. arXiv:2304.00559  [pdf, ps, other

    eess.SY cs.MA

    On the trade-off between event-based and periodic state estimation under bandwidth constraints

    Authors: Dominik Baumann, Thomas B. Schön

    Abstract: Event-based methods carefully select when to transmit information to enable high-performance control and estimation over resource-constrained communication networks. However, they come at a cost. For instance, event-based communication induces a higher computational load and increases the complexity of the scheduling problem. Thus, in some cases, allocating available slots to agents periodically i… ▽ More

    Submitted 2 April, 2023; originally announced April 2023.

    Comments: 6 pages

  23. Invertible Kernel PCA with Random Fourier Features

    Authors: Daniel Gedon, Antôni H. Ribeiro, Niklas Wahlström, Thomas B. Schön

    Abstract: Kernel principal component analysis (kPCA) is a widely studied method to construct a low-dimensional data representation after a nonlinear transformation. The prevailing method to reconstruct the original input signal from kPCA -- an important task for denoising -- requires us to solve a supervised learning problem. In this paper, we present an alternative method where the reconstruction follows n… ▽ More

    Submitted 9 March, 2023; originally announced March 2023.

    Comments: This work has been submitted to the IEEE for possible publication. Copyright may be transferred without notice, after which this version may no longer be accessible

  24. arXiv:2302.03679  [pdf, other

    cs.LG cs.CV

    How Reliable is Your Regression Model's Uncertainty Under Real-World Distribution Shifts?

    Authors: Fredrik K. Gustafsson, Martin Danelljan, Thomas B. Schön

    Abstract: Many important computer vision applications are naturally formulated as regression problems. Within medical imaging, accurate regression models have the potential to automate various tasks, hel** to lower costs and improve patient outcomes. Such safety-critical deployment does however require reliable estimation of model uncertainty, also under the wide variety of distribution shifts that might… ▽ More

    Submitted 7 November, 2023; v1 submitted 7 February, 2023; originally announced February 2023.

    Comments: TMLR, 2023. Code is available at https://github.com/fregu856/regression_uncertainty

  25. arXiv:2301.12832  [pdf, other

    cs.LG eess.SY

    Deep networks for system identification: a Survey

    Authors: Gianluigi Pillonetto, Aleksandr Aravkin, Daniel Gedon, Lennart Ljung, Antônio H. Ribeiro, Thomas B. Schön

    Abstract: Deep learning is a topic of considerable current interest. The availability of massive data collections and powerful software resources has led to an impressive amount of results in many application areas that reveal essential but hidden properties of the observations. System identification learns mathematical descriptions of dynamic systems from input-output data and can thus benefit from the adv… ▽ More

    Submitted 30 January, 2023; originally announced January 2023.

  26. arXiv:2301.11699  [pdf, other

    cs.LG cs.CV

    Image Restoration with Mean-Reverting Stochastic Differential Equations

    Authors: Ziwei Luo, Fredrik K. Gustafsson, Zheng Zhao, Jens Sjölund, Thomas B. Schön

    Abstract: This paper presents a stochastic differential equation (SDE) approach for general-purpose image restoration. The key construction consists in a mean-reverting SDE that transforms a high-quality image into a degraded counterpart as a mean state with fixed Gaussian noise. Then, by simulating the corresponding reverse-time SDE, we are able to restore the origin of the low-quality image without relyin… ▽ More

    Submitted 31 May, 2023; v1 submitted 27 January, 2023; originally announced January 2023.

    Comments: Accepted by ICML 2023; Project page: https://algolzw.github.io/ir-sde/index.html

  27. arXiv:2212.13890  [pdf, other

    eess.SP cs.CV cs.LG

    ECG-Based Electrolyte Prediction: Evaluating Regression and Probabilistic Methods

    Authors: Philipp Von Bachmann, Daniel Gedon, Fredrik K. Gustafsson, Antônio H. Ribeiro, Erik Lampa, Stefan Gustafsson, Johan Sundström, Thomas B. Schön

    Abstract: Objective: Imbalances of the electrolyte concentration levels in the body can lead to catastrophic consequences, but accurate and accessible measurements could improve patient outcomes. While blood tests provide accurate measurements, they are invasive and the laboratory analysis can be slow or inaccessible. In contrast, an electrocardiogram (ECG) is a widely adopted tool which is quick and simple… ▽ More

    Submitted 21 December, 2022; originally announced December 2022.

    Comments: Code and trained models are available at https://github.com/philippvb/ecg-electrolyte-regression

  28. arXiv:2211.05639  [pdf, other

    math.OC eess.SY

    Gaussian inference for data-driven state-feedback design of nonlinear systems

    Authors: Tim Martin, Thomas B. Schön, Frank Allgöwer

    Abstract: Data-driven control of nonlinear systems with rigorous guarantees is a challenging problem as it usually calls for nonconvex optimization and requires often knowledge of the true basis functions of the system dynamics. To tackle these drawbacks, this work is based on a data-driven polynomial representation of general nonlinear systems exploiting Taylor polynomials. Thereby, we design state-feedbac… ▽ More

    Submitted 24 March, 2023; v1 submitted 10 November, 2022; originally announced November 2022.

    Comments: Final version, accepted for presentation at the 22nd IFAC World Congress, 2023

  29. arXiv:2210.14684  [pdf, other

    stat.CO stat.AP stat.ME

    Nonlinear System Identification: Learning while respecting physical models using a sequential Monte Carlo method

    Authors: Anna Wigren, Johan Wågberg, Fredrik Lindsten, Adrian Wills, Thomas B. Schön

    Abstract: Identification of nonlinear systems is a challenging problem. Physical knowledge of the system can be used in the identification process to significantly improve the predictive performance by restricting the space of possible map**s from the input to the output. Typically, the physical models contain unknown parameters that must be learned from data. Classical methods often restrict the possible… ▽ More

    Submitted 26 October, 2022; originally announced October 2022.

    Comments: 52 pages, 13 figures

    Journal ref: IEEE Control Systems Magazine, Volume 42, Issue 1, pages 75 - 102, February 2022

  30. arXiv:2205.13629  [pdf, other

    cs.CV cs.AI cs.RO

    Deep Sensor Fusion with Pyramid Fusion Networks for 3D Semantic Segmentation

    Authors: Hannah Schieber, Fabian Duerr, Torsten Schoen, Jürgen Beyerer

    Abstract: Robust environment perception for autonomous vehicles is a tremendous challenge, which makes a diverse sensor set with e.g. camera, lidar and radar crucial. In the process of understanding the recorded sensor data, 3D semantic segmentation plays an important role. Therefore, this work presents a pyramid-based deep fusion architecture for lidar and camera to improve 3D semantic segmentation of traf… ▽ More

    Submitted 26 May, 2022; originally announced May 2022.

    Comments: conditionally accepted at IEEE IV 2022, 7 pages, 4 figures, 5 tables

  31. arXiv:2205.12695  [pdf, other

    stat.ML cs.CR cs.LG eess.SP math.ST

    Surprises in adversarially-trained linear regression

    Authors: Antônio H. Ribeiro, Dave Zachariah, Thomas B. Schön

    Abstract: State-of-the-art machine learning models can be vulnerable to very small input perturbations that are adversarially constructed. Adversarial training is an effective approach to defend against such examples. It is formulated as a min-max problem, searching for the best solution when the training data was corrupted by the worst-case attacks. For linear regression problems, adversarial training can… ▽ More

    Submitted 20 October, 2022; v1 submitted 25 May, 2022; originally announced May 2022.

  32. arXiv:2205.06306  [pdf, other

    stat.ML eess.SP stat.AP

    Probabilistic Estimation of Instantaneous Frequencies of Chirp Signals

    Authors: Zheng Zhao, Simo Särkkä, Jens Sjölund, Thomas B. Schön

    Abstract: We present a continuous-time probabilistic approach for estimating the chirp signal and its instantaneous frequency function when the true forms of these functions are not accessible. Our model represents these functions by non-linearly cascaded Gaussian processes represented as non-linear stochastic differential equations. The posterior distribution of the functions is then estimated with stochas… ▽ More

    Submitted 13 February, 2023; v1 submitted 12 May, 2022; originally announced May 2022.

    Comments: Accepted for publication in IEEE Transactions on Signal Processing

  33. arXiv:2204.06274  [pdf, other

    stat.ML cs.CR cs.LG eess.SP math.ST

    Overparameterized Linear Regression under Adversarial Attacks

    Authors: Antônio H. Ribeiro, Thomas B. Schön

    Abstract: We study the error of linear regression in the face of adversarial attacks. In this framework, an adversary changes the input to the regression model in order to maximize the prediction error. We provide bounds on the prediction error in the presence of an adversary as a function of the parameter norm and the error in the absence of such an adversary. We show how these bounds make it possible to s… ▽ More

    Submitted 27 January, 2023; v1 submitted 13 April, 2022; originally announced April 2022.

  34. arXiv:2202.01793  [pdf, other

    stat.ML cs.LG

    Incorporating Sum Constraints into Multitask Gaussian Processes

    Authors: Philipp Pilar, Carl Jidling, Thomas B. Schön, Niklas Wahlström

    Abstract: Machine learning models can be improved by adapting them to respect existing background knowledge. In this paper we consider multitask Gaussian processes, with background knowledge in the form of constraints that require a specific sum of the outputs to be constant. This is achieved by conditioning the prior distribution on the constraint fulfillment. The approach allows for both linear and nonlin… ▽ More

    Submitted 1 February, 2023; v1 submitted 3 February, 2022; originally announced February 2022.

    Journal ref: Transactions on Machine Learning Research, 2022

  35. Efficient Learning of the Parameters of Non-Linear Models using Differentiable Resampling in Particle Filters

    Authors: Conor Rosato, Vincent Beraud, Paul Horridge, Thomas B. Schön, Simon Maskell

    Abstract: It has been widely documented that the sampling and resampling steps in particle filters cannot be differentiated. The {\itshape reparameterisation trick} was introduced to allow the sampling step to be reformulated into a differentiable function. We extend the {\itshape reparameterisation trick} to include the stochastic input to resampling therefore limiting the discontinuities in the gradient c… ▽ More

    Submitted 27 April, 2022; v1 submitted 2 November, 2021; originally announced November 2021.

    Comments: 35 pages, 10 figures

  36. arXiv:2110.11948  [pdf, other

    cs.LG cs.CV stat.ML

    Learning Proposals for Practical Energy-Based Regression

    Authors: Fredrik K. Gustafsson, Martin Danelljan, Thomas B. Schön

    Abstract: Energy-based models (EBMs) have experienced a resurgence within machine learning in recent years, including as a promising alternative for probabilistic regression. However, energy-based regression requires a proposal distribution to be manually designed for training, and an initial estimate has to be provided at test-time. We address both of these issues by introducing a conceptually simple metho… ▽ More

    Submitted 7 November, 2023; v1 submitted 22 October, 2021; originally announced October 2021.

    Comments: AISTATS 2022. Code is available at https://github.com/fregu856/ebms_proposals

  37. arXiv:2107.02259  [pdf, other

    cs.CV cs.AI cs.LG

    VolNet: Estimating Human Body Part Volumes from a Single RGB Image

    Authors: Fabian Leinen, Vittorio Cozzolino, Torsten Schön

    Abstract: Human body volume estimation from a single RGB image is a challenging problem despite minimal attention from the research community. However VolNet, an architecture leveraging 2D and 3D pose estimation, body part segmentation and volume regression extracted from a single 2D RGB image combined with the subject's body height can be used to estimate the total body volume. VolNet is designed to predic… ▽ More

    Submitted 5 July, 2021; originally announced July 2021.

  38. arXiv:2106.02328  [pdf, other

    cs.CV cs.AI cs.LG cs.NE

    Temporally coherent video anonymization through GAN inpainting

    Authors: Thangapavithraa Balaji, Patrick Blies, Georg Göri, Raphael Mitsch, Marcel Wasserer, Torsten Schön

    Abstract: This work tackles the problem of temporally coherent face anonymization in natural video streams.We propose JaGAN, a two-stage system starting with detecting and masking out faces with black image patches in all individual frames of the video. The second stage leverages a privacy-preserving Video Generative Adversarial Network designed to inpaint the missing image patches with artificially generat… ▽ More

    Submitted 4 June, 2021; originally announced June 2021.

    Comments: Preprint of our FG2021 submission

  39. arXiv:2104.13853  [pdf, other

    cs.LG eess.SY

    Learning deep autoregressive models for hierarchical data

    Authors: Carl R. Andersson, Niklas Wahlström, Thomas B. Schön

    Abstract: We propose a model for hierarchical structured data as an extension to the stochastic temporal convolutional network. The proposed model combines an autoregressive model with a hierarchical variational autoencoder and downsampling to achieve superior computational complexity. We evaluate the proposed model on two different types of sequential data: speech and handwritten text. The results are prom… ▽ More

    Submitted 1 July, 2021; v1 submitted 28 April, 2021; originally announced April 2021.

  40. Data to Controller for Nonlinear Systems: An Approximate Solution

    Authors: Johannes N. Hendriks, James R. Z. Holdsworth, Adrian G. Wills, Thomas B. Schon, Brett Ninness

    Abstract: This paper considers the problem of determining an optimal control action based on observed data. We formulate the problem assuming that the system can be modelled by a nonlinear state-space model, but where the model parameters, state and future disturbances are not known and are treated as random variables. Central to our formulation is that the joint distribution of these unknown objects is con… ▽ More

    Submitted 30 June, 2021; v1 submitted 15 March, 2021; originally announced March 2021.

    Comments: in IEEE Control Systems Letters, 2021

  41. arXiv:2103.04727  [pdf, other

    cs.LG cs.CV cs.RO

    Vision-Based Mobile Robotics Obstacle Avoidance With Deep Reinforcement Learning

    Authors: Patrick Wenzel, Torsten Schön, Laura Leal-Taixé, Daniel Cremers

    Abstract: Obstacle avoidance is a fundamental and challenging problem for autonomous navigation of mobile robots. In this paper, we consider the problem of obstacle avoidance in simple 3D environments where the robot has to solely rely on a single monocular camera. In particular, we are interested in solving this problem without relying on localization, map**, or planning techniques. Most of the existing… ▽ More

    Submitted 8 March, 2021; originally announced March 2021.

    Comments: Accepted at 2021 IEEE International Conference on Robotics and Automation (ICRA)

  42. arXiv:2103.00930  [pdf, other

    physics.med-ph cs.CV cs.LG eess.IV eess.SY

    Unsupervised dynamic modeling of medical image transformation

    Authors: Niklas Gunnarsson, Peter Kimstrand, Jens Sjölund, Thomas B. Schön

    Abstract: Spatiotemporal imaging has applications in e.g. cardiac diagnostics, surgical guidance, and radiotherapy monitoring, In this paper, we explain the temporal motion by identifying the underlying dynamics, only based on the sequential images. Our dynamical model maps the inputs of observed high-dimensional sequential images to a low-dimensional latent space wherein a linear relationship between a hid… ▽ More

    Submitted 7 November, 2022; v1 submitted 1 March, 2021; originally announced March 2021.

    Comments: published in 2022 25th International Conference on Information Fusion (FUSION)

  43. arXiv:2102.10880  [pdf, other

    cs.LG

    A Probabilistically Motivated Learning Rate Adaptation for Stochastic Optimization

    Authors: Filip de Roos, Carl Jidling, Adrian Wills, Thomas Schön, Philipp Hennig

    Abstract: Machine learning practitioners invest significant manual and computational resources in finding suitable learning rates for optimization algorithms. We provide a probabilistic motivation, in terms of Gaussian inference, for popular stochastic first-order methods. As an important special case, it recovers the Polyak step with a general metric. The inference allows us to relate the learning rate to… ▽ More

    Submitted 22 February, 2021; originally announced February 2021.

  44. arXiv:2102.07757  [pdf, other

    eess.IV cs.LG eess.SP

    How Convolutional Neural Networks Deal with Aliasing

    Authors: Antônio H. Ribeiro, Thomas B. Schön

    Abstract: The convolutional neural network (CNN) remains an essential tool in solving computer vision problems. Standard convolutional architectures consist of stacked layers of operations that progressively downscale the image. Aliasing is a well-known side-effect of downsampling that may take place: it causes high-frequency components of the original signal to become indistinguishable from its low-frequen… ▽ More

    Submitted 15 February, 2021; originally announced February 2021.

    Comments: To appear in the 2021 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP)

  45. arXiv:2012.07269  [pdf, ps, other

    stat.ML cs.LG

    Variational State and Parameter Estimation

    Authors: Jarrad Courts, Johannes Hendriks, Adrian Wills, Thomas Schön, Brett Ninness

    Abstract: This paper considers the problem of computing Bayesian estimates of both states and model parameters for nonlinear state-space models. Generally, this problem does not have a tractable solution and approximations must be utilised. In this work, a variational approach is used to provide an assumed density which approximates the desired, intractable, distribution. The approach is deterministic and r… ▽ More

    Submitted 14 December, 2020; originally announced December 2020.

  46. arXiv:2012.06341  [pdf, other

    cs.LG eess.SY stat.ML

    Beyond Occam's Razor in System Identification: Double-Descent when Modeling Dynamics

    Authors: Antônio H. Ribeiro, Johannes N. Hendriks, Adrian G. Wills, Thomas B. Schön

    Abstract: System identification aims to build models of dynamical systems from data. Traditionally, choosing the model requires the designer to balance between two goals of conflicting nature; the model must be rich enough to capture the system dynamics, but not so flexible that it learns spurious random effects from the dataset. It is typically observed that the model validation performance follows a U-sha… ▽ More

    Submitted 6 August, 2021; v1 submitted 11 December, 2020; originally announced December 2020.

    Comments: To appear in the Proceedings of the 19th IFAC Symposium in System Identification (2021)

  47. arXiv:2012.05072  [pdf, ps, other

    stat.ML cs.LG eess.SY stat.ME

    Variational System Identification for Nonlinear State-Space Models

    Authors: Jarrad Courts, Adrian Wills, Thomas Schön, Brett Ninness

    Abstract: This paper considers parameter estimation for nonlinear state-space models, which is an important but challenging problem. We address this challenge by employing a variational inference (VI) approach, which is a principled method that has deep connections to maximum likelihood estimation. This VI approach ultimately provides estimates of the model as solutions to an optimisation problem, which is… ▽ More

    Submitted 14 September, 2022; v1 submitted 8 December, 2020; originally announced December 2020.

  48. arXiv:2012.04634  [pdf, other

    cs.CV cs.LG cs.RO stat.ML

    Accurate 3D Object Detection using Energy-Based Models

    Authors: Fredrik K. Gustafsson, Martin Danelljan, Thomas B. Schön

    Abstract: Accurate 3D object detection (3DOD) is crucial for safe navigation of complex environments by autonomous robots. Regressing accurate 3D bounding boxes in cluttered environments based on sparse LiDAR data is however a highly challenging problem. We address this task by exploring recent advances in conditional energy-based models (EBMs) for probabilistic regression. While methods employing EBMs for… ▽ More

    Submitted 7 November, 2023; v1 submitted 8 December, 2020; originally announced December 2020.

    Comments: CVPR Workshops 2021. Code is available at https://github.com/fregu856/ebms_3dod

  49. arXiv:2012.04136  [pdf, other

    cs.LG eess.SY

    Deep Energy-Based NARX Models

    Authors: Johannes N. Hendriks, Fredrik K. Gustafsson, Antônio H. Ribeiro, Adrian G. Wills, Thomas B. Schön

    Abstract: This paper is directed towards the problem of learning nonlinear ARX models based on system input--output data. In particular, our interest is in learning a conditional distribution of the current output based on a finite window of past inputs and outputs. To achieve this, we consider the use of so-called energy-based models, which have been developed in allied fields for learning unknown distribu… ▽ More

    Submitted 7 December, 2020; originally announced December 2020.

  50. arXiv:2006.02877  [pdf, ps, other

    math.CO math.NT

    A subexponential upper bound for van der Waerden numbers W(3,k)

    Authors: Tomasz Schoen

    Abstract: We show an improved upper estimate for van der Waerden number $W(3,k):$ there is an absolute constant $c>0$ such that if $\{1,\dots,N\}=X\cup Y$ is a partition such that $X$ does not contain any arithmetic progression of length $3$ and $Y$ does not contain any arithmetic progression of length $k$ then $$N\le \exp(O(k^{1-c}))\,.$$

    Submitted 4 June, 2020; originally announced June 2020.

    MSC Class: 05D10; 11B25