Skip to main content

Showing 1–50 of 86 results for author: Schön, T

Searching in archive cs. Search in all archives.
.
  1. arXiv:2406.19175  [pdf, other

    cs.LG cs.CV

    Towards Reducing Data Acquisition and Labeling for Defect Detection using Simulated Data

    Authors: Lukas Malte Kemeter, Rasmus Hvingelby, Paulina Sierak, Tobias Schön, Bishwajit Gosswam

    Abstract: In many manufacturing settings, annotating data for machine learning and computer vision is costly, but synthetic data can be generated at significantly lower cost. Substituting the real-world data with synthetic data is therefore appealing for many machine learning applications that require large amounts of training data. However, relying solely on synthetic data is frequently inadequate for effe… ▽ More

    Submitted 27 June, 2024; originally announced June 2024.

  2. arXiv:2405.13794  [pdf, other

    stat.ML cs.LG stat.CO stat.ME

    Conditioning diffusion models by explicit forward-backward bridging

    Authors: Adrien Corenflos, Zheng Zhao, Simo Särkkä, Jens Sjölund, Thomas B. Schön

    Abstract: Given an unconditional diffusion model $π(x, y)$, using it to perform conditional simulation $π(x \mid y)$ is still largely an open question and is typically achieved by learning conditional drifts to the denoising SDE after the fact. In this work, we express conditional simulation as an inference problem on an augmented space corresponding to a partial SDE bridge. This perspective allows us to im… ▽ More

    Submitted 22 May, 2024; originally announced May 2024.

    Comments: 24 pages, 12 figures

  3. arXiv:2404.09732  [pdf, other

    cs.CV

    Photo-Realistic Image Restoration in the Wild with Controlled Vision-Language Models

    Authors: Ziwei Luo, Fredrik K. Gustafsson, Zheng Zhao, Jens Sjölund, Thomas B. Schön

    Abstract: Though diffusion models have been successfully applied to various image restoration (IR) tasks, their performance is sensitive to the choice of training datasets. Typically, diffusion models trained in specific datasets fail to recover images that have out-of-distribution degradations. To address this problem, this work leverages a capable vision-language model and a synthetic degradation pipeline… ▽ More

    Submitted 15 April, 2024; originally announced April 2024.

    Comments: CVPRW 2024; Code: https://github.com/Algolzw/daclip-uir

  4. arXiv:2402.04080  [pdf, other

    cs.LG eess.SY

    Entropy-regularized Diffusion Policy with Q-Ensembles for Offline Reinforcement Learning

    Authors: Ruoqi Zhang, Ziwei Luo, Jens Sjölund, Thomas B. Schön, Per Mattsson

    Abstract: This paper presents advanced techniques of training diffusion policies for offline reinforcement learning (RL). At the core is a mean-reverting stochastic differential equation (SDE) that transfers a complex action distribution into a standard Gaussian and then samples actions conditioned on the environment state with a corresponding reverse-time SDE, like a typical diffusion policy. We show that… ▽ More

    Submitted 6 February, 2024; originally announced February 2024.

  5. arXiv:2401.14325  [pdf, other

    cs.CV

    Unlocking Past Information: Temporal Embeddings in Cooperative Bird's Eye View Prediction

    Authors: Dominik Rößle, Jeremias Gerner, Klaus Bogenberger, Daniel Cremers, Stefanie Schmidtner, Torsten Schön

    Abstract: Accurate and comprehensive semantic segmentation of Bird's Eye View (BEV) is essential for ensuring safe and proactive navigation in autonomous driving. Although cooperative perception has exceeded the detection capabilities of single-agent systems, prevalent camera-based algorithms in cooperative perception neglect valuable information derived from historical observations. This limitation becomes… ▽ More

    Submitted 25 January, 2024; originally announced January 2024.

  6. arXiv:2401.05876  [pdf, other

    cs.LG cs.RO

    Safe reinforcement learning in uncertain contexts

    Authors: Dominik Baumann, Thomas B. Schön

    Abstract: When deploying machine learning algorithms in the real world, guaranteeing safety is an essential asset. Existing safe learning approaches typically consider continuous variables, i.e., regression tasks. However, in practice, robotic systems are also subject to discrete, external environmental changes, e.g., having to carry objects of certain weights or operating on frozen, wet, or dry surfaces. S… ▽ More

    Submitted 11 January, 2024; originally announced January 2024.

    Comments: Accepted final version to appear in the IEEE Transactions on Robotics

  7. arXiv:2312.06211  [pdf, other

    eess.SY cs.LG

    Structured state-space models are deep Wiener models

    Authors: Fabio Bonassi, Carl Andersson, Per Mattsson, Thomas B. Schön

    Abstract: The goal of this paper is to provide a system identification-friendly introduction to the Structured State-space Models (SSMs). These models have become recently popular in the machine learning community since, owing to their parallelizability, they can be efficiently and scalably trained to tackle extremely-long sequence classification and regression problems. Interestingly, SSMs appear as an eff… ▽ More

    Submitted 20 May, 2024; v1 submitted 11 December, 2023; originally announced December 2023.

    Comments: \c{opyright} 2024 the authors. This work has been accepted to IFAC for publication under a Creative Commons Licence CC-BY-NC-ND

  8. arXiv:2311.12566  [pdf, other

    cs.LG stat.ML

    Variational Elliptical Processes

    Authors: Maria Bånkestad, Jens Sjölund, Jalil Taghia, Thomas B. Schöon

    Abstract: We present elliptical processes, a family of non-parametric probabilistic models that subsume Gaussian processes and Student's t processes. This generalization includes a range of new heavy-tailed behaviors while retaining computational tractability. Elliptical processes are based on a representation of elliptical distributions as a continuous mixture of Gaussian distributions. We parameterize thi… ▽ More

    Submitted 21 November, 2023; originally announced November 2023.

    Comments: 14 pages, 15 figures, appendix 9 pages

    Journal ref: Transactions on Machine Learning Research, September 2023

  9. arXiv:2310.19608  [pdf, other

    cs.LG stat.ML

    On Feynman--Kac training of partial Bayesian neural networks

    Authors: Zheng Zhao, Sebastian Mair, Thomas B. Schön, Jens Sjölund

    Abstract: Recently, partial Bayesian neural networks (pBNNs), which only consider a subset of the parameters to be stochastic, were shown to perform competitively with full Bayesian neural networks. However, pBNNs are often multi-modal in the latent variable space and thus challenging to approximate with parametric models. To address this problem, we propose an efficient sampling-based training strategy, wh… ▽ More

    Submitted 27 February, 2024; v1 submitted 30 October, 2023; originally announced October 2023.

    Comments: In AISTATS 2024

  10. arXiv:2310.11335  [pdf, other

    cs.LG

    Non-ergodicity in reinforcement learning: robustness via ergodicity transformations

    Authors: Dominik Baumann, Erfaun Noorani, James Price, Ole Peters, Colm Connaughton, Thomas B. Schön

    Abstract: Envisioned application areas for reinforcement learning (RL) include autonomous driving, precision agriculture, and finance, which all require RL agents to make decisions in the real world. A significant challenge hindering the adoption of RL methods in these domains is the non-robustness of conventional algorithms. In this paper, we argue that a fundamental issue contributing to this lack of robu… ▽ More

    Submitted 10 April, 2024; v1 submitted 17 October, 2023; originally announced October 2023.

  11. arXiv:2310.10807  [pdf, other

    stat.ML cs.CR cs.LG math.OC

    Regularization properties of adversarially-trained linear regression

    Authors: Antônio H. Ribeiro, Dave Zachariah, Francis Bach, Thomas B. Schön

    Abstract: State-of-the-art machine learning models can be vulnerable to very small input perturbations that are adversarially constructed. Adversarial training is an effective approach to defend against it. Formulated as a min-max problem, it searches for the best solution when the training data were corrupted by the worst-case attacks. Linear models are among the simple models where vulnerabilities can be… ▽ More

    Submitted 16 October, 2023; originally announced October 2023.

    Comments: Accepted (spotlight) NeurIPS 2023; A preliminary version of this work titled: "Surprises in adversarially-trained linear regression" was made available under a different identifier: arXiv:2205.12695

  12. arXiv:2310.01018  [pdf, other

    cs.CV

    Controlling Vision-Language Models for Multi-Task Image Restoration

    Authors: Ziwei Luo, Fredrik K. Gustafsson, Zheng Zhao, Jens Sjölund, Thomas B. Schön

    Abstract: Vision-language models such as CLIP have shown great impact on diverse downstream tasks for zero-shot or label-free predictions. However, when it comes to low-level vision such as image restoration their performance deteriorates dramatically due to corrupted inputs. In this paper, we present a degradation-aware vision-language model (DA-CLIP) to better transfer pretrained vision-language models to… ▽ More

    Submitted 28 February, 2024; v1 submitted 2 October, 2023; originally announced October 2023.

    Comments: Accepted by ICLR 2024. Project page: https://algolzw.github.io/daclip-uir/index.html

  13. arXiv:2309.16335  [pdf, other

    cs.LG cs.AI q-bio.QM stat.AP

    End-to-end Risk Prediction of Atrial Fibrillation from the 12-Lead ECG by Deep Neural Networks

    Authors: Theogene Habineza, Antônio H. Ribeiro, Daniel Gedon, Joachim A. Behar, Antonio Luiz P. Ribeiro, Thomas B. Schön

    Abstract: Background: Atrial fibrillation (AF) is one of the most common cardiac arrhythmias that affects millions of people each year worldwide and it is closely linked to increased risk of cardiovascular diseases such as stroke and heart failure. Machine learning methods have shown promising results in evaluating the risk of develo** atrial fibrillation from the electrocardiogram. We aim to develop and… ▽ More

    Submitted 28 September, 2023; originally announced September 2023.

    Comments: 16 pages with 7 figures

    Journal ref: @article{HABINEZA2023193, journal = {Journal of Electrocardiology}, volume = {81}, pages = {193-200}, year = {2023}, issn = {0022-0736}}

  14. arXiv:2308.02632  [pdf, other

    cs.CV cs.AI cs.LG

    Generation of Realistic Synthetic Raw Radar Data for Automated Driving Applications using Generative Adversarial Networks

    Authors: Eduardo C. Fidelis, Fabio Reway, Herick Y. S. Ribeiro, Pietro L. Campos, Werner Huber, Christian Icking, Lester A. Faria, Torsten Schön

    Abstract: The main approaches for simulating FMCW radar are based on ray tracing, which is usually computationally intensive and do not account for background noise. This work proposes a faster method for FMCW radar simulation capable of generating synthetic raw radar data using generative adversarial networks (GAN). The code and pre-trained weights are open-source and available on GitHub. This method gener… ▽ More

    Submitted 8 August, 2023; v1 submitted 4 August, 2023; originally announced August 2023.

  15. arXiv:2306.03953  [pdf, other

    cs.RO eess.SP eess.SY

    Rao-Blackwellized Particle Smoothing for Simultaneous Localization and Map**

    Authors: Manon Kok, Arno Solin, Thomas B. Schön

    Abstract: Simultaneous localization and map** (SLAM) is the task of building a map representation of an unknown environment while at the same time using it for positioning. A probabilistic interpretation of the SLAM task allows for incorporating prior knowledge and for operation under uncertainty. Contrary to the common practice of computing point estimates of the system states, we capture the full poster… ▽ More

    Submitted 5 June, 2024; v1 submitted 6 June, 2023; originally announced June 2023.

    Comments: 23 pages, 7 figures

    Journal ref: Data-Centric Engineering. 2024;5:e15

  16. arXiv:2304.08291  [pdf, other

    cs.CV

    Refusion: Enabling Large-Size Realistic Image Restoration with Latent-Space Diffusion Models

    Authors: Ziwei Luo, Fredrik K. Gustafsson, Zheng Zhao, Jens Sjölund, Thomas B. Schön

    Abstract: This work aims to improve the applicability of diffusion models in realistic image restoration. Specifically, we enhance the diffusion model in several aspects such as network architecture, noise level, denoising steps, training image size, and optimizer/scheduler. We show that tuning these hyperparameters allows us to achieve better performance on both distortion and perceptual scores. We also pr… ▽ More

    Submitted 17 April, 2023; originally announced April 2023.

    Comments: CVPRW 2023. Runner-up method in NTIRE 2023 Image Shadow Removal Challenge. Code is available at https://github.com/Algolzw/image-restoration-sde

  17. arXiv:2304.00559  [pdf, ps, other

    eess.SY cs.MA

    On the trade-off between event-based and periodic state estimation under bandwidth constraints

    Authors: Dominik Baumann, Thomas B. Schön

    Abstract: Event-based methods carefully select when to transmit information to enable high-performance control and estimation over resource-constrained communication networks. However, they come at a cost. For instance, event-based communication induces a higher computational load and increases the complexity of the scheduling problem. Thus, in some cases, allocating available slots to agents periodically i… ▽ More

    Submitted 2 April, 2023; originally announced April 2023.

    Comments: 6 pages

  18. Invertible Kernel PCA with Random Fourier Features

    Authors: Daniel Gedon, Antôni H. Ribeiro, Niklas Wahlström, Thomas B. Schön

    Abstract: Kernel principal component analysis (kPCA) is a widely studied method to construct a low-dimensional data representation after a nonlinear transformation. The prevailing method to reconstruct the original input signal from kPCA -- an important task for denoising -- requires us to solve a supervised learning problem. In this paper, we present an alternative method where the reconstruction follows n… ▽ More

    Submitted 9 March, 2023; originally announced March 2023.

    Comments: This work has been submitted to the IEEE for possible publication. Copyright may be transferred without notice, after which this version may no longer be accessible

  19. arXiv:2302.03679  [pdf, other

    cs.LG cs.CV

    How Reliable is Your Regression Model's Uncertainty Under Real-World Distribution Shifts?

    Authors: Fredrik K. Gustafsson, Martin Danelljan, Thomas B. Schön

    Abstract: Many important computer vision applications are naturally formulated as regression problems. Within medical imaging, accurate regression models have the potential to automate various tasks, hel** to lower costs and improve patient outcomes. Such safety-critical deployment does however require reliable estimation of model uncertainty, also under the wide variety of distribution shifts that might… ▽ More

    Submitted 7 November, 2023; v1 submitted 7 February, 2023; originally announced February 2023.

    Comments: TMLR, 2023. Code is available at https://github.com/fregu856/regression_uncertainty

  20. arXiv:2301.12832  [pdf, other

    cs.LG eess.SY

    Deep networks for system identification: a Survey

    Authors: Gianluigi Pillonetto, Aleksandr Aravkin, Daniel Gedon, Lennart Ljung, Antônio H. Ribeiro, Thomas B. Schön

    Abstract: Deep learning is a topic of considerable current interest. The availability of massive data collections and powerful software resources has led to an impressive amount of results in many application areas that reveal essential but hidden properties of the observations. System identification learns mathematical descriptions of dynamic systems from input-output data and can thus benefit from the adv… ▽ More

    Submitted 30 January, 2023; originally announced January 2023.

  21. arXiv:2301.11699  [pdf, other

    cs.LG cs.CV

    Image Restoration with Mean-Reverting Stochastic Differential Equations

    Authors: Ziwei Luo, Fredrik K. Gustafsson, Zheng Zhao, Jens Sjölund, Thomas B. Schön

    Abstract: This paper presents a stochastic differential equation (SDE) approach for general-purpose image restoration. The key construction consists in a mean-reverting SDE that transforms a high-quality image into a degraded counterpart as a mean state with fixed Gaussian noise. Then, by simulating the corresponding reverse-time SDE, we are able to restore the origin of the low-quality image without relyin… ▽ More

    Submitted 31 May, 2023; v1 submitted 27 January, 2023; originally announced January 2023.

    Comments: Accepted by ICML 2023; Project page: https://algolzw.github.io/ir-sde/index.html

  22. arXiv:2212.13890  [pdf, other

    eess.SP cs.CV cs.LG

    ECG-Based Electrolyte Prediction: Evaluating Regression and Probabilistic Methods

    Authors: Philipp Von Bachmann, Daniel Gedon, Fredrik K. Gustafsson, Antônio H. Ribeiro, Erik Lampa, Stefan Gustafsson, Johan Sundström, Thomas B. Schön

    Abstract: Objective: Imbalances of the electrolyte concentration levels in the body can lead to catastrophic consequences, but accurate and accessible measurements could improve patient outcomes. While blood tests provide accurate measurements, they are invasive and the laboratory analysis can be slow or inaccessible. In contrast, an electrocardiogram (ECG) is a widely adopted tool which is quick and simple… ▽ More

    Submitted 21 December, 2022; originally announced December 2022.

    Comments: Code and trained models are available at https://github.com/philippvb/ecg-electrolyte-regression

  23. arXiv:2205.13629  [pdf, other

    cs.CV cs.AI cs.RO

    Deep Sensor Fusion with Pyramid Fusion Networks for 3D Semantic Segmentation

    Authors: Hannah Schieber, Fabian Duerr, Torsten Schoen, Jürgen Beyerer

    Abstract: Robust environment perception for autonomous vehicles is a tremendous challenge, which makes a diverse sensor set with e.g. camera, lidar and radar crucial. In the process of understanding the recorded sensor data, 3D semantic segmentation plays an important role. Therefore, this work presents a pyramid-based deep fusion architecture for lidar and camera to improve 3D semantic segmentation of traf… ▽ More

    Submitted 26 May, 2022; originally announced May 2022.

    Comments: conditionally accepted at IEEE IV 2022, 7 pages, 4 figures, 5 tables

  24. arXiv:2205.12695  [pdf, other

    stat.ML cs.CR cs.LG eess.SP math.ST

    Surprises in adversarially-trained linear regression

    Authors: Antônio H. Ribeiro, Dave Zachariah, Thomas B. Schön

    Abstract: State-of-the-art machine learning models can be vulnerable to very small input perturbations that are adversarially constructed. Adversarial training is an effective approach to defend against such examples. It is formulated as a min-max problem, searching for the best solution when the training data was corrupted by the worst-case attacks. For linear regression problems, adversarial training can… ▽ More

    Submitted 20 October, 2022; v1 submitted 25 May, 2022; originally announced May 2022.

  25. arXiv:2204.06274  [pdf, other

    stat.ML cs.CR cs.LG eess.SP math.ST

    Overparameterized Linear Regression under Adversarial Attacks

    Authors: Antônio H. Ribeiro, Thomas B. Schön

    Abstract: We study the error of linear regression in the face of adversarial attacks. In this framework, an adversary changes the input to the regression model in order to maximize the prediction error. We provide bounds on the prediction error in the presence of an adversary as a function of the parameter norm and the error in the absence of such an adversary. We show how these bounds make it possible to s… ▽ More

    Submitted 27 January, 2023; v1 submitted 13 April, 2022; originally announced April 2022.

  26. arXiv:2202.01793  [pdf, other

    stat.ML cs.LG

    Incorporating Sum Constraints into Multitask Gaussian Processes

    Authors: Philipp Pilar, Carl Jidling, Thomas B. Schön, Niklas Wahlström

    Abstract: Machine learning models can be improved by adapting them to respect existing background knowledge. In this paper we consider multitask Gaussian processes, with background knowledge in the form of constraints that require a specific sum of the outputs to be constant. This is achieved by conditioning the prior distribution on the constraint fulfillment. The approach allows for both linear and nonlin… ▽ More

    Submitted 1 February, 2023; v1 submitted 3 February, 2022; originally announced February 2022.

    Journal ref: Transactions on Machine Learning Research, 2022

  27. Efficient Learning of the Parameters of Non-Linear Models using Differentiable Resampling in Particle Filters

    Authors: Conor Rosato, Vincent Beraud, Paul Horridge, Thomas B. Schön, Simon Maskell

    Abstract: It has been widely documented that the sampling and resampling steps in particle filters cannot be differentiated. The {\itshape reparameterisation trick} was introduced to allow the sampling step to be reformulated into a differentiable function. We extend the {\itshape reparameterisation trick} to include the stochastic input to resampling therefore limiting the discontinuities in the gradient c… ▽ More

    Submitted 27 April, 2022; v1 submitted 2 November, 2021; originally announced November 2021.

    Comments: 35 pages, 10 figures

  28. arXiv:2110.11948  [pdf, other

    cs.LG cs.CV stat.ML

    Learning Proposals for Practical Energy-Based Regression

    Authors: Fredrik K. Gustafsson, Martin Danelljan, Thomas B. Schön

    Abstract: Energy-based models (EBMs) have experienced a resurgence within machine learning in recent years, including as a promising alternative for probabilistic regression. However, energy-based regression requires a proposal distribution to be manually designed for training, and an initial estimate has to be provided at test-time. We address both of these issues by introducing a conceptually simple metho… ▽ More

    Submitted 7 November, 2023; v1 submitted 22 October, 2021; originally announced October 2021.

    Comments: AISTATS 2022. Code is available at https://github.com/fregu856/ebms_proposals

  29. arXiv:2107.02259  [pdf, other

    cs.CV cs.AI cs.LG

    VolNet: Estimating Human Body Part Volumes from a Single RGB Image

    Authors: Fabian Leinen, Vittorio Cozzolino, Torsten Schön

    Abstract: Human body volume estimation from a single RGB image is a challenging problem despite minimal attention from the research community. However VolNet, an architecture leveraging 2D and 3D pose estimation, body part segmentation and volume regression extracted from a single 2D RGB image combined with the subject's body height can be used to estimate the total body volume. VolNet is designed to predic… ▽ More

    Submitted 5 July, 2021; originally announced July 2021.

  30. arXiv:2106.02328  [pdf, other

    cs.CV cs.AI cs.LG cs.NE

    Temporally coherent video anonymization through GAN inpainting

    Authors: Thangapavithraa Balaji, Patrick Blies, Georg Göri, Raphael Mitsch, Marcel Wasserer, Torsten Schön

    Abstract: This work tackles the problem of temporally coherent face anonymization in natural video streams.We propose JaGAN, a two-stage system starting with detecting and masking out faces with black image patches in all individual frames of the video. The second stage leverages a privacy-preserving Video Generative Adversarial Network designed to inpaint the missing image patches with artificially generat… ▽ More

    Submitted 4 June, 2021; originally announced June 2021.

    Comments: Preprint of our FG2021 submission

  31. arXiv:2104.13853  [pdf, other

    cs.LG eess.SY

    Learning deep autoregressive models for hierarchical data

    Authors: Carl R. Andersson, Niklas Wahlström, Thomas B. Schön

    Abstract: We propose a model for hierarchical structured data as an extension to the stochastic temporal convolutional network. The proposed model combines an autoregressive model with a hierarchical variational autoencoder and downsampling to achieve superior computational complexity. We evaluate the proposed model on two different types of sequential data: speech and handwritten text. The results are prom… ▽ More

    Submitted 1 July, 2021; v1 submitted 28 April, 2021; originally announced April 2021.

  32. arXiv:2103.04727  [pdf, other

    cs.LG cs.CV cs.RO

    Vision-Based Mobile Robotics Obstacle Avoidance With Deep Reinforcement Learning

    Authors: Patrick Wenzel, Torsten Schön, Laura Leal-Taixé, Daniel Cremers

    Abstract: Obstacle avoidance is a fundamental and challenging problem for autonomous navigation of mobile robots. In this paper, we consider the problem of obstacle avoidance in simple 3D environments where the robot has to solely rely on a single monocular camera. In particular, we are interested in solving this problem without relying on localization, map**, or planning techniques. Most of the existing… ▽ More

    Submitted 8 March, 2021; originally announced March 2021.

    Comments: Accepted at 2021 IEEE International Conference on Robotics and Automation (ICRA)

  33. arXiv:2103.00930  [pdf, other

    physics.med-ph cs.CV cs.LG eess.IV eess.SY

    Unsupervised dynamic modeling of medical image transformation

    Authors: Niklas Gunnarsson, Peter Kimstrand, Jens Sjölund, Thomas B. Schön

    Abstract: Spatiotemporal imaging has applications in e.g. cardiac diagnostics, surgical guidance, and radiotherapy monitoring, In this paper, we explain the temporal motion by identifying the underlying dynamics, only based on the sequential images. Our dynamical model maps the inputs of observed high-dimensional sequential images to a low-dimensional latent space wherein a linear relationship between a hid… ▽ More

    Submitted 7 November, 2022; v1 submitted 1 March, 2021; originally announced March 2021.

    Comments: published in 2022 25th International Conference on Information Fusion (FUSION)

  34. arXiv:2102.10880  [pdf, other

    cs.LG

    A Probabilistically Motivated Learning Rate Adaptation for Stochastic Optimization

    Authors: Filip de Roos, Carl Jidling, Adrian Wills, Thomas Schön, Philipp Hennig

    Abstract: Machine learning practitioners invest significant manual and computational resources in finding suitable learning rates for optimization algorithms. We provide a probabilistic motivation, in terms of Gaussian inference, for popular stochastic first-order methods. As an important special case, it recovers the Polyak step with a general metric. The inference allows us to relate the learning rate to… ▽ More

    Submitted 22 February, 2021; originally announced February 2021.

  35. arXiv:2102.07757  [pdf, other

    eess.IV cs.LG eess.SP

    How Convolutional Neural Networks Deal with Aliasing

    Authors: Antônio H. Ribeiro, Thomas B. Schön

    Abstract: The convolutional neural network (CNN) remains an essential tool in solving computer vision problems. Standard convolutional architectures consist of stacked layers of operations that progressively downscale the image. Aliasing is a well-known side-effect of downsampling that may take place: it causes high-frequency components of the original signal to become indistinguishable from its low-frequen… ▽ More

    Submitted 15 February, 2021; originally announced February 2021.

    Comments: To appear in the 2021 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP)

  36. arXiv:2012.07269  [pdf, ps, other

    stat.ML cs.LG

    Variational State and Parameter Estimation

    Authors: Jarrad Courts, Johannes Hendriks, Adrian Wills, Thomas Schön, Brett Ninness

    Abstract: This paper considers the problem of computing Bayesian estimates of both states and model parameters for nonlinear state-space models. Generally, this problem does not have a tractable solution and approximations must be utilised. In this work, a variational approach is used to provide an assumed density which approximates the desired, intractable, distribution. The approach is deterministic and r… ▽ More

    Submitted 14 December, 2020; originally announced December 2020.

  37. arXiv:2012.06341  [pdf, other

    cs.LG eess.SY stat.ML

    Beyond Occam's Razor in System Identification: Double-Descent when Modeling Dynamics

    Authors: Antônio H. Ribeiro, Johannes N. Hendriks, Adrian G. Wills, Thomas B. Schön

    Abstract: System identification aims to build models of dynamical systems from data. Traditionally, choosing the model requires the designer to balance between two goals of conflicting nature; the model must be rich enough to capture the system dynamics, but not so flexible that it learns spurious random effects from the dataset. It is typically observed that the model validation performance follows a U-sha… ▽ More

    Submitted 6 August, 2021; v1 submitted 11 December, 2020; originally announced December 2020.

    Comments: To appear in the Proceedings of the 19th IFAC Symposium in System Identification (2021)

  38. arXiv:2012.05072  [pdf, ps, other

    stat.ML cs.LG eess.SY stat.ME

    Variational System Identification for Nonlinear State-Space Models

    Authors: Jarrad Courts, Adrian Wills, Thomas Schön, Brett Ninness

    Abstract: This paper considers parameter estimation for nonlinear state-space models, which is an important but challenging problem. We address this challenge by employing a variational inference (VI) approach, which is a principled method that has deep connections to maximum likelihood estimation. This VI approach ultimately provides estimates of the model as solutions to an optimisation problem, which is… ▽ More

    Submitted 14 September, 2022; v1 submitted 8 December, 2020; originally announced December 2020.

  39. arXiv:2012.04634  [pdf, other

    cs.CV cs.LG cs.RO stat.ML

    Accurate 3D Object Detection using Energy-Based Models

    Authors: Fredrik K. Gustafsson, Martin Danelljan, Thomas B. Schön

    Abstract: Accurate 3D object detection (3DOD) is crucial for safe navigation of complex environments by autonomous robots. Regressing accurate 3D bounding boxes in cluttered environments based on sparse LiDAR data is however a highly challenging problem. We address this task by exploring recent advances in conditional energy-based models (EBMs) for probabilistic regression. While methods employing EBMs for… ▽ More

    Submitted 7 November, 2023; v1 submitted 8 December, 2020; originally announced December 2020.

    Comments: CVPR Workshops 2021. Code is available at https://github.com/fregu856/ebms_3dod

  40. arXiv:2012.04136  [pdf, other

    cs.LG eess.SY

    Deep Energy-Based NARX Models

    Authors: Johannes N. Hendriks, Fredrik K. Gustafsson, Antônio H. Ribeiro, Adrian G. Wills, Thomas B. Schön

    Abstract: This paper is directed towards the problem of learning nonlinear ARX models based on system input--output data. In particular, our interest is in learning a conditional distribution of the current output based on a finite window of past inputs and outputs. To achieve this, we consider the use of so-called energy-based models, which have been developed in allied fields for learning unknown distribu… ▽ More

    Submitted 7 December, 2020; originally announced December 2020.

  41. arXiv:2005.01698  [pdf, other

    cs.CV cs.LG cs.RO stat.ML

    How to Train Your Energy-Based Model for Regression

    Authors: Fredrik K. Gustafsson, Martin Danelljan, Radu Timofte, Thomas B. Schön

    Abstract: Energy-based models (EBMs) have become increasingly popular within computer vision in recent years. While they are commonly employed for generative image modeling, recent work has applied EBMs also for regression tasks, achieving state-of-the-art performance on object detection and visual tracking. Training EBMs is however known to be challenging. While a variety of different techniques have been… ▽ More

    Submitted 14 August, 2020; v1 submitted 4 May, 2020; originally announced May 2020.

    Comments: BMVC 2020. Code is available at https://github.com/fregu856/ebms_regression

  42. arXiv:2003.14162  [pdf, other

    eess.SY cs.LG stat.ML

    Deep State Space Models for Nonlinear System Identification

    Authors: Daniel Gedon, Niklas Wahlström, Thomas B. Schön, Lennart Ljung

    Abstract: Deep state space models (SSMs) are an actively researched model class for temporal models developed in the deep learning community which have a close connection to classic SSMs. The use of deep SSMs as a black-box identification model can describe a wide range of dynamics due to the flexibility of deep neural networks. Additionally, the probabilistic nature of the model class allows the uncertaint… ▽ More

    Submitted 18 June, 2021; v1 submitted 31 March, 2020; originally announced March 2020.

  43. arXiv:2003.10819  [pdf, ps, other

    physics.med-ph cs.CV cs.LG eess.IV

    Registration by tracking for sequential 2D MRI

    Authors: Niklas Gunnarsson, Jens Sjölund, Thomas B. Schön

    Abstract: Our anatomy is in constant motion. With modern MR imaging it is possible to record this motion in real-time during an ongoing radiation therapy session. In this paper we present an image registration method that exploits the sequential nature of 2D MR images to estimate the corresponding displacement field. The method employs several discriminative correlation filters that independently track spec… ▽ More

    Submitted 24 March, 2020; originally announced March 2020.

    Comments: Currently under review for a conference

  44. arXiv:2003.07201  [pdf, ps, other

    stat.ME cs.LG stat.ML

    The Elliptical Processes: a Family of Fat-tailed Stochastic Processes

    Authors: Maria Bånkestad, Jens Sjölund, Jalil Taghia, Thomas Schön

    Abstract: We present the elliptical processes -- a family of non-parametric probabilistic models that subsumes the Gaussian process and the Student-t process. This generalization includes a range of new fat-tailed behaviors yet retains computational tractability. We base the elliptical processes on a representation of elliptical distributions as a continuous mixture of Gaussian distributions and derive clos… ▽ More

    Submitted 2 December, 2020; v1 submitted 13 March, 2020; originally announced March 2020.

  45. Gaussian Variational State Estimation for Nonlinear State-Space Models

    Authors: Jarrad Courts, Adrian Wills, Thomas B. Schön

    Abstract: In this paper, the problem of state estimation, in the context of both filtering and smoothing, for nonlinear state-space models is considered. Due to the nonlinear nature of the models, the state estimation problem is generally intractable as it involves integrals of general nonlinear functions and the filtered and smoothed state distributions lack closed-form solutions. As such, it is common to… ▽ More

    Submitted 1 October, 2021; v1 submitted 6 February, 2020; originally announced February 2020.

  46. arXiv:2002.01600  [pdf, other

    stat.ML cs.LG physics.comp-ph

    Linearly Constrained Neural Networks

    Authors: Johannes Hendriks, Carl Jidling, Adrian Wills, Thomas Schön

    Abstract: We present a novel approach to modelling and learning vector fields from physical systems using neural networks that explicitly satisfy known linear operator constraints. To achieve this, the target function is modelled as a linear transformation of an underlying potential field, which is in turn modelled by a neural network. This transformation is chosen such that any prediction of the target fun… ▽ More

    Submitted 27 April, 2021; v1 submitted 4 February, 2020; originally announced February 2020.

  47. arXiv:1912.13143  [pdf, other

    math.OC cs.LG

    Optimistic robust linear quadratic dual control

    Authors: Jack Umenberger, Thomas B. Schon

    Abstract: Recent work by Mania et al. has proved that certainty equivalent control achieves nearly optimal regret for linear systems with quadratic costs. However, when parameter uncertainty is large, certainty equivalence cannot be relied upon to stabilize the true, unknown system. In this paper, we present a dual control strategy that attempts to combine the performance of certainty equivalence, with the… ▽ More

    Submitted 30 December, 2019; originally announced December 2019.

    Comments: Preprint submitted to L4DC 2020. 11 pages. 1 figure

  48. arXiv:1910.00463  [pdf, other

    eess.SP cs.RO eess.SY

    A Fast and Robust Algorithm for Orientation Estimation using Inertial Sensors

    Authors: Manon Kok, Thomas B. Schön

    Abstract: We present a novel algorithm for online, real-time orientation estimation. Our algorithm integrates gyroscope data and corrects the resulting orientation estimate for integration drift using accelerometer and magnetometer data. This correction is computed, at each time instance, using a single gradient descent step with fixed step length. This fixed step length results in robustness against model… ▽ More

    Submitted 1 October, 2019; originally announced October 2019.

    Comments: 9 pages, 2 figures

    Journal ref: IEEE Signal Processing Letters, 2019

  49. arXiv:1909.12297  [pdf, other

    cs.LG cs.CV stat.ML

    Energy-Based Models for Deep Probabilistic Regression

    Authors: Fredrik K. Gustafsson, Martin Danelljan, Goutam Bhat, Thomas B. Schön

    Abstract: While deep learning-based classification is generally tackled using standardized approaches, a wide variety of techniques are employed for regression. In computer vision, one particularly popular such technique is that of confidence-based regression, which entails predicting a confidence value for each input-target pair (x,y). While this approach has demonstrated impressive results, it requires im… ▽ More

    Submitted 19 July, 2020; v1 submitted 26 September, 2019; originally announced September 2019.

    Comments: ECCV 2020. Code is available at https://github.com/fregu856/ebms_regression

  50. arXiv:1909.01844  [pdf, other

    stat.ML cs.LG

    Deep kernel learning for integral measurements

    Authors: Carl Jidling, Johannes Hendriks, Thomas B. Schön, Adrian Wills

    Abstract: Deep kernel learning refers to a Gaussian process that incorporates neural networks to improve the modelling of complex functions. We present a method that makes this approach feasible for problems where the data consists of line integral measurements of the target function. The performance is illustrated on computed tomography reconstruction examples.

    Submitted 4 September, 2019; originally announced September 2019.