-
Utilizing U-Net Architectures with Auxiliary Information for Scatter Correction in CBCT Across Different Field-of-View Settings
Authors:
Harshit Agrawal,
Ari Hietanen,
Simo Särkkä
Abstract:
Cone-beam computed tomography (CBCT) has become a vital imaging technique in various medical fields but scatter artifacts are a major limitation in CBCT scanning. This challenge is exacerbated by the use of large flat panel 2D detectors. The scatter-to-primary ratio increases significantly with the increase in the size of FOV being scanned. Several deep learning methods, particularly U-Net archite…
▽ More
Cone-beam computed tomography (CBCT) has become a vital imaging technique in various medical fields but scatter artifacts are a major limitation in CBCT scanning. This challenge is exacerbated by the use of large flat panel 2D detectors. The scatter-to-primary ratio increases significantly with the increase in the size of FOV being scanned. Several deep learning methods, particularly U-Net architectures, have shown promising capabilities in estimating the scatter directly from the CBCT projections. However, the influence of varying FOV sizes on these deep learning models remains unexplored. Having a single neural network for the scatter estimation of varying FOV projections can be of significant importance towards real clinical applications. This study aims to train and evaluate the performance of a U-Net network on a simulated dataset with varying FOV sizes. We further propose a new method (Aux-Net) by providing auxiliary information, such as FOV size, to the U-Net encoder. We validate our method on 30 different FOV sizes and compare it with the U-Net. Our study demonstrates that providing auxiliary information to the network enhances the generalization capability of the U-Net. Our findings suggest that this novel approach outperforms the baseline U-Net, offering a significant step towards practical application in real clinical settings where CBCT systems are employed to scan a wide range of FOVs.
△ Less
Submitted 27 February, 2024;
originally announced February 2024.
-
Risk-Sensitive Stochastic Optimal Control as Rao-Blackwellized Markovian Score Climbing
Authors:
Hany Abdulsamad,
Sahel Iqbal,
Adrien Corenflos,
Simo Särkkä
Abstract:
Stochastic optimal control of dynamical systems is a crucial challenge in sequential decision-making. Recently, control-as-inference approaches have had considerable success, providing a viable risk-sensitive framework to address the exploration-exploitation dilemma. Nonetheless, a majority of these techniques only invoke the inference-control duality to derive a modified risk objective that is th…
▽ More
Stochastic optimal control of dynamical systems is a crucial challenge in sequential decision-making. Recently, control-as-inference approaches have had considerable success, providing a viable risk-sensitive framework to address the exploration-exploitation dilemma. Nonetheless, a majority of these techniques only invoke the inference-control duality to derive a modified risk objective that is then addressed within a reinforcement learning framework. This paper introduces a novel perspective by framing risk-sensitive stochastic control as Markovian score climbing under samples drawn from a conditional particle filter. Our approach, while purely inference-centric, provides asymptotically unbiased estimates for gradient-based policy optimization with optimal importance weighting and no explicit value function learning. To validate our methodology, we apply it to the task of learning neural non-Gaussian feedback policies, showcasing its efficacy on numerical benchmarks of stochastic dynamical systems.
△ Less
Submitted 21 December, 2023;
originally announced December 2023.
-
A Recursive Newton Method for Smoothing in Nonlinear State Space Models
Authors:
Fatemeh Yaghoobi,
Hany Abdulsamad,
Simo Särkkä
Abstract:
In this paper, we use the optimization formulation of nonlinear Kalman filtering and smoothing problems to develop second-order variants of iterated Kalman smoother (IKS) methods. We show that Newton's method corresponds to a recursion over affine smoothing problems on a modified state-space model augmented by a pseudo measurement. The first and second derivatives required in this approach can be…
▽ More
In this paper, we use the optimization formulation of nonlinear Kalman filtering and smoothing problems to develop second-order variants of iterated Kalman smoother (IKS) methods. We show that Newton's method corresponds to a recursion over affine smoothing problems on a modified state-space model augmented by a pseudo measurement. The first and second derivatives required in this approach can be efficiently computed with widely available automatic differentiation tools. Furthermore, we show how to incorporate line-search and trust-region strategies into the proposed second-order IKS algorithm in order to regularize updates between iterations. Finally, we provide numerical examples to demonstrate the method's efficiency in terms of runtime compared to its batch counterpart.
△ Less
Submitted 15 June, 2023;
originally announced June 2023.
-
Deep learning based projection domain metal segmentation for metal artifact reduction in cone beam computed tomography
Authors:
Harshit Agrawal,
Ari Hietanen,
Simo Särkkä
Abstract:
Metal artifact correction is a challenging problem in cone beam computed tomography (CBCT) scanning. Metal implants inserted into the anatomy cause severe artifacts in reconstructed images. Widely used inpainting-based metal artifact reduction (MAR) methods require segmentation of metal traces in the projections as a first step, which is a challenging task. One approach is to use a deep learning m…
▽ More
Metal artifact correction is a challenging problem in cone beam computed tomography (CBCT) scanning. Metal implants inserted into the anatomy cause severe artifacts in reconstructed images. Widely used inpainting-based metal artifact reduction (MAR) methods require segmentation of metal traces in the projections as a first step, which is a challenging task. One approach is to use a deep learning method to segment metals in the projections. However, the success of deep learning methods is limited by the availability of realistic training data. It is laborious and time consuming to get reliable ground truth annotations due to unclear implant boundaries and large numbers of projections. We propose to use X-ray simulations to generate synthetic metal segmentation training dataset from clinical CBCT scans. We compare the effect of simulations with different numbers of photons and also compare several training strategies to augment the available data. We compare our model's performance on real clinical scans with conventional region growing threshold-based MAR, moving metal artifact reduction method, and a recent deep learning method. We show that simulations with relatively small number of photons are suitable for the metal segmentation task and that training the deep learning model with full size and cropped projections together improves the robustness of the model. We show substantial improvement in the image quality affected by severe motion, voxel size under-sampling, and out-of-FOV metals. Our method can be easily integrated into the existing projection-based MAR pipeline to get improved image quality. This method can provide a novel paradigm to accurately segment metals in CBCT projections.
△ Less
Submitted 9 October, 2023; v1 submitted 17 August, 2022;
originally announced August 2022.
-
Probabilistic Estimation of Instantaneous Frequencies of Chirp Signals
Authors:
Zheng Zhao,
Simo Särkkä,
Jens Sjölund,
Thomas B. Schön
Abstract:
We present a continuous-time probabilistic approach for estimating the chirp signal and its instantaneous frequency function when the true forms of these functions are not accessible. Our model represents these functions by non-linearly cascaded Gaussian processes represented as non-linear stochastic differential equations. The posterior distribution of the functions is then estimated with stochas…
▽ More
We present a continuous-time probabilistic approach for estimating the chirp signal and its instantaneous frequency function when the true forms of these functions are not accessible. Our model represents these functions by non-linearly cascaded Gaussian processes represented as non-linear stochastic differential equations. The posterior distribution of the functions is then estimated with stochastic filters and smoothers. We compute a (posterior) Cramér--Rao lower bound for the Gaussian process model, and derive a theoretical upper bound for the estimation error in the mean squared sense. The experiments show that the proposed method outperforms a number of state-of-the-art methods on a synthetic data. We also show that the method works out-of-the-box for two real-world datasets.
△ Less
Submitted 13 February, 2023; v1 submitted 12 May, 2022;
originally announced May 2022.
-
Fourier--Hermite Dynamic Programming for Optimal Control
Authors:
Sakira Hassan,
Simo Särkkä
Abstract:
In this paper, we propose a novel computational method for solving non-linear optimal control problems. The method is based on the use of Fourier--Hermite series for approximating the action-value function arising in dynamic programming instead of the conventional Taylor series expansion used in differential dynamic programming (DDP). The coefficients of the Fourier--Hermite series can be numerica…
▽ More
In this paper, we propose a novel computational method for solving non-linear optimal control problems. The method is based on the use of Fourier--Hermite series for approximating the action-value function arising in dynamic programming instead of the conventional Taylor series expansion used in differential dynamic programming (DDP). The coefficients of the Fourier--Hermite series can be numerically computed by using sigma-point methods, which leads to a novel class of sigma-point based dynamic programming methods. We also prove the quadratic convergence of the method and experimentally test its performance against other methods.
△ Less
Submitted 26 November, 2022; v1 submitted 27 February, 2022;
originally announced February 2022.
-
Temporal Parallelisation of Dynamic Programming and Linear Quadratic Control
Authors:
Simo Särkkä,
Ángel F. García-Fernández
Abstract:
This paper proposes a general formulation for temporal parallelisation of dynamic programming for optimal control problems. We derive the elements and associative operators to be able to use parallel scans to solve these problems with logarithmic time complexity rather than linear time complexity. We apply this methodology to problems with finite state and control spaces, linear quadratic tracking…
▽ More
This paper proposes a general formulation for temporal parallelisation of dynamic programming for optimal control problems. We derive the elements and associative operators to be able to use parallel scans to solve these problems with logarithmic time complexity rather than linear time complexity. We apply this methodology to problems with finite state and control spaces, linear quadratic tracking control problems, and to a class of nonlinear control problems. The computational benefits of the parallel methods are demonstrated via numerical simulations run on a graphics processing unit.
△ Less
Submitted 24 January, 2022; v1 submitted 7 April, 2021;
originally announced April 2021.
-
Enhancing Industrial X-ray Tomography by Data-Centric Statistical Methods
Authors:
Jarkko Suuronen,
Muhammad Emzir,
Sari Lasanen,
Simo Särkkä,
Lassi Roininen
Abstract:
X-ray tomography has applications in various industrial fields such as sawmill industry, oil and gas industry, chemical engineering, and geotechnical engineering. In this article, we study Bayesian methods for the X-ray tomography reconstruction. In Bayesian methods, the inverse problem of tomographic reconstruction is solved with help of a statistical prior distribution which encodes the possible…
▽ More
X-ray tomography has applications in various industrial fields such as sawmill industry, oil and gas industry, chemical engineering, and geotechnical engineering. In this article, we study Bayesian methods for the X-ray tomography reconstruction. In Bayesian methods, the inverse problem of tomographic reconstruction is solved with help of a statistical prior distribution which encodes the possible internal structures by assigning probabilities for smoothness and edge distribution of the object. We compare Gaussian random field priors, that favour smoothness, to non-Gaussian total variation, Besov, and Cauchy priors which promote sharp edges and high-contrast and low-contrast areas in the object. We also present computational schemes for solving the resulting high-dimensional Bayesian inverse problem with 100,000-1,000,000 unknowns. In particular, we study the applicability of a no-U-turn variant of Hamiltonian Monte Carlo methods and of a more classical adaptive Metropolis-within-Gibbs algorithm for this purpose. These methods also enable full uncertainty quantification of the reconstructions. For faster computations, we use maximum a posteriori estimates with limited-memory BFGS optimisation algorithm. As the first industrial application, we consider sawmill industry X-ray log tomography. The logs have knots, rotten parts, and even possibly metallic pieces, making them good examples for non-Gaussian priors. Secondly, we study drill-core rock sample tomography, an example from oil and gas industry. We show that Cauchy priors produce smaller number of artefacts than other choices, especially with sparse high-noise measurements, and choosing Hamiltonian Monte Carlo enables systematic uncertainty quantification.
△ Less
Submitted 8 March, 2020;
originally announced March 2020.
-
Automated Polysomnography Analysis for Detection of Non-Apneic and Non-Hypopneic Arousals using Feature Engineering and a Bidirectional LSTM Network
Authors:
Ali Bahrami Rad,
Morteza Zabihi,
Zheng Zhao,
Moncef Gabbouj,
Aggelos K. Katsaggelos,
Simo Särkkä
Abstract:
Objective: The aim of this study is to develop an automated classification algorithm for polysomnography (PSG) recordings to detect non-apneic and non-hypopneic arousals. Our particular focus is on detecting the respiratory effort-related arousals (RERAs) which are very subtle respiratory events that do not meet the criteria for apnea or hypopnea, and are more challenging to detect. Methods: The p…
▽ More
Objective: The aim of this study is to develop an automated classification algorithm for polysomnography (PSG) recordings to detect non-apneic and non-hypopneic arousals. Our particular focus is on detecting the respiratory effort-related arousals (RERAs) which are very subtle respiratory events that do not meet the criteria for apnea or hypopnea, and are more challenging to detect. Methods: The proposed algorithm is based on a bidirectional long short-term memory (BiLSTM) classifier and 465 multi-domain features, extracted from multimodal clinical time series. The features consist of a set of physiology-inspired features (n = 75), obtained by multiple steps of feature selection and expert analysis, and a set of physiology-agnostic features (n = 390), derived from scattering transform. Results: The proposed algorithm is validated on the 2018 PhysioNet challenge dataset. The overall performance in terms of the area under the precision-recall curve (AUPRC) is 0.50 on the hidden test dataset. This result is tied for the second-best score during the follow-up and official phases of the 2018 PhysioNet challenge. Conclusions: The results demonstrate that it is possible to automatically detect subtle non-apneic/non-hypopneic arousal events from PSG recordings. Significance: Automatic detection of subtle respiratory events such as RERAs together with other non-apneic/non-hypopneic arousals will allow detailed annotations of large PSG databases. This contributes to a better retrospective analysis of sleep data, which may also improve the quality of treatment.
△ Less
Submitted 6 September, 2019;
originally announced September 2019.
-
The Use of Gaussian Processes in System Identification
Authors:
Simo Särkkä
Abstract:
Gaussian processes are used in machine learning to learn input-output map**s from observed data. Gaussian process regression is based on imposing a Gaussian process prior on the unknown regressor function and statistically conditioning it on the observed data. In system identification, Gaussian processes are used to form time series prediction models such as non-linear finite-impulse response (N…
▽ More
Gaussian processes are used in machine learning to learn input-output map**s from observed data. Gaussian process regression is based on imposing a Gaussian process prior on the unknown regressor function and statistically conditioning it on the observed data. In system identification, Gaussian processes are used to form time series prediction models such as non-linear finite-impulse response (NFIR) models as well as non-linear autoregressive (NARX) models. Gaussian process state-space models (GPSS) can be used to learn the dynamic and measurement models for a state-space representation of the input-output data. Temporal and spatio-temporal Gaussian processes can be directly used to form regressor on the data in the time domain. The aim of this article is to briefly outline the main directions in system identification methods using Gaussian processes.
△ Less
Submitted 13 July, 2019;
originally announced July 2019.
-
CT Data of a Pen-Spring: Application to Under-Sampled Dynamic X-ray Tomography
Authors:
Juliaana Juurakko,
Zenith Purisha,
Simo Särkkä
Abstract:
This is the documentation of Computed Tomography (CT) data of a pen-spring. The open data set is available at https://zenodo.org/record/3266936#.XRyMdCZS9oA and can be freely used for scientific purposes with appropriate references to the data and to this document in arxiv.longhoe.net. The provided data set includes the X-ray sinograms ({\tt finalSino}) of a single 2D slice from a different height of the…
▽ More
This is the documentation of Computed Tomography (CT) data of a pen-spring. The open data set is available at https://zenodo.org/record/3266936#.XRyMdCZS9oA and can be freely used for scientific purposes with appropriate references to the data and to this document in arxiv.longhoe.net. The provided data set includes the X-ray sinograms ({\tt finalSino}) of a single 2D slice from a different height of the spring. The {\tt finalSino} was obtained from a measured 10-projection or 100-projection {\tt sinogram} using fan-beam geometry by down-sampling and taking logarithms. The data set includes also those original measured {\tt sinogram}s and corresponding measurement matrices.
△ Less
Submitted 3 July, 2019;
originally announced July 2019.
-
1D Convolutional Neural Network Models for Sleep Arousal Detection
Authors:
Morteza Zabihi,
Ali Bahrami Rad,
Serkan Kiranyaz,
Simo Särkkä,
Moncef Gabbouj
Abstract:
Sleep arousals transition the depth of sleep to a more superficial stage. The occurrence of such events is often considered as a protective mechanism to alert the body of harmful stimuli. Thus, accurate sleep arousal detection can lead to an enhanced understanding of the underlying causes and influencing the assessment of sleep quality. Previous studies and guidelines have suggested that sleep aro…
▽ More
Sleep arousals transition the depth of sleep to a more superficial stage. The occurrence of such events is often considered as a protective mechanism to alert the body of harmful stimuli. Thus, accurate sleep arousal detection can lead to an enhanced understanding of the underlying causes and influencing the assessment of sleep quality. Previous studies and guidelines have suggested that sleep arousals are linked mainly to abrupt frequency shifts in EEG signals, but the proposed rules are shown to be insufficient for a comprehensive characterization of arousals. This study investigates the application of five recent convolutional neural networks (CNNs) for sleep arousal detection and performs comparative evaluations to determine the best model for this task. The investigated state-of-the-art CNN models have originally been designed for image or speech processing. A detailed set of evaluations is performed on the benchmark dataset provided by PhysioNet/Computing in Cardiology Challenge 2018, and the results show that the best 1D CNN model has achieved an average of 0.31 and 0.84 for the area under the precision-recall and area under the ROC curves, respectively.
△ Less
Submitted 1 March, 2019;
originally announced March 2019.
-
Kalman-based Spectro-Temporal ECG Analysis using Deep Convolutional Networks for Atrial Fibrillation Detection
Authors:
Zheng Zhao,
Simo Särkkä,
Ali Bahrami Rad
Abstract:
In this article, we propose a novel ECG classification framework for atrial fibrillation (AF) detection using spectro-temporal representation (i.e., time varying spectrum) and deep convolutional networks. In the first step we use a Bayesian spectro-temporal representation based on the estimation of time-varying coefficients of Fourier series using Kalman filter and smoother. Next, we derive an alt…
▽ More
In this article, we propose a novel ECG classification framework for atrial fibrillation (AF) detection using spectro-temporal representation (i.e., time varying spectrum) and deep convolutional networks. In the first step we use a Bayesian spectro-temporal representation based on the estimation of time-varying coefficients of Fourier series using Kalman filter and smoother. Next, we derive an alternative model based on a stochastic oscillator differential equation to accelerate the estimation of the spectro-temporal representation in lengthy signals. Finally, after comparative evaluations of different convolutional architectures, we propose an efficient deep convolutional neural network to classify the 2D spectro-temporal ECG data.
The ECG spectro-temporal data are classified into four different classes: AF, non-AF normal rhythm (Normal), non-AF abnormal rhythm (Other), and noisy segments (Noisy). The performance of the proposed methods is evaluated and scored with the PhysioNet/Computing in Cardiology (CinC) 2017 dataset. The experimental results show that the proposed method achieves the overall F1 score of 80.2%, which is in line with the state-of-the-art algorithms.
△ Less
Submitted 12 December, 2018;
originally announced December 2018.
-
Improved Calibration of Numerical Integration Error in Sigma-Point Filters
Authors:
Jakub Prüher,
Toni Karvonen,
Chris J. Oates,
Ondřej Straka,
Simo Särkkä
Abstract:
The sigma-point filters, such as the UKF, which exploit numerical quadrature to obtain an additional order of accuracy in the moment transformation step, are popular alternatives to the ubiquitous EKF. The classical quadrature rules used in the sigma-point filters are motivated via polynomial approximation of the integrand, however in the applied context these assumptions cannot always be justifie…
▽ More
The sigma-point filters, such as the UKF, which exploit numerical quadrature to obtain an additional order of accuracy in the moment transformation step, are popular alternatives to the ubiquitous EKF. The classical quadrature rules used in the sigma-point filters are motivated via polynomial approximation of the integrand, however in the applied context these assumptions cannot always be justified. As a result, quadrature error can introduce bias into estimated moments, for which there is no compensatory mechanism in the classical sigma-point filters. This can lead in turn to estimates and predictions that are poorly calibrated. In this article, we investigate the Bayes-Sard quadrature method in the context of sigma-point filters, which enables uncertainty due to quadrature error to be formalised within a probabilistic model. Our first contribution is to derive the well-known classical quadratures as special cases of the Bayes-Sard quadrature method. Then a general-purpose moment transform is developed and utilised in the design of novel sigma-point filters, so that uncertainty due to quadrature error is explicitly quantified. Numerical experiments on a challenging tracking example with misspecified initial conditions show that the additional uncertainty quantification built into our method leads to better-calibrated state estimates with improved RMSE.
△ Less
Submitted 22 February, 2020; v1 submitted 28 November, 2018;
originally announced November 2018.
-
RSS Models for Respiration Rate Monitoring
Authors:
Hüseyin Yiğitler,
Ossi Kaltiokallio,
Roland Hostettler,
Riku Jäntti,
Neal Patwari,
Simo Särkkä
Abstract:
Received signal strength based respiration rate monitoring is emerging as an alternative non-contact technology. These systems make use of the radio measurements of short-range commodity wireless devices, which vary due to the inhalation and exhalation motion of a person. The success of respiration rate estimation using such measurements depends on the signal-to-noise ratio, which alters with prop…
▽ More
Received signal strength based respiration rate monitoring is emerging as an alternative non-contact technology. These systems make use of the radio measurements of short-range commodity wireless devices, which vary due to the inhalation and exhalation motion of a person. The success of respiration rate estimation using such measurements depends on the signal-to-noise ratio, which alters with properties of the person and with the measurement system. To date, no model has been presented that allows evaluation of different deployments or system configurations for successful breathing rate estimation. In this paper, a received signal strength model for respiration rate monitoring is introduced. It is shown that measurements in linear and logarithmic scale have the same functional form, and the same estimation techniques can be used in both cases. The implications of the model are validated under varying signal-to-noise ratio conditions using the performances of three estimators: batch frequency estimator, recursive Bayesian estimator, and model-based estimator. The results are in coherence with the findings, and they imply that different estimators are advantageous in different signal-to-noise ratio regimes.
△ Less
Submitted 26 November, 2017;
originally announced November 2017.
-
Gaussian Process Latent Force Models for Learning and Stochastic Control of Physical Systems
Authors:
Simo Särkkä,
Mauricio A. Álvarez,
Neil D. Lawrence
Abstract:
This article is concerned with learning and stochastic control in physical systems which contain unknown input signals. These unknown signals are modeled as Gaussian processes (GP) with certain parametrized covariance structures. The resulting latent force models (LFMs) can be seen as hybrid models that contain a first-principles physical model part and a non-parametric GP model part. We briefly r…
▽ More
This article is concerned with learning and stochastic control in physical systems which contain unknown input signals. These unknown signals are modeled as Gaussian processes (GP) with certain parametrized covariance structures. The resulting latent force models (LFMs) can be seen as hybrid models that contain a first-principles physical model part and a non-parametric GP model part. We briefly review the statistical inference and learning methods for this kind of models, introduce stochastic control methodology for the models, and provide new theoretical observability and controllability results for them.
△ Less
Submitted 13 August, 2018; v1 submitted 15 September, 2017;
originally announced September 2017.
-
Nonlinear State Space Model Identification Using a Regularized Basis Function Expansion
Authors:
Andreas Svensson,
Thomas B. Schön,
Arno Solin,
Simo Särkkä
Abstract:
This paper is concerned with black-box identification of nonlinear state space models. By using a basis function expansion within the state space model, we obtain a flexible structure. The model is identified using an expectation maximization approach, where the states and the parameters are updated iteratively in such a way that a maximum likelihood estimate is obtained. We use recent particle me…
▽ More
This paper is concerned with black-box identification of nonlinear state space models. By using a basis function expansion within the state space model, we obtain a flexible structure. The model is identified using an expectation maximization approach, where the states and the parameters are updated iteratively in such a way that a maximum likelihood estimate is obtained. We use recent particle methods with sound theoretical properties to infer the states, whereas the model parameters can be updated using closed-form expressions by exploiting the fact that our model is linear in the parameters. Not to over-fit the flexible model to the data, we also propose a regularization scheme without increasing the computational burden. Importantly, this opens up for systematic use of regularization in nonlinear state space models. We conclude by evaluating our proposed approach on one simulation example and two real-data problems.
△ Less
Submitted 2 October, 2015;
originally announced October 2015.