Search | arXiv e-print repository

Non-rigid Medical Image Registration using Physics-informed Neural Networks

Authors: Zhe Min, Zachary M. C. Baum, Shaheer U. Saeed, Mark Emberton, Dean C. Barratt, Zeike A. Taylor, Yipeng Hu

Abstract: Biomechanical modelling of soft tissue provides a non-data-driven method for constraining medical image registration, such that the estimated spatial transformation is considered biophysically plausible. This has not only been adopted in real-world clinical applications, such as the MR-to-ultrasound registration for prostate intervention of interest in this work, but also provides an explainable m… ▽ More Biomechanical modelling of soft tissue provides a non-data-driven method for constraining medical image registration, such that the estimated spatial transformation is considered biophysically plausible. This has not only been adopted in real-world clinical applications, such as the MR-to-ultrasound registration for prostate intervention of interest in this work, but also provides an explainable means of understanding the organ motion and spatial correspondence establishment. This work instantiates the recently-proposed physics-informed neural networks (PINNs) to a 3D linear elastic model for modelling prostate motion commonly encountered during transrectal ultrasound guided procedures. To overcome a widely-recognised challenge in generalising PINNs to different subjects, we propose to use PointNet as the nodal-permutation-invariant feature extractor, together with a registration algorithm that aligns point sets and simultaneously takes into account the PINN-imposed biomechanics. The proposed method has been both developed and validated in both patient-specific and multi-patient manner. △ Less

Submitted 20 February, 2023; originally announced February 2023.

Comments: IPMI 2023

arXiv:2211.15330 [pdf, other]

UAS in the Airspace: A Review on Integration, Simulation, Optimization, and Open Challenges

Authors: Euclides Carlos Pinto Neto, Derick Moreira Baum, Jorge Rady de Almeida Jr., Joao Batista Camargo Jr., Paulo Sergio Cugnasca

Abstract: Air transportation is essential for society, and it is increasing gradually due to its importance. To improve the airspace operation, new technologies are under development, such as Unmanned Aircraft Systems (UAS). In fact, in the past few years, there has been a growth in UAS numbers in segregated airspace. However, there is an interest in integrating these aircraft into the National Airspace Sys… ▽ More Air transportation is essential for society, and it is increasing gradually due to its importance. To improve the airspace operation, new technologies are under development, such as Unmanned Aircraft Systems (UAS). In fact, in the past few years, there has been a growth in UAS numbers in segregated airspace. However, there is an interest in integrating these aircraft into the National Airspace System (NAS). The UAS is vital to different industries due to its advantages brought to the airspace (e.g., efficiency). Conversely, the relationship between UAS and Air Traffic Control (ATC) needs to be well-defined due to the impacts on ATC capacity these aircraft may present. Throughout the years, this impact may be lower than it is nowadays because the current lack of familiarity in this relationship contributes to higher workload levels. Thereupon, the primary goal of this research is to present a comprehensive review of the advancements in the integration of UAS in the National Airspace System (NAS) from different perspectives. We consider the challenges regarding simulation, final approach, and optimization of problems related to the interoperability of such systems in the airspace. Finally, we identify several open challenges in the field based on the existing state-of-the-art proposals. △ Less

Submitted 24 November, 2022; originally announced November 2022.

arXiv:2210.15371 [pdf]

Meta-Learning Initializations for Interactive Medical Image Registration

Authors: Zachary M. C. Baum, Yipeng Hu, Dean Barratt

Abstract: We present a meta-learning framework for interactive medical image registration. Our proposed framework comprises three components: a learning-based medical image registration algorithm, a form of user interaction that refines registration at inference, and a meta-learning protocol that learns a rapidly adaptable network initialization. This paper describes a specific algorithm that implements the… ▽ More We present a meta-learning framework for interactive medical image registration. Our proposed framework comprises three components: a learning-based medical image registration algorithm, a form of user interaction that refines registration at inference, and a meta-learning protocol that learns a rapidly adaptable network initialization. This paper describes a specific algorithm that implements the registration, interaction and meta-learning protocol for our exemplar clinical application: registration of magnetic resonance (MR) imaging to interactively acquired, sparsely-sampled transrectal ultrasound (TRUS) images. Our approach obtains comparable registration error (4.26 mm) to the best-performing non-interactive learning-based 3D-to-3D method (3.97 mm) while requiring only a fraction of the data, and occurring in real-time during acquisition. Applying sparsely sampled data to non-interactive methods yields higher registration errors (6.26 mm), demonstrating the effectiveness of interactive MR-TRUS registration, which may be applied intraoperatively given the real-time nature of the adaptation process. △ Less

Submitted 27 October, 2022; originally announced October 2022.

Comments: 11 pages, 10 figures. Paper accepted to IEEE Transactions on Medical Imaging (October 26 2022)

arXiv:2209.07196 [pdf, other]

doi 10.1109/WIFS55849.2022.9975411

Environment Classification via Blind Roomprints Estimation

Authors: Malte Baum, Luca Cuccovillo, Artem Yaroshchuk, Patrick Aichroth

Abstract: In this paper we present a novel approach for environment classification for speech recordings, which does not require the selection of decaying reverberation tails. It is based on a multi-band RT60 analysis of blind channel estimates and achieves an accuracy of up to 93.6% on test recordings derived from the ACE corpus. In this paper we present a novel approach for environment classification for speech recordings, which does not require the selection of decaying reverberation tails. It is based on a multi-band RT60 analysis of blind channel estimates and achieves an accuracy of up to 93.6% on test recordings derived from the ACE corpus. △ Less

Submitted 26 January, 2023; v1 submitted 15 September, 2022; originally announced September 2022.

Journal ref: in IEEE International Workshop on Information Forensics and Security (WIFS), December 12-16, 2022, Shanghai, China, pp.1-6

arXiv:2207.10998 [pdf]

Rapid Lung Ultrasound COVID-19 Severity Scoring with Resource-Efficient Deep Feature Extraction

Authors: Pierre Raillard, Lorenzo Cristoni, Andrew Walden, Roberto Lazzari, Thomas Pulimood, Louis Grandjean, Claudia AM Gandini Wheeler-Kingshott, Yipeng Hu, Zachary MC Baum

Abstract: Artificial intelligence-based analysis of lung ultrasound imaging has been demonstrated as an effective technique for rapid diagnostic decision support throughout the COVID-19 pandemic. However, such techniques can require days- or weeks-long training processes and hyper-parameter tuning to develop intelligent deep learning image analysis models. This work focuses on leveraging 'off-the-shelf' pre… ▽ More Artificial intelligence-based analysis of lung ultrasound imaging has been demonstrated as an effective technique for rapid diagnostic decision support throughout the COVID-19 pandemic. However, such techniques can require days- or weeks-long training processes and hyper-parameter tuning to develop intelligent deep learning image analysis models. This work focuses on leveraging 'off-the-shelf' pre-trained models as deep feature extractors for scoring disease severity with minimal training time. We propose using pre-trained initializations of existing methods ahead of simple and compact neural networks to reduce reliance on computational capacity. This reduction of computational capacity is of critical importance in time-limited or resource-constrained circumstances, such as the early stages of a pandemic. On a dataset of 49 patients, comprising over 20,000 images, we demonstrate that the use of existing methods as feature extractors results in the effective classification of COVID-19-related pneumonia severity while requiring only minutes of training time. Our methods can achieve an accuracy of over 0.93 on a 4-level severity score scale and provides comparable per-patient region and global scores compared to expert annotated ground truths. These results demonstrate the capability for rapid deployment and use of such minimally-adapted methods for progress monitoring, patient stratification and management in clinical practice for COVID-19 patients, and potentially in other respiratory diseases. △ Less

Submitted 22 July, 2022; originally announced July 2022.

Comments: Accepted to ASMUS 2022 Workshop at MICCAI

arXiv:2203.14258 [pdf, other]

doi 10.1016/j.media.2022.102427

Image quality assessment for machine learning tasks using meta-reinforcement learning

Authors: Shaheer U. Saeed, Yunguan Fu, Vasilis Stavrinides, Zachary M. C. Baum, Qianye Yang, Mirabela Rusu, Richard E. Fan, Geoffrey A. Sonn, J. Alison Noble, Dean C. Barratt, Yipeng Hu

Abstract: In this paper, we consider image quality assessment (IQA) as a measure of how images are amenable with respect to a given downstream task, or task amenability. When the task is performed using machine learning algorithms, such as a neural-network-based task predictor for image classification or segmentation, the performance of the task predictor provides an objective estimate of task amenability.… ▽ More In this paper, we consider image quality assessment (IQA) as a measure of how images are amenable with respect to a given downstream task, or task amenability. When the task is performed using machine learning algorithms, such as a neural-network-based task predictor for image classification or segmentation, the performance of the task predictor provides an objective estimate of task amenability. In this work, we use an IQA controller to predict the task amenability which, itself being parameterised by neural networks, can be trained simultaneously with the task predictor. We further develop a meta-reinforcement learning framework to improve the adaptability for both IQA controllers and task predictors, such that they can be fine-tuned efficiently on new datasets or meta-tasks. We demonstrate the efficacy of the proposed task-specific, adaptable IQA approach, using two clinical applications for ultrasound-guided prostate intervention and pneumonia detection on X-ray images. △ Less

Submitted 27 March, 2022; originally announced March 2022.

Comments: Accepted to Medical Image Analysis; Final published version available at: https://doi.org/10.1016/j.media.2022.102427

Journal ref: Medical Image Analysis, Volume 78, 2022, 102427, ISSN 1361-8415

arXiv:2202.09798 [pdf, other]

doi 10.59275/j.melba.2022-a1cc

Image quality assessment by overlap** task-specific and task-agnostic measures: application to prostate multiparametric MR images for cancer segmentation

Authors: Shaheer U. Saeed, Wen Yan, Yunguan Fu, Francesco Giganti, Qianye Yang, Zachary M. C. Baum, Mirabela Rusu, Richard E. Fan, Geoffrey A. Sonn, Mark Emberton, Dean C. Barratt, Yipeng Hu

Abstract: Image quality assessment (IQA) in medical imaging can be used to ensure that downstream clinical tasks can be reliably performed. Quantifying the impact of an image on the specific target tasks, also named as task amenability, is needed. A task-specific IQA has recently been proposed to learn an image-amenability-predicting controller simultaneously with a target task predictor. This allows for th… ▽ More Image quality assessment (IQA) in medical imaging can be used to ensure that downstream clinical tasks can be reliably performed. Quantifying the impact of an image on the specific target tasks, also named as task amenability, is needed. A task-specific IQA has recently been proposed to learn an image-amenability-predicting controller simultaneously with a target task predictor. This allows for the trained IQA controller to measure the impact an image has on the target task performance, when this task is performed using the predictor, e.g. segmentation and classification neural networks in modern clinical applications. In this work, we propose an extension to this task-specific IQA approach, by adding a task-agnostic IQA based on auto-encoding as the target task. Analysing the intersection between low-quality images, deemed by both the task-specific and task-agnostic IQA, may help to differentiate the underpinning factors that caused the poor target task performance. For example, common imaging artefacts may not adversely affect the target task, which would lead to a low task-agnostic quality and a high task-specific quality, whilst individual cases considered clinically challenging, which can not be improved by better imaging equipment or protocols, is likely to result in a high task-agnostic quality but a low task-specific quality. We first describe a flexible reward sha** strategy which allows for the adjustment of weighting between task-agnostic and task-specific quality scoring. Furthermore, we evaluate the proposed algorithm using a clinically challenging target task of prostate tumour segmentation on multiparametric magnetic resonance (mpMR) images, from 850 patients. The proposed reward sha** strategy, with appropriately weighted task-specific and task-agnostic qualities, successfully identified samples that need re-acquisition due to defected imaging process. △ Less

Submitted 20 February, 2022; originally announced February 2022.

Comments: Accepted for publication at the Journal of Machine Learning for Biomedical Imaging (MELBA) https://www.melba-journal.org

arXiv:2109.05023 [pdf]

Real-time multimodal image registration with partial intraoperative point-set data

Authors: Zachary M C Baum, Yipeng Hu, Dean C Barratt

Abstract: We present Free Point Transformer (FPT) - a deep neural network architecture for non-rigid point-set registration. Consisting of two modules, a global feature extraction module and a point transformation module, FPT does not assume explicit constraints based on point vicinity, thereby overcoming a common requirement of previous learning-based point-set registration methods. FPT is designed to acce… ▽ More We present Free Point Transformer (FPT) - a deep neural network architecture for non-rigid point-set registration. Consisting of two modules, a global feature extraction module and a point transformation module, FPT does not assume explicit constraints based on point vicinity, thereby overcoming a common requirement of previous learning-based point-set registration methods. FPT is designed to accept unordered and unstructured point-sets with a variable number of points and uses a "model-free" approach without heuristic constraints. Training FPT is flexible and involves minimizing an intuitive unsupervised loss function, but supervised, semi-supervised, and partially- or weakly-supervised training are also supported. This flexibility makes FPT amenable to multimodal image registration problems where the ground-truth deformations are difficult or impossible to measure. In this paper, we demonstrate the application of FPT to non-rigid registration of prostate magnetic resonance (MR) imaging and sparsely-sampled transrectal ultrasound (TRUS) images. The registration errors were 4.71 mm and 4.81 mm for complete TRUS imaging and sparsely-sampled TRUS imaging, respectively. The results indicate superior accuracy to the alternative rigid and non-rigid registration algorithms tested and substantially lower computation time. The rapid inference possible with FPT makes it particularly suitable for applications where real-time registration is beneficial. △ Less

Submitted 20 September, 2021; v1 submitted 10 September, 2021; originally announced September 2021.

Comments: Accepted manuscript in Medical Image Analysis

arXiv:2108.03138 [pdf]

Lung Ultrasound Segmentation and Adaptation between COVID-19 and Community-Acquired Pneumonia

Authors: Harry Mason, Lorenzo Cristoni, Andrew Walden, Roberto Lazzari, Thomas Pulimood, Louis Grandjean, Claudia AM Gandini Wheeler-Kingshott, Yipeng Hu, Zachary MC Baum

Abstract: Lung ultrasound imaging has been shown effective in detecting typical patterns for interstitial pneumonia, as a point-of-care tool for both patients with COVID-19 and other community-acquired pneumonia (CAP). In this work, we focus on the hyperechoic B-line segmentation task. Using deep neural networks, we automatically outline the regions that are indicative of pathology-sensitive artifacts and t… ▽ More Lung ultrasound imaging has been shown effective in detecting typical patterns for interstitial pneumonia, as a point-of-care tool for both patients with COVID-19 and other community-acquired pneumonia (CAP). In this work, we focus on the hyperechoic B-line segmentation task. Using deep neural networks, we automatically outline the regions that are indicative of pathology-sensitive artifacts and their associated sonographic patterns. With a real-world data-scarce scenario, we investigate approaches to utilize both COVID-19 and CAP lung ultrasound data to train the networks; comparing fine-tuning and unsupervised domain adaptation. Segmenting either type of lung condition at inference may support a range of clinical applications during evolving epidemic stages, but also demonstrates value in resource-constrained clinical scenarios. Adapting real clinical data acquired from COVID-19 patients to those from CAP patients significantly improved Dice scores from 0.60 to 0.87 (p < 0.001) and from 0.43 to 0.71 (p < 0.001), on independent COVID-19 and CAP test cases, respectively. It is of practical value that the improvement was demonstrated with only a small amount of data in both training and adaptation data sets, a common constraint for deploying machine learning models in clinical practice. Interestingly, we also report that the inverse adaptation, from labelled CAP data to unlabeled COVID-19 data, did not demonstrate an improvement when tested on either condition. Furthermore, we offer a possible explanation that correlates the segmentation performance to label consistency and data domain diversity in this point-of-care lung ultrasound application. △ Less

Submitted 6 August, 2021; originally announced August 2021.

Comments: Accepted to MICCAI ASMUS Workshop

arXiv:2106.06430 [pdf, ps, other]

Continuous Herded Gibbs Sampling

Authors: Laura M. Wolf, Marcus Baum

Abstract: Herding is a technique to sequentially generate deterministic samples from a probability distribution. In this work, we propose a continuous herded Gibbs sampler that combines kernel herding on continuous densities with the Gibbs sampling idea. Our algorithm allows for deterministically sampling from high-dimensional multivariate probability densities, without directly sampling from the joint dens… ▽ More Herding is a technique to sequentially generate deterministic samples from a probability distribution. In this work, we propose a continuous herded Gibbs sampler that combines kernel herding on continuous densities with the Gibbs sampling idea. Our algorithm allows for deterministically sampling from high-dimensional multivariate probability densities, without directly sampling from the joint density. Experiments with Gaussian mixture densities indicate that the L2 error decreases similarly to kernel herding, while the computation time is significantly lower, i.e., linear in the number of dimensions. △ Less

Submitted 13 January, 2022; v1 submitted 11 June, 2021; originally announced June 2021.

Comments: 6 pages, 7 figures submitted to 2021 IEEE 24th International Conference on Information Fusion (FUSION)

arXiv:2011.02580 [pdf, ps, other]

doi 10.21105/joss.02705

DeepReg: a deep learning toolkit for medical image registration

Authors: Yunguan Fu, Nina Montaña Brown, Shaheer U. Saeed, Adrià Casamitjana, Zachary M. C. Baum, Rémi Delaunay, Qianye Yang, Alexander Grimwood, Zhe Min, Stefano B. Blumberg, Juan Eugenio Iglesias, Dean C. Barratt, Ester Bonmati, Daniel C. Alexander, Matthew J. Clarkson, Tom Vercauteren, Yipeng Hu

Abstract: DeepReg (https://github.com/DeepRegNet/DeepReg) is a community-supported open-source toolkit for research and education in medical image registration using deep learning. DeepReg (https://github.com/DeepRegNet/DeepReg) is a community-supported open-source toolkit for research and education in medical image registration using deep learning. △ Less

Submitted 4 November, 2020; originally announced November 2020.

Comments: Accepted in The Journal of Open Source Software (JOSS)

arXiv:2009.01924 [pdf, other]

Introduction to Medical Image Registration with DeepReg, Between Old and New

Authors: N. Montana Brown, Y. Fu, S. U. Saeed, A. Casamitjana, Z. M. C. Baum, R. Delaunay, Q. Yang, A. Grimwood, Z. Min, E. Bonmati, T. Vercauteren, M. J. Clarkson, Y. Hu

Abstract: This document outlines a tutorial to get started with medical image registration using the open-source package DeepReg. The basic concepts of medical image registration are discussed, linking classical methods to newer methods using deep learning. Two iterative, classical algorithms using optimisation and one learning-based algorithm using deep learning are coded step-by-step using DeepReg utiliti… ▽ More This document outlines a tutorial to get started with medical image registration using the open-source package DeepReg. The basic concepts of medical image registration are discussed, linking classical methods to newer methods using deep learning. Two iterative, classical algorithms using optimisation and one learning-based algorithm using deep learning are coded step-by-step using DeepReg utilities, all with real, open-accessible, medical data. △ Less

Submitted 7 September, 2020; v1 submitted 29 August, 2020; originally announced September 2020.

Comments: Submitted to MICCAI Educational Challenge 2020

arXiv:2008.08840 [pdf]

Image quality assessment for closed-loop computer-assisted lung ultrasound

Authors: Zachary M C Baum, Ester Bonmati, Lorenzo Cristoni, Andrew Walden, Ferran Prados, Baris Kanber, Dean C Barratt, David J Hawkes, Geoffrey J M Parker, Claudia A M Gandini Wheeler-Kingshott, Yipeng Hu

Abstract: We describe a novel, two-stage computer assistance system for lung anomaly detection using ultrasound imaging in the intensive care setting to improve operator performance and patient stratification during coronavirus pandemics. The proposed system consists of two deep-learning-based models: a quality assessment module that automates predictions of image quality, and a diagnosis assistance module… ▽ More We describe a novel, two-stage computer assistance system for lung anomaly detection using ultrasound imaging in the intensive care setting to improve operator performance and patient stratification during coronavirus pandemics. The proposed system consists of two deep-learning-based models: a quality assessment module that automates predictions of image quality, and a diagnosis assistance module that determines the likelihood-oh-anomaly in ultrasound images of sufficient quality. Our two-stage strategy uses a novelty detection algorithm to address the lack of control cases available for training the quality assessment classifier. The diagnosis assistance module can then be trained with data that are deemed of sufficient quality, guaranteed by the closed-loop feedback mechanism from the quality assessment module. Using more than 25000 ultrasound images from 37 COVID-19-positive patients scanned at two hospitals, plus 12 control cases, this study demonstrates the feasibility of using the proposed machine learning approach. We report an accuracy of 86% when classifying between sufficient and insufficient quality images by the quality assessment module. For data of sufficient quality - as determined by the quality assessment module - the mean classification accuracy, sensitivity, and specificity in detecting COVID-19-positive cases were 0.95, 0.91, and 0.97, respectively, across five holdout test data sets unseen during the training of any networks within the proposed system. Overall, the integration of the two modules yields accurate, fast, and practical acquisition guidance and diagnostic assistance for patients with suspected respiratory conditions at point-of-care. △ Less

Submitted 18 January, 2021; v1 submitted 20 August, 2020; originally announced August 2020.

Comments: 7 pages, 3 figures - Accepted to SPIE Medical Imaging 2021

arXiv:2008.01885 [pdf]

Multimodality Biomedical Image Registration using Free Point Transformer Networks

Authors: Zachary M. C. Baum, Yipeng Hu, Dean C. Barratt

Abstract: We describe a point-set registration algorithm based on a novel free point transformer (FPT) network, designed for points extracted from multimodal biomedical images for registration tasks, such as those frequently encountered in ultrasound-guided interventional procedures. FPT is constructed with a global feature extractor which accepts unordered source and target point-sets of variable size. The… ▽ More We describe a point-set registration algorithm based on a novel free point transformer (FPT) network, designed for points extracted from multimodal biomedical images for registration tasks, such as those frequently encountered in ultrasound-guided interventional procedures. FPT is constructed with a global feature extractor which accepts unordered source and target point-sets of variable size. The extracted features are conditioned by a shared multilayer perceptron point transformer module to predict a displacement vector for each source point, transforming it into the target space. The point transformer module assumes no vicinity or smoothness in predicting spatial transformation and, together with the global feature extractor, is trained in a data-driven fashion with an unsupervised loss function. In a multimodal registration task using prostate MR and sparsely acquired ultrasound images, FPT yields comparable or improved results over other rigid and non-rigid registration methods. This demonstrates the versatility of FPT to learn registration directly from real, clinical training data and to generalize to a challenging task, such as the interventional application presented. △ Less

Submitted 4 August, 2020; originally announced August 2020.

Comments: 10 pages, 4 figures. Accepted for publication at International Conference on Medical Image Computing and Computer Assisted Intervention (MICCAI) workshop on Advances in Simplifying Medical UltraSound (ASMUS) 2020

ACM Class: I.2.6

arXiv:2006.14873 [pdf, other]

Simulation-based Analysis of Multipath Delay Distributions in Urban Canyons

Authors: Simon Ollander, Friedrich-Wilhelm Bode, Marcus Baum

Abstract: Global navigation satellite systems provide accurate positioning nearly worldwide. However, in the urban canyons of dense cities, buildings block and reflect the signals, causing multipath errors. To mitigate multipath errors, knowledge of the distribution of the reflection delays is important. Measurements of this distribution have been done in several dense cities, but it is unknown how the dela… ▽ More Global navigation satellite systems provide accurate positioning nearly worldwide. However, in the urban canyons of dense cities, buildings block and reflect the signals, causing multipath errors. To mitigate multipath errors, knowledge of the distribution of the reflection delays is important. Measurements of this distribution have been done in several dense cities, but it is unknown how the delay distribution depends on the depth of the urban canyon. To fill this gap, we simulated reflection scenarios in 12 different environments: from suburban to deep urban canyon. Subsequently, we analyzed the resulting delay distributions. This paper presents these distributions, and a method to estimate them using the number of received satellites. According to our simulation, the multipath delays follow gamma distributions, whose shape parameters decrease when the urban canyon depth increases. A quadratic model can estimate the shape parameter using the number of received satellites. Consequently, depending on the number of received satellites, the distribution of the reflection delays can be estimated. This information can be combined with prior knowledge from other methods for improved multipath delay estimation. In the future, for more realistic results, the effects on signals that are reflected multiple times and environments other than urban canyons should be simulated. △ Less

Submitted 5 October, 2020; v1 submitted 26 June, 2020; originally announced June 2020.

Comments: 10 pages, 10 figures, to be published in 2020 European Navigation Conference

arXiv:1904.00708 [pdf, other]

Optimal Fusion of Elliptic Extended Target Estimates based on the Wasserstein Distance

Authors: Kolja Thormann, Marcus Baum

Abstract: This paper considers the fusion of multiple estimates of a spatially extended object, where the object extent is modeled as an ellipse parameterized by the orientation and semiaxes lengths. For this purpose, we propose a novel systematic approach that employs a distance measure for ellipses, i.e., the Gaussian Wasserstein distance, as a cost function. We derive an explicit approximate expression f… ▽ More This paper considers the fusion of multiple estimates of a spatially extended object, where the object extent is modeled as an ellipse parameterized by the orientation and semiaxes lengths. For this purpose, we propose a novel systematic approach that employs a distance measure for ellipses, i.e., the Gaussian Wasserstein distance, as a cost function. We derive an explicit approximate expression for the Minimum Mean Gaussian Wasserstein distance (MMGW) estimate. Based on the concept of a MMGW estimator, we develop efficient methods for the fusion of extended target estimates. The proposed fusion methods are evaluated in a simulated experiment and the benefits of the novel methods are discussed. △ Less

Submitted 8 October, 2019; v1 submitted 1 April, 2019; originally announced April 2019.

arXiv:1805.03276 [pdf, other]

doi 10.1109/TSP.2019.2929462

Tracking the Orientation and Axes Lengths of an Elliptical Extended Object

Authors: Shishan Yang, Marcus Baum

Abstract: Extended object tracking considers the simultaneous estimation of the kinematic state and the shape parameters of a moving object based on a varying number of noisy detections. A main challenge in extended object tracking is the nonlinearity and high-dimensionality of the estimation problem. This work presents compact closed-form expressions for a recursive Kalman filter that explicitly estimates… ▽ More Extended object tracking considers the simultaneous estimation of the kinematic state and the shape parameters of a moving object based on a varying number of noisy detections. A main challenge in extended object tracking is the nonlinearity and high-dimensionality of the estimation problem. This work presents compact closed-form expressions for a recursive Kalman filter that explicitly estimates the orientation and axes lengths of an extended object based on detections that are scattered over the object surface (according to a Gaussian distribution). Existing approaches are either based on Monte Carlo approximations or do not allow for explicitly maintaining all ellipse parameters. The performance of the novel approach is demonstrated with respect to the state-of-the-art by means of simulations. △ Less

Submitted 3 September, 2019; v1 submitted 8 May, 2018; originally announced May 2018.

Comments: This is the accepted version (not the IEEEpublished version). \c{opyright} 20XX IEEE

Journal ref: IEEE Transactions on Signal Processing, Issue:18 (2019) 4720-4729

arXiv:1604.00970 [pdf, other]

Extended Object Tracking: Introduction, Overview and Applications

Authors: Karl Granstrom, Marcus Baum, Stephan Reuter

Abstract: This article provides an elaborate overview of current research in extended object tracking. We provide a clear definition of the extended object tracking problem and discuss its delimitation to other types of object tracking. Next, different aspects of extended object modelling are extensively discussed. Subsequently, we give a tutorial introduction to two basic and well used extended object trac… ▽ More This article provides an elaborate overview of current research in extended object tracking. We provide a clear definition of the extended object tracking problem and discuss its delimitation to other types of object tracking. Next, different aspects of extended object modelling are extensively discussed. Subsequently, we give a tutorial introduction to two basic and well used extended object tracking approaches - the random matrix approach and the Kalman filter-based approach for star-convex shapes. The next part treats the tracking of multiple extended objects and elaborates how the large number of feasible association hypotheses can be tackled using both Random Finite Set (RFS) and Non-RFS multi-object trackers. The article concludes with a summary of current applications, where four example applications involving camera, X-band radar, light detection and ranging (lidar), red-green-blue-depth (RGB-D) sensors are highlighted. △ Less

Submitted 21 February, 2017; v1 submitted 14 March, 2016; originally announced April 2016.

Comments: 30 pages, 19 figures

Journal ref: Journal of Advances in Information Fusion, Volume 12, Number 2, Pages 139-174, December 2016, ISSN 1557-6418

arXiv:1604.00219 [pdf, other]

Second-Order Extended Kalman Filter for Extended Object and Group Tracking

Authors: Shishan Yang, Marcus Baum

Abstract: In this paper, we propose a novel method for estimating an elliptic shape approximation of a moving extended object that gives rise to multiple scattered measurements per frame. For this purpose, we parameterize the elliptic shape with its orientation and the lengths of the semi-axes. We relate an individual measurement with the ellipse parameters by means of a multiplicative noise model and deriv… ▽ More In this paper, we propose a novel method for estimating an elliptic shape approximation of a moving extended object that gives rise to multiple scattered measurements per frame. For this purpose, we parameterize the elliptic shape with its orientation and the lengths of the semi-axes. We relate an individual measurement with the ellipse parameters by means of a multiplicative noise model and derive a second-order extended Kalman filter for a closed-form recursive measurement update. The benefits of the new method are discussed by means of Monte Carlo simulations for both static and dynamic scenarios. △ Less

Submitted 25 May, 2018; v1 submitted 1 April, 2016; originally announced April 2016.

arXiv:1304.5084 [pdf, other]

Extended Object Tracking with Random Hypersurface Models

Authors: Marcus Baum, Uwe D. Hanebeck

Abstract: The Random Hypersurface Model (RHM) is introduced that allows for estimating a shape approximation of an extended object in addition to its kinematic state. An RHM represents the spatial extent by means of randomly scaled versions of the shape boundary. In doing so, the shape parameters and the measurements are related via a measurement equation that serves as the basis for a Gaussian state estima… ▽ More The Random Hypersurface Model (RHM) is introduced that allows for estimating a shape approximation of an extended object in addition to its kinematic state. An RHM represents the spatial extent by means of randomly scaled versions of the shape boundary. In doing so, the shape parameters and the measurements are related via a measurement equation that serves as the basis for a Gaussian state estimator. Specific estimators are derived for elliptic and star-convex shapes. △ Less

Submitted 18 April, 2013; originally announced April 2013.

Comments: Draft accepted for publication in IEEE Transactions on Aerospace and Electronic Systems

arXiv:1212.5882 [pdf, other]

The Kernel-SME Filter for Multiple Target Tracking

Authors: Marcus Baum, Uwe D. Hanebeck

Abstract: We present a novel method called Kernel-SME filter for tracking multiple targets when the association of the measurements to the targets is unknown. The method is a further development of the Symmetric Measurement Equation (SME) filter, which removes the data association uncertainty of the original measurement equation with the help of a symmetric transformation. The underlying idea of the Kernel-… ▽ More We present a novel method called Kernel-SME filter for tracking multiple targets when the association of the measurements to the targets is unknown. The method is a further development of the Symmetric Measurement Equation (SME) filter, which removes the data association uncertainty of the original measurement equation with the help of a symmetric transformation. The underlying idea of the Kernel-SME filter is to construct a symmetric transformation by means of map** the measurements to a Gaussian mixture. This transformation is scalable to a large number of targets and allows for deriving a Gaussian state estimator that has a cubic time complexity in the number of targets. △ Less

Submitted 24 December, 2012; originally announced December 2012.

Showing 1–21 of 21 results for author: Baum, M