-
Equivariant Multiscale Learned Invertible Reconstruction for Cone Beam CT
Authors:
Nikita Moriakov,
Jan-Jakob Sonke,
Jonas Teuwen
Abstract:
Cone Beam CT (CBCT) is an essential imaging modality nowadays, but the image quality of CBCT still lags behind the high quality standards established by the conventional Computed Tomography. We propose LIRE+, a learned iterative scheme for fast and memory-efficient CBCT reconstruction, which is a substantially faster and more parameter-efficient alternative to the recently proposed LIRE method. LI…
▽ More
Cone Beam CT (CBCT) is an essential imaging modality nowadays, but the image quality of CBCT still lags behind the high quality standards established by the conventional Computed Tomography. We propose LIRE+, a learned iterative scheme for fast and memory-efficient CBCT reconstruction, which is a substantially faster and more parameter-efficient alternative to the recently proposed LIRE method. LIRE+ is a rotationally-equivariant multiscale learned invertible primal-dual iterative scheme for CBCT reconstruction. Memory usage is optimized by relying on simple reversible residual networks in primal/dual cells and patch-wise computations inside the cells during forward and backward passes, while increased inference speed is achieved by making the primal-dual scheme multiscale so that the reconstruction process starts at low resolution and with low resolution primal/dual latent vectors. A LIRE+ model was trained and validated on a set of 260 + 22 thorax CT scans and tested using a set of 142 thorax CT scans with additional evaluation with and without finetuning on an out-of-distribution set of 79 Head and Neck (HN) CT scans. Our method surpasses classical and deep learning baselines, including LIRE, on the thorax test set. For a similar inference time and with only 37 % of the parameter budget, LIRE+ achieves a +0.2 dB PSNR improvement over LIRE, while being able to match the performance of LIRE in 45 % less inference time and with 28 % of the parameter budget. Rotational equivariance ensures robustness of LIRE+ to patient orientation, while LIRE and other deep learning baselines suffer from substantial performance degradation when patient orientation is unusual. On the HN dataset in the absence of finetuning, LIRE+ is generally comparable to LIRE in performance apart from a few outlier cases, whereas after identical finetuning LIRE+ demonstates a +1.02 dB PSNR improvement over LIRE.
△ Less
Submitted 20 January, 2024;
originally announced January 2024.
-
JSSL: Joint Supervised and Self-supervised Learning for MRI Reconstruction
Authors:
George Yiasemis,
Nikita Moriakov,
Clara I. Sánchez,
Jan-Jakob Sonke,
Jonas Teuwen
Abstract:
Magnetic Resonance Imaging represents an important diagnostic modality; however, its inherently slow acquisition process poses challenges in obtaining fully sampled k-space data under motion in clinical scenarios such as abdominal, cardiac, and prostate imaging. In the absence of fully sampled acquisitions, which can serve as ground truth data, training deep learning algorithms in a supervised man…
▽ More
Magnetic Resonance Imaging represents an important diagnostic modality; however, its inherently slow acquisition process poses challenges in obtaining fully sampled k-space data under motion in clinical scenarios such as abdominal, cardiac, and prostate imaging. In the absence of fully sampled acquisitions, which can serve as ground truth data, training deep learning algorithms in a supervised manner to predict the underlying ground truth image becomes an impossible task. To address this limitation, self-supervised methods have emerged as a viable alternative, leveraging available subsampled k-space data to train deep learning networks for MRI reconstruction. Nevertheless, these self-supervised approaches often fall short when compared to supervised methodologies. In this paper, we introduce JSSL (Joint Supervised and Self-supervised Learning), a novel training approach for deep learning-based MRI reconstruction algorithms aimed at enhancing reconstruction quality in scenarios where target dataset(s) containing fully sampled k-space measurements are unavailable. Our proposed method operates by simultaneously training a model in a self-supervised learning setting, using subsampled data from the target dataset(s), and in a supervised learning manner, utilizing data from other datasets, referred to as proxy datasets, where fully sampled k-space data is accessible. To demonstrate the efficacy of JSSL, we utilized subsampled prostate parallel MRI measurements as the target dataset, while employing fully sampled brain and knee k-space acquisitions as proxy datasets. Our results showcase a substantial improvement over conventional self-supervised training methods, thereby underscoring the effectiveness of our joint approach. We provide a theoretical motivation for JSSL and establish a practical "rule-of-thumb" for selecting the most appropriate training approach for deep MRI reconstruction.
△ Less
Submitted 27 November, 2023;
originally announced November 2023.
-
Deep Cardiac MRI Reconstruction with ADMM
Authors:
George Yiasemis,
Nikita Moriakov,
Jan-Jakob Sonke,
Jonas Teuwen
Abstract:
Cardiac magnetic resonance imaging is a valuable non-invasive tool for identifying cardiovascular diseases. For instance, Cine MRI is the benchmark modality for assessing the cardiac function and anatomy. On the other hand, multi-contrast (T1 and T2) map** has the potential to assess pathologies and abnormalities in the myocardium and interstitium. However, voluntary breath-holding and often arr…
▽ More
Cardiac magnetic resonance imaging is a valuable non-invasive tool for identifying cardiovascular diseases. For instance, Cine MRI is the benchmark modality for assessing the cardiac function and anatomy. On the other hand, multi-contrast (T1 and T2) map** has the potential to assess pathologies and abnormalities in the myocardium and interstitium. However, voluntary breath-holding and often arrhythmia, in combination with MRI's slow imaging speed, can lead to motion artifacts, hindering real-time acquisition image quality. Although performing accelerated acquisitions can facilitate dynamic imaging, it induces aliasing, causing low reconstructed image quality in Cine MRI and inaccurate T1 and T2 map** estimation. In this work, inspired by related work in accelerated MRI reconstruction, we present a deep learning (DL)-based method for accelerated cine and multi-contrast reconstruction in the context of dynamic cardiac imaging. We formulate the reconstruction problem as a least squares regularized optimization task, and employ vSHARP, a state-of-the-art DL-based inverse problem solver, which incorporates half-quadratic variable splitting and the alternating direction method of multipliers with neural networks. We treat the problem in two setups; a 2D reconstruction and a 2D dynamic reconstruction task, and employ 2D and 3D deep learning networks, respectively. Our method optimizes in both the image and k-space domains, allowing for high reconstruction fidelity. Although the target data is undersampled with a Cartesian equispaced scheme, we train our model using both Cartesian and simulated non-Cartesian undersampling schemes to enhance generalization of the model to unseen data. Furthermore, our model adopts a deep neural network to learn and refine the sensitivity maps of multi-coil k-space data. Lastly, our method is jointly trained on both, undersampled cine and multi-contrast data.
△ Less
Submitted 10 October, 2023;
originally announced October 2023.
-
vSHARP: variable Splitting Half-quadratic ADMM algorithm for Reconstruction of inverse-Problems
Authors:
George Yiasemis,
Nikita Moriakov,
Jan-Jakob Sonke,
Jonas Teuwen
Abstract:
Medical Imaging (MI) tasks, such as accelerated Parallel Magnetic Resonance Imaging (MRI), often involve reconstructing an image from noisy or incomplete measurements. This amounts to solving ill-posed inverse problems, where a satisfactory closed-form analytical solution is not available. Traditional methods such as Compressed Sensing (CS) in MRI reconstruction can be time-consuming or prone to o…
▽ More
Medical Imaging (MI) tasks, such as accelerated Parallel Magnetic Resonance Imaging (MRI), often involve reconstructing an image from noisy or incomplete measurements. This amounts to solving ill-posed inverse problems, where a satisfactory closed-form analytical solution is not available. Traditional methods such as Compressed Sensing (CS) in MRI reconstruction can be time-consuming or prone to obtaining low-fidelity images. Recently, a plethora of supervised and self-supervised Deep Learning (DL) approaches have demonstrated superior performance in inverse-problem solving, surpassing conventional methods. In this study, we propose vSHARP (variable Splitting Half-quadratic ADMM algorithm for Reconstruction of inverse Problems), a novel DL-based method for solving ill-posed inverse problems arising in MI. vSHARP utilizes the Half-Quadratic Variable Splitting method and employs the Alternating Direction Method of Multipliers (ADMM) to unroll the optimization process. For data consistency, vSHARP unrolls a differentiable gradient descent process in the image domain, while a DL-based denoiser, such as a U-Net architecture, is applied to enhance image quality. vSHARP also employs a dilated-convolution DL-based model to predict the Lagrange multipliers for the ADMM initialization. We evaluate the proposed model by applying it to the task of accelerated Parallel MRI Reconstruction on two distinct datasets. We present a comparative analysis of our experimental results with state-of-the-art approaches, highlighting the superior performance of vSHARP.
△ Less
Submitted 18 September, 2023;
originally announced September 2023.
-
Improving Lesion Volume Measurements on Digital Mammograms
Authors:
Nikita Moriakov,
Jim Peters,
Ritse Mann,
Nico Karssemeijer,
Jos van Dijck,
Mireille Broeders,
Jonas Teuwen
Abstract:
Lesion volume is an important predictor for prognosis in breast cancer. We make a step towards a more accurate lesion volume measurement on digital mammograms by develo** a model that allows to estimate lesion volumes on processed mammograms, which are the images routinely used by radiologists in clinical practice as well as in breast cancer screening and are available in medical centers. Proces…
▽ More
Lesion volume is an important predictor for prognosis in breast cancer. We make a step towards a more accurate lesion volume measurement on digital mammograms by develo** a model that allows to estimate lesion volumes on processed mammograms, which are the images routinely used by radiologists in clinical practice as well as in breast cancer screening and are available in medical centers. Processed mammograms are obtained from raw mammograms, which are the X-ray data coming directly from the scanner, by applying certain vendor-specific non-linear transformations. At the core of our volume estimation method is a physics-based algorithm for measuring lesion volumes on raw mammograms. We subsequently extend this algorithm to processed mammograms via a deep learning image-to-image translation model that produces synthetic raw mammograms from processed mammograms in a multi-vendor setting. We assess the reliability and validity of our method using a dataset of 1778 mammograms with an annotated mass. Firstly, we investigate the correlations between lesion volumes computed from mediolateral oblique and craniocaudal views, with a resulting Pearson correlation of 0.93 [95% confidence interval (CI) 0.92 - 0.93]. Secondly, we compare the resulting lesion volumes from true and synthetic raw data, with a resulting Pearson correlation of 0.998 [95% CI 0.998 - 0.998] . Finally, for a subset of 100 mammograms with a malign mass and concurrent MRI examination available, we analyze the agreement between lesion volume on mammography and MRI, resulting in an intraclass correlation coefficient of 0.81 [95% CI 0.73 - 0.87] for consistency and 0.78 [95% CI 0.66 - 0.86] for absolute agreement. In conclusion, we developed an algorithm to measure mammographic lesion volume that reached excellent reliability and good validity, when using MRI as ground truth.
△ Less
Submitted 28 August, 2023;
originally announced August 2023.
-
Neural Modulation Fields for Conditional Cone Beam Neural Tomography
Authors:
Samuele Papa,
David M. Knigge,
Riccardo Valperga,
Nikita Moriakov,
Miltos Kofinas,
Jan-Jakob Sonke,
Efstratios Gavves
Abstract:
Conventional Computed Tomography (CT) methods require large numbers of noise-free projections for accurate density reconstructions, limiting their applicability to the more complex class of Cone Beam Geometry CT (CBCT) reconstruction. Recently, deep learning methods have been proposed to overcome these limitations, with methods based on neural fields (NF) showing strong performance, by approximati…
▽ More
Conventional Computed Tomography (CT) methods require large numbers of noise-free projections for accurate density reconstructions, limiting their applicability to the more complex class of Cone Beam Geometry CT (CBCT) reconstruction. Recently, deep learning methods have been proposed to overcome these limitations, with methods based on neural fields (NF) showing strong performance, by approximating the reconstructed density through a continuous-in-space coordinate based neural network. Our focus is on improving such methods, however, unlike previous work, which requires training an NF from scratch for each new set of projections, we instead propose to leverage anatomical consistencies over different scans by training a single conditional NF on a dataset of projections. We propose a novel conditioning method where local modulations are modeled per patient as a field over the input domain through a Neural Modulation Field (NMF). The resulting Conditional Cone Beam Neural Tomography (CondCBNT) shows improved performance for both high and low numbers of available projections on noise-free and noisy data.
△ Less
Submitted 17 July, 2023;
originally announced July 2023.
-
Joint machine learning and analytic track reconstruction for X-ray polarimetry with gas pixel detectors
Authors:
Nicoló Cibrario,
Michela Negro,
Nikita Moriakov,
Raffaella Bonino,
Luca Baldini,
Niccoló Di Lalla,
Luca Latronico,
Simone Maldera,
Alberto Manfreda,
Nicola Omodei,
Carmelo Sgró,
Stefano Tugliani
Abstract:
We present our study on the reconstruction of photoelectron tracks in gas pixel detectors used for astrophysical X-ray polarimetry. Our work aims to maximize the performance of convolutional neural networks (CNNs) to predict the impact point of incoming X-rays from the image of the photoelectron track. A very high precision in the reconstruction of the impact point position is achieved thanks to t…
▽ More
We present our study on the reconstruction of photoelectron tracks in gas pixel detectors used for astrophysical X-ray polarimetry. Our work aims to maximize the performance of convolutional neural networks (CNNs) to predict the impact point of incoming X-rays from the image of the photoelectron track. A very high precision in the reconstruction of the impact point position is achieved thanks to the introduction of an artificial sharpening process of the images. We find that providing the CNN-predicted impact point as input to the state-of-the-art analytic analysis improves the modulation factor ($\sim 1 \%$ at 3 keV and $\sim 6 \%$ at 6 keV) and naturally mitigates a subtle effect appearing in polarization measurements of bright extended sources known as "polarization leakage".
△ Less
Submitted 27 April, 2023;
originally announced April 2023.
-
End-to-end Memory-Efficient Reconstruction for Cone Beam CT
Authors:
Nikita Moriakov,
Jan-Jakob Sonke,
Jonas Teuwen
Abstract:
Cone Beam CT plays an important role in many medical fields nowadays, but the potential of this imaging modality is hampered by lower image quality compared to the conventional CT. A lot of recent research has been directed towards reconstruction methods relying on deep learning. However, practical application of deep learning to CBCT reconstruction is complicated by several issues, such as exceed…
▽ More
Cone Beam CT plays an important role in many medical fields nowadays, but the potential of this imaging modality is hampered by lower image quality compared to the conventional CT. A lot of recent research has been directed towards reconstruction methods relying on deep learning. However, practical application of deep learning to CBCT reconstruction is complicated by several issues, such as exceedingly high memory costs of deep learning methods for fully 3D data. In this work, we address these limitations and propose LIRE: a learned invertible primal-dual iterative scheme for Cone Beam CT reconstruction. Memory requirements of the network are substantially reduced while preserving its expressive power, enabling us to train on data with isotropic 2mm voxel spacing, clinically-relevant projection count and detector panel resolution on current hardware with 24 GB VRAM. Two LIRE models for small and for large Field-of-View setting were trained and validated on a set of 260 + 22 thorax CT scans and tested using a set of 142 thorax CT scans plus an out-of-distribution dataset of 79 head \& neck CT scans. For both settings, our method surpasses the classical methods and the deep learning baselines on both test sets. On the thorax CT set, our method achieves PSNR of 33.84 $\pm$ 2.28 for the small FoV setting and 35.14 $\pm$ 2.69 for the large FoV setting; U-Net baseline achieves PSNR of 33.08 $\pm$ 1.75 and 34.29 $\pm$ 2.71 respectively. On the head \& neck CT set, our method achieves PSNR of 39.35 $\pm$ 1.75 for the small FoV setting and 41.21 $\pm$ 1.41 for the large FoV setting; U-Net baseline achieves PSNR of 33.08 $\pm$ 1.75 and 34.29 $\pm$ 2.71 respectively. Additionally, we demonstrate that LIRE can be finetuned to reconstruct high-resolution CBCT data with the same geometry but 1mm voxel spacing and higher detector panel resolution, where it outperforms the U-Net baseline as well.
△ Less
Submitted 31 October, 2023; v1 submitted 15 May, 2022;
originally announced May 2022.
-
Subpixel object segmentation using wavelets and multi resolution analysis
Authors:
Ray Sheombarsing,
Nikita Moriakov,
Jan-Jakob Sonke,
Jonas Teuwen
Abstract:
We propose a novel deep learning framework for fast prediction of boundaries of two-dimensional simply connected domains using wavelets and Multi Resolution Analysis (MRA). The boundaries are modelled as (piecewise) smooth closed curves using wavelets and the so-called Pyramid Algorithm. Our network architecture is a hybrid analog of the U-Net, where the down-sampling path is a two-dimensional enc…
▽ More
We propose a novel deep learning framework for fast prediction of boundaries of two-dimensional simply connected domains using wavelets and Multi Resolution Analysis (MRA). The boundaries are modelled as (piecewise) smooth closed curves using wavelets and the so-called Pyramid Algorithm. Our network architecture is a hybrid analog of the U-Net, where the down-sampling path is a two-dimensional encoder with learnable filters, and the upsampling path is a one-dimensional decoder, which builds curves up from low to high resolution levels. Any wavelet basis induced by a MRA can be used. This flexibility allows for incorporation of priors on the smoothness of curves. The effectiveness of the proposed method is demonstrated by delineating boundaries of simply connected domains (organs) in medical images using Debauches wavelets and comparing performance with a U-Net baseline. Our model demonstrates up to 5x faster inference speed compared to the U-Net, while maintaining similar performance in terms of Dice score and Hausdorff distance.
△ Less
Submitted 28 October, 2021;
originally announced October 2021.
-
Multi-Coil MRI Reconstruction Challenge -- Assessing Brain MRI Reconstruction Models and their Generalizability to Varying Coil Configurations
Authors:
Youssef Beauferris,
Jonas Teuwen,
Dimitrios Karkalousos,
Nikita Moriakov,
Mattha Caan,
George Yiasemis,
Lívia Rodrigues,
Alexandre Lopes,
Hélio Pedrini,
Letícia Rittner,
Maik Dannecker,
Viktor Studenyak,
Fabian Gröger,
Devendra Vyas,
Shahrooz Faghih-Roohi,
Amrit Kumar Jethi,
Jaya Chandra Raju,
Mohanasankar Sivaprakasam,
Mike Lasby,
Nikita Nogovitsyn,
Wallace Loos,
Richard Frayne,
Roberto Souza
Abstract:
Deep-learning-based brain magnetic resonance imaging (MRI) reconstruction methods have the potential to accelerate the MRI acquisition process. Nevertheless, the scientific community lacks appropriate benchmarks to assess MRI reconstruction quality of high-resolution brain images, and evaluate how these proposed algorithms will behave in the presence of small, but expected data distribution shifts…
▽ More
Deep-learning-based brain magnetic resonance imaging (MRI) reconstruction methods have the potential to accelerate the MRI acquisition process. Nevertheless, the scientific community lacks appropriate benchmarks to assess MRI reconstruction quality of high-resolution brain images, and evaluate how these proposed algorithms will behave in the presence of small, but expected data distribution shifts. The Multi-Coil Magnetic Resonance Image (MC-MRI) Reconstruction Challenge provides a benchmark that aims at addressing these issues, using a large dataset of high-resolution, three-dimensional, T1-weighted MRI scans. The challenge has two primary goals: 1) to compare different MRI reconstruction models on this dataset and 2) to assess the generalizability of these models to data acquired with a different number of receiver coils. In this paper, we describe the challenge experimental design, and summarize the results of a set of baseline and state of the art brain MRI reconstruction models. We provide relevant comparative information on the current MRI reconstruction state-of-the-art and highlight the challenges of obtaining generalizable models that are required prior to broader clinical adoption. The MC-MRI benchmark data, evaluation code and current challenge leaderboard are publicly available. They provide an objective performance assessment for future developments in the field of brain MRI reconstruction.
△ Less
Submitted 21 December, 2021; v1 submitted 9 November, 2020;
originally announced November 2020.
-
Deep Learning-based Initialization of Iterative Reconstruction for Breast Tomosynthesis
Authors:
Koen Michielsen,
Nikita Moriakov,
Jonas Teuwen,
Ioannis Sechopoulos
Abstract:
Reconstruction of digital breast tomosynthesis is a challenging problem due to the limited angle data available in such systems. Due to memory limitations, deep learning-based methods can help improve these reconstructions, but can not (yet) attain sufficiently high resolution. In addition to this practical issue, questions remain on the possibility of such models introducing 'ghost' information f…
▽ More
Reconstruction of digital breast tomosynthesis is a challenging problem due to the limited angle data available in such systems. Due to memory limitations, deep learning-based methods can help improve these reconstructions, but can not (yet) attain sufficiently high resolution. In addition to this practical issue, questions remain on the possibility of such models introducing 'ghost' information from the training data that is not compatible with the projection data. To take advantage of some of the benefits of deep learning-based reconstructions while avoiding these limitations, we propose to use the low resolution deep learning-based reconstruction as an initialization of a regular high resolution iterative method.
The network was trained using digital phantoms, some based on a mathematical model and some derived from patient dedicated breast CT scans. The output of this network was then used as initialization for 10 000 iterations of MLTR for nine patient based phantoms that were not included in the training. The same nine cases were also reconstructed without any initialization for comparison.
The reconstructions including initialization were found to reach a lower mean squared error than those without, and visual inspection found much improved retrieval of the breast outline and depiction of the skin, confirming that adding the deep learning-based initialization adds valuable information to the reconstruction.
△ Less
Submitted 3 September, 2020;
originally announced September 2020.
-
Deep learning reconstruction of digital breast tomosynthesis images for accurate breast density and patient-specific radiation dose estimation
Authors:
Jonas Teuwen,
Nikita Moriakov,
Christian Fedon,
Marco Caballo,
Ingrid Reiser,
Pedrag Bakic,
Eloy García,
Oliver Diaz,
Koen Michielsen,
Ioannis Sechopoulos
Abstract:
The two-dimensional nature of mammography makes estimation of the overall breast density challenging, and estimation of the true patient-specific radiation dose impossible. Digital breast tomosynthesis (DBT), a pseudo-3D technique, is now commonly used in breast cancer screening and diagnostics. Still, the severely limited 3rd dimension information in DBT has not been used, until now, to estimate…
▽ More
The two-dimensional nature of mammography makes estimation of the overall breast density challenging, and estimation of the true patient-specific radiation dose impossible. Digital breast tomosynthesis (DBT), a pseudo-3D technique, is now commonly used in breast cancer screening and diagnostics. Still, the severely limited 3rd dimension information in DBT has not been used, until now, to estimate the true breast density or the patient-specific dose. This study proposes a reconstruction algorithm for DBT based on deep learning specifically optimized for these tasks. The algorithm, which we name DBToR, is based on unrolling a proximal-dual optimization method. The proximal operators are replaced with convolutional neural networks and prior knowledge is included in the model. This extends previous work on a deep learning-based reconstruction model by providing both the primal and the dual blocks with breast thickness information, which is available in DBT. Training and testing of the model were performed using virtual patient phantoms from two different sources. Reconstruction performance, and accuracy in estimation of breast density and radiation dose, were estimated, showing high accuracy (density <+/-3%; dose <+/-20%) without bias, significantly improving on the current state-of-the-art. This work also lays the groundwork for develo** a deep learning-based reconstruction algorithm for the task of image interpretation by radiologists.
△ Less
Submitted 29 March, 2021; v1 submitted 11 June, 2020;
originally announced June 2020.
-
Inferring astrophysical X-ray polarization with deep learning
Authors:
Nikita Moriakov,
Ashwin Samudre,
Michela Negro,
Fabian Gieseke,
Sydney Otten,
Luc Hendriks
Abstract:
We investigate the use of deep learning in the context of X-ray polarization detection from astrophysical sources as will be observed by the Imaging X-ray Polarimetry Explorer (IXPE), a future NASA selected space-based mission expected to be operative in 2021. In particular, we propose two models that can be used to estimate the impact point as well as the polarization direction of the incoming ra…
▽ More
We investigate the use of deep learning in the context of X-ray polarization detection from astrophysical sources as will be observed by the Imaging X-ray Polarimetry Explorer (IXPE), a future NASA selected space-based mission expected to be operative in 2021. In particular, we propose two models that can be used to estimate the impact point as well as the polarization direction of the incoming radiation. The results obtained show that data-driven approaches depict a promising alternative to the existing analytical approaches. We also discuss problems and challenges to be addressed in the near future.
△ Less
Submitted 16 May, 2020;
originally announced May 2020.
-
Kernel of CycleGAN as a Principle homogeneous space
Authors:
Nikita Moriakov,
Jonas Adler,
Jonas Teuwen
Abstract:
Unpaired image-to-image translation has attracted significant interest due to the invention of CycleGAN, a method which utilizes a combination of adversarial and cycle consistency losses to avoid the need for paired data. It is known that the CycleGAN problem might admit multiple solutions, and our goal in this paper is to analyze the space of exact solutions and to give perturbation bounds for ap…
▽ More
Unpaired image-to-image translation has attracted significant interest due to the invention of CycleGAN, a method which utilizes a combination of adversarial and cycle consistency losses to avoid the need for paired data. It is known that the CycleGAN problem might admit multiple solutions, and our goal in this paper is to analyze the space of exact solutions and to give perturbation bounds for approximate solutions. We show theoretically that the exact solution space is invariant with respect to automorphisms of the underlying probability spaces, and, furthermore, that the group of automorphisms acts freely and transitively on the space of exact solutions. We examine the case of zero `pure' CycleGAN loss first in its generality, and, subsequently, expand our analysis to approximate solutions for `extended' CycleGAN loss where identity loss term is included. In order to demonstrate that these results are applicable, we show that under mild conditions nontrivial smooth automorphisms exist. Furthermore, we provide empirical evidence that neural networks can learn these automorphisms with unexpected and unwanted results. We conclude that finding optimal solutions to the CycleGAN loss does not necessarily lead to the envisioned result in image-to-image translation tasks and that underlying hidden symmetries can render the result utterly useless.
△ Less
Submitted 24 January, 2020;
originally announced January 2020.
-
Learned SIRT for Cone Beam Computed Tomography Reconstruction
Authors:
Roeland J. Dilz,
Lukas Schröder,
Nikita Moriakov,
Jan-Jakob Sonke,
Jonas Teuwen
Abstract:
We introduce the learned simultaneous iterative reconstruction technique (SIRT) for tomographic reconstruction. The learned SIRT algorithm is a deep learning based reconstruction method combining model knowledge with a learned component. The algorithm is trained by map** raw measured data to the reconstruction results over several iterations. The Learned SIRT algorithm is applied to a cone beam…
▽ More
We introduce the learned simultaneous iterative reconstruction technique (SIRT) for tomographic reconstruction. The learned SIRT algorithm is a deep learning based reconstruction method combining model knowledge with a learned component. The algorithm is trained by map** raw measured data to the reconstruction results over several iterations. The Learned SIRT algorithm is applied to a cone beam geometry on a circular orbit, a challenging problem for learned methods due to its 3D geometry and its inherent inability to completely capture the patient anatomy. A comparison of 2D reconstructions is shown, where the learned SIRT approach produces reconstructions with superior peak signal to noise ratio (PSNR) and structural similarity (SSIM), compared to FBP, SIRT and U-net post-processing and similar PSNR and SSIM compared to the learned primal dual algorithm. Similar results are shown for cone beam geometry reconstructions of a 3D Shepp Logan phantom, where we obtain between 9.9 and 28.1 dB improvement over FBP with a substantial improvement in SSIM. Finally we show that our algorithm scales to clinically relevant problems, and performs well when applied to measurements of a physical phantom.
△ Less
Submitted 28 August, 2019;
originally announced August 2019.
-
Vendor-independent soft tissue lesion detection using weakly supervised and unsupervised adversarial domain adaptation
Authors:
Joris van Vugt,
Elena Marchiori,
Ritse Mann,
Albert Gubern-Mérida,
Nikita Moriakov,
Jonas Teuwen
Abstract:
Computer-aided detection aims to improve breast cancer screening programs by hel** radiologists to evaluate digital mammography (DM) exams. DM exams are generated by devices from different vendors, with diverse characteristics between and even within vendors. Physical properties of these devices and postprocessing of the images can greatly influence the resulting mammogram. This results in the f…
▽ More
Computer-aided detection aims to improve breast cancer screening programs by hel** radiologists to evaluate digital mammography (DM) exams. DM exams are generated by devices from different vendors, with diverse characteristics between and even within vendors. Physical properties of these devices and postprocessing of the images can greatly influence the resulting mammogram. This results in the fact that a deep learning model trained on data from one vendor cannot readily be applied to data from another vendor. This paper investigates the use of tailored transfer learning methods based on adversarial learning to tackle this problem. We consider a database of DM exams (mostly bilateral and two views) generated by Hologic and Siemens vendors. We analyze two transfer learning settings: 1) unsupervised transfer, where Hologic data with soft lesion annotation at pixel level and Siemens unlabelled data are used to annotate images in the latter data; 2) weak supervised transfer, where exam level labels for images from the Siemens mammograph are available. We propose tailored variants of recent state-of-the-art methods for transfer learning which take into account the class imbalance and incorporate knowledge provided by the annotations at exam level. Results of experiments indicate the beneficial effect of transfer learning in both transfer settings. Notably, at 0.02 false positives per image, we achieve a sensitivity of 0.37, compared to 0.30 of a baseline with no transfer. Results indicate that using exam level annotations gives an additional increase in sensitivity.
△ Less
Submitted 14 August, 2018;
originally announced August 2018.
-
Deep Learning Framework for Digital Breast Tomosynthesis Reconstruction
Authors:
Nikita Moriakov,
Koen Michielsen,
Jonas Adler,
Ritse Mann,
Ioannis Sechopoulos,
Jonas Teuwen
Abstract:
Digital breast tomosynthesis is rapidly replacing digital mammography as the basic x-ray technique for evaluation of the breasts. However, the sparse sampling and limited angular range gives rise to different artifacts, which manufacturers try to solve in several ways. In this study we propose an extension of the Learned Primal-Dual algorithm for digital breast tomosynthesis. The Learned Primal-Du…
▽ More
Digital breast tomosynthesis is rapidly replacing digital mammography as the basic x-ray technique for evaluation of the breasts. However, the sparse sampling and limited angular range gives rise to different artifacts, which manufacturers try to solve in several ways. In this study we propose an extension of the Learned Primal-Dual algorithm for digital breast tomosynthesis. The Learned Primal-Dual algorithm is a deep neural network consisting of several `reconstruction blocks', which take in raw sinogram data as the initial input, perform a forward and a backward pass by taking projections and back-projections, and use a convolutional neural network to produce an intermediate reconstruction result which is then improved further by the successive reconstruction block. We extend the architecture by providing breast thickness measurements as a mask to the neural network and allow it to learn how to use this thickness mask. We have trained the algorithm on digital phantoms and the corresponding noise-free/noisy projections, and then tested the algorithm on digital phantoms for varying level of noise. Reconstruction performance of the algorithms was compared visually, using MSE loss and Structural Similarity Index. Results indicate that the proposed algorithm outperforms the baseline iterative reconstruction algorithm in terms of reconstruction quality for both breast edges and internal structures and is robust to noise.
△ Less
Submitted 14 August, 2018;
originally announced August 2018.
-
On effective Birkhoff's ergodic theorem for computable actions of amenable groups
Authors:
Nikita Moriakov
Abstract:
We introduce computable actions of computable groups and prove the following versions of effective Birkhoff's ergodic theorem. Let $Γ$ be a computable amenable group, then there always exists a canonically computable tempered two-sided Følner sequence $(F_n)_{n \geq
1}$ in $Γ$. For a computable, measure-preserving, ergodic action of $Γ$ on a Cantor space $\{0,1\}^{\mathbb N}$ endowed with a comp…
▽ More
We introduce computable actions of computable groups and prove the following versions of effective Birkhoff's ergodic theorem. Let $Γ$ be a computable amenable group, then there always exists a canonically computable tempered two-sided Følner sequence $(F_n)_{n \geq
1}$ in $Γ$. For a computable, measure-preserving, ergodic action of $Γ$ on a Cantor space $\{0,1\}^{\mathbb N}$ endowed with a computable probability measure $μ$, it is shown that for every bounded lower semicomputable function $f$ on $\{0,1\}^{\mathbb N}$ and for every Martin-Löf random $ω\in \{0,1\}^{\mathbb N}$ the equality \[ \lim\limits_{n \to \infty} \frac{1}{|F_n|} \sum\limits_{g \in F_n} f(g \cdot ω) = \int\limits f d μ\] holds, where the averages are taken with respect to a canonically computable tempered two-sided Følner sequence $(F_n)_{n \geq
1}$. We also prove the same identity for all lower semicomputable $f$'s in the special case when $Γ$ is a computable group of polynomial growth and $F_n:=\mathrm{B}(n)$ is the Følner sequence of balls around the neutral element of $Γ$.
△ Less
Submitted 23 January, 2017;
originally announced January 2017.
-
Hochman's upcrossing theorem for groups of polynomial growth
Authors:
Nikita Moriakov
Abstract:
Consider a stochastic process $(S_{[a_i,b_i]})_{[a_i,b_i] \subset \mathbb{N}}$, which is indexed by the collection of all nonempty intervals $[a_i,b_i] \subset \mathbb{N}$ and which is stationary under translations of the intervals. It was shown by M. Hochman that, for any $k \geq 1$ and any interval $(α,β) \subset \mathbb{R}$, one can give an `almost-exponential' bound on the size of the set wher…
▽ More
Consider a stochastic process $(S_{[a_i,b_i]})_{[a_i,b_i] \subset \mathbb{N}}$, which is indexed by the collection of all nonempty intervals $[a_i,b_i] \subset \mathbb{N}$ and which is stationary under translations of the intervals. It was shown by M. Hochman that, for any $k \geq 1$ and any interval $(α,β) \subset \mathbb{R}$, one can give an `almost-exponential' bound on the size of the set where the associated process $(S_{[1,n]})_{n \geq 1}$ has at least $k$ fluctuations over $(α,β)$. It was also noticed that a similar techniques can be applied in $\mathbb{Z}^d$ case. In this article we extend Hochman's upcrossing theorem to groups of polynomial growth.
△ Less
Submitted 15 December, 2016;
originally announced December 2016.
-
Fluctuations of Ergodic Averages for Actions of Groups of Polynomial Growth
Authors:
Nikita Moriakov
Abstract:
It was shown by S. Kalikow and B. Weiss that, given a measure-preserving action of $\mathbb{Z}^d$ on a probability space $X$ and a nonnegative measurable function $f$ on $X$, the probability that the sequence of ergodic averages $$ \frac 1 {(2k+1)^d} \sum\limits_{g \in [-k,\dots,k]^d} f(g \cdot x) $$ has at least $n$ fluctuations across an interval $(α,β)$ can be bounded from above by $c_1 c_2^n$…
▽ More
It was shown by S. Kalikow and B. Weiss that, given a measure-preserving action of $\mathbb{Z}^d$ on a probability space $X$ and a nonnegative measurable function $f$ on $X$, the probability that the sequence of ergodic averages $$ \frac 1 {(2k+1)^d} \sum\limits_{g \in [-k,\dots,k]^d} f(g \cdot x) $$ has at least $n$ fluctuations across an interval $(α,β)$ can be bounded from above by $c_1 c_2^n$ for some universal constants $c_1 \in \mathbb{R}$ and $c_2 \in (0,1)$, which depend only on $d,α,β$. The purpose of this article is to generalize this result to measure-preserving actions of groups of polynomial growth. As the main tool we develop a generalization of effective Vitali covering theorem for groups of polynomial growth.
△ Less
Submitted 19 August, 2016; v1 submitted 17 August, 2016;
originally announced August 2016.
-
Computable Følner monotilings and a theorem of Brudno II
Authors:
Nikita Moriakov
Abstract:
A theorem of A.A. Brudno says that the Kolmogorov-Sinai entropy of a subshift X over $\mathbb{N}$ with respect to an ergodic measure $μ$ equals the asymptotic Kolmogorov complexity of almost every word $ω$ in X. The purpose of this article is to extend this result to subshifts over computable groups that admit computable regular symmetric Følner monotilings, which we introduce in this work. These…
▽ More
A theorem of A.A. Brudno says that the Kolmogorov-Sinai entropy of a subshift X over $\mathbb{N}$ with respect to an ergodic measure $μ$ equals the asymptotic Kolmogorov complexity of almost every word $ω$ in X. The purpose of this article is to extend this result to subshifts over computable groups that admit computable regular symmetric Følner monotilings, which we introduce in this work. These monotilings are a special type of computable Følner monotilings, which we defined earlier in order to extend the initial results of Brudno. For every $d \in \mathbb{N}$, the groups $\mathbb{Z}^d$ and the groups of unipotent upper-triangular matrices of dimension $d+1$ with integer entries admit particularly nice computable regular symmetric Følner monotilings for which we can provide the required computing algorithms `explicitly'.
△ Less
Submitted 14 December, 2015; v1 submitted 13 October, 2015;
originally announced October 2015.
-
On systems with quasi-discrete spectrum
Authors:
Markus Haase,
Nikita Moriakov
Abstract:
In this paper we re-examine the theory of systems with quasi-discrete spectrum initiated in the 1960's by Abramov, Hahn, and Parry. In the first part, we give a simpler proof of the Hahn--Parry theorem stating that each minimal topological system with quasi-discrete spectrum is isomorphic to a certain affine automorphism system on some compact Abelian group. Next, we show that a suitable applicati…
▽ More
In this paper we re-examine the theory of systems with quasi-discrete spectrum initiated in the 1960's by Abramov, Hahn, and Parry. In the first part, we give a simpler proof of the Hahn--Parry theorem stating that each minimal topological system with quasi-discrete spectrum is isomorphic to a certain affine automorphism system on some compact Abelian group. Next, we show that a suitable application of Gelfand's theorem renders Abramov's theorem --- the analogue of the Hahn-Parry theorem for measure-preserving systems --- a straightforward corollary of the Hahn-Parry result.
In the second part, independent of the first, we present a shortened proof of the fact that each factor of a totally ergodic system with quasi-discrete spectrum (a "QDS-system") has again quasi-discrete spectrum and that such systems have zero entropy. Moreover, we obtain a complete algebraic classification of the factors of a QDS-system.
In the third part, we apply the results of the second to the (still open) question whether a Markov quasi-factor of a QDS-system is already a factor of it. We show that this is true when the system satisfies some algebraic constraint on the group of quasi-eigenvalues, which is satisfied, e.g., in the case of the skew shift.
△ Less
Submitted 1 June, 2017; v1 submitted 29 September, 2015;
originally announced September 2015.
-
Computable Følner monotilings and a theorem of Brudno I
Authors:
Nikita Moriakov
Abstract:
The purpose of this article is to extend the earliest results of A.A. Brudno, connecting topological entropy of a subshift X over $\mathbb{N}$ to the Kolmogorov complexity of words in X, to subshifts over computable groups that posses computable Følner monotilings, which we introduce in this work. The classical examples of such groups are the groups $\mathbb{Z}^d$ and the groups of upper-triangula…
▽ More
The purpose of this article is to extend the earliest results of A.A. Brudno, connecting topological entropy of a subshift X over $\mathbb{N}$ to the Kolmogorov complexity of words in X, to subshifts over computable groups that posses computable Følner monotilings, which we introduce in this work. The classical examples of such groups are the groups $\mathbb{Z}^d$ and the groups of upper-triangular matrices with integer entries. Following the work of B. Weiss we show that the class of such groups is closed under group extensions.
△ Less
Submitted 13 October, 2015; v1 submitted 25 September, 2015;
originally announced September 2015.
-
Categories of measurement functors. Entropy of discrete amenable group representations on abstract categories. Entropy as a bifunctor into $[0,\infty]$
Authors:
Nikita Moriakov
Abstract:
The main purpose of this article is to provide a common generalization of the notions of a topological and Kolmogorov-Sinai entropy for arbitrary representations of discrete amenable groups on objects of (abstract) categories. This is performed by introducing the notion of a measurement functor from the category of representations of a fixed amenable group $Γ$ on objects of an abstract category C…
▽ More
The main purpose of this article is to provide a common generalization of the notions of a topological and Kolmogorov-Sinai entropy for arbitrary representations of discrete amenable groups on objects of (abstract) categories. This is performed by introducing the notion of a measurement functor from the category of representations of a fixed amenable group $Γ$ on objects of an abstract category C to the category of representations of $Γ$ on distributive lattices with localization. We develop the entropy theory of representations of $Γ$ on these lattices, and then define the entropy of a representation of $Γ$ on objects of the category C with respect to a given measurement functor. For a fixed measurement functor, this entropy decreases along arrows of the category of representations. For a fixed category, entropies defined via different measurement functors decrease pointwise along natural transformations of measurement functors. We conclude that entropy is a bifunctor to the poset of extended positive reals. As an application of the theory, we show that both topological and Kolmogorov-Sinai entropies are instances of entropies arising from certain measurement functors.
△ Less
Submitted 13 October, 2015; v1 submitted 25 September, 2015;
originally announced September 2015.