-
Benchmarking Deep Learning-Based Low Dose CT Image Denoising Algorithms
Authors:
Elias Eulig,
Björn Ommer,
Marc Kachelrieß
Abstract:
Long lasting efforts have been made to reduce radiation dose and thus the potential radiation risk to the patient for computed tomography acquisitions without severe deterioration of image quality. To this end, numerous reconstruction and noise reduction algorithms have been developed, many of which are based on iterative reconstruction techniques, incorporating prior knowledge in the projection o…
▽ More
Long lasting efforts have been made to reduce radiation dose and thus the potential radiation risk to the patient for computed tomography acquisitions without severe deterioration of image quality. To this end, numerous reconstruction and noise reduction algorithms have been developed, many of which are based on iterative reconstruction techniques, incorporating prior knowledge in the projection or image domain. Recently, deep learning-based methods became increasingly popular and a multitude of papers claim ever improving performance both quantitatively and qualitatively. In this work, we find that the lack of a common benchmark setup and flaws in the experimental setup of many publications hinder verifiability of those claims. We propose a benchmark setup to overcome those flaws and improve reproducibility and verifiability of experimental results in the field. In a comprehensive and fair evaluation of several deep learning-based low dose CT denoising algorithms, we find that most methods perform statistically similar and improvements over the past six years have been marginal at best.
△ Less
Submitted 9 January, 2024;
originally announced January 2024.
-
Assumption violations in causal discovery and the robustness of score matching
Authors:
Francesco Montagna,
Atalanti A. Mastakouri,
Elias Eulig,
Nicoletta Noceti,
Lorenzo Rosasco,
Dominik Janzing,
Bryon Aragam,
Francesco Locatello
Abstract:
When domain knowledge is limited and experimentation is restricted by ethical, financial, or time constraints, practitioners turn to observational causal discovery methods to recover the causal structure, exploiting the statistical properties of their data. Because causal discovery without further assumptions is an ill-posed problem, each algorithm comes with its own set of usually untestable assu…
▽ More
When domain knowledge is limited and experimentation is restricted by ethical, financial, or time constraints, practitioners turn to observational causal discovery methods to recover the causal structure, exploiting the statistical properties of their data. Because causal discovery without further assumptions is an ill-posed problem, each algorithm comes with its own set of usually untestable assumptions, some of which are hard to meet in real datasets. Motivated by these considerations, this paper extensively benchmarks the empirical performance of recent causal discovery methods on observational i.i.d. data generated under different background conditions, allowing for violations of the critical assumptions required by each selected approach. Our experimental findings show that score matching-based methods demonstrate surprising performance in the false positive and false negative rate of the inferred graph in these challenging scenarios, and we provide theoretical insights into their performance. This work is also the first effort to benchmark the stability of causal discovery algorithms with respect to the values of their hyperparameters. Finally, we hope this paper will set a new standard for the evaluation of causal discovery methods and can serve as an accessible entry point for practitioners interested in the field, highlighting the empirical implications of different algorithm choices.
△ Less
Submitted 20 October, 2023;
originally announced October 2023.
-
Toward Falsifying Causal Graphs Using a Permutation-Based Test
Authors:
Elias Eulig,
Atalanti A. Mastakouri,
Patrick Blöbaum,
Michaela Hardt,
Dominik Janzing
Abstract:
Understanding the causal relationships among the variables of a system is paramount to explain and control its behaviour. Inferring the causal graph from observational data without interventions, however, requires a lot of strong assumptions that are not always realistic. Even for domain experts it can be challenging to express the causal graph. Therefore, metrics that quantitatively assess the go…
▽ More
Understanding the causal relationships among the variables of a system is paramount to explain and control its behaviour. Inferring the causal graph from observational data without interventions, however, requires a lot of strong assumptions that are not always realistic. Even for domain experts it can be challenging to express the causal graph. Therefore, metrics that quantitatively assess the goodness of a causal graph provide helpful checks before using it in downstream tasks. Existing metrics provide an absolute number of inconsistencies between the graph and the observed data, and without a baseline, practitioners are left to answer the hard question of how many such inconsistencies are acceptable or expected. Here, we propose a novel consistency metric by constructing a surrogate baseline through node permutations. By comparing the number of inconsistencies with those on the surrogate baseline, we derive an interpretable metric that captures whether the DAG fits significantly better than random. Evaluating on both simulated and real data sets from various domains, including biology and cloud monitoring, we demonstrate that the true DAG is not falsified by our metric, whereas the wrong graphs given by a hypothetical user are likely to be falsified.
△ Less
Submitted 16 May, 2023;
originally announced May 2023.
-
DiagViB-6: A Diagnostic Benchmark Suite for Vision Models in the Presence of Shortcut and Generalization Opportunities
Authors:
Elias Eulig,
Piyapat Saranrittichai,
Chaithanya Kumar Mummadi,
Kilian Rambach,
William Beluch,
Xiahan Shi,
Volker Fischer
Abstract:
Common deep neural networks (DNNs) for image classification have been shown to rely on shortcut opportunities (SO) in the form of predictive and easy-to-represent visual factors. This is known as shortcut learning and leads to impaired generalization. In this work, we show that common DNNs also suffer from shortcut learning when predicting only basic visual object factors of variation (FoV) such a…
▽ More
Common deep neural networks (DNNs) for image classification have been shown to rely on shortcut opportunities (SO) in the form of predictive and easy-to-represent visual factors. This is known as shortcut learning and leads to impaired generalization. In this work, we show that common DNNs also suffer from shortcut learning when predicting only basic visual object factors of variation (FoV) such as shape, color, or texture. We argue that besides shortcut opportunities, generalization opportunities (GO) are also an inherent part of real-world vision data and arise from partial independence between predicted classes and FoVs. We also argue that it is necessary for DNNs to exploit GO to overcome shortcut learning. Our core contribution is to introduce the Diagnostic Vision Benchmark suite DiagViB-6, which includes datasets and metrics to study a network's shortcut vulnerability and generalization capability for six independent FoV. In particular, DiagViB-6 allows controlling the type and degree of SO and GO in a dataset. We benchmark a wide range of popular vision architectures and show that they can exploit GO only to a limited extent.
△ Less
Submitted 8 October, 2021; v1 submitted 12 August, 2021;
originally announced August 2021.
-
Deep Learning-Based Reconstruction of Interventional Tools from Four X-Ray Projections for Tomographic Interventional Guidance
Authors:
Elias Eulig,
Joscha Maier,
Michael Knaup,
N. Robert Bennett,
Klaus Hörndler,
Adam S. Wang,
Marc Kachelrieß
Abstract:
Image guidance for minimally invasive interventions is usually performed by acquiring fluoroscopic images using a C-arm system. However, the projective data provide only limited information about the spatial structure and position of interventional tools such as stents, guide wires or coils. In this work we propose a deep learning-based pipeline for real-time tomographic (four-dimensional) interve…
▽ More
Image guidance for minimally invasive interventions is usually performed by acquiring fluoroscopic images using a C-arm system. However, the projective data provide only limited information about the spatial structure and position of interventional tools such as stents, guide wires or coils. In this work we propose a deep learning-based pipeline for real-time tomographic (four-dimensional) interventional guidance at acceptable dose levels. In the first step, interventional tools are extracted from four cone-beam CT projections using a deep convolutional neural network (CNN). These projections are then reconstructed and fed into a second CNN, which maps this highly undersampled reconstruction to a segmentation of the interventional tools. Our pipeline is capable of reconstructing interventional tools from only four x-ray projections without the need for a patient prior with very high accuracy. Therefore, the proposed approach is capable of overcoming the drawbacks of today's interventional guidance and could enable the development of new minimally invasive radiological interventions by providing full spatiotemporal information about the interventional tools.
△ Less
Submitted 24 November, 2021; v1 submitted 23 September, 2020;
originally announced September 2020.