-
Post-Training Network Compression for 3D Medical Image Segmentation: Reducing Computational Efforts via Tucker Decomposition
Authors:
Tobias Weber,
Jakob Dexl,
David Rügamer,
Michael Ingrisch
Abstract:
We address the computational barrier of deploying advanced deep learning segmentation models in clinical settings by studying the efficacy of network compression through tensor decomposition. We propose a post-training Tucker factorization that enables the decomposition of pre-existing models to reduce computational requirements without impeding segmentation accuracy. We applied Tucker decompositi…
▽ More
We address the computational barrier of deploying advanced deep learning segmentation models in clinical settings by studying the efficacy of network compression through tensor decomposition. We propose a post-training Tucker factorization that enables the decomposition of pre-existing models to reduce computational requirements without impeding segmentation accuracy. We applied Tucker decomposition to the convolutional kernels of the TotalSegmentator (TS) model, an nnU-Net model trained on a comprehensive dataset for automatic segmentation of 117 anatomical structures. Our approach reduced the floating-point operations (FLOPs) and memory required during inference, offering an adjustable trade-off between computational efficiency and segmentation quality. This study utilized the publicly available TS dataset, employing various downsampling factors to explore the relationship between model size, inference speed, and segmentation performance. The application of Tucker decomposition to the TS model substantially reduced the model parameters and FLOPs across various compression rates, with limited loss in segmentation accuracy. We removed up to 88% of the model's parameters with no significant performance changes in the majority of classes after fine-tuning. Practical benefits varied across different graphics processing unit (GPU) architectures, with more distinct speed-ups on less powerful hardware. Post-hoc network compression via Tucker decomposition presents a viable strategy for reducing the computational demand of medical image segmentation models without substantially sacrificing accuracy. This approach enables the broader adoption of advanced deep learning technologies in clinical practice, offering a way to navigate the constraints of hardware capabilities.
△ Less
Submitted 18 April, 2024; v1 submitted 15 April, 2024;
originally announced April 2024.
-
Constrained Probabilistic Mask Learning for Task-specific Undersampled MRI Reconstruction
Authors:
Tobias Weber,
Michael Ingrisch,
Bernd Bischl,
David Rügamer
Abstract:
Undersampling is a common method in Magnetic Resonance Imaging (MRI) to subsample the number of data points in k-space, reducing acquisition times at the cost of decreased image quality. A popular approach is to employ undersampling patterns following various strategies, e.g., variable density sampling or radial trajectories. In this work, we propose a method that directly learns the undersampling…
▽ More
Undersampling is a common method in Magnetic Resonance Imaging (MRI) to subsample the number of data points in k-space, reducing acquisition times at the cost of decreased image quality. A popular approach is to employ undersampling patterns following various strategies, e.g., variable density sampling or radial trajectories. In this work, we propose a method that directly learns the undersampling masks from data points, thereby also providing task- and domain-specific patterns. To solve the resulting discrete optimization problem, we propose a general optimization routine called ProM: A fully probabilistic, differentiable, versatile, and model-free framework for mask optimization that enforces acceleration factors through a convex constraint. Analyzing knee, brain, and cardiac MRI datasets with our method, we discover that different anatomic regions reveal distinct optimal undersampling masks, demonstrating the benefits of using custom masks, tailored for a downstream task. For example, ProM can create undersampling masks that maximize performance in downstream tasks like segmentation with networks trained on fully-sampled MRIs. Even with extreme acceleration factors, ProM yields reasonable performance while being more versatile than existing methods, paving the way for data-driven all-purpose mask generation.
△ Less
Submitted 22 August, 2023; v1 submitted 25 May, 2023;
originally announced May 2023.
-
Cascaded Latent Diffusion Models for High-Resolution Chest X-ray Synthesis
Authors:
Tobias Weber,
Michael Ingrisch,
Bernd Bischl,
David Rügamer
Abstract:
While recent advances in large-scale foundational models show promising results, their application to the medical domain has not yet been explored in detail. In this paper, we progress into the realms of large-scale modeling in medical synthesis by proposing Cheff - a foundational cascaded latent diffusion model, which generates highly-realistic chest radiographs providing state-of-the-art quality…
▽ More
While recent advances in large-scale foundational models show promising results, their application to the medical domain has not yet been explored in detail. In this paper, we progress into the realms of large-scale modeling in medical synthesis by proposing Cheff - a foundational cascaded latent diffusion model, which generates highly-realistic chest radiographs providing state-of-the-art quality on a 1-megapixel scale. We further propose MaCheX, which is a unified interface for public chest datasets and forms the largest open collection of chest X-rays up to date. With Cheff conditioned on radiological reports, we further guide the synthesis process over text prompts and unveil the research area of report-to-chest-X-ray generation.
△ Less
Submitted 20 March, 2023;
originally announced March 2023.
-
Ultrasound differential phase contrast using backscattering and the memory effect
Authors:
Timothy D. Weber,
Nikunj Khetan,
Ruohui Yang,
Jerome Mertz
Abstract:
We describe a simple and fast technique to perform ultrasound differential phase contrast (DPC) imaging in arbitrarily thick scattering media. Though configured in a reflection geometry, DPC is based on transmission imaging and is a direct analogue of optical differential interference contrast (DIC). DPC exploits the memory effect and works in combination with standard pulse-echo imaging, with no…
▽ More
We describe a simple and fast technique to perform ultrasound differential phase contrast (DPC) imaging in arbitrarily thick scattering media. Though configured in a reflection geometry, DPC is based on transmission imaging and is a direct analogue of optical differential interference contrast (DIC). DPC exploits the memory effect and works in combination with standard pulse-echo imaging, with no additional hardware or data requirements, enabling complementary phase contrast (in the transverse direction) without any need for intensive numerical computation. We experimentally demonstrate the principle of DPC using tissue phantoms with calibrated speed-of-sound inclusions.
△ Less
Submitted 14 March, 2021;
originally announced March 2021.
-
Encoder-decoder semantic segmentation models for electroluminescence images of thin-film photovoltaic modules
Authors:
Evgenii Sovetkin,
Elbert Jan Achterberg,
Thomas Weber,
Bart E. Pieters
Abstract:
We consider a series of image segmentation methods based on the deep neural networks in order to perform semantic segmentation of electroluminescence (EL) images of thin-film modules. We utilize the encoder-decoder deep neural network architecture. The framework is general such that it can easily be extended to other types of images (e.g. thermography) or solar cell technologies (e.g. crystalline…
▽ More
We consider a series of image segmentation methods based on the deep neural networks in order to perform semantic segmentation of electroluminescence (EL) images of thin-film modules. We utilize the encoder-decoder deep neural network architecture. The framework is general such that it can easily be extended to other types of images (e.g. thermography) or solar cell technologies (e.g. crystalline silicon modules). The networks are trained and tested on a sample of images from a database with 6000 EL images of Copper Indium Gallium Diselenide (CIGS) thin film modules. We selected two types of features to extract, shunts and so called "droplets". The latter feature is often observed in the set of images. Several models are tested using various combinations of encoder-decoder layers, and a procedure is proposed to select the best model. We show exemplary results with the best selected model. Furthermore, we applied the best model to the full set of 6000 images and demonstrate that the automated segmentation of EL images can reveal many subtle features which cannot be inferred from studying a small sample of images. We believe these features can contribute to process optimization and quality control.
△ Less
Submitted 15 October, 2020;
originally announced October 2020.
-
Speed-of-sound imaging by differential phase contrast with angular compounding
Authors:
Nikunj Khetan,
Timothy Weber,
Jerome Mertz
Abstract:
We describe a technique to reveal speed-of-sound (SoS) variations within an echogenic sample. The technique uses the same receive data as standard pulse-echo imaging based on plane-wave compounding, and can be operated in parallel. Point-like scatterers randomly distributed throughout the sample serve as local probes of the downstream transmit-beam phase shifts caused by aberrating structures with…
▽ More
We describe a technique to reveal speed-of-sound (SoS) variations within an echogenic sample. The technique uses the same receive data as standard pulse-echo imaging based on plane-wave compounding, and can be operated in parallel. Point-like scatterers randomly distributed throughout the sample serve as local probes of the downstream transmit-beam phase shifts caused by aberrating structures within the sample. Phase shifts are monitored in a differential manner, providing signatures of transverse gradients of the local sample SoS. The contrast of the signatures is augmented by a method of angular compounding, which provides ``focus" control of the image sharpness, which, in turn, enables a visual localization of aberrating inclusions within the sample on the fly. The localization can be performed in 2D when operated with standard B-mode imaging, or in 3D when operated with C-mode imaging. Finally, we present a wave-acoustic forward model that provides insight into the principle of differential phase contrast (DPC) imaging, and roughly recapitulates experimental results obtained with an elastography phantom. In particular, we demonstrate that our technique easily reveals relative SoS variations as small as 0.5\% in real time. Such imaging may ultimately be useful for clinical diagnosis of pathologies in soft tissue.
△ Less
Submitted 6 July, 2020;
originally announced July 2020.
-
Interactive Image Restoration
Authors:
Zhiwei Han,
Thomas Weber,
Stefan Matthes,
Yuanting Liu,
Hao Shen
Abstract:
Machine learning and many of its applications are considered hard to approach due to their complexity and lack of transparency. One mission of human-centric machine learning is to improve algorithm transparency and user satisfaction while ensuring an acceptable task accuracy. In this work, we present an interactive image restoration framework, which exploits both image prior and human painting kno…
▽ More
Machine learning and many of its applications are considered hard to approach due to their complexity and lack of transparency. One mission of human-centric machine learning is to improve algorithm transparency and user satisfaction while ensuring an acceptable task accuracy. In this work, we present an interactive image restoration framework, which exploits both image prior and human painting knowledge in an iterative manner such that they can boost on each other. Additionally, in this system users can repeatedly get feedback of their interactions from the restoration progress. This informs the users about their impact on the restoration results, which leads to better sense of control, which can lead to greater trust and approachability. The positive results of both objective and subjective evaluation indicate that, our interactive approach positively contributes to the approachability of restoration algorithms in terms of algorithm performance and user experience.
△ Less
Submitted 24 October, 2019;
originally announced October 2019.
-
Implementation of Nonlinear Model Predictive Path-Following Control for an Industrial Robot
Authors:
Timm Faulwasser,
Tobias Weber,
Juan Pablo Zometa,
Rolf Findeisen
Abstract:
Many robotic applications, such as milling, gluing, or high precision measurements, require the exact following of a pre-defined geometric path. In this paper, we investigate the real-time feasible implementation of model predictive path-following control for an industrial robot. We consider constrained output path following with and without reference speed assignment. We present results from an i…
▽ More
Many robotic applications, such as milling, gluing, or high precision measurements, require the exact following of a pre-defined geometric path. In this paper, we investigate the real-time feasible implementation of model predictive path-following control for an industrial robot. We consider constrained output path following with and without reference speed assignment. We present results from an implementation of the proposed model predictive path-following controller on a KUKA LWR IV robot.
△ Less
Submitted 18 August, 2016; v1 submitted 30 June, 2015;
originally announced June 2015.