Search | arXiv e-print repository

arXiv:2404.09683 [pdf, other]

Post-Training Network Compression for 3D Medical Image Segmentation: Reducing Computational Efforts via Tucker Decomposition

Authors: Tobias Weber, Jakob Dexl, David Rügamer, Michael Ingrisch

Abstract: We address the computational barrier of deploying advanced deep learning segmentation models in clinical settings by studying the efficacy of network compression through tensor decomposition. We propose a post-training Tucker factorization that enables the decomposition of pre-existing models to reduce computational requirements without impeding segmentation accuracy. We applied Tucker decompositi… ▽ More We address the computational barrier of deploying advanced deep learning segmentation models in clinical settings by studying the efficacy of network compression through tensor decomposition. We propose a post-training Tucker factorization that enables the decomposition of pre-existing models to reduce computational requirements without impeding segmentation accuracy. We applied Tucker decomposition to the convolutional kernels of the TotalSegmentator (TS) model, an nnU-Net model trained on a comprehensive dataset for automatic segmentation of 117 anatomical structures. Our approach reduced the floating-point operations (FLOPs) and memory required during inference, offering an adjustable trade-off between computational efficiency and segmentation quality. This study utilized the publicly available TS dataset, employing various downsampling factors to explore the relationship between model size, inference speed, and segmentation performance. The application of Tucker decomposition to the TS model substantially reduced the model parameters and FLOPs across various compression rates, with limited loss in segmentation accuracy. We removed up to 88% of the model's parameters with no significant performance changes in the majority of classes after fine-tuning. Practical benefits varied across different graphics processing unit (GPU) architectures, with more distinct speed-ups on less powerful hardware. Post-hoc network compression via Tucker decomposition presents a viable strategy for reducing the computational demand of medical image segmentation models without substantially sacrificing accuracy. This approach enables the broader adoption of advanced deep learning technologies in clinical practice, offering a way to navigate the constraints of hardware capabilities. △ Less

Submitted 18 April, 2024; v1 submitted 15 April, 2024; originally announced April 2024.

arXiv:2305.16376 [pdf, other]

Constrained Probabilistic Mask Learning for Task-specific Undersampled MRI Reconstruction

Authors: Tobias Weber, Michael Ingrisch, Bernd Bischl, David Rügamer

Abstract: Undersampling is a common method in Magnetic Resonance Imaging (MRI) to subsample the number of data points in k-space, reducing acquisition times at the cost of decreased image quality. A popular approach is to employ undersampling patterns following various strategies, e.g., variable density sampling or radial trajectories. In this work, we propose a method that directly learns the undersampling… ▽ More Undersampling is a common method in Magnetic Resonance Imaging (MRI) to subsample the number of data points in k-space, reducing acquisition times at the cost of decreased image quality. A popular approach is to employ undersampling patterns following various strategies, e.g., variable density sampling or radial trajectories. In this work, we propose a method that directly learns the undersampling masks from data points, thereby also providing task- and domain-specific patterns. To solve the resulting discrete optimization problem, we propose a general optimization routine called ProM: A fully probabilistic, differentiable, versatile, and model-free framework for mask optimization that enforces acceleration factors through a convex constraint. Analyzing knee, brain, and cardiac MRI datasets with our method, we discover that different anatomic regions reveal distinct optimal undersampling masks, demonstrating the benefits of using custom masks, tailored for a downstream task. For example, ProM can create undersampling masks that maximize performance in downstream tasks like segmentation with networks trained on fully-sampled MRIs. Even with extreme acceleration factors, ProM yields reasonable performance while being more versatile than existing methods, paving the way for data-driven all-purpose mask generation. △ Less

Submitted 22 August, 2023; v1 submitted 25 May, 2023; originally announced May 2023.

Comments: accepted at WACV 2024

arXiv:2303.11224 [pdf, other]

Cascaded Latent Diffusion Models for High-Resolution Chest X-ray Synthesis

Authors: Tobias Weber, Michael Ingrisch, Bernd Bischl, David Rügamer

Abstract: While recent advances in large-scale foundational models show promising results, their application to the medical domain has not yet been explored in detail. In this paper, we progress into the realms of large-scale modeling in medical synthesis by proposing Cheff - a foundational cascaded latent diffusion model, which generates highly-realistic chest radiographs providing state-of-the-art quality… ▽ More While recent advances in large-scale foundational models show promising results, their application to the medical domain has not yet been explored in detail. In this paper, we progress into the realms of large-scale modeling in medical synthesis by proposing Cheff - a foundational cascaded latent diffusion model, which generates highly-realistic chest radiographs providing state-of-the-art quality on a 1-megapixel scale. We further propose MaCheX, which is a unified interface for public chest datasets and forms the largest open collection of chest X-rays up to date. With Cheff conditioned on radiological reports, we further guide the synthesis process over text prompts and unveil the research area of report-to-chest-X-ray generation. △ Less

Submitted 20 March, 2023; originally announced March 2023.

Comments: accepted at PAKDD 2023

arXiv:2103.07949 [pdf, other]

doi 10.1063/5.0048071

Ultrasound differential phase contrast using backscattering and the memory effect

Authors: Timothy D. Weber, Nikunj Khetan, Ruohui Yang, Jerome Mertz

Abstract: We describe a simple and fast technique to perform ultrasound differential phase contrast (DPC) imaging in arbitrarily thick scattering media. Though configured in a reflection geometry, DPC is based on transmission imaging and is a direct analogue of optical differential interference contrast (DIC). DPC exploits the memory effect and works in combination with standard pulse-echo imaging, with no… ▽ More We describe a simple and fast technique to perform ultrasound differential phase contrast (DPC) imaging in arbitrarily thick scattering media. Though configured in a reflection geometry, DPC is based on transmission imaging and is a direct analogue of optical differential interference contrast (DIC). DPC exploits the memory effect and works in combination with standard pulse-echo imaging, with no additional hardware or data requirements, enabling complementary phase contrast (in the transverse direction) without any need for intensive numerical computation. We experimentally demonstrate the principle of DPC using tissue phantoms with calibrated speed-of-sound inclusions. △ Less

Submitted 14 March, 2021; originally announced March 2021.

Comments: 5 pages, 5 figures. Accepted for publication in Applied Physics Letters

arXiv:2010.07556 [pdf, other]

Encoder-decoder semantic segmentation models for electroluminescence images of thin-film photovoltaic modules

Authors: Evgenii Sovetkin, Elbert Jan Achterberg, Thomas Weber, Bart E. Pieters

Abstract: We consider a series of image segmentation methods based on the deep neural networks in order to perform semantic segmentation of electroluminescence (EL) images of thin-film modules. We utilize the encoder-decoder deep neural network architecture. The framework is general such that it can easily be extended to other types of images (e.g. thermography) or solar cell technologies (e.g. crystalline… ▽ More We consider a series of image segmentation methods based on the deep neural networks in order to perform semantic segmentation of electroluminescence (EL) images of thin-film modules. We utilize the encoder-decoder deep neural network architecture. The framework is general such that it can easily be extended to other types of images (e.g. thermography) or solar cell technologies (e.g. crystalline silicon modules). The networks are trained and tested on a sample of images from a database with 6000 EL images of Copper Indium Gallium Diselenide (CIGS) thin film modules. We selected two types of features to extract, shunts and so called "droplets". The latter feature is often observed in the set of images. Several models are tested using various combinations of encoder-decoder layers, and a procedure is proposed to select the best model. We show exemplary results with the best selected model. Furthermore, we applied the best model to the full set of 6000 images and demonstrate that the automated segmentation of EL images can reveal many subtle features which cannot be inferred from studying a small sample of images. We believe these features can contribute to process optimization and quality control. △ Less

Submitted 15 October, 2020; originally announced October 2020.

arXiv:2007.03156 [pdf, other]

Speed-of-sound imaging by differential phase contrast with angular compounding

Authors: Nikunj Khetan, Timothy Weber, Jerome Mertz

Abstract: We describe a technique to reveal speed-of-sound (SoS) variations within an echogenic sample. The technique uses the same receive data as standard pulse-echo imaging based on plane-wave compounding, and can be operated in parallel. Point-like scatterers randomly distributed throughout the sample serve as local probes of the downstream transmit-beam phase shifts caused by aberrating structures with… ▽ More We describe a technique to reveal speed-of-sound (SoS) variations within an echogenic sample. The technique uses the same receive data as standard pulse-echo imaging based on plane-wave compounding, and can be operated in parallel. Point-like scatterers randomly distributed throughout the sample serve as local probes of the downstream transmit-beam phase shifts caused by aberrating structures within the sample. Phase shifts are monitored in a differential manner, providing signatures of transverse gradients of the local sample SoS. The contrast of the signatures is augmented by a method of angular compounding, which provides ``focus" control of the image sharpness, which, in turn, enables a visual localization of aberrating inclusions within the sample on the fly. The localization can be performed in 2D when operated with standard B-mode imaging, or in 3D when operated with C-mode imaging. Finally, we present a wave-acoustic forward model that provides insight into the principle of differential phase contrast (DPC) imaging, and roughly recapitulates experimental results obtained with an elastography phantom. In particular, we demonstrate that our technique easily reveals relative SoS variations as small as 0.5\% in real time. Such imaging may ultimately be useful for clinical diagnosis of pathologies in soft tissue. △ Less

Submitted 6 July, 2020; originally announced July 2020.

Comments: 9 pages, 10 figures

arXiv:1910.11059 [pdf, other]

Interactive Image Restoration

Authors: Zhiwei Han, Thomas Weber, Stefan Matthes, Yuanting Liu, Hao Shen

Abstract: Machine learning and many of its applications are considered hard to approach due to their complexity and lack of transparency. One mission of human-centric machine learning is to improve algorithm transparency and user satisfaction while ensuring an acceptable task accuracy. In this work, we present an interactive image restoration framework, which exploits both image prior and human painting kno… ▽ More Machine learning and many of its applications are considered hard to approach due to their complexity and lack of transparency. One mission of human-centric machine learning is to improve algorithm transparency and user satisfaction while ensuring an acceptable task accuracy. In this work, we present an interactive image restoration framework, which exploits both image prior and human painting knowledge in an iterative manner such that they can boost on each other. Additionally, in this system users can repeatedly get feedback of their interactions from the restoration progress. This informs the users about their impact on the restoration results, which leads to better sense of control, which can lead to greater trust and approachability. The positive results of both objective and subjective evaluation indicate that, our interactive approach positively contributes to the approachability of restoration algorithms in terms of algorithm performance and user experience. △ Less

Submitted 24 October, 2019; originally announced October 2019.

Comments: Human-centric Machine Learning Workshop, NeurIPS 2019

arXiv:1506.09084 [pdf, other]

doi 10.1109/TCST.2016.2601624

Implementation of Nonlinear Model Predictive Path-Following Control for an Industrial Robot

Authors: Timm Faulwasser, Tobias Weber, Juan Pablo Zometa, Rolf Findeisen

Abstract: Many robotic applications, such as milling, gluing, or high precision measurements, require the exact following of a pre-defined geometric path. In this paper, we investigate the real-time feasible implementation of model predictive path-following control for an industrial robot. We consider constrained output path following with and without reference speed assignment. We present results from an i… ▽ More Many robotic applications, such as milling, gluing, or high precision measurements, require the exact following of a pre-defined geometric path. In this paper, we investigate the real-time feasible implementation of model predictive path-following control for an industrial robot. We consider constrained output path following with and without reference speed assignment. We present results from an implementation of the proposed model predictive path-following controller on a KUKA LWR IV robot. △ Less

Submitted 18 August, 2016; v1 submitted 30 June, 2015; originally announced June 2015.

Comments: 8 pages, 3 figures; final revised version

MSC Class: 93C83; 70Q05

Journal ref: IEEE Transactions on Control System Technology, 2017 25(4), 1505-1511

Showing 1–8 of 8 results for author: Weber, T