Skip to main content

Showing 1–50 of 160 results for author: Maier, A

Searching in archive eess. Search in all archives.
.
  1. arXiv:2406.16659  [pdf, other

    cs.LG eess.SP

    Data-driven Modeling in Metrology -- A Short Introduction, Current Developments and Future Perspectives

    Authors: Linda-Sophie Schneider, Patrick Krauss, Nadine Schiering, Christopher Syben, Richard Schielein, Andreas Maier

    Abstract: Mathematical models are vital to the field of metrology, playing a key role in the derivation of measurement results and the calculation of uncertainties from measurement data, informed by an understanding of the measurement process. These models generally represent the correlation between the quantity being measured and all other pertinent quantities. Such relationships are used to construct meas… ▽ More

    Submitted 24 June, 2024; originally announced June 2024.

    Comments: 31 pages, Preprint

  2. arXiv:2406.14576  [pdf, other

    eess.AS

    Towards Intelligent Speech Assistants in Operating Rooms: A Multimodal Model for Surgical Workflow Analysis

    Authors: Kubilay Can Demir, Belen Lojo Rodriguez, Tobias Weise, Andreas Maier, Seung Hee Yang

    Abstract: To develop intelligent speech assistants and integrate them seamlessly with intra-operative decision-support frameworks, accurate and efficient surgical phase recognition is a prerequisite. In this study, we propose a multimodal framework based on Gated Multimodal Units (GMU) and Multi-Stage Temporal Convolutional Networks (MS-TCN) to recognize surgical phases of port-catheter placement operations… ▽ More

    Submitted 17 June, 2024; originally announced June 2024.

    Comments: 5 Pages, Interspeech 2024

    MSC Class: 00b20

  3. arXiv:2405.19079  [pdf, other

    eess.IV cs.CV

    On the Influence of Smoothness Constraints in Computed Tomography Motion Compensation

    Authors: Mareike Thies, Fabian Wagner, Noah Maul, Siyuan Mei, Mingxuan Gu, Laura Pfaff, Nastassia Vysotskaya, Haijun Yu, Andreas Maier

    Abstract: Computed tomography (CT) relies on precise patient immobilization during image acquisition. Nevertheless, motion artifacts in the reconstructed images can persist. Motion compensation methods aim to correct such artifacts post-acquisition, often incorporating temporal smoothness constraints on the estimated motion patterns. This study analyzes the influence of a spline-based motion model within an… ▽ More

    Submitted 29 May, 2024; originally announced May 2024.

  4. arXiv:2404.08064  [pdf

    eess.AS cs.AI cs.CR cs.LG

    The Impact of Speech Anonymization on Pathology and Its Limits

    Authors: Soroosh Tayebi Arasteh, Tomas Arias-Vergara, Paula Andrea Perez-Toro, Tobias Weise, Kai Packhaeuser, Maria Schuster, Elmar Noeth, Andreas Maier, Seung Hee Yang

    Abstract: Integration of speech into healthcare has intensified privacy concerns due to its potential as a non-invasive biomarker containing individual biometric information. In response, speaker anonymization aims to conceal personally identifiable information while retaining crucial linguistic content. However, the application of anonymization techniques to pathological speech, a critical area where priva… ▽ More

    Submitted 22 June, 2024; v1 submitted 11 April, 2024; originally announced April 2024.

  5. arXiv:2404.03541  [pdf, other

    eess.IV cs.CV

    Segmentation-Guided Knee Radiograph Generation using Conditional Diffusion Models

    Authors: Siyuan Mei, Fuxin Fan, Fabian Wagner, Mareike Thies, Mingxuan Gu, Yipeng Sun, Andreas Maier

    Abstract: Deep learning-based medical image processing algorithms require representative data during development. In particular, surgical data might be difficult to obtain, and high-quality public datasets are limited. To overcome this limitation and augment datasets, a widely adopted solution is the generation of synthetic images. In this work, we employ conditional diffusion models to generate knee radiog… ▽ More

    Submitted 4 April, 2024; originally announced April 2024.

  6. arXiv:2403.14440  [pdf, other

    eess.IV cs.AI cs.CV cs.LG

    Analysing Diffusion Segmentation for Medical Images

    Authors: Mathias Öttl, Siyuan Mei, Frauke Wilm, Jana Steenpass, Matthias Rübner, Arndt Hartmann, Matthias Beckmann, Peter Fasching, Andreas Maier, Ramona Erber, Katharina Breininger

    Abstract: Denoising Diffusion Probabilistic models have become increasingly popular due to their ability to offer probabilistic modeling and generate diverse outputs. This versatility inspired their adaptation for image segmentation, where multiple predictions of the model can produce segmentation results that not only achieve high quality but also capture the uncertainty inherent in the model. Here, powerf… ▽ More

    Submitted 21 March, 2024; originally announced March 2024.

  7. arXiv:2403.10695  [pdf, other

    eess.IV cs.CV

    EAGLE: An Edge-Aware Gradient Localization Enhanced Loss for CT Image Reconstruction

    Authors: Yipeng Sun, Yixing Huang, Linda-Sophie Schneider, Mareike Thies, Mingxuan Gu, Siyuan Mei, Siming Bayer, Andreas Maier

    Abstract: Computed Tomography (CT) image reconstruction is crucial for accurate diagnosis and deep learning approaches have demonstrated significant potential in improving reconstruction quality. However, the choice of loss function profoundly affects the reconstructed images. Traditional mean squared error loss often produces blurry images lacking fine details, while alternatives designed to improve may in… ▽ More

    Submitted 15 March, 2024; originally announced March 2024.

    Comments: Preprint

  8. arXiv:2403.03326  [pdf, other

    eess.IV cs.CV

    AnatoMix: Anatomy-aware Data Augmentation for Multi-organ Segmentation

    Authors: Chang Liu, Fuxin Fan, Annette Schwarz, Andreas Maier

    Abstract: Multi-organ segmentation in medical images is a widely researched task and can save much manual efforts of clinicians in daily routines. Automating the organ segmentation process using deep learning (DL) is a promising solution and state-of-the-art segmentation models are achieving promising accuracy. In this work, We proposed a novel data augmentation strategy for increasing the generalizibility… ▽ More

    Submitted 5 March, 2024; originally announced March 2024.

  9. A Spatiotemporal Illumination Model for 3D Image Fusion in Optical Coherence Tomography

    Authors: Stefan Ploner, Jungeun Won, Julia Schottenhamml, Jessica Girgis, Kenneth Lam, Nadia Waheed, James Fujimoto, Andreas Maier

    Abstract: Optical coherence tomography (OCT) is a non-invasive, micrometer-scale imaging modality that has become a clinical standard in ophthalmology. By raster-scanning the retina, sequential cross-sectional image slices are acquired to generate volumetric data. In-vivo imaging suffers from discontinuities between slices that show up as motion and illumination artifacts. We present a new illumination mode… ▽ More

    Submitted 19 February, 2024; originally announced February 2024.

    Comments: Presented orally & as poster on 20th April 2023 at the IEEE International Symposium on Biomedical Imaging (ISBI) in Cartagena, Colombia. 6 pages, 3 figures. You can find the official version with broken equations and bad contrast figures under https://ieeexplore.ieee.org/document/10230526

  10. arXiv:2401.16104  [pdf, other

    cs.CV eess.IV

    A 2D Sinogram-Based Approach to Defect Localization in Computed Tomography

    Authors: Yuzhong Zhou, Linda-Sophie Schneider, Fuxin Fan, Andreas Maier

    Abstract: The rise of deep learning has introduced a transformative era in the field of image processing, particularly in the context of computed tomography. Deep learning has made a significant contribution to the field of industrial Computed Tomography. However, many defect detection algorithms are applied directly to the reconstructed domain, often disregarding the raw sensor data. This paper shifts the… ▽ More

    Submitted 29 January, 2024; originally announced January 2024.

  11. arXiv:2401.16039  [pdf, other

    eess.IV cs.CV cs.LG

    Data-Driven Filter Design in FBP: Transforming CT Reconstruction with Trainable Fourier Series

    Authors: Yipeng Sun, Linda-Sophie Schneider, Fuxin Fan, Mareike Thies, Mingxuan Gu, Siyuan Mei, Yuzhong Zhou, Siming Bayer, Andreas Maier

    Abstract: In this study, we introduce a Fourier series-based trainable filter for computed tomography (CT) reconstruction within the filtered backprojection (FBP) framework. This method overcomes the limitation in noise reduction, inherent in conventional FBP methods, by optimizing Fourier series coefficients to construct the filter. This method enables robust performance across different resolution scales… ▽ More

    Submitted 29 January, 2024; originally announced January 2024.

    Comments: Preprint

  12. arXiv:2401.12725  [pdf, other

    eess.IV cs.CV

    Two-View Topogram-Based Anatomy-Guided CT Reconstruction for Prospective Risk Minimization

    Authors: Chang Liu, Laura Klein, Yixing Huang, Edith Baader, Michael Lell, Marc Kachelrieß, Andreas Maier

    Abstract: To facilitate a prospective estimation of CT effective dose and risk minimization process, a prospective spatial dose estimation and the known anatomical structures are expected. To this end, a CT reconstruction method is required to reconstruct CT volumes from as few projections as possible, i.e. by using the topograms, with anatomical structures as correct as possible. In this work, an optimized… ▽ More

    Submitted 23 January, 2024; originally announced January 2024.

  13. arXiv:2401.09283  [pdf, other

    eess.IV cs.CV

    A gradient-based approach to fast and accurate head motion compensation in cone-beam CT

    Authors: Mareike Thies, Fabian Wagner, Noah Maul, Haijun Yu, Manuela Meier, Linda-Sophie Schneider, Mingxuan Gu, Siyuan Mei, Lukas Folle, Andreas Maier

    Abstract: Cone-beam computed tomography (CBCT) systems, with their portability, present a promising avenue for direct point-of-care medical imaging, particularly in critical scenarios such as acute stroke assessment. However, the integration of CBCT into clinical workflows faces challenges, primarily linked to long scan duration resulting in patient motion during scanning and leading to image quality degrad… ▽ More

    Submitted 17 January, 2024; originally announced January 2024.

    Comments: This work has been submitted to the IEEE for possible publication. Copyright may be transferred without notice, after which this version may no longer be accessible

  14. arXiv:2401.03912  [pdf, other

    eess.IV cs.CV cs.LG

    Attention-Guided Erasing: A Novel Augmentation Method for Enhancing Downstream Breast Density Classification

    Authors: Adarsh Bhandary Panambur, Hui Yu, Sheethal Bhat, Prathmesh Madhu, Siming Bayer, Andreas Maier

    Abstract: The assessment of breast density is crucial in the context of breast cancer screening, especially in populations with a higher percentage of dense breast tissues. This study introduces a novel data augmentation technique termed Attention-Guided Erasing (AGE), devised to enhance the downstream classification of four distinct breast density categories in mammography following the BI-RADS recommendat… ▽ More

    Submitted 8 January, 2024; originally announced January 2024.

  15. arXiv:2312.13744  [pdf

    eess.SY

    Modelling of Networked Measuring Systems -- From White-Box Models to Data Based Approaches

    Authors: Klaus-Dieter Sommer, Peter Harris, Sascha Eichstädt, Roland Füssl, Tanja Dorst, Andreas Schütze, Michael Heizmann, Nadine Schiering, Andreas Maier, Yuhui Luo, Christos Tachtatzis, Ivan Andonovic, Gordon Gourlay

    Abstract: Mathematical modelling is at the core of metrology as it transforms raw measured data into useful measurement results. A model captures the relationship between the measurand and all relevant quantities on which the measurand depends, and is used to design measuring systems, analyse measured data, make inferences and predictions, and is the basis for evaluating measurement uncertainties. Tradition… ▽ More

    Submitted 21 December, 2023; originally announced December 2023.

  16. arXiv:2312.08255  [pdf, other

    eess.IV cs.CV cs.LG

    OCTDL: Optical Coherence Tomography Dataset for Image-Based Deep Learning Methods

    Authors: Mikhail Kulyabin, Aleksei Zhdanov, Anastasia Nikiforova, Andrey Stepichev, Anna Kuznetsova, Mikhail Ronkin, Vasilii Borisov, Alexander Bogachev, Sergey Korotkich, Paul A Constable, Andreas Maier

    Abstract: Optical coherence tomography (OCT) is a non-invasive imaging technique with extensive clinical applications in ophthalmology. OCT enables the visualization of the retinal layers, playing a vital role in the early detection and monitoring of retinal diseases. OCT uses the principle of light wave interference to create detailed images of the retinal microstructures, making it a valuable tool for dia… ▽ More

    Submitted 31 March, 2024; v1 submitted 13 December, 2023; originally announced December 2023.

  17. arXiv:2306.14596  [pdf, other

    eess.IV cs.CV

    Deep Learning for Cancer Prognosis Prediction Using Portrait Photos by StyleGAN Embedding

    Authors: Amr Hagag, Ahmed Gomaa, Dominik Kornek, Andreas Maier, Rainer Fietkau, Christoph Bert, Florian Putz, Yixing Huang

    Abstract: Survival prediction for cancer patients is critical for optimal treatment selection and patient management. Current patient survival prediction methods typically extract survival information from patients' clinical record data or biological and imaging data. In practice, experienced clinicians can have a preliminary assessment of patients' health status based on patients' observable physical appea… ▽ More

    Submitted 28 June, 2023; v1 submitted 26 June, 2023; originally announced June 2023.

  18. A Vessel-Segmentation-Based CycleGAN for Unpaired Multi-modal Retinal Image Synthesis

    Authors: Aline Sindel, Andreas Maier, Vincent Christlein

    Abstract: Unpaired image-to-image translation of retinal images can efficiently increase the training dataset for deep-learning-based multi-modal retinal registration methods. Our method integrates a vessel segmentation network into the image-to-image translation task by extending the CycleGAN framework. The segmentation network is inserted prior to a UNet vision transformer generator network and serves as… ▽ More

    Submitted 5 June, 2023; originally announced June 2023.

    Comments: Accepted to BVM 2023

    Journal ref: BVM 2023

  19. arXiv:2306.01752  [pdf, other

    eess.IV cs.CV cs.LG

    Handling Label Uncertainty on the Example of Automatic Detection of Shepherd's Crook RCA in Coronary CT Angiography

    Authors: Felix Denzinger, Michael Wels, Oliver Taubmann, Florian Kordon, Fabian Wagner, Stephanie Mehltretter, Mehmet A. Gülsün, Max Schöbinger, Florian André, Sebastian Buss, Johannes Görich, Michael Sühling, Andreas Maier

    Abstract: Coronary artery disease (CAD) is often treated minimally invasively with a catheter being inserted into the diseased coronary vessel. If a patient exhibits a Shepherd's Crook (SC) Right Coronary Artery (RCA) - an anatomical norm variant of the coronary vasculature - the complexity of this procedure is increased. Automated reporting of this variant from coronary CT angiography screening would ease… ▽ More

    Submitted 22 May, 2023; originally announced June 2023.

    Comments: Accepted at ISBI 2023

  20. Federated learning for secure development of AI models for Parkinson's disease detection using speech from different languages

    Authors: Soroosh Tayebi Arasteh, Cristian David Rios-Urrego, Elmar Noeth, Andreas Maier, Seung Hee Yang, Jan Rusz, Juan Rafael Orozco-Arroyave

    Abstract: Parkinson's disease (PD) is a neurological disorder impacting a person's speech. Among automatic PD assessment methods, deep learning models have gained particular interest. Recently, the community has explored cross-pathology and cross-language models which can improve diagnostic accuracy even further. However, strict patient data privacy regulations largely prevent institutions from sharing pati… ▽ More

    Submitted 21 August, 2023; v1 submitted 18 May, 2023; originally announced May 2023.

    Comments: INTERSPEECH 2023, pp. 5003--5007, Dublin, Ireland

    Journal ref: INTERSPEECH 2023

  21. arXiv:2305.08227  [pdf, other

    eess.AS cs.CL cs.SD

    DeepFilterNet: Perceptually Motivated Real-Time Speech Enhancement

    Authors: Hendrik Schröter, Tobias Rosenkranz, Alberto N. Escalante-B., Andreas Maier

    Abstract: Multi-frame algorithms for single-channel speech enhancement are able to take advantage from short-time correlations within the speech signal. Deep Filtering (DF) was proposed to directly estimate a complex filter in frequency domain to take advantage of these correlations. In this work, we present a real-time speech enhancement demo using DeepFilterNet. DeepFilterNet's efficiency is enabled by ex… ▽ More

    Submitted 14 May, 2023; originally announced May 2023.

    Comments: Accepted as show and tell demo to interspeech 2023

  22. arXiv:2305.08225  [pdf, other

    eess.AS

    Deep Multi-Frame Filtering for Hearing Aids

    Authors: Hendrik Schröter, Tobias Rosenkranz, Alberto N. Escalante-B., Andreas Maier

    Abstract: Multi-frame algorithms for single-channel speech enhancement are able to take advantage from short-time correlations within the speech signal. Deep filtering (DF) recently demonstrated its capabilities for low-latency scenarios like hearing aids with its complex multi-frame (MF) filter. Alternatively, the complex filter can be estimated via an MF minimum variance distortionless response (MVDR), or… ▽ More

    Submitted 14 May, 2023; originally announced May 2023.

    Comments: Submitted to Interspeech 2023

  23. arXiv:2303.14711  [pdf, other

    eess.IV cs.CV

    Unsupervised detection of small hyperreflective features in ultrahigh resolution optical coherence tomography

    Authors: Marcel Reimann, Jungeun Won, Hiroyuki Takahashi, Antonio Yaghy, Yunchan Hwang, Stefan Ploner, Junhong Lin, Jessica Girgis, Kenneth Lam, Siyu Chen, Nadia K. Waheed, Andreas Maier, James G. Fujimoto

    Abstract: Recent advances in optical coherence tomography such as the development of high speed ultrahigh resolution scanners and corresponding signal processing techniques may reveal new potential biomarkers in retinal diseases. Newly visible features are, for example, small hyperreflective specks in age-related macular degeneration. Identifying these new markers is crucial to investigate potential associa… ▽ More

    Submitted 26 March, 2023; originally announced March 2023.

    Comments: Accepted as poster at BVM workshop 2023 (https://www.bvm-workshop.org/). The arXiv version provides full quality figures. 6 pages content (2 figures)

  24. arXiv:2303.11724  [pdf, other

    cs.CV cs.LG eess.IV

    Task-based Generation of Optimized Projection Sets using Differentiable Ranking

    Authors: Linda-Sophie Schneider, Mareike Thies, Christopher Syben, Richard Schielein, Mathias Unberath, Andreas Maier

    Abstract: We present a method for selecting valuable projections in computed tomography (CT) scans to enhance image reconstruction and diagnosis. The approach integrates two important factors, projection-based detectability and data completeness, into a single feed-forward neural network. The network evaluates the value of projections, processes them through a differentiable ranking function and makes the f… ▽ More

    Submitted 21 March, 2023; originally announced March 2023.

  25. arXiv:2303.00500  [pdf, other

    cs.CV cs.LG eess.IV

    Inherently Interpretable Multi-Label Classification Using Class-Specific Counterfactuals

    Authors: Susu Sun, Stefano Woerner, Andreas Maier, Lisa M. Koch, Christian F. Baumgartner

    Abstract: Interpretability is essential for machine learning algorithms in high-stakes application fields such as medical image analysis. However, high-performing black-box neural networks do not provide explanations for their predictions, which can lead to mistrust and suboptimal human-ML collaboration. Post-hoc explanation techniques, which are widely used in practice, have been shown to suffer from sever… ▽ More

    Submitted 8 August, 2023; v1 submitted 1 March, 2023; originally announced March 2023.

    Comments: Accepted to MIDL 2023

  26. arXiv:2302.11612  [pdf

    eess.IV

    Retinal blood flow speed quantification at the capillary level using temporal autocorrelation fitting OCTA

    Authors: Yunchan Hwang, Jungeun Won, Antonio Yaghy, Hiroyuki Takahashi, Jessica M. Girgis, Kenneth Lam, Siyu Chen, Eric M. Moult, Stefan B. Ploner, Andreas Maier, Nadia K. Waheed, James G. Fujimoto

    Abstract: Optical coherence tomography angiography (OCTA) can visualize vasculature structures, but provides limited information about the blood flow speeds. Here, we present a second generation variable interscan time analysis (VISTA) OCTA, which evaluates a quantitative surrogate marker for blood flow speed in vasculature. At the capillary level, spatially compiled OCTA and a simple temporal autocorrelati… ▽ More

    Submitted 22 February, 2023; originally announced February 2023.

  27. arXiv:2302.06251  [pdf, other

    eess.IV cs.CV

    Optimizing CT Scan Geometries With and Without Gradients

    Authors: Mareike Thies, Fabian Wagner, Noah Maul, Laura Pfaff, Linda-Sophie Schneider, Christopher Syben, Andreas Maier

    Abstract: In computed tomography (CT), the projection geometry used for data acquisition needs to be known precisely to obtain a clear reconstructed image. Rigid patient motion is a cause for misalignment between measured data and employed geometry. Commonly, such motion is compensated by solving an optimization problem that, e.g., maximizes the quality of the reconstructed image with respect to the project… ▽ More

    Submitted 13 February, 2023; originally announced February 2023.

  28. arXiv:2301.09282  [pdf, other

    eess.IV cs.CV cs.LG

    Classification of Luminal Subtypes in Full Mammogram Images Using Transfer Learning

    Authors: Adarsh Bhandary Panambur, Prathmesh Madhu, Andreas Maier

    Abstract: Automatic identification of patients with luminal and non-luminal subtypes during a routine mammography screening can support clinicians in streamlining breast cancer therapy planning. Recent machine learning techniques have shown promising results in molecular subtype classification in mammography; however, they are highly dependent on pixel-level annotations, handcrafted, and radiomic features.… ▽ More

    Submitted 23 January, 2023; originally announced January 2023.

    Comments: Submitted to IEEE ISBI 2023

  29. arXiv:2301.04423  [pdf, other

    eess.IV cs.CV

    Multi-Scanner Canine Cutaneous Squamous Cell Carcinoma Histopathology Dataset

    Authors: Frauke Wilm, Marco Fragoso, Christof A. Bertram, Nikolas Stathonikos, Mathias Öttl, **gna Qiu, Robert Klopfleisch, Andreas Maier, Katharina Breininger, Marc Aubreville

    Abstract: In histopathology, scanner-induced domain shifts are known to impede the performance of trained neural networks when tested on unseen data. Multi-domain pre-training or dedicated domain-generalization techniques can help to develop domain-agnostic algorithms. For this, multi-scanner datasets with a high variety of slide scanning systems are highly desirable. We present a publicly available multi-s… ▽ More

    Submitted 27 February, 2023; v1 submitted 11 January, 2023; originally announced January 2023.

    Comments: 6 pages, 3 figures, 1 table, accepted at BVM workshop 2023

  30. arXiv:2212.04832  [pdf, other

    eess.IV cs.CV

    Noise2Contrast: Multi-Contrast Fusion Enables Self-Supervised Tomographic Image Denoising

    Authors: Fabian Wagner, Mareike Thies, Laura Pfaff, Noah Maul, Sabrina Pechmann, Mingxuan Gu, Jonas Utz, Oliver Aust, Daniela Weidner, Georgiana Neag, Stefan Uderhardt, Jang-Hwan Choi, Andreas Maier

    Abstract: Self-supervised image denoising techniques emerged as convenient methods that allow training denoising models without requiring ground-truth noise-free data. Existing methods usually optimize loss metrics that are calculated from multiple noisy realizations of similar images, e.g., from neighboring tomographic slices. However, those approaches fail to utilize the multiple contrasts that are routin… ▽ More

    Submitted 9 December, 2022; originally announced December 2022.

  31. Gradient-Based Geometry Learning for Fan-Beam CT Reconstruction

    Authors: Mareike Thies, Fabian Wagner, Noah Maul, Lukas Folle, Manuela Meier, Maximilian Rohleder, Linda-Sophie Schneider, Laura Pfaff, Mingxuan Gu, Jonas Utz, Felix Denzinger, Michael Manhart, Andreas Maier

    Abstract: Incorporating computed tomography (CT) reconstruction operators into differentiable pipelines has proven beneficial in many applications. Such approaches usually focus on the projection data and keep the acquisition geometry fixed. However, precise knowledge of the acquisition geometry is essential for high quality reconstruction results. In this paper, the differentiable formulation of fan-beam C… ▽ More

    Submitted 5 December, 2022; originally announced December 2022.

  32. arXiv:2211.16219  [pdf, other

    cs.CV eess.IV

    Metal-conscious Embedding for CBCT Projection Inpainting

    Authors: Fuxin Fan, Yangkong Wang, Ludwig Ritschl, Ramyar Biniazan, Marcel Beister, Björn Kreher, Yixing Huang, Steffen Kappler, Andreas Maier

    Abstract: The existence of metallic implants in projection images for cone-beam computed tomography (CBCT) introduces undesired artifacts which degrade the quality of reconstructed images. In order to reduce metal artifacts, projection inpainting is an essential step in many metal artifact reduction algorithms. In this work, a hybrid network combining the shift window (Swin) vision transformer (ViT) and a c… ▽ More

    Submitted 29 November, 2022; originally announced November 2022.

  33. arXiv:2211.16141  [pdf, other

    eess.IV cs.CV

    Mind the Gap: Scanner-induced domain shifts pose challenges for representation learning in histopathology

    Authors: Frauke Wilm, Marco Fragoso, Christof A. Bertram, Nikolas Stathonikos, Mathias Öttl, **gna Qiu, Robert Klopfleisch, Andreas Maier, Marc Aubreville, Katharina Breininger

    Abstract: Computer-aided systems in histopathology are often challenged by various sources of domain shift that impact the performance of these algorithms considerably. We investigated the potential of using self-supervised pre-training to overcome scanner-induced domain shifts for the downstream task of tumor segmentation. For this, we present the Barlow Triplets to learn scanner-invariant representations… ▽ More

    Submitted 29 November, 2022; originally announced November 2022.

    Comments: 5 pages, 4 figures, 1 table. This work has been submitted to the IEEE for possible publication. Copyright may be transferred without notice, after which this version may no longer be accessible

  34. Improved HER2 Tumor Segmentation with Subtype Balancing using Deep Generative Networks

    Authors: Mathias Öttl, Jana Mönius, Matthias Rübner, Carol I. Geppert, **gna Qiu, Frauke Wilm, Arndt Hartmann, Matthias W. Beckmann, Peter A. Fasching, Andreas Maier, Ramona Erber, Katharina Breininger

    Abstract: Tumor segmentation in histopathology images is often complicated by its composition of different histological subtypes and class imbalance. Oversampling subtypes with low prevalence features is not a satisfactory solution since it eventually leads to overfitting. We propose to create synthetic images with semantically-conditioned deep generative networks and to combine subtype-balanced synthetic i… ▽ More

    Submitted 11 November, 2022; originally announced November 2022.

    Comments: 5 pages, 6 figures

  35. arXiv:2211.06146  [pdf, other

    eess.IV cs.CV

    An unobtrusive quality supervision approach for medical image annotation

    Authors: Sonja Kunzmann, Mathias Öttl, Prathmesh Madhu, Felix Denzinger, Andreas Maier

    Abstract: Image annotation is one essential prior step to enable data-driven algorithms. In medical imaging, having large and reliably annotated data sets is crucial to recognize various diseases robustly. However, annotator performance varies immensely, thus impacts model training. Therefore, often multiple annotators should be employed, which is however expensive and resource-intensive. Hence, it is desir… ▽ More

    Submitted 22 November, 2022; v1 submitted 11 November, 2022; originally announced November 2022.

    Comments: 4 pages, 4 figures

  36. arXiv:2211.01323  [pdf, other

    eess.IV cs.CV cs.LG

    Generation of Anonymous Chest Radiographs Using Latent Diffusion Models for Training Thoracic Abnormality Classification Systems

    Authors: Kai Packhäuser, Lukas Folle, Florian Thamm, Andreas Maier

    Abstract: The availability of large-scale chest X-ray datasets is a requirement for develo** well-performing deep learning-based algorithms in thoracic abnormality detection and classification. However, biometric identifiers in chest radiographs hinder the public sharing of such data for research purposes due to the risk of patient re-identification. To counteract this issue, synthetic data generation off… ▽ More

    Submitted 4 November, 2022; v1 submitted 2 November, 2022; originally announced November 2022.

    Comments: This work has been submitted to the IEEE for possible publication. Copyright may be transferred without notice, after which this version may no longer be accessible

  37. arXiv:2211.01111  [pdf, other

    eess.IV cs.CV

    On the Benefit of Dual-domain Denoising in a Self-supervised Low-dose CT Setting

    Authors: Fabian Wagner, Mareike Thies, Laura Pfaff, Oliver Aust, Sabrina Pechmann, Daniela Weidner, Noah Maul, Maximilian Rohleder, Mingxuan Gu, Jonas Utz, Felix Denzinger, Andreas Maier

    Abstract: Computed tomography (CT) is routinely used for three-dimensional non-invasive imaging. Numerous data-driven image denoising algorithms were proposed to restore image quality in low-dose acquisitions. However, considerably less research investigates methods already intervening in the raw detector data due to limited access to suitable projection data or correct reconstruction algorithms. In this wo… ▽ More

    Submitted 3 November, 2022; v1 submitted 2 November, 2022; originally announced November 2022.

    Comments: This work has been submitted to the IEEE for possible publication. Copyright may be transferred without notice, after which this version may no longer be accessible

  38. arXiv:2210.07611  [pdf, other

    eess.IV cs.CV cs.LG

    Self-Supervised 2D/3D Registration for X-Ray to CT Image Fusion

    Authors: Srikrishna Jaganathan, Maximilian Kukla, Jian Wang, Karthik Shetty, Andreas Maier

    Abstract: Deep Learning-based 2D/3D registration enables fast, robust, and accurate X-ray to CT image fusion when large annotated paired datasets are available for training. However, the need for paired CT volume and X-ray images with ground truth registration limits the applicability in interventional scenarios. An alternative is to use simulated X-ray projections from CT volumes, thus removing the need fo… ▽ More

    Submitted 14 October, 2022; originally announced October 2022.

    Comments: Accepted at WACV 2023

  39. 3D Rendering Framework for Data Augmentation in Optical Character Recognition

    Authors: Andreas Spruck, Maximiliane Hawesch, Anatol Maier, Christian Riess, Jürgen Seiler, André Kaup

    Abstract: In this paper, we propose a data augmentation framework for Optical Character Recognition (OCR). The proposed framework is able to synthesize new viewing angles and illumination scenarios, effectively enriching any available OCR dataset. Its modular structure allows to be modified to match individual user requirements. The framework enables to comfortably scale the enlargement factor of the availa… ▽ More

    Submitted 27 September, 2022; originally announced September 2022.

    Comments: IEEE International Symposium on Signals, Circuits and Systems (ISSCS), 1-4, July 2021

  40. arXiv:2209.14448  [pdf, other

    cs.CV eess.IV

    Synthesizing Annotated Image and Video Data Using a Rendering-Based Pipeline for Improved License Plate Recognition

    Authors: Andreas Spruck, Maximilane Gruber, Anatol Maier, Denise Moussa, Jürgen Seiler, Christian Riess, André Kaup

    Abstract: An insufficient number of training samples is a common problem in neural network applications. While data augmentation methods require at least a minimum number of samples, we propose a novel, rendering-based pipeline for synthesizing annotated data sets. Our method does not modify existing samples but synthesizes entirely new samples. The proposed rendering-based pipeline is capable of generating… ▽ More

    Submitted 28 September, 2022; originally announced September 2022.

    Comments: submitted to IEEE Transactions on Intelligent Transportation Systems

  41. arXiv:2209.11531  [pdf, other

    eess.IV cs.CR cs.CV cs.LG

    Deep Learning-based Anonymization of Chest Radiographs: A Utility-preserving Measure for Patient Privacy

    Authors: Kai Packhäuser, Sebastian Gündel, Florian Thamm, Felix Denzinger, Andreas Maier

    Abstract: Robust and reliable anonymization of chest radiographs constitutes an essential step before publishing large datasets of such for research purposes. The conventional anonymization process is carried out by obscuring personal information in the images with black boxes and removing or replacing meta-information. However, such simple measures retain biometric information in the chest radiographs, all… ▽ More

    Submitted 24 July, 2023; v1 submitted 23 September, 2022; originally announced September 2022.

    Comments: Accepted at MICCAI 2023

  42. arXiv:2209.09733  [pdf, other

    eess.IV cs.CV

    Metal Inpainting in CBCT Projections Using Score-based Generative Model

    Authors: Siyuan Mei, Fuxin Fan, Andreas Maier

    Abstract: During orthopaedic surgery, the inserting of metallic implants or screws are often performed under mobile C-arm systems. Due to the high attenuation of metals, severe metal artifacts occur in 3D reconstructions, which degrade the image quality greatly. To reduce the artifacts, many metal artifact reduction algorithms have been developed and metal inpainting in projection domain is an essential ste… ▽ More

    Submitted 20 September, 2022; originally announced September 2022.

  43. arXiv:2209.07232  [pdf, other

    eess.IV cs.CV

    A Spatiotemporal Model for Precise and Efficient Fully-automatic 3D Motion Correction in OCT

    Authors: Stefan Ploner, Siyu Chen, Jungeun Won, Lennart Husvogt, Katharina Breininger, Julia Schottenhamml, James Fujimoto, Andreas Maier

    Abstract: Optical coherence tomography (OCT) is a micrometer-scale, volumetric imaging modality that has become a clinical standard in ophthalmology. OCT instruments image by raster-scanning a focused light spot across the retina, acquiring sequential cross-sectional images to generate volumetric data. Patient eye motion during the acquisition poses unique challenges: Non-rigid, discontinuous distortions ca… ▽ More

    Submitted 15 September, 2022; originally announced September 2022.

    Comments: Presented at MICCAI 2022 (main conference). The arXiv version provides full quality figures. 9 pages content (5 figures) + 2 pages references + 2 pages supplementary material (2 figures)

  44. Deep learning for automatic head and neck lymph node level delineation provides expert-level accuracy

    Authors: Thomas Weissmann, Yixing Huang, Stefan Fischer, Johannes Roesch, Sina Mansoorian, Horacio Ayala Gaona, Antoniu-Oreste Gostian, Markus Hecht, Sebastian Lettmaier, Lisa Deloch, Benjamin Frey, Udo S. Gaipl, Luitpold V. Distel, Andreas Maier, Heinrich Iro, Sabine Semrau, Christoph Bert, Rainer Fietkau, Florian Putz

    Abstract: Background: Deep learning (DL)-based head and neck lymph node level (HN_LNL) autodelineation is of high relevance to radiotherapy research and clinical treatment planning but still underinvestigated in academic literature. Methods: An expert-delineated cohort of 35 planning CTs was used for training of an nnU-net 3D-fullres/2D-ensemble model for autosegmentation of 20 different HN_LNL. A second co… ▽ More

    Submitted 1 March, 2023; v1 submitted 28 August, 2022; originally announced August 2022.

    Comments: 14 pages, 6 figures, published in Frontiers in Oncology

    Journal ref: Front. Oncol. 13:1115258

  45. arXiv:2207.14650  [pdf

    eess.IV cs.CV cs.LG

    SYNTA: A novel approach for deep learning-based image analysis in muscle histopathology using photo-realistic synthetic data

    Authors: Leonid Mill, Oliver Aust, Jochen A. Ackermann, Philipp Burger, Monica Pascual, Katrin Palumbo-Zerr, Gerhard Krönke, Stefan Uderhardt, Georg Schett, Christoph S. Clemen, Rolf Schröder, Christian Holtzhausen, Samir Jabari, Andreas Maier, Anika Grüneboom

    Abstract: Artificial intelligence (AI), machine learning, and deep learning (DL) methods are becoming increasingly important in the field of biomedical image analysis. However, to exploit the full potential of such methods, a representative number of experimentally acquired images containing a significant number of manually annotated objects is needed as training data. Here we introduce SYNTA (synthetic dat… ▽ More

    Submitted 3 January, 2024; v1 submitted 29 July, 2022; originally announced July 2022.

  46. Trainable Joint Bilateral Filters for Enhanced Prediction Stability in Low-dose CT

    Authors: Fabian Wagner, Mareike Thies, Felix Denzinger, Mingxuan Gu, Mayank Patwari, Stefan Ploner, Noah Maul, Laura Pfaff, Yixing Huang, Andreas Maier

    Abstract: Low-dose computed tomography (CT) denoising algorithms aim to enable reduced patient dose in routine CT acquisitions while maintaining high image quality. Recently, deep learning~(DL)-based methods were introduced, outperforming conventional denoising algorithms on this task due to their high model capacity. However, for the transition of DL-based denoising to clinical practice, these data-driven… ▽ More

    Submitted 15 July, 2022; originally announced July 2022.

    Journal ref: Sci.Rep. 12 (2022) 17540

  47. arXiv:2207.02392  [pdf, other

    eess.IV cs.CV cs.LG

    AutoSpeed: A Linked Autoencoder Approach for Pulse-Echo Speed-of-Sound Imaging for Medical Ultrasound

    Authors: Farnaz Khun Jush, Markus Biele, Peter M. Dueppenbecker, Andreas Maier

    Abstract: Quantitative ultrasound, e.g., speed-of-sound (SoS) in tissues, provides information about tissue properties that have diagnostic value. Recent studies showed the possibility of extracting SoS information from pulse-echo ultrasound raw data (a.k.a. RF data) using deep neural networks that are fully trained on simulated data. These methods take sensor domain data, i.e., RF data, as input and train… ▽ More

    Submitted 4 July, 2022; originally announced July 2022.

    Comments: 12 pages, 7 figures, submitted to Medical Image Analysis

  48. arXiv:2206.12320  [pdf, other

    cs.SD cs.AI eess.AS

    PoCaP Corpus: A Multimodal Dataset for Smart Operating Room Speech Assistant using Interventional Radiology Workflow Analysis

    Authors: Kubilay Can Demir, Matthias May, Axel Schmid, Michael Uder, Katharina Breininger, Tobias Weise, Andreas Maier, Seung Hee Yang

    Abstract: This paper presents a new multimodal interventional radiology dataset, called PoCaP (Port Catheter Placement) Corpus. This corpus consists of speech and audio signals in German, X-ray images, and system commands collected from 31 PoCaP interventions by six surgeons with average duration of 81.4 $\pm$ 41.0 minutes. The corpus aims to provide a resource for develo** a smart speech assistant in ope… ▽ More

    Submitted 24 June, 2022; originally announced June 2022.

    Comments: 8 pages, 4 figures, Text, Speech and Dialogue 2022 Conference

    MSC Class: 00b20

  49. arXiv:2205.13297  [pdf, other

    cs.CV cs.LG eess.IV

    DeepTechnome: Mitigating Unknown Bias in Deep Learning Based Assessment of CT Images

    Authors: Simon Langer, Oliver Taubmann, Felix Denzinger, Andreas Maier, Alexander Mühlberg

    Abstract: Reliably detecting diseases using relevant biological information is crucial for real-world applicability of deep learning techniques in medical imaging. We debias deep learning models during training against unknown bias - without preprocessing/filtering the input beforehand or assuming specific knowledge about its distribution or precise nature in the dataset. We use control regions as surrogate… ▽ More

    Submitted 26 May, 2022; originally announced May 2022.

    ACM Class: I.2.6; I.4; I.5

  50. arXiv:2205.05474  [pdf, other

    eess.AS cs.LG cs.SD eess.SP

    DeepFilterNet2: Towards Real-Time Speech Enhancement on Embedded Devices for Full-Band Audio

    Authors: Hendrik Schröter, Alberto N. Escalante-B., Tobias Rosenkranz, Andreas Maier

    Abstract: Deep learning-based speech enhancement has seen huge improvements and recently also expanded to full band audio (48 kHz). However, many approaches have a rather high computational complexity and require big temporal buffers for real time usage e.g. due to temporal convolutions or attention. Both make those approaches not feasible on embedded devices. This work further extends DeepFilterNet, which… ▽ More

    Submitted 11 May, 2022; originally announced May 2022.

    Comments: Submitted to IWAENC 2022