Skip to main content

Showing 1–50 of 306 results for author: Maier, A

Searching in archive cs. Search in all archives.
.
  1. arXiv:2406.16659  [pdf, other

    cs.LG eess.SP

    Data-driven Modeling in Metrology -- A Short Introduction, Current Developments and Future Perspectives

    Authors: Linda-Sophie Schneider, Patrick Krauss, Nadine Schiering, Christopher Syben, Richard Schielein, Andreas Maier

    Abstract: Mathematical models are vital to the field of metrology, playing a key role in the derivation of measurement results and the calculation of uncertainties from measurement data, informed by an understanding of the measurement process. These models generally represent the correlation between the quantity being measured and all other pertinent quantities. Such relationships are used to construct meas… ▽ More

    Submitted 24 June, 2024; originally announced June 2024.

    Comments: 31 pages, Preprint

  2. arXiv:2406.16143  [pdf, other

    cs.CV

    Review of Zero-Shot and Few-Shot AI Algorithms in The Medical Domain

    Authors: Maged Badawi, Mohammedyahia Abushanab, Sheethal Bhat, Andreas Maier

    Abstract: In this paper, different techniques of few-shot, zero-shot, and regular object detection have been investigated. The need for few-shot learning and zero-shot learning techniques is crucial and arises from the limitations and challenges in traditional machine learning, deep learning, and computer vision methods where they require large amounts of data, plus the poor generalization of those traditio… ▽ More

    Submitted 23 June, 2024; originally announced June 2024.

  3. arXiv:2406.05477  [pdf, other

    cs.CV cs.LG

    Attri-Net: A Globally and Locally Inherently Interpretable Model for Multi-Label Classification Using Class-Specific Counterfactuals

    Authors: Susu Sun, Stefano Woerner, Andreas Maier, Lisa M. Koch, Christian F. Baumgartner

    Abstract: Interpretability is crucial for machine learning algorithms in high-stakes medical applications. However, high-performing neural networks typically cannot explain their predictions. Post-hoc explanation methods provide a way to understand neural networks but have been shown to suffer from conceptual problems. Moreover, current research largely focuses on providing local explanations for individual… ▽ More

    Submitted 8 June, 2024; originally announced June 2024.

    Comments: Extension of paper: Inherently Interpretable Multi-Label Classification Using Class-Specific Counterfactuals (Sun et al., MIDL 2023)

  4. arXiv:2405.19079  [pdf, other

    eess.IV cs.CV

    On the Influence of Smoothness Constraints in Computed Tomography Motion Compensation

    Authors: Mareike Thies, Fabian Wagner, Noah Maul, Siyuan Mei, Mingxuan Gu, Laura Pfaff, Nastassia Vysotskaya, Haijun Yu, Andreas Maier

    Abstract: Computed tomography (CT) relies on precise patient immobilization during image acquisition. Nevertheless, motion artifacts in the reconstructed images can persist. Motion compensation methods aim to correct such artifacts post-acquisition, often incorporating temporal smoothness constraints on the estimated motion patterns. This study analyzes the influence of a spline-based motion model within an… ▽ More

    Submitted 29 May, 2024; originally announced May 2024.

  5. arXiv:2405.16115  [pdf, other

    cs.CL cs.LG

    SNOBERT: A Benchmark for clinical notes entity linking in the SNOMED CT clinical terminology

    Authors: Mikhail Kulyabin, Gleb Sokolov, Aleksandr Galaida, Andreas Maier, Tomas Arias-Vergara

    Abstract: The extraction and analysis of insights from medical data, primarily stored in free-text formats by healthcare workers, presents significant challenges due to its unstructured nature. Medical coding, a crucial process in healthcare, remains minimally automated due to the complexity of medical ontologies and restricted access to medical texts for training Natural Language Processing models. In this… ▽ More

    Submitted 25 May, 2024; originally announced May 2024.

  6. arXiv:2405.09333  [pdf, other

    cs.CV

    Application of Gated Recurrent Units for CT Trajectory Optimization

    Authors: Yuedong Yuan, Linda-Sophie Schneider, Andreas Maier

    Abstract: Recent advances in computed tomography (CT) imaging, especially with dual-robot systems, have introduced new challenges for scan trajectory optimization. This paper presents a novel approach using Gated Recurrent Units (GRUs) to optimize CT scan trajectories. Our approach exploits the flexibility of robotic CT systems to select projections that enhance image quality by improving resolution and con… ▽ More

    Submitted 15 May, 2024; originally announced May 2024.

    Comments: 4 pages, 6 figures

  7. arXiv:2405.02024  [pdf, other

    cs.CL cs.AI

    Analyzing Narrative Processing in Large Language Models (LLMs): Using GPT4 to test BERT

    Authors: Patrick Krauss, Jannik Hösch, Claus Metzner, Andreas Maier, Peter Uhrig, Achim Schilling

    Abstract: The ability to transmit and receive complex information via language is unique to humans and is the basis of traditions, culture and versatile social interactions. Through the disruptive introduction of transformer based large language models (LLMs) humans are not the only entity to "understand" and produce language any more. In the present study, we have performed the first steps to use LLMs as a… ▽ More

    Submitted 3 May, 2024; originally announced May 2024.

  8. arXiv:2405.01156  [pdf, other

    cs.CV cs.AI

    Self-Supervised Learning for Interventional Image Analytics: Towards Robust Device Trackers

    Authors: Saahil Islam, Venkatesh N. Murthy, Dominik Neumann, Badhan Kumar Das, Puneet Sharma, Andreas Maier, Dorin Comaniciu, Florin C. Ghesu

    Abstract: An accurate detection and tracking of devices such as guiding catheters in live X-ray image acquisitions is an essential prerequisite for endovascular cardiac interventions. This information is leveraged for procedural guidance, e.g., directing stent placements. To ensure procedural safety and efficacy, there is a need for high robustness no failures during tracking. To achieve that, one needs to… ▽ More

    Submitted 2 May, 2024; originally announced May 2024.

  9. arXiv:2404.14807  [pdf, other

    cs.CV

    Reference-Free Multi-Modality Volume Registration of X-Ray Microscopy and Light-Sheet Fluorescence Microscopy

    Authors: Siyuan Mei, Fuxin Fan, Mareike Thies, Mingxuan Gu, Fabian Wagner, Oliver Aust, Ina Erceg, Zeynab Mirzaei, Georgiana Neag, Yipeng Sun, Yixing Huang, Andreas Maier

    Abstract: Recently, X-ray microscopy (XRM) and light-sheet fluorescence microscopy (LSFM) have emerged as two pivotal imaging tools in preclinical research on bone remodeling diseases, offering micrometer-level resolution. Integrating these complementary modalities provides a holistic view of bone microstructures, facilitating function-oriented volume analysis across different disease cycles. However, regis… ▽ More

    Submitted 23 April, 2024; originally announced April 2024.

  10. arXiv:2404.14747  [pdf, other

    cs.CV

    Differentiable Score-Based Likelihoods: Learning CT Motion Compensation From Clean Images

    Authors: Mareike Thies, Noah Maul, Siyuan Mei, Laura Pfaff, Nastassia Vysotskaya, Mingxuan Gu, Jonas Utz, Dennis Possart, Lukas Folle, Fabian Wagner, Andreas Maier

    Abstract: Motion artifacts can compromise the diagnostic value of computed tomography (CT) images. Motion correction approaches require a per-scan estimation of patient-specific motion patterns. In this work, we train a score-based model to act as a probability density estimator for clean head CT images. Given the trained model, we quantify the deviation of a given motion-affected CT image from the ideal di… ▽ More

    Submitted 23 April, 2024; originally announced April 2024.

  11. arXiv:2404.08064  [pdf

    eess.AS cs.AI cs.CR cs.LG

    The Impact of Speech Anonymization on Pathology and Its Limits

    Authors: Soroosh Tayebi Arasteh, Tomas Arias-Vergara, Paula Andrea Perez-Toro, Tobias Weise, Kai Packhaeuser, Maria Schuster, Elmar Noeth, Andreas Maier, Seung Hee Yang

    Abstract: Integration of speech into healthcare has intensified privacy concerns due to its potential as a non-invasive biomarker containing individual biometric information. In response, speaker anonymization aims to conceal personally identifiable information while retaining crucial linguistic content. However, the application of anonymization techniques to pathological speech, a critical area where priva… ▽ More

    Submitted 22 June, 2024; v1 submitted 11 April, 2024; originally announced April 2024.

  12. arXiv:2404.06314  [pdf, ps, other

    quant-ph cs.LG cs.SE

    Qiskit-Torch-Module: Fast Prototy** of Quantum Neural Networks

    Authors: Nico Meyer, Christian Ufrecht, Maniraman Periyasamy, Axel Plinge, Christopher Mutschler, Daniel D. Scherer, Andreas Maier

    Abstract: Quantum computer simulation software is an integral tool for the research efforts in the quantum computing community. An important aspect is the efficiency of respective frameworks, especially for training variational quantum algorithms. Focusing on the widely used Qiskit software environment, we develop the qiskit-torch-module. It improves runtime performance by two orders of magnitude over compa… ▽ More

    Submitted 9 April, 2024; originally announced April 2024.

    Comments: This work has been submitted to the IEEE for possible publication. Copyright may be transferred without notice, after which this version may no longer be accessible. 7 pages, 4 figures, 3 tables

  13. arXiv:2404.03541  [pdf, other

    eess.IV cs.CV

    Segmentation-Guided Knee Radiograph Generation using Conditional Diffusion Models

    Authors: Siyuan Mei, Fuxin Fan, Fabian Wagner, Mareike Thies, Mingxuan Gu, Yipeng Sun, Andreas Maier

    Abstract: Deep learning-based medical image processing algorithms require representative data during development. In particular, surgical data might be difficult to obtain, and high-quality public datasets are limited. To overcome this limitation and augment datasets, a widely adopted solution is the generation of synthetic images. In this work, we employ conditional diffusion models to generate knee radiog… ▽ More

    Submitted 4 April, 2024; originally announced April 2024.

  14. arXiv:2403.18343  [pdf, other

    cs.LG

    The Artificial Neural Twin -- Process Optimization and Continual Learning in Distributed Process Chains

    Authors: Johannes Emmert, Ronald Mendez, Houman Mirzaalian Dastjerdi, Christopher Syben, Andreas Maier

    Abstract: Industrial process optimization and control is crucial to increase economic and ecologic efficiency. However, data sovereignty, differing goals, or the required expert knowledge for implementation impede holistic implementation. Further, the increasing use of data-driven AI-methods in process models and industrial sensory often requires regular fine-tuning to accommodate distribution drifts. We pr… ▽ More

    Submitted 27 March, 2024; originally announced March 2024.

    Comments: 20 pages, 11 figures

    ACM Class: I.2.11; J.2; F.2.2

  15. arXiv:2403.14440  [pdf, other

    eess.IV cs.AI cs.CV cs.LG

    Analysing Diffusion Segmentation for Medical Images

    Authors: Mathias Öttl, Siyuan Mei, Frauke Wilm, Jana Steenpass, Matthias Rübner, Arndt Hartmann, Matthias Beckmann, Peter Fasching, Andreas Maier, Ramona Erber, Katharina Breininger

    Abstract: Denoising Diffusion Probabilistic models have become increasingly popular due to their ability to offer probabilistic modeling and generate diverse outputs. This versatility inspired their adaptation for image segmentation, where multiple predictions of the model can produce segmentation results that not only achieve high quality but also capture the uncertainty inherent in the model. Here, powerf… ▽ More

    Submitted 21 March, 2024; originally announced March 2024.

  16. arXiv:2403.14429  [pdf, other

    cs.CV cs.AI cs.LG

    Style-Extracting Diffusion Models for Semi-Supervised Histopathology Segmentation

    Authors: Mathias Öttl, Frauke Wilm, Jana Steenpass, **gna Qiu, Matthias Rübner, Arndt Hartmann, Matthias Beckmann, Peter Fasching, Andreas Maier, Ramona Erber, Bernhard Kainz, Katharina Breininger

    Abstract: Deep learning-based image generation has seen significant advancements with diffusion models, notably improving the quality of generated images. Despite these developments, generating images with unseen characteristics beneficial for downstream tasks has received limited attention. To bridge this gap, we propose Style-Extracting Diffusion Models, featuring two conditioning mechanisms. Specifically… ▽ More

    Submitted 21 March, 2024; originally announced March 2024.

  17. arXiv:2403.10695  [pdf, other

    eess.IV cs.CV

    EAGLE: An Edge-Aware Gradient Localization Enhanced Loss for CT Image Reconstruction

    Authors: Yipeng Sun, Yixing Huang, Linda-Sophie Schneider, Mareike Thies, Mingxuan Gu, Siyuan Mei, Siming Bayer, Andreas Maier

    Abstract: Computed Tomography (CT) image reconstruction is crucial for accurate diagnosis and deep learning approaches have demonstrated significant potential in improving reconstruction quality. However, the choice of loss function profoundly affects the reconstructed images. Traditional mean squared error loss often produces blurry images lacking fine details, while alternatives designed to improve may in… ▽ More

    Submitted 15 March, 2024; originally announced March 2024.

    Comments: Preprint

  18. arXiv:2403.03326  [pdf, other

    eess.IV cs.CV

    AnatoMix: Anatomy-aware Data Augmentation for Multi-organ Segmentation

    Authors: Chang Liu, Fuxin Fan, Annette Schwarz, Andreas Maier

    Abstract: Multi-organ segmentation in medical images is a widely researched task and can save much manual efforts of clinicians in daily routines. Automating the organ segmentation process using deep learning (DL) is a promising solution and state-of-the-art segmentation models are achieving promising accuracy. In this work, We proposed a novel data augmentation strategy for increasing the generalizibility… ▽ More

    Submitted 5 March, 2024; originally announced March 2024.

  19. arXiv:2403.01993  [pdf, other

    cs.CV

    Physics-Informed Learning for Time-Resolved Angiographic Contrast Agent Concentration Reconstruction

    Authors: Noah Maul, Annette Birkhold, Fabian Wagner, Mareike Thies, Maximilian Rohleder, Philipp Berg, Markus Kowarschik, Andreas Maier

    Abstract: Three-dimensional Digital Subtraction Angiography (3D-DSA) is a well-established X-ray-based technique for visualizing vascular anatomy. Recently, four-dimensional DSA (4D-DSA) reconstruction algorithms have been developed to enable the visualization of volumetric contrast flow dynamics through time-series of volumes. . This reconstruction problem is ill-posed mainly due to vessel overlap in the p… ▽ More

    Submitted 4 March, 2024; originally announced March 2024.

  20. arXiv:2403.00426  [pdf, other

    cs.CV

    Deep Learning Computed Tomography based on the Defrise and Clack Algorithm

    Authors: Chengze Ye, Linda-Sophie Schneider, Yipeng Sun, Andreas Maier

    Abstract: This study presents a novel approach for reconstructing cone beam computed tomography (CBCT) for specific orbits using known operator learning. Unlike traditional methods, this technique employs a filtered backprojection type (FBP-type) algorithm, which integrates a unique, adaptive filtering process. This process involves a series of operations, including weightings, differentiations, the 2D Rado… ▽ More

    Submitted 1 March, 2024; originally announced March 2024.

  21. Offline Writer Identification Using Convolutional Neural Network Activation Features

    Authors: Vincent Christlein, David Bernecker, Andreas Maier, Elli Angelopoulou

    Abstract: Convolutional neural networks (CNNs) have recently become the state-of-the-art tool for large-scale image classification. In this work we propose the use of activation features from CNNs as local descriptors for writer identification. A global descriptor is then formed by means of GMM supervector encoding, which is further improved by normalization with the KL-Kernel. We evaluate our method on two… ▽ More

    Submitted 26 February, 2024; originally announced February 2024.

    Comments: fixed tab 1b

    Journal ref: Pattern Recognition. DAGM 2015. Lecture Notes in Computer Science(), vol 9358. Springer, Cham

  22. ARIN: Adaptive Resampling and Instance Normalization for Robust Blind Inpainting of Dunhuang Cave Paintings

    Authors: Alexander Schmidt, Prathmesh Madhu, Andreas Maier, Vincent Christlein, Ronak Kosti

    Abstract: Image enhancement algorithms are very useful for real world computer vision tasks where image resolution is often physically limited by the sensor size. While state-of-the-art deep neural networks show impressive results for image enhancement, they often struggle to enhance real-world images. In this work, we tackle a real-world setting: inpainting of images from Dunhuang caves. The Dunhuang datas… ▽ More

    Submitted 25 February, 2024; originally announced February 2024.

    Journal ref: 2022 Eleventh International Conference on Image Processing Theory, Tools and Applications (IPTA), Salzburg, Austria, 2022, pp. 1-6

  23. A Spatiotemporal Illumination Model for 3D Image Fusion in Optical Coherence Tomography

    Authors: Stefan Ploner, Jungeun Won, Julia Schottenhamml, Jessica Girgis, Kenneth Lam, Nadia Waheed, James Fujimoto, Andreas Maier

    Abstract: Optical coherence tomography (OCT) is a non-invasive, micrometer-scale imaging modality that has become a clinical standard in ophthalmology. By raster-scanning the retina, sequential cross-sectional image slices are acquired to generate volumetric data. In-vivo imaging suffers from discontinuities between slices that show up as motion and illumination artifacts. We present a new illumination mode… ▽ More

    Submitted 19 February, 2024; originally announced February 2024.

    Comments: Presented orally & as poster on 20th April 2023 at the IEEE International Symposium on Biomedical Imaging (ISBI) in Cartagena, Colombia. 6 pages, 3 figures. You can find the official version with broken equations and bad contrast figures under https://ieeexplore.ieee.org/document/10230526

  24. arXiv:2402.10223  [pdf, other

    cs.RO cs.CV math.OC

    Integer Optimization of CT Trajectories using a Discrete Data Completeness Formulation

    Authors: Linda-Sophie Schneider, Gabriel Herl, Andreas Maier

    Abstract: X-ray computed tomography (CT) plays a key role in digitizing three-dimensional structures for a wide range of medical and industrial applications. Traditional CT systems often rely on standard circular and helical scan trajectories, which may not be optimal for challenging scenarios involving large objects, complex structures, or resource constraints. In response to these challenges, we are explo… ▽ More

    Submitted 29 January, 2024; originally announced February 2024.

    Comments: Preprint

  25. arXiv:2402.07922  [pdf, other

    cs.HC cs.AI cs.DB

    Towards the Human Digital Twin: Definition and Design -- A survey

    Authors: Martin Wolfgang Lauer-Schmaltz, Philip Cash, John Paulin Hansen, Anja Maier

    Abstract: Human Digital Twins (HDTs) are a fast-emerging technology with significant potential in fields ranging from healthcare to sports. HDTs extend the traditional understanding of Digital Twins by representing humans as the underlying physical entity. This has introduced several significant challenges, including ambiguity in the definition of HDTs and a lack of guidance for their design. This survey br… ▽ More

    Submitted 2 February, 2024; originally announced February 2024.

    Comments: This paper is an extension of the following paper: Lauer-Schmaltz MW, Cash P, Hansen JP, Maier A. Designing Human Digital Twins for Behaviour-Changing Therapy and Rehabilitation: A Systematic Review. Proceedings of the Design Society. 2022;2:1303-1312. doi:10.1017/pds.2022.132

  26. arXiv:2401.16104  [pdf, other

    cs.CV eess.IV

    A 2D Sinogram-Based Approach to Defect Localization in Computed Tomography

    Authors: Yuzhong Zhou, Linda-Sophie Schneider, Fuxin Fan, Andreas Maier

    Abstract: The rise of deep learning has introduced a transformative era in the field of image processing, particularly in the context of computed tomography. Deep learning has made a significant contribution to the field of industrial Computed Tomography. However, many defect detection algorithms are applied directly to the reconstructed domain, often disregarding the raw sensor data. This paper shifts the… ▽ More

    Submitted 29 January, 2024; originally announced January 2024.

  27. arXiv:2401.16039  [pdf, other

    eess.IV cs.CV cs.LG

    Data-Driven Filter Design in FBP: Transforming CT Reconstruction with Trainable Fourier Series

    Authors: Yipeng Sun, Linda-Sophie Schneider, Fuxin Fan, Mareike Thies, Mingxuan Gu, Siyuan Mei, Yuzhong Zhou, Siming Bayer, Andreas Maier

    Abstract: In this study, we introduce a Fourier series-based trainable filter for computed tomography (CT) reconstruction within the filtered backprojection (FBP) framework. This method overcomes the limitation in noise reduction, inherent in conventional FBP methods, by optimizing Fourier series coefficients to construct the filter. This method enables robust performance across different resolution scales… ▽ More

    Submitted 29 January, 2024; originally announced January 2024.

    Comments: Preprint

  28. arXiv:2401.12725  [pdf, other

    eess.IV cs.CV

    Two-View Topogram-Based Anatomy-Guided CT Reconstruction for Prospective Risk Minimization

    Authors: Chang Liu, Laura Klein, Yixing Huang, Edith Baader, Michael Lell, Marc Kachelrieß, Andreas Maier

    Abstract: To facilitate a prospective estimation of CT effective dose and risk minimization process, a prospective spatial dose estimation and the known anatomical structures are expected. To this end, a CT reconstruction method is required to reconstruct CT volumes from as few projections as possible, i.e. by using the topograms, with anatomical structures as correct as possible. In this work, an optimized… ▽ More

    Submitted 23 January, 2024; originally announced January 2024.

  29. arXiv:2401.09283  [pdf, other

    eess.IV cs.CV

    A gradient-based approach to fast and accurate head motion compensation in cone-beam CT

    Authors: Mareike Thies, Fabian Wagner, Noah Maul, Haijun Yu, Manuela Meier, Linda-Sophie Schneider, Mingxuan Gu, Siyuan Mei, Lukas Folle, Andreas Maier

    Abstract: Cone-beam computed tomography (CBCT) systems, with their portability, present a promising avenue for direct point-of-care medical imaging, particularly in critical scenarios such as acute stroke assessment. However, the integration of CBCT into clinical workflows faces challenges, primarily linked to long scan duration resulting in patient motion during scanning and leading to image quality degrad… ▽ More

    Submitted 17 January, 2024; originally announced January 2024.

    Comments: This work has been submitted to the IEEE for possible publication. Copyright may be transferred without notice, after which this version may no longer be accessible

  30. arXiv:2401.03912  [pdf, other

    eess.IV cs.CV cs.LG

    Attention-Guided Erasing: A Novel Augmentation Method for Enhancing Downstream Breast Density Classification

    Authors: Adarsh Bhandary Panambur, Hui Yu, Sheethal Bhat, Prathmesh Madhu, Siming Bayer, Andreas Maier

    Abstract: The assessment of breast density is crucial in the context of breast cancer screening, especially in populations with a higher percentage of dense breast tissues. This study introduces a novel data augmentation technique termed Attention-Guided Erasing (AGE), devised to enhance the downstream classification of four distinct breast density categories in mammography following the BI-RADS recommendat… ▽ More

    Submitted 8 January, 2024; originally announced January 2024.

  31. arXiv:2401.01364  [pdf, other

    q-bio.NC cs.AI cs.LG cs.NE

    Multi-Modal Cognitive Maps based on Neural Networks trained on Successor Representations

    Authors: Paul Stoewer, Achim Schilling, Andreas Maier, Patrick Krauss

    Abstract: Cognitive maps are a proposed concept on how the brain efficiently organizes memories and retrieves context out of them. The entorhinal-hippocampal complex is heavily involved in episodic and relational memory processing, as well as spatial navigation and is thought to built cognitive maps via place and grid cells. To make use of the promising properties of cognitive maps, we set up a multi-modal… ▽ More

    Submitted 22 December, 2023; originally announced January 2024.

  32. arXiv:2312.15825  [pdf, ps, other

    cs.CV cs.CE cs.LG

    Comparative Analysis of Radiomic Features and Gene Expression Profiles in Histopathology Data Using Graph Neural Networks

    Authors: Luis Carlos Rivera Monroy, Leonhard Rist, Martin Eberhardt, Christian Ostalecki, Andreas Bauer, Julio Vera, Katharina Breininger, Andreas Maier

    Abstract: This study leverages graph neural networks to integrate MELC data with Radiomic-extracted features for melanoma classification, focusing on cell-wise analysis. It assesses the effectiveness of gene expression profiles and Radiomic features, revealing that Radiomic features, particularly when combined with UMAP for dimensionality reduction, significantly enhance classification performance. Notably,… ▽ More

    Submitted 25 December, 2023; originally announced December 2023.

    Comments: Paper accepted at the German Conference on Medical Image Computing 2024

  33. arXiv:2312.08255  [pdf, other

    eess.IV cs.CV cs.LG

    OCTDL: Optical Coherence Tomography Dataset for Image-Based Deep Learning Methods

    Authors: Mikhail Kulyabin, Aleksei Zhdanov, Anastasia Nikiforova, Andrey Stepichev, Anna Kuznetsova, Mikhail Ronkin, Vasilii Borisov, Alexander Bogachev, Sergey Korotkich, Paul A Constable, Andreas Maier

    Abstract: Optical coherence tomography (OCT) is a non-invasive imaging technique with extensive clinical applications in ophthalmology. OCT enables the visualization of the retinal layers, playing a vital role in the early detection and monitoring of retinal diseases. OCT uses the principle of light wave interference to create detailed images of the retinal microstructures, making it a valuable tool for dia… ▽ More

    Submitted 31 March, 2024; v1 submitted 13 December, 2023; originally announced December 2023.

  34. SniffyArt: The Dataset of Smelling Persons

    Authors: Mathias Zinnen, Azhar Hussian, Hang Tran, Prathmesh Madhu, Andreas Maier, Vincent Christlein

    Abstract: Smell gestures play a crucial role in the investigation of past smells in the visual arts yet their automated recognition poses significant challenges. This paper introduces the SniffyArt dataset, consisting of 1941 individuals represented in 441 historical artworks. Each person is annotated with a tightly fitting bounding box, 17 pose keypoints, and a gesture label. By integrating these annotatio… ▽ More

    Submitted 20 November, 2023; originally announced November 2023.

    Comments: 10 pages, 8 figures

    Journal ref: Proceedings of the 5th Workshop on analySis, Understanding and proMotion of heritAge Contents. 2023. S. 49-58

  35. arXiv:2309.17192  [pdf, other

    cs.LG cs.CV

    A Survey of Incremental Transfer Learning: Combining Peer-to-Peer Federated Learning and Domain Incremental Learning for Multicenter Collaboration

    Authors: Yixing Huang, Christoph Bert, Ahmed Gomaa, Rainer Fietkau, Andreas Maier, Florian Putz

    Abstract: Due to data privacy constraints, data sharing among multiple clinical centers is restricted, which impedes the development of high performance deep learning models from multicenter collaboration. Naive weight transfer methods share intermediate model weights without raw data and hence can bypass data privacy restrictions. However, performance drops are typically observed when the model is transfer… ▽ More

    Submitted 29 September, 2023; originally announced September 2023.

  36. arXiv:2307.01577  [pdf, other

    cs.AI q-bio.NC

    Conceptual Cognitive Maps Formation with Neural Successor Networks and Word Embeddings

    Authors: Paul Stoewer, Achim Schilling, Andreas Maier, Patrick Krauss

    Abstract: The human brain possesses the extraordinary capability to contextualize the information it receives from our environment. The entorhinal-hippocampal plays a critical role in this function, as it is deeply engaged in memory processing and constructing cognitive maps using place and grid cells. Comprehending and leveraging this ability could significantly augment the field of artificial intelligence… ▽ More

    Submitted 4 July, 2023; originally announced July 2023.

  37. arXiv:2306.14596  [pdf, other

    eess.IV cs.CV

    Deep Learning for Cancer Prognosis Prediction Using Portrait Photos by StyleGAN Embedding

    Authors: Amr Hagag, Ahmed Gomaa, Dominik Kornek, Andreas Maier, Rainer Fietkau, Christoph Bert, Florian Putz, Yixing Huang

    Abstract: Survival prediction for cancer patients is critical for optimal treatment selection and patient management. Current patient survival prediction methods typically extract survival information from patients' clinical record data or biological and imaging data. In practice, experienced clinicians can have a preliminary assessment of patients' health status based on patients' observable physical appea… ▽ More

    Submitted 28 June, 2023; v1 submitted 26 June, 2023; originally announced June 2023.

  38. A Vessel-Segmentation-Based CycleGAN for Unpaired Multi-modal Retinal Image Synthesis

    Authors: Aline Sindel, Andreas Maier, Vincent Christlein

    Abstract: Unpaired image-to-image translation of retinal images can efficiently increase the training dataset for deep-learning-based multi-modal retinal registration methods. Our method integrates a vessel segmentation network into the image-to-image translation task by extending the CycleGAN framework. The segmentation network is inserted prior to a UNet vision transformer generator network and serves as… ▽ More

    Submitted 5 June, 2023; originally announced June 2023.

    Comments: Accepted to BVM 2023

    Journal ref: BVM 2023

  39. arXiv:2306.01752  [pdf, other

    eess.IV cs.CV cs.LG

    Handling Label Uncertainty on the Example of Automatic Detection of Shepherd's Crook RCA in Coronary CT Angiography

    Authors: Felix Denzinger, Michael Wels, Oliver Taubmann, Florian Kordon, Fabian Wagner, Stephanie Mehltretter, Mehmet A. Gülsün, Max Schöbinger, Florian André, Sebastian Buss, Johannes Görich, Michael Sühling, Andreas Maier

    Abstract: Coronary artery disease (CAD) is often treated minimally invasively with a catheter being inserted into the diseased coronary vessel. If a patient exhibits a Shepherd's Crook (SC) Right Coronary Artery (RCA) - an anatomical norm variant of the coronary vasculature - the complexity of this procedure is increased. Automated reporting of this variant from coronary CT angiography screening would ease… ▽ More

    Submitted 22 May, 2023; originally announced June 2023.

    Comments: Accepted at ISBI 2023

  40. PoCaPNet: A Novel Approach for Surgical Phase Recognition Using Speech and X-Ray Images

    Authors: Kubilay Can Demir, Tobias Weise, Matthias May, Axel Schmid, Andreas Maier, Seung Hee Yang

    Abstract: Surgical phase recognition is a challenging and necessary task for the development of context-aware intelligent systems that can support medical personnel for better patient care and effective operating room management. In this paper, we present a surgical phase recognition framework that employs a Multi-Stage Temporal Convolution Network using speech and X-Ray images for the first time. We evalua… ▽ More

    Submitted 25 May, 2023; originally announced May 2023.

    Comments: 5 Pages, 3 figures, INTERSPEECH 2023

    MSC Class: 00b20

  41. Federated learning for secure development of AI models for Parkinson's disease detection using speech from different languages

    Authors: Soroosh Tayebi Arasteh, Cristian David Rios-Urrego, Elmar Noeth, Andreas Maier, Seung Hee Yang, Jan Rusz, Juan Rafael Orozco-Arroyave

    Abstract: Parkinson's disease (PD) is a neurological disorder impacting a person's speech. Among automatic PD assessment methods, deep learning models have gained particular interest. Recently, the community has explored cross-pathology and cross-language models which can improve diagnostic accuracy even further. However, strict patient data privacy regulations largely prevent institutions from sharing pati… ▽ More

    Submitted 21 August, 2023; v1 submitted 18 May, 2023; originally announced May 2023.

    Comments: INTERSPEECH 2023, pp. 5003--5007, Dublin, Ireland

    Journal ref: INTERSPEECH 2023

  42. arXiv:2305.08227  [pdf, other

    eess.AS cs.CL cs.SD

    DeepFilterNet: Perceptually Motivated Real-Time Speech Enhancement

    Authors: Hendrik Schröter, Tobias Rosenkranz, Alberto N. Escalante-B., Andreas Maier

    Abstract: Multi-frame algorithms for single-channel speech enhancement are able to take advantage from short-time correlations within the speech signal. Deep Filtering (DF) was proposed to directly estimate a complex filter in frequency domain to take advantage of these correlations. In this work, we present a real-time speech enhancement demo using DeepFilterNet. DeepFilterNet's efficiency is enabled by ex… ▽ More

    Submitted 14 May, 2023; originally announced May 2023.

    Comments: Accepted as show and tell demo to interspeech 2023

  43. arXiv:2305.07524  [pdf

    physics.med-ph cs.AI

    Joint MR sequence optimization beats pure neural network approaches for spin-echo MRI super-resolution

    Authors: Hoai Nam Dang, Vladimir Golkov, Thomas Wimmer, Daniel Cremers, Andreas Maier, Moritz Zaiss

    Abstract: Current MRI super-resolution (SR) methods only use existing contrasts acquired from typical clinical sequences as input for the neural network (NN). In turbo spin echo sequences (TSE) the sequence parameters can have a strong influence on the actual resolution of the acquired image and have consequently a considera-ble impact on the performance of the NN. We propose a known-operator learning appro… ▽ More

    Submitted 12 May, 2023; originally announced May 2023.

    Comments: 13 pages, 4 figures, 3 tables, submitted to MICCAI 2023 for review

  44. arXiv:2305.00446  [pdf, other

    cs.CL

    Building a Non-native Speech Corpus Featuring Chinese-English Bilingual Children: Compilation and Rationale

    Authors: Hiuchung Hung, Andreas Maier, Thorsten Piske

    Abstract: This paper introduces a non-native speech corpus consisting of narratives from fifty 5- to 6-year-old Chinese-English children. Transcripts totaling 6.5 hours of children taking a narrative comprehension test in English (L2) are presented, along with human-rated scores and annotations of grammatical and pronunciation errors. The children also completed the parallel MAIN tests in Chinese (L1) for r… ▽ More

    Submitted 7 January, 2024; v1 submitted 30 April, 2023; originally announced May 2023.

  45. arXiv:2304.11957  [pdf, other

    physics.med-ph cs.CL

    Benchmarking ChatGPT-4 on ACR Radiation Oncology In-Training (TXIT) Exam and Red Journal Gray Zone Cases: Potentials and Challenges for AI-Assisted Medical Education and Decision Making in Radiation Oncology

    Authors: Yixing Huang, Ahmed Gomaa, Sabine Semrau, Marlen Haderlein, Sebastian Lettmaier, Thomas Weissmann, Johanna Grigo, Hassen Ben Tkhayat, Benjamin Frey, Udo S. Gaipl, Luitpold V. Distel, Andreas Maier, Rainer Fietkau, Christoph Bert, Florian Putz

    Abstract: The potential of large language models in medicine for education and decision making purposes has been demonstrated as they achieve decent scores on medical exams such as the United States Medical Licensing Exam (USMLE) and the MedQA exam. In this work, we evaluate the performance of ChatGPT-4 in the specialized field of radiation oncology using the 38th American College of Radiology (ACR) radiati… ▽ More

    Submitted 21 August, 2023; v1 submitted 24 April, 2023; originally announced April 2023.

  46. arXiv:2304.05864  [pdf, other

    cs.CV cs.LG

    Scale-Equivariant Deep Learning for 3D Data

    Authors: Thomas Wimmer, Vladimir Golkov, Hoai Nam Dang, Moritz Zaiss, Andreas Maier, Daniel Cremers

    Abstract: The ability of convolutional neural networks (CNNs) to recognize objects regardless of their position in the image is due to the translation-equivariance of the convolutional operation. Group-equivariant CNNs transfer this equivariance to other transformations of the input. Dealing appropriately with objects and object parts of different scale is challenging, and scale can vary for multiple reason… ▽ More

    Submitted 12 April, 2023; originally announced April 2023.

    Comments: 12 pages, 4 figures

  47. arXiv:2303.14711  [pdf, other

    eess.IV cs.CV

    Unsupervised detection of small hyperreflective features in ultrahigh resolution optical coherence tomography

    Authors: Marcel Reimann, Jungeun Won, Hiroyuki Takahashi, Antonio Yaghy, Yunchan Hwang, Stefan Ploner, Junhong Lin, Jessica Girgis, Kenneth Lam, Siyu Chen, Nadia K. Waheed, Andreas Maier, James G. Fujimoto

    Abstract: Recent advances in optical coherence tomography such as the development of high speed ultrahigh resolution scanners and corresponding signal processing techniques may reveal new potential biomarkers in retinal diseases. Newly visible features are, for example, small hyperreflective specks in age-related macular degeneration. Identifying these new markers is crucial to investigate potential associa… ▽ More

    Submitted 26 March, 2023; originally announced March 2023.

    Comments: Accepted as poster at BVM workshop 2023 (https://www.bvm-workshop.org/). The arXiv version provides full quality figures. 6 pages content (2 figures)

  48. arXiv:2303.11724  [pdf, other

    cs.CV cs.LG eess.IV

    Task-based Generation of Optimized Projection Sets using Differentiable Ranking

    Authors: Linda-Sophie Schneider, Mareike Thies, Christopher Syben, Richard Schielein, Mathias Unberath, Andreas Maier

    Abstract: We present a method for selecting valuable projections in computed tomography (CT) scans to enhance image reconstruction and diagnosis. The approach integrates two important factors, projection-based detectability and data completeness, into a single feed-forward neural network. The network evaluates the value of projections, processes them through a differentiable ranking function and makes the f… ▽ More

    Submitted 21 March, 2023; originally announced March 2023.

  49. arXiv:2303.04923  [pdf, other

    cs.CV

    BOSS: Bones, Organs and Skin Shape Model

    Authors: Karthik Shetty, Annette Birkhold, Srikrishna Jaganathan, Norbert Strobel, Bernhard Egger, Markus Kowarschik, Andreas Maier

    Abstract: Objective: A digital twin of a patient can be a valuable tool for enhancing clinical tasks such as workflow automation, patient-specific X-ray dose optimization, markerless tracking, positioning, and navigation assistance in image-guided interventions. However, it is crucial that the patient's surface and internal organs are of high quality for any pose and shape estimates. At present, the majorit… ▽ More

    Submitted 8 March, 2023; originally announced March 2023.

  50. arXiv:2303.00500  [pdf, other

    cs.CV cs.LG eess.IV

    Inherently Interpretable Multi-Label Classification Using Class-Specific Counterfactuals

    Authors: Susu Sun, Stefano Woerner, Andreas Maier, Lisa M. Koch, Christian F. Baumgartner

    Abstract: Interpretability is essential for machine learning algorithms in high-stakes application fields such as medical image analysis. However, high-performing black-box neural networks do not provide explanations for their predictions, which can lead to mistrust and suboptimal human-ML collaboration. Post-hoc explanation techniques, which are widely used in practice, have been shown to suffer from sever… ▽ More

    Submitted 8 August, 2023; v1 submitted 1 March, 2023; originally announced March 2023.

    Comments: Accepted to MIDL 2023