Search | arXiv e-print repository

SALT: Introducing a Framework for Hierarchical Segmentations in Medical Imaging using Softmax for Arbitrary Label Trees

Authors: Sven Koitka, Giulia Baldini, Cynthia S. Schmidt, Olivia B. Pollok, Obioma Pelka, Judith Kohnke, Katarzyna Borys, Christoph M. Friedrich, Benedikt M. Schaarschmidt, Michael Forsting, Lale Umutlu, Johannes Haubold, Felix Nensa, René Hosch

Abstract: Traditional segmentation networks approach anatomical structures as standalone elements, overlooking the intrinsic hierarchical connections among them. This study introduces Softmax for Arbitrary Label Trees (SALT), a novel approach designed to leverage the hierarchical relationships between labels, improving the efficiency and interpretability of the segmentations. This study introduces a novel… ▽ More Traditional segmentation networks approach anatomical structures as standalone elements, overlooking the intrinsic hierarchical connections among them. This study introduces Softmax for Arbitrary Label Trees (SALT), a novel approach designed to leverage the hierarchical relationships between labels, improving the efficiency and interpretability of the segmentations. This study introduces a novel segmentation technique for CT imaging, which leverages conditional probabilities to map the hierarchical structure of anatomical landmarks, such as the spine's division into lumbar, thoracic, and cervical regions and further into individual vertebrae. The model was developed using the SAROS dataset from The Cancer Imaging Archive (TCIA), comprising 900 body region segmentations from 883 patients. The dataset was further enhanced by generating additional segmentations with the TotalSegmentator, for a total of 113 labels. The model was trained on 600 scans, while validation and testing were conducted on 150 CT scans. Performance was assessed using the Dice score across various datasets, including SAROS, CT-ORG, FLARE22, LCTSC, LUNA16, and WORD. Among the evaluated datasets, SALT achieved its best results on the LUNA16 and SAROS datasets, with Dice scores of 0.93 and 0.929 respectively. The model demonstrated reliable accuracy across other datasets, scoring 0.891 on CT-ORG and 0.849 on FLARE22. The LCTSC dataset showed a score of 0.908 and the WORD dataset also showed good performance with a score of 0.844. SALT used the hierarchical structures inherent in the human body to achieve whole-body segmentations with an average of 35 seconds for 100 slices. This rapid processing underscores its potential for integration into clinical workflows, facilitating the automatic and efficient computation of full-body segmentations with each CT scan, thus enhancing diagnostic processes and patient care. △ Less

Submitted 11 July, 2024; originally announced July 2024.

arXiv:2406.05192 [pdf, other]

Revealing faint compact radio jets at redshifts above 5 with very long baseline interferometry

Authors: Máté Krezinger, Giovanni Baldini, Marcello Giroletti, Tullia Sbarrato, Gabriele Ghisellini, Gabriele Giovannini, Tao An, Krisztina É. Gabányi, Sándor Frey

Abstract: Over the past two decades, our knowledge of the high-redshift (z > 5) radio quasars has expanded, thanks to dedicated high-resolution very long baseline interferometry (VLBI) observations. Distant quasars provide unique information about the formation and evolution of the first galaxies and supermassive black holes in the Universe. Powerful relativistic jets are likely to have played an essential… ▽ More Over the past two decades, our knowledge of the high-redshift (z > 5) radio quasars has expanded, thanks to dedicated high-resolution very long baseline interferometry (VLBI) observations. Distant quasars provide unique information about the formation and evolution of the first galaxies and supermassive black holes in the Universe. Powerful relativistic jets are likely to have played an essential role in these processes. However, the sample of VLBI-observed radio quasars is still too small to allow meaningful statistical conclusions. We extend the list of the VLBI observed radio quasars to investigate how the source structure and physical parameters are related to radio loudness. We assembled a sample of 10 faint radio quasars located at 5 < z < 6 with their radio-loudness indices spanning between 0.9-76. We observed the selected targets with the European VLBI Network (EVN) at 1.7 GHz. The milliarcsecond-scale resolution of VLBI at this frequency allows us to probe the compact innermost parts of radio-emitting relativistic jets. In addition to the single-band VLBI observations, we collected single-dish and low-resolution radio interferometric data to investigate the spectral properties and variability of our sources. The detection rate of this high-redshift, low-flux-density sample is 90%, with only one target (J0306+1853) remaining undetected. The other 9 sources appear core-dominated and show a single, faint and compact radio core on this angular scale. The derived radio powers are typical of FRII radio galaxies and quasars. By extending our sample with other VLBI-detected z > 5 sources from the literature, we found that the core brightness temperatures and monochromatic radio powers tend to increase with radio loudness. △ Less

Submitted 7 June, 2024; originally announced June 2024.

Comments: Submitted to Astronomy & Astrophysics

arXiv:2404.05694 [pdf, other]

Comprehensive Study on German Language Models for Clinical and Biomedical Text Understanding

Authors: Ahmad Idrissi-Yaghir, Amin Dada, Henning Schäfer, Kamyar Arzideh, Giulia Baldini, Jan Trienes, Max Hasin, Jeanette Bewersdorff, Cynthia S. Schmidt, Marie Bauer, Kaleb E. Smith, Jiang Bian, Yonghui Wu, Jörg Schlötterer, Torsten Zesch, Peter A. Horn, Christin Seifert, Felix Nensa, Jens Kleesiek, Christoph M. Friedrich

Abstract: Recent advances in natural language processing (NLP) can be largely attributed to the advent of pre-trained language models such as BERT and RoBERTa. While these models demonstrate remarkable performance on general datasets, they can struggle in specialized domains such as medicine, where unique domain-specific terminologies, domain-specific abbreviations, and varying document structures are commo… ▽ More Recent advances in natural language processing (NLP) can be largely attributed to the advent of pre-trained language models such as BERT and RoBERTa. While these models demonstrate remarkable performance on general datasets, they can struggle in specialized domains such as medicine, where unique domain-specific terminologies, domain-specific abbreviations, and varying document structures are common. This paper explores strategies for adapting these models to domain-specific requirements, primarily through continuous pre-training on domain-specific data. We pre-trained several German medical language models on 2.4B tokens derived from translated public English medical data and 3B tokens of German clinical data. The resulting models were evaluated on various German downstream tasks, including named entity recognition (NER), multi-label classification, and extractive question answering. Our results suggest that models augmented by clinical and translation-based pre-training typically outperform general domain models in medical contexts. We conclude that continuous pre-training has demonstrated the ability to match or even exceed the performance of clinical models trained from scratch. Furthermore, pre-training on clinical data or leveraging translated texts have proven to be reliable methods for domain adaptation in medical NLP tasks. △ Less

Submitted 8 May, 2024; v1 submitted 8 April, 2024; originally announced April 2024.

Comments: Accepted at LREC-COLING 2024

arXiv:2312.05176 [pdf, other]

MRI Scan Synthesis Methods based on Clustering and Pix2Pix

Authors: Giulia Baldini, Melanie Schmidt, Charlotte Zäske, Liliana L. Caldeira

Abstract: We consider a missing data problem in the context of automatic segmentation methods for Magnetic Resonance Imaging (MRI) brain scans. Usually, automated MRI scan segmentation is based on multiple scans (e.g., T1-weighted, T2-weighted, T1CE, FLAIR). However, quite often a scan is blurry, missing or otherwise unusable. We investigate the question whether a missing scan can be synthesized. We exempli… ▽ More We consider a missing data problem in the context of automatic segmentation methods for Magnetic Resonance Imaging (MRI) brain scans. Usually, automated MRI scan segmentation is based on multiple scans (e.g., T1-weighted, T2-weighted, T1CE, FLAIR). However, quite often a scan is blurry, missing or otherwise unusable. We investigate the question whether a missing scan can be synthesized. We exemplify that this is in principle possible by synthesizing a T2-weighted scan from a given T1-weighted scan. Our first aim is to compute a picture that resembles the missing scan closely, measured by average mean squared error (MSE). We develop/use several methods for this, including a random baseline approach, a clustering-based method and pixel-to-pixel translation method by Isola et al. (Pix2Pix) which is based on conditional GANs. The lowest MSE is achieved by our clustering-based method. Our second aim is to compare the methods with respect to the effect that using the synthesized scan has on the segmentation process. For this, we use a DeepMedic model trained with the four input scan modalities named above. We replace the T2-weighted scan by the synthesized picture and evaluate the segmentations with respect to the tumor identification, using Dice scores as numerical evaluation. The evaluation shows that the segmentation works well with synthesized scans (in particular, with Pix2Pix methods) in many cases. △ Less

Submitted 3 May, 2024; v1 submitted 8 December, 2023; originally announced December 2023.

Comments: Accepted at AIME 2024

arXiv:2306.15350 [pdf, other]

CellViT: Vision Transformers for Precise Cell Segmentation and Classification

Authors: Fabian Hörst, Moritz Rempe, Lukas Heine, Constantin Seibold, Julius Keyl, Giulia Baldini, Selma Ugurel, Jens Siveke, Barbara Grünwald, Jan Egger, Jens Kleesiek

Abstract: Nuclei detection and segmentation in hematoxylin and eosin-stained (H&E) tissue images are important clinical tasks and crucial for a wide range of applications. However, it is a challenging task due to nuclei variances in staining and size, overlap** boundaries, and nuclei clustering. While convolutional neural networks have been extensively used for this task, we explore the potential of Trans… ▽ More Nuclei detection and segmentation in hematoxylin and eosin-stained (H&E) tissue images are important clinical tasks and crucial for a wide range of applications. However, it is a challenging task due to nuclei variances in staining and size, overlap** boundaries, and nuclei clustering. While convolutional neural networks have been extensively used for this task, we explore the potential of Transformer-based networks in this domain. Therefore, we introduce a new method for automated instance segmentation of cell nuclei in digitized tissue samples using a deep learning architecture based on Vision Transformer called CellViT. CellViT is trained and evaluated on the PanNuke dataset, which is one of the most challenging nuclei instance segmentation datasets, consisting of nearly 200,000 annotated Nuclei into 5 clinically important classes in 19 tissue types. We demonstrate the superiority of large-scale in-domain and out-of-domain pre-trained Vision Transformers by leveraging the recently published Segment Anything Model and a ViT-encoder pre-trained on 104 million histological image patches - achieving state-of-the-art nuclei detection and instance segmentation performance on the PanNuke dataset with a mean panoptic quality of 0.50 and an F1-detection score of 0.83. The code is publicly available at https://github.com/TIO-IKIM/CellViT △ Less

Submitted 6 October, 2023; v1 submitted 27 June, 2023; originally announced June 2023.

Comments: 18 pages, 5 figures, appendix included

arXiv:2108.00974 [pdf, other]

Evaluating Federated Learning for Intrusion Detection in Internet of Things: Review and Challenges

Authors: Enrique Mármol Campos, Pablo Fernández Saura, Aurora González-Vidal, José L. Hernández-Ramos, Jorge Bernal Bernabe, Gianmarco Baldini, Antonio Skarmeta

Abstract: The application of Machine Learning (ML) techniques to the well-known intrusion detection systems (IDS) is key to cope with increasingly sophisticated cybersecurity attacks through an effective and efficient detection process. In the context of the Internet of Things (IoT), most ML-enabled IDS approaches use centralized approaches where IoT devices share their data with data centers for further an… ▽ More The application of Machine Learning (ML) techniques to the well-known intrusion detection systems (IDS) is key to cope with increasingly sophisticated cybersecurity attacks through an effective and efficient detection process. In the context of the Internet of Things (IoT), most ML-enabled IDS approaches use centralized approaches where IoT devices share their data with data centers for further analysis. To mitigate privacy concerns associated with centralized approaches, in recent years the use of Federated Learning (FL) has attracted a significant interest in different sectors, including healthcare and transport systems. However, the development of FL-enabled IDS for IoT is in its infancy, and still requires research efforts from various areas, in order to identify the main challenges for the deployment in real-world scenarios. In this direction, our work evaluates a FL-enabled IDS approach based on a multiclass classifier considering different data distributions for the detection of different attacks in an IoT scenario. In particular, we use three different settings that are obtained by partitioning the recent ToN\_IoT dataset according to IoT devices' IP address and types of attack. Furthermore, we evaluate the impact of different aggregation functions according to such setting by using the recent IBMFL framework as FL implementation. Additionally, we identify a set of challenges and future directions based on the existing literature and the analysis of our evaluation results. △ Less

Submitted 2 August, 2021; originally announced August 2021.

Comments: 41 pages, 11 figures, 4 tables

arXiv:1909.07039 [pdf, other]

Toward a Blockchain-based Platform to Manage Cybersecurity Certification of IoT devices

Authors: Ricardo Neisse, José L. Hernández-Ramos, Sara N. Matheu, Gianmarco Baldini, Antonio Skarmeta

Abstract: The goal of this paper is to propose a blockchain-based platform to enhance transparency and traceability of cybersecurity certification information motivated by the recently adopted EU Cybersecurity Act. The proposed platform is generic and intended to support the trusted exchange of cybersecurity certification information for any electronic product, service, or process. However, for the purposes… ▽ More The goal of this paper is to propose a blockchain-based platform to enhance transparency and traceability of cybersecurity certification information motivated by the recently adopted EU Cybersecurity Act. The proposed platform is generic and intended to support the trusted exchange of cybersecurity certification information for any electronic product, service, or process. However, for the purposes of this paper, we focus on the case study of the cybersecurity certification of IoT devices, which are explicitly referenced in the recently adopted Cybersecurity Act as one of the main domains where it is highlighted the need for an increased level of trust. △ Less

Submitted 16 September, 2019; originally announced September 2019.

Comments: 8 pages

arXiv:1701.07676 [pdf, other]

Mobile phone identification through the built-in magnetometers

Authors: Gianmarco Baldini, Gary Steri, Raimondo Giuliani, Vladimir Kyovtorov

Abstract: Mobile phones identification through their built in components has been demonstrated in literature for various types of sensors including the camera, microphones and accelerometers. The identification is performed by the exploitation of the small but significant differences in the electronic circuits generated during the production process. Thus, these differences become an intrinsic property of t… ▽ More Mobile phones identification through their built in components has been demonstrated in literature for various types of sensors including the camera, microphones and accelerometers. The identification is performed by the exploitation of the small but significant differences in the electronic circuits generated during the production process. Thus, these differences become an intrinsic property of the electronic components, which can be detected and become an unique fingerprint of the component and of the mobile phone. In this paper, we investigate the identification of mobile phones through their builtin magnetometers, which has not been reported in literature yet. Magnetometers are stimulated with different waveforms using a solenoid connected to a computer s audio board. The identification is performed analyzing the digital output of the magnetometer through the use of statistical features and the Support Vector Machine (SVM) machine learning algorithm. We prove that this technique can distinguish different models and brands with very high accuracy but it can only distinguish phones of the same model with limited accuracy. △ Less

Submitted 26 January, 2017; originally announced January 2017.

arXiv:1508.05932 [pdf]

A novel variable-distance antenna test range and high spatial resolution corroboration of the inverse square law for 433.5 MHz radiation

Authors: Christoph de Haën, Giancarlo Baldini, Matthias Erhardt

Abstract: A novel, low-budget, open-air, slant-geometry antenna test range for UHF radiation is presented. It was designed primarily to facilitate variation of the distance between emitter and receiver antennas, but has also the potential for adaptation to simultaneous variation of distance and receiver antenna orientation. In support of the validity of the range the inverse square law for 433.5 MHz radiati… ▽ More A novel, low-budget, open-air, slant-geometry antenna test range for UHF radiation is presented. It was designed primarily to facilitate variation of the distance between emitter and receiver antennas, but has also the potential for adaptation to simultaneous variation of distance and receiver antenna orientation. In support of the validity of the range the inverse square law for 433.5 MHz radiation between two naked half-wave dipole antennas was tested with high spatial resolution from close to the far field limit outward to 46 wavelengths. The ratio of sine-amplitude input voltages to the receiver antenna at two distances between the antennas diminished in proportion to the corresponding inverse distance ratio to the power 0.9970 +/- 0.0051 (R^2 = 0.992). This value is indistinguishable from the theoretical value of 1 and confirms the proportionality of the electric field strength to the inverse distance from the radiation source. Given the known proportionality of irradiance to the square of the electric field strength, the result corroborates the inverse square law for irradiance at the lowest frequency for which thus far data have been published. △ Less

Submitted 28 October, 2015; v1 submitted 23 August, 2015; originally announced August 2015.

Comments: Compliance with SI terminology. Language improvements. Results unchanged

Showing 1–9 of 9 results for author: Baldini, G