Search | arXiv e-print repository

Multi-Modal Dataset Creation for Federated~Learning with DICOM Structured Reports

Authors: Malte Tölle, Lukas Burger, Halvar Kelm, Florian André, Peter Bannas, Gerhard Diller, Norbert Frey, Philipp Garthe, Stefan Groß, Anja Hennemuth, Lars Kaderali, Nina Krüger, Andreas Leha, Simon Martin, Alexander Meyer, Eike Nagel, Stefan Orwat, Clemens Scherer, Moritz Seiffert, Jan Moritz Seliger, Stefan Simm, Tim Friede, Tim Seidler, Sandy Engelhardt

Abstract: Purpose: Federated training is often hindered by heterogeneous datasets due to divergent data storage options, inconsistent naming schemes, varied annotation procedures, and disparities in label quality. This is particularly evident in the emerging multi-modal learning paradigms, where dataset harmonization including a uniform data representation and filtering options are of paramount importance.… ▽ More Purpose: Federated training is often hindered by heterogeneous datasets due to divergent data storage options, inconsistent naming schemes, varied annotation procedures, and disparities in label quality. This is particularly evident in the emerging multi-modal learning paradigms, where dataset harmonization including a uniform data representation and filtering options are of paramount importance. Methods: DICOM structured reports enable the standardized linkage of arbitrary information beyond the imaging domain and can be used within Python deep learning pipelines with highdicom. Building on this, we developed an open platform for data integration and interactive filtering capabilities that simplifies the process of assembling multi-modal datasets. Results: In this study, we extend our prior work by showing its applicability to more and divergent data types, as well as streamlining datasets for federated training within an established consortium of eight university hospitals in Germany. We prove its concurrent filtering ability by creating harmonized multi-modal datasets across all locations for predicting the outcome after minimally invasive heart valve replacement. The data includes DICOM data (i.e. computed tomography images, electrocardiography scans) as well as annotations (i.e. calcification segmentations, pointsets and pacemaker dependency), and metadata (i.e. prosthesis and diagnoses). Conclusion: Structured reports bridge the traditional gap between imaging systems and information systems. Utilizing the inherent DICOM reference system arbitrary data types can be queried concurrently to create meaningful cohorts for clinical studies. The graphical interface as well as example structured report templates will be made publicly available. △ Less

Submitted 12 July, 2024; originally announced July 2024.

arXiv:2407.07557 [pdf, other]

Federated Foundation Model for Cardiac CT Imaging

Authors: Malte Tölle, Philipp Garthe, Clemens Scherer, Jan Moritz Seliger, Andreas Leha, Nina Krüger, Stefan Simm, Simon Martin, Sebastian Eble, Halvar Kelm, Moritz Bednorz, Florian André, Peter Bannas, Gerhard Diller, Norbert Frey, Stefan Groß, Anja Hennemuth, Lars Kaderali, Alexander Meyer, Eike Nagel, Stefan Orwat, Moritz Seiffert, Tim Friede, Tim Seidler, Sandy Engelhardt

Abstract: Federated learning (FL) is a renowned technique for utilizing decentralized data while preserving privacy. However, real-world applications often involve inherent challenges such as partially labeled datasets, where not all clients possess expert annotations of all labels of interest, leaving large portions of unlabeled data unused. In this study, we conduct the largest federated cardiac CT imagin… ▽ More Federated learning (FL) is a renowned technique for utilizing decentralized data while preserving privacy. However, real-world applications often involve inherent challenges such as partially labeled datasets, where not all clients possess expert annotations of all labels of interest, leaving large portions of unlabeled data unused. In this study, we conduct the largest federated cardiac CT imaging analysis to date, focusing on partially labeled datasets ($n=8,124$) of Transcatheter Aortic Valve Implantation (TAVI) patients over eight hospital clients. Transformer architectures, which are the major building blocks of current foundation models, have shown superior performance when trained on larger cohorts than traditional CNNs. However, when trained on small task-specific labeled sample sizes, it is currently not feasible to exploit their underlying attention mechanism for improved performance. Therefore, we developed a two-stage semi-supervised learning strategy that distills knowledge from several task-specific CNNs (landmark detection and segmentation of calcification) into a single transformer model by utilizing large amounts of unlabeled data typically residing unused in hospitals to mitigate these issues. This method not only improves the predictive accuracy and generalizability of transformer-based architectures but also facilitates the simultaneous learning of all partial labels within a single transformer model across the federation. Additionally, we show that our transformer-based model extracts more meaningful features for further downstream tasks than the UNet-based one by only training the last layer to also solve segmentation of coronary arteries. We make the code and weights of the final model openly available, which can serve as a foundation model for further research in cardiac CT imaging. △ Less

Submitted 10 July, 2024; originally announced July 2024.

arXiv:2311.17755 [pdf]

doi 10.1007/s10439-023-03353-4

Experimental and Theoretical Brownian Dynamics Analysis of Ion Transport During Cellular Electroporation of E. coli Bacteria

Authors: Juan González-Cuevas, Ricardo Argüello, Marcos Florentin, Franck M. André, Lluis Mir

Abstract: Escherichia coli bacterium is a rod-shaped organism composed of a complex double membrane structure. Knowledge of electric field driven ion transport through both membranes and the evolution of their induced permeabilization has important applications in biomedical engineering, delivery of genes and antibacterial agents. However, few studies have been conducted on Gram-negative bacteria in this re… ▽ More Escherichia coli bacterium is a rod-shaped organism composed of a complex double membrane structure. Knowledge of electric field driven ion transport through both membranes and the evolution of their induced permeabilization has important applications in biomedical engineering, delivery of genes and antibacterial agents. However, few studies have been conducted on Gram-negative bacteria in this regard considering the contribution of all ion types. To address this gap in knowledge, we have developed a deterministic and stochastic Brownian dynamics model to simulate in 3D space the motion of ions through pores formed in the plasma membranes of E. coli cells during electroporation. The diffusion coefficient, mobility, and translation time of Ca$^{2+}$, Mg$^{2+}$, Na$^+$, K$^+$, and Cl$^-$ ions within the pore region are estimated from the numerical model. Calculations of pore's conductance have been validated with experiments conducted at Gustave Roussy. From the simulations, it was found that the main driving force of ionic uptake during the pulse is the one due to the externally applied electric field. The results from this work provide a better understanding of ion transport during electroporation, aiding in the design of electrical pulses for maximizing ion throughput, primarily for application in cancer treatment. △ Less

Submitted 29 November, 2023; originally announced November 2023.

Comments: Annals of Biomedical Engineering, 2023

arXiv:2310.14472 [pdf, ps, other]

The dressing field method for diffeomorphisms: a relational framework

Authors: Jordan T. Francois Andre

Abstract: The dressing field method is a tool to reduce gauge symmetries. Here we extend it to cover the case of diffeomorphisms. The resulting framework is a systematic scheme to produce Diff(M)-invariant objects, which has a natural relational interpretation. Its precise formulation relies on a clear understanding of the bundle geometry of field space. By detailing it, among other things we stress the g… ▽ More The dressing field method is a tool to reduce gauge symmetries. Here we extend it to cover the case of diffeomorphisms. The resulting framework is a systematic scheme to produce Diff(M)-invariant objects, which has a natural relational interpretation. Its precise formulation relies on a clear understanding of the bundle geometry of field space. By detailing it, among other things we stress the geometric nature of field-independent and field-dependent diffeomorphisms, and highlight that the heuristic "extended bracket" for field-dependent vector fields often featuring in the covariant phase space literature can be understood as arising from the Frölicher-Nijenhuis bracket. Furthermore, by articulating this bundle geometry with the covariant phase space approach, we give a streamlined account of the elementary objects of the (pre)symplectic structure of a Diff(M)-theory: Noether charges and their bracket, as induced by the standard prescription for the presymplectic potential and 2-form. We give conceptually transparent expressions allowing to read the integrability conditions and the circumstances under which the bracket of charge is Lie, and the resulting Poisson algebras of charges are central extensions of the Lie algebras of field-independent ($\mathfrak{diff}(M)$) and field-dependent vector fields. We show that, applying the dressing field method, one obtains a Diff(M)-invariant and manifestly relational formulation of a general relativistic field theory. Relying on results just mentioned, we easily derive the "dressed" (relational) presymplectic structure of the theory. This reproduces or extends results from the gravitational edge mode and gravitational dressing literature. In addition to simplified technical derivations, the conceptual clarity of the framework supplies several insights and allows us to dispel misconceptions. △ Less

Submitted 25 February, 2024; v1 submitted 22 October, 2023; originally announced October 2023.

Comments: 62 pages. Update: short descriptions of the action Lie groupoid and algebroid of field space are added

arXiv:2310.10282 [pdf, other]

Helical coil design with controlled dispersion for bunching enhancement of the TNSA protons

Authors: A Hirsch-Passicos, C L C Lacoste, F André, Y Elskens, E d'Humières, V Tikhonchuk, M Bardon

Abstract: The quality of the proton beam produced by Target Normal Sheath Acceleration (TNSA) with high power lasers can be significantly improved with the use of helical coils. While they showed promising results in terms of focusing, their performances in terms of the of cutoff energy and bunching stay limited due to the dispersive nature of helical coils. A new scheme of helical coil with a tube surround… ▽ More The quality of the proton beam produced by Target Normal Sheath Acceleration (TNSA) with high power lasers can be significantly improved with the use of helical coils. While they showed promising results in terms of focusing, their performances in terms of the of cutoff energy and bunching stay limited due to the dispersive nature of helical coils. A new scheme of helical coil with a tube surrounding the helix is introduced, and the first numerical simulations and an analytical model show a possibility of a drastic reduction of the current pulse dispersion for the parameters of high power laser facilities. The helical coils with tube strongly increase bunching, creating two collimated narrow-band proton beams from a broad and divergent TNSA distribution. The analytical model provides scaling of proton parameters as a function of laser facility features. △ Less

Submitted 16 October, 2023; originally announced October 2023.

arXiv:2309.05528 [pdf, other]

On the detection of Out-Of-Distribution samples in Multiple Instance Learning

Authors: Loïc Le Bescond, Maria Vakalopoulou, Stergios Christodoulidis, Fabrice André, Hugues Talbot

Abstract: The deployment of machine learning solutions in real-world scenarios often involves addressing the challenge of out-of-distribution (OOD) detection. While significant efforts have been devoted to OOD detection in classical supervised settings, the context of weakly supervised learning, particularly the Multiple Instance Learning (MIL) framework, remains under-explored. In this study, we tackle thi… ▽ More The deployment of machine learning solutions in real-world scenarios often involves addressing the challenge of out-of-distribution (OOD) detection. While significant efforts have been devoted to OOD detection in classical supervised settings, the context of weakly supervised learning, particularly the Multiple Instance Learning (MIL) framework, remains under-explored. In this study, we tackle this challenge by adapting post-hoc OOD detection methods to the MIL setting while introducing a novel benchmark specifically designed to assess OOD detection performance in weakly supervised scenarios. Across extensive experiments based on diverse public datasets, KNN emerges as the best-performing method overall. However, it exhibits significant shortcomings on some datasets, emphasizing the complexity of this under-explored and challenging topic. Our findings shed light on the complex nature of OOD detection under the MIL framework, emphasizing the importance of develo** novel, robust, and reliable methods that can generalize effectively in a weakly supervised context. The code for the paper is available here: https://github.com/loic-lb/OOD_MIL. △ Less

Submitted 9 November, 2023; v1 submitted 11 September, 2023; originally announced September 2023.

arXiv:2306.01752 [pdf, other]

Handling Label Uncertainty on the Example of Automatic Detection of Shepherd's Crook RCA in Coronary CT Angiography

Authors: Felix Denzinger, Michael Wels, Oliver Taubmann, Florian Kordon, Fabian Wagner, Stephanie Mehltretter, Mehmet A. Gülsün, Max Schöbinger, Florian André, Sebastian Buss, Johannes Görich, Michael Sühling, Andreas Maier

Abstract: Coronary artery disease (CAD) is often treated minimally invasively with a catheter being inserted into the diseased coronary vessel. If a patient exhibits a Shepherd's Crook (SC) Right Coronary Artery (RCA) - an anatomical norm variant of the coronary vasculature - the complexity of this procedure is increased. Automated reporting of this variant from coronary CT angiography screening would ease… ▽ More Coronary artery disease (CAD) is often treated minimally invasively with a catheter being inserted into the diseased coronary vessel. If a patient exhibits a Shepherd's Crook (SC) Right Coronary Artery (RCA) - an anatomical norm variant of the coronary vasculature - the complexity of this procedure is increased. Automated reporting of this variant from coronary CT angiography screening would ease prior risk assessment. We propose a 1D convolutional neural network which leverages a sequence of residual dilated convolutions to automatically determine this norm variant from a prior extracted vessel centerline. As the SC RCA is not clearly defined with respect to concrete measurements, labeling also includes qualitative aspects. Therefore, 4.23% samples in our dataset of 519 RCA centerlines were labeled as unsure SC RCAs, with 5.97% being labeled as sure SC RCAs. We explore measures to handle this label uncertainty, namely global/model-wise random assignment, exclusion, and soft label assignment. Furthermore, we evaluate how this uncertainty can be leveraged for the determination of a rejection class. With our best configuration, we reach an area under the receiver operating characteristic curve (AUC) of 0.938 on confident labels. Moreover, we observe an increase of up to 0.020 AUC when rejecting 10% of the data and leveraging the labeling uncertainty information in the exclusion process. △ Less

Submitted 22 May, 2023; originally announced June 2023.

Comments: Accepted at ISBI 2023

arXiv:2208.05745 [pdf, other]

doi 10.1093/bib/bbad260

A biology-driven deep generative model for cell-type annotation in cytometry

Authors: Quentin Blampey, Nadège Bercovici, Charles-Antoine Dutertre, Isabelle Pic, Fabrice André, Joana Mourato Ribeiro, Paul-Henry Cournède

Abstract: Cytometry enables precise single-cell phenoty** within heterogeneous populations. These cell types are traditionally annotated via manual gating, but this method suffers from a lack of reproducibility and sensitivity to batch-effect. Also, the most recent cytometers - spectral flow or mass cytometers - create rich and high-dimensional data whose analysis via manual gating becomes challenging and… ▽ More Cytometry enables precise single-cell phenoty** within heterogeneous populations. These cell types are traditionally annotated via manual gating, but this method suffers from a lack of reproducibility and sensitivity to batch-effect. Also, the most recent cytometers - spectral flow or mass cytometers - create rich and high-dimensional data whose analysis via manual gating becomes challenging and time-consuming. To tackle these limitations, we introduce Scyan (https://github.com/MICS-Lab/scyan), a Single-cell Cytometry Annotation Network that automatically annotates cell types using only prior expert knowledge about the cytometry panel. We demonstrate that Scyan significantly outperforms the related state-of-the-art models on multiple public datasets while being faster and interpretable. In addition, Scyan overcomes several complementary tasks such as batch-effect removal, debarcoding, and population discovery. Overall, this model accelerates and eases cell population characterisation, quantification, and discovery in cytometry. △ Less

Submitted 21 April, 2023; v1 submitted 11 August, 2022; originally announced August 2022.

arXiv:2207.14625 [pdf, other]

Content-Aware Differential Privacy with Conditional Invertible Neural Networks

Authors: Malte Tölle, Ullrich Köthe, Florian André, Benjamin Meder, Sandy Engelhardt

Abstract: Differential privacy (DP) has arisen as the gold standard in protecting an individual's privacy in datasets by adding calibrated noise to each data sample. While the application to categorical data is straightforward, its usability in the context of images has been limited. Contrary to categorical data the meaning of an image is inherent in the spatial correlation of neighboring pixels making the… ▽ More Differential privacy (DP) has arisen as the gold standard in protecting an individual's privacy in datasets by adding calibrated noise to each data sample. While the application to categorical data is straightforward, its usability in the context of images has been limited. Contrary to categorical data the meaning of an image is inherent in the spatial correlation of neighboring pixels making the simple application of noise infeasible. Invertible Neural Networks (INN) have shown excellent generative performance while still providing the ability to quantify the exact likelihood. Their principle is based on transforming a complicated distribution into a simple one e.g. an image into a spherical Gaussian. We hypothesize that adding noise to the latent space of an INN can enable differentially private image modification. Manipulation of the latent space leads to a modified image while preserving important details. Further, by conditioning the INN on meta-data provided with the dataset we aim at leaving dimensions important for downstream tasks like classification untouched while altering other parts that potentially contain identifying information. We term our method content-aware differential privacy (CADP). We conduct experiments on publicly available benchmarking datasets as well as dedicated medical ones. In addition, we show the generalizability of our method to categorical data. The source code is publicly available at https://github.com/Cardio-AI/CADP. △ Less

Submitted 29 July, 2022; originally announced July 2022.

Comments: Accepted at 3rd DeCaF Workshop (MICCAI22)

MSC Class: J.3 I.4.0 J.3 I.2.6

arXiv:2202.03671 [pdf, other]

CAD-RADS Scoring using Deep Learning and Task-Specific Centerline Labeling

Authors: Felix Denzinger, Michael Wels, Oliver Taubmann, Mehmet A. Gülsün, Max Schöbinger, Florian André, Sebastian J. Buss, Johannes Görich, Michael Sühling, Andreas Maier, Katharina Breininger

Abstract: With coronary artery disease (CAD) persisting to be one of the leading causes of death worldwide, interest in supporting physicians with algorithms to speed up and improve diagnosis is high. In clinical practice, the severeness of CAD is often assessed with a coronary CT angiography (CCTA) scan and manually graded with the CAD-Reporting and Data System (CAD-RADS) score. The clinical questions this… ▽ More With coronary artery disease (CAD) persisting to be one of the leading causes of death worldwide, interest in supporting physicians with algorithms to speed up and improve diagnosis is high. In clinical practice, the severeness of CAD is often assessed with a coronary CT angiography (CCTA) scan and manually graded with the CAD-Reporting and Data System (CAD-RADS) score. The clinical questions this score assesses are whether patients have CAD or not (rule-out) and whether they have severe CAD or not (hold-out). In this work, we reach new state-of-the-art performance for automatic CAD-RADS scoring. We propose using severity-based label encoding, test time augmentation (TTA) and model ensembling for a task-specific deep learning architecture. Furthermore, we introduce a novel task- and model-specific, heuristic coronary segment labeling, which subdivides coronary trees into consistent parts across patients. It is fast, robust, and easy to implement. We were able to raise the previously reported area under the receiver operating characteristic curve (AUC) from 0.914 to 0.942 in the rule-out and from 0.921 to 0.950 in the hold-out task respectively. △ Less

Submitted 8 February, 2022; originally announced February 2022.

Comments: Under review MIDL 2020

arXiv:2201.10410 [pdf, other]

Comparison of Evaluation Metrics for Landmark Detection in CMR Images

Authors: Sven Koehler, Lalith Sharan, Julian Kuhm, Arman Ghanaat, Jelizaveta Gordejeva, Nike K. Simon, Niko M. Grell, Florian André, Sandy Engelhardt

Abstract: Cardiac Magnetic Resonance (CMR) images are widely used for cardiac diagnosis and ventricular assessment. Extracting specific landmarks like the right ventricular insertion points is of importance for spatial alignment and 3D modeling. The automatic detection of such landmarks has been tackled by multiple groups using Deep Learning, but relatively little attention has been paid to the failure case… ▽ More Cardiac Magnetic Resonance (CMR) images are widely used for cardiac diagnosis and ventricular assessment. Extracting specific landmarks like the right ventricular insertion points is of importance for spatial alignment and 3D modeling. The automatic detection of such landmarks has been tackled by multiple groups using Deep Learning, but relatively little attention has been paid to the failure cases of evaluation metrics in this field. In this work, we extended the public ACDC dataset with additional labels of the right ventricular insertion points and compare different variants of a heatmap-based landmark detection pipeline. In this comparison, we demonstrate very likely pitfalls of apparently simple detection and localisation metrics which highlights the importance of a clear detection strategy and the definition of an upper limit for localisation-based metrics. Our preliminary results indicate that a combination of different metrics is necessary, as they yield different winners for method comparison. Additionally, they highlight the need of a comprehensive metric description and evaluation standardisation, especially for the error cases where no metrics could be computed or where no lower/upper boundary of a metric exists. Code and labels: https://github.com/Cardio-AI/rvip_landmark_detection △ Less

Submitted 28 January, 2022; v1 submitted 25 January, 2022; originally announced January 2022.

Comments: Accepted at Bildverarbeitung für die Medizin (BVM), Informatik aktuell. Springer Vieweg, Wiesbaden 2022

arXiv:2109.06937 [pdf, other]

doi 10.1063/5.0059349

Time simulation of the nonlinear wave-particle interaction in meters long traveling-wave tubes

Authors: Damien F. G. Minenna, Khalil Aliane, Yves Elskens, Alexandre Poyé, Frédéric André, Jérôme Puech, Fabrice Doveil

Abstract: We propose a multi-particle self-consistent Hamiltonian (derived from an N-body description) that is applicable for periodic structures such as traveling-wave tubes (TWTs), gyrotrons, free-electron lasers, or particle accelerators. We build a 1D symplectic multi-particle algorithm to simulate the nonlinear wave-particle interaction in the time domain occurring in an experimental 3-meters long heli… ▽ More We propose a multi-particle self-consistent Hamiltonian (derived from an N-body description) that is applicable for periodic structures such as traveling-wave tubes (TWTs), gyrotrons, free-electron lasers, or particle accelerators. We build a 1D symplectic multi-particle algorithm to simulate the nonlinear wave-particle interaction in the time domain occurring in an experimental 3-meters long helix TWT. Our algorithm is efficient thanks to a drastic reduction model. A 3D helix version of our reduction model is provided. Finally, we establish an explicit expression of the electromagnetic power in the time domain and in non-monochromatic (non-"continuous waveform") regime. △ Less

Submitted 14 September, 2021; originally announced September 2021.

Comments: 17 pages, 11 figures

MSC Class: 78A50

Journal ref: Phys. Plasmas, 28, 092110 (2021)

arXiv:2010.01963 [pdf, other]

doi 10.1007/978-3-030-59725-2

Automatic CAD-RADS Scoring Using Deep Learning

Authors: Felix Denzinger, Michael Wels, Katharina Breininger, Mehmet A. Gülsün, Max Schöbinger, Florian André, Sebastian Buß, Johannes Görich, Michael Sühling, Andreas Maier

Abstract: Coronary CT angiography (CCTA) has established its role as a non-invasive modality for the diagnosis of coronary artery disease (CAD). The CAD-Reporting and Data System (CAD-RADS) has been developed to standardize communication and aid in decision making based on CCTA findings. The CAD-RADS score is determined by manual assessment of all coronary vessels and the grading of lesions within the coron… ▽ More Coronary CT angiography (CCTA) has established its role as a non-invasive modality for the diagnosis of coronary artery disease (CAD). The CAD-Reporting and Data System (CAD-RADS) has been developed to standardize communication and aid in decision making based on CCTA findings. The CAD-RADS score is determined by manual assessment of all coronary vessels and the grading of lesions within the coronary artery tree. We propose a bottom-up approach for fully-automated prediction of this score using deep-learning operating on a segment-wise representation of the coronary arteries. The method relies solely on a prior fully-automated centerline extraction and segment labeling and predicts the segment-wise stenosis degree and the overall calcification grade as auxiliary tasks in a multi-task learning setup. We evaluate our approach on a data collection consisting of 2,867 patients. On the task of identifying patients with a CAD-RADS score indicating the need for further invasive investigation our approach reaches an area under curve (AUC) of 0.923 and an AUC of 0.914 for determining whether the patient suffers from CAD. This level of performance enables our approach to be used in a fully-automated screening setup or to assist diagnostic CCTA reading, especially due to its neural architecture design -- which allows comprehensive predictions. △ Less

Submitted 5 October, 2020; originally announced October 2020.

Comments: Published at MICCAI 2020

arXiv:2007.08373 [pdf, other]

Self-Supervised Nuclei Segmentation in Histopathological Images Using Attention

Authors: Mihir Sahasrabudhe, Stergios Christodoulidis, Roberto Salgado, Stefan Michiels, Sherene Loi, Fabrice André, Nikos Paragios, Maria Vakalopoulou

Abstract: Segmentation and accurate localization of nuclei in histopathological images is a very challenging problem, with most existing approaches adopting a supervised strategy. These methods usually rely on manual annotations that require a lot of time and effort from medical experts. In this study, we present a self-supervised approach for segmentation of nuclei for whole slide histopathology images. Ou… ▽ More Segmentation and accurate localization of nuclei in histopathological images is a very challenging problem, with most existing approaches adopting a supervised strategy. These methods usually rely on manual annotations that require a lot of time and effort from medical experts. In this study, we present a self-supervised approach for segmentation of nuclei for whole slide histopathology images. Our method works on the assumption that the size and texture of nuclei can determine the magnification at which a patch is extracted. We show that the identification of the magnification level for tiles can generate a preliminary self-supervision signal to locate nuclei. We further show that by appropriately constraining our model it is possible to retrieve meaningful segmentation maps as an auxiliary output to the primary magnification identification task. Our experiments show that with standard post-processing, our method can outperform other unsupervised nuclei segmentation approaches and report similar performance with supervised ones on the publicly available MoNuSeg dataset. Our code and models are available online to facilitate further research. △ Less

Submitted 16 July, 2020; originally announced July 2020.

Comments: 10 pages. Code available online at https://github.com/msahasrabudhe/miccai2020_self_sup_nuclei_seg

arXiv:2004.12852 [pdf, other]

AI-Driven CT-based quantification, staging and short-term outcome prediction of COVID-19 pneumonia

Authors: Guillaume Chassagnon, Maria Vakalopoulou, Enzo Battistella, Stergios Christodoulidis, Trieu-Nghi Hoang-Thi, Severine Dangeard, Eric Deutsch, Fabrice Andre, Enora Guillo, Nara Halm, Stefany El Hajj, Florian Bompard, Sophie Neveu, Chahinez Hani, Ines Saab, Alienor Campredon, Hasmik Koulakian, Souhail Bennani, Gael Freche, Aurelien Lombard, Laure Fournier, Hippolyte Monnier, Teodor Grand, Jules Gregory, Antoine Khalil , et al. (6 additional authors not shown)

Abstract: Chest computed tomography (CT) is widely used for the management of Coronavirus disease 2019 (COVID-19) pneumonia because of its availability and rapidity. The standard of reference for confirming COVID-19 relies on microbiological tests but these tests might not be available in an emergency setting and their results are not immediately available, contrary to CT. In addition to its role for early… ▽ More Chest computed tomography (CT) is widely used for the management of Coronavirus disease 2019 (COVID-19) pneumonia because of its availability and rapidity. The standard of reference for confirming COVID-19 relies on microbiological tests but these tests might not be available in an emergency setting and their results are not immediately available, contrary to CT. In addition to its role for early diagnosis, CT has a prognostic role by allowing visually evaluating the extent of COVID-19 lung abnormalities. The objective of this study is to address prediction of short-term outcomes, especially need for mechanical ventilation. In this multi-centric study, we propose an end-to-end artificial intelligence solution for automatic quantification and prognosis assessment by combining automatic CT delineation of lung disease meeting performance of experts and data-driven identification of biomarkers for its prognosis. AI-driven combination of variables with CT-based biomarkers offers perspectives for optimal patient management given the shortage of intensive care beds and ventilators. △ Less

Submitted 20 April, 2020; originally announced April 2020.

arXiv:1909.13704 [pdf, other]

doi 10.1109/TED.2019.2928450

DIMOHA: A Time-Domain Algorithm for Traveling-Wave Tube Simulations

Authors: Damien Minenna, Yves Elskens, Frédéric André, Alexandre Poyé, Jérôme Puech, Fabrice Doveil

Abstract: To simulate traveling-wave tubes (TWTs) in time domain and more generally the wave-particle interaction in vacuum devices, we developed the DIscrete MOdel with HAmiltonian approach (dimoha) as an alternative to current particle-in-cell (PIC) and frequency approaches. Indeed, it is based on a longitudinal N-body Hamiltonian approach satisfying Maxwell's equations. Advantages of dimoha comprise: (i)… ▽ More To simulate traveling-wave tubes (TWTs) in time domain and more generally the wave-particle interaction in vacuum devices, we developed the DIscrete MOdel with HAmiltonian approach (dimoha) as an alternative to current particle-in-cell (PIC) and frequency approaches. Indeed, it is based on a longitudinal N-body Hamiltonian approach satisfying Maxwell's equations. Advantages of dimoha comprise: (i) it allows arbitrary waveform (not just field envelope), including continuous waveform (CW), multiple carriers or digital modulations (shift keying); (ii) the algorithm is much faster than PIC codes thanks to a field discretization allowing a drastic degree-of-freedom reduction, along with a robust symplectic integrator; (iii) it supports any periodic slow-wave structure design such as helix or folded waveguides; (iv) it reproduces harmonic generation, reflection, oscillation and distortion phenomena; (v) it handles nonlinear dynamics, including intermodulations, trap** and chaos. dimoha accuracy is assessed by comparing it against measurements from a commercial Ku-band tapered helix TWT and against simulations from a sub-THz folded waveguide TWT with a staggered double-grating slow-wave structure. The algorithm is also tested for multiple-carriers simulations with success. △ Less

Submitted 27 September, 2019; originally announced September 2019.

Journal ref: IEEE Transactions on Electron Devices, Institute of Electrical and Electronics Engineers, 2019, 66 (9), pp.4042-4047

arXiv:1905.06900 [pdf, other]

Derived Codebooks for High-Accuracy Nearest Neighbor Search

Authors: Fabien André, Anne-Marie Kermarrec, Nicolas Le Scouarnec

Abstract: High-dimensional Nearest Neighbor (NN) search is central in multimedia search systems. Product Quantization (PQ) is a widespread NN search technique which has a high performance and good scalability. PQ compresses high-dimensional vectors into compact codes thanks to a combination of quantizers. Large databases can, therefore, be stored entirely in RAM, enabling fast responses to NN queries. In al… ▽ More High-dimensional Nearest Neighbor (NN) search is central in multimedia search systems. Product Quantization (PQ) is a widespread NN search technique which has a high performance and good scalability. PQ compresses high-dimensional vectors into compact codes thanks to a combination of quantizers. Large databases can, therefore, be stored entirely in RAM, enabling fast responses to NN queries. In almost all cases, PQ uses 8-bit quantizers as they offer low response times. In this paper, we advocate the use of 16-bit quantizers. Compared to 8-bit quantizers, 16-bit quantizers boost accuracy but they increase response time by a factor of 3 to 10. We propose a novel approach that allows 16-bit quantizers to offer the same response time as 8-bit quantizers, while still providing a boost of accuracy. Our approach builds on two key ideas: (i) the construction of derived codebooks that allow a fast and approximate distance evaluation, and (ii) a two-pass NN search procedure which builds a candidate set using the derived codebooks, and then refines it using 16-bit quantizers. On 1 billion SIFT vectors, with an inverted index, our approach offers a Recall@100 of 0.85 in 5.2 ms. By contrast, 16-bit quantizers alone offer a Recall@100 of 0.85 in 39 ms, and 8-bit quantizers a Recall@100 of 0.82 in 3.8 ms. △ Less

Submitted 16 May, 2019; originally announced May 2019.

arXiv:1902.06431 [pdf, other]

Universality of the Abraham-Minkowski dilemma for photon momenta beyond dielectric materials

Authors: Damien Minenna, Yves Elskens, Fabrice Doveil, Frédéric André

Abstract: The authors) Whenever light is slowed down, for any cause, two different formulas give its momentum. For dielectrics, the coexistence of those momenta was the heart of the century-old Abraham-Minkowski dilemma, recently resolved. We demonstrate that this framework extends to momentum exchange in wave-particle interaction; in particular to Langmuir waves for Landau dam** and to vacuum waveguides… ▽ More The authors) Whenever light is slowed down, for any cause, two different formulas give its momentum. For dielectrics, the coexistence of those momenta was the heart of the century-old Abraham-Minkowski dilemma, recently resolved. We demonstrate that this framework extends to momentum exchange in wave-particle interaction; in particular to Langmuir waves for Landau dam** and to vacuum waveguides of electron tubes (metallic slow-wave structures). Focussing on the latter, we show that the dilemma resolution is not limited to discriminating between kinematic and canonical momenta but also involves a non-negligible momentum flux from Maxwell's electromagnetic stress. The existence of two momenta in materials, plasmas, and waveguides, for which light velocity modification has entirely different origin, points to the universality of the Abraham-Minkowski dilemma. △ Less

Submitted 18 February, 2019; originally announced February 2019.

arXiv:1812.09162 [pdf, other]

doi 10.1109/TPAMI.2019.2952606

Quicker ADC : Unlocking the hidden potential of Product Quantization with SIMD

Authors: Fabien André, Anne-Marie Kermarrec, Nicolas Le Scouarnec

Abstract: Efficient Nearest Neighbor (NN) search in high-dimensional spaces is a foundation of many multimedia retrieval systems. A common approach is to rely on Product Quantization, which allows the storage of large vector databases in memory and efficient distance computations. Yet, implementations of nearest neighbor search with Product Quantization have their performance limited by the many memory acce… ▽ More Efficient Nearest Neighbor (NN) search in high-dimensional spaces is a foundation of many multimedia retrieval systems. A common approach is to rely on Product Quantization, which allows the storage of large vector databases in memory and efficient distance computations. Yet, implementations of nearest neighbor search with Product Quantization have their performance limited by the many memory accesses they perform. Following this observation, André et al. proposed Quick ADC with up to $6\times$ faster implementations of $m\times{}4$ product quantizers (PQ) leveraging specific SIMD instructions. Quicker ADC is a generalization of Quick ADC not limited to $m\times{}4$ codes and supporting AVX-512, the latest revision of SIMD instruction set. In doing so, Quicker ADC faces the challenge of using efficiently 5,6 and 7-bit shuffles that do not align to computer bytes or words. To this end, we introduce (i) irregular product quantizers combining sub-quantizers of different granularity and (ii) split tables allowing lookup tables larger than registers. We evaluate Quicker ADC with multiple indexes including Inverted Multi-Indexes and IVF HNSW and show that it outperforms the reference optimized implementations (i.e., FAISS and polysemous codes) for numerous configurations. Finally, we release an open-source fork of FAISS enhanced with Quicker ADC at http://github.com/nlescoua/faiss-quickeradc. △ Less

Submitted 14 November, 2019; v1 submitted 21 December, 2018; originally announced December 2018.

Comments: Open-source implementation at http://github.com/nlescoua/faiss-quickeradc

Journal ref: IEEE Transactions on Pattern Analysis and Machine Intelligence, 2019 Early Access

arXiv:1803.11498 [pdf, other]

doi 10.1209/0295-5075/122/44002

Electromagnetic power and momentum in N-body hamiltonian approach to wave-particle dynamics in a periodic structure

Authors: Damien Minenna, Yves Elskens, Frédéric André, Fabrice Doveil

Abstract: To model momentum exchange in nonlinear wave-particle interaction, as in amplification devices like traveling-wave tubes, we use an $N$-body self-consistent hamiltonian description based on Kuznetsov's discrete model, and we provide new formulations for the electromagnetic power and the conserved momentum. This approach leads to fast and accurate numerical simulations in time domain and in one dim… ▽ More To model momentum exchange in nonlinear wave-particle interaction, as in amplification devices like traveling-wave tubes, we use an $N$-body self-consistent hamiltonian description based on Kuznetsov's discrete model, and we provide new formulations for the electromagnetic power and the conserved momentum. This approach leads to fast and accurate numerical simulations in time domain and in one dimensional space. △ Less

Submitted 30 March, 2018; originally announced March 2018.

arXiv:1803.11497 [pdf, other]

doi 10.1140/epjh/e2018-90023-1

The Traveling-Wave Tube in the History of Telecommunication

Authors: Damien Minenna, Frédéric André, Yves Elskens, Jean-François Auboin, Fabrice Doveil, Jérôme Puech, Élise Duverdier

Abstract: The traveling-wave tube is a critical subsystem for satellite data transmission. Its role in the history of wireless communications and in the space conquest is significant, but largely ignored, even though the device remains widely used nowadays. This paper present, albeit non-exhaustively, circumstances and contexts that led to its invention, and its part in the worldwide (in particular in Europ… ▽ More The traveling-wave tube is a critical subsystem for satellite data transmission. Its role in the history of wireless communications and in the space conquest is significant, but largely ignored, even though the device remains widely used nowadays. This paper present, albeit non-exhaustively, circumstances and contexts that led to its invention, and its part in the worldwide (in particular in Europe) expansion of TV broadcasting via microwave radio-relays and satellites. We also discuss its actual contribution to space applications and its conception. The originality of this paper comes from the wide period covered (from first slow-wave structures in 1889 to present space projects) and from connection points made between this device and commercial exploitations. The appendix deals with an intuitive pedagogical description of the wave-particle interaction. △ Less

Submitted 30 March, 2018; originally announced March 2018.

arXiv:1712.02912 [pdf, other]

Exploiting Modern Hardware for High-Dimensional Nearest Neighbor Search

Authors: Fabien André

Abstract: Many multimedia information retrieval or machine learning problems require efficient high-dimensional nearest neighbor search techniques. For instance, multimedia objects (images, music or videos) can be represented by high-dimensional feature vectors. Finding two similar multimedia objects then comes down to finding two objects that have similar feature vectors. In the current context of mass use… ▽ More Many multimedia information retrieval or machine learning problems require efficient high-dimensional nearest neighbor search techniques. For instance, multimedia objects (images, music or videos) can be represented by high-dimensional feature vectors. Finding two similar multimedia objects then comes down to finding two objects that have similar feature vectors. In the current context of mass use of social networks, large scale multimedia databases or large scale machine learning applications are more and more common, calling for efficient nearest neighbor search approaches. This thesis builds on product quantization, an efficient nearest neighbor search technique that compresses high-dimensional vectors into short codes. This makes it possible to store very large databases entirely in RAM, enabling low response times. We propose several contributions that exploit the capabilities of modern CPUs, especially SIMD and the cache hierarchy, to further decrease response times offered by product quantization. △ Less

Submitted 7 December, 2017; originally announced December 2017.

Comments: PhD Thesis, 123 Pages

arXiv:1711.04510 [pdf, other]

Comparison Between Pierce Equivalent Circuit and Recent Discrete Model for Traveling-Wave Tubes

Authors: Damien Minenna, Artem Terentyuk, Frédéric André, Yves Elskens, Nikita Ryskin

Abstract: To perform accurate numerical simulations of the traveling-wave tube in time domain, a new approach using field decomposition with large reduction of degrees-of-freedom has been proposed: the discrete model. To assess its validity, we compare it with the well-established Pierce equivalent circuit model in small signal regime. We also discuss associated beam, circuit-beam, and circuit impedances. W… ▽ More To perform accurate numerical simulations of the traveling-wave tube in time domain, a new approach using field decomposition with large reduction of degrees-of-freedom has been proposed: the discrete model. To assess its validity, we compare it with the well-established Pierce equivalent circuit model in small signal regime. We also discuss associated beam, circuit-beam, and circuit impedances. We demonstrate analytically and with a numerical example that the newly developed discrete model is very close to the Pierce model. Interestingly, small deviations do exist at the edges of the amplification band. We speculate that the deviation from reality is on the Pierce model side, while the discrete model would be more accurate. △ Less

Submitted 13 November, 2017; originally announced November 2017.

Comments: arXiv admin note: substantial text overlap with arXiv:1702.04976

arXiv:1704.07355 [pdf, ps, other]

doi 10.1145/3078971.3078992

Accelerated Nearest Neighbor Search with Quick ADC

Authors: Fabien André, Anne-Marie Kermarrec, Nicolas Le Scouarnec

Abstract: Efficient Nearest Neighbor (NN) search in high-dimensional spaces is a foundation of many multimedia retrieval systems. Because it offers low responses times, Product Quantization (PQ) is a popular solution. PQ compresses high-dimensional vectors into short codes using several sub-quantizers, which enables in-RAM storage of large databases. This allows fast answers to NN queries, without accessing… ▽ More Efficient Nearest Neighbor (NN) search in high-dimensional spaces is a foundation of many multimedia retrieval systems. Because it offers low responses times, Product Quantization (PQ) is a popular solution. PQ compresses high-dimensional vectors into short codes using several sub-quantizers, which enables in-RAM storage of large databases. This allows fast answers to NN queries, without accessing the SSD or HDD. The key feature of PQ is that it can compute distances between short codes and high-dimensional vectors using cache-resident lookup tables. The efficiency of this technique, named Asymmetric Distance Computation (ADC), remains limited because it performs many cache accesses. In this paper, we introduce Quick ADC, a novel technique that achieves a 3 to 6 times speedup over ADC by exploiting Single Instruction Multiple Data (SIMD) units available in current CPUs. Efficiently exploiting SIMD requires algorithmic changes to the ADC procedure. Namely, Quick ADC relies on two key modifications of ADC: (i) the use 4-bit sub-quantizers instead of the standard 8-bit sub-quantizers and (ii) the quantization of floating-point distances. This allows Quick ADC to exceed the performance of state-of-the-art systems, e.g., it achieves a Recall@100 of 0.94 in 3.4 ms on 1 billion SIFT descriptors (128-bit codes). △ Less

Submitted 24 April, 2017; originally announced April 2017.

Comments: 8 pages, 5 figures, published in Proceedings of ICMR'17, Bucharest, Romania, June 06-09, 2017

ACM Class: H.5.1; H.2.4; H.2.8

arXiv:1607.00779 [pdf, other]

On frequency and time domain models of traveling wave tubes

Authors: Stéphane Théveny, Frédéric André, Yves Elskens

Abstract: We discuss the envelope modulation assumption of frequency-domain models of traveling wave tubes (TWTs) and test its consistency with the Maxwell equations. We compare the predictions of usual frequency-domain models with those of a new time domain model of the TWT. We discuss the envelope modulation assumption of frequency-domain models of traveling wave tubes (TWTs) and test its consistency with the Maxwell equations. We compare the predictions of usual frequency-domain models with those of a new time domain model of the TWT. △ Less

Submitted 4 July, 2016; originally announced July 2016.

arXiv:1303.6143 [pdf, ps, other]

doi 10.1209/0295-5075/103/28004

Hamiltonian description of self-consistent wave-particle dynamics in a periodic structure

Authors: Frédéric André, Pierre Bernardi, Nikita M. Ryskin, Fabrice Doveil, Yves Elskens

Abstract: Conservation of energy and momentum in the classical theory of radiating electrons has been a challenging problem since its inception. We propose a formulation of classical electrodynamics in Hamiltonian form that satisfies the Maxwell equations and the Lorentz force. The radiated field is represented with eigenfunctions using the Gel'fand $β$-transform. The electron Hamiltonian is the standard on… ▽ More Conservation of energy and momentum in the classical theory of radiating electrons has been a challenging problem since its inception. We propose a formulation of classical electrodynamics in Hamiltonian form that satisfies the Maxwell equations and the Lorentz force. The radiated field is represented with eigenfunctions using the Gel'fand $β$-transform. The electron Hamiltonian is the standard one coupling the particles with the propagating fields. The dynamics conserves energy and excludes self-acceleration. A complete Hamiltonian formulation results from adding electrostatic action-at-a-distance coupling between electrons. △ Less

Submitted 18 July, 2013; v1 submitted 25 March, 2013; originally announced March 2013.

Comments: 5 pages

Journal ref: Europhysics Letters 103 (2013) 28004

Showing 1–26 of 26 results for author: Andre, F