-
Multi-Modal Dataset Creation for Federated~Learning with DICOM Structured Reports
Authors:
Malte Tölle,
Lukas Burger,
Halvar Kelm,
Florian André,
Peter Bannas,
Gerhard Diller,
Norbert Frey,
Philipp Garthe,
Stefan Groß,
Anja Hennemuth,
Lars Kaderali,
Nina Krüger,
Andreas Leha,
Simon Martin,
Alexander Meyer,
Eike Nagel,
Stefan Orwat,
Clemens Scherer,
Moritz Seiffert,
Jan Moritz Seliger,
Stefan Simm,
Tim Friede,
Tim Seidler,
Sandy Engelhardt
Abstract:
Purpose: Federated training is often hindered by heterogeneous datasets due to divergent data storage options, inconsistent naming schemes, varied annotation procedures, and disparities in label quality. This is particularly evident in the emerging multi-modal learning paradigms, where dataset harmonization including a uniform data representation and filtering options are of paramount importance.…
▽ More
Purpose: Federated training is often hindered by heterogeneous datasets due to divergent data storage options, inconsistent naming schemes, varied annotation procedures, and disparities in label quality. This is particularly evident in the emerging multi-modal learning paradigms, where dataset harmonization including a uniform data representation and filtering options are of paramount importance.
Methods: DICOM structured reports enable the standardized linkage of arbitrary information beyond the imaging domain and can be used within Python deep learning pipelines with highdicom. Building on this, we developed an open platform for data integration and interactive filtering capabilities that simplifies the process of assembling multi-modal datasets.
Results: In this study, we extend our prior work by showing its applicability to more and divergent data types, as well as streamlining datasets for federated training within an established consortium of eight university hospitals in Germany. We prove its concurrent filtering ability by creating harmonized multi-modal datasets across all locations for predicting the outcome after minimally invasive heart valve replacement. The data includes DICOM data (i.e. computed tomography images, electrocardiography scans) as well as annotations (i.e. calcification segmentations, pointsets and pacemaker dependency), and metadata (i.e. prosthesis and diagnoses).
Conclusion: Structured reports bridge the traditional gap between imaging systems and information systems. Utilizing the inherent DICOM reference system arbitrary data types can be queried concurrently to create meaningful cohorts for clinical studies. The graphical interface as well as example structured report templates will be made publicly available.
△ Less
Submitted 12 July, 2024;
originally announced July 2024.
-
Federated Foundation Model for Cardiac CT Imaging
Authors:
Malte Tölle,
Philipp Garthe,
Clemens Scherer,
Jan Moritz Seliger,
Andreas Leha,
Nina Krüger,
Stefan Simm,
Simon Martin,
Sebastian Eble,
Halvar Kelm,
Moritz Bednorz,
Florian André,
Peter Bannas,
Gerhard Diller,
Norbert Frey,
Stefan Groß,
Anja Hennemuth,
Lars Kaderali,
Alexander Meyer,
Eike Nagel,
Stefan Orwat,
Moritz Seiffert,
Tim Friede,
Tim Seidler,
Sandy Engelhardt
Abstract:
Federated learning (FL) is a renowned technique for utilizing decentralized data while preserving privacy. However, real-world applications often involve inherent challenges such as partially labeled datasets, where not all clients possess expert annotations of all labels of interest, leaving large portions of unlabeled data unused. In this study, we conduct the largest federated cardiac CT imagin…
▽ More
Federated learning (FL) is a renowned technique for utilizing decentralized data while preserving privacy. However, real-world applications often involve inherent challenges such as partially labeled datasets, where not all clients possess expert annotations of all labels of interest, leaving large portions of unlabeled data unused. In this study, we conduct the largest federated cardiac CT imaging analysis to date, focusing on partially labeled datasets ($n=8,124$) of Transcatheter Aortic Valve Implantation (TAVI) patients over eight hospital clients. Transformer architectures, which are the major building blocks of current foundation models, have shown superior performance when trained on larger cohorts than traditional CNNs. However, when trained on small task-specific labeled sample sizes, it is currently not feasible to exploit their underlying attention mechanism for improved performance. Therefore, we developed a two-stage semi-supervised learning strategy that distills knowledge from several task-specific CNNs (landmark detection and segmentation of calcification) into a single transformer model by utilizing large amounts of unlabeled data typically residing unused in hospitals to mitigate these issues. This method not only improves the predictive accuracy and generalizability of transformer-based architectures but also facilitates the simultaneous learning of all partial labels within a single transformer model across the federation. Additionally, we show that our transformer-based model extracts more meaningful features for further downstream tasks than the UNet-based one by only training the last layer to also solve segmentation of coronary arteries. We make the code and weights of the final model openly available, which can serve as a foundation model for further research in cardiac CT imaging.
△ Less
Submitted 10 July, 2024;
originally announced July 2024.
-
Experimental and Theoretical Brownian Dynamics Analysis of Ion Transport During Cellular Electroporation of E. coli Bacteria
Authors:
Juan González-Cuevas,
Ricardo Argüello,
Marcos Florentin,
Franck M. André,
Lluis Mir
Abstract:
Escherichia coli bacterium is a rod-shaped organism composed of a complex double membrane structure. Knowledge of electric field driven ion transport through both membranes and the evolution of their induced permeabilization has important applications in biomedical engineering, delivery of genes and antibacterial agents. However, few studies have been conducted on Gram-negative bacteria in this re…
▽ More
Escherichia coli bacterium is a rod-shaped organism composed of a complex double membrane structure. Knowledge of electric field driven ion transport through both membranes and the evolution of their induced permeabilization has important applications in biomedical engineering, delivery of genes and antibacterial agents. However, few studies have been conducted on Gram-negative bacteria in this regard considering the contribution of all ion types. To address this gap in knowledge, we have developed a deterministic and stochastic Brownian dynamics model to simulate in 3D space the motion of ions through pores formed in the plasma membranes of E. coli cells during electroporation. The diffusion coefficient, mobility, and translation time of Ca$^{2+}$, Mg$^{2+}$, Na$^+$, K$^+$, and Cl$^-$ ions within the pore region are estimated from the numerical model. Calculations of pore's conductance have been validated with experiments conducted at Gustave Roussy. From the simulations, it was found that the main driving force of ionic uptake during the pulse is the one due to the externally applied electric field. The results from this work provide a better understanding of ion transport during electroporation, aiding in the design of electrical pulses for maximizing ion throughput, primarily for application in cancer treatment.
△ Less
Submitted 29 November, 2023;
originally announced November 2023.
-
The dressing field method for diffeomorphisms: a relational framework
Authors:
Jordan T. Francois Andre
Abstract:
The dressing field method is a tool to reduce gauge symmetries. Here we extend it to cover the case of diffeomorphisms. The resulting framework is a systematic scheme to produce Diff(M)-invariant objects, which has a natural relational interpretation.
Its precise formulation relies on a clear understanding of the bundle geometry of field space. By detailing it, among other things we stress the g…
▽ More
The dressing field method is a tool to reduce gauge symmetries. Here we extend it to cover the case of diffeomorphisms. The resulting framework is a systematic scheme to produce Diff(M)-invariant objects, which has a natural relational interpretation.
Its precise formulation relies on a clear understanding of the bundle geometry of field space. By detailing it, among other things we stress the geometric nature of field-independent and field-dependent diffeomorphisms, and highlight that the heuristic "extended bracket" for field-dependent vector fields often featuring in the covariant phase space literature can be understood as arising from the Frölicher-Nijenhuis bracket. Furthermore, by articulating this bundle geometry with the covariant phase space approach, we give a streamlined account of the elementary objects of the (pre)symplectic structure of a Diff(M)-theory: Noether charges and their bracket, as induced by the standard prescription for the presymplectic potential and 2-form. We give conceptually transparent expressions allowing to read the integrability conditions and the circumstances under which the bracket of charge is Lie, and the resulting Poisson algebras of charges are central extensions of the Lie algebras of field-independent ($\mathfrak{diff}(M)$) and field-dependent vector fields.
We show that, applying the dressing field method, one obtains a Diff(M)-invariant and manifestly relational formulation of a general relativistic field theory. Relying on results just mentioned, we easily derive the "dressed" (relational) presymplectic structure of the theory. This reproduces or extends results from the gravitational edge mode and gravitational dressing literature. In addition to simplified technical derivations, the conceptual clarity of the framework supplies several insights and allows us to dispel misconceptions.
△ Less
Submitted 25 February, 2024; v1 submitted 22 October, 2023;
originally announced October 2023.
-
Helical coil design with controlled dispersion for bunching enhancement of the TNSA protons
Authors:
A Hirsch-Passicos,
C L C Lacoste,
F André,
Y Elskens,
E d'Humières,
V Tikhonchuk,
M Bardon
Abstract:
The quality of the proton beam produced by Target Normal Sheath Acceleration (TNSA) with high power lasers can be significantly improved with the use of helical coils. While they showed promising results in terms of focusing, their performances in terms of the of cutoff energy and bunching stay limited due to the dispersive nature of helical coils. A new scheme of helical coil with a tube surround…
▽ More
The quality of the proton beam produced by Target Normal Sheath Acceleration (TNSA) with high power lasers can be significantly improved with the use of helical coils. While they showed promising results in terms of focusing, their performances in terms of the of cutoff energy and bunching stay limited due to the dispersive nature of helical coils. A new scheme of helical coil with a tube surrounding the helix is introduced, and the first numerical simulations and an analytical model show a possibility of a drastic reduction of the current pulse dispersion for the parameters of high power laser facilities. The helical coils with tube strongly increase bunching, creating two collimated narrow-band proton beams from a broad and divergent TNSA distribution. The analytical model provides scaling of proton parameters as a function of laser facility features.
△ Less
Submitted 16 October, 2023;
originally announced October 2023.
-
On the detection of Out-Of-Distribution samples in Multiple Instance Learning
Authors:
Loïc Le Bescond,
Maria Vakalopoulou,
Stergios Christodoulidis,
Fabrice André,
Hugues Talbot
Abstract:
The deployment of machine learning solutions in real-world scenarios often involves addressing the challenge of out-of-distribution (OOD) detection. While significant efforts have been devoted to OOD detection in classical supervised settings, the context of weakly supervised learning, particularly the Multiple Instance Learning (MIL) framework, remains under-explored. In this study, we tackle thi…
▽ More
The deployment of machine learning solutions in real-world scenarios often involves addressing the challenge of out-of-distribution (OOD) detection. While significant efforts have been devoted to OOD detection in classical supervised settings, the context of weakly supervised learning, particularly the Multiple Instance Learning (MIL) framework, remains under-explored. In this study, we tackle this challenge by adapting post-hoc OOD detection methods to the MIL setting while introducing a novel benchmark specifically designed to assess OOD detection performance in weakly supervised scenarios. Across extensive experiments based on diverse public datasets, KNN emerges as the best-performing method overall. However, it exhibits significant shortcomings on some datasets, emphasizing the complexity of this under-explored and challenging topic. Our findings shed light on the complex nature of OOD detection under the MIL framework, emphasizing the importance of develo** novel, robust, and reliable methods that can generalize effectively in a weakly supervised context. The code for the paper is available here: https://github.com/loic-lb/OOD_MIL.
△ Less
Submitted 9 November, 2023; v1 submitted 11 September, 2023;
originally announced September 2023.
-
Handling Label Uncertainty on the Example of Automatic Detection of Shepherd's Crook RCA in Coronary CT Angiography
Authors:
Felix Denzinger,
Michael Wels,
Oliver Taubmann,
Florian Kordon,
Fabian Wagner,
Stephanie Mehltretter,
Mehmet A. Gülsün,
Max Schöbinger,
Florian André,
Sebastian Buss,
Johannes Görich,
Michael Sühling,
Andreas Maier
Abstract:
Coronary artery disease (CAD) is often treated minimally invasively with a catheter being inserted into the diseased coronary vessel. If a patient exhibits a Shepherd's Crook (SC) Right Coronary Artery (RCA) - an anatomical norm variant of the coronary vasculature - the complexity of this procedure is increased. Automated reporting of this variant from coronary CT angiography screening would ease…
▽ More
Coronary artery disease (CAD) is often treated minimally invasively with a catheter being inserted into the diseased coronary vessel. If a patient exhibits a Shepherd's Crook (SC) Right Coronary Artery (RCA) - an anatomical norm variant of the coronary vasculature - the complexity of this procedure is increased. Automated reporting of this variant from coronary CT angiography screening would ease prior risk assessment. We propose a 1D convolutional neural network which leverages a sequence of residual dilated convolutions to automatically determine this norm variant from a prior extracted vessel centerline. As the SC RCA is not clearly defined with respect to concrete measurements, labeling also includes qualitative aspects. Therefore, 4.23% samples in our dataset of 519 RCA centerlines were labeled as unsure SC RCAs, with 5.97% being labeled as sure SC RCAs. We explore measures to handle this label uncertainty, namely global/model-wise random assignment, exclusion, and soft label assignment. Furthermore, we evaluate how this uncertainty can be leveraged for the determination of a rejection class. With our best configuration, we reach an area under the receiver operating characteristic curve (AUC) of 0.938 on confident labels. Moreover, we observe an increase of up to 0.020 AUC when rejecting 10% of the data and leveraging the labeling uncertainty information in the exclusion process.
△ Less
Submitted 22 May, 2023;
originally announced June 2023.
-
A biology-driven deep generative model for cell-type annotation in cytometry
Authors:
Quentin Blampey,
Nadège Bercovici,
Charles-Antoine Dutertre,
Isabelle Pic,
Fabrice André,
Joana Mourato Ribeiro,
Paul-Henry Cournède
Abstract:
Cytometry enables precise single-cell phenoty** within heterogeneous populations. These cell types are traditionally annotated via manual gating, but this method suffers from a lack of reproducibility and sensitivity to batch-effect. Also, the most recent cytometers - spectral flow or mass cytometers - create rich and high-dimensional data whose analysis via manual gating becomes challenging and…
▽ More
Cytometry enables precise single-cell phenoty** within heterogeneous populations. These cell types are traditionally annotated via manual gating, but this method suffers from a lack of reproducibility and sensitivity to batch-effect. Also, the most recent cytometers - spectral flow or mass cytometers - create rich and high-dimensional data whose analysis via manual gating becomes challenging and time-consuming. To tackle these limitations, we introduce Scyan (https://github.com/MICS-Lab/scyan), a Single-cell Cytometry Annotation Network that automatically annotates cell types using only prior expert knowledge about the cytometry panel. We demonstrate that Scyan significantly outperforms the related state-of-the-art models on multiple public datasets while being faster and interpretable. In addition, Scyan overcomes several complementary tasks such as batch-effect removal, debarcoding, and population discovery. Overall, this model accelerates and eases cell population characterisation, quantification, and discovery in cytometry.
△ Less
Submitted 21 April, 2023; v1 submitted 11 August, 2022;
originally announced August 2022.
-
Content-Aware Differential Privacy with Conditional Invertible Neural Networks
Authors:
Malte Tölle,
Ullrich Köthe,
Florian André,
Benjamin Meder,
Sandy Engelhardt
Abstract:
Differential privacy (DP) has arisen as the gold standard in protecting an individual's privacy in datasets by adding calibrated noise to each data sample. While the application to categorical data is straightforward, its usability in the context of images has been limited. Contrary to categorical data the meaning of an image is inherent in the spatial correlation of neighboring pixels making the…
▽ More
Differential privacy (DP) has arisen as the gold standard in protecting an individual's privacy in datasets by adding calibrated noise to each data sample. While the application to categorical data is straightforward, its usability in the context of images has been limited. Contrary to categorical data the meaning of an image is inherent in the spatial correlation of neighboring pixels making the simple application of noise infeasible. Invertible Neural Networks (INN) have shown excellent generative performance while still providing the ability to quantify the exact likelihood. Their principle is based on transforming a complicated distribution into a simple one e.g. an image into a spherical Gaussian. We hypothesize that adding noise to the latent space of an INN can enable differentially private image modification. Manipulation of the latent space leads to a modified image while preserving important details. Further, by conditioning the INN on meta-data provided with the dataset we aim at leaving dimensions important for downstream tasks like classification untouched while altering other parts that potentially contain identifying information. We term our method content-aware differential privacy (CADP). We conduct experiments on publicly available benchmarking datasets as well as dedicated medical ones. In addition, we show the generalizability of our method to categorical data. The source code is publicly available at https://github.com/Cardio-AI/CADP.
△ Less
Submitted 29 July, 2022;
originally announced July 2022.
-
CAD-RADS Scoring using Deep Learning and Task-Specific Centerline Labeling
Authors:
Felix Denzinger,
Michael Wels,
Oliver Taubmann,
Mehmet A. Gülsün,
Max Schöbinger,
Florian André,
Sebastian J. Buss,
Johannes Görich,
Michael Sühling,
Andreas Maier,
Katharina Breininger
Abstract:
With coronary artery disease (CAD) persisting to be one of the leading causes of death worldwide, interest in supporting physicians with algorithms to speed up and improve diagnosis is high. In clinical practice, the severeness of CAD is often assessed with a coronary CT angiography (CCTA) scan and manually graded with the CAD-Reporting and Data System (CAD-RADS) score. The clinical questions this…
▽ More
With coronary artery disease (CAD) persisting to be one of the leading causes of death worldwide, interest in supporting physicians with algorithms to speed up and improve diagnosis is high. In clinical practice, the severeness of CAD is often assessed with a coronary CT angiography (CCTA) scan and manually graded with the CAD-Reporting and Data System (CAD-RADS) score. The clinical questions this score assesses are whether patients have CAD or not (rule-out) and whether they have severe CAD or not (hold-out). In this work, we reach new state-of-the-art performance for automatic CAD-RADS scoring. We propose using severity-based label encoding, test time augmentation (TTA) and model ensembling for a task-specific deep learning architecture. Furthermore, we introduce a novel task- and model-specific, heuristic coronary segment labeling, which subdivides coronary trees into consistent parts across patients. It is fast, robust, and easy to implement. We were able to raise the previously reported area under the receiver operating characteristic curve (AUC) from 0.914 to 0.942 in the rule-out and from 0.921 to 0.950 in the hold-out task respectively.
△ Less
Submitted 8 February, 2022;
originally announced February 2022.
-
Comparison of Evaluation Metrics for Landmark Detection in CMR Images
Authors:
Sven Koehler,
Lalith Sharan,
Julian Kuhm,
Arman Ghanaat,
Jelizaveta Gordejeva,
Nike K. Simon,
Niko M. Grell,
Florian André,
Sandy Engelhardt
Abstract:
Cardiac Magnetic Resonance (CMR) images are widely used for cardiac diagnosis and ventricular assessment. Extracting specific landmarks like the right ventricular insertion points is of importance for spatial alignment and 3D modeling. The automatic detection of such landmarks has been tackled by multiple groups using Deep Learning, but relatively little attention has been paid to the failure case…
▽ More
Cardiac Magnetic Resonance (CMR) images are widely used for cardiac diagnosis and ventricular assessment. Extracting specific landmarks like the right ventricular insertion points is of importance for spatial alignment and 3D modeling. The automatic detection of such landmarks has been tackled by multiple groups using Deep Learning, but relatively little attention has been paid to the failure cases of evaluation metrics in this field. In this work, we extended the public ACDC dataset with additional labels of the right ventricular insertion points and compare different variants of a heatmap-based landmark detection pipeline. In this comparison, we demonstrate very likely pitfalls of apparently simple detection and localisation metrics which highlights the importance of a clear detection strategy and the definition of an upper limit for localisation-based metrics. Our preliminary results indicate that a combination of different metrics is necessary, as they yield different winners for method comparison. Additionally, they highlight the need of a comprehensive metric description and evaluation standardisation, especially for the error cases where no metrics could be computed or where no lower/upper boundary of a metric exists. Code and labels: https://github.com/Cardio-AI/rvip_landmark_detection
△ Less
Submitted 28 January, 2022; v1 submitted 25 January, 2022;
originally announced January 2022.
-
Time simulation of the nonlinear wave-particle interaction in meters long traveling-wave tubes
Authors:
Damien F. G. Minenna,
Khalil Aliane,
Yves Elskens,
Alexandre Poyé,
Frédéric André,
Jérôme Puech,
Fabrice Doveil
Abstract:
We propose a multi-particle self-consistent Hamiltonian (derived from an N-body description) that is applicable for periodic structures such as traveling-wave tubes (TWTs), gyrotrons, free-electron lasers, or particle accelerators. We build a 1D symplectic multi-particle algorithm to simulate the nonlinear wave-particle interaction in the time domain occurring in an experimental 3-meters long heli…
▽ More
We propose a multi-particle self-consistent Hamiltonian (derived from an N-body description) that is applicable for periodic structures such as traveling-wave tubes (TWTs), gyrotrons, free-electron lasers, or particle accelerators. We build a 1D symplectic multi-particle algorithm to simulate the nonlinear wave-particle interaction in the time domain occurring in an experimental 3-meters long helix TWT. Our algorithm is efficient thanks to a drastic reduction model. A 3D helix version of our reduction model is provided. Finally, we establish an explicit expression of the electromagnetic power in the time domain and in non-monochromatic (non-"continuous waveform") regime.
△ Less
Submitted 14 September, 2021;
originally announced September 2021.
-
Automatic CAD-RADS Scoring Using Deep Learning
Authors:
Felix Denzinger,
Michael Wels,
Katharina Breininger,
Mehmet A. Gülsün,
Max Schöbinger,
Florian André,
Sebastian Buß,
Johannes Görich,
Michael Sühling,
Andreas Maier
Abstract:
Coronary CT angiography (CCTA) has established its role as a non-invasive modality for the diagnosis of coronary artery disease (CAD). The CAD-Reporting and Data System (CAD-RADS) has been developed to standardize communication and aid in decision making based on CCTA findings. The CAD-RADS score is determined by manual assessment of all coronary vessels and the grading of lesions within the coron…
▽ More
Coronary CT angiography (CCTA) has established its role as a non-invasive modality for the diagnosis of coronary artery disease (CAD). The CAD-Reporting and Data System (CAD-RADS) has been developed to standardize communication and aid in decision making based on CCTA findings. The CAD-RADS score is determined by manual assessment of all coronary vessels and the grading of lesions within the coronary artery tree.
We propose a bottom-up approach for fully-automated prediction of this score using deep-learning operating on a segment-wise representation of the coronary arteries. The method relies solely on a prior fully-automated centerline extraction and segment labeling and predicts the segment-wise stenosis degree and the overall calcification grade as auxiliary tasks in a multi-task learning setup.
We evaluate our approach on a data collection consisting of 2,867 patients. On the task of identifying patients with a CAD-RADS score indicating the need for further invasive investigation our approach reaches an area under curve (AUC) of 0.923 and an AUC of 0.914 for determining whether the patient suffers from CAD. This level of performance enables our approach to be used in a fully-automated screening setup or to assist diagnostic CCTA reading, especially due to its neural architecture design -- which allows comprehensive predictions.
△ Less
Submitted 5 October, 2020;
originally announced October 2020.
-
Self-Supervised Nuclei Segmentation in Histopathological Images Using Attention
Authors:
Mihir Sahasrabudhe,
Stergios Christodoulidis,
Roberto Salgado,
Stefan Michiels,
Sherene Loi,
Fabrice André,
Nikos Paragios,
Maria Vakalopoulou
Abstract:
Segmentation and accurate localization of nuclei in histopathological images is a very challenging problem, with most existing approaches adopting a supervised strategy. These methods usually rely on manual annotations that require a lot of time and effort from medical experts. In this study, we present a self-supervised approach for segmentation of nuclei for whole slide histopathology images. Ou…
▽ More
Segmentation and accurate localization of nuclei in histopathological images is a very challenging problem, with most existing approaches adopting a supervised strategy. These methods usually rely on manual annotations that require a lot of time and effort from medical experts. In this study, we present a self-supervised approach for segmentation of nuclei for whole slide histopathology images. Our method works on the assumption that the size and texture of nuclei can determine the magnification at which a patch is extracted. We show that the identification of the magnification level for tiles can generate a preliminary self-supervision signal to locate nuclei. We further show that by appropriately constraining our model it is possible to retrieve meaningful segmentation maps as an auxiliary output to the primary magnification identification task. Our experiments show that with standard post-processing, our method can outperform other unsupervised nuclei segmentation approaches and report similar performance with supervised ones on the publicly available MoNuSeg dataset. Our code and models are available online to facilitate further research.
△ Less
Submitted 16 July, 2020;
originally announced July 2020.
-
AI-Driven CT-based quantification, staging and short-term outcome prediction of COVID-19 pneumonia
Authors:
Guillaume Chassagnon,
Maria Vakalopoulou,
Enzo Battistella,
Stergios Christodoulidis,
Trieu-Nghi Hoang-Thi,
Severine Dangeard,
Eric Deutsch,
Fabrice Andre,
Enora Guillo,
Nara Halm,
Stefany El Hajj,
Florian Bompard,
Sophie Neveu,
Chahinez Hani,
Ines Saab,
Alienor Campredon,
Hasmik Koulakian,
Souhail Bennani,
Gael Freche,
Aurelien Lombard,
Laure Fournier,
Hippolyte Monnier,
Teodor Grand,
Jules Gregory,
Antoine Khalil
, et al. (6 additional authors not shown)
Abstract:
Chest computed tomography (CT) is widely used for the management of Coronavirus disease 2019 (COVID-19) pneumonia because of its availability and rapidity. The standard of reference for confirming COVID-19 relies on microbiological tests but these tests might not be available in an emergency setting and their results are not immediately available, contrary to CT. In addition to its role for early…
▽ More
Chest computed tomography (CT) is widely used for the management of Coronavirus disease 2019 (COVID-19) pneumonia because of its availability and rapidity. The standard of reference for confirming COVID-19 relies on microbiological tests but these tests might not be available in an emergency setting and their results are not immediately available, contrary to CT. In addition to its role for early diagnosis, CT has a prognostic role by allowing visually evaluating the extent of COVID-19 lung abnormalities. The objective of this study is to address prediction of short-term outcomes, especially need for mechanical ventilation. In this multi-centric study, we propose an end-to-end artificial intelligence solution for automatic quantification and prognosis assessment by combining automatic CT delineation of lung disease meeting performance of experts and data-driven identification of biomarkers for its prognosis. AI-driven combination of variables with CT-based biomarkers offers perspectives for optimal patient management given the shortage of intensive care beds and ventilators.
△ Less
Submitted 20 April, 2020;
originally announced April 2020.
-
DIMOHA: A Time-Domain Algorithm for Traveling-Wave Tube Simulations
Authors:
Damien Minenna,
Yves Elskens,
Frédéric André,
Alexandre Poyé,
Jérôme Puech,
Fabrice Doveil
Abstract:
To simulate traveling-wave tubes (TWTs) in time domain and more generally the wave-particle interaction in vacuum devices, we developed the DIscrete MOdel with HAmiltonian approach (dimoha) as an alternative to current particle-in-cell (PIC) and frequency approaches. Indeed, it is based on a longitudinal N-body Hamiltonian approach satisfying Maxwell's equations. Advantages of dimoha comprise: (i)…
▽ More
To simulate traveling-wave tubes (TWTs) in time domain and more generally the wave-particle interaction in vacuum devices, we developed the DIscrete MOdel with HAmiltonian approach (dimoha) as an alternative to current particle-in-cell (PIC) and frequency approaches. Indeed, it is based on a longitudinal N-body Hamiltonian approach satisfying Maxwell's equations. Advantages of dimoha comprise: (i) it allows arbitrary waveform (not just field envelope), including continuous waveform (CW), multiple carriers or digital modulations (shift keying); (ii) the algorithm is much faster than PIC codes thanks to a field discretization allowing a drastic degree-of-freedom reduction, along with a robust symplectic integrator; (iii) it supports any periodic slow-wave structure design such as helix or folded waveguides; (iv) it reproduces harmonic generation, reflection, oscillation and distortion phenomena; (v) it handles nonlinear dynamics, including intermodulations, trap** and chaos. dimoha accuracy is assessed by comparing it against measurements from a commercial Ku-band tapered helix TWT and against simulations from a sub-THz folded waveguide TWT with a staggered double-grating slow-wave structure. The algorithm is also tested for multiple-carriers simulations with success.
△ Less
Submitted 27 September, 2019;
originally announced September 2019.
-
Derived Codebooks for High-Accuracy Nearest Neighbor Search
Authors:
Fabien André,
Anne-Marie Kermarrec,
Nicolas Le Scouarnec
Abstract:
High-dimensional Nearest Neighbor (NN) search is central in multimedia search systems. Product Quantization (PQ) is a widespread NN search technique which has a high performance and good scalability. PQ compresses high-dimensional vectors into compact codes thanks to a combination of quantizers. Large databases can, therefore, be stored entirely in RAM, enabling fast responses to NN queries. In al…
▽ More
High-dimensional Nearest Neighbor (NN) search is central in multimedia search systems. Product Quantization (PQ) is a widespread NN search technique which has a high performance and good scalability. PQ compresses high-dimensional vectors into compact codes thanks to a combination of quantizers. Large databases can, therefore, be stored entirely in RAM, enabling fast responses to NN queries. In almost all cases, PQ uses 8-bit quantizers as they offer low response times. In this paper, we advocate the use of 16-bit quantizers. Compared to 8-bit quantizers, 16-bit quantizers boost accuracy but they increase response time by a factor of 3 to 10. We propose a novel approach that allows 16-bit quantizers to offer the same response time as 8-bit quantizers, while still providing a boost of accuracy. Our approach builds on two key ideas: (i) the construction of derived codebooks that allow a fast and approximate distance evaluation, and (ii) a two-pass NN search procedure which builds a candidate set using the derived codebooks, and then refines it using 16-bit quantizers. On 1 billion SIFT vectors, with an inverted index, our approach offers a Recall@100 of 0.85 in 5.2 ms. By contrast, 16-bit quantizers alone offer a Recall@100 of 0.85 in 39 ms, and 8-bit quantizers a Recall@100 of 0.82 in 3.8 ms.
△ Less
Submitted 16 May, 2019;
originally announced May 2019.
-
Universality of the Abraham-Minkowski dilemma for photon momenta beyond dielectric materials
Authors:
Damien Minenna,
Yves Elskens,
Fabrice Doveil,
Frédéric André
Abstract:
The authors) Whenever light is slowed down, for any cause, two different formulas give its momentum. For dielectrics, the coexistence of those momenta was the heart of the century-old Abraham-Minkowski dilemma, recently resolved. We demonstrate that this framework extends to momentum exchange in wave-particle interaction; in particular to Langmuir waves for Landau dam** and to vacuum waveguides…
▽ More
The authors) Whenever light is slowed down, for any cause, two different formulas give its momentum. For dielectrics, the coexistence of those momenta was the heart of the century-old Abraham-Minkowski dilemma, recently resolved. We demonstrate that this framework extends to momentum exchange in wave-particle interaction; in particular to Langmuir waves for Landau dam** and to vacuum waveguides of electron tubes (metallic slow-wave structures). Focussing on the latter, we show that the dilemma resolution is not limited to discriminating between kinematic and canonical momenta but also involves a non-negligible momentum flux from Maxwell's electromagnetic stress. The existence of two momenta in materials, plasmas, and waveguides, for which light velocity modification has entirely different origin, points to the universality of the Abraham-Minkowski dilemma.
△ Less
Submitted 18 February, 2019;
originally announced February 2019.
-
Quicker ADC : Unlocking the hidden potential of Product Quantization with SIMD
Authors:
Fabien André,
Anne-Marie Kermarrec,
Nicolas Le Scouarnec
Abstract:
Efficient Nearest Neighbor (NN) search in high-dimensional spaces is a foundation of many multimedia retrieval systems. A common approach is to rely on Product Quantization, which allows the storage of large vector databases in memory and efficient distance computations. Yet, implementations of nearest neighbor search with Product Quantization have their performance limited by the many memory acce…
▽ More
Efficient Nearest Neighbor (NN) search in high-dimensional spaces is a foundation of many multimedia retrieval systems. A common approach is to rely on Product Quantization, which allows the storage of large vector databases in memory and efficient distance computations. Yet, implementations of nearest neighbor search with Product Quantization have their performance limited by the many memory accesses they perform. Following this observation, André et al. proposed Quick ADC with up to $6\times$ faster implementations of $m\times{}4$ product quantizers (PQ) leveraging specific SIMD instructions. Quicker ADC is a generalization of Quick ADC not limited to $m\times{}4$ codes and supporting AVX-512, the latest revision of SIMD instruction set. In doing so, Quicker ADC faces the challenge of using efficiently 5,6 and 7-bit shuffles that do not align to computer bytes or words. To this end, we introduce (i) irregular product quantizers combining sub-quantizers of different granularity and (ii) split tables allowing lookup tables larger than registers. We evaluate Quicker ADC with multiple indexes including Inverted Multi-Indexes and IVF HNSW and show that it outperforms the reference optimized implementations (i.e., FAISS and polysemous codes) for numerous configurations. Finally, we release an open-source fork of FAISS enhanced with Quicker ADC at http://github.com/nlescoua/faiss-quickeradc.
△ Less
Submitted 14 November, 2019; v1 submitted 21 December, 2018;
originally announced December 2018.
-
Electromagnetic power and momentum in N-body hamiltonian approach to wave-particle dynamics in a periodic structure
Authors:
Damien Minenna,
Yves Elskens,
Frédéric André,
Fabrice Doveil
Abstract:
To model momentum exchange in nonlinear wave-particle interaction, as in amplification devices like traveling-wave tubes, we use an $N$-body self-consistent hamiltonian description based on Kuznetsov's discrete model, and we provide new formulations for the electromagnetic power and the conserved momentum. This approach leads to fast and accurate numerical simulations in time domain and in one dim…
▽ More
To model momentum exchange in nonlinear wave-particle interaction, as in amplification devices like traveling-wave tubes, we use an $N$-body self-consistent hamiltonian description based on Kuznetsov's discrete model, and we provide new formulations for the electromagnetic power and the conserved momentum. This approach leads to fast and accurate numerical simulations in time domain and in one dimensional space.
△ Less
Submitted 30 March, 2018;
originally announced March 2018.
-
The Traveling-Wave Tube in the History of Telecommunication
Authors:
Damien Minenna,
Frédéric André,
Yves Elskens,
Jean-François Auboin,
Fabrice Doveil,
Jérôme Puech,
Élise Duverdier
Abstract:
The traveling-wave tube is a critical subsystem for satellite data transmission. Its role in the history of wireless communications and in the space conquest is significant, but largely ignored, even though the device remains widely used nowadays. This paper present, albeit non-exhaustively, circumstances and contexts that led to its invention, and its part in the worldwide (in particular in Europ…
▽ More
The traveling-wave tube is a critical subsystem for satellite data transmission. Its role in the history of wireless communications and in the space conquest is significant, but largely ignored, even though the device remains widely used nowadays. This paper present, albeit non-exhaustively, circumstances and contexts that led to its invention, and its part in the worldwide (in particular in Europe) expansion of TV broadcasting via microwave radio-relays and satellites. We also discuss its actual contribution to space applications and its conception. The originality of this paper comes from the wide period covered (from first slow-wave structures in 1889 to present space projects) and from connection points made between this device and commercial exploitations. The appendix deals with an intuitive pedagogical description of the wave-particle interaction.
△ Less
Submitted 30 March, 2018;
originally announced March 2018.
-
Exploiting Modern Hardware for High-Dimensional Nearest Neighbor Search
Authors:
Fabien André
Abstract:
Many multimedia information retrieval or machine learning problems require efficient high-dimensional nearest neighbor search techniques. For instance, multimedia objects (images, music or videos) can be represented by high-dimensional feature vectors. Finding two similar multimedia objects then comes down to finding two objects that have similar feature vectors. In the current context of mass use…
▽ More
Many multimedia information retrieval or machine learning problems require efficient high-dimensional nearest neighbor search techniques. For instance, multimedia objects (images, music or videos) can be represented by high-dimensional feature vectors. Finding two similar multimedia objects then comes down to finding two objects that have similar feature vectors. In the current context of mass use of social networks, large scale multimedia databases or large scale machine learning applications are more and more common, calling for efficient nearest neighbor search approaches.
This thesis builds on product quantization, an efficient nearest neighbor search technique that compresses high-dimensional vectors into short codes. This makes it possible to store very large databases entirely in RAM, enabling low response times. We propose several contributions that exploit the capabilities of modern CPUs, especially SIMD and the cache hierarchy, to further decrease response times offered by product quantization.
△ Less
Submitted 7 December, 2017;
originally announced December 2017.
-
Comparison Between Pierce Equivalent Circuit and Recent Discrete Model for Traveling-Wave Tubes
Authors:
Damien Minenna,
Artem Terentyuk,
Frédéric André,
Yves Elskens,
Nikita Ryskin
Abstract:
To perform accurate numerical simulations of the traveling-wave tube in time domain, a new approach using field decomposition with large reduction of degrees-of-freedom has been proposed: the discrete model. To assess its validity, we compare it with the well-established Pierce equivalent circuit model in small signal regime. We also discuss associated beam, circuit-beam, and circuit impedances. W…
▽ More
To perform accurate numerical simulations of the traveling-wave tube in time domain, a new approach using field decomposition with large reduction of degrees-of-freedom has been proposed: the discrete model. To assess its validity, we compare it with the well-established Pierce equivalent circuit model in small signal regime. We also discuss associated beam, circuit-beam, and circuit impedances. We demonstrate analytically and with a numerical example that the newly developed discrete model is very close to the Pierce model. Interestingly, small deviations do exist at the edges of the amplification band. We speculate that the deviation from reality is on the Pierce model side, while the discrete model would be more accurate.
△ Less
Submitted 13 November, 2017;
originally announced November 2017.
-
Accelerated Nearest Neighbor Search with Quick ADC
Authors:
Fabien André,
Anne-Marie Kermarrec,
Nicolas Le Scouarnec
Abstract:
Efficient Nearest Neighbor (NN) search in high-dimensional spaces is a foundation of many multimedia retrieval systems. Because it offers low responses times, Product Quantization (PQ) is a popular solution. PQ compresses high-dimensional vectors into short codes using several sub-quantizers, which enables in-RAM storage of large databases. This allows fast answers to NN queries, without accessing…
▽ More
Efficient Nearest Neighbor (NN) search in high-dimensional spaces is a foundation of many multimedia retrieval systems. Because it offers low responses times, Product Quantization (PQ) is a popular solution. PQ compresses high-dimensional vectors into short codes using several sub-quantizers, which enables in-RAM storage of large databases. This allows fast answers to NN queries, without accessing the SSD or HDD. The key feature of PQ is that it can compute distances between short codes and high-dimensional vectors using cache-resident lookup tables. The efficiency of this technique, named Asymmetric Distance Computation (ADC), remains limited because it performs many cache accesses.
In this paper, we introduce Quick ADC, a novel technique that achieves a 3 to 6 times speedup over ADC by exploiting Single Instruction Multiple Data (SIMD) units available in current CPUs. Efficiently exploiting SIMD requires algorithmic changes to the ADC procedure. Namely, Quick ADC relies on two key modifications of ADC: (i) the use 4-bit sub-quantizers instead of the standard 8-bit sub-quantizers and (ii) the quantization of floating-point distances. This allows Quick ADC to exceed the performance of state-of-the-art systems, e.g., it achieves a Recall@100 of 0.94 in 3.4 ms on 1 billion SIFT descriptors (128-bit codes).
△ Less
Submitted 24 April, 2017;
originally announced April 2017.
-
On frequency and time domain models of traveling wave tubes
Authors:
Stéphane Théveny,
Frédéric André,
Yves Elskens
Abstract:
We discuss the envelope modulation assumption of frequency-domain models of traveling wave tubes (TWTs) and test its consistency with the Maxwell equations. We compare the predictions of usual frequency-domain models with those of a new time domain model of the TWT.
We discuss the envelope modulation assumption of frequency-domain models of traveling wave tubes (TWTs) and test its consistency with the Maxwell equations. We compare the predictions of usual frequency-domain models with those of a new time domain model of the TWT.
△ Less
Submitted 4 July, 2016;
originally announced July 2016.
-
Hamiltonian description of self-consistent wave-particle dynamics in a periodic structure
Authors:
Frédéric André,
Pierre Bernardi,
Nikita M. Ryskin,
Fabrice Doveil,
Yves Elskens
Abstract:
Conservation of energy and momentum in the classical theory of radiating electrons has been a challenging problem since its inception. We propose a formulation of classical electrodynamics in Hamiltonian form that satisfies the Maxwell equations and the Lorentz force. The radiated field is represented with eigenfunctions using the Gel'fand $β$-transform. The electron Hamiltonian is the standard on…
▽ More
Conservation of energy and momentum in the classical theory of radiating electrons has been a challenging problem since its inception. We propose a formulation of classical electrodynamics in Hamiltonian form that satisfies the Maxwell equations and the Lorentz force. The radiated field is represented with eigenfunctions using the Gel'fand $β$-transform. The electron Hamiltonian is the standard one coupling the particles with the propagating fields. The dynamics conserves energy and excludes self-acceleration. A complete Hamiltonian formulation results from adding electrostatic action-at-a-distance coupling between electrons.
△ Less
Submitted 18 July, 2013; v1 submitted 25 March, 2013;
originally announced March 2013.