Search | arXiv e-print repository

FedPIDAvg: A PID controller inspired aggregation method for Federated Learning

Authors: Leon Mächler, Ivan Ezhov, Suprosanna Shit, Johannes C. Paetzold

Abstract: This paper presents FedPIDAvg, the winning submission to the Federated Tumor Segmentation Challenge 2022 (FETS22). Inspired by FedCostWAvg, our winning contribution to FETS21, we contribute an improved aggregation strategy for federated and collaborative learning. FedCostWAvg is a weighted averaging method that not only considers the number of training samples of each cluster but also the size of… ▽ More This paper presents FedPIDAvg, the winning submission to the Federated Tumor Segmentation Challenge 2022 (FETS22). Inspired by FedCostWAvg, our winning contribution to FETS21, we contribute an improved aggregation strategy for federated and collaborative learning. FedCostWAvg is a weighted averaging method that not only considers the number of training samples of each cluster but also the size of the drop of the respective cost function in the last federated round. This can be interpreted as the derivative part of a PID controller (proportional-integral-derivative controller). In FedPIDAvg, we further add the missing integral term. Another key challenge was the vastly varying size of data samples per center. We addressed this by modeling the data center sizes as following a Poisson distribution and choosing the training iterations per center accordingly. Our method outperformed all other submissions. △ Less

Submitted 24 April, 2023; originally announced April 2023.

arXiv:2212.08568 [pdf, other]

Biomedical image analysis competitions: The state of current participation practice

Authors: Matthias Eisenmann, Annika Reinke, Vivienn Weru, Minu Dietlinde Tizabi, Fabian Isensee, Tim J. Adler, Patrick Godau, Veronika Cheplygina, Michal Kozubek, Sharib Ali, Anubha Gupta, Jan Kybic, Alison Noble, Carlos Ortiz de Solórzano, Samiksha Pachade, Caroline Petitjean, Daniel Sage, Donglai Wei, Elizabeth Wilden, Deepak Alapatt, Vincent Andrearczyk, Ujjwal Baid, Spyridon Bakas, Niranjan Balu, Sophia Bano , et al. (331 additional authors not shown)

Abstract: The number of international benchmarking competitions is steadily increasing in various fields of machine learning (ML) research and practice. So far, however, little is known about the common practice as well as bottlenecks faced by the community in tackling the research questions posed. To shed light on the status quo of algorithm development in the specific field of biomedical imaging analysis,… ▽ More The number of international benchmarking competitions is steadily increasing in various fields of machine learning (ML) research and practice. So far, however, little is known about the common practice as well as bottlenecks faced by the community in tackling the research questions posed. To shed light on the status quo of algorithm development in the specific field of biomedical imaging analysis, we designed an international survey that was issued to all participants of challenges conducted in conjunction with the IEEE ISBI 2021 and MICCAI 2021 conferences (80 competitions in total). The survey covered participants' expertise and working environments, their chosen strategies, as well as algorithm characteristics. A median of 72% challenge participants took part in the survey. According to our results, knowledge exchange was the primary incentive (70%) for participation, while the reception of prize money played only a minor role (16%). While a median of 80 working hours was spent on method development, a large portion of participants stated that they did not have enough time for method development (32%). 25% perceived the infrastructure to be a bottleneck. Overall, 94% of all solutions were deep learning-based. Of these, 84% were based on standard architectures. 43% of the respondents reported that the data samples (e.g., images) were too large to be processed at once. This was most commonly addressed by patch-based training (69%), downsampling (37%), and solving 3D analysis tasks as a series of 2D tasks. K-fold cross-validation on the training set was performed by only 37% of the participants and only 50% of the participants performed ensembling based on multiple identical models (61%) or heterogeneous models (39%). 48% of the respondents applied postprocessing steps. △ Less

Submitted 12 September, 2023; v1 submitted 16 December, 2022; originally announced December 2022.

arXiv:2205.04550 [pdf, other]

A for-loop is all you need. For solving the inverse problem in the case of personalized tumor growth modeling

Authors: Ivan Ezhov, Marcel Rosier, Lucas Zimmer, Florian Kofler, Suprosanna Shit, Johannes Paetzold, Kevin Scibilia, Leon Maechler, Katharina Franitza, Tamaz Amiranashvili, Martin J. Menten, Marie Metz, Sailesh Conjeti, Benedikt Wiestler, Bjoern Menze

Abstract: Solving the inverse problem is the key step in evaluating the capacity of a physical model to describe real phenomena. In medical image computing, it aligns with the classical theme of image-based model personalization. Traditionally, a solution to the problem is obtained by performing either sampling or variational inference based methods. Both approaches aim to identify a set of free physical mo… ▽ More Solving the inverse problem is the key step in evaluating the capacity of a physical model to describe real phenomena. In medical image computing, it aligns with the classical theme of image-based model personalization. Traditionally, a solution to the problem is obtained by performing either sampling or variational inference based methods. Both approaches aim to identify a set of free physical model parameters that results in a simulation best matching an empirical observation. When applied to brain tumor modeling, one of the instances of image-based model personalization in medical image computing, the overarching drawback of the methods is the time complexity for finding such a set. In a clinical setting with limited time between imaging and diagnosis or even intervention, this time complexity may prove critical. As the history of quantitative science is the history of compression, we align in this paper with the historical tendency and propose a method compressing complex traditional strategies for solving an inverse problem into a simple database query task. We evaluated different ways of performing the database query task assessing the trade-off between accuracy and execution time. On the exemplary task of brain tumor growth modeling, we prove that the proposed method achieves one order speed-up compared to existing approaches for solving the inverse problem. The resulting compute time offers critical means for relying on more complex and, hence, realistic models, for integrating image preprocessing and inverse modeling even deeper, or for implementing the current model into a clinical workflow. △ Less

Submitted 11 July, 2022; v1 submitted 9 May, 2022; originally announced May 2022.

arXiv:2111.08649 [pdf, other]

FedCostWAvg: A new averaging for better Federated Learning

Authors: Leon Mächler, Ivan Ezhov, Florian Kofler, Suprosanna Shit, Johannes C. Paetzold, Timo Loehr, Benedikt Wiestler, Bjoern Menze

Abstract: We propose a simple new aggregation strategy for federated learning that won the MICCAI Federated Tumor Segmentation Challenge 2021 (FETS), the first ever challenge on Federated Learning in the Machine Learning community. Our method addresses the problem of how to aggregate multiple models that were trained on different data sets. Conceptually, we propose a new way to choose the weights when avera… ▽ More We propose a simple new aggregation strategy for federated learning that won the MICCAI Federated Tumor Segmentation Challenge 2021 (FETS), the first ever challenge on Federated Learning in the Machine Learning community. Our method addresses the problem of how to aggregate multiple models that were trained on different data sets. Conceptually, we propose a new way to choose the weights when averaging the different models, thereby extending the current state of the art (FedAvg). Empirical validation demonstrates that our approach reaches a notable improvement in segmentation performance compared to FedAvg. △ Less

Submitted 16 November, 2021; originally announced November 2021.

arXiv:2109.01467 [pdf, other]

Semi-Implicit Neural Solver for Time-dependent Partial Differential Equations

Authors: Suprosanna Shit, Ivan Ezhov, Leon Mächler, Abinav R., Jana Lipkova, Johannes C. Paetzold, Florian Kofler, Marie Piraud, Bjoern H. Menze

Abstract: Fast and accurate solutions of time-dependent partial differential equations (PDEs) are of pivotal interest to many research fields, including physics, engineering, and biology. Generally, implicit/semi-implicit schemes are preferred over explicit ones to improve stability and correctness. However, existing semi-implicit methods are usually iterative and employ a general-purpose solver, which may… ▽ More Fast and accurate solutions of time-dependent partial differential equations (PDEs) are of pivotal interest to many research fields, including physics, engineering, and biology. Generally, implicit/semi-implicit schemes are preferred over explicit ones to improve stability and correctness. However, existing semi-implicit methods are usually iterative and employ a general-purpose solver, which may be sub-optimal for a specific class of PDEs. In this paper, we propose a neural solver to learn an optimal iterative scheme in a data-driven fashion for any class of PDEs. Specifically, we modify a single iteration of a semi-implicit solver using a deep neural network. We provide theoretical guarantees for the correctness and convergence of neural solvers analogous to conventional iterative solvers. In addition to the commonly used Dirichlet boundary condition, we adopt a diffuse domain approach to incorporate a diverse type of boundary conditions, e.g., Neumann. We show that the proposed neural solver can go beyond linear PDEs and applies to a class of non-linear PDEs, where the non-linear component is non-stiff. We demonstrate the efficacy of our method on 2D and 3D scenarios. To this end, we show how our model generalizes to parameter settings, which are different from training; and achieves faster convergence than semi-implicit schemes. △ Less

Submitted 3 September, 2021; originally announced September 2021.

arXiv:2104.09982 [pdf, other]

Explaining the Entombed Algorithm

Authors: Leon Mächler, David Naccache

Abstract: In \cite{entombed}, John Aycock and Tara Copplestone pose an open question, namely the explanation of the mysterious lookup table used in the Entombed Game's Algorithm for two dimensional maze generation. The question attracted media attention (BBC etc) and was open until today. This paper answers this question, explains the algorithm and even extends it to three dimensions. In \cite{entombed}, John Aycock and Tara Copplestone pose an open question, namely the explanation of the mysterious lookup table used in the Entombed Game's Algorithm for two dimensional maze generation. The question attracted media attention (BBC etc) and was open until today. This paper answers this question, explains the algorithm and even extends it to three dimensions. △ Less

Submitted 21 April, 2021; v1 submitted 18 April, 2021; originally announced April 2021.

arXiv:1312.3725 [pdf, ps, other]

doi 10.1103/PhysRevB.89.024501

Magnetic field-dependence of the basal-plane superconducting anisotropy in YBa2Cu4O8 from small-angle neutron scattering measurements of the vortex lattice

Authors: Jonathan S. White, Charlotte J. Bowell, Alistair S. Cameron, Richard W. Heslop, Joël Mesot, Jorge L. Gavilano, Simon Strässle, Lars Mächler, Rustem Khasanov, Charles D. Dewhurst, Janusz Karpinski, Edward M. Forgan

Abstract: We report a study of the basal-plane anisotropy of the superfluid density in underdoped YBa$_{2}$Cu$_{4}$O$_{8}$ (Y124), showing the effects of both the CuO$_{2}$ planes and the fully occupied CuO chains. From small-angle neutron scattering measurements of the vortex lattice, we can infer the superconducting (SC) properties for a temperature ($T$) range $T=$ 1.5 K to $T_{\rm c}$ and magnetic induc… ▽ More We report a study of the basal-plane anisotropy of the superfluid density in underdoped YBa$_{2}$Cu$_{4}$O$_{8}$ (Y124), showing the effects of both the CuO$_{2}$ planes and the fully occupied CuO chains. From small-angle neutron scattering measurements of the vortex lattice, we can infer the superconducting (SC) properties for a temperature ($T$) range $T=$ 1.5 K to $T_{\rm c}$ and magnetic induction $B$ from 0.1 to 6 T. We find that the superfluid density along \textbf{a} has a simple $d$-wave T-dependence. However, along \textbf{b} (the chain direction) the superfluid density falls much more rapidly with $T$ and also with increasing field. This strongly suggests the suppression of proximity-effect induced superconductivity in the CuO chains. In addition, our results do not support a common framework for the low field in-plane SC response in Y124 and related YBa$_{2}$Cu$_{3}$O$_{7}$, and also indicate that any magnetic field-induced charge-density-wave order in Y124 exists only for fields above 6 T. △ Less

Submitted 13 December, 2013; originally announced December 2013.

Comments: 8 pages, accepted in Phys. Rev. B

Journal ref: Phys. Rev. B 89, 024501 (2014)

arXiv:1204.2152 [pdf, ps, other]

doi 10.1103/PhysRevB.85.134520

Spin density wave induced disordering of the vortex lattice in superconducting La$_{2-x}$Sr$_x$CuO$_4$

Authors: J. Chang, J. S. White, M. Laver, C. J. Bowell, S. P. Brown, A. T. Holmes, L. Maechler, S. Strassle, R. Gilardi, S. Gerber, T. Kurosawa, N. Momono, M. Oda, M. Ido, O. J. Lipscombe, S. M. Hayden, C. D. Dewhurst, R. Vavrin, J. Gavilano, J. Kohlbrecher, E. M. Forgan, J. Mesot

Abstract: We use small angle neutron scattering to study the superconducting vortex lattice in La$_{2-x}$Sr$_x$CuO$_4$ as a function of do** and magnetic field. We show that near optimally do** the vortex lattice coordination and the superconducting coherence length $ξ$ are controlled by a van-Hove singularity crossing the Fermi level near the Brillouin zone boundary. The vortex lattice properties chang… ▽ More We use small angle neutron scattering to study the superconducting vortex lattice in La$_{2-x}$Sr$_x$CuO$_4$ as a function of do** and magnetic field. We show that near optimally do** the vortex lattice coordination and the superconducting coherence length $ξ$ are controlled by a van-Hove singularity crossing the Fermi level near the Brillouin zone boundary. The vortex lattice properties change dramatically as a spin-density-wave instability is approached upon underdo**. The Bragg glass paradigm provides a good description of this regime and suggests that SDW order acts as a novel source of disorder on the vortex lattice. △ Less

Submitted 10 April, 2012; originally announced April 2012.

Comments: Accepted in Phys. Rev. B

Journal ref: Phys. Rev. B 85, 134520 (2012)

Showing 1–8 of 8 results for author: Maechler, L