Skip to main content

Showing 1–13 of 13 results for author: Gossmann, A

Searching in archive cs. Search in all archives.
.
  1. arXiv:2403.14356  [pdf, other

    cs.LG cs.SE

    DomainLab: A modular Python package for domain generalization in deep learning

    Authors: Xudong Sun, Carla Feistner, Alexej Gossmann, George Schwarz, Rao Muhammad Umer, Lisa Beer, Patrick Rockenschaub, Rahul Babu Shrestha, Armin Gruber, Nutan Chen, Sayedali Shetab Boushehri, Florian Buettner, Carsten Marr

    Abstract: Poor generalization performance caused by distribution shifts in unseen domains often hinders the trustworthy deployment of deep neural networks. Many domain generalization techniques address this problem by adding a domain invariant regularization loss terms during training. However, there is a lack of modular software that allows users to combine the advantages of different methods with minimal… ▽ More

    Submitted 21 March, 2024; originally announced March 2024.

  2. arXiv:2403.13728  [pdf, ps, other

    cs.LG cs.AI

    M-HOF-Opt: Multi-Objective Hierarchical Output Feedback Optimization via Multiplier Induced Loss Landscape Scheduling

    Authors: Xudong Sun, Nutan Chen, Alexej Gossmann, Yu Xing, Carla Feistner, Emilio Dorigatt, Felix Drost, Daniele Scarcella, Lisa Beer, Carsten Marr

    Abstract: We address the online combinatorial choice of weight multipliers for multi-objective optimization of many loss terms parameterized by neural works via a probabilistic graphical model (PGM) for the joint model parameter and multiplier evolution process, with a hypervolume based likelihood promoting multi-objective descent. The corresponding parameter and multiplier estimation as a sequential decisi… ▽ More

    Submitted 10 April, 2024; v1 submitted 20 March, 2024; originally announced March 2024.

  3. arXiv:2402.14254  [pdf, other

    cs.LG stat.ML

    A hierarchical decomposition for explaining ML performance discrepancies

    Authors: Jean Feng, Harvineet Singh, Fan Xia, Adarsh Subbaswamy, Alexej Gossmann

    Abstract: Machine learning (ML) algorithms can often differ in performance across domains. Understanding $\textit{why}$ their performance differs is crucial for determining what types of interventions (e.g., algorithmic or operational) are most effective at closing the performance gaps. Existing methods focus on $\textit{aggregate decompositions}$ of the total performance gap into the impact of a shift in t… ▽ More

    Submitted 21 February, 2024; originally announced February 2024.

    Comments: 11 pages, 5 figures in main body; 14 pages and 2 figures in appendices

  4. arXiv:2311.11463  [pdf, other

    cs.LG stat.ML

    Designing monitoring strategies for deployed machine learning algorithms: navigating performativity through a causal lens

    Authors: Jean Feng, Adarsh Subbaswamy, Alexej Gossmann, Harvineet Singh, Berkman Sahiner, Mi-Ok Kim, Gene Pennello, Nicholas Petrick, Romain Pirracchio, Fan Xia

    Abstract: After a machine learning (ML)-based system is deployed, monitoring its performance is important to ensure the safety and effectiveness of the algorithm over time. When an ML algorithm interacts with its environment, the algorithm can affect the data-generating mechanism and be a major source of bias when evaluating its standalone performance, an issue known as performativity. Although prior work h… ▽ More

    Submitted 26 February, 2024; v1 submitted 19 November, 2023; originally announced November 2023.

  5. arXiv:2307.15247  [pdf, other

    cs.LG stat.ME stat.ML

    Is this model reliable for everyone? Testing for strong calibration

    Authors: Jean Feng, Alexej Gossmann, Romain Pirracchio, Nicholas Petrick, Gene Pennello, Berkman Sahiner

    Abstract: In a well-calibrated risk prediction model, the average predicted probability is close to the true event rate for any given subgroup. Such models are reliable across heterogeneous populations and satisfy strong notions of algorithmic fairness. However, the task of auditing a model for strong calibration is well-known to be difficult -- particularly for machine learning (ML) algorithms -- due to th… ▽ More

    Submitted 27 July, 2023; originally announced July 2023.

  6. arXiv:2211.09781  [pdf, other

    stat.ML cs.CY cs.LG

    Monitoring machine learning (ML)-based risk prediction algorithms in the presence of confounding medical interventions

    Authors: Jean Feng, Alexej Gossmann, Gene Pennello, Nicholas Petrick, Berkman Sahiner, Romain Pirracchio

    Abstract: Performance monitoring of machine learning (ML)-based risk prediction models in healthcare is complicated by the issue of confounding medical interventions (CMI): when an algorithm predicts a patient to be at high risk for an adverse event, clinicians are more likely to administer prophylactic treatment and alter the very target that the algorithm aims to predict. A simple approach is to ignore CM… ▽ More

    Submitted 14 April, 2023; v1 submitted 17 November, 2022; originally announced November 2022.

  7. arXiv:2203.11377  [pdf, other

    stat.ML cs.LG stat.ME

    Sequential algorithmic modification with test data reuse

    Authors: Jean Feng, Gene Pennello, Nicholas Petrick, Berkman Sahiner, Romain Pirracchio, Alexej Gossmann

    Abstract: After initial release of a machine learning algorithm, the model can be fine-tuned by retraining on subsequently gathered data, adding newly discovered features, or more. Each modification introduces a risk of deteriorating performance and must be validated on a test dataset. It may not always be practical to assemble a new dataset for testing each modification, especially when most modifications… ▽ More

    Submitted 21 March, 2022; originally announced March 2022.

  8. arXiv:2110.06866  [pdf, other

    stat.ML cs.LG stat.AP

    Bayesian logistic regression for online recalibration and revision of risk prediction models with performance guarantees

    Authors: Jean Feng, Alexej Gossmann, Berkman Sahiner, Romain Pirracchio

    Abstract: After deploying a clinical prediction model, subsequently collected data can be used to fine-tune its predictions and adapt to temporal shifts. Because model updating carries risks of over-updating/fitting, we study online methods with performance guarantees. We introduce two procedures for continual recalibration or revision of an underlying prediction model: Bayesian logistic regression (BLR) an… ▽ More

    Submitted 13 October, 2021; originally announced October 2021.

  9. arXiv:2007.00479  [pdf, ps, other

    stat.ML cs.LG math.NA

    The Restricted Isometry of ReLU Networks: Generalization through Norm Concentration

    Authors: Alex Goeßmann, Gitta Kutyniok

    Abstract: While regression tasks aim at interpolating a relation on the entire input space, they often have to be solved with a limited amount of training data. Still, if the hypothesis functions can be sketched well with the data, one can hope for identifying a generalizing model. In this work, we introduce with the Neural Restricted Isometry Property (NeuRIP) a uniform concentration event, in which all… ▽ More

    Submitted 1 July, 2020; originally announced July 2020.

    Comments: 27 pages, 5 figures

    MSC Class: G.3 ACM Class: F.2; G.3

  10. arXiv:2003.12081  [pdf, other

    physics.comp-ph cond-mat.mtrl-sci cs.LG physics.chem-ph

    Representations of molecules and materials for interpolation of quantum-mechanical simulations via machine learning

    Authors: Marcel F. Langer, Alex Goeßmann, Matthias Rupp

    Abstract: Computational study of molecules and materials from first principles is a cornerstone of physics, chemistry, and materials science, but limited by the cost of accurate and precise simulations. In settings involving many simulations, machine learning can reduce these costs, often by orders of magnitude, by interpolating between reference simulations. This requires representations that describe any… ▽ More

    Submitted 9 February, 2021; v1 submitted 26 March, 2020; originally announced March 2020.

    Comments: 20 pages, 6 figures, excluding supplement (19 pages, 5 figures); v2: extended review and discussion, more representations covered, edited for clarity. For additional information, including datasets, results, and software see https://marcel.science/repbench

  11. arXiv:2002.12388  [pdf, other

    math.NA cs.LG math.DS quant-ph stat.ML

    Tensor network approaches for learning non-linear dynamical laws

    Authors: A. Goeßmann, M. Götte, I. Roth, R. Sweke, G. Kutyniok, J. Eisert

    Abstract: Given observations of a physical system, identifying the underlying non-linear governing equation is a fundamental task, necessary both for gaining understanding and generating deterministic future predictions. Of most practical relevance are automated approaches to theory building that scale efficiently for complex systems with many degrees of freedom. To date, available scalable methods aim at a… ▽ More

    Submitted 27 February, 2020; originally announced February 2020.

    Comments: 17 pages, 8 figures

  12. arXiv:1906.02972  [pdf, other

    cs.LG stat.ML

    Variational Resampling Based Assessment of Deep Neural Networks under Distribution Shift

    Authors: Xudong Sun, Alexej Gossmann, Yu Wang, Bernd Bischl

    Abstract: A novel variational inference based resampling framework is proposed to evaluate the robustness and generalization capability of deep learning models with respect to distribution shift. We use Auto Encoding Variational Bayes to find a latent representation of the data, on which a Variational Gaussian Mixture Model is applied to deliberately create distribution shift by dividing the dataset into di… ▽ More

    Submitted 27 October, 2019; v1 submitted 7 June, 2019; originally announced June 2019.

  13. arXiv:1904.01070  [pdf

    cs.LG cs.NE q-bio.NC stat.ML

    Multimodal Sparse Classifier for Adolescent Brain Age Prediction

    Authors: Peyman Hosseinzadeh Kassani, Alexej Gossmann, Yu-** Wang

    Abstract: The study of healthy brain development helps to better understand the brain transformation and brain connectivity patterns which happen during childhood to adulthood. This study presents a sparse machine learning solution across whole-brain functional connectivity (FC) measures of three sets of data, derived from resting state functional magnetic resonance imaging (rs-fMRI) and task fMRI data, inc… ▽ More

    Submitted 1 April, 2019; originally announced April 2019.