Skip to main content

Showing 1–12 of 12 results for author: Sonabend, R

.
  1. arXiv:2407.00345  [pdf

    cs.SE

    Unicorns Do Not Exist: Employing and Appreciating Community Managers in Open Source

    Authors: Raphael Sonabend, Anna Carnegie, Anne Lee Steele, Marie Nugent, Malvika Sharan

    Abstract: Open-source software is released under an open-source licence, which means the software can be shared, adapted, and reshared without prejudice. In the context of open-source software, community managers manage the communities that contribute to the development and upkeep of open-source tools. Despite playing a crucial role in maintaining open-source software, community managers are often overlooke… ▽ More

    Submitted 29 June, 2024; originally announced July 2024.

  2. arXiv:2406.04098  [pdf, other

    stat.ML cs.LG

    A Large-Scale Neutral Comparison Study of Survival Models on Low-Dimensional Data

    Authors: Lukas Burk, John Zobolas, Bernd Bischl, Andreas Bender, Marvin N. Wright, Raphael Sonabend

    Abstract: This work presents the first large-scale neutral benchmark experiment focused on single-event, right-censored, low-dimensional survival data. Benchmark experiments are essential in methodological research to scientifically compare new and existing model classes through proper empirical evaluation. Existing benchmarks in the survival literature are often narrow in scope, focusing, for example, on h… ▽ More

    Submitted 6 June, 2024; originally announced June 2024.

    Comments: 42 pages, 28 figures

  3. arXiv:2403.13150  [pdf, other

    cs.LG cs.AI stat.CO stat.ML

    Training Survival Models using Scoring Rules

    Authors: Philipp Kopper, David Rügamer, Raphael Sonabend, Bernd Bischl, Andreas Bender

    Abstract: Survival Analysis provides critical insights for partially incomplete time-to-event data in various domains. It is also an important example of probabilistic machine learning. The probabilistic nature of the predictions can be exploited by using (proper) scoring rules in the model fitting process instead of likelihood-based optimization. Our proposal does so in a generic manner and can be used for… ▽ More

    Submitted 19 March, 2024; originally announced March 2024.

  4. arXiv:2402.02824  [pdf

    cs.SE

    FAIR-USE4OS: Guidelines for Creating Impactful Open-Source Software

    Authors: Raphael Sonabend, Hugo Gruson, Leo Wolansky, Agnes Kiragga, Daniel S. Katz

    Abstract: This paper extends the FAIR (Findable, Accessible, Interoperable, Reusable) guidelines to provide criteria for assessing if software conforms to best practices in open source. By adding 'USE' (User-Centered, Sustainable, Equitable), software development can adhere to open source best practice by incorporating user-input early on, ensuring front-end designs are accessible to all possible stakeholde… ▽ More

    Submitted 3 April, 2024; v1 submitted 5 February, 2024; originally announced February 2024.

  5. Deep Learning for Survival Analysis: A Review

    Authors: Simon Wiegrebe, Philipp Kopper, Raphael Sonabend, Bernd Bischl, Andreas Bender

    Abstract: The influx of deep learning (DL) techniques into the field of survival analysis in recent years has led to substantial methodological progress; for instance, learning from unstructured or high-dimensional data such as images, text or omics data. In this work, we conduct a comprehensive systematic review of DL-based methods for time-to-event analysis, characterizing them according to both survival-… ▽ More

    Submitted 22 February, 2024; v1 submitted 24 May, 2023; originally announced May 2023.

    Comments: 29 pages, 7 figures, 2 tables, 1 interactive table

    Journal ref: Artif Intell Rev 57, 65 (2024)

  6. arXiv:2212.05260  [pdf, other

    math.ST cs.LG stat.AP

    Examining properness in the external validation of survival models with squared and logarithmic losses

    Authors: Raphael Sonabend, John Zobolas, Philipp Kopper, Lukas Burk, Andreas Bender

    Abstract: Scoring rules promote rational and honest decision-making, which is becoming increasingly important for automated procedures in `auto-ML'. In this paper we survey common squared and logarithmic scoring rules for survival analysis and determine which losses are proper and improper. We prove that commonly utilised squared and logarithmic scoring rules that are claimed to be proper are in fact improp… ▽ More

    Submitted 3 June, 2024; v1 submitted 10 December, 2022; originally announced December 2022.

  7. arXiv:2206.03256  [pdf, other

    cs.CY cs.LG stat.AP stat.ME

    Flexible Group Fairness Metrics for Survival Analysis

    Authors: Raphael Sonabend, Florian Pfisterer, Alan Mishler, Moritz Schauer, Lukas Burk, Sumantrak Mukherjee, Sebastian Vollmer

    Abstract: Algorithmic fairness is an increasingly important field concerned with detecting and mitigating biases in machine learning models. There has been a wealth of literature for algorithmic fairness in regression and classification however there has been little exploration of the field for survival analysis. Survival analysis is the prediction task in which one attempts to predict the probability of an… ▽ More

    Submitted 22 July, 2022; v1 submitted 26 May, 2022; originally announced June 2022.

    Comments: Accepted in DSHealth 2022 (Workshop on Applied Data Science for Healthcare)

  8. arXiv:2112.04828  [pdf, other

    stat.ML cs.LG math.ST stat.ME

    Avoiding C-hacking when evaluating survival distribution predictions with discrimination measures

    Authors: Raphael Sonabend, Andreas Bender, Sebastian Vollmer

    Abstract: In this paper we consider how to evaluate survival distribution predictions with measures of discrimination. This is a non-trivial problem as discrimination measures are the most commonly used in survival analysis and yet there is no clear method to derive a risk prediction from a distribution prediction. We survey methods proposed in literature and software and consider their respective advantage… ▽ More

    Submitted 9 March, 2022; v1 submitted 9 December, 2021; originally announced December 2021.

  9. arXiv:2101.04938  [pdf, other

    cs.SE cs.LG

    Designing Machine Learning Toolboxes: Concepts, Principles and Patterns

    Authors: Franz J. Király, Markus Löning, Anthony Blaom, Ahmed Guecioueur, Raphael Sonabend

    Abstract: Machine learning (ML) and AI toolboxes such as scikit-learn or Weka are workhorses of contemporary data scientific practice -- their central role being enabled by usable yet powerful designs that allow to easily specify, train and validate complex modeling pipelines. However, despite their universal success, the key design principles in their construction have never been fully analyzed. In this pa… ▽ More

    Submitted 13 January, 2021; originally announced January 2021.

  10. arXiv:2009.02993  [pdf, other

    cs.SE cs.MS stat.AP stat.CO

    distr6: R6 Object-Oriented Probability Distributions Interface in R

    Authors: Raphael Sonabend, Franz Kiraly

    Abstract: distr6 is an object-oriented (OO) probability distributions interface leveraging the extensibility and scalability of R6, and the speed and efficiency of Rcpp. Over 50 probability distributions are currently implemented in the package with `core' methods including density, distribution, and generating functions, and more `exotic' ones including hazards and distribution function anti-derivatives. I… ▽ More

    Submitted 20 March, 2021; v1 submitted 7 September, 2020; originally announced September 2020.

    Comments: Accepted in The R Journal

  11. arXiv:2008.08080  [pdf, ps, other

    stat.CO cs.LG stat.ML

    mlr3proba: An R Package for Machine Learning in Survival Analysis

    Authors: Raphael Sonabend, Franz J. Király, Andreas Bender, Bernd Bischl, Michel Lang

    Abstract: As machine learning has become increasingly popular over the last few decades, so too has the number of machine learning interfaces for implementing these models. Whilst many R libraries exist for machine learning, very few offer extended support for survival analysis. This is problematic considering its importance in fields like medicine, bioinformatics, economics, engineering, and more. mlr3prob… ▽ More

    Submitted 14 December, 2020; v1 submitted 18 August, 2020; originally announced August 2020.

    Comments: Submitted to Bioinformatics

  12. arXiv:1812.07519  [pdf, other

    cs.LG cs.AI stat.ML

    NIPS - Not Even Wrong? A Systematic Review of Empirically Complete Demonstrations of Algorithmic Effectiveness in the Machine Learning and Artificial Intelligence Literature

    Authors: Franz J Király, Bilal Mateen, Raphael Sonabend

    Abstract: Objective: To determine the completeness of argumentative steps necessary to conclude effectiveness of an algorithm in a sample of current ML/AI supervised learning literature. Data Sources: Papers published in the Neural Information Processing Systems (NeurIPS, née NIPS) journal where the official record showed a 2017 year of publication. Eligibility Criteria: Studies reporting a (semi-)super… ▽ More

    Submitted 18 December, 2018; originally announced December 2018.