Skip to main content

Showing 1–11 of 11 results for author: Kern, C

Searching in archive stat. Search in all archives.
.
  1. arXiv:2405.18206  [pdf, other

    cs.LG stat.ME stat.ML

    Multi-CATE: Multi-Accurate Conditional Average Treatment Effect Estimation Robust to Unknown Covariate Shifts

    Authors: Christoph Kern, Michael Kim, Angela Zhou

    Abstract: Estimating heterogeneous treatment effects is important to tailor treatments to those individuals who would most likely benefit. However, conditional average treatment effect predictors may often be trained on one population but possibly deployed on different, possibly unknown populations. We use methodology for learning multi-accurate predictors to post-process CATE T-learners (differenced regres… ▽ More

    Submitted 28 May, 2024; originally announced May 2024.

  2. arXiv:2404.17293  [pdf, other

    cs.LG cs.CY stat.AP stat.ML

    Lazy Data Practices Harm Fairness Research

    Authors: Jan Simson, Alessandro Fabris, Christoph Kern

    Abstract: Data practices shape research and practice on fairness in machine learning (fair ML). Critical data studies offer important reflections and critiques for the responsible advancement of the field by highlighting shortcomings and proposing recommendations for improvement. In this work, we present a comprehensive analysis of fair ML datasets, demonstrating how unreflective yet common practices hinder… ▽ More

    Submitted 18 June, 2024; v1 submitted 26 April, 2024; originally announced April 2024.

    Journal ref: FAccT '24: Proceedings of the 2024 ACM Conference on Fairness, Accountability, and Transparency (2024) 642-659

  3. arXiv:2402.09328  [pdf, other

    stat.ML cs.LG stat.ME

    Connecting Algorithmic Fairness to Quality Dimensions in Machine Learning in Official Statistics and Survey Production

    Authors: Patrick Oliver Schenk, Christoph Kern

    Abstract: National Statistical Organizations (NSOs) increasingly draw on Machine Learning (ML) to improve the timeliness and cost-effectiveness of their products. When introducing ML solutions, NSOs must ensure that high standards with respect to robustness, reproducibility, and accuracy are upheld as codified, e.g., in the Quality Framework for Statistical Algorithms (QF4SA; Yung et al. 2022). At the same… ▽ More

    Submitted 14 February, 2024; originally announced February 2024.

  4. arXiv:2311.14212  [pdf, other

    stat.ML cs.CL cs.LG stat.ME

    Annotation Sensitivity: Training Data Collection Methods Affect Model Performance

    Authors: Christoph Kern, Stephanie Eckman, Jacob Beck, Rob Chew, Bolei Ma, Frauke Kreuter

    Abstract: When training data are collected from human annotators, the design of the annotation instrument, the instructions given to annotators, the characteristics of the annotators, and their interactions can impact training data. This study demonstrates that design choices made when creating an annotation instrument also impact the models trained on the resulting annotations. We introduce the term annota… ▽ More

    Submitted 22 January, 2024; v1 submitted 23 November, 2023; originally announced November 2023.

    Comments: EMNLP 2023 Findings: https://aclanthology.org/2023.findings-emnlp.992/

  5. arXiv:2310.19091  [pdf, other

    cs.LG cs.CY cs.HC stat.ME

    Bridging the Gap: Towards an Expanded Toolkit for ML-Supported Decision-Making in the Public Sector

    Authors: Unai Fischer-Abaigar, Christoph Kern, Noam Barda, Frauke Kreuter

    Abstract: Machine Learning (ML) systems are becoming instrumental in the public sector, with applications spanning areas like criminal justice, social welfare, financial fraud detection, and public health. While these systems offer great potential benefits to institutional decision-making processes, such as improved efficiency and reliability, they still face the challenge of aligning nuanced policy objecti… ▽ More

    Submitted 26 April, 2024; v1 submitted 29 October, 2023; originally announced October 2023.

  6. One Model Many Scores: Using Multiverse Analysis to Prevent Fairness Hacking and Evaluate the Influence of Model Design Decisions

    Authors: Jan Simson, Florian Pfisterer, Christoph Kern

    Abstract: A vast number of systems across the world use algorithmic decision making (ADM) to (partially) automate decisions that have previously been made by humans. The downstream effects of ADM systems critically depend on the decisions made during a systems' design, implementation, and evaluation, as biases in data can be mitigated or reinforced along the modeling pipeline. Many of these decisions are ma… ▽ More

    Submitted 18 June, 2024; v1 submitted 31 August, 2023; originally announced August 2023.

    Journal ref: FAccT '24: Proceedings of the 2024 ACM Conference on Fairness, Accountability, and Transparency (2024) 1305-1320

  7. arXiv:2211.02730  [pdf, other

    stat.ML cs.LG

    Uncertainty-aware predictive modeling for fair data-driven decisions

    Authors: Patrick Kaiser, Christoph Kern, David RĂ¼gamer

    Abstract: Both industry and academia have made considerable progress in develo** trustworthy and responsible machine learning (ML) systems. While critical concepts like fairness and explainability are often addressed, the safety of systems is typically not sufficiently taken into account. By viewing data-driven decision systems as socio-technical systems, we draw on the uncertainty in ML literature to sho… ▽ More

    Submitted 4 November, 2022; originally announced November 2022.

  8. arXiv:2108.04134  [pdf, other

    cs.CY cs.LG stat.AP

    Fairness in Algorithmic Profiling: A German Case Study

    Authors: Christoph Kern, Ruben L. Bach, Hannah Mautner, Frauke Kreuter

    Abstract: Algorithmic profiling is increasingly used in the public sector as a means to allocate limited public resources effectively and objectively. One example is the prediction-based statistical profiling of job seekers to guide the allocation of support measures by public employment services. However, empirical evaluations of potential side-effects such as unintended discrimination and fairness concern… ▽ More

    Submitted 4 August, 2021; originally announced August 2021.

  9. arXiv:2105.01441  [pdf, other

    stat.ML cs.LG

    Distributive Justice and Fairness Metrics in Automated Decision-making: How Much Overlap Is There?

    Authors: Matthias Kuppler, Christoph Kern, Ruben L. Bach, Frauke Kreuter

    Abstract: The advent of powerful prediction algorithms led to increased automation of high-stake decisions regarding the allocation of scarce resources such as government spending and welfare support. This automation bears the risk of perpetuating unwanted discrimination against vulnerable and historically disadvantaged groups. Research on algorithmic discrimination in computer science and other disciplines… ▽ More

    Submitted 6 May, 2021; v1 submitted 4 May, 2021; originally announced May 2021.

  10. arXiv:2012.11678  [pdf

    stat.AP

    Global Trends and Predictors of Face Mask Usage During the COVID-19 Pandemic

    Authors: Elena Badillo-Goicoechea, Ting-Hsuan Chang, Esther Kim, Sarah LaRocca, Katherine Morris, Xiaoyi Deng, Samantha Chiu, Adrianne Bradford, Andres Garcia, Christoph Kern, Curtiss Cobb, Frauke Kreuter, Elizabeth A. Stuart

    Abstract: Background: Guidelines and recommendations from public health authorities related to face masks have been essential in containing the COVID-19 pandemic. We assessed the prevalence and correlates of mask usage during the pandemic. Methods: We examined a total of 13,723,810 responses to a daily cross-sectional representative online survey in 38 countries who completed from April 23, 2020 to Octobe… ▽ More

    Submitted 8 January, 2021; v1 submitted 21 December, 2020; originally announced December 2020.

    Comments: 39 pages, 2 mian figures, Appendix

  11. arXiv:1909.13361  [pdf, other

    stat.ME cs.LG stat.AP

    A Longitudinal Framework for Predicting Nonresponse in Panel Surveys

    Authors: Christoph Kern, Bernd Weiss, Jan-Philipp Kolb

    Abstract: Nonresponse in panel studies can lead to a substantial loss in data quality due to its potential to introduce bias and distort survey estimates. Recent work investigates the usage of machine learning to predict nonresponse in advance, such that predicted nonresponse propensities can be used to inform the data collection process. However, predicting nonresponse in panel studies requires accounting… ▽ More

    Submitted 2 November, 2019; v1 submitted 29 September, 2019; originally announced September 2019.