Skip to main content

Showing 1–9 of 9 results for author: Baum, K

Searching in archive cs. Search in all archives.
.
  1. arXiv:2407.03391  [pdf, other

    cs.CR cs.AI cs.CL

    Soft Begging: Modular and Efficient Shielding of LLMs against Prompt Injection and Jailbreaking based on Prompt Tuning

    Authors: Simon Ostermann, Kevin Baum, Christoph Endres, Julia Masloh, Patrick Schramowski

    Abstract: Prompt injection (both direct and indirect) and jailbreaking are now recognized as significant issues for large language models (LLMs), particularly due to their potential for harm in application-integrated contexts. This extended abstract explores a novel approach to protecting LLMs from such attacks, termed "soft begging." This method involves training soft prompts to counteract the effects of c… ▽ More

    Submitted 3 July, 2024; originally announced July 2024.

  2. On the Quest for Effectiveness in Human Oversight: Interdisciplinary Perspectives

    Authors: Sarah Sterz, Kevin Baum, Sebastian Biewer, Holger Hermanns, Anne Lauber-Rönsberg, Philip Meinel, Markus Langer

    Abstract: Human oversight is currently discussed as a potential safeguard to counter some of the negative aspects of high-risk AI applications. This prompts a critical examination of the role and conditions necessary for what is prominently termed effective or meaningful human oversight of these systems. This paper investigates effective human oversight by synthesizing insights from psychological, legal, ph… ▽ More

    Submitted 7 May, 2024; v1 submitted 5 April, 2024; originally announced April 2024.

    Comments: 13 pages, 1 figure, 1 table; ACM Conference on Fairness, Accountability, and Transparency (ACM FAccT) 2024

  3. arXiv:2312.07252  [pdf, other

    cs.LG stat.ML

    Identifying Drivers of Predictive Aleatoric Uncertainty

    Authors: Pascal Iversen, Simon Witzke, Katharina Baum, Bernhard Y. Renard

    Abstract: Explainability and uncertainty quantification are two pillars of trustable artificial intelligence. However, the reasoning behind uncertainty estimates is generally left unexplained. Identifying the drivers of uncertainty complements explanations of point predictions in recognizing model limitations and enhances trust in decisions and their communication. So far, explanations of uncertainties have… ▽ More

    Submitted 30 May, 2024; v1 submitted 12 December, 2023; originally announced December 2023.

    Comments: Simon Witzke and Pascal Iversen contributed equally

  4. arXiv:2308.06186  [pdf, other

    cs.CY cs.AI cs.LO

    Software Do** Analysis for Human Oversight

    Authors: Sebastian Biewer, Kevin Baum, Sarah Sterz, Holger Hermanns, Sven Hetmank, Markus Langer, Anne Lauber-Rönsberg, Franz Lehr

    Abstract: This article introduces a framework that is meant to assist in mitigating societal risks that software can pose. Concretely, this encompasses facets of software do** as well as unfairness and discrimination in high-risk decision-making systems. The term software do** refers to software that contains surreptitiously added functionality that is against the interest of the user. A prominent examp… ▽ More

    Submitted 11 August, 2023; originally announced August 2023.

    Comments: Submitted to Formal Methods in System Design, special issue for FASE 2022

  5. arXiv:2304.04000  [pdf, other

    cs.LG

    SimbaML: Connecting Mechanistic Models and Machine Learning with Augmented Data

    Authors: Maximilian Kleissl, Lukas Drews, Benedict B. Heyder, Julian Zabbarov, Pascal Iversen, Simon Witzke, Bernhard Y. Renard, Katharina Baum

    Abstract: Training sophisticated machine learning (ML) models requires large datasets that are difficult or expensive to collect for many applications. If prior knowledge about system dynamics is available, mechanistic representations can be used to supplement real-world data. We present SimbaML (Simulation-Based ML), an open-source tool that unifies realistic synthetic dataset generation from ordinary diff… ▽ More

    Submitted 9 July, 2023; v1 submitted 8 April, 2023; originally announced April 2023.

    Comments: 6 pages, 1 figure

  6. arXiv:2210.03211  [pdf, other

    cs.SI cs.DS

    LazyFox: Fast and parallelized overlap** community detection in large graphs

    Authors: Tim Garrels, Athar Khodabakhsh, Bernhard Y. Renard, Katharina Baum

    Abstract: The detection of communities in graph datasets provides insight about a graph's underlying structure and is an important tool for various domains such as social sciences, marketing, traffic forecast, and drug discovery. While most existing algorithms provide fast approaches for community detection, their results usually contain strictly separated communities. However, most datasets would semantica… ▽ More

    Submitted 6 October, 2022; originally announced October 2022.

    Comments: 17 pages, 5 figures

  7. arXiv:2108.07711  [pdf, ps, other

    cs.CY

    Explainability Auditing for Intelligent Systems: A Rationale for Multi-Disciplinary Perspectives

    Authors: Markus Langer, Kevin Baum, Kathrin Hartmann, Stefan Hessel, Timo Speith, Jonas Wahl

    Abstract: National and international guidelines for trustworthy artificial intelligence (AI) consider explainability to be a central facet of trustworthy systems. This paper outlines a multi-disciplinary rationale for explainability auditing. Specifically, we propose that explainability auditing can ensure the quality of explainability of systems in applied contexts and can be the basis for certification as… ▽ More

    Submitted 5 August, 2021; originally announced August 2021.

    Comments: Accepted at the First International Workshop on Requirements Engineering for Explainable Systems (RE4ES) co-located with the 29th IEEE International Requirements Engineering Conference (RE'21)

  8. What Do We Want From Explainable Artificial Intelligence (XAI)? -- A Stakeholder Perspective on XAI and a Conceptual Model Guiding Interdisciplinary XAI Research

    Authors: Markus Langer, Daniel Oster, Timo Speith, Holger Hermanns, Lena Kästner, Eva Schmidt, Andreas Sesing, Kevin Baum

    Abstract: Previous research in Explainable Artificial Intelligence (XAI) suggests that a main aim of explainability approaches is to satisfy specific interests, goals, expectations, needs, and demands regarding artificial systems (we call these stakeholders' desiderata) in a variety of contexts. However, the literature on XAI is vast, spreads out across multiple largely disconnected disciplines, and it ofte… ▽ More

    Submitted 15 February, 2021; originally announced February 2021.

    Comments: 57 pages, 2 figures, 1 table, to be published in Artificial Intelligence, Markus Langer, Daniel Oster and Timo Speith share first-authorship of this paper

  9. Towards a Framework Combining Machine Ethics and Machine Explainability

    Authors: Kevin Baum, Holger Hermanns, Timo Speith

    Abstract: We find ourselves surrounded by a rapidly increasing number of autonomous and semi-autonomous systems. Two grand challenges arise from this development: Machine Ethics and Machine Explainability. Machine Ethics, on the one hand, is concerned with behavioral constraints for systems, so that morally acceptable, restricted behavior results; Machine Explainability, on the other hand, enables systems t… ▽ More

    Submitted 2 January, 2019; originally announced January 2019.

    Comments: In Proceedings CREST 2018, arXiv:1901.00073

    Journal ref: EPTCS 286, 2019, pp. 34-49