Skip to main content

Showing 1–4 of 4 results for author: Metz, Y

.
  1. arXiv:2401.01955  [pdf, other

    cs.HC cs.MM

    MULTI-CASE: A Transformer-based Ethics-aware Multimodal Investigative Intelligence Framework

    Authors: Maximilian T. Fischer, Yannick Metz, Lucas Joos, Matthias Miller, Daniel A. Keim

    Abstract: AI-driven models are increasingly deployed in operational analytics solutions, for instance, in investigative journalism or the intelligence community. Current approaches face two primary challenges: ethical and privacy concerns, as well as difficulties in efficiently combining heterogeneous data sources for multimodal analytics. To tackle the challenge of multimodal analytics, we present MULTI-CA… ▽ More

    Submitted 3 January, 2024; originally announced January 2024.

    Comments: 6 pages, 3 figures, 1 table

  2. arXiv:2308.04332  [pdf, other

    cs.LG cs.HC

    RLHF-Blender: A Configurable Interactive Interface for Learning from Diverse Human Feedback

    Authors: Yannick Metz, David Lindner, Raphaƫl Baur, Daniel Keim, Mennatallah El-Assady

    Abstract: To use reinforcement learning from human feedback (RLHF) in practical applications, it is crucial to learn reward models from diverse sources of human feedback and to consider human factors involved in providing feedback of different types. However, the systematic study of learning from diverse types of feedback is held back by limited standardized tooling available to researchers. To bridge this… ▽ More

    Submitted 8 August, 2023; originally announced August 2023.

    Comments: 14 pages, 3 figures

    Journal ref: ICML2023 Interactive Learning from Implicit Human Feedback Workshop

  3. arXiv:2210.03649  [pdf, other

    cs.LG cs.AI cs.MA cs.RO

    How to Enable Uncertainty Estimation in Proximal Policy Optimization

    Authors: Eugene Bykovets, Yannick Metz, Mennatallah El-Assady, Daniel A. Keim, Joachim M. Buhmann

    Abstract: While deep reinforcement learning (RL) agents have showcased strong results across many domains, a major concern is their inherent opaqueness and the safety of such systems in real-world use cases. To overcome these issues, we need agents that can quantify their uncertainty and detect out-of-distribution (OOD) states. Existing uncertainty estimation techniques, like Monte-Carlo Dropout or Deep Ens… ▽ More

    Submitted 7 October, 2022; originally announced October 2022.

    ACM Class: I.2; I.2.6; I.2.8; I.2.9; I.2.10

  4. arXiv:2208.10481  [pdf, other

    cs.LG cs.AI cs.CR cs.CV cs.RO

    BARReL: Bottleneck Attention for Adversarial Robustness in Vision-Based Reinforcement Learning

    Authors: Eugene Bykovets, Yannick Metz, Mennatallah El-Assady, Daniel A. Keim, Joachim M. Buhmann

    Abstract: Robustness to adversarial perturbations has been explored in many areas of computer vision. This robustness is particularly relevant in vision-based reinforcement learning, as the actions of autonomous agents might be safety-critic or impactful in the real world. We investigate the susceptibility of vision-based reinforcement learning agents to gradient-based adversarial attacks and evaluate a pot… ▽ More

    Submitted 22 August, 2022; originally announced August 2022.

    Comments: 5 pages, 2 figures, 3 tables

    ACM Class: I.2.6; I.2.8; I.2.9; I.2.10; I.5.4