Skip to main content

Showing 1–19 of 19 results for author: Lindner, D

Searching in archive cs. Search in all archives.
.
  1. arXiv:2406.06886  [pdf, other

    cs.DB

    Enabling Data Dependency-based Query Optimization

    Authors: Daniel Lindner, Daniel Ritter, Felix Naumann

    Abstract: Data dependency-based query optimization techniques can considerably improve database system performance: we apply three such optimization techniques to five database management systems (DBMSs) and observe throughput improvements between 5 % and 33 %. We address two key challenges to achieve these results: (i) efficiently identifying and extracting relevant dependencies from the data, and (ii) mak… ▽ More

    Submitted 10 June, 2024; originally announced June 2024.

  2. arXiv:2403.13793  [pdf, other

    cs.LG

    Evaluating Frontier Models for Dangerous Capabilities

    Authors: Mary Phuong, Matthew Aitchison, Elliot Catt, Sarah Cogan, Alexandre Kaskasoli, Victoria Krakovna, David Lindner, Matthew Rahtz, Yannis Assael, Sarah Hodkinson, Heidi Howard, Tom Lieberum, Ramana Kumar, Maria Abi Raad, Albert Webson, Lewis Ho, Sharon Lin, Sebastian Farquhar, Marcus Hutter, Gregoire Deletang, Anian Ruoss, Seliem El-Sayed, Sasha Brown, Anca Dragan, Rohin Shah , et al. (2 additional authors not shown)

    Abstract: To understand the risks posed by a new AI system, we must understand what it can and cannot do. Building on prior work, we introduce a programme of new "dangerous capability" evaluations and pilot them on Gemini 1.0 models. Our evaluations cover four areas: (1) persuasion and deception; (2) cyber-security; (3) self-proliferation; and (4) self-reasoning. We do not find evidence of strong dangerous… ▽ More

    Submitted 5 April, 2024; v1 submitted 20 March, 2024; originally announced March 2024.

  3. arXiv:2310.12921  [pdf, other

    cs.LG cs.AI

    Vision-Language Models are Zero-Shot Reward Models for Reinforcement Learning

    Authors: Juan Rocamonde, Victoriano Montesinos, Elvis Nava, Ethan Perez, David Lindner

    Abstract: Reinforcement learning (RL) requires either manually specifying a reward function, which is often infeasible, or learning a reward model from a large amount of human feedback, which is often very expensive. We study a more sample-efficient alternative: using pretrained vision-language models (VLMs) as zero-shot reward models (RMs) to specify tasks via natural language. We propose a natural and gen… ▽ More

    Submitted 14 March, 2024; v1 submitted 19 October, 2023; originally announced October 2023.

    Comments: Presented at International Conference on Learning Representations (ICLR) 2024

  4. arXiv:2308.04332  [pdf, other

    cs.LG cs.HC

    RLHF-Blender: A Configurable Interactive Interface for Learning from Diverse Human Feedback

    Authors: Yannick Metz, David Lindner, Raphaël Baur, Daniel Keim, Mennatallah El-Assady

    Abstract: To use reinforcement learning from human feedback (RLHF) in practical applications, it is crucial to learn reward models from diverse sources of human feedback and to consider human factors involved in providing feedback of different types. However, the systematic study of learning from diverse types of feedback is held back by limited standardized tooling available to researchers. To bridge this… ▽ More

    Submitted 8 August, 2023; originally announced August 2023.

    Comments: 14 pages, 3 figures

    Journal ref: ICML2023 Interactive Learning from Implicit Human Feedback Workshop

  5. arXiv:2307.15217  [pdf, other

    cs.AI cs.CL cs.LG

    Open Problems and Fundamental Limitations of Reinforcement Learning from Human Feedback

    Authors: Stephen Casper, Xander Davies, Claudia Shi, Thomas Krendl Gilbert, Jérémy Scheurer, Javier Rando, Rachel Freedman, Tomasz Korbak, David Lindner, Pedro Freire, Tony Wang, Samuel Marks, Charbel-Raphaël Segerie, Micah Carroll, Andi Peng, Phillip Christoffersen, Mehul Damani, Stewart Slocum, Usman Anwar, Anand Siththaranjan, Max Nadeau, Eric J. Michaud, Jacob Pfau, Dmitrii Krasheninnikov, Xin Chen , et al. (7 additional authors not shown)

    Abstract: Reinforcement learning from human feedback (RLHF) is a technique for training AI systems to align with human goals. RLHF has emerged as the central method used to finetune state-of-the-art large language models (LLMs). Despite this popularity, there has been relatively little public work systematizing its flaws. In this paper, we (1) survey open problems and fundamental limitations of RLHF and rel… ▽ More

    Submitted 11 September, 2023; v1 submitted 27 July, 2023; originally announced July 2023.

  6. arXiv:2305.16147  [pdf, other

    cs.LG cs.AI stat.ML

    Learning Safety Constraints from Demonstrations with Unknown Rewards

    Authors: David Lindner, Xin Chen, Sebastian Tschiatschek, Katja Hofmann, Andreas Krause

    Abstract: We propose Convex Constraint Learning for Reinforcement Learning (CoCoRL), a novel approach for inferring shared constraints in a Constrained Markov Decision Process (CMDP) from a set of safe demonstrations with possibly different reward functions. While previous work is limited to demonstrations with known rewards or fully known environment dynamics, CoCoRL can learn constraints from demonstratio… ▽ More

    Submitted 1 March, 2024; v1 submitted 25 May, 2023; originally announced May 2023.

    Comments: Presented at the International Conference on Artificial Intelligence and Statistics (AISTATS) 2024

  7. arXiv:2301.05062  [pdf, other

    cs.LG cs.AI stat.ML

    Tracr: Compiled Transformers as a Laboratory for Interpretability

    Authors: David Lindner, János Kramár, Sebastian Farquhar, Matthew Rahtz, Thomas McGrath, Vladimir Mikulik

    Abstract: We show how to "compile" human-readable programs into standard decoder-only transformer models. Our compiler, Tracr, generates models with known structure. This structure can be used to design experiments. For example, we use it to study "superposition" in transformers that execute multi-step algorithms. Additionally, the known structure of Tracr-compiled models can serve as ground-truth for evalu… ▽ More

    Submitted 3 November, 2023; v1 submitted 12 January, 2023; originally announced January 2023.

    Comments: Presented at NeurIPS 2023 (Spotlight)

  8. arXiv:2210.04610  [pdf, other

    cs.AI cs.CR cs.CV cs.CY cs.LG

    Red-Teaming the Stable Diffusion Safety Filter

    Authors: Javier Rando, Daniel Paleka, David Lindner, Lennart Heim, Florian Tramèr

    Abstract: Stable Diffusion is a recent open-source image generation model comparable to proprietary models such as DALLE, Imagen, or Parti. Stable Diffusion comes with a safety filter that aims to prevent generating explicit images. Unfortunately, the filter is obfuscated and poorly documented. This makes it hard for users to prevent misuse in their applications, and to understand the filter's limitations a… ▽ More

    Submitted 10 November, 2022; v1 submitted 3 October, 2022; originally announced October 2022.

    Comments: ML Safety Workshop NeurIPS 2022

  9. arXiv:2207.08645  [pdf, other

    cs.LG cs.AI stat.ML

    Active Exploration for Inverse Reinforcement Learning

    Authors: David Lindner, Andreas Krause, Giorgia Ramponi

    Abstract: Inverse Reinforcement Learning (IRL) is a powerful paradigm for inferring a reward function from expert demonstrations. Many IRL algorithms require a known transition model and sometimes even a known expert policy, or they at least require access to a generative model. However, these assumptions are too strong for many real-world applications, where the environment can be accessed only through seq… ▽ More

    Submitted 22 August, 2023; v1 submitted 18 July, 2022; originally announced July 2022.

    Comments: Presented at Conference on Neural Information Processing Systems (NeurIPS), 2022

  10. arXiv:2206.13316  [pdf, other

    cs.LG cs.HC stat.ML

    Humans are not Boltzmann Distributions: Challenges and Opportunities for Modelling Human Feedback and Interaction in Reinforcement Learning

    Authors: David Lindner, Mennatallah El-Assady

    Abstract: Reinforcement learning (RL) commonly assumes access to well-specified reward functions, which many practical applications do not provide. Instead, recently, more work has explored learning what to do from interacting with humans. So far, most of these approaches model humans as being (nosily) rational and, in particular, giving unbiased feedback. We argue that these models are too simplistic and t… ▽ More

    Submitted 27 June, 2022; originally announced June 2022.

    Comments: Accepted to Communication in Human-AI Interaction Workshop (CHAI) at IJCAI-ECAI-22

  11. arXiv:2206.05255  [pdf, other

    cs.LG cs.AI stat.ML

    Interactively Learning Preference Constraints in Linear Bandits

    Authors: David Lindner, Sebastian Tschiatschek, Katja Hofmann, Andreas Krause

    Abstract: We study sequential decision-making with known rewards and unknown constraints, motivated by situations where the constraints represent expensive-to-evaluate human preferences, such as safe and comfortable driving behavior. We formalize the challenge of interactively learning about these constraints as a novel linear bandit problem which we call constrained linear best-arm identification. To solve… ▽ More

    Submitted 10 June, 2022; originally announced June 2022.

    Comments: Accepted to International Conference on Machine Learning (ICML), 2022

  12. GoSafeOpt: Scalable Safe Exploration for Global Optimization of Dynamical Systems

    Authors: Bhavya Sukhija, Matteo Turchetta, David Lindner, Andreas Krause, Sebastian Trimpe, Dominik Baumann

    Abstract: Learning optimal control policies directly on physical systems is challenging since even a single failure can lead to costly hardware damage. Most existing model-free learning methods that guarantee safety, i.e., no failures, during exploration are limited to local optima. A notable exception is the GoSafe algorithm, which, unfortunately, cannot handle high-dimensional systems and hence cannot be… ▽ More

    Submitted 12 June, 2023; v1 submitted 24 January, 2022; originally announced January 2022.

    Journal ref: Artificial Intelligence, Volume 320, Year 2023

  13. arXiv:2106.01325  [pdf, other

    cs.LG cs.AI cs.CY stat.ML

    Addressing the Long-term Impact of ML Decisions via Policy Regret

    Authors: David Lindner, Hoda Heidari, Andreas Krause

    Abstract: Machine Learning (ML) increasingly informs the allocation of opportunities to individuals and communities in areas such as lending, education, employment, and beyond. Such decisions often impact their subjects' future characteristics and capabilities in an a priori unknown fashion. The decision-maker, therefore, faces exploration-exploitation dilemmas akin to those in multi-armed bandits. Followin… ▽ More

    Submitted 2 June, 2021; originally announced June 2021.

    Comments: Accepted to IJCAI 2021

  14. arXiv:2104.03946  [pdf, other

    cs.LG cs.AI stat.ML

    Learning What To Do by Simulating the Past

    Authors: David Lindner, Rohin Shah, Pieter Abbeel, Anca Dragan

    Abstract: Since reward functions are hard to specify, recent work has focused on learning policies from human feedback. However, such approaches are impeded by the expense of acquiring such feedback. Recent work proposed that agents have access to a source of information that is effectively free: in any environment that humans have acted in, the state will already be optimized for human preferences, and thu… ▽ More

    Submitted 3 May, 2021; v1 submitted 8 April, 2021; originally announced April 2021.

    Comments: Presented at ICLR 2021

  15. arXiv:2104.00550  [pdf

    physics.soc-ph cs.SI

    Agent-based simulations for protecting nursing homes with prevention and vaccination strategies

    Authors: Jana Lasser, Johannes Zuber, Johannes Sorger, Elma Dervic, Katharina Ledebur, Simon David Lindner, Elisabeth Klager, Maria Kletečka-Pulker, Harald Willschke, Katrin Stangl, Sarah Stadtmann, Christian Haslinger, Peter Klimek, Thomas Wochele-Thoma

    Abstract: Due to its high lethality amongst the elderly, the safety of nursing homes has been of central importance during the COVID-19 pandemic. With test procedures becoming available at scale, such as antigen or RT-LAMP tests, and increasing availability of vaccinations, nursing homes might be able to safely relax prohibitory measures while controlling the spread of infections (meaning an average of one… ▽ More

    Submitted 14 June, 2021; v1 submitted 16 November, 2020; originally announced April 2021.

    Comments: Supplementary material is included in the manuscript PDF

  16. arXiv:2102.12466  [pdf, other

    cs.LG

    Information Directed Reward Learning for Reinforcement Learning

    Authors: David Lindner, Matteo Turchetta, Sebastian Tschiatschek, Kamil Ciosek, Andreas Krause

    Abstract: For many reinforcement learning (RL) applications, specifying a reward is difficult. This paper considers an RL setting where the agent obtains information about the reward only by querying an expert that can, for example, evaluate individual states or provide binary preferences over trajectories. From such expensive feedback, we aim to learn a model of the reward that allows standard RL algorithm… ▽ More

    Submitted 31 January, 2022; v1 submitted 24 February, 2021; originally announced February 2021.

    Comments: Presented at Conference on Neural Information Processing Systems (NeurIPS), 2021

  17. arXiv:2101.12509  [pdf, ps, other

    cs.LG cs.AI

    Challenges for Using Impact Regularizers to Avoid Negative Side Effects

    Authors: David Lindner, Kyle Matoba, Alexander Meulemans

    Abstract: Designing reward functions for reinforcement learning is difficult: besides specifying which behavior is rewarded for a task, the reward also has to discourage undesired outcomes. Misspecified reward functions can lead to unintended negative side effects, and overall unsafe behavior. To overcome this problem, recent work proposed to augment the specified reward function with an impact regularizer… ▽ More

    Submitted 23 February, 2021; v1 submitted 29 January, 2021; originally announced January 2021.

    Comments: Presented at the SafeAI workshop at AAAI 2021

  18. arXiv:1907.00452  [pdf, ps, other

    cs.LG stat.ML

    Detecting Spiky Corruption in Markov Decision Processes

    Authors: Jason Mancuso, Tomasz Kisielewski, David Lindner, Alok Singh

    Abstract: Current reinforcement learning methods fail if the reward function is imperfect, i.e. if the agent observes reward different from what it actually receives. We study this problem within the formalism of Corrupt Reward Markov Decision Processes (CRMDPs). We show that if the reward corruption in a CRMDP is sufficiently "spiky", the environment is solvable. We fully characterize the regret bound of a… ▽ More

    Submitted 30 June, 2019; originally announced July 2019.

    Comments: paper accepted to the AI Safety Workshop at IJCAI-19

  19. arXiv:1903.11451  [pdf, other

    cs.SI cs.LG stat.ML

    Sensing Social Media Signals for Cryptocurrency News

    Authors: Johannes Beck, Roberta Huang, David Lindner, Tian Guo, Ce Zhang, Dirk Helbing, Nino Antulov-Fantulin

    Abstract: The ability to track and monitor relevant and important news in real-time is of crucial interest in multiple industrial sectors. In this work, we focus on the set of cryptocurrency news, which recently became of emerging interest to the general and financial audience. In order to track relevant news in real-time, we (i) match news from the web with tweets from social media, (ii) track their intrad… ▽ More

    Submitted 27 March, 2019; originally announced March 2019.

    Comments: full version of the paper, that is accepted at ACM WWW '19 Conference, MSM'19 Workshop