Skip to main content

Showing 1–50 of 253 results for author: Krause, A

Searching in archive cs. Search in all archives.
.
  1. arXiv:2406.16745  [pdf, other

    cs.LG cs.AI cs.GT stat.ML

    Bandits with Preference Feedback: A Stackelberg Game Perspective

    Authors: Barna Pásztor, Parnian Kassraie, Andreas Krause

    Abstract: Bandits with preference feedback present a powerful tool for optimizing unknown target functions when only pairwise comparisons are allowed instead of direct value queries. This model allows for incorporating human feedback into online inference and optimization and has been employed in systems for fine-tuning large language models. The problem is well understood in simplified settings with linear… ▽ More

    Submitted 24 June, 2024; originally announced June 2024.

    Comments: 30 pages, 8 figures

  2. arXiv:2406.11601  [pdf, other

    cs.LG stat.ML

    Standardizing Structural Causal Models

    Authors: Weronika Ormaniec, Scott Sussex, Lars Lorch, Bernhard Schölkopf, Andreas Krause

    Abstract: Synthetic datasets generated by structural causal models (SCMs) are commonly used for benchmarking causal structure learning algorithms. However, the variances and pairwise correlations in SCM data tend to increase along the causal ordering. Several popular algorithms exploit these artifacts, possibly leading to conclusions that do not generalize to real-world settings. Existing metrics like… ▽ More

    Submitted 17 June, 2024; originally announced June 2024.

  3. arXiv:2406.03932  [pdf, other

    cs.LG

    Breeding Programs Optimization with Reinforcement Learning

    Authors: Omar G. Younis, Luca Corinzia, Ioannis N. Athanasiadis, Andreas Krause, Joachim M. Buhmann, Matteo Turchetta

    Abstract: Crop breeding is crucial in improving agricultural productivity while potentially decreasing land usage, greenhouse gas emissions, and water consumption. However, breeding programs are challenging due to long turnover times, high-dimensional decision spaces, long-term objectives, and the need to adapt to rapid climate change. This paper introduces the use of Reinforcement Learning (RL) to optimize… ▽ More

    Submitted 6 June, 2024; originally announced June 2024.

    Comments: NeurIPS 2023 Workshop on Tackling Climate Change with Machine Learning

  4. arXiv:2406.01575  [pdf, other

    math.OC cs.AI cs.LG stat.ML

    Stochastic Bilevel Optimization with Lower-Level Contextual Markov Decision Processes

    Authors: Vinzenz Thoma, Barna Pasztor, Andreas Krause, Giorgia Ramponi, Yifan Hu

    Abstract: In various applications, the optimal policy in a strategic decision-making problem depends both on the environmental configuration and exogenous events. For these settings, we introduce Bilevel Optimization with Contextual Markov Decision Processes (BO-CMDP), a stochastic bilevel decision-making model, where the lower level consists of solving a contextual Markov Decision Process (CMDP). BO-CMDP c… ▽ More

    Submitted 3 June, 2024; originally announced June 2024.

    Comments: 54 pages, 18 Figures

  5. arXiv:2406.01175  [pdf, other

    cs.LG

    NeoRL: Efficient Exploration for Nonepisodic RL

    Authors: Bhavya Sukhija, Lenart Treven, Florian Dörfler, Stelian Coros, Andreas Krause

    Abstract: We study the problem of nonepisodic reinforcement learning (RL) for nonlinear dynamical systems, where the system dynamics are unknown and the RL agent has to learn from a single trajectory, i.e., without resets. We propose Nonepisodic Optimistic RL (NeoRL), an approach based on the principle of optimism in the face of uncertainty. NeoRL uses well-calibrated probabilistic models and plans optimist… ▽ More

    Submitted 4 June, 2024; v1 submitted 3 June, 2024; originally announced June 2024.

  6. arXiv:2406.01163  [pdf, other

    cs.LG

    When to Sense and Control? A Time-adaptive Approach for Continuous-Time RL

    Authors: Lenart Treven, Bhavya Sukhija, Yarden As, Florian Dörfler, Andreas Krause

    Abstract: Reinforcement learning (RL) excels in optimizing policies for discrete-time Markov decision processes (MDP). However, various systems are inherently continuous in time, making discrete-time MDPs an inexact modeling choice. In many applications, such as greenhouse control or medical treatments, each interaction (measurement or switching of action) involves manual intervention and thus is inherently… ▽ More

    Submitted 4 June, 2024; v1 submitted 3 June, 2024; originally announced June 2024.

  7. arXiv:2405.05890  [pdf, other

    cs.LG cs.AI

    Safe Exploration Using Bayesian World Models and Log-Barrier Optimization

    Authors: Yarden As, Bhavya Sukhija, Andreas Krause

    Abstract: A major challenge in deploying reinforcement learning in online tasks is ensuring that safety is maintained throughout the learning process. In this work, we propose CERL, a new method for solving constrained Markov decision processes while kee** the policy safe during learning. Our method leverages Bayesian world models and suggests policies that are pessimistic w.r.t. the model's epistemic unc… ▽ More

    Submitted 9 May, 2024; originally announced May 2024.

  8. arXiv:2403.19570  [pdf, other

    cs.LG

    GrINd: Grid Interpolation Network for Scattered Observations

    Authors: Andrzej Dulny, Paul Heinisch, Andreas Hotho, Anna Krause

    Abstract: Predicting the evolution of spatiotemporal physical systems from sparse and scattered observational data poses a significant challenge in various scientific domains. Traditional methods rely on dense grid-structured data, limiting their applicability in scenarios with sparse observations. To address this challenge, we introduce GrINd (Grid Interpolation Network for Scattered Observations), a novel… ▽ More

    Submitted 28 March, 2024; originally announced March 2024.

  9. arXiv:2403.18438  [pdf, other

    cs.LG

    Global Vegetation Modeling with Pre-Trained Weather Transformers

    Authors: Pascal Janetzky, Florian Gallusser, Simon Hentschel, Andreas Hotho, Anna Krause

    Abstract: Accurate vegetation models can produce further insights into the complex interaction between vegetation activity and ecosystem processes. Previous research has established that long-term trends and short-term variability of temperature and precipitation affect vegetation activity. Motivated by the recent success of Transformer-based Deep Learning models for medium-range weather forecasting, we ada… ▽ More

    Submitted 27 March, 2024; originally announced March 2024.

    Comments: Tackling Climate Change with Machine Learning Workshop @ ICLR 2024

  10. arXiv:2403.16644  [pdf, other

    cs.RO cs.LG

    Bridging the Sim-to-Real Gap with Bayesian Inference

    Authors: Jonas Rothfuss, Bhavya Sukhija, Lenart Treven, Florian Dörfler, Stelian Coros, Andreas Krause

    Abstract: We present SIM-FSVGD for learning robot dynamics from data. As opposed to traditional methods, SIM-FSVGD leverages low-fidelity physical priors, e.g., in the form of simulators, to regularize the training of neural network models. While learning accurate dynamics already in the low data regime, SIM-FSVGD scales and excels also when more data is available. We empirically show that learning with imp… ▽ More

    Submitted 25 March, 2024; originally announced March 2024.

  11. arXiv:2403.11827  [pdf, other

    cs.SD cs.LG eess.AS

    Sound Event Detection and Localization with Distance Estimation

    Authors: Daniel Aleksander Krause, Archontis Politis, Annamaria Mesaros

    Abstract: Sound Event Detection and Localization (SELD) is a combined task of identifying sound events and their corresponding direction-of-arrival (DOA). While this task has numerous applications and has been extensively researched in recent years, it fails to provide full information about the sound source position. In this paper, we overcome this problem by extending the task to Sound Event Detection, Lo… ▽ More

    Submitted 12 June, 2024; v1 submitted 18 March, 2024; originally announced March 2024.

    Comments: This paper has been accepted for the 32nd European Signal Processing Conference EUSIPCO 2024 in Lyon

  12. arXiv:2402.15898  [pdf, other

    cs.LG cs.AI

    Transductive Active Learning: Theory and Applications

    Authors: Jonas Hübotter, Bhavya Sukhija, Lenart Treven, Yarden As, Andreas Krause

    Abstract: We generalize active learning to address real-world settings with concrete prediction targets where sampling is restricted to an accessible region of the domain, while prediction targets may lie outside this region. We analyze a family of decision rules that sample adaptively to minimize uncertainty about prediction targets. We are the first to show, under general regularity assumptions, that such… ▽ More

    Submitted 22 May, 2024; v1 submitted 13 February, 2024; originally announced February 2024.

    Comments: arXiv admin note: substantial text overlap with arXiv:2402.15441

  13. arXiv:2402.15441  [pdf, other

    cs.LG cs.AI

    Active Few-Shot Fine-Tuning

    Authors: Jonas Hübotter, Bhavya Sukhija, Lenart Treven, Yarden As, Andreas Krause

    Abstract: We study the question: How can we select the right data for fine-tuning to a specific task? We call this data selection problem active fine-tuning and show that it is an instance of transductive active learning, a novel generalization of classical active learning. We propose ITL, short for information-based transductive learning, an approach which samples adaptively to maximize information gained… ▽ More

    Submitted 21 June, 2024; v1 submitted 13 February, 2024; originally announced February 2024.

  14. arXiv:2402.08406  [pdf, other

    cs.LG

    Transition Constrained Bayesian Optimization via Markov Decision Processes

    Authors: Jose Pablo Folch, Calvin Tsay, Robert M Lee, Behrang Shafei, Weronika Ormaniec, Andreas Krause, Mark van der Wilk, Ruth Misener, Mojmír Mutný

    Abstract: Bayesian optimization is a methodology to optimize black-box functions. Traditionally, it focuses on the setting where you can arbitrarily query the search space. However, many real-life problems do not offer this flexibility; in particular, the search space of the next query may depend on previous ones. Example challenges arise in the physical sciences in the form of local movement constraints, r… ▽ More

    Submitted 29 May, 2024; v1 submitted 13 February, 2024; originally announced February 2024.

    Comments: 10 pages main, 32 pages total, 16 figures, 2 tables, preprint

  15. arXiv:2402.06562  [pdf, other

    eess.SY cs.LG cs.RO math.OC

    Safe Guaranteed Exploration for Non-linear Systems

    Authors: Manish Prajapat, Johannes Köhler, Matteo Turchetta, Andreas Krause, Melanie N. Zeilinger

    Abstract: Safely exploring environments with a-priori unknown constraints is a fundamental challenge that restricts the autonomy of robots. While safety is paramount, guarantees on sufficient exploration are also crucial for ensuring autonomous task completion. To address these challenges, we propose a novel safe guaranteed exploration framework using optimal control, which achieves first-of-its-kind result… ▽ More

    Submitted 9 February, 2024; originally announced February 2024.

  16. arXiv:2402.05724  [pdf, other

    cs.LG cs.AI cs.GT stat.ML

    Model-Based RL for Mean-Field Games is not Statistically Harder than Single-Agent RL

    Authors: Jiawei Huang, Niao He, Andreas Krause

    Abstract: We study the sample complexity of reinforcement learning (RL) in Mean-Field Games (MFGs) with model-based function approximation that requires strategic exploration to find a Nash Equilibrium policy. We introduce the Partial Model-Based Eluder Dimension (P-MBED), a more effective notion to characterize the model class complexity. Notably, P-MBED measures the complexity of the single-agent model cl… ▽ More

    Submitted 3 June, 2024; v1 submitted 8 February, 2024; originally announced February 2024.

    Comments: ICML 2024; 55 Pages

  17. arXiv:2401.08351  [pdf, other

    cs.LG cs.CR

    Personalized Federated Learning of Probabilistic Models: A PAC-Bayesian Approach

    Authors: Mahrokh Ghoddousi Boroujeni, Andreas Krause, Giancarlo Ferrari Trecate

    Abstract: Federated learning aims to infer a shared model from private and decentralized data stored locally by multiple clients. Personalized federated learning (PFL) goes one step further by adapting the global model to each client, enhancing the model's fit for different clients. A significant level of personalization is required for highly heterogeneous clients, but can be challenging to achieve especia… ▽ More

    Submitted 16 January, 2024; originally announced January 2024.

  18. arXiv:2312.08307  [pdf, other

    physics.chem-ph cs.LG

    EquiReact: An equivariant neural network for chemical reactions

    Authors: Puck van Gerwen, Ksenia R. Briling, Charlotte Bunne, Vignesh Ram Somnath, Ruben Laplaza, Andreas Krause, Clemence Corminboeuf

    Abstract: Equivariant neural networks have considerably improved the accuracy and data-efficiency of predictions of molecular properties. Building on this success, we introduce EquiReact, an equivariant neural network to infer properties of chemical reactions, built from three-dimensional structures of reactants and products. We illustrate its competitive performance on the prediction of activation barriers… ▽ More

    Submitted 13 December, 2023; originally announced December 2023.

    Comments: 41 pages + SI (6 pages)

  19. arXiv:2311.16706  [pdf, ps, other

    cs.LG math.PR stat.ML

    Sinkhorn Flow: A Continuous-Time Framework for Understanding and Generalizing the Sinkhorn Algorithm

    Authors: Mohammad Reza Karimi, Ya-** Hsieh, Andreas Krause

    Abstract: Many problems in machine learning can be formulated as solving entropy-regularized optimal transport on the space of probability measures. The canonical approach involves the Sinkhorn iterates, renowned for their rich mathematical properties. Recently, the Sinkhorn algorithm has been recast within the mirror descent framework, thus benefiting from classical optimization theory insights. Here, we b… ▽ More

    Submitted 28 November, 2023; originally announced November 2023.

  20. arXiv:2311.07558  [pdf, other

    cs.LG cs.RO

    Data-Efficient Task Generalization via Probabilistic Model-based Meta Reinforcement Learning

    Authors: Arjun Bhardwaj, Jonas Rothfuss, Bhavya Sukhija, Yarden As, Marco Hutter, Stelian Coros, Andreas Krause

    Abstract: We introduce PACOH-RL, a novel model-based Meta-Reinforcement Learning (Meta-RL) algorithm designed to efficiently adapt control policies to changing dynamics. PACOH-RL meta-learns priors for the dynamics model, allowing swift adaptation to new dynamics with minimal interaction data. Existing Meta-RL methods require abundant meta-learning data, limiting their applicability in settings such as robo… ▽ More

    Submitted 6 February, 2024; v1 submitted 13 November, 2023; originally announced November 2023.

  21. arXiv:2311.04402  [pdf, other

    cs.LG stat.ML

    Likelihood Ratio Confidence Sets for Sequential Decision Making

    Authors: Nicolas Emmenegger, Mojmír Mutný, Andreas Krause

    Abstract: Certifiable, adaptive uncertainty estimates for unknown quantities are an essential ingredient of sequential decision-making algorithms. Standard approaches rely on problem-dependent concentration results and are limited to a specific combination of parameterization, noise family, and estimator. In this paper, we revisit the likelihood-based inference principle and propose to use likelihood ratios… ▽ More

    Submitted 7 November, 2023; originally announced November 2023.

  22. arXiv:2311.02374  [pdf, other

    math.OC cs.LG

    Riemannian stochastic optimization methods avoid strict saddle points

    Authors: Ya-** Hsieh, Mohammad Reza Karimi, Andreas Krause, Panayotis Mertikopoulos

    Abstract: Many modern machine learning applications - from online principal component analysis to covariance matrix identification and dictionary learning - can be formulated as minimization problems on Riemannian manifolds, and are typically solved with a Riemannian stochastic gradient method (or some variant thereof). However, in many cases of interest, the resulting minimization problem is not geodesical… ▽ More

    Submitted 4 November, 2023; originally announced November 2023.

    Comments: 27 pages, 3 figures

    MSC Class: Primary 62L20; 37N40; secondary 90C15; 90C48

  23. arXiv:2310.19848  [pdf, other

    cs.LG cs.RO math.OC

    Efficient Exploration in Continuous-time Model-based Reinforcement Learning

    Authors: Lenart Treven, Jonas Hübotter, Bhavya Sukhija, Florian Dörfler, Andreas Krause

    Abstract: Reinforcement learning algorithms typically consider discrete-time dynamics, even though the underlying systems are often continuous in time. In this paper, we introduce a model-based reinforcement learning algorithm that represents continuous-time dynamics using nonlinear ordinary differential equations (ODEs). We capture epistemic uncertainty using well-calibrated probabilistic models, and use t… ▽ More

    Submitted 30 October, 2023; originally announced October 2023.

  24. arXiv:2310.19390  [pdf, other

    stat.ML cs.LG

    Implicit Manifold Gaussian Process Regression

    Authors: Bernardo Fichera, Viacheslav Borovitskiy, Andreas Krause, Aude Billard

    Abstract: Gaussian process regression is widely used because of its ability to provide well-calibrated uncertainty estimates and handle small or sparse datasets. However, it struggles with high-dimensional data. One possible way to scale this technique to higher dimensions is to leverage the implicit low-dimensional manifold upon which the data actually lies, as postulated by the manifold hypothesis. Prior… ▽ More

    Submitted 1 February, 2024; v1 submitted 30 October, 2023; originally announced October 2023.

    Journal ref: Advances in Neural Information Processing Systems, 2023

  25. arXiv:2310.18824  [pdf, other

    stat.ML cs.LG

    Intrinsic Gaussian Vector Fields on Manifolds

    Authors: Daniel Robert-Nicoud, Andreas Krause, Viacheslav Borovitskiy

    Abstract: Various applications ranging from robotics to climate science require modeling signals on non-Euclidean domains, such as the sphere. Gaussian process models on manifolds have recently been proposed for such tasks, in particular when uncertainty quantification is needed. In the manifold setting, vector-valued signals can behave very differently from scalar-valued ones, with much of the progress so… ▽ More

    Submitted 31 March, 2024; v1 submitted 28 October, 2023; originally announced October 2023.

    Comments: Version accepted at AISTATS 2024

  26. arXiv:2310.18535  [pdf, other

    math.OC cs.LG

    Contextual Stochastic Bilevel Optimization

    Authors: Yifan Hu, Jie Wang, Yao Xie, Andreas Krause, Daniel Kuhn

    Abstract: We introduce contextual stochastic bilevel optimization (CSBO) -- a stochastic bilevel optimization framework with the lower-level problem minimizing an expectation conditioned on some contextual information and the upper-level decision variable. This framework extends classical stochastic bilevel optimization when the lower-level decision maker responds optimally not only to the decision of the u… ▽ More

    Submitted 27 October, 2023; originally announced October 2023.

    Comments: The paper is accepted by NeurIPS 2023

  27. arXiv:2310.17405  [pdf, other

    cs.LG

    Causal Modeling with Stationary Diffusions

    Authors: Lars Lorch, Andreas Krause, Bernhard Schölkopf

    Abstract: We develop a novel approach towards causal inference. Rather than structural equations over a causal graph, we learn stochastic differential equations (SDEs) whose stationary densities model a system's behavior under interventions. These stationary diffusion models do not require the formalism of causal graphs, let alone the common assumption of acyclicity. We show that in several cases, they gene… ▽ More

    Submitted 16 March, 2024; v1 submitted 26 October, 2023; originally announced October 2023.

    Comments: AISTATS 2024

  28. arXiv:2310.06177  [pdf, other

    cs.LG

    DockGame: Cooperative Games for Multimeric Rigid Protein Docking

    Authors: Vignesh Ram Somnath, Pier Giuseppe Sessa, Maria Rodriguez Martinez, Andreas Krause

    Abstract: Protein interactions and assembly formation are fundamental to most biological processes. Predicting the assembly structure from constituent proteins -- referred to as the protein docking task -- is thus a crucial step in protein design applications. Most traditional and deep learning methods for docking have focused mainly on binary docking, following either a search-based, regression-based, or g… ▽ More

    Submitted 9 October, 2023; originally announced October 2023.

    Comments: Under Review

  29. arXiv:2309.02236  [pdf, other

    cs.LG cs.AI stat.ML

    Distributionally Robust Model-based Reinforcement Learning with Large State Spaces

    Authors: Shyam Sundhar Ramesh, Pier Giuseppe Sessa, Yifan Hu, Andreas Krause, Ilija Bogunovic

    Abstract: Three major challenges in reinforcement learning are the complex dynamical systems with large state spaces, the costly data acquisition processes, and the deviation of real-world dynamics from the training environment deployment. To overcome these issues, we study distributionally robust Markov decision processes with continuous state spaces under the widely used Kullback-Leibler, chi-square, and… ▽ More

    Submitted 5 September, 2023; originally announced September 2023.

    Journal ref: AISTATS 2024

  30. arXiv:2308.01744  [pdf, other

    cs.LG

    Multitask Learning with No Regret: from Improved Confidence Bounds to Active Learning

    Authors: Pier Giuseppe Sessa, Pierre Laforgue, Nicolò Cesa-Bianchi, Andreas Krause

    Abstract: Multitask learning is a powerful framework that enables one to simultaneously learn multiple related tasks by sharing information between them. Quantifying uncertainty in the estimated tasks is of pivotal importance for many downstream applications, such as online or active learning. In this work, we provide novel multitask confidence intervals in the challenging agnostic setting, i.e., when neith… ▽ More

    Submitted 3 August, 2023; originally announced August 2023.

  31. arXiv:2307.16625  [pdf, other

    cs.LG stat.ML

    Adversarial Causal Bayesian Optimization

    Authors: Scott Sussex, Pier Giuseppe Sessa, Anastasiia Makarova, Andreas Krause

    Abstract: In Causal Bayesian Optimization (CBO), an agent intervenes on an unknown structural causal model to maximize a downstream reward variable. In this paper, we consider the generalization where other agents or external events also intervene on the system, which is key for enabling adaptiveness to non-stationarities such as weather changes, market forces, or adversaries. We formalize this generalizati… ▽ More

    Submitted 31 July, 2023; originally announced July 2023.

    Comments: 21 pages, 8 figures

  32. arXiv:2307.13372  [pdf, other

    cs.LG

    Submodular Reinforcement Learning

    Authors: Manish Prajapat, Mojmír Mutný, Melanie N. Zeilinger, Andreas Krause

    Abstract: In reinforcement learning (RL), rewards of states are typically considered additive, and following the Markov assumption, they are $\textit{independent}$ of states visited previously. In many important applications, such as coverage control, experiment design and informative path planning, rewards naturally have diminishing returns, i.e., their value decreases in light of similar states visited pr… ▽ More

    Submitted 24 May, 2024; v1 submitted 25 July, 2023; originally announced July 2023.

    Comments: Spotlight paper at ICLR 2024

  33. arXiv:2307.12897  [pdf, other

    stat.ML cs.AI cs.LG

    Anytime Model Selection in Linear Bandits

    Authors: Parnian Kassraie, Nicolas Emmenegger, Andreas Krause, Aldo Pacchiano

    Abstract: Model selection in the context of bandit optimization is a challenging problem, as it requires balancing exploration and exploitation not only for action selection, but also for model selection. One natural approach is to rely on online learning algorithms that treat different models as experts. Existing methods, however, scale poorly ($\text{poly}M$) with the number of models $M$ in terms of thei… ▽ More

    Submitted 12 November, 2023; v1 submitted 24 July, 2023; originally announced July 2023.

    Comments: NeurIPS 2023, 37 pages

  34. arXiv:2306.17052  [pdf, other

    cs.LG cs.AI cs.MA stat.ML

    Safe Model-Based Multi-Agent Mean-Field Reinforcement Learning

    Authors: Matej Jusup, Barna Pásztor, Tadeusz Janik, Kenan Zhang, Francesco Corman, Andreas Krause, Ilija Bogunovic

    Abstract: Many applications, e.g., in shared mobility, require coordinating a large number of agents. Mean-field reinforcement learning addresses the resulting scalability challenge by optimizing the policy of a representative agent interacting with the infinite population of identical agents instead of considering individual pairwise interactions. In this paper, we address an important generalization where… ▽ More

    Submitted 27 December, 2023; v1 submitted 29 June, 2023; originally announced June 2023.

    Comments: 23 pages, 26 figures, 6 tables

  35. arXiv:2306.14511  [pdf, other

    cs.LG

    TaylorPDENet: Learning PDEs from non-grid Data

    Authors: Paul Heinisch, Andrzej Dulny, Anna Krause, Andreas Hotho

    Abstract: Modeling data obtained from dynamical systems has gained attention in recent years as a challenging task for machine learning models. Previous approaches assume the measurements to be distributed on a grid. However, for real-world applications like weather prediction, the observations are taken from arbitrary locations within the spatial domain. In this paper, we propose TaylorPDENet - a novel mac… ▽ More

    Submitted 26 June, 2023; originally announced June 2023.

  36. arXiv:2306.13479  [pdf, other

    eess.SY cs.RO

    Safe Risk-averse Bayesian Optimization for Controller Tuning

    Authors: Christopher Koenig, Miks Ozols, Anastasia Makarova, Efe C. Balta, Andreas Krause, Alisa Rupenyan

    Abstract: Controller tuning and parameter optimization are crucial in system design to improve both the controller and underlying system performance. Bayesian optimization has been established as an efficient model-free method for controller tuning and adaptation. Standard methods, however, are not enough for high-precision systems to be robust with respect to unknown input-dependent noise and stable under… ▽ More

    Submitted 23 June, 2023; originally announced June 2023.

  37. arXiv:2306.12371  [pdf, other

    cs.LG cs.RO eess.SY

    Optimistic Active Exploration of Dynamical Systems

    Authors: Bhavya Sukhija, Lenart Treven, Cansu Sancaktar, Sebastian Blaes, Stelian Coros, Andreas Krause

    Abstract: Reinforcement learning algorithms commonly seek to optimize policies for solving one particular task. How should we explore an unknown dynamical system such that the estimated model globally approximates the dynamics and allows us to solve multiple downstream tasks in a zero-shot manner? In this paper, we address this challenge, by develo** an algorithm -- OPAX -- for active exploration. OPAX us… ▽ More

    Submitted 30 October, 2023; v1 submitted 21 June, 2023; originally announced June 2023.

  38. arXiv:2306.09708  [pdf, other

    cs.CR

    "We've Disabled MFA for You": An Evaluation of the Security and Usability of Multi-Factor Authentication Recovery Deployments

    Authors: Sabrina Amft, Sandra Höltervennhoff, Nicolas Huaman, Alexander Krause, Lucy Simko, Yasemin Acar, Sascha Fahl

    Abstract: Multi-Factor Authentication is intended to strengthen the security of password-based authentication by adding another factor, such as hardware tokens or one-time passwords using mobile apps. However, this increased authentication security comes with potential drawbacks that can lead to account and asset loss. If users lose access to their additional authentication factors for any reason, they will… ▽ More

    Submitted 19 September, 2023; v1 submitted 16 June, 2023; originally announced June 2023.

  39. arXiv:2306.09099  [pdf, other

    cs.LG

    Unbalanced Diffusion Schrödinger Bridge

    Authors: Matteo Pariset, Ya-** Hsieh, Charlotte Bunne, Andreas Krause, Valentin De Bortoli

    Abstract: Schrödinger bridges (SBs) provide an elegant framework for modeling the temporal evolution of populations in physical, chemical, or biological systems. Such natural processes are commonly subject to changes in population size over time due to the emergence of new species or birth and death events. However, existing neural parameterizations of SBs such as diffusion Schrödinger bridges (DSBs) are re… ▽ More

    Submitted 15 June, 2023; originally announced June 2023.

  40. arXiv:2306.07749  [pdf, other

    cs.LG cs.GT cs.MA

    Provably Learning Nash Policies in Constrained Markov Potential Games

    Authors: Pragnya Alatur, Giorgia Ramponi, Niao He, Andreas Krause

    Abstract: Multi-agent reinforcement learning (MARL) addresses sequential decision-making problems with multiple agents, where each agent optimizes its own objective. In many real-world instances, the agents may not only want to optimize their objectives, but also ensure safe behavior. For example, in traffic routing, each car (agent) aims to reach its destination quickly (objective) while avoiding collision… ▽ More

    Submitted 13 June, 2023; originally announced June 2023.

    Comments: 30 pages

  41. arXiv:2306.07092  [pdf, other

    cs.RO cs.AI

    Tuning Legged Locomotion Controllers via Safe Bayesian Optimization

    Authors: Daniel Widmer, Dongho Kang, Bhavya Sukhija, Jonas Hübotter, Andreas Krause, Stelian Coros

    Abstract: This paper presents a data-driven strategy to streamline the deployment of model-based controllers in legged robotic hardware platforms. Our approach leverages a model-free safe learning algorithm to automate the tuning of control gains, addressing the mismatch between the simplified model used in the control formulation and the real system. This method substantially mitigates the risk of hazardou… ▽ More

    Submitted 25 October, 2023; v1 submitted 12 June, 2023; originally announced June 2023.

    Comments: This paper has been accepted to the 2023 Conference on Robot Learning (CoRL 2023.) The first two authors contributed equally. The supplementary video is available at https://youtu.be/zDBouUgegrU and the code implementation is available at https://github.com/lasgroup/gosafeopt

  42. DynaBench: A benchmark dataset for learning dynamical systems from low-resolution data

    Authors: Andrzej Dulny, Andreas Hotho, Anna Krause

    Abstract: Previous work on learning physical systems from data has focused on high-resolution grid-structured measurements. However, real-world knowledge of such systems (e.g. weather data) relies on sparsely scattered measuring stations. In this paper, we introduce a novel simulated benchmark dataset, DynaBench, for learning dynamical systems directly from sparsely scattered data without prior knowledge of… ▽ More

    Submitted 28 September, 2023; v1 submitted 9 June, 2023; originally announced June 2023.

    Comments: This version is the final camera-ready version that has been published in the Proceedings of ECML-PKDD 2023

    Journal ref: Machine Learning and Knowledge Discovery in Databases: Research Track. ECML PKDD 2023. Lecture Notes in Computer Science(), vol 14169, p. 438-455. Springer, Cham

  43. arXiv:2305.16147  [pdf, other

    cs.LG cs.AI stat.ML

    Learning Safety Constraints from Demonstrations with Unknown Rewards

    Authors: David Lindner, Xin Chen, Sebastian Tschiatschek, Katja Hofmann, Andreas Krause

    Abstract: We propose Convex Constraint Learning for Reinforcement Learning (CoCoRL), a novel approach for inferring shared constraints in a Constrained Markov Decision Process (CMDP) from a set of safe demonstrations with possibly different reward functions. While previous work is limited to demonstrations with known rewards or fully known environment dynamics, CoCoRL can learn constraints from demonstratio… ▽ More

    Submitted 1 March, 2024; v1 submitted 25 May, 2023; originally announced May 2023.

    Comments: Presented at the International Conference on Artificial Intelligence and Statistics (AISTATS) 2024

  44. arXiv:2305.09779  [pdf, other

    cs.LG cs.AI

    A Scalable Walsh-Hadamard Regularizer to Overcome the Low-degree Spectral Bias of Neural Networks

    Authors: Ali Gorji, Andisheh Amrollahi, Andreas Krause

    Abstract: Despite the capacity of neural nets to learn arbitrary functions, models trained through gradient descent often exhibit a bias towards ``simpler'' functions. Various notions of simplicity have been introduced to characterize this behavior. Here, we focus on the case of neural networks with discrete (zero-one), high-dimensional, inputs through the lens of their Fourier (Walsh-Hadamard) transforms,… ▽ More

    Submitted 10 June, 2023; v1 submitted 16 May, 2023; originally announced May 2023.

    Comments: Accepted for the 39th Conference on Uncertainty in Artificial Intelligence (UAI 2023)

  45. arXiv:2305.05354  [pdf, other

    cs.RO cs.AI

    Safe Deep RL for Intraoperative Planning of Pedicle Screw Placement

    Authors: Yunke Ao, Hooman Esfandiari, Fabio Carrillo, Yarden As, Mazda Farshad, Benjamin F. Grewe, Andreas Krause, Philipp Fuernstahl

    Abstract: Spinal fusion surgery requires highly accurate implantation of pedicle screw implants, which must be conducted in critical proximity to vital structures with a limited view of anatomy. Robotic surgery systems have been proposed to improve placement accuracy, however, state-of-the-art systems suffer from the limitations of open-loop approaches, as they follow traditional concepts of preoperative pl… ▽ More

    Submitted 10 May, 2023; v1 submitted 9 May, 2023; originally announced May 2023.

    Comments: 10 pages, 4 figures

  46. arXiv:2303.01076  [pdf, other

    cs.LG cs.AI stat.ML

    Hallucinated Adversarial Control for Conservative Offline Policy Evaluation

    Authors: Jonas Rothfuss, Bhavya Sukhija, Tobias Birchler, Parnian Kassraie, Andreas Krause

    Abstract: We study the problem of conservative off-policy evaluation (COPE) where given an offline dataset of environment interactions, collected by other agents, we seek to obtain a (tight) lower bound on a policy's performance. This is crucial when deciding whether a given policy satisfies certain minimal performance/safety criteria before it can be deployed in the real world. To this end, we introduce HA… ▽ More

    Submitted 26 May, 2023; v1 submitted 2 March, 2023; originally announced March 2023.

    Comments: Conference on Uncertainty in Artificial Intelligence (UAI) 2023, first three authors contributed equally

  47. Machine Learning for QoS Prediction in Vehicular Communication: Challenges and Solution Approaches

    Authors: Alexandros Palaios, Christian L. Vielhaus, Daniel F. Külzer, Cara Watermann, Rodrigo Hernangomez, Sanket Partani, Philipp Geuer, Anton Krause, Raja Sattiraju, Martin Kasparick, Gerhard Fettweis, Frank H. P. Fitzek, Hans D. Schotten, Slawomir Stanczak

    Abstract: As cellular networks evolve towards the 6th generation, machine learning is seen as a key enabling technology to improve the capabilities of the network. Machine learning provides a methodology for predictive systems, which can make networks become proactive. This proactive behavior of the network can be leveraged to sustain, for example, a specific quality of service requirement. With predictive… ▽ More

    Submitted 22 August, 2023; v1 submitted 23 February, 2023; originally announced February 2023.

    Comments: 18 pages, 12 Figures. Accepted on IEEE Access

  48. arXiv:2302.11419  [pdf, other

    cs.LG q-bio.QM

    Aligned Diffusion Schrödinger Bridges

    Authors: Vignesh Ram Somnath, Matteo Pariset, Ya-** Hsieh, Maria Rodriguez Martinez, Andreas Krause, Charlotte Bunne

    Abstract: Diffusion Schrödinger bridges (DSB) have recently emerged as a powerful framework for recovering stochastic dynamics via their marginal observations at different time points. Despite numerous successful applications, existing algorithms for solving DSBs have so far failed to utilize the structure of aligned data, which naturally arises in many biological phenomena. In this paper, we propose a nove… ▽ More

    Submitted 28 April, 2024; v1 submitted 22 February, 2023; originally announced February 2023.

  49. arXiv:2302.03683  [pdf, ps, other

    cs.LG stat.ML

    Linear Partial Monitoring for Sequential Decision-Making: Algorithms, Regret Bounds and Applications

    Authors: Johannes Kirschner, Tor Lattimore, Andreas Krause

    Abstract: Partial monitoring is an expressive framework for sequential decision-making with an abundance of applications, including graph-structured and dueling bandits, dynamic pricing and transductive feedback models. We survey and extend recent results on the linear formulation of partial monitoring that naturally generalizes the standard linear bandit setting. The main result is that a single algorithm,… ▽ More

    Submitted 13 November, 2023; v1 submitted 7 February, 2023; originally announced February 2023.

  50. arXiv:2301.09943  [pdf, other

    cs.LG math.OC

    Learning To Dive In Branch And Bound

    Authors: Max B. Paulus, Andreas Krause

    Abstract: Primal heuristics are important for solving mixed integer linear programs, because they find feasible solutions that facilitate branch and bound search. A prominent group of primal heuristics are diving heuristics. They iteratively modify and resolve linear programs to conduct a depth-first search from any node in the search tree. Existing divers rely on generic decision rules that fail to exploit… ▽ More

    Submitted 24 January, 2023; originally announced January 2023.