Skip to main content

Showing 101–150 of 342 results for author: Krause, A

.
  1. Turing and wave instabilities in hyperbolic reaction-diffusion systems: The role of second-order time derivatives and cross-diffusion terms on pattern formation

    Authors: Joshua Ritchie, Andrew L. Krause, Robert A. Van Gorder

    Abstract: Hyperbolic reaction-diffusion equations have recently attracted attention both for their application to a variety of biological and chemical phenomena, and for their distinct features in terms of propagation speed and novel instabilities not present in classical two-species reaction-diffusion systems. We explore the onset of diffusive instabilities and resulting pattern formation for such systems.… ▽ More

    Submitted 28 April, 2022; originally announced April 2022.

  2. arXiv:2204.04558  [pdf, other

    cs.RO cs.LG

    Gradient-Based Trajectory Optimization With Learned Dynamics

    Authors: Bhavya Sukhija, Nathanael Köhler, Miguel Zamora, Simon Zimmermann, Sebastian Curi, Andreas Krause, Stelian Coros

    Abstract: Trajectory optimization methods have achieved an exceptional level of performance on real-world robots in recent years. These methods heavily rely on accurate analytical models of the dynamics, yet some aspects of the physical world can only be captured to a limited extent. An alternative approach is to leverage machine learning techniques to learn a differentiable dynamics model of the system fro… ▽ More

    Submitted 25 June, 2023; v1 submitted 9 April, 2022; originally announced April 2022.

  3. arXiv:2204.03420  [pdf, ps, other

    math.KT math.AG math.AT

    On the K-theory of $\mathbb{Z}/p^n$ -- announcement

    Authors: Benjamin Antieau, Achim Krause, Thomas Nikolaus

    Abstract: We announce new methods for using prismatic cohomology to compute the K-groups of $\mathbb{Z}/p^n$ and related rings. We use computer algebra methods to compute these K-groups through a large range in specific cases and also obtain explicit formulas for their orders in large degrees.

    Submitted 7 April, 2022; originally announced April 2022.

    Comments: Comments welcome!

  4. arXiv:2204.02337  [pdf, other

    cs.LG cs.AI q-bio.BM

    Multi-Scale Representation Learning on Proteins

    Authors: Vignesh Ram Somnath, Charlotte Bunne, Andreas Krause

    Abstract: Proteins are fundamental biological entities mediating key roles in cellular function and disease. This paper introduces a multi-scale graph construction of a protein -- HoloProt -- connecting surface to structure and sequence. The surface captures coarser details of the protein, while sequence as primary component and structure -- comprising secondary and tertiary components -- capture finer deta… ▽ More

    Submitted 4 April, 2022; originally announced April 2022.

    Comments: Neural Information Processing Systems 2021

  5. Tuning Particle Accelerators with Safety Constraints using Bayesian Optimization

    Authors: Johannes Kirschner, Mojmir Mutný, Andreas Krause, Jaime Coello de Portugal, Nicole Hiller, Jochem Snuverink

    Abstract: Tuning machine parameters of particle accelerators is a repetitive and time-consuming task that is challenging to automate. While many off-the-shelf optimization algorithms are available, in practice their use is limited because most methods do not account for safety-critical constraints in each iteration, such as loss signals or step-size limitations. One notable exception is safe Bayesian optimi… ▽ More

    Submitted 30 June, 2022; v1 submitted 25 March, 2022; originally announced March 2022.

  6. arXiv:2203.07322  [pdf, other

    cs.LG cs.MA

    Efficient Model-based Multi-agent Reinforcement Learning via Optimistic Equilibrium Computation

    Authors: Pier Giuseppe Sessa, Maryam Kamgarpour, Andreas Krause

    Abstract: We consider model-based multi-agent reinforcement learning, where the environment transition model is unknown and can only be learned via expensive interactions with the environment. We propose H-MARL (Hallucinated Multi-Agent Reinforcement Learning), a novel sample-efficient algorithm that can efficiently balance exploration, i.e., learning about the environment, and exploitation, i.e., achieve g… ▽ More

    Submitted 10 July, 2022; v1 submitted 14 March, 2022; originally announced March 2022.

  7. A Novel Physics-Regularized Interpretable Machine Learning Model for Grain Growth

    Authors: Weishi Yan, Joseph Melville, Vishal Yadav, Kristien Everett, Lin Yang, Michael S. Kesler, Amanda R. Krause, Michael R. Tonks, Joel B. Harley

    Abstract: Experimental grain growth observations often deviate from grain growth simulations, revealing that the governing rules for grain boundary motion are not fully understood. A novel deep learning model was developed to capture grain growth behavior from training data without making assumptions about the underlying physics. The Physics-Regularized Interpretable Machine Learning Microstructure Evolutio… ▽ More

    Submitted 17 August, 2022; v1 submitted 7 March, 2022; originally announced March 2022.

    Comments: 31 pages, 12 figures. Accepted to Materials & Design. Code Available: https://github.com/EAGG-UF/PRIMME

  8. arXiv:2202.05722  [pdf, other

    cs.LG q-bio.QM

    The Schrödinger Bridge between Gaussian Measures has a Closed Form

    Authors: Charlotte Bunne, Ya-** Hsieh, Marco Cuturi, Andreas Krause

    Abstract: The static optimal transport $(\mathrm{OT})$ problem between Gaussians seeks to recover an optimal map, or more generally a coupling, to morph a Gaussian into another. It has been well studied and applied to a wide variety of tasks. Here we focus on the dynamic formulation of OT, also known as the Schrödinger bridge (SB) problem, which has recently seen a surge of interest in machine learning due… ▽ More

    Submitted 31 March, 2023; v1 submitted 11 February, 2022; originally announced February 2022.

  9. arXiv:2202.01850  [pdf, other

    stat.ML cs.AI cs.LG

    A Robust Phased Elimination Algorithm for Corruption-Tolerant Gaussian Process Bandits

    Authors: Ilija Bogunovic, Zihan Li, Andreas Krause, Jonathan Scarlett

    Abstract: We consider the sequential optimization of an unknown, continuous, and expensive to evaluate reward function, from noisy and adversarially corrupted observed rewards. When the corruption attacks are subject to a suitable budget $C$ and the function lives in a Reproducing Kernel Hilbert Space (RKHS), the problem can be posed as corrupted Gaussian process (GP) bandit optimization. We propose a novel… ▽ More

    Submitted 28 March, 2022; v1 submitted 3 February, 2022; originally announced February 2022.

    Comments: Added references

  10. arXiv:2202.00602  [pdf, other

    stat.ML cs.AI cs.LG

    Meta-Learning Hypothesis Spaces for Sequential Decision-making

    Authors: Parnian Kassraie, Jonas Rothfuss, Andreas Krause

    Abstract: Obtaining reliable, adaptive confidence sets for prediction functions (hypotheses) is a central challenge in sequential decision-making tasks, such as bandits and model-based reinforcement learning. These confidence sets typically rely on prior assumptions on the hypothesis space, e.g., the known kernel of a Reproducing Kernel Hilbert Space (RKHS). Hand-designing such kernels is error prone, and m… ▽ More

    Submitted 17 June, 2022; v1 submitted 1 February, 2022; originally announced February 2022.

    Comments: 23 pages, 11 figures

  11. arXiv:2201.09802  [pdf, other

    cs.LG cs.AI cs.RO

    Constrained Policy Optimization via Bayesian World Models

    Authors: Yarden As, Ilnura Usmanova, Sebastian Curi, Andreas Krause

    Abstract: Improving sample-efficiency and safety are crucial challenges when deploying reinforcement learning in high-stakes real world applications. We propose LAMBDA, a novel model-based approach for policy optimization in safety critical tasks modeled via constrained Markov decision processes. Our approach utilizes Bayesian world models, and harnesses the resulting uncertainty to maximize optimistic uppe… ▽ More

    Submitted 6 February, 2022; v1 submitted 24 January, 2022; originally announced January 2022.

  12. GoSafeOpt: Scalable Safe Exploration for Global Optimization of Dynamical Systems

    Authors: Bhavya Sukhija, Matteo Turchetta, David Lindner, Andreas Krause, Sebastian Trimpe, Dominik Baumann

    Abstract: Learning optimal control policies directly on physical systems is challenging since even a single failure can lead to costly hardware damage. Most existing model-free learning methods that guarantee safety, i.e., no failures, during exploration are limited to local optima. A notable exception is the GoSafe algorithm, which, unfortunately, cannot handle high-dimensional systems and hence cannot be… ▽ More

    Submitted 12 June, 2023; v1 submitted 24 January, 2022; originally announced January 2022.

    Journal ref: Artificial Intelligence, Volume 320, Year 2023

  13. arXiv:2111.07786  [pdf, other

    cs.AI cs.LG

    Independent SE(3)-Equivariant Models for End-to-End Rigid Protein Docking

    Authors: Octavian-Eugen Ganea, Xinyuan Huang, Charlotte Bunne, Yatao Bian, Regina Barzilay, Tommi Jaakkola, Andreas Krause

    Abstract: Protein complex formation is a central problem in biology, being involved in most of the cell's processes, and essential for applications, e.g. drug design or protein engineering. We tackle rigid body protein-protein docking, i.e., computationally predicting the 3D structure of a protein-protein complex from the individual unbound structures, assuming no conformational change within the proteins h… ▽ More

    Submitted 15 March, 2022; v1 submitted 15 November, 2021; originally announced November 2021.

    Journal ref: Spotlight at ICLR 2022: International Conference on Learning Representations

  14. arXiv:2111.07671  [pdf, other

    cs.LG

    NeuralPDE: Modelling Dynamical Systems from Data

    Authors: Andrzej Dulny, Andreas Hotho, Anna Krause

    Abstract: Many physical processes such as weather phenomena or fluid mechanics are governed by partial differential equations (PDEs). Modelling such dynamical systems using Neural Networks is an active research field. However, current methods are still very limited, as they do not exploit the knowledge about the dynamical nature of the system, require extensive prior knowledge about the governing equations… ▽ More

    Submitted 11 October, 2022; v1 submitted 15 November, 2021; originally announced November 2021.

    Journal ref: In KI 2022: Advances in Artificial Intelligence (pp. 75-89). Springer International Publishing (2022)

  15. arXiv:2111.05008  [pdf, ps, other

    cs.LG cs.AI stat.ML

    Misspecified Gaussian Process Bandit Optimization

    Authors: Ilija Bogunovic, Andreas Krause

    Abstract: We consider the problem of optimizing a black-box function based on noisy bandit feedback. Kernelized bandit algorithms have shown strong empirical and theoretical performance for this problem. They heavily rely on the assumption that the model is well-specified, however, and can fail without it. Instead, we introduce a \emph{misspecified} kernelized bandit setting where the unknown function can b… ▽ More

    Submitted 9 November, 2021; originally announced November 2021.

    Comments: Accepted to NeurIPS 2021

  16. arXiv:2111.03637  [pdf, other

    cs.LG

    Risk-averse Heteroscedastic Bayesian Optimization

    Authors: Anastasiia Makarova, Ilnura Usmanova, Ilija Bogunovic, Andreas Krause

    Abstract: Many black-box optimization tasks arising in high-stakes applications require risk-averse decisions. The standard Bayesian optimization (BO) paradigm, however, optimizes the expected value only. We generalize BO to trade mean and input-dependent variance of the objective, both of which we assume to be unknown a priori. In particular, we propose a novel risk-averse heteroscedastic Bayesian optimiza… ▽ More

    Submitted 5 November, 2021; originally announced November 2021.

    Journal ref: Advances in Neural Information Processing Systems, 2021

  17. arXiv:2110.14296  [pdf, other

    cs.LG eess.SY math.DS stat.ML

    Learning Stable Deep Dynamics Models for Partially Observed or Delayed Dynamical Systems

    Authors: Andreas Schlaginhaufen, Philippe Wenk, Andreas Krause, Florian Dörfler

    Abstract: Learning how complex dynamical systems evolve over time is a key challenge in system identification. For safety critical systems, it is often crucial that the learned model is guaranteed to converge to some equilibrium point. To this end, neural ODEs regularized with neural Lyapunov functions are a promising approach when states are fully observed. For practical applications however, partial obser… ▽ More

    Submitted 10 December, 2021; v1 submitted 27 October, 2021; originally announced October 2021.

    Comments: Published at NeurIPS 2021

    Journal ref: Advances in Neural Information Processing Systems, 2021

  18. arXiv:2110.11665  [pdf, other

    cs.LG stat.ML

    Diversified Sampling for Batched Bayesian Optimization with Determinantal Point Processes

    Authors: Elvis Nava, Mojmír Mutný, Andreas Krause

    Abstract: In Bayesian Optimization (BO) we study black-box function optimization with noisy point evaluations and Bayesian priors. Convergence of BO can be greatly sped up by batching, where multiple evaluations of the black-box function are performed in a single round. The main difficulty in this setting is to propose at the same time diverse and informative batches of evaluation points. In this work, we i… ▽ More

    Submitted 8 February, 2022; v1 submitted 22 October, 2021; originally announced October 2021.

    Comments: To be published in AISTATS 2022

  19. arXiv:2110.11181  [pdf, other

    cs.LG stat.ML

    Sensing Cox Processes via Posterior Sampling and Positive Bases

    Authors: Mojmír Mutný, Andreas Krause

    Abstract: We study adaptive sensing of Cox point processes, a widely used model from spatial statistics. We introduce three tasks: maximization of captured events, search for the maximum of the intensity function and learning level sets of the intensity function. We model the intensity function as a sample from a truncated Gaussian process, represented in a specially constructed positive basis. In this basi… ▽ More

    Submitted 29 March, 2022; v1 submitted 21 October, 2021; originally announced October 2021.

  20. arXiv:2110.10809  [pdf, other

    cs.LG cs.AI cs.RO

    Hierarchical Skills for Efficient Exploration

    Authors: Jonas Gehring, Gabriel Synnaeve, Andreas Krause, Nicolas Usunier

    Abstract: In reinforcement learning, pre-trained low-level skills have the potential to greatly facilitate exploration. However, prior knowledge of the downstream task is required to strike the right balance between generality (fine-grained control) and specificity (faster learning) in skill design. In previous work on continuous control, the sensitivity of methods to this trade-off has not been addressed e… ▽ More

    Submitted 20 October, 2021; originally announced October 2021.

    Comments: To appear in 35th Conference on Neural Information Processing Systems (NeurIPS 2021)

  21. arXiv:2110.03945  [pdf, other

    cs.LG

    Anomaly Detection in Beehives: An Algorithm Comparison

    Authors: Padraig Davidson, Michael Steininger, Florian Lautenschlager, Anna Krause, Andreas Hotho

    Abstract: Sensor-equipped beehives allow monitoring the living conditions of bees. Machine learning models can use the data of such hives to learn behavioral patterns and find anomalous events. One type of event that is of particular interest to apiarists for economical reasons is bee swarming. Other events of interest are behavioral anomalies from illness and technical anomalies, e.g. sensor failure. Beeke… ▽ More

    Submitted 8 October, 2021; originally announced October 2021.

  22. Live Visualization of Dynamic Software Cities with Heat Map Overlays

    Authors: Alexander Krause, Malte Hansen, Wilhelm Hasselbring

    Abstract: The 3D city metaphor in software visualization is a well-explored rendering method. Numerous tools use their custom variation to visualize offline-analyzed data. Heat map overlays are one of these variants. They introduce a separate information layer in addition to the software city's own semantics. Results show that their usage facilitates program comprehension. In this paper, we present our he… ▽ More

    Submitted 29 September, 2021; originally announced September 2021.

    Comments: 2021 Working Conference on Software Visualization (VISSOFT), 5 pages

    ACM Class: D.2.11

  23. arXiv:2109.12534  [pdf, other

    cs.LG stat.ML

    Data Summarization via Bilevel Optimization

    Authors: Zalán Borsos, Mojmír Mutný, Marco Tagliasacchi, Andreas Krause

    Abstract: The increasing availability of massive data sets poses a series of challenges for machine learning. Prominent among these is the need to learn models under hardware or human resource constraints. In such resource-constrained settings, a simple yet powerful approach is to operate on small subsets of the data. Coresets are weighted subsets of the data that provide approximation guarantees for the op… ▽ More

    Submitted 26 September, 2021; originally announced September 2021.

  24. arXiv:2109.09835  [pdf, ps, other

    math.OC

    Fast Projection Onto Convex Smooth Constraints

    Authors: Ilnura Usmanova, Maryam Kamgarpour, Andreas Krause, Kfir Yehuda Levy

    Abstract: The Euclidean projection onto a convex set is an important problem that arises in numerous constrained optimization tasks. Unfortunately, in many cases, computing projections is computationally demanding. In this work, we focus on projection problems where the constraints are smooth and the number of constraints is significantly smaller than the dimension. The runtime of existing approaches to sol… ▽ More

    Submitted 20 September, 2021; originally announced September 2021.

  25. arXiv:2107.12033  [pdf, other

    cs.SD cs.LG eess.AS

    Joint Direction and Proximity Classification of Overlap** Sound Events from Binaural Audio

    Authors: Daniel Aleksander Krause, Archontis Politis, Annamaria Mesaros

    Abstract: Sound source proximity and distance estimation are of great interest in many practical applications, since they provide significant information for acoustic scene analysis. As both tasks share complementary qualities, ensuring efficient interaction between these two is crucial for a complete picture of an aural environment. In this paper, we aim to investigate several ways of performing joint prox… ▽ More

    Submitted 26 July, 2021; originally announced July 2021.

  26. arXiv:2107.06327  [pdf, other

    cs.GT cs.LG

    Contextual Games: Multi-Agent Learning with Side Information

    Authors: Pier Giuseppe Sessa, Ilija Bogunovic, Andreas Krause, Maryam Kamgarpour

    Abstract: We formulate the novel class of contextual games, a type of repeated games driven by contextual information at each round. By means of kernel-based regularity assumptions, we model the correlation between different contexts and game outcomes and propose a novel online (meta) algorithm that exploits such correlations to minimize the contextual regret of individual players. We define game-theoretic… ▽ More

    Submitted 13 July, 2021; originally announced July 2021.

    Journal ref: Proc. of Neural Information Processing Systems (NeurIPS), 2020

  27. arXiv:2107.06283  [pdf, other

    physics.comp-ph cs.ET

    Analog Computing for Molecular Dynamics

    Authors: Sven Köppel, Alexandra Krause, Bernd Ulmann

    Abstract: Modern analog computers are ideally suited to solving large systems of ordinary differential equations at high speed with low energy consumtion and limited accuracy. In this article, we survey N-body physics, applied to a simple water model inspired by force fields which are popular in molecular dynamics. We demonstrate a setup which simulate a single water molecule in time. To the best of our kno… ▽ More

    Submitted 13 July, 2021; originally announced July 2021.

    Comments: 9 pages, 9 figures, submitted to Emerging Topics in Computing, IEEE Trans

    MSC Class: 82M37 ACM Class: J.2; J.3; G.1.7

    Journal ref: IJUC Volume 17, Number 4, p. 259-282 (2022)

  28. arXiv:2107.04050  [pdf, other

    stat.ML cs.LG cs.MA

    Efficient Model-Based Multi-Agent Mean-Field Reinforcement Learning

    Authors: Barna Pásztor, Ilija Bogunovic, Andreas Krause

    Abstract: Learning in multi-agent systems is highly challenging due to several factors including the non-stationarity introduced by agents' interactions and the combinatorial nature of their state and action spaces. In particular, we consider the Mean-Field Control (MFC) problem which assumes an asymptotically infinite population of identical agents that aim to collaboratively maximize the collective reward… ▽ More

    Submitted 9 May, 2023; v1 submitted 8 July, 2021; originally announced July 2021.

    Journal ref: Pásztor, B., Krause, A., & Bogunovic, I. (2023). Efficient Model-Based Multi-Agent Mean-Field Reinforcement Learning. Transactions on Machine Learning Research

  29. arXiv:2107.03144  [pdf, other

    stat.ML cs.AI cs.LG

    Neural Contextual Bandits without Regret

    Authors: Parnian Kassraie, Andreas Krause

    Abstract: Contextual bandits are a rich model for sequential decision making given side information, with important applications, e.g., in recommender systems. We propose novel algorithms for contextual bandits harnessing neural networks to approximate the unknown reward function. We resolve the open problem of proving sublinear regret bounds in this setting for general context sequences, considering both f… ▽ More

    Submitted 28 February, 2022; v1 submitted 7 July, 2021; originally announced July 2021.

    Comments: 37 pages, 6 figures

  30. arXiv:2106.11609  [pdf, other

    cs.LG math.DS stat.ML

    Distributional Gradient Matching for Learning Uncertain Neural Dynamics Models

    Authors: Lenart Treven, Philippe Wenk, Florian Dörfler, Andreas Krause

    Abstract: Differential equations in general and neural ODEs in particular are an essential technique in continuous-time system identification. While many deterministic learning algorithms have been designed based on numerical integration via the adjoint method, many downstream tasks such as active learning, exploration in reinforcement learning, robust control, or filtering require accurate estimates of pre… ▽ More

    Submitted 15 October, 2021; v1 submitted 22 June, 2021; originally announced June 2021.

    Comments: Published at NeurIPS 2021

    Journal ref: Advances in Neural Information Processing Systems, 2021

  31. arXiv:2106.08375  [pdf, other

    nlin.PS q-bio.CB

    Modern Perspectives on Near-Equilibrium Analysis of Turing Systems

    Authors: Andrew L. Krause, Eamonn A. Gaffney, Philip K. Maini, Václav Klika

    Abstract: In the nearly seven decades since the publication of Alan Turing's work on morphogenesis, enormous progress has been made in understanding both the mathematical and biological aspects of his proposed reaction-diffusion theory. Some of these developments were nascent in Turing's paper, and others have been due to new insights from modern mathematical techniques, advances in numerical simulations, a… ▽ More

    Submitted 13 September, 2021; v1 submitted 15 June, 2021; originally announced June 2021.

    Comments: 21 pages, 6 figures

    MSC Class: 92C15 (primary) 35B36; 35K57 (secondary)

  32. arXiv:2106.07445  [pdf, other

    cs.LG cs.CR cs.CV math.OC stat.ML

    PopSkipJump: Decision-Based Attack for Probabilistic Classifiers

    Authors: Carl-Johann Simon-Gabriel, Noman Ahmed Sheikh, Andreas Krause

    Abstract: Most current classifiers are vulnerable to adversarial examples, small input perturbations that change the classification output. Many existing attack algorithms cover various settings, from white-box to black-box classifiers, but typically assume that the answers are deterministic and often fail when they are not. We therefore propose a new adversarial decision-based attack specifically designed… ▽ More

    Submitted 14 June, 2021; originally announced June 2021.

    Comments: ICML'21. Code available at https://github.com/cjsg/PopSkipJump . 9 pages & 7 figures in main part, 14 pages & 10 figures in appendix

  33. arXiv:2106.06345  [pdf, other

    cs.LG

    Proximal Optimal Transport Modeling of Population Dynamics

    Authors: Charlotte Bunne, Laetitia Meng-Papaxanthos, Andreas Krause, Marco Cuturi

    Abstract: We propose a new approach to model the collective dynamics of a population of particles evolving with time. As is often the case in challenging scientific applications, notably single-cell genomics, measuring features for these particles requires destroying them. As a result, the population can only be monitored with periodic snapshots, obtained by sampling a few particles that are sacrificed in e… ▽ More

    Submitted 18 February, 2022; v1 submitted 11 June, 2021; originally announced June 2021.

  34. arXiv:2106.04443  [pdf, other

    cs.LG cs.IT math.OC

    Robust Generalization despite Distribution Shift via Minimum Discriminating Information

    Authors: Tobias Sutter, Andreas Krause, Daniel Kuhn

    Abstract: Training models that perform well under distribution shifts is a central challenge in machine learning. In this paper, we introduce a modeling framework where, in addition to training data, we have partial structural knowledge of the shifted test distribution. We employ the principle of minimum discriminating information to embed the available prior knowledge, and use distributionally robust optim… ▽ More

    Submitted 26 October, 2021; v1 submitted 8 June, 2021; originally announced June 2021.

    Comments: 23 pages, 4 figures

    Journal ref: NeurIPS 2021

  35. arXiv:2106.03195  [pdf, other

    cs.LG cs.AI stat.ML

    Meta-Learning Reliable Priors in the Function Space

    Authors: Jonas Rothfuss, Dominique Heyn, **fan Chen, Andreas Krause

    Abstract: When data are scarce meta-learning can improve a learner's accuracy by harnessing previous experience from related learning tasks. However, existing methods have unreliable uncertainty estimates which are often overconfident. Addressing these shortcomings, we introduce a novel meta-learning framework, called F-PACOH, that treats meta-learned priors as stochastic processes and performs meta-level r… ▽ More

    Submitted 11 January, 2022; v1 submitted 6 June, 2021; originally announced June 2021.

    Comments: In Advances of Neural Information Processing Systems (NeurIPS) 2021

  36. arXiv:2106.02938  [pdf, other

    cs.LG

    Energy-Based Learning for Cooperative Games, with Applications to Valuation Problems in Machine Learning

    Authors: Yatao Bian, Yu Rong, Tingyang Xu, Jiaxiang Wu, Andreas Krause, Junzhou Huang

    Abstract: Valuation problems, such as feature interpretation, data valuation and model valuation for ensembles, become increasingly more important in many machine learning applications. Such problems are commonly solved by well-known game-theoretic criteria, such as Shapley value or Banzhaf value. In this work, we present a novel energy-based treatment for cooperative games, with a theoretical justification… ▽ More

    Submitted 12 May, 2022; v1 submitted 5 June, 2021; originally announced June 2021.

    Comments: ICLR 2022

  37. arXiv:2106.01325  [pdf, other

    cs.LG cs.AI cs.CY stat.ML

    Addressing the Long-term Impact of ML Decisions via Policy Regret

    Authors: David Lindner, Hoda Heidari, Andreas Krause

    Abstract: Machine Learning (ML) increasingly informs the allocation of opportunities to individuals and communities in areas such as lending, education, employment, and beyond. Such decisions often impact their subjects' future characteristics and capabilities in an a priori unknown fashion. The decision-maker, therefore, faces exploration-exploitation dilemmas akin to those in multi-armed bandits. Followin… ▽ More

    Submitted 2 June, 2021; originally announced June 2021.

    Comments: Accepted to IJCAI 2021

  38. arXiv:2105.14250  [pdf, other

    cs.CV cs.LG

    Cherry-Picking Gradients: Learning Low-Rank Embeddings of Visual Data via Differentiable Cross-Approximation

    Authors: Mikhail Usvyatsov, Anastasia Makarova, Rafael Ballester-Ripoll, Maxim Rakhuba, Andreas Krause, Konrad Schindler

    Abstract: We propose an end-to-end trainable framework that processes large-scale visual data tensors by looking at a fraction of their entries only. Our method combines a neural network encoder with a tensor train decomposition to learn a low-rank latent encoding, coupled with cross-approximation (CA) to learn the representation through a subset of the original samples. CA is an adaptive sampling algorithm… ▽ More

    Submitted 12 November, 2021; v1 submitted 29 May, 2021; originally announced May 2021.

    Journal ref: Proc. International Conference on Computer Vision (ICCV) 2021

  39. arXiv:2105.14024  [pdf, other

    cs.LG

    Near-Optimal Multi-Perturbation Experimental Design for Causal Structure Learning

    Authors: Scott Sussex, Andreas Krause, Caroline Uhler

    Abstract: Causal structure learning is a key problem in many domains. Causal structures can be learnt by performing experiments on the system of interest. We address the largely unexplored problem of designing a batch of experiments that each simultaneously intervene on multiple variables. While potentially more informative than the commonly considered single-variable interventions, selecting such intervent… ▽ More

    Submitted 24 November, 2021; v1 submitted 28 May, 2021; originally announced May 2021.

    Comments: 10 pages, 2 figures, appendix, to be published in 35th Conference on Neural Information Processing Systems (NeurIPS 2021), fixed typos and clarified wording

  40. arXiv:2105.11839  [pdf, other

    cs.LG stat.ML

    DiBS: Differentiable Bayesian Structure Learning

    Authors: Lars Lorch, Jonas Rothfuss, Bernhard Schölkopf, Andreas Krause

    Abstract: Bayesian structure learning allows inferring Bayesian network structure from data while reasoning about the epistemic uncertainty -- a key element towards enabling active causal discovery and designing interventions in real world systems. In this work, we propose a general, fully differentiable framework for Bayesian structure learning (DiBS) that operates in the continuous space of a latent proba… ▽ More

    Submitted 16 December, 2021; v1 submitted 25 May, 2021; originally announced May 2021.

    Comments: NeurIPS 2021; updated run time results

  41. arXiv:2105.11802  [pdf, other

    stat.ML cs.LG

    Bias-Robust Bayesian Optimization via Dueling Bandits

    Authors: Johannes Kirschner, Andreas Krause

    Abstract: We consider Bayesian optimization in settings where observations can be adversarially biased, for example by an uncontrolled hidden confounder. Our first contribution is a reduction of the confounded setting to the dueling bandit model. Then we propose a novel approach for dueling bandits based on information-directed sampling (IDS). Thereby, we obtain the first efficient kernelized algorithm for… ▽ More

    Submitted 9 June, 2021; v1 submitted 25 May, 2021; originally announced May 2021.

  42. arXiv:2105.10252  [pdf, other

    q-fin.GN q-fin.MF q-fin.PM q-fin.PR

    A note on the CAPM with endogenously consistent market returns

    Authors: Andreas Krause

    Abstract: I demonstrate that with the market return determined by the equilibrium returns of the CAPM, expected returns of an asset are affected by the risks of all assets jointly. Another implication is that the range of feasible market returns will be limited and dependent on the distribution of weights in the market portfolio. A large and well diversified market with no dominating asset will only return… ▽ More

    Submitted 21 May, 2021; originally announced May 2021.

    Comments: 4 pages, 1 figure

  43. arXiv:2104.14113  [pdf, other

    cs.LG

    Regret Bounds for Gaussian-Process Optimization in Large Domains

    Authors: Manuel Wüthrich, Bernhard Schölkopf, Andreas Krause

    Abstract: The goal of this paper is to characterize Gaussian-Process optimization in the setting where the function domain is large relative to the number of admissible function evaluations, i.e., where it is impossible to find the global optimum. We provide upper bounds on the suboptimality (Bayesian simple regret) of the solution found by optimization strategies that are closely related to the widely used… ▽ More

    Submitted 24 January, 2022; v1 submitted 29 April, 2021; originally announced April 2021.

  44. arXiv:2104.08166  [pdf, other

    cs.LG cs.AI stat.ML

    Automatic Termination for Hyperparameter Optimization

    Authors: Anastasia Makarova, Huibin Shen, Valerio Perrone, Aaron Klein, Jean Baptiste Faddoul, Andreas Krause, Matthias Seeger, Cedric Archambeau

    Abstract: Bayesian optimization (BO) is a widely popular approach for the hyperparameter optimization (HPO) in machine learning. At its core, BO iteratively evaluates promising configurations until a user-defined budget, such as wall-clock time or number of iterations, is exhausted. While the final performance after tuning heavily depends on the provided budget, it is hard to pre-specify an optimal value in… ▽ More

    Submitted 22 July, 2022; v1 submitted 16 April, 2021; originally announced April 2021.

    Comments: Accepted at AutoML Conference 2022

  45. arXiv:2103.10369  [pdf, other

    cs.LG cs.AI stat.ML

    Combining Pessimism with Optimism for Robust and Efficient Model-Based Deep Reinforcement Learning

    Authors: Sebastian Curi, Ilija Bogunovic, Andreas Krause

    Abstract: In real-world tasks, reinforcement learning (RL) agents frequently encounter situations that are not present during training time. To ensure reliable performance, the RL agents need to exhibit robustness against worst-case situations. The robust RL framework addresses this challenge via a worst-case optimization between an agent and an adversary. Previous robust RL algorithms are either sample ine… ▽ More

    Submitted 18 March, 2021; originally announced March 2021.

  46. arXiv:2102.12466  [pdf, other

    cs.LG

    Information Directed Reward Learning for Reinforcement Learning

    Authors: David Lindner, Matteo Turchetta, Sebastian Tschiatschek, Kamil Ciosek, Andreas Krause

    Abstract: For many reinforcement learning (RL) applications, specifying a reward is difficult. This paper considers an RL setting where the agent obtains information about the reward only by querying an expert that can, for example, evaluate individual states or provide binary preferences over trajectories. From such expensive feedback, we aim to learn a model of the reward that allows standard RL algorithm… ▽ More

    Submitted 31 January, 2022; v1 submitted 24 February, 2021; originally announced February 2021.

    Comments: Presented at Conference on Neural Information Processing Systems (NeurIPS), 2021

  47. arXiv:2102.05371  [pdf, other

    cs.LG

    Risk-Averse Offline Reinforcement Learning

    Authors: Núria Armengol Urpí, Sebastian Curi, Andreas Krause

    Abstract: Training Reinforcement Learning (RL) agents in high-stakes applications might be too prohibitive due to the risk associated to exploration. Thus, the agent can only use data previously collected by safe policies. While previous work considers optimizing the average performance using offline data, we focus on optimizing a risk-averse criteria, namely the CVaR. In particular, we present the Offline… ▽ More

    Submitted 10 February, 2021; originally announced February 2021.

  48. arXiv:2101.08534  [pdf, other

    stat.ML cs.LG

    Efficient Pure Exploration for Combinatorial Bandits with Semi-Bandit Feedback

    Authors: Marc Jourdan, Mojmír Mutný, Johannes Kirschner, Andreas Krause

    Abstract: Combinatorial bandits with semi-bandit feedback generalize multi-armed bandits, where the agent chooses sets of arms and observes a noisy reward for each arm contained in the chosen set. The action set satisfies a given structure such as forming a base of a matroid or a path in a graph. We focus on the pure-exploration problem of identifying the best arm with fixed confidence, as well as a more ge… ▽ More

    Submitted 21 January, 2021; originally announced January 2021.

    Comments: 45 pages. 3 tables. Appendices: from A to I. Figures: 1(a), 1(b), 2(a), 2(b), 3(a), 3(b), 3(c), 4(a), 4(b), 5(a), 5(b), 5(c), 5(d), 6(a), 6(b). To be published in the 32nd International Conference on Algorithmic Learning Theory and the Proceedings of Machine Learning Research vol 132:1-45, 2021

  49. arXiv:2101.07825  [pdf, other

    eess.SY cs.AI cs.LG cs.RO

    Safe and Efficient Model-free Adaptive Control via Bayesian Optimization

    Authors: Christopher König, Matteo Turchetta, John Lygeros, Alisa Rupenyan, Andreas Krause

    Abstract: Adaptive control approaches yield high-performance controllers when a precise system model or suitable parametrizations of the controller are available. Existing data-driven approaches for adaptive control mostly augment standard model-based methods with additional information about uncertainties in the dynamics or about disturbances. In this work, we propose a purely data-driven, model-free appro… ▽ More

    Submitted 2 March, 2021; v1 submitted 19 January, 2021; originally announced January 2021.

  50. arXiv:2101.01816  [pdf, ps, other

    cs.GT

    Incentive-Compatible Forecasting Competitions

    Authors: Jens Witkowski, Rupert Freeman, Jennifer Wortman Vaughan, David M. Pennock, Andreas Krause

    Abstract: We initiate the study of incentive-compatible forecasting competitions in which multiple forecasters make predictions about one or more events and compete for a single prize. We have two objectives: (1) to incentivize forecasters to report truthfully and (2) to award the prize to the most accurate forecaster. Proper scoring rules incentivize truthful reporting if all forecasters are paid according… ▽ More

    Submitted 7 September, 2021; v1 submitted 5 January, 2021; originally announced January 2021.

    Comments: 38 pages. Relative to the previous version Appendix A and Theorem 5 are new. This version additionally contains some expanded exposition