Search | arXiv e-print repository

arXiv:1906.02685 [pdf, other]

Stochastic Bandits with Context Distributions

Authors: Johannes Kirschner, Andreas Krause

Abstract: We introduce a stochastic contextual bandit model where at each time step the environment chooses a distribution over a context set and samples the context from this distribution. The learner observes only the context distribution while the exact context realization remains hidden. This allows for a broad range of applications where the context is stochastic or when the learner needs to predict th… ▽ More We introduce a stochastic contextual bandit model where at each time step the environment chooses a distribution over a context set and samples the context from this distribution. The learner observes only the context distribution while the exact context realization remains hidden. This allows for a broad range of applications where the context is stochastic or when the learner needs to predict the context. We leverage the UCB algorithm to this setting and show that it achieves an order-optimal high-probability bound on the cumulative regret for linear and kernelized reward functions. Our results strictly generalize previous work in the sense that both our model and the algorithm reduce to the standard setting when the environment chooses only Dirac delta distributions and therefore provides the exact context to the learner. We further analyze a variant where the learner observes the realized context after choosing the action. Finally, we demonstrate the proposed method on synthetic and real-world datasets. △ Less

Submitted 14 November, 2019; v1 submitted 6 June, 2019; originally announced June 2019.

Comments: Accepted at NeurIPS 2019

arXiv:1905.05461 [pdf, other]

Learning Generative Models across Incomparable Spaces

Authors: Charlotte Bunne, David Alvarez-Melis, Andreas Krause, Stefanie Jegelka

Abstract: Generative Adversarial Networks have shown remarkable success in learning a distribution that faithfully recovers a reference distribution in its entirety. However, in some cases, we may want to only learn some aspects (e.g., cluster or manifold structure), while modifying others (e.g., style, orientation or dimension). In this work, we propose an approach to learn generative models across such in… ▽ More Generative Adversarial Networks have shown remarkable success in learning a distribution that faithfully recovers a reference distribution in its entirety. However, in some cases, we may want to only learn some aspects (e.g., cluster or manifold structure), while modifying others (e.g., style, orientation or dimension). In this work, we propose an approach to learn generative models across such incomparable spaces, and demonstrate how to steer the learned distribution towards target properties. A key component of our model is the Gromov-Wasserstein distance, a notion of discrepancy that compares distributions relationally rather than absolutely. While this framework subsumes current generative models in identically reproducing distributions, its inherent flexibility allows application to tasks in manifold learning, relational learning and cross-domain learning. △ Less

Submitted 15 May, 2019; v1 submitted 14 May, 2019; originally announced May 2019.

Comments: International Conference on Machine Learning (ICML)

Journal ref: Proceedings of Machine Learning Research (PMLR), 97 (2019)

arXiv:1904.09683 [pdf, other]

doi 10.1007/s00285-021-01552-y

Turing conditions for pattern forming systems on evolving manifolds

Authors: Robert A. Van Gorder, Václav Klika, Andrew L. Krause

Abstract: The study of pattern-forming instabilities in reaction-diffusion systems on growing or otherwise time-dependent domains arises in a variety of settings, including applications in developmental biology, spatial ecology, and experimental chemistry. Analyzing such instabilities is complicated, as there is a strong dependence of any spatially homogeneous base states on time, and the resulting structur… ▽ More The study of pattern-forming instabilities in reaction-diffusion systems on growing or otherwise time-dependent domains arises in a variety of settings, including applications in developmental biology, spatial ecology, and experimental chemistry. Analyzing such instabilities is complicated, as there is a strong dependence of any spatially homogeneous base states on time, and the resulting structure of the linearized perturbations used to determine the onset of instability is inherently non-autonomous. We obtain general conditions for the onset and structure of diffusion driven instabilities in reaction-diffusion systems on domains which evolve in time, in terms of the time-evolution of the Laplace-Beltrami spectrum for the domain and functions which specify the domain evolution. Our results give sufficient conditions for diffusive instabilities phrased in terms of differential inequalities which are both versatile and straightforward to implement, despite the generality of the studied problem. These conditions generalize a large number of results known in the literature, such as the algebraic inequalities commonly used as a sufficient criterion for the Turing instability on static domains, and approximate asymptotic results valid for specific types of growth, or specific domains. We demonstrate our general Turing conditions on a variety of domains with different evolution laws, and in particular show how insight can be gained even when the domain changes rapidly in time, or when the homogeneous state is oscillatory, such as in the case of Turing-Hopf instabilities. Extensions to higher-order spatial systems are also included as a way of demonstrating the generality of the approach. △ Less

Submitted 17 October, 2019; v1 submitted 21 April, 2019; originally announced April 2019.

MSC Class: 35B36; 92C15; 70K50; 58C40; 58J32

Journal ref: Journal of Mathematical Biology 82 (2021) 4

arXiv:1903.12416 [pdf, other]

Online Variance Reduction with Mixtures

Authors: Zalán Borsos, Sebastian Curi, Kfir Y. Levy, Andreas Krause

Abstract: Adaptive importance sampling for stochastic optimization is a promising approach that offers improved convergence through variance reduction. In this work, we propose a new framework for variance reduction that enables the use of mixtures over predefined sampling distributions, which can naturally encode prior knowledge about the data. While these sampling distributions are fixed, the mixture weig… ▽ More Adaptive importance sampling for stochastic optimization is a promising approach that offers improved convergence through variance reduction. In this work, we propose a new framework for variance reduction that enables the use of mixtures over predefined sampling distributions, which can naturally encode prior knowledge about the data. While these sampling distributions are fixed, the mixture weights are adapted during the optimization process. We propose VRM, a novel and efficient adaptive scheme that asymptotically recovers the best mixture weights in hindsight and can also accommodate sampling distributions over sets of points. We empirically demonstrate the versatility of VRM in a range of applications. △ Less

Submitted 29 March, 2019; originally announced March 2019.

arXiv:1903.04626 [pdf, other]

Safe Convex Learning under Uncertain Constraints

Authors: Ilnura Usmanova, Andreas Krause, Maryam Kamgarpour

Abstract: We address the problem of minimizing a convex smooth function $f(x)$ over a compact polyhedral set $D$ given a stochastic zeroth-order constraint feedback model. This problem arises in safety-critical machine learning applications, such as personalized medicine and robotics. In such cases, one needs to ensure constraints are satisfied while exploring the decision space to find optimum of the loss… ▽ More We address the problem of minimizing a convex smooth function $f(x)$ over a compact polyhedral set $D$ given a stochastic zeroth-order constraint feedback model. This problem arises in safety-critical machine learning applications, such as personalized medicine and robotics. In such cases, one needs to ensure constraints are satisfied while exploring the decision space to find optimum of the loss function. We propose a new variant of the Frank-Wolfe algorithm, which applies to the case of uncertain linear constraints. Using robust optimization, we provide the convergence rate of the algorithm while guaranteeing feasibility of all iterates, with high probability. △ Less

Submitted 9 December, 2019; v1 submitted 11 March, 2019; originally announced March 2019.

Comments: 15 pages, 7 figures, AISTATS 2019

arXiv:1903.00950 [pdf, ps, other]

Bounding Inefficiency of Equilibria in Continuous Actions Games using Submodularity and Curvature

Authors: Pier Giuseppe Sessa, Maryam Kamgarpour, Andreas Krause

Abstract: Games with continuous strategy sets arise in several machine learning problems (e.g. adversarial learning). For such games, simple no-regret learning algorithms exist in several cases and ensure convergence to coarse correlated equilibria (CCE). The efficiency of such equilibria with respect to a social function, however, is not well understood. In this paper, we define the class of valid utility… ▽ More Games with continuous strategy sets arise in several machine learning problems (e.g. adversarial learning). For such games, simple no-regret learning algorithms exist in several cases and ensure convergence to coarse correlated equilibria (CCE). The efficiency of such equilibria with respect to a social function, however, is not well understood. In this paper, we define the class of valid utility games with continuous strategies and provide efficiency bounds for their CCEs. Our bounds rely on the social function being a monotone DR-submodular function. We further refine our bounds based on the curvature of the social function. Furthermore, we extend our efficiency bounds to a class of non-submodular functions that satisfy approximate submodularity properties. Finally, we show that valid utility games with continuous strategies can be designed to maximize monotone DR-submodular functions subject to disjoint constraints with approximation guarantees. The approximation guarantees we derive are based on the efficiency of the equilibria of such games and can improve the existing ones in the literature. We illustrate and validate our results on a budget allocation game and a sensor coverage problem. △ Less

Submitted 3 March, 2019; originally announced March 2019.

arXiv:1902.08480 [pdf, other]

AReS and MaRS - Adversarial and MMD-Minimizing Regression for SDEs

Authors: Gabriele Abbati, Philippe Wenk, Michael A Osborne, Andreas Krause, Bernhard Schölkopf, Stefan Bauer

Abstract: Stochastic differential equations are an important modeling class in many disciplines. Consequently, there exist many methods relying on various discretization and numerical integration schemes. In this paper, we propose a novel, probabilistic model for estimating the drift and diffusion given noisy observations of the underlying stochastic system. Using state-of-the-art adversarial and moment mat… ▽ More Stochastic differential equations are an important modeling class in many disciplines. Consequently, there exist many methods relying on various discretization and numerical integration schemes. In this paper, we propose a novel, probabilistic model for estimating the drift and diffusion given noisy observations of the underlying stochastic system. Using state-of-the-art adversarial and moment matching inference techniques, we avoid the discretization schemes of classical approaches. This leads to significant improvements in parameter accuracy and robustness given random initial guesses. On four established benchmark systems, we compare the performance of our algorithms to state-of-the-art solutions based on extended Kalman filtering and Gaussian processes. △ Less

Submitted 28 May, 2019; v1 submitted 22 February, 2019; originally announced February 2019.

Comments: Published at the Thirty-sixth International Conference on Machine Learning (ICML 2019)

arXiv:1902.08036 [pdf, other]

Multi-Player Bandits: The Adversarial Case

Authors: Pragnya Alatur, Kfir Y. Levy, Andreas Krause

Abstract: We consider a setting where multiple players sequentially choose among a common set of actions (arms). Motivated by a cognitive radio networks application, we assume that players incur a loss upon colliding, and that communication between players is not possible. Existing approaches assume that the system is stationary. Yet this assumption is often violated in practice, e.g., due to signal strengt… ▽ More We consider a setting where multiple players sequentially choose among a common set of actions (arms). Motivated by a cognitive radio networks application, we assume that players incur a loss upon colliding, and that communication between players is not possible. Existing approaches assume that the system is stationary. Yet this assumption is often violated in practice, e.g., due to signal strength fluctuations. In this work, we design the first Multi-player Bandit algorithm that provably works in arbitrarily changing environments, where the losses of the arms may even be chosen by an adversary. This resolves an open problem posed by Rosenski, Shamir, and Szlak (2016). △ Less

Submitted 21 February, 2019; originally announced February 2019.

arXiv:1902.06278 [pdf, other]

ODIN: ODE-Informed Regression for Parameter and State Inference in Time-Continuous Dynamical Systems

Authors: Philippe Wenk, Gabriele Abbati, Michael A Osborne, Bernhard Schölkopf, Andreas Krause, Stefan Bauer

Abstract: Parameter inference in ordinary differential equations is an important problem in many applied sciences and in engineering, especially in a data-scarce setting. In this work, we introduce a novel generative modeling approach based on constrained Gaussian processes and leverage it to build a computationally and data efficient algorithm for state and parameter inference. In an extensive set of exper… ▽ More Parameter inference in ordinary differential equations is an important problem in many applied sciences and in engineering, especially in a data-scarce setting. In this work, we introduce a novel generative modeling approach based on constrained Gaussian processes and leverage it to build a computationally and data efficient algorithm for state and parameter inference. In an extensive set of experiments, our approach outperforms the current state of the art for parameter inference both in terms of accuracy and computational cost. It also shows promising results for the much more challenging problem of model selection. △ Less

Submitted 5 December, 2019; v1 submitted 17 February, 2019; originally announced February 2019.

Comments: Published at the Thirty-fourth AAAI Conference on Artificial Intelligence

arXiv:1902.05981 [pdf, other]

Adaptive Sequence Submodularity

Authors: Marko Mitrovic, Ehsan Kazemi, Moran Feldman, Andreas Krause, Amin Karbasi

Abstract: In many machine learning applications, one needs to interactively select a sequence of items (e.g., recommending movies based on a user's feedback) or make sequential decisions in a certain order (e.g., guiding an agent through a series of states). Not only do sequences already pose a dauntingly large search space, but we must also take into account past observations, as well as the uncertainty of… ▽ More In many machine learning applications, one needs to interactively select a sequence of items (e.g., recommending movies based on a user's feedback) or make sequential decisions in a certain order (e.g., guiding an agent through a series of states). Not only do sequences already pose a dauntingly large search space, but we must also take into account past observations, as well as the uncertainty of future outcomes. Without further structure, finding an optimal sequence is notoriously challenging, if not completely intractable. In this paper, we view the problem of adaptive and sequential decision making through the lens of submodularity and propose an adaptive greedy policy with strong theoretical guarantees. Additionally, to demonstrate the practical utility of our results, we run experiments on Amazon product recommendation and Wikipedia link prediction tasks. △ Less

Submitted 20 June, 2019; v1 submitted 15 February, 2019; originally announced February 2019.

arXiv:1902.04783 [pdf, other]

Mathematical Notions vs. Human Perception of Fairness: A Descriptive Approach to Fairness for Machine Learning

Authors: Megha Srivastava, Hoda Heidari, Andreas Krause

Abstract: Fairness for Machine Learning has received considerable attention, recently. Various mathematical formulations of fairness have been proposed, and it has been shown that it is impossible to satisfy all of them simultaneously. The literature so far has dealt with these impossibility results by quantifying the tradeoffs between different formulations of fairness. Our work takes a different perspecti… ▽ More Fairness for Machine Learning has received considerable attention, recently. Various mathematical formulations of fairness have been proposed, and it has been shown that it is impossible to satisfy all of them simultaneously. The literature so far has dealt with these impossibility results by quantifying the tradeoffs between different formulations of fairness. Our work takes a different perspective on this issue. Rather than requiring all notions of fairness to (partially) hold at the same time, we ask which one of them is the most appropriate given the societal domain in which the decision-making model is to be deployed. We take a descriptive approach and set out to identify the notion of fairness that best captures \emph{lay people's perception of fairness}. We run adaptive experiments designed to pinpoint the most compatible notion of fairness with each participant's choices through a small number of tests. Perhaps surprisingly, we find that the most simplistic mathematical definition of fairness---namely, demographic parity---most closely matches people's idea of fairness in two distinct application scenarios. This conclusion remains intact even when we explicitly tell the participants about the alternative, more complicated definitions of fairness, and we reduce the cognitive burden of evaluating those notions for them. Our findings have important implications for the Fair ML literature and the discourse on formalizing algorithmic fairness. △ Less

Submitted 8 December, 2019; v1 submitted 13 February, 2019; originally announced February 2019.

arXiv:1902.03229 [pdf, other]

Adaptive and Safe Bayesian Optimization in High Dimensions via One-Dimensional Subspaces

Authors: Johannes Kirschner, Mojmír Mutný, Nicole Hiller, Rasmus Ischebeck, Andreas Krause

Abstract: Bayesian optimization is known to be difficult to scale to high dimensions, because the acquisition step requires solving a non-convex optimization problem in the same search space. In order to scale the method and keep its benefits, we propose an algorithm (LineBO) that restricts the problem to a sequence of iteratively chosen one-dimensional sub-problems that can be solved efficiently. We show t… ▽ More Bayesian optimization is known to be difficult to scale to high dimensions, because the acquisition step requires solving a non-convex optimization problem in the same search space. In order to scale the method and keep its benefits, we propose an algorithm (LineBO) that restricts the problem to a sequence of iteratively chosen one-dimensional sub-problems that can be solved efficiently. We show that our algorithm converges globally and obtains a fast local rate when the function is strongly convex. Further, if the objective has an invariant subspace, our method automatically adapts to the effective dimension without changing the algorithm. When combined with the SafeOpt algorithm to solve the sub-problems, we obtain the first safe Bayesian optimization algorithm with theoretical guarantees applicable in high-dimensional settings. We evaluate our method on multiple synthetic benchmarks, where we obtain competitive performance. Further, we deploy our algorithm to optimize the beam intensity of the Swiss Free Electron Laser with up to 40 parameters while satisfying safe operation constraints. △ Less

Submitted 28 May, 2019; v1 submitted 8 February, 2019; originally announced February 2019.

arXiv:1901.03357 [pdf, other]

No-Regret Bayesian Optimization with Unknown Hyperparameters

Authors: Felix Berkenkamp, Angela P. Schoellig, Andreas Krause

Abstract: Bayesian optimization (BO) based on Gaussian process models is a powerful paradigm to optimize black-box functions that are expensive to evaluate. While several BO algorithms provably converge to the global optimum of the unknown function, they assume that the hyperparameters of the kernel are known in advance. This is not the case in practice and misspecification often causes these algorithms to… ▽ More Bayesian optimization (BO) based on Gaussian process models is a powerful paradigm to optimize black-box functions that are expensive to evaluate. While several BO algorithms provably converge to the global optimum of the unknown function, they assume that the hyperparameters of the kernel are known in advance. This is not the case in practice and misspecification often causes these algorithms to converge to poor local optima. In this paper, we present the first BO algorithm that is provably no-regret and converges to the optimum without knowledge of the hyperparameters. During optimization we slowly adapt the hyperparameters of stationary kernels and thereby expand the associated function class over time, so that the BO algorithm considers more complex function candidates. Based on the theoretical insights, we propose several practical algorithms that achieve the empirical sample efficiency of BO with online hyperparameter estimation, but retain theoretical convergence guarantees. We evaluate our method on several benchmark problems. △ Less

Submitted 1 April, 2019; v1 submitted 10 January, 2019; originally announced January 2019.

arXiv:1812.07544 [pdf, other]

Information-Directed Exploration for Deep Reinforcement Learning

Authors: Nikolay Nikolov, Johannes Kirschner, Felix Berkenkamp, Andreas Krause

Abstract: Efficient exploration remains a major challenge for reinforcement learning. One reason is that the variability of the returns often depends on the current state and action, and is therefore heteroscedastic. Classical exploration strategies such as upper confidence bound algorithms and Thompson sampling fail to appropriately account for heteroscedasticity, even in the bandit setting. Motivated by r… ▽ More Efficient exploration remains a major challenge for reinforcement learning. One reason is that the variability of the returns often depends on the current state and action, and is therefore heteroscedastic. Classical exploration strategies such as upper confidence bound algorithms and Thompson sampling fail to appropriately account for heteroscedasticity, even in the bandit setting. Motivated by recent findings that address this issue in bandits, we propose to use Information-Directed Sampling (IDS) for exploration in reinforcement learning. As our main contribution, we build on recent advances in distributional reinforcement learning and propose a novel, tractable approximation of IDS for deep Q-learning. The resulting exploration strategy explicitly accounts for both parametric uncertainty and heteroscedastic observation noise. We evaluate our method on Atari games and demonstrate a significant improvement over alternative approaches. △ Less

Submitted 24 March, 2019; v1 submitted 18 December, 2018; originally announced December 2018.

arXiv:1811.05788 [pdf, other]

Learning to Compensate Photovoltaic Power Fluctuations from Images of the Sky by Imitating an Optimal Policy

Authors: Robin Spiess, Felix Berkenkamp, Jan Poland, Andreas Krause

Abstract: The energy output of photovoltaic (PV) power plants depends on the environment and thus fluctuates over time. As a result, PV power can cause instability in the power grid, in particular when increasingly used. Limiting the rate of change of the power output is a common way to mitigate these fluctuations, often with the help of large batteries. A reactive controller that uses these batteries to co… ▽ More The energy output of photovoltaic (PV) power plants depends on the environment and thus fluctuates over time. As a result, PV power can cause instability in the power grid, in particular when increasingly used. Limiting the rate of change of the power output is a common way to mitigate these fluctuations, often with the help of large batteries. A reactive controller that uses these batteries to compensate ramps works in practice, but causes stress on the battery due to a high energy throughput. In this paper, we present a deep learning approach that uses images of the sky to compensate power fluctuations predictively and reduces battery stress. In particular, we show that the optimal control policy can be computed using information that is only available in hindsight. Based on this, we use imitation learning to train a neural network that approximates this hindsight-optimal policy, but uses only currently available sky images and sensor data. We evaluate our method on a large dataset of measurements and images from a real power plant and show that the trained policy reduces stress on the battery. △ Less

Submitted 13 November, 2018; originally announced November 2018.

Comments: 7 pages, 7 figures

arXiv:1811.05512 [pdf, other]

A domain agnostic measure for monitoring and evaluating GANs

Authors: Paulina Grnarova, Kfir Y Levy, Aurelien Lucchi, Nathanael Perraudin, Ian Goodfellow, Thomas Hofmann, Andreas Krause

Abstract: Generative Adversarial Networks (GANs) have shown remarkable results in modeling complex distributions, but their evaluation remains an unsettled issue. Evaluations are essential for: (i) relative assessment of different models and (ii) monitoring the progress of a single model throughout training. The latter cannot be determined by simply inspecting the generator and discriminator loss curves as… ▽ More Generative Adversarial Networks (GANs) have shown remarkable results in modeling complex distributions, but their evaluation remains an unsettled issue. Evaluations are essential for: (i) relative assessment of different models and (ii) monitoring the progress of a single model throughout training. The latter cannot be determined by simply inspecting the generator and discriminator loss curves as they behave non-intuitively. We leverage the notion of duality gap from game theory to propose a measure that addresses both (i) and (ii) at a low computational cost. Extensive experiments show the effectiveness of this measure to rank different GAN models and capture the typical GAN failure scenarios, including mode collapse and non-convergent behaviours. This evaluation metric also provides meaningful monitoring on the progression of the loss during training. It highly correlates with FID on natural image datasets, and with domain specific scores for text, sound and cosmology data where FID is not directly suitable. In particular, our proposed metric requires no labels or a pretrained classifier, making it domain agnostic. △ Less

Submitted 15 July, 2020; v1 submitted 13 November, 2018; originally announced November 2018.

arXiv:1810.11050 [pdf, ps, other]

C-motivic modular forms

Authors: Bogdan Gheorghe, Daniel C. Isaksen, Achim Krause, Nicolas Ricka

Abstract: We construct a topological model for cellular, 2-complete, stable C-motivic homotopy theory that uses no algebro-geometric foundations. We compute the Steenrod algebra in this context, and we construct a "motivic modular forms" spectrum over C. We construct a topological model for cellular, 2-complete, stable C-motivic homotopy theory that uses no algebro-geometric foundations. We compute the Steenrod algebra in this context, and we construct a "motivic modular forms" spectrum over C. △ Less

Submitted 25 October, 2018; originally announced October 2018.

MSC Class: Primary 14F42; 55N34; 55S10; Secondary 55Q45; 55T15

arXiv:1809.03400 [pdf, other]

A Moral Framework for Understanding of Fair ML through Economic Models of Equality of Opportunity

Authors: Hoda Heidari, Michele Loi, Krishna P. Gummadi, Andreas Krause

Abstract: We map the recently proposed notions of algorithmic fairness to economic models of Equality of opportunity (EOP)---an extensively studied ideal of fairness in political philosophy. We formally show that through our conceptual map**, many existing definition of algorithmic fairness, such as predictive value parity and equality of odds, can be interpreted as special cases of EOP. In this respect,… ▽ More We map the recently proposed notions of algorithmic fairness to economic models of Equality of opportunity (EOP)---an extensively studied ideal of fairness in political philosophy. We formally show that through our conceptual map**, many existing definition of algorithmic fairness, such as predictive value parity and equality of odds, can be interpreted as special cases of EOP. In this respect, our work serves as a unifying moral framework for understanding existing notions of algorithmic fairness. Most importantly, this framework allows us to explicitly spell out the moral assumptions underlying each notion of fairness, and interpret recent fairness impossibility results in a new light. Last but not least and inspired by luck egalitarian models of EOP, we propose a new family of measures for algorithmic fairness. We illustrate our proposal empirically and show that employing a measure of algorithmic (un)fairness when its underlying moral assumptions are not satisfied, can have devastating consequences for the disadvantaged group's welfare. △ Less

Submitted 27 November, 2018; v1 submitted 10 September, 2018; originally announced September 2018.

arXiv:1808.00924 [pdf, other]

The Lyapunov Neural Network: Adaptive Stability Certification for Safe Learning of Dynamical Systems

Authors: Spencer M. Richards, Felix Berkenkamp, Andreas Krause

Abstract: Learning algorithms have shown considerable prowess in simulation by allowing robots to adapt to uncertain environments and improve their performance. However, such algorithms are rarely used in practice on safety-critical systems, since the learned policy typically does not yield any safety guarantees. That is, the required exploration may cause physical harm to the robot or its environment. In t… ▽ More Learning algorithms have shown considerable prowess in simulation by allowing robots to adapt to uncertain environments and improve their performance. However, such algorithms are rarely used in practice on safety-critical systems, since the learned policy typically does not yield any safety guarantees. That is, the required exploration may cause physical harm to the robot or its environment. In this paper, we present a method to learn accurate safety certificates for nonlinear, closed-loop dynamical systems. Specifically, we construct a neural network Lyapunov function and a training algorithm that adapts it to the shape of the largest safe region in the state space. The algorithm relies only on knowledge of inputs and outputs of the dynamics, rather than on any specific model structure. We demonstrate our method by learning the safe region of attraction for a simulated inverted pendulum. Furthermore, we discuss how our method can be used in safe learning algorithms together with statistical models of dynamical systems. △ Less

Submitted 1 October, 2018; v1 submitted 2 August, 2018; originally announced August 2018.

Comments: Proc. of the 2nd Conference on Robot Learning (CoRL 2018)

arXiv:1807.01808 [pdf, ps, other]

Discrete Sampling using Semigradient-based Product Mixtures

Authors: Alkis Gotovos, Hamed Hassani, Andreas Krause, Stefanie Jegelka

Abstract: We consider the problem of inference in discrete probabilistic models, that is, distributions over subsets of a finite ground set. These encompass a range of well-known models in machine learning, such as determinantal point processes and Ising models. Locally-moving Markov chain Monte Carlo algorithms, such as the Gibbs sampler, are commonly used for inference in such models, but their convergenc… ▽ More We consider the problem of inference in discrete probabilistic models, that is, distributions over subsets of a finite ground set. These encompass a range of well-known models in machine learning, such as determinantal point processes and Ising models. Locally-moving Markov chain Monte Carlo algorithms, such as the Gibbs sampler, are commonly used for inference in such models, but their convergence is, at times, prohibitively slow. This is often caused by state-space bottlenecks that greatly hinder the movement of such samplers. We propose a novel sampling strategy that uses a specific mixture of product distributions to propose global moves and, thus, accelerate convergence. Furthermore, we show how to construct such a mixture using semigradient information. We illustrate the effectiveness of combining our sampler with existing ones, both theoretically on an example model, as well as practically on three models learned from real-world data sets. △ Less

Submitted 9 July, 2018; v1 submitted 4 July, 2018; originally announced July 2018.

arXiv:1806.07200 [pdf, other]

Adaptive Input Estimation in Linear Dynamical Systems with Applications to Learning-from-Observations

Authors: Sebastian Curi, Kfir Y. Levy, Andreas Krause

Abstract: We address the problem of estimating the inputs of a dynamical system from measurements of the system's outputs. To this end, we introduce a novel estimation algorithm that explicitly trades off bias and variance to optimally reduce the overall estimation error. This optimal trade-off is done efficiently and adaptively in every time step. Experimentally, we show that our method often produces esti… ▽ More We address the problem of estimating the inputs of a dynamical system from measurements of the system's outputs. To this end, we introduce a novel estimation algorithm that explicitly trades off bias and variance to optimally reduce the overall estimation error. This optimal trade-off is done efficiently and adaptively in every time step. Experimentally, we show that our method often produces estimates with substantially lower error compared to the state-of-the-art. Finally, we consider the more complex \emph{Learning-from-Observations} framework, where an agent should learn a controller from the outputs of an expert's demonstration. We incorporate our estimation algorithm as a building block inside this framework and show that it enables learning controllers successfully. △ Less

Submitted 19 September, 2019; v1 submitted 19 June, 2018; originally announced June 2018.

Comments: CDC 2019

arXiv:1806.04959 [pdf, other]

Fairness Behind a Veil of Ignorance: A Welfare Analysis for Automated Decision Making

Authors: Hoda Heidari, Claudio Ferrari, Krishna P. Gummadi, Andreas Krause

Abstract: We draw attention to an important, yet largely overlooked aspect of evaluating fairness for automated decision making systems---namely risk and welfare considerations. Our proposed family of measures corresponds to the long-established formulations of cardinal social welfare in economics, and is justified by the Rawlsian conception of fairness behind a veil of ignorance. The convex formulation of… ▽ More We draw attention to an important, yet largely overlooked aspect of evaluating fairness for automated decision making systems---namely risk and welfare considerations. Our proposed family of measures corresponds to the long-established formulations of cardinal social welfare in economics, and is justified by the Rawlsian conception of fairness behind a veil of ignorance. The convex formulation of our welfare-based measures of fairness allows us to integrate them as a constraint into any convex loss minimization pipeline. Our empirical analysis reveals interesting trade-offs between our proposal and (a) prediction accuracy, (b) group discrimination, and (c) Dwork et al.'s notion of individual fairness. Furthermore and perhaps most importantly, our work provides both heuristic justification and empirical evidence suggesting that a lower-bound on our measures often leads to bounded inequality in algorithmic outcomes; hence presenting the first computationally feasible mechanism for bounding individual-level inequality. △ Less

Submitted 11 January, 2019; v1 submitted 13 June, 2018; originally announced June 2018.

Comments: Conference: Thirty-second Conference on Neural Information Processing Systems (NIPS 2018)

arXiv:1805.08322 [pdf, other]

Teaching Multiple Concepts to a Forgetful Learner

Authors: Anette Hunziker, Yuxin Chen, Oisin Mac Aodha, Manuel Gomez Rodriguez, Andreas Krause, Pietro Perona, Yisong Yue, Adish Singla

Abstract: How can we help a forgetful learner learn multiple concepts within a limited time frame? While there have been extensive studies in designing optimal schedules for teaching a single concept given a learner's memory model, existing approaches for teaching multiple concepts are typically based on heuristic scheduling techniques without theoretical guarantees. In this paper, we look at the problem fr… ▽ More How can we help a forgetful learner learn multiple concepts within a limited time frame? While there have been extensive studies in designing optimal schedules for teaching a single concept given a learner's memory model, existing approaches for teaching multiple concepts are typically based on heuristic scheduling techniques without theoretical guarantees. In this paper, we look at the problem from the perspective of discrete optimization and introduce a novel algorithmic framework for teaching multiple concepts with strong performance guarantees. Our framework is both generic, allowing the design of teaching schedules for different memory models, and also interactive, allowing the teacher to adapt the schedule to the underlying forgetting mechanisms of the learner. Furthermore, for a well-known memory model, we are able to identify a regime of model parameters where our framework is guaranteed to achieve high performance. We perform extensive evaluations using simulations along with real user studies in two concrete applications: (i) an educational app for online vocabulary teaching; and (ii) an app for teaching novices how to recognize animal species from images. Our results demonstrate the effectiveness of our algorithm compared to popular heuristic approaches. △ Less

Submitted 25 October, 2019; v1 submitted 21 May, 2018; originally announced May 2018.

Comments: NeurIPS 2019

arXiv:1805.07482 [pdf, other]

Optimal DR-Submodular Maximization and Applications to Provable Mean Field Inference

Authors: An Bian, Joachim M. Buhmann, Andreas Krause

Abstract: Mean field inference in probabilistic models is generally a highly nonconvex problem. Existing optimization methods, e.g., coordinate ascent algorithms, can only generate local optima. In this work we propose provable mean filed methods for probabilistic log-submodular models and its posterior agreement (PA) with strong approximation guarantees. The main algorithmic technique is a new Double Gre… ▽ More Mean field inference in probabilistic models is generally a highly nonconvex problem. Existing optimization methods, e.g., coordinate ascent algorithms, can only generate local optima. In this work we propose provable mean filed methods for probabilistic log-submodular models and its posterior agreement (PA) with strong approximation guarantees. The main algorithmic technique is a new Double Greedy scheme, termed DR-DoubleGreedy, for continuous DR-submodular maximization with box-constraints. It is a one-pass algorithm with linear time complexity, reaching the optimal 1/2 approximation ratio, which may be of independent interest. We validate the superior performance of our algorithms against baseline algorithms on both synthetic and real-world datasets. △ Less

Submitted 29 November, 2018; v1 submitted 18 May, 2018; originally announced May 2018.

Comments: 28 pages

arXiv:1805.07095 [pdf, other]

Reinforced Imitation: Sample Efficient Deep Reinforcement Learning for Map-less Navigation by Leveraging Prior Demonstrations

Authors: Mark Pfeiffer, Samarth Shukla, Matteo Turchetta, Cesar Cadena, Andreas Krause, Roland Siegwart, Juan Nieto

Abstract: This work presents a case study of a learning-based approach for target driven map-less navigation. The underlying navigation model is an end-to-end neural network which is trained using a combination of expert demonstrations, imitation learning (IL) and reinforcement learning (RL). While RL and IL suffer from a large sample complexity and the distribution mismatch problem, respectively, we show t… ▽ More This work presents a case study of a learning-based approach for target driven map-less navigation. The underlying navigation model is an end-to-end neural network which is trained using a combination of expert demonstrations, imitation learning (IL) and reinforcement learning (RL). While RL and IL suffer from a large sample complexity and the distribution mismatch problem, respectively, we show that leveraging prior expert demonstrations for pre-training can reduce the training time to reach at least the same level of performance compared to plain RL by a factor of 5. We present a thorough evaluation of different combinations of expert demonstrations, different RL algorithms and reward functions, both in simulation and on a real robotic platform. Our results show that the final model outperforms both standalone approaches in the amount of successful navigation tasks. In addition, the RL reward function can be significantly simplified when using pre-training, e.g. by using a sparse reward only. The learned navigation policy is able to generalize to unseen and real-world environments. △ Less

Submitted 31 August, 2018; v1 submitted 18 May, 2018; originally announced May 2018.

Comments: 8 pages, submitted for publication in the IEEE Robotics and Automation Letters

arXiv:1804.04378 [pdf, other]

Fast Gaussian Process Based Gradient Matching for Parameter Identification in Systems of Nonlinear ODEs

Authors: Philippe Wenk, Alkis Gotovos, Stefan Bauer, Nico Gorbach, Andreas Krause, Joachim M. Buhmann

Abstract: Parameter identification and comparison of dynamical systems is a challenging task in many fields. Bayesian approaches based on Gaussian process regression over time-series data have been successfully applied to infer the parameters of a dynamical system without explicitly solving it. While the benefits in computational cost are well established, a rigorous mathematical framework has been missing.… ▽ More Parameter identification and comparison of dynamical systems is a challenging task in many fields. Bayesian approaches based on Gaussian process regression over time-series data have been successfully applied to infer the parameters of a dynamical system without explicitly solving it. While the benefits in computational cost are well established, a rigorous mathematical framework has been missing. We offer a novel interpretation which leads to a better understanding and improvements in state-of-the-art performance in terms of accuracy for nonlinear dynamical systems. △ Less

Submitted 1 March, 2019; v1 submitted 12 April, 2018; originally announced April 2018.

Comments: accepted at AISTATS 2019

arXiv:1803.08287 [pdf, other]

Learning-based Model Predictive Control for Safe Exploration

Authors: Torsten Koller, Felix Berkenkamp, Matteo Turchetta, Andreas Krause

Abstract: Learning-based methods have been successful in solving complex control tasks without significant prior knowledge about the system. However, these methods typically do not provide any safety guarantees, which prevents their use in safety-critical, real-world applications. In this paper, we present a learning-based model predictive control scheme that can provide provable high-probability safety gua… ▽ More Learning-based methods have been successful in solving complex control tasks without significant prior knowledge about the system. However, these methods typically do not provide any safety guarantees, which prevents their use in safety-critical, real-world applications. In this paper, we present a learning-based model predictive control scheme that can provide provable high-probability safety guarantees. To this end, we exploit regularity assumptions on the dynamics in terms of a Gaussian process prior to construct provably accurate confidence intervals on predicted trajectories. Unlike previous approaches, we do not assume that model uncertainties are independent. Based on these predictions, we guarantee that trajectories satisfy safety constraints. Moreover, we use a terminal set constraint to recursively guarantee the existence of safe control actions at every iteration. In our experiments, we show that the resulting algorithm can be used to safely and efficiently explore and learn about dynamic systems. △ Less

Submitted 7 November, 2018; v1 submitted 22 March, 2018; originally announced March 2018.

Comments: Proc. of the Conference on Decision and Control, 2018

arXiv:1803.02147 [pdf, other]

Amplitude death criteria for coupled complex Ginzburg-Landau systems

Authors: Robert A. Van Gorder, Andrew L. Krause, James A. Kwiecinski

Abstract: Amplitude death, which occurs in a system when one or more macroscopic wavefunctions collapse to zero, has been observed in mutually coupled solid-state lasers, analog circuits, and thermoacoustic oscillators, to name a few applications. While studies have considered amplitude death on oscillator systems and in externally forced complex Ginzburg-Landau systems, a route to amplitude death has not b… ▽ More Amplitude death, which occurs in a system when one or more macroscopic wavefunctions collapse to zero, has been observed in mutually coupled solid-state lasers, analog circuits, and thermoacoustic oscillators, to name a few applications. While studies have considered amplitude death on oscillator systems and in externally forced complex Ginzburg-Landau systems, a route to amplitude death has not been studied in autonomous continuum systems. We derive simple analytic conditions for the onset of amplitude death of one macroscopic wavefunction in a system of two coupled complex Ginzburg-Landau equations with general nonlinear self- and cross-interaction terms. Our results give a more general theoretical underpinning for recent amplitude death results reported in the literature, and suggest an approach for tuning parameters in such systems so that they either permit or prohibit amplitude death of a wavefunction (depending on the application). Numerical simulation of the coupled complex Ginzburg-Landau equations, for examples including cubic, cubic-quintic, and saturable nonlinearities, is used to illustrate the analytical results. △ Less

Submitted 29 January, 2019; v1 submitted 6 March, 2018; originally announced March 2018.

Comments: 7 pages, 4 figures

arXiv:1803.01785 [pdf, other]

Differentiable Submodular Maximization

Authors: Sebastian Tschiatschek, Aytunc Sahin, Andreas Krause

Abstract: We consider learning of submodular functions from data. These functions are important in machine learning and have a wide range of applications, e.g. data summarization, feature selection and active learning. Despite their combinatorial nature, submodular functions can be maximized approximately with strong theoretical guarantees in polynomial time. Typically, learning the submodular function and… ▽ More We consider learning of submodular functions from data. These functions are important in machine learning and have a wide range of applications, e.g. data summarization, feature selection and active learning. Despite their combinatorial nature, submodular functions can be maximized approximately with strong theoretical guarantees in polynomial time. Typically, learning the submodular function and optimization of that function are treated separately, i.e. the function is first learned using a proxy objective and subsequently maximized. In contrast, we show how to perform learning and optimization jointly. By interpreting the output of greedy maximization algorithms as distributions over sequences of items and smoothening these distributions, we obtain a differentiable objective. In this way, we can differentiate through the maximization algorithms and optimize the model to work well with the optimization algorithm. We theoretically characterize the error made by our approach, yielding insights into the tradeoff of smoothness and accuracy. We demonstrate the effectiveness of our approach for jointly learning and optimizing on synthetic maximum cut data, and on real world applications such as product recommendation and image collection summarization. △ Less

Submitted 14 June, 2018; v1 submitted 5 March, 2018; originally announced March 2018.

arXiv:1802.09110 [pdf, other]

Submodularity on Hypergraphs: From Sets to Sequences

Authors: Marko Mitrovic, Moran Feldman, Andreas Krause, Amin Karbasi

Abstract: In a nutshell, submodular functions encode an intuitive notion of diminishing returns. As a result, submodularity appears in many important machine learning tasks such as feature selection and data summarization. Although there has been a large volume of work devoted to the study of submodular functions in recent years, the vast majority of this work has been focused on algorithms that output sets… ▽ More In a nutshell, submodular functions encode an intuitive notion of diminishing returns. As a result, submodularity appears in many important machine learning tasks such as feature selection and data summarization. Although there has been a large volume of work devoted to the study of submodular functions in recent years, the vast majority of this work has been focused on algorithms that output sets, not sequences. However, in many settings, the order in which we output items can be just as important as the items themselves. To extend the notion of submodularity to sequences, we use a directed graph on the items where the edges encode the additional value of selecting items in a particular order. Existing theory is limited to the case where this underlying graph is a directed acyclic graph. In this paper, we introduce two new algorithms that provably give constant factor approximations for general graphs and hypergraphs having bounded in or out degrees. Furthermore, we show the utility of our new algorithms for real-world applications in movie recommendation, online link prediction, and the design of course sequences for MOOCs. △ Less

Submitted 15 March, 2018; v1 submitted 25 February, 2018; originally announced February 2018.

arXiv:1802.04715 [pdf, other]

Online Variance Reduction for Stochastic Optimization

Authors: Zalán Borsos, Andreas Krause, Kfir Y. Levy

Abstract: Modern stochastic optimization methods often rely on uniform sampling which is agnostic to the underlying characteristics of the data. This might degrade the convergence by yielding estimates that suffer from a high variance. A possible remedy is to employ non-uniform importance sampling techniques, which take the structure of the dataset into account. In this work, we investigate a recently propo… ▽ More Modern stochastic optimization methods often rely on uniform sampling which is agnostic to the underlying characteristics of the data. This might degrade the convergence by yielding estimates that suffer from a high variance. A possible remedy is to employ non-uniform importance sampling techniques, which take the structure of the dataset into account. In this work, we investigate a recently proposed setting which poses variance reduction as an online optimization problem with bandit feedback. We devise a novel and efficient algorithm for this setting that finds a sequence of importance sampling distributions competitive with the best fixed distribution in hindsight, the first result of this kind. While we present our method for sampling datapoints, it naturally extends to selecting coordinates or even blocks of thereof. Empirical validations underline the benefits of our method in several settings. △ Less

Submitted 6 June, 2018; v1 submitted 13 February, 2018; originally announced February 2018.

Comments: COLT 2018

arXiv:1801.09667 [pdf, other]

Information Directed Sampling and Bandits with Heteroscedastic Noise

Authors: Johannes Kirschner, Andreas Krause

Abstract: In the stochastic bandit problem, the goal is to maximize an unknown function via a sequence of noisy evaluations. Typically, the observation noise is assumed to be independent of the evaluation point and to satisfy a tail bound uniformly on the domain; a restrictive assumption for many applications. In this work, we consider bandits with heteroscedastic noise, where we explicitly allow the noise… ▽ More In the stochastic bandit problem, the goal is to maximize an unknown function via a sequence of noisy evaluations. Typically, the observation noise is assumed to be independent of the evaluation point and to satisfy a tail bound uniformly on the domain; a restrictive assumption for many applications. In this work, we consider bandits with heteroscedastic noise, where we explicitly allow the noise distribution to depend on the evaluation point. We show that this leads to new trade-offs for information and regret, which are not taken into account by existing approaches like upper confidence bound algorithms (UCB) or Thompson Sampling. To address these shortcomings, we introduce a frequentist regret analysis framework, that is similar to the Bayesian framework of Russo and Van Roy (2014), and we prove a new high-probability regret bound for general, possibly randomized policies, which depends on a quantity we refer to as regret-information ratio. From this bound, we define a frequentist version of Information Directed Sampling (IDS) to minimize the regret-information ratio over all possible action sampling distributions. This further relies on concentration inequalities for online least squares regression in separable Hilbert spaces, which we generalize to the case of heteroscedastic noise. We then formulate several variants of IDS for linear and reproducing kernel Hilbert space response functions, yielding novel algorithms for Bayesian optimization. We also prove frequentist regret bounds, which in the homoscedastic case recover known bounds for UCB, but can be much better when the noise is heteroscedastic. Empirically, we demonstrate in a linear setting with heteroscedastic noise, that some of our methods can outperform UCB and Thompson Sampling, while staying competitive when the noise is homoscedastic. △ Less

Submitted 19 April, 2018; v1 submitted 29 January, 2018; originally announced January 2018.

Comments: Figure 1a,2a updated

arXiv:1801.05057 [pdf, ps, other]

A simple protocol for certifying graph states and applications in quantum networks

Authors: Damian Markham, Alexandra Krause

Abstract: We present a simple protocol for certifying graph states in quantum networks using stabiliser measurements. The certification statements can easily be applied to different protocols using graph states. We see for example how it can be used to for measurement based verified quantum compu- tation, certified sampling of random unitaries and quantum metrology and sharing quantum secrets over untrusted… ▽ More We present a simple protocol for certifying graph states in quantum networks using stabiliser measurements. The certification statements can easily be applied to different protocols using graph states. We see for example how it can be used to for measurement based verified quantum compu- tation, certified sampling of random unitaries and quantum metrology and sharing quantum secrets over untrusted channels. △ Less

Submitted 15 January, 2018; originally announced January 2018.

Comments: 6 pages

arXiv:1801.02458 [pdf, other]

doi 10.1142/S0218127418300136

Chaotic dynamics in the planar gravitational many-body problem with rigid body rotations

Authors: James A. Kwiecinski, Attila Kovacs, Andrew L. Krause, Ferran Brosa Planella, Robert A. Van Gorder

Abstract: The discovery of Pluto's small moons in the last decade brought attention to the dynamics of the dwarf planet's satellites. With such systems in mind, we study a planar $N$-body system in which all the bodies are point masses, except for a single rigid body. We then present a reduced model consisting of a planar $N$-body problem with the rigid body treated as a 1D continuum (i.e. the body is treat… ▽ More The discovery of Pluto's small moons in the last decade brought attention to the dynamics of the dwarf planet's satellites. With such systems in mind, we study a planar $N$-body system in which all the bodies are point masses, except for a single rigid body. We then present a reduced model consisting of a planar $N$-body problem with the rigid body treated as a 1D continuum (i.e. the body is treated as a rod with an arbitrary mass distribution). Such a model provides a good approximation to highly asymmetric geometries, such as the recently observed interstellar asteroid 'Oumuamua, but is also amenable to analysis. We analytically demonstrate the existence of homoclinic chaos in the case where one of the orbits is nearly circular by way of the Melnikov method, and give numerical evidence for chaos when the orbits are more complicated. We show that the extent of chaos in parameter space is strongly tied to the deviations from a purely circular orbit. These results suggest that chaos is ubiquitous in many-body problems when one or more of the rigid bodies exhibits non-spherical and highly asymmetric geometries. The excitation of chaotic rotations does not appear to require tidal dissipation, obliquity variation, or orbital resonance. Such dynamics give a possible explanation for routes to chaotic dynamics observed in $N$-body systems such as the Pluto system where some of the bodies are highly non-spherical. △ Less

Submitted 29 May, 2018; v1 submitted 3 January, 2018; originally announced January 2018.

Comments: 23 pages, 7 figures

arXiv:1711.09025 [pdf, other]

Fake News Detection in Social Networks via Crowd Signals

Authors: Sebastian Tschiatschek, Adish Singla, Manuel Gomez Rodriguez, Arpit Merchant, Andreas Krause

Abstract: Our work considers leveraging crowd signals for detecting fake news and is motivated by tools recently introduced by Facebook that enable users to flag fake news. By aggregating users' flags, our goal is to select a small subset of news every day, send them to an expert (e.g., via a third-party fact-checking organization), and stop the spread of news identified as fake by an expert. The main objec… ▽ More Our work considers leveraging crowd signals for detecting fake news and is motivated by tools recently introduced by Facebook that enable users to flag fake news. By aggregating users' flags, our goal is to select a small subset of news every day, send them to an expert (e.g., via a third-party fact-checking organization), and stop the spread of news identified as fake by an expert. The main objective of our work is to minimize the spread of misinformation by stop** the propagation of fake news in the network. It is especially challenging to achieve this objective as it requires detecting fake news with high-confidence as quickly as possible. We show that in order to leverage users' flags efficiently, it is crucial to learn about users' flagging accuracy. We develop a novel algorithm, DETECTIVE, that performs Bayesian inference for detecting fake news and jointly learns about users' flagging accuracy over time. Our algorithm employs posterior sampling to actively trade off exploitation (selecting news that maximize the objective value at a given epoch) and exploration (selecting news that maximize the value of information towards learning about users' flagging accuracy). We demonstrate the effectiveness of our approach via extensive experiments and show the power of leveraging community signals for fake news detection. △ Less

Submitted 2 March, 2018; v1 submitted 24 November, 2017; originally announced November 2017.

arXiv:1711.08331 [pdf, other]

Learning User Preferences to Incentivize Exploration in the Sharing Economy

Authors: Christoph Hirnschall, Adish Singla, Sebastian Tschiatschek, Andreas Krause

Abstract: We study platforms in the sharing economy and discuss the need for incentivizing users to explore options that otherwise would not be chosen. For instance, rental platforms such as Airbnb typically rely on customer reviews to provide users with relevant information about different options. Yet, often a large fraction of options does not have any reviews available. Such options are frequently negle… ▽ More We study platforms in the sharing economy and discuss the need for incentivizing users to explore options that otherwise would not be chosen. For instance, rental platforms such as Airbnb typically rely on customer reviews to provide users with relevant information about different options. Yet, often a large fraction of options does not have any reviews available. Such options are frequently neglected as viable choices, and in turn are unlikely to be evaluated, creating a vicious cycle. Platforms can engage users to deviate from their preferred choice by offering monetary incentives for choosing a different option instead. To efficiently learn the optimal incentives to offer, we consider structural information in user preferences and introduce a novel algorithm - Coordinated Online Learning (CoOL) - for learning with structural information modeled as convex constraints. We provide formal guarantees on the performance of our algorithm and test the viability of our approach in a user study with data of apartments on Airbnb. Our findings suggest that our approach is well-suited to learn appropriate incentives and increase exploration on the investigated platform. △ Less

Submitted 24 November, 2017; v1 submitted 17 November, 2017; originally announced November 2017.

Comments: Longer version of AAAI'18 paper. arXiv admin note: text overlap with arXiv:1702.02849

arXiv:1711.06740 [pdf, other]

Information Gathering with Peers: Submodular Optimization with Peer-Prediction Constraints

Authors: Goran Radanovic, Adish Singla, Andreas Krause, Boi Faltings

Abstract: We study a problem of optimal information gathering from multiple data providers that need to be incentivized to provide accurate information. This problem arises in many real world applications that rely on crowdsourced data sets, but where the process of obtaining data is costly. A notable example of such a scenario is crowd sensing. To this end, we formulate the problem of optimal information g… ▽ More We study a problem of optimal information gathering from multiple data providers that need to be incentivized to provide accurate information. This problem arises in many real world applications that rely on crowdsourced data sets, but where the process of obtaining data is costly. A notable example of such a scenario is crowd sensing. To this end, we formulate the problem of optimal information gathering as maximization of a submodular function under a budget constraint, where the budget represents the total expected payment to data providers. Contrary to the existing approaches, we base our payments on incentives for accuracy and truthfulness, in particular, {\em peer-prediction} methods that score each of the selected data providers against its best peer, while ensuring that the minimum expected payment is above a given threshold. We first show that the problem at hand is hard to approximate within a constant factor that is not dependent on the properties of the payment function. However, for given topological and analytical properties of the instance, we construct two greedy algorithms, respectively called PPCGreedy and PPCGreedyIter, and establish theoretical bounds on their performance w.r.t. the optimal solution. Finally, we evaluate our methods using a realistic crowd sensing testbed. △ Less

Submitted 24 November, 2017; v1 submitted 17 November, 2017; originally announced November 2017.

Comments: Longer version of AAAI'18 paper

arXiv:1711.02515 [pdf, other]

Continuous DR-submodular Maximization: Structure and Algorithms

Authors: An Bian, Kfir Y. Levy, Andreas Krause, Joachim M. Buhmann

Abstract: DR-submodular continuous functions are important objectives with wide real-world applications spanning MAP inference in determinantal point processes (DPPs), and mean-field inference for probabilistic submodular models, amongst others. DR-submodularity captures a subclass of non-convex functions that enables both exact minimization and approximate maximization in polynomial time. In this work we… ▽ More DR-submodular continuous functions are important objectives with wide real-world applications spanning MAP inference in determinantal point processes (DPPs), and mean-field inference for probabilistic submodular models, amongst others. DR-submodularity captures a subclass of non-convex functions that enables both exact minimization and approximate maximization in polynomial time. In this work we study the problem of maximizing non-monotone DR-submodular continuous functions under general down-closed convex constraints. We start by investigating geometric properties that underlie such objectives, e.g., a strong relation between (approximately) stationary points and global optimum is proved. These properties are then used to devise two optimization algorithms with provable guarantees. Concretely, we first devise a "two-phase" algorithm with $1/4$ approximation guarantee. This algorithm allows the use of existing methods for finding (approximately) stationary points as a subroutine, thus, harnessing recent progress in non-convex optimization. Then we present a non-monotone Frank-Wolfe variant with $1/e$ approximation guarantee and sublinear convergence rate. Finally, we extend our approach to a broader class of generalized DR-submodular continuous functions, which captures a wider spectrum of applications. Our theoretical findings are validated on synthetic and real-world problem instances. △ Less

Submitted 24 May, 2019; v1 submitted 3 November, 2017; originally announced November 2017.

Comments: Published in NIPS 2017

arXiv:1711.01566 [pdf, other]

Stochastic Submodular Maximization: The Case of Coverage Functions

Authors: Mohammad Reza Karimi, Mario Lucic, Hamed Hassani, Andreas Krause

Abstract: Stochastic optimization of continuous objectives is at the heart of modern machine learning. However, many important problems are of discrete nature and often involve submodular objectives. We seek to unleash the power of stochastic continuous optimization, namely stochastic gradient descent and its variants, to such discrete problems. We first introduce the problem of stochastic submodular optimi… ▽ More Stochastic optimization of continuous objectives is at the heart of modern machine learning. However, many important problems are of discrete nature and often involve submodular objectives. We seek to unleash the power of stochastic continuous optimization, namely stochastic gradient descent and its variants, to such discrete problems. We first introduce the problem of stochastic submodular optimization, where one needs to optimize a submodular objective which is given as an expectation. Our model captures situations where the discrete objective arises as an empirical risk (e.g., in the case of exemplar-based clustering), or is given as an explicit stochastic model (e.g., in the case of influence maximization in social networks). By exploiting that common extensions act linearly on the class of submodular functions, we employ projected stochastic gradient ascent and its variants in the continuous domain, and perform rounding to obtain discrete solutions. We focus on the rich and widely used family of weighted coverage functions. We show that our approach yields solutions that are guaranteed to match the optimal approximation guarantees, while reducing the computational cost by several orders of magnitude, as we demonstrate empirically. △ Less

Submitted 5 November, 2017; originally announced November 2017.

Comments: 31st Conference on Neural Information Processing Systems (NIPS 2017)

arXiv:1710.07968 [pdf, other]

doi 10.1016/j.aop.2018.07.003

Coupled complex Ginzburg-Landau systems with saturable nonlinearity and asymmetric cross-phase modulation

Authors: Robert A. Van Gorder, Andrew L. Krause, Ferran Brosa Planella, Abigail M. Burton

Abstract: We formulate and study dynamics from a complex Ginzburg-Landau system with saturable nonlinearity, including asymmetric cross-phase modulation (XPM) parameters. Such equations can model phenomena described by complex Ginzburg-Landau systems under the added assumption of saturable media. When the saturation parameter is set to zero, we recover a general complex cubic Ginzburg-Landau system with XPM… ▽ More We formulate and study dynamics from a complex Ginzburg-Landau system with saturable nonlinearity, including asymmetric cross-phase modulation (XPM) parameters. Such equations can model phenomena described by complex Ginzburg-Landau systems under the added assumption of saturable media. When the saturation parameter is set to zero, we recover a general complex cubic Ginzburg-Landau system with XPM. We first derive conditions for the existence of bounded dynamics, approximating the absorbing set for solutions. We use this to then determine conditions for amplitude death of a single wavefunction. We also construct exact plane wave solutions, and determine conditions for their modulational instability. In a degenerate limit where dispersion and nonlinearity balance, we reduce our system to a saturable nonlinear Schrödinger system with XPM parameters, and we demonstrate the existence and behavior of spatially heterogeneous stationary solutions in this limit. Using numerical simulations we verify the aforementioned analytical results, while also demonstrating other interesting emergent features of the dynamics, such as spatiotemporal chaos in the presence of modulational instability. In other regimes, coherent patterns including uniform states or banded structures arise, corresponding to certain stable stationary states. For sufficiently large yet equal XPM parameters, we observe a segregation of wavefunctions into different regions of the spatial domain, while when XPM parameters are large and take different values, one wavefunction may decay to zero in finite time over the spatial domain (in agreement with the amplitude death predicted analytically). While saturation will often regularize the dynamics, such transient dynamics can still be observed - and in some cases even prolonged - as the saturability of the media is increased, as the saturation may act to slow the timescale. △ Less

Submitted 17 June, 2018; v1 submitted 22 October, 2017; originally announced October 2017.

Comments: 36 pages

arXiv:1709.01006 [pdf, other]

Learning Implicit Generative Models Using Differentiable Graph Tests

Authors: Josip Djolonga, Andreas Krause

Abstract: Recently, there has been a growing interest in the problem of learning rich implicit models - those from which we can sample, but can not evaluate their density. These models apply some parametric function, such as a deep network, to a base measure, and are learned end-to-end using stochastic optimization. One strategy of devising a loss function is through the statistics of two sample tests - if… ▽ More Recently, there has been a growing interest in the problem of learning rich implicit models - those from which we can sample, but can not evaluate their density. These models apply some parametric function, such as a deep network, to a base measure, and are learned end-to-end using stochastic optimization. One strategy of devising a loss function is through the statistics of two sample tests - if we can fool a statistical test, the learned distribution should be a good model of the true data. However, not all tests can easily fit into this framework, as they might not be differentiable with respect to the data points, and hence with respect to the parameters of the implicit model. Motivated by this problem, in this paper we show how two such classical tests, the Friedman-Rafsky and k-nearest neighbour tests, can be effectively smoothed using ideas from undirected graphical models - the matrix tree theorem and cardinality potentials. Moreover, as we show experimentally, smoothing can significantly increase the power of the test, which might of of independent interest. Finally, we apply our method to learn implicit models. △ Less

Submitted 4 September, 2017; originally announced September 2017.

arXiv:1706.03968 [pdf, other]

Asynchronous Graph Pattern Matching on Multiprocessor Systems

Authors: Alexander Krause, Annett Ungethüm, Thomas Kissinger, Dirk Habich, Wolfgang Lehner

Abstract: Pattern matching on large graphs is the foundation for a variety of application domains. Strict latency requirements and continuously increasing graph sizes demand the usage of highly parallel in-memory graph processing engines that need to consider non-uniform memory access (NUMA) and concurrency issues to scale up on modern multiprocessor systems. To tackle these aspects, graph partitioning beco… ▽ More Pattern matching on large graphs is the foundation for a variety of application domains. Strict latency requirements and continuously increasing graph sizes demand the usage of highly parallel in-memory graph processing engines that need to consider non-uniform memory access (NUMA) and concurrency issues to scale up on modern multiprocessor systems. To tackle these aspects, graph partitioning becomes increasingly important. Hence, we present a technique to process graph pattern matching on NUMA systems in this paper. As a scalable pattern matching processing infrastructure, we leverage a data-oriented architecture that preserves data locality and minimizes concurrency-related bottlenecks on NUMA systems. We show in detail, how graph pattern matching can be asynchronously processed on a multiprocessor system. △ Less

Submitted 14 June, 2017; v1 submitted 13 June, 2017; originally announced June 2017.

Comments: 14 Pages, Extended version for ADBIS 2017

arXiv:1706.03583 [pdf, other]

Streaming Non-monotone Submodular Maximization: Personalized Video Summarization on the Fly

Authors: Baharan Mirzasoleiman, Stefanie Jegelka, Andreas Krause

Abstract: The need for real time analysis of rapidly producing data streams (e.g., video and image streams) motivated the design of streaming algorithms that can efficiently extract and summarize useful information from massive data "on the fly". Such problems can often be reduced to maximizing a submodular set function subject to various constraints. While efficient streaming methods have been recently dev… ▽ More The need for real time analysis of rapidly producing data streams (e.g., video and image streams) motivated the design of streaming algorithms that can efficiently extract and summarize useful information from massive data "on the fly". Such problems can often be reduced to maximizing a submodular set function subject to various constraints. While efficient streaming methods have been recently developed for monotone submodular maximization, in a wide range of applications, such as video summarization, the underlying utility function is non-monotone, and there are often various constraints imposed on the optimization problem to consider privacy or personalization. We develop the first efficient single pass streaming algorithm, Streaming Local Search, that for any streaming monotone submodular maximization algorithm with approximation guarantee $α$ under a collection of independence systems ${\cal I}$, provides a constant $1/\big(1+2/\sqrtα+1/α+2d(1+\sqrtα)\big)$ approximation guarantee for maximizing a non-monotone submodular function under the intersection of ${\cal I}$ and $d$ knapsack constraints. Our experiments show that for video summarization, our method runs more than 1700 times faster than previous work, while maintaining practically the same performance. △ Less

Submitted 26 December, 2017; v1 submitted 12 June, 2017; originally announced June 2017.

arXiv:1706.03269 [pdf, other]

An Online Learning Approach to Generative Adversarial Networks

Authors: Paulina Grnarova, Kfir Y. Levy, Aurelien Lucchi, Thomas Hofmann, Andreas Krause

Abstract: We consider the problem of training generative models with a Generative Adversarial Network (GAN). Although GANs can accurately model complex distributions, they are known to be difficult to train due to instabilities caused by a difficult minimax optimization problem. In this paper, we view the problem of training GANs as finding a mixed strategy in a zero-sum game. Building on ideas from online… ▽ More We consider the problem of training generative models with a Generative Adversarial Network (GAN). Although GANs can accurately model complex distributions, they are known to be difficult to train due to instabilities caused by a difficult minimax optimization problem. In this paper, we view the problem of training GANs as finding a mixed strategy in a zero-sum game. Building on ideas from online learning we propose a novel training method named Chekhov GAN 1 . On the theory side, we show that our method provably converges to an equilibrium for semi-shallow GAN architectures, i.e. architectures where the discriminator is a one layer network and the generator is arbitrary. On the practical side, we develop an efficient heuristic guided by our theoretical results, which we apply to commonly used deep GAN architectures. On several real world tasks our approach exhibits improved stability and performance compared to standard GAN training. △ Less

Submitted 10 June, 2017; originally announced June 2017.

arXiv:1705.08551 [pdf, other]

Safe Model-based Reinforcement Learning with Stability Guarantees

Authors: Felix Berkenkamp, Matteo Turchetta, Angela P. Schoellig, Andreas Krause

Abstract: Reinforcement learning is a powerful paradigm for learning optimal policies from experimental data. However, to find optimal policies, most reinforcement learning algorithms explore all possible actions, which may be harmful for real-world systems. As a consequence, learning algorithms are rarely applied on safety-critical systems in the real world. In this paper, we present a learning algorithm t… ▽ More Reinforcement learning is a powerful paradigm for learning optimal policies from experimental data. However, to find optimal policies, most reinforcement learning algorithms explore all possible actions, which may be harmful for real-world systems. As a consequence, learning algorithms are rarely applied on safety-critical systems in the real world. In this paper, we present a learning algorithm that explicitly considers safety, defined in terms of stability guarantees. Specifically, we extend control-theoretic results on Lyapunov stability verification and show how to use statistical models of the dynamics to obtain high-performance control policies with provable stability certificates. Moreover, under additional regularity assumptions in terms of a Gaussian process prior, we prove that one can effectively and safely collect data in order to learn about the dynamics and thus both improve control performance and expand the safe region of the state space. In our experiments, we show how the resulting algorithm can safely optimize a neural network policy on a simulated inverted pendulum, without the pendulum ever falling down. △ Less

Submitted 13 November, 2017; v1 submitted 23 May, 2017; originally announced May 2017.

Comments: Proc. of Neural Information Processing Systems (NIPS), 2017

arXiv:1705.07323 [pdf, ps, other]

doi 10.1103/PhysRevMaterials.1.064601

Simulation of Charge Transport in Organic Semiconductors: A Time-Dependent Multiscale Method Based on Non-Equilibrium Green's Functions

Authors: Susanne Leitherer, Christof M. Jäger, Andreas Krause, Marcus Halik, Tim Clark, Michael Thoss

Abstract: In weakly interacting organic semiconductors, static and dynamic disorder often have an important impact on transport properties. Describing charge transport in these systems requires an approach that correctly takes structural and electronic fluctuations into account. Here, we present a multiscale method based on a combination of molecular dynamics simulations, electronic structure calculations,… ▽ More In weakly interacting organic semiconductors, static and dynamic disorder often have an important impact on transport properties. Describing charge transport in these systems requires an approach that correctly takes structural and electronic fluctuations into account. Here, we present a multiscale method based on a combination of molecular dynamics simulations, electronic structure calculations, and a transport theory that uses time-dependent non-equilibrium Green's functions. We apply the methodology to investigate the charge transport in C$_{60}$-containing self-assembled monolayers (SAMs), which are used in organic field-effect transistors. △ Less

Submitted 20 May, 2017; originally announced May 2017.

Comments: 5 pages, 4 figures

Journal ref: Phys. Rev. Materials 1, 064601 (2017)

arXiv:1703.08110 [pdf, other]

Training Gaussian Mixture Models at Scale via Coresets

Authors: Mario Lucic, Matthew Faulkner, Andreas Krause, Dan Feldman

Abstract: How can we train a statistical mixture model on a massive data set? In this work we show how to construct coresets for mixtures of Gaussians. A coreset is a weighted subset of the data, which guarantees that models fitting the coreset also provide a good fit for the original data set. We show that, perhaps surprisingly, Gaussian mixtures admit coresets of size polynomial in dimension and the numbe… ▽ More How can we train a statistical mixture model on a massive data set? In this work we show how to construct coresets for mixtures of Gaussians. A coreset is a weighted subset of the data, which guarantees that models fitting the coreset also provide a good fit for the original data set. We show that, perhaps surprisingly, Gaussian mixtures admit coresets of size polynomial in dimension and the number of mixture components, while being independent of the data set size. Hence, one can harness computationally intensive algorithms to compute a good approximation on a significantly smaller data set. More importantly, such coresets can be efficiently constructed both in distributed and streaming settings and do not impose restrictions on the data generating process. Our results rely on a novel reduction of statistical estimation to problems in computational geometry and new combinatorial complexity results for mixtures of Gaussians. Empirical evaluation on several real-world datasets suggests that our coreset-based approach enables significant reduction in training-time with negligible approximation error. △ Less

Submitted 15 January, 2018; v1 submitted 23 March, 2017; originally announced March 2017.

arXiv:1703.06476 [pdf, other]

Practical Coreset Constructions for Machine Learning

Authors: Olivier Bachem, Mario Lucic, Andreas Krause

Abstract: We investigate coresets - succinct, small summaries of large data sets - so that solutions found on the summary are provably competitive with solution found on the full data set. We provide an overview over the state-of-the-art in coreset construction for machine learning. In Section 2, we present both the intuition behind and a theoretically sound framework to construct coresets for general probl… ▽ More We investigate coresets - succinct, small summaries of large data sets - so that solutions found on the summary are provably competitive with solution found on the full data set. We provide an overview over the state-of-the-art in coreset construction for machine learning. In Section 2, we present both the intuition behind and a theoretically sound framework to construct coresets for general problems and apply it to $k$-means clustering. In Section 3 we summarize existing coreset construction algorithms for a variety of machine learning problems such as maximum likelihood estimation of mixture models, Bayesian non-parametric models, principal component analysis, regression and general empirical risk minimization. △ Less

Submitted 4 June, 2017; v1 submitted 19 March, 2017; originally announced March 2017.

arXiv:1703.05452 [pdf, other]

Efficient Online Learning for Optimizing Value of Information: Theory and Application to Interactive Troubleshooting

Authors: Yuxin Chen, Jean-Michel Renders, Morteza Haghir Chehreghani, Andreas Krause

Abstract: We consider the optimal value of information (VoI) problem, where the goal is to sequentially select a set of tests with a minimal cost, so that one can efficiently make the best decision based on the observed outcomes. Existing algorithms are either heuristics with no guarantees, or scale poorly (with exponential run time in terms of the number of available tests). Moreover, these methods assume… ▽ More We consider the optimal value of information (VoI) problem, where the goal is to sequentially select a set of tests with a minimal cost, so that one can efficiently make the best decision based on the observed outcomes. Existing algorithms are either heuristics with no guarantees, or scale poorly (with exponential run time in terms of the number of available tests). Moreover, these methods assume a known distribution over the test outcomes, which is often not the case in practice. We propose an efficient sampling-based online learning framework to address the above issues. First, assuming the distribution over hypotheses is known, we propose a dynamic hypothesis enumeration strategy, which allows efficient information gathering with strong theoretical guarantees. We show that with sufficient amount of samples, one can identify a near-optimal decision with high probability. Second, when the parameters of the hypotheses distribution are unknown, we propose an algorithm which learns the parameters progressively via posterior sampling in an online fashion. We further establish a rigorous bound on the expected regret. We demonstrate the effectiveness of our approach on a real-world interactive troubleshooting application and show that one can efficiently make high-quality decisions with low cost. △ Less

Submitted 17 July, 2017; v1 submitted 15 March, 2017; originally announced March 2017.

Comments: 18 pages, 6 figures, to appear in the Conference on Uncertainty in Artificial Intelligence (UAI) 2017

arXiv:1703.02100 [pdf, other]

Guarantees for Greedy Maximization of Non-submodular Functions with Applications

Authors: Andrew An Bian, Joachim M. Buhmann, Andreas Krause, Sebastian Tschiatschek

Abstract: We investigate the performance of the standard Greedy algorithm for cardinality constrained maximization of non-submodular nondecreasing set functions. While there are strong theoretical guarantees on the performance of Greedy for maximizing submodular functions, there are few guarantees for non-submodular ones. However, Greedy enjoys strong empirical performance for many important non-submodular… ▽ More We investigate the performance of the standard Greedy algorithm for cardinality constrained maximization of non-submodular nondecreasing set functions. While there are strong theoretical guarantees on the performance of Greedy for maximizing submodular functions, there are few guarantees for non-submodular ones. However, Greedy enjoys strong empirical performance for many important non-submodular functions, e.g., the Bayesian A-optimality objective in experimental design. We prove theoretical guarantees supporting the empirical performance. Our guarantees are characterized by a combination of the (generalized) curvature $α$ and the submodularity ratio $γ$. In particular, we prove that Greedy enjoys a tight approximation guarantee of $\frac{1}α(1- e^{-γα})$ for cardinality constrained maximization. In addition, we bound the submodularity ratio and curvature for several important real-world objectives, including the Bayesian A-optimality objective, the determinantal function of a square submatrix and certain linear programs with combinatorial constraints. We experimentally validate our theoretical findings for both synthetic and real-world applications. △ Less

Submitted 14 May, 2019; v1 submitted 6 March, 2017; originally announced March 2017.

Comments: published at ICML 2017. First author is now known as Yatao Bian <[email protected]>. ORCID: https://orcid.org/0000-0002-2368-4084

Showing 201–250 of 342 results for author: Krause, A