Skip to main content

Showing 1–16 of 16 results for author: Kappen, H J

Searching in archive cs. Search in all archives.
.
  1. arXiv:2005.06364  [pdf, other

    eess.SY cs.LG

    Adaptive Smoothing Path Integral Control

    Authors: Dominik Thalmeier, Hilbert J. Kappen, Simone Totaro, Vicenç Gómez

    Abstract: In Path Integral control problems a representation of an optimally controlled dynamical system can be formally computed and serve as a guidepost to learn a parametrized policy. The Path Integral Cross-Entropy (PICE) method tries to exploit this, but is hampered by poor sample efficiency. We propose a model-free algorithm called ASPIC (Adaptive Smoothing of Path Integral Control) that applies an in… ▽ More

    Submitted 13 May, 2020; originally announced May 2020.

    Comments: 23 pages, 5 figures, NeurIPS 2019 Optimization Foundations of Reinforcement Learning Workshop (OptRL 2019)

  2. arXiv:1710.09825  [pdf, other

    cond-mat.dis-nn cs.LG cs.NE stat.ML

    On the role of synaptic stochasticity in training low-precision neural networks

    Authors: Carlo Baldassi, Federica Gerace, Hilbert J. Kappen, Carlo Lucibello, Luca Saglietti, Enzo Tartaglione, Riccardo Zecchina

    Abstract: Stochasticity and limited precision of synaptic weights in neural network models are key aspects of both biological and hardware modeling of learning processes. Here we show that a neural network model with stochastic binary weights naturally gives prominence to exponentially rare dense regions of solutions with a number of desirable properties such as robustness and good generalization performanc… ▽ More

    Submitted 19 March, 2018; v1 submitted 26 October, 2017; originally announced October 2017.

    Comments: 7 pages + 14 pages of supplementary material

    Journal ref: Phys. Rev. Lett. 120, 268103 (2018)

  3. Action selection in growing state spaces: Control of Network Structure Growth

    Authors: Dominik Thalmeier, Vicenç Gómez, Hilbert J. Kappen

    Abstract: The dynamical processes taking place on a network depend on its topology. Influencing the growth process of a network therefore has important implications on such dynamical processes. We formulate the problem of influencing the growth of a network as a stochastic optimal control problem in which a structural cost function penalizes undesired topologies. We approximate this control problem with a r… ▽ More

    Submitted 27 December, 2016; v1 submitted 23 June, 2016; originally announced June 2016.

    Comments: 23 pages, 7 figures

    Journal ref: Journal of Physics A: Mathematical and Theoretical, Volume 50, Number 3, 034006, 2017

  4. Particle Smoothing for Hidden Diffusion Processes: Adaptive Path Integral Smoother

    Authors: H. -Ch. Ruiz, H. J. Kappen

    Abstract: Particle smoothing methods are used for inference of stochastic processes based on noisy observations. Typically, the estimation of the marginal posterior distribution given all observations is cumbersome and computational intensive. In this paper, we propose a simple algorithm based on path integral control theory to estimate the smoothing distribution of continuous-time diffusion processes with… ▽ More

    Submitted 6 March, 2017; v1 submitted 1 May, 2016; originally announced May 2016.

    Comments: 16 pages, 13 figures

  5. Adaptive importance sampling for control and inference

    Authors: Hilbert Johan Kappen, Hans Christian Ruiz

    Abstract: Path integral (PI) control problems are a restricted class of non-linear control problems that can be solved formally as a Feyman-Kac path integral and can be estimated using Monte Carlo sampling. In this contribution we review path integral control theory in the finite horizon case. We subsequently focus on the problem how to compute and represent control solutions. Within the PI theory, the qu… ▽ More

    Submitted 2 September, 2015; v1 submitted 7 May, 2015; originally announced May 2015.

    Comments: 23 pages, 4 figures

  6. arXiv:1502.04548  [pdf, other

    eess.SY cs.MA cs.RO

    Real-Time Stochastic Optimal Control for Multi-agent Quadrotor Systems

    Authors: Vicenç Gómez, Sep Thijssen, Andrew Symington, Stephen Hailes, Hilbert J. Kappen

    Abstract: This paper presents a novel method for controlling teams of unmanned aerial vehicles using Stochastic Optimal Control (SOC) theory. The approach consists of a centralized high-level planner that computes optimal state trajectories as velocity sequences, and a platform-specific low-level controller which ensures that these velocity sequences are met. The planning task is expressed as a centralized… ▽ More

    Submitted 12 May, 2020; v1 submitted 16 February, 2015; originally announced February 2015.

    Comments: 17 pages, 8 figures, 26th International Conference on Automated Planning and Scheduling

  7. arXiv:1406.0993  [pdf, ps, other

    eess.SY cs.RO

    Latent Kullback Leibler Control for Continuous-State Systems using Probabilistic Graphical Models

    Authors: Takamitsu Matsubara, Vicenç Gómez, Hilbert J. Kappen

    Abstract: Kullback Leibler (KL) control problems allow for efficient computation of optimal control by solving a principal eigenvector problem. However, direct applicability of such framework to continuous state-action systems is limited. In this paper, we propose to embed a KL control problem in a probabilistic graphical model where observed variables correspond to the continuous (possibly high-dimensional… ▽ More

    Submitted 27 August, 2014; v1 submitted 4 June, 2014; originally announced June 2014.

    Comments: 9 pages, 5 figures, accepted in Uncertainty in Artificial Intelligence (UAI '14)

    ACM Class: I.2.8; I.2.9; G.3

  8. arXiv:1209.5656  [pdf, ps, other

    cs.IT cs.NI

    Learning Price-Elasticity of Smart Consumers in Power Distribution Systems

    Authors: Vicenç Gómez, Michael Chertkov, Scott Backhaus, Hilbert J. Kappen

    Abstract: Demand Response is an emerging technology which will transform the power grid of tomorrow. It is revolutionary, not only because it will enable peak load shaving and will add resources to manage large distribution systems, but mainly because it will tap into an almost unexplored and extremely powerful pool of resources comprised of many small individual consumers on distribution grids. However, to… ▽ More

    Submitted 25 September, 2012; originally announced September 2012.

    Comments: 6 pages, 5 figures, IEEE SmartGridComm 2012

    ACM Class: C.2.1; G.3

  9. arXiv:1203.0652  [pdf, ps, other

    cs.SI physics.soc-ph

    A likelihood-based framework for the analysis of discussion threads

    Authors: Vicenç Gómez, Hilbert J. Kappen, Nelly Litvak, Andreas Kaltenbrunner

    Abstract: Online discussion threads are conversational cascades in the form of posted messages that can be generally found in social systems that comprise many-to-many interaction such as blogs, news aggregators or bulletin board systems. We propose a framework based on generative models of growing trees to analyse the structure and evolution of discussion threads. We consider the growth of a discussion to… ▽ More

    Submitted 3 March, 2012; originally announced March 2012.

    Comments: 31 pages, 12 figures, journal

    ACM Class: G.3; H.5.4

  10. arXiv:1109.0486  [pdf, ps, other

    stat.ME cs.LG

    The Variational Garrote

    Authors: Hilbert J. Kappen, Vicenç Gómez

    Abstract: In this paper, we present a new variational method for sparse regression using $L_0$ regularization. The variational parameters appear in the approximate model in a way that is similar to Breiman's Garrote model. We refer to this method as the variational Garrote (VG). We show that the combination of the variational approximation and $L_0$ regularization has the effect of making the problem effect… ▽ More

    Submitted 12 November, 2012; v1 submitted 2 September, 2011; originally announced September 2011.

    Comments: 26 pages, 11 figures

  11. arXiv:1011.0673  [pdf, ps, other

    physics.data-an cs.SI physics.soc-ph

    Modeling the structure and evolution of discussion cascades

    Authors: Vicenç Gómez, Hilbert J. Kappen, Andreas Kaltenbrunner

    Abstract: We analyze the structure and evolution of discussion cascades in four popular websites: Slashdot, Barrapunto, Meneame and Wikipedia. Despite the big heterogeneities between these sites, a preferential attachment (PA) model with bias to the root can capture the temporal evolution of the observed trees and many of their statistical properties, namely, probability distributions of the branching facto… ▽ More

    Submitted 15 April, 2011; v1 submitted 2 November, 2010; originally announced November 2010.

    Comments: 10 pages, 11 figures

    ACM Class: J.4; G.2.2

    Journal ref: 22nd ACM conference on hypertext and hypermedia (HT 2011)

  12. arXiv:1004.2027  [pdf, ps, other

    cs.LG cs.AI eess.SY math.OC stat.ML

    Dynamic Policy Programming

    Authors: Mohammad Gheshlaghi Azar, Vicenc Gomez, Hilbert J. Kappen

    Abstract: In this paper, we propose a novel policy iteration method, called dynamic policy programming (DPP), to estimate the optimal policy in the infinite-horizon Markov decision processes. We prove the finite-iteration and asymptotic l\infty-norm performance-loss bounds for DPP in the presence of approximation/estimation error. The bounds are expressed in terms of the l\infty-norm of the average accumula… ▽ More

    Submitted 6 September, 2011; v1 submitted 12 April, 2010; originally announced April 2010.

    Comments: Submitted to Journal of Machine Learning Research

  13. arXiv:0901.0786  [pdf, ps, other

    cs.AI

    Approximate inference on planar graphs using Loop Calculus and Belief Propagation

    Authors: V. Gómez, H. J. Kappen, M. Chertkov

    Abstract: We introduce novel results for approximate inference on planar graphical models using the loop calculus framework. The loop calculus (Chertkov and Chernyak, 2006) allows to express the exact partition function of a graphical model as a finite sum of terms that can be evaluated once the belief propagation (BP) solution is known. In general, full summation over all correction terms is intractable.… ▽ More

    Submitted 25 May, 2009; v1 submitted 7 January, 2009; originally announced January 2009.

    Comments: 23 pages, 10 figures. Submitted to Journal of Machine Learning Research. Proceedings version accepted for UAI 2009

  14. arXiv:cs/0612109  [pdf, ps, other

    cs.AI

    Truncating the loop series expansion for Belief Propagation

    Authors: Vicenc Gomez, J. M. Mooij, H. J. Kappen

    Abstract: Recently, M. Chertkov and V.Y. Chernyak derived an exact expression for the partition sum (normalization constant) corresponding to a graphical model, which is an expansion around the Belief Propagation solution. By adding correction terms to the BP free energy, one for each "generalized loop" in the factor graph, the exact partition sum is obtained. However, the usually enormous number of gener… ▽ More

    Submitted 25 July, 2007; v1 submitted 21 December, 2006; originally announced December 2006.

    Comments: 31 pages, 12 figures, submitted to Journal of Machine Learning Research

    Journal ref: The Journal of Machine Learning Research, 8(Sep):1987--2016, 2007

  15. arXiv:cond-mat/0608312  [pdf, ps, other

    cond-mat.stat-mech cond-mat.dis-nn cs.IT

    On Cavity Approximations for Graphical Models

    Authors: T. Rizzo, B. Wemmenhove, H. J. Kappen

    Abstract: We reformulate the Cavity Approximation (CA), a class of algorithms recently introduced for improving the Bethe approximation estimates of marginals in graphical models. In our new formulation, which allows for the treatment of multivalued variables, a further generalization to factor graphs with arbitrary order of interaction factors is explicitly carried out, and a message passing algorithm th… ▽ More

    Submitted 16 January, 2007; v1 submitted 14 August, 2006; originally announced August 2006.

    Comments: Extension to factor graphs and comments on related work added

  16. Sufficient conditions for convergence of the Sum-Product Algorithm

    Authors: Joris M. Mooij, Hilbert J. Kappen

    Abstract: We derive novel conditions that guarantee convergence of the Sum-Product algorithm (also known as Loopy Belief Propagation or simply Belief Propagation) to a unique fixed point, irrespective of the initial messages. The computational complexity of the conditions is polynomial in the number of variables. In contrast with previously existing conditions, our results are directly applicable to arbit… ▽ More

    Submitted 8 May, 2007; v1 submitted 8 April, 2005; originally announced April 2005.

    Comments: 15 pages, 5 figures. Major changes and new results in this revised version. Submitted to IEEE Transactions on Information Theory

    ACM Class: I.2.3; F.2.1

    Journal ref: IEEE Transactions on Information Theory, 53(12):4422-4437 Dec. 2007