Skip to main content

Showing 1–13 of 13 results for author: Wehenkel, L

Searching in archive cs. Search in all archives.
.
  1. arXiv:2111.02218  [pdf, other

    stat.ML cs.LG

    From global to local MDI variable importances for random forests and when they are Shapley values

    Authors: Antonio Sutera, Gilles Louppe, Van Anh Huynh-Thu, Louis Wehenkel, Pierre Geurts

    Abstract: Random forests have been widely used for their ability to provide so-called importance measures, which give insight at a global (per dataset) level on the relevance of input variables to predict a certain output. On the other hand, methods based on Shapley values have been introduced to refine the analysis of feature relevance in tree-based models to a local (per instance) level. In this context,… ▽ More

    Submitted 3 November, 2021; originally announced November 2021.

    Journal ref: 35th Conference on Neural Information Processing Systems (NeurIPS 2021), Sydney, Australia

  2. arXiv:2110.00301  [pdf, other

    cs.CR eess.SY

    Cyber-physical risk modeling with imperfect cyber-attackers

    Authors: Efthymios Karangelos, Louis Wehenkel

    Abstract: We model the risk posed by a malicious cyber-attacker seeking to induce grid insecurity by means of a load redistribution attack, while explicitly acknowledging that such an actor would plausibly base its decision strategy on imperfect information. More specifically, we introduce a novel formulation for the cyber-attacker's decision-making problem and analyze the distribution of decisions taken wi… ▽ More

    Submitted 1 October, 2021; originally announced October 2021.

  3. arXiv:1905.07558  [pdf, other

    stat.ML cs.LG

    Gradient tree boosting with random output projections for multi-label classification and multi-output regression

    Authors: Arnaud Joly, Louis Wehenkel, Pierre Geurts

    Abstract: In many applications of supervised learning, multiple classification or regression outputs have to be predicted jointly. We consider several extensions of gradient boosting to address such problems. We first propose a straightforward adaptation of gradient boosting exploiting multiple output regression trees as base learners. We then argue that this method is only expected to be optimal when the o… ▽ More

    Submitted 18 May, 2019; originally announced May 2019.

  4. arXiv:1801.00500  [pdf, other

    cs.CE

    Chance-Constrained Outage Scheduling using a Machine Learning Proxy

    Authors: Gal Dalal, Elad Gilboa, Shie Mannor, Louis Wehenkel

    Abstract: Outage scheduling aims at defining, over a horizon of several months to years, when different components needing maintenance should be taken out of operation. Its objective is to minimize operation-cost expectation while satisfying reliability-related constraints. We propose a distributed scenario-based chance-constrained optimization formulation for this problem. To tackle tractability issues ari… ▽ More

    Submitted 1 January, 2018; originally announced January 2018.

  5. arXiv:1709.01177  [pdf, other

    stat.ML cs.LG

    Random Subspace with Trees for Feature Selection Under Memory Constraints

    Authors: Antonio Sutera, Célia Châtel, Gilles Louppe, Louis Wehenkel, Pierre Geurts

    Abstract: Dealing with datasets of very high dimension is a major challenge in machine learning. In this paper, we consider the problem of feature selection in applications where the memory is not large enough to contain all features. In this setting, we propose a novel tree-based feature selection approach that builds a sequence of randomized trees on small subsamples of variables mixing both variables alr… ▽ More

    Submitted 6 September, 2017; v1 submitted 4 September, 2017; originally announced September 2017.

  6. arXiv:1611.10215  [pdf, other

    cs.LG cs.AI

    Unit Commitment using Nearest Neighbor as a Short-Term Proxy

    Authors: Gal Dalal, Elad Gilboa, Shie Mannor, Louis Wehenkel

    Abstract: We devise the Unit Commitment Nearest Neighbor (UCNN) algorithm to be used as a proxy for quickly approximating outcomes of short-term decisions, to make tractable hierarchical long-term assessment and planning for large power systems. Experimental results on updated versions of IEEE-RTS79 and IEEE-RTS96 show high accuracy measured on operational cost, achieved in runtimes that are lower in severa… ▽ More

    Submitted 28 February, 2018; v1 submitted 30 November, 2016; originally announced November 2016.

  7. arXiv:1605.03848  [pdf, other

    stat.ML cs.LG

    Context-dependent feature analysis with random forests

    Authors: Antonio Sutera, Gilles Louppe, Vân Anh Huynh-Thu, Louis Wehenkel, Pierre Geurts

    Abstract: In many cases, feature selection is often more complicated than identifying a single subset of input variables that would together explain the output. There may be interactions that depend on contextual information, i.e., variables that reveal to be relevant only in some specific circumstances. In this setting, the contribution of this paper is to extend the random forest variable importances fram… ▽ More

    Submitted 12 May, 2016; originally announced May 2016.

    Comments: Accepted for presentation at UAI 2016

  8. arXiv:1404.6074  [pdf, other

    cs.LG stat.ML

    Classifying pairs with trees for supervised biological network inference

    Authors: Marie Schrynemackers, Louis Wehenkel, M. Madan Babu, Pierre Geurts

    Abstract: Networks are ubiquitous in biology and computational approaches have been largely investigated for their inference. In particular, supervised machine learning methods can be used to complete a partially known network by integrating various measurements. Two main supervised frameworks have been proposed: the local approach, which trains a separate model for each network node, and the global approac… ▽ More

    Submitted 24 April, 2014; originally announced April 2014.

    Comments: 22 pages

  9. Random forests with random projections of the output space for high dimensional multi-label classification

    Authors: Arnaud Joly, Pierre Geurts, Louis Wehenkel

    Abstract: We adapt the idea of random projections applied to the output space, so as to enhance tree-based ensemble methods in the context of multi-label classification. We show how learning time complexity can be reduced without affecting computational complexity and accuracy of predictions. We also show that random output space projections may be used in order to reach different bias-variance tradeoffs, o… ▽ More

    Submitted 29 September, 2014; v1 submitted 14 April, 2014; originally announced April 2014.

    Journal ref: Machine Learning and Knowledge Discovery in Databases, 2014, Part I, pp 607-622

  10. arXiv:1301.0553  [pdf

    cs.AI

    On the Construction of the Inclusion Boundary Neighbourhood for Markov Equivalence Classes of Bayesian Network Structures

    Authors: Vincent Auvray, Louis Wehenkel

    Abstract: The problem of learning Markov equivalence classes of Bayesian network structures may be solved by searching for the maximum of a scoring metric in a space of these classes. This paper deals with the definition and analysis of one such search space. We use a theoretically motivated neighbourhood, the inclusion boundary, and represent equivalence classes by essential graphs. We show that this searc… ▽ More

    Submitted 12 December, 2012; originally announced January 2013.

    Comments: Appears in Proceedings of the Eighteenth Conference on Uncertainty in Artificial Intelligence (UAI2002)

    Report number: UAI-P-2002-PG-26-35

  11. arXiv:1208.4773  [pdf, ps, other

    eess.SY cs.AI cs.LG

    Optimized Look-Ahead Tree Policies: A Bridge Between Look-Ahead Tree Policies and Direct Policy Search

    Authors: Tobias Jung, Louis Wehenkel, Damien Ernst, Francis Maes

    Abstract: Direct policy search (DPS) and look-ahead tree (LT) policies are two widely used classes of techniques to produce high performance policies for sequential decision-making problems. To make DPS approaches work well, one crucial issue is to select an appropriate space of parameterized policies with respect to the targeted problem. A fundamental issue in LT approaches is that, to take good decisions,… ▽ More

    Submitted 23 August, 2012; originally announced August 2012.

    Comments: In Submission

  12. arXiv:1207.5208  [pdf, other

    cs.AI cs.LG stat.ML

    Meta-Learning of Exploration/Exploitation Strategies: The Multi-Armed Bandit Case

    Authors: Francis Maes, Damien Ernst, Louis Wehenkel

    Abstract: The exploration/exploitation (E/E) dilemma arises naturally in many subfields of Science. Multi-armed bandit problems formalize this dilemma in its canonical form. Most current research in this field focuses on generic solutions that can be applied to a wide range of problems. However, in practice, it is often the case that a form of prior information is available about the specific class of targe… ▽ More

    Submitted 22 July, 2012; originally announced July 2012.

    Comments: 16 pages, Springer Selection of papers of ICAART'12

  13. arXiv:1206.3236  [pdf

    cs.LG cs.DS stat.ML

    Learning Inclusion-Optimal Chordal Graphs

    Authors: Vincent Auvray, Louis Wehenkel

    Abstract: Chordal graphs can be used to encode dependency models that are representable by both directed acyclic and undirected graphs. This paper discusses a very simple and efficient algorithm to learn the chordal structure of a probabilistic model from data. The algorithm is a greedy hill-climbing search algorithm that uses the inclusion boundary neighborhood over chordal graphs. In the limit of a large… ▽ More

    Submitted 13 June, 2012; originally announced June 2012.

    Comments: Appears in Proceedings of the Twenty-Fourth Conference on Uncertainty in Artificial Intelligence (UAI2008)

    Report number: UAI-P-2008-PG-18-25