Skip to main content

Showing 1–9 of 9 results for author: Paulus, M B

.
  1. arXiv:2311.05650  [pdf, other

    math.OC cs.LG

    Learning to Configure Separators in Branch-and-Cut

    Authors: Sirui Li, Wenbin Ouyang, Max B. Paulus, Cathy Wu

    Abstract: Cutting planes are crucial in solving mixed integer linear programs (MILP) as they facilitate bound improvements on the optimal solution. Modern MILP solvers rely on a variety of separators to generate a diverse set of cutting planes by invoking the separators frequently during the solving process. This work identifies that MILP solvers can be drastically accelerated by appropriately selecting sep… ▽ More

    Submitted 7 November, 2023; originally announced November 2023.

  2. arXiv:2301.09943  [pdf, other

    cs.LG math.OC

    Learning To Dive In Branch And Bound

    Authors: Max B. Paulus, Andreas Krause

    Abstract: Primal heuristics are important for solving mixed integer linear programs, because they find feasible solutions that facilitate branch and bound search. A prominent group of primal heuristics are diving heuristics. They iteratively modify and resolve linear programs to conduct a depth-first search from any node in the search tree. Existing divers rely on generic decision rules that fail to exploit… ▽ More

    Submitted 24 January, 2023; originally announced January 2023.

  3. arXiv:2209.12590  [pdf, other

    cs.LG

    Learning to Drop Out: An Adversarial Approach to Training Sequence VAEs

    Authors: Đorđe Miladinović, Kumar Shridhar, Kushal Jain, Max B. Paulus, Joachim M. Buhmann, Mrinmaya Sachan, Carl Allen

    Abstract: In principle, applying variational autoencoders (VAEs) to sequential data offers a method for controlled sequence generation, manipulation, and structured representation learning. However, training sequence VAEs is challenging: autoregressive decoders can often explain the data without utilizing the latent space, known as posterior collapse. To mitigate this, state-of-the-art models weaken the pow… ▽ More

    Submitted 16 December, 2022; v1 submitted 26 September, 2022; originally announced September 2022.

    Comments: Accepted at NeurIPS 2022

  4. arXiv:2206.13414  [pdf, other

    cs.LG math.OC stat.ML

    Learning To Cut By Looking Ahead: Cutting Plane Selection via Imitation Learning

    Authors: Max B. Paulus, Giulia Zarpellon, Andreas Krause, Laurent Charlin, Chris J. Maddison

    Abstract: Cutting planes are essential for solving mixed-integer linear problems (MILPs), because they facilitate bound improvements on the optimal solution value. For selecting cuts, modern solvers rely on manually designed heuristics that are tuned to gauge the potential effectiveness of cuts. We show that a greedy selection rule explicitly looking ahead to select cuts that yield the best bound improvemen… ▽ More

    Submitted 27 June, 2022; originally announced June 2022.

    Comments: ICML 2022

  5. arXiv:2202.08396  [pdf, other

    cs.LG cs.AI cs.LO

    Augment with Care: Contrastive Learning for Combinatorial Problems

    Authors: Haonan Duan, Pashootan Vaezipoor, Max B. Paulus, Yangjun Ruan, Chris J. Maddison

    Abstract: Supervised learning can improve the design of state-of-the-art solvers for combinatorial problems, but labelling large numbers of combinatorial instances is often impractical due to exponential worst-case complexity. Inspired by the recent success of contrastive pre-training for images, we conduct a scientific study of the effect of augmentation design on contrastive pre-training for the Boolean s… ▽ More

    Submitted 20 June, 2022; v1 submitted 16 February, 2022; originally announced February 2022.

  6. arXiv:2202.04910  [pdf, other

    cs.LG math.OC

    Instance-wise algorithm configuration with graph neural networks

    Authors: Romeo Valentin, Claudio Ferrari, Jérémy Scheurer, Andisheh Amrollahi, Chris Wendler, Max B. Paulus

    Abstract: We present our submission for the configuration task of the Machine Learning for Combinatorial Optimization (ML4CO) NeurIPS 2021 competition. The configuration task is to predict a good configuration of the open-source solver SCIP to solve a mixed integer linear program (MILP) efficiently. We pose this task as a supervised learning problem: First, we compile a large dataset of the solver performan… ▽ More

    Submitted 10 February, 2022; originally announced February 2022.

    Comments: 5 pages, 3 figures

  7. arXiv:2110.01515  [pdf, other

    cs.LG stat.ML

    A Review of the Gumbel-max Trick and its Extensions for Discrete Stochasticity in Machine Learning

    Authors: Iris A. M. Huijben, Wouter Kool, Max B. Paulus, Ruud J. G. van Sloun

    Abstract: The Gumbel-max trick is a method to draw a sample from a categorical distribution, given by its unnormalized (log-)probabilities. Over the past years, the machine learning community has proposed several extensions of this trick to facilitate, e.g., drawing multiple samples, sampling from structured domains, or gradient estimation for error backpropagation in neural network optimization. The goal o… ▽ More

    Submitted 8 March, 2022; v1 submitted 4 October, 2021; originally announced October 2021.

    Comments: Accepted as a survey article in IEEE TPAMI

  8. arXiv:2010.04838  [pdf, other

    stat.ML cs.LG

    Rao-Blackwellizing the Straight-Through Gumbel-Softmax Gradient Estimator

    Authors: Max B. Paulus, Chris J. Maddison, Andreas Krause

    Abstract: Gradient estimation in models with discrete latent variables is a challenging problem, because the simplest unbiased estimators tend to have high variance. To counteract this, modern estimators either introduce bias, rely on multiple function evaluations, or use learned, input-dependent baselines. Thus, there is a need for estimators that require minimal tuning, are computationally cheap, and have… ▽ More

    Submitted 9 October, 2020; originally announced October 2020.

  9. arXiv:2006.08063  [pdf, other

    stat.ML cs.LG

    Gradient Estimation with Stochastic Softmax Tricks

    Authors: Max B. Paulus, Dami Choi, Daniel Tarlow, Andreas Krause, Chris J. Maddison

    Abstract: The Gumbel-Max trick is the basis of many relaxed gradient estimators. These estimators are easy to implement and low variance, but the goal of scaling them comprehensively to large combinatorial distributions is still outstanding. Working within the perturbation model framework, we introduce stochastic softmax tricks, which generalize the Gumbel-Softmax trick to combinatorial spaces. Our framewor… ▽ More

    Submitted 28 February, 2021; v1 submitted 14 June, 2020; originally announced June 2020.

    Comments: NeurIPS 2020, final copy