Search | arXiv e-print repository

A New Class of Compact Formulations for Vehicle Routing Problems

Authors: Udayan Mandal, Amelia Regan, Louis Martin Rousseau, Julian Yarkony

Abstract: This paper introduces a novel compact mixed integer linear programming (MILP) formulation and a discretization discovery-based solution approach for the Vehicle Routing Problem with Time Windows (VRPTW). We aim to solve the optimization problem efficiently by constraining the linear programming (LP) solutions to use only flows corresponding to time and capacity-feasible routes that are locally ele… ▽ More This paper introduces a novel compact mixed integer linear programming (MILP) formulation and a discretization discovery-based solution approach for the Vehicle Routing Problem with Time Windows (VRPTW). We aim to solve the optimization problem efficiently by constraining the linear programming (LP) solutions to use only flows corresponding to time and capacity-feasible routes that are locally elementary (prohibiting cycles of customers localized in space). We employ a discretization discovery algorithm to refine the LP relaxation iteratively. This iterative process alternates between two steps: (1) increasing time/capacity/elementarity enforcement to increase the LP objective, albeit at the expense of increased complexity (more variables and constraints), and (2) decreasing enforcement without decreasing the LP objective to reduce complexity. This iterative approach ensures we produce an LP relaxation that closely approximates the optimal MILP objective with minimal complexity, facilitating an efficient solution via an off-the-shelf MILP solver. The effectiveness of our method is demonstrated through empirical evaluations on classical VRPTW instances. We showcase the efficiency of solving the final MILP and multiple iterations of LP relaxations, highlighting the decreased integrality gap of the final LP relaxation. We believe that our approach holds promise for addressing a wide range of routing problems within and beyond the VRPTW domain. △ Less

Submitted 29 February, 2024; originally announced March 2024.

arXiv:2304.11723 [pdf, other]

Graph Master and Local Area Routes for Efficient Column Generation for the Capacitated Vehicle Routing Problem with Time Windows

Authors: Udayan Mandal, Amelia Regan, Louis Martin Rousseau, Julian Yarkony

Abstract: In this research we consider the problem of accelerating the convergence of column generation (CG) for the weighted set cover formulation of the capacitated vehicle routing problem with time windows (CVRPTW). We adapt two new techniques, Local Area (LA) routes and Graph Master (GM) to these problems. LA-routes rely on pre-computing all lowest cost elementary sub-routes, called LA-arcs, where all c… ▽ More In this research we consider the problem of accelerating the convergence of column generation (CG) for the weighted set cover formulation of the capacitated vehicle routing problem with time windows (CVRPTW). We adapt two new techniques, Local Area (LA) routes and Graph Master (GM) to these problems. LA-routes rely on pre-computing all lowest cost elementary sub-routes, called LA-arcs, where all customers but the final customer are localized in space. LA-routes are constructed by concatenating LA-arcs where the final customer in a given LA-arc is the first customer in the subsequent LA-arc. To construct the lowest reduced cost elementary route during the pricing step of CG we apply a Decremental State Space Relaxation/time window discretization method over time, remaining demand, and customers visited; where the edges in the associated pricing graph are LA-arcs. To accelerate the convergence of CG we use an enhanced GM approach. We map each route generated during pricing to a strict total ordering of all customers, that respects the ordering of customers in the route; and somewhat preserves spatial locality. Each such strict total ordering is then mapped to a multi-graph where each node is associated with a tuple of customer, capacity remaining, and time remaining. Nodes are connected by feasible LA-arcs when the first/last customers in the LA-arc are less/greater than each intermediate customer (and each other) with respect to the total order. The multi-graph of a given route can express that route and other "related" routes; and every path from source to sink describes a feasible elementary route. Solving optimization over the restricted master problem over all multi-graphs is done efficiently by constructing the relevant nodes/edges on demand. △ Less

Submitted 23 April, 2023; originally announced April 2023.

arXiv:2209.12963 [pdf, other]

Local Area Routes and Valid Inequalities for Efficient Vehicle Routing

Authors: Udayan Mandal, Amelia Regan, Julian Yarkony

Abstract: In this research we introduce Local Area (LA) routes for improving the efficiency and tightness of column generation (CG) methods for solving vehicle routing problems (VRP). LA-routes rely on pre-computing the lowest cost elementary sub-route (called an LA-arc) for each tuple consisting of the following: (1) a (first) customer where the LA-arc begins, (2) a distant customer (from the first) where… ▽ More In this research we introduce Local Area (LA) routes for improving the efficiency and tightness of column generation (CG) methods for solving vehicle routing problems (VRP). LA-routes rely on pre-computing the lowest cost elementary sub-route (called an LA-arc) for each tuple consisting of the following: (1) a (first) customer where the LA-arc begins, (2) a distant customer (from the first) where the LA-arc ends, and (3) a set of intermediate customers near the first customer. LA-routes are constructed by concatenating LA-arcs where the final customer in a given LA-arc is the first customer in the subsequent LA-arc. A Decremental State Space Relaxation method is applied over LA-routes to construct the lowest reduced cost elementary route during the pricing step of CG. LA-route based solvers can be used to efficiently tighten the standard set cover VRP using a variant of subset row inequalities, which do not alter the structure of pricing. We incorporate LA-arcs into a novel CG stabilization scheme. Specifically each column generated during pricing is mapped to an ordered list of customers consistent with that column. An LA-arc is consistent with an ordering if the first/last customer in the arc come before/after all other customers in the LA-arc in the associated ordering respectively. Each such ordering is then mapped to a multi-graph where nodes correspond to (customer/demand) and edges correspond to LA-arcs consistent with that ordering. Hence any path from source to sink on the multi-graph is a feasible elementary route. The ordering for a column places customers spatially nearby in nearby positions on the ordering so that routes can be generated so as to permit spatially nearby customers to be visited without traveling far away first. We solve the restricted master problem over these graphs, which has special structure allowing for fast solution. △ Less

Submitted 30 January, 2023; v1 submitted 26 September, 2022; originally announced September 2022.

Comments: The update on Jan 30 2023: Fixes many typos; Increases the clarity of the text; Adds additional experiments; added Rounded Capacity Inequalities; and changes the title. arXiv admin note: substantial text overlap with arXiv:2207.04520

arXiv:2207.04520 [pdf, other]

Local Area Routes for Vehicle Routing Problems

Authors: Udayan Mandal, Amelia Regan, Julian Yarkony

Abstract: We consider an approach for improving the efficiency of column generation (CG) methods for solving vehicle routing problems. We introduce Local Area (LA) route relaxations, an alternative/complement to the commonly used ng-route relaxations and Decremental State Space Relaxations (DSSR) inside of CG formulations. LA routes are a subset of ng-routes and a super-set of elementary routes. Normally, t… ▽ More We consider an approach for improving the efficiency of column generation (CG) methods for solving vehicle routing problems. We introduce Local Area (LA) route relaxations, an alternative/complement to the commonly used ng-route relaxations and Decremental State Space Relaxations (DSSR) inside of CG formulations. LA routes are a subset of ng-routes and a super-set of elementary routes. Normally, the pricing stage of CG must produce elementary routes, which are routes without repeated customers, using processes which can be computationally expensive. Non-elementary routes visit at least one customer more than once, creating a cycle. LA routes relax the constraint of being an elementary route in such a manner as to permit efficient pricing. LA routes are best understood in terms of ng-route relaxations. Ng-routes are routes which are permitted to have non-localized cycles in space; this means that at least one intermediate customer (called a breaker) in the cycle must consider the starting customer in the cycle to be spatially far away. LA routes are described using a set of special indexes corresponding to customers on the route ordered from the start to the end of the route. LA route relaxations further restrict the set of permitted cycles beyond that of ng-routes by additionally enforcing that the breaker must be a located at a special index where the set of special indexes is defined recursively as follows. The first special index in the route is at index 1 meaning that it is associated with the first customer in the route. The k'th special index corresponds to the first customer after the k-1'th special index, that is not considered to be a neighbor of (considered spatially far from) the customer located at the k-1'th special index. We demonstrate that LA route relaxations can significantly improve the computational speed of pricing when compared to the standard DSSR. △ Less

Submitted 10 July, 2022; originally announced July 2022.

arXiv:2202.01274 [pdf, other]

Principled Graph Management

Authors: Julian Yarkony, Amelia Regan

Abstract: Graph Generation is a recently introduced enhanced Column Generation algorithm for solving expanded Linear Programming relaxations of mixed integer linear programs without weakening the expanded relaxations which characterize these methods. To apply Graph Generation we must be able to map any given column generated during pricing to a small directed acyclic graph for which any path from source to… ▽ More Graph Generation is a recently introduced enhanced Column Generation algorithm for solving expanded Linear Programming relaxations of mixed integer linear programs without weakening the expanded relaxations which characterize these methods. To apply Graph Generation we must be able to map any given column generated during pricing to a small directed acyclic graph for which any path from source to sink describes a feasible column. This structure is easily satisfied for vehicle routing, crew scheduling and various logistics problems where pricing is a constrained shortest path problem. The construction of such graphs trades off the size/diversity of a subset of columns modeled by the graphs versus the additional computational time required to solve the problems induced by larger graphs. Graph Generation (GG) has two computational bottlenecks. The first is pricing. Pricing in GG and Column Generation (CG) is identical because of the structure of the problems solved. The second bottleneck is the restricted master problem (RMP), which is more computationally intensive in GG than in CG given the same number of columns generated. By design GG converges in fewer iterations than CG, and hence requires fewer calls to pricing. Therefore, when the computation time of GG is dominated by pricing, as opposed to solving the RMP, GG converges much faster than CG in terms of time. However GG need not converge faster than CG when the GG RMP, rather than pricing, dominates computation. In this paper we introduce Principled Graph Management (PGM), which is an algorithm to solve the GG RMP rapidly by exploiting its special structure. We demonstrate the effectiveness of PGM inside a GG solution to the classical Capacitated Vehicle Routing Problem. We demonstrate that PGM solves the GG RMP hundreds of times faster than the baseline solver and that the improvement in speed increases with problem size. △ Less

Submitted 2 February, 2022; originally announced February 2022.

Comments: arXiv admin note: text overlap with arXiv:2110.01070

arXiv:2110.01070 [pdf, other]

Graph Generation: A New Approach to Solving Expanded Linear Programming Relaxations

Authors: Julian Yarkony, Naveed Haghani, Amelia Regan

Abstract: In this article we introduce Graph Generation, an enhanced Column Generation (CG) algorithm for solving expanded linear programming relaxations of mixed integer linear programs. To apply Graph Generation, we must be able to map any given column to a small directed acyclic graph for which any path from source to sink describes a feasible column. This structure is easily satisfied for vehicle routin… ▽ More In this article we introduce Graph Generation, an enhanced Column Generation (CG) algorithm for solving expanded linear programming relaxations of mixed integer linear programs. To apply Graph Generation, we must be able to map any given column to a small directed acyclic graph for which any path from source to sink describes a feasible column. This structure is easily satisfied for vehicle routing and crew scheduling problems; and other such problems where pricing is a resource constrained shortest path problem. Such graphs are then added to the restricted master problem (RMP) when the corresponding column is generated during pricing. The use of Graph Generation does not weaken the linear programming relaxation being solved. At any given iteration of CG enhanced by Graph Generation; the technique permits the RMP to express a much wider set of columns than those generated during pricing, leading to faster convergence of CG. Graph Generation does not change the structure of the CG pricing problem. We show how the method can be applied in a general way, and then demonstrate the effectiveness of our approach on the classical Capacitated Vehicle Routing Problem. △ Less

Submitted 3 October, 2021; originally announced October 2021.

Comments: 19 pages

arXiv:2108.09233 [pdf, other]

Detour Dual Optimal Inequalities for Column Generation with Application to Routing and Location

Authors: Julian Yarkony, Naveed Haghani, Amelia Regan

Abstract: We consider the problem of accelerating column generation (CG) for logistics optimization problems using vehicle routing as an example. Without loss of generality, we focus on the Capacitated Vehicle Routing Problem (CVRP) via the addition of a new class of dual optimal inequalities (DOI) that incorporate information about detours from the vehicle routes. These inequalities extend the Smooth-DOI r… ▽ More We consider the problem of accelerating column generation (CG) for logistics optimization problems using vehicle routing as an example. Without loss of generality, we focus on the Capacitated Vehicle Routing Problem (CVRP) via the addition of a new class of dual optimal inequalities (DOI) that incorporate information about detours from the vehicle routes. These inequalities extend the Smooth-DOI recently introduced in the literature for the solution of certain classes of set-covering problems by CG. The Detour-DOI introduced in this article permit low cost swap operations between items on a given active route with items near to other items on that route to estimate (and bound) the values of the dual variables. Smooth-DOI in contrast only permit low cost swap operations between nearby items. The use of Detour-DOI permits a faster convergence of CG without weakening the linear programming relaxation. We then argue that these DOI can also be conveniently applied to single source capacitated facility location problems. These problems have been shown to be equivalent to a broad class of logistics optimization problems that include, for example telecommunication network design and production planning. The importance of develo** vastly more efficient column generation solvers cannot be overstated. Detour-DOI, which permit large numbers of columns to be expressed with a finite set of variables, contributes to this important endeavor. △ Less

Submitted 20 August, 2021; originally announced August 2021.

Comments: 26 pages, 6 figures

Report number: 2021.8.20 ACM Class: F.2

arXiv:2103.15234 [pdf, other]

Family Column Generation: A Principled Stabilized Column Generation Approach

Authors: Naveen Haghani, Julian Yarkony, Amelia Regan

Abstract: We tackle the problem of accelerating column generation (CG) approaches to set cover formulations in operations research. At each iteration of CG we generate a dual solution that approximately solves the LP over all columns consisting of a subset of columns in the nascent set. We refer to this linear program (LP) as the Family Restricted Master Problem (FRMP), which provides a tighter bound on the… ▽ More We tackle the problem of accelerating column generation (CG) approaches to set cover formulations in operations research. At each iteration of CG we generate a dual solution that approximately solves the LP over all columns consisting of a subset of columns in the nascent set. We refer to this linear program (LP) as the Family Restricted Master Problem (FRMP), which provides a tighter bound on the master problem at each iteration of CG, while preserving efficient inference. For example, in the single source capacitated facility location problem (SSCFLP) the family of a column $l$ associated with facility $f$ and customer set $N_l$ contains the set of columns associated with $f$ and the customer set that lies in the power set of $N_l$. The solution to FRMP optimization is attacked with a coordinate ascent method in the dual. The generation of direction of travel corresponds to solving the restricted master problem over columns corresponding to the reduced lowest cost column in each family given specific dual variables based on the incumbent dual, and is easily generated without resolving complex pricing problems. We apply our algorithm to the SSCFLP and demonstrate improved performance over two relevant baselines. △ Less

Submitted 28 March, 2021; originally announced March 2021.

Comments: 30 pages, 3 figures

arXiv:2103.08835 [pdf, other]

Multi-Robot Routing with Time Windows: A Column Generation Approach

Authors: Naveed Haghani, Jiaoyang Li, Sven Koenig, Gautam Kunapuli, Claudio Contardo, Amelia Regan, Julian Yarkony

Abstract: Robots performing tasks in warehouses provide the first example of wide-spread adoption of autonomous vehicles in transportation and logistics. The efficiency of these operations, which can vary widely in practice, are a key factor in the success of supply chains. In this work we consider the problem of coordinating a fleet of robots performing picking operations in a warehouse so as to maximize t… ▽ More Robots performing tasks in warehouses provide the first example of wide-spread adoption of autonomous vehicles in transportation and logistics. The efficiency of these operations, which can vary widely in practice, are a key factor in the success of supply chains. In this work we consider the problem of coordinating a fleet of robots performing picking operations in a warehouse so as to maximize the net profit achieved within a time period while respecting problem- and robot-specific constraints. We formulate the problem as a weighted set packing problem where the elements in consideration are items on the warehouse floor that can be picked up and delivered within specified time windows. We enforce the constraint that robots must not collide, that each item is picked up and delivered by at most one robot, and that the number of robots active at any time does not exceed the total number available. Since the set of routes is exponential in the size of the input, we attack optimization of the resulting integer linear program using column generation, where pricing amounts to solving an elementary resource-constrained shortest-path problem. We propose an efficient optimization scheme that avoids consideration of every increment within the time windows. We also propose a heuristic pricing algorithm that can efficiently solve the pricing subproblem. While this itself is an important problem, the insights gained from solving these problems effectively can lead to new advances in other time-widow constrained vehicle routing problems. △ Less

Submitted 15 March, 2021; originally announced March 2021.

Comments: arXiv admin note: substantial text overlap with arXiv:2006.04856

arXiv:2006.04856 [pdf, other]

Integer Programming for Multi-Robot Planning: A Column Generation Approach

Authors: Naveed Haghani, Jiaoyang Li, Sven Koenig, Gautam Kunapuli, Claudio Contardo, Julian Yarkony

Abstract: We consider the problem of coordinating a fleet of robots in a warehouse so as to maximize the reward achieved within a time limit while respecting problem and robot specific constraints. We formulate the problem as a weighted set packing problem where elements are defined as being the space-time positions a robot can occupy and the items that can be picked up and delivered. We enforce that robots… ▽ More We consider the problem of coordinating a fleet of robots in a warehouse so as to maximize the reward achieved within a time limit while respecting problem and robot specific constraints. We formulate the problem as a weighted set packing problem where elements are defined as being the space-time positions a robot can occupy and the items that can be picked up and delivered. We enforce that robots do not collide, that each item is delivered at most once, and that the number of robots active at any time does not exceed the total number available. Since the set of robot routes is not enumerable, we attack optimization using column generation where pricing is a resource-constrained shortest-path problem. △ Less

Submitted 8 June, 2020; originally announced June 2020.

arXiv:2004.05499 [pdf, other]

Relaxed Dual Optimal Inequalities for Relaxed Columns: with Application to Vehicle Routing

Authors: Naveed Haghani, Claudio Contardo, Julian Yarkony

Abstract: We address the problem of accelerating column generation for set cover problems in which we relax the state space of the columns to do efficient pricing. We achieve this by adapting the recently introduced smooth and flexible dual optimal inequalities (DOI) for use with relaxed columns. Smooth DOI exploit the observation that similar items are nearly fungible, and hence should be associated with s… ▽ More We address the problem of accelerating column generation for set cover problems in which we relax the state space of the columns to do efficient pricing. We achieve this by adapting the recently introduced smooth and flexible dual optimal inequalities (DOI) for use with relaxed columns. Smooth DOI exploit the observation that similar items are nearly fungible, and hence should be associated with similarly valued dual variables. Flexible DOI exploit the observation that the change in cost of a column induced by removing an item can be bounded. We adapt these DOI to the problem of capacitated vehicle routing in the context of ng-route relaxations. We demonstrate significant speed ups on a benchmark data set, while provably not weakening the relaxation. △ Less

Submitted 11 April, 2020; originally announced April 2020.

arXiv:2001.02267 [pdf, other]

Smooth and flexible dual optimal inequalities

Authors: Naveed Haghani, Claudio Contardo, Julian Yarkony

Abstract: We address the problem of accelerating column generation (CG) for set-covering formulations via dual optimal inequalities (DOI). DOI use knowledge of the dual solution space to derive inequalities that might be violated by intermediate solutions to a restricted master problem, and as such are efficient at reducing the number of iterations and the oscillations of the dual variables commonly observe… ▽ More We address the problem of accelerating column generation (CG) for set-covering formulations via dual optimal inequalities (DOI). DOI use knowledge of the dual solution space to derive inequalities that might be violated by intermediate solutions to a restricted master problem, and as such are efficient at reducing the number of iterations and the oscillations of the dual variables commonly observed in column generation procedures. We study two novel classes of DOI which are referred to as Flexible DOI (F-DOI) and Smooth-DOI (S-DOI), respectively (and jointly as SF-DOI). F-DOI provide rebates for covering items more than necessary. S-DOI describe the payment of a penalty to permit the under-coverage of items in exchange for the over-inclusion of other items. Unlike other classes of DOI from the literature, the S-DOI and F-DOI rely on very little to no problem-specific knowledge, and as such have the potential to be applied to a vast number of problem domains. In particular, we illustrate the efficiency of the new inequalities by embedding them within a column generation solver for the single source capacitated facility location problem (SSCFLP). A speed-up of a factor of up to 130x can be observed as when compared to a non-stabilized variant of the same CG procedure to achieve the linear relaxation lower bound on problems with dense columns and structured assignments costs. △ Less

Submitted 7 January, 2020; originally announced January 2020.

arXiv:1912.02937 [pdf, other]

End-to-end Training of CNN-CRF via Differentiable Dual-Decomposition

Authors: Shaofei Wang, Vishnu Lokhande, Maneesh Singh, Konrad Kording, Julian Yarkony

Abstract: Modern computer vision (CV) is often based on convolutional neural networks (CNNs) that excel at hierarchical feature extraction. The previous generation of CV approaches was often based on conditional random fields (CRFs) that excel at modeling flexible higher order interactions. As their benefits are complementary they are often combined. However, these approaches generally use mean-field approx… ▽ More Modern computer vision (CV) is often based on convolutional neural networks (CNNs) that excel at hierarchical feature extraction. The previous generation of CV approaches was often based on conditional random fields (CRFs) that excel at modeling flexible higher order interactions. As their benefits are complementary they are often combined. However, these approaches generally use mean-field approximations and thus, arguably, did not directly optimize the real problem. Here we revisit dual-decomposition-based approaches to CRF optimization, an alternative to the mean-field approximation. These algorithms can efficiently and exactly solve sub-problems and directly optimize a convex upper bound of the real problem, providing optimality certificates on the way. Our approach uses a novel fixed-point iteration algorithm which enjoys dual-monotonicity, dual-differentiability and high parallelism. The whole system, CRF and CNN can thus be efficiently trained using back-propagation. We demonstrate the effectiveness of our system on semantic image segmentation, showing consistent improvement over baseline models. △ Less

Submitted 5 December, 2019; originally announced December 2019.

arXiv:1909.05460 [pdf, other]

Accelerating Column Generation via Flexible Dual Optimal Inequalities with Application to Entity Resolution

Authors: Vishnu Suresh Lokhande, Shaofei Wang, Maneesh Singh, Julian Yarkony

Abstract: In this paper, we introduce a new optimization approach to Entity Resolution. Traditional approaches tackle entity resolution with hierarchical clustering, which does not benefit from a formal optimization formulation. In contrast, we model entity resolution as correlation-clustering, which we treat as a weighted set-packing problem and write as an integer linear program (ILP). In this case source… ▽ More In this paper, we introduce a new optimization approach to Entity Resolution. Traditional approaches tackle entity resolution with hierarchical clustering, which does not benefit from a formal optimization formulation. In contrast, we model entity resolution as correlation-clustering, which we treat as a weighted set-packing problem and write as an integer linear program (ILP). In this case sources in the input data correspond to elements and entities in output data correspond to sets/clusters. We tackle optimization of weighted set packing by relaxing integrality in our ILP formulation. The set of potential sets/clusters can not be explicitly enumerated, thus motivating optimization via column generation. In addition to the novel formulation, we also introduce new dual optimal inequalities (DOI), that we call flexible dual optimal inequalities, which tightly lower-bound dual variables during optimization and accelerate column generation. We apply our formulation to entity resolution (also called de-duplication of records), and achieve state-of-the-art accuracy on two popular benchmark datasets. The project page is available at the following url, https://github.com/lokhande-vishnu/EntityResolution △ Less

Submitted 21 February, 2020; v1 submitted 12 September, 2019; originally announced September 2019.

Comments: Accepted at AAAI20. Version update. Update the link to project page

arXiv:1902.05659 [pdf, other]

Massively Parallel Benders Decomposition for Correlation Clustering

Authors: Margret Keuper, Jovita Lukasik, Maneesh Singh, Julian Yarkony

Abstract: We tackle the problem of graph partitioning for image segmentation using correlation clustering (CC), which we treat as an integer linear program (ILP). We reformulate optimization in the ILP so as to admit efficient optimization via Benders decomposition, a classic technique from operations research. Our Benders decomposition formulation has many subproblems, each associated with a node in the CC… ▽ More We tackle the problem of graph partitioning for image segmentation using correlation clustering (CC), which we treat as an integer linear program (ILP). We reformulate optimization in the ILP so as to admit efficient optimization via Benders decomposition, a classic technique from operations research. Our Benders decomposition formulation has many subproblems, each associated with a node in the CC instance's graph, which are solved in parallel. Each Benders subproblem enforces the cycle inequalities corresponding to the negative weight edges attached to its corresponding node in the CC instance. We generate Magnanti-Wong Benders rows in addition to standard Benders rows, to accelerate optimization. Our Benders decomposition approach provides a promising new avenue to accelerate optimization for CC, and allows for massive parallelization. △ Less

Submitted 2 August, 2019; v1 submitted 14 February, 2019; originally announced February 2019.

arXiv:1805.04958 [pdf, other]

Accelerating Message Passing for MAP with Benders Decomposition

Authors: Julian Yarkony, Shaofei Wang

Abstract: We introduce a novel mechanism to tighten the local polytope relaxation for MAP inference in Markov random fields with low state space variables. We consider a surjection of the variables to a set of hyper-variables and apply the local polytope relaxation over these hyper-variables. The state space of each individual hyper-variable is constructed to be enumerable while the vector product of pairs… ▽ More We introduce a novel mechanism to tighten the local polytope relaxation for MAP inference in Markov random fields with low state space variables. We consider a surjection of the variables to a set of hyper-variables and apply the local polytope relaxation over these hyper-variables. The state space of each individual hyper-variable is constructed to be enumerable while the vector product of pairs is not easily enumerable making message passing inference intractable. To circumvent the difficulty of enumerating the vector product of state spaces of hyper-variables we introduce a novel Benders decomposition approach. This produces an upper envelope describing the message constructed from affine functions of the individual variables that compose the hyper-variable receiving the message. The envelope is tight at the minimizers which are shared by the true message. Benders rows are constructed to be Pareto optimal and are generated using an efficient procedure targeted for binary problems. △ Less

Submitted 13 May, 2018; originally announced May 2018.

arXiv:1711.07794 [pdf, other]

Efficient Multi-Person Pose Estimation with Provable Guarantees

Authors: Shaofei Wang, Konrad Paul Kording, Julian Yarkony

Abstract: Multi-person pose estimation (MPPE) in natural images is key to the meaningful use of visual data in many fields including movement science, security, and rehabilitation. In this paper we tackle MPPE with a bottom-up approach, starting with candidate detections of body parts from a convolutional neural network (CNN) and grou** them into people. We formulate the grou** of body part detections i… ▽ More Multi-person pose estimation (MPPE) in natural images is key to the meaningful use of visual data in many fields including movement science, security, and rehabilitation. In this paper we tackle MPPE with a bottom-up approach, starting with candidate detections of body parts from a convolutional neural network (CNN) and grou** them into people. We formulate the grou** of body part detections into people as a minimum-weight set packing (MWSP) problem where the set of potential people is the power set of body part detections. We model the quality of a hypothesis of a person which is a set in the MWSP by an augmented tree-structured Markov random field where variables correspond to body-parts and their state-spaces correspond to the power set of the detections for that part. We describe a novel algorithm that combines efficiency with provable bounds on this MWSP problem. We employ an implicit column generation strategy where the pricing problem is formulated as a dynamic program. To efficiently solve this dynamic program we exploit the problem structure utilizing a nested Bender's decomposition (NBD) exact inference strategy which we speed up by recycling Bender's rows between calls to the pricing problem. We test our approach on the MPII-Multiperson dataset, showing that our approach obtains comparable results with the state-of-the-art algorithm for joint node labeling and grou** problems, and that NBD achieves considerable speed-ups relative to a naive dynamic programming approach. Typical algorithms that solve joint node labeling and grou** problems use heuristics and thus can not obtain proofs of optimality. Our approach, in contrast, proves that for over 99 percent of problem instances we find the globally optimal solution and otherwise provide upper/lower bounds. △ Less

Submitted 21 November, 2017; originally announced November 2017.

arXiv:1709.07337 [pdf, other]

Efficient Column Generation for Cell Detection and Segmentation

Authors: Chong Zhang, Shaofei Wang, Miguel A. Gonzalez-Ballester, Julian Yarkony

Abstract: We study the problem of instance segmentation in biological images with crowded and compact cells. We formulate this task as an integer program where variables correspond to cells and constraints enforce that cells do not overlap. To solve this integer program, we propose a column generation formulation where the pricing program is solved via exact optimization of very small scale integer programs… ▽ More We study the problem of instance segmentation in biological images with crowded and compact cells. We formulate this task as an integer program where variables correspond to cells and constraints enforce that cells do not overlap. To solve this integer program, we propose a column generation formulation where the pricing program is solved via exact optimization of very small scale integer programs. Column generation is tightened using odd set inequalities which fit elegantly into pricing problem optimization. Our column generation approach achieves fast stable anytime inference for our instance segmentation problems. We demonstrate on three distinct light microscopy datasets, with several hundred cells each, that our proposed algorithm rapidly achieves or exceeds state of the art accuracy. △ Less

Submitted 21 September, 2017; originally announced September 2017.

arXiv:1709.05982 [pdf, other]

Multi-Person Pose Estimation via Column Generation

Authors: Shaofei Wang, Chong Zhang, Miguel A. Gonzalez-Ballester, Alexander Ihler, Julian Yarkony

Abstract: We study the problem of multi-person pose estimation in natural images. A pose estimate describes the spatial position and identity (head, foot, knee, etc.) of every non-occluded body part of a person. Pose estimation is difficult due to issues such as deformation and variation in body configurations and occlusion of parts, while multi-person settings add complications such as an unknown number of… ▽ More We study the problem of multi-person pose estimation in natural images. A pose estimate describes the spatial position and identity (head, foot, knee, etc.) of every non-occluded body part of a person. Pose estimation is difficult due to issues such as deformation and variation in body configurations and occlusion of parts, while multi-person settings add complications such as an unknown number of people, with unknown appearance and possible interactions in their poses and part locations. We give a novel integer program formulation of the multi-person pose estimation problem, in which variables correspond to assignments of parts in the image to poses in a two-tier, hierarchical way. This enables us to develop an efficient custom optimization procedure based on column generation, where columns are produced by exact optimization of very small scale integer programs. We demonstrate improved accuracy and speed for our method on the MPII multi-person pose estimation benchmark. △ Less

Submitted 18 September, 2017; originally announced September 2017.

arXiv:1709.04411 [pdf, other]

Exploiting skeletal structure in computer vision annotation with Benders decomposition

Authors: Shaofei Wang, Konrad Kording, Julian Yarkony

Abstract: Many annotation problems in computer vision can be phrased as integer linear programs (ILPs). The use of standard industrial solvers does not to exploit the underlying structure of such problems eg, the skeleton in pose estimation. The leveraging of the underlying structure in conjunction with industrial solvers promises increases in both speed and accuracy. Such structure can be exploited using B… ▽ More Many annotation problems in computer vision can be phrased as integer linear programs (ILPs). The use of standard industrial solvers does not to exploit the underlying structure of such problems eg, the skeleton in pose estimation. The leveraging of the underlying structure in conjunction with industrial solvers promises increases in both speed and accuracy. Such structure can be exploited using Bender's decomposition, a technique from operations research, that solves complex ILPs or mixed integer linear programs by decomposing them into sub-problems that communicate via a master problem. The intuition is that conditioned on a small subset of the variables the solution to the remaining variables can be computed easily by taking advantage of properties of the ILP constraint matrix such as block structure. In this paper we apply Benders decomposition to a typical problem in computer vision where we have many sub-ILPs (eg, partitioning of detections, body-parts) coupled to a master ILP (eg, constructing skeletons). Dividing inference problems into a master problem and sub-problems motivates the development of a plethora of novel models, and inference approaches for the field of computer vision. △ Less

Submitted 13 September, 2017; originally announced September 2017.

arXiv:1612.00437 [pdf, other]

Efficient Pose and Cell Segmentation using Column Generation

Authors: Shaofei Wang, Chong Zhang, Miguel A. Gonzalez-Ballester, Julian Yarkony

Abstract: We study the problems of multi-person pose segmentation in natural images and instance segmentation in biological images with crowded cells. We formulate these distinct tasks as integer programs where variables correspond to poses/cells. To optimize, we propose a generic relaxation scheme for solving these combinatorial problems using a column generation formulation where the program for generatin… ▽ More We study the problems of multi-person pose segmentation in natural images and instance segmentation in biological images with crowded cells. We formulate these distinct tasks as integer programs where variables correspond to poses/cells. To optimize, we propose a generic relaxation scheme for solving these combinatorial problems using a column generation formulation where the program for generating a column is solved via exact optimization of very small scale integer programs. This results in efficient exploration of the spaces of poses and cells. △ Less

Submitted 1 December, 2016; originally announced December 2016.

arXiv:1602.04409 [pdf, other]

Convex Optimization For Non-Convex Problems via Column Generation

Authors: Julian Yarkony, Kamalika Chaudhuri

Abstract: We apply column generation to approximating complex structured objects via a set of primitive structured objects under either the cross entropy or L2 loss. We use L1 regularization to encourage the use of few structured primitive objects. We attack approximation using convex optimization over an infinite number of variables each corresponding to a primitive structured object that are generated on… ▽ More We apply column generation to approximating complex structured objects via a set of primitive structured objects under either the cross entropy or L2 loss. We use L1 regularization to encourage the use of few structured primitive objects. We attack approximation using convex optimization over an infinite number of variables each corresponding to a primitive structured object that are generated on demand by easy inference in the Lagrangian dual. We apply our approach to producing low rank approximations to large 3-way tensors. △ Less

Submitted 13 February, 2016; originally announced February 2016.

arXiv:1512.02413 [pdf, other]

Tracking Objects with Higher Order Interactions using Delayed Column Generation

Authors: Shaofei Wang, Steffen Wolf, Charless Fowlkes, Julian Yarkony

Abstract: We study the problem of multi-target tracking and data association in video. We formulate this in terms of selecting a subset of high-quality tracks subject to the constraint that no pair of selected tracks is associated with a common detection (of an object). This objective is equivalent to the classic NP-hard problem of finding a maximum-weight set packing (MWSP) where tracks correspond to sets… ▽ More We study the problem of multi-target tracking and data association in video. We formulate this in terms of selecting a subset of high-quality tracks subject to the constraint that no pair of selected tracks is associated with a common detection (of an object). This objective is equivalent to the classic NP-hard problem of finding a maximum-weight set packing (MWSP) where tracks correspond to sets and is made further difficult since the number of candidate tracks grows exponentially in the number of detections. We present a relaxation of this combinatorial problem that uses a column generation formulation where the pricing problem is solved via dynamic programming to efficiently explore the space of tracks. We employ row generation to tighten the bound in such a way as to preserve efficient inference in the pricing problem. We show the practical utility of this algorithm for tracking problems in natural and biological video datasets. △ Less

Submitted 9 August, 2016; v1 submitted 8 December, 2015; originally announced December 2015.

arXiv:1511.01994 [pdf, other]

Next Generation Multicuts for Semi-Planar Graphs

Authors: Julian Yarkony

Abstract: We study the problem of multicut segmentation. We introduce modified versions of the Semi-PlanarCC based on bounding Lagrange multipliers. We apply our work to natural image segmentation. We study the problem of multicut segmentation. We introduce modified versions of the Semi-PlanarCC based on bounding Lagrange multipliers. We apply our work to natural image segmentation. △ Less

Submitted 6 November, 2015; originally announced November 2015.

arXiv:1507.02407 [pdf, other]

Planar Ultrametric Rounding for Image Segmentation

Authors: Julian Yarkony, Charless C. Fowlkes

Abstract: We study the problem of hierarchical clustering on planar graphs. We formulate this in terms of an LP relaxation of ultrametric rounding. To solve this LP efficiently we introduce a dual cutting plane scheme that uses minimum cost perfect matching as a subroutine in order to efficiently explore the space of planar partitions. We apply our algorithm to the problem of hierarchical image segmentation… ▽ More We study the problem of hierarchical clustering on planar graphs. We formulate this in terms of an LP relaxation of ultrametric rounding. To solve this LP efficiently we introduce a dual cutting plane scheme that uses minimum cost perfect matching as a subroutine in order to efficiently explore the space of planar partitions. We apply our algorithm to the problem of hierarchical image segmentation. △ Less

Submitted 9 September, 2015; v1 submitted 9 July, 2015; originally announced July 2015.

MSC Class: 68T45

arXiv:1208.0378 [pdf, other]

Fast Planar Correlation Clustering for Image Segmentation

Authors: Julian Yarkony, Alexander T. Ihler, Charless C. Fowlkes

Abstract: We describe a new optimization scheme for finding high-quality correlation clusterings in planar graphs that uses weighted perfect matching as a subroutine. Our method provides lower-bounds on the energy of the optimal correlation clustering that are typically fast to compute and tight in practice. We demonstrate our algorithm on the problem of image segmentation where this approach outperforms ex… ▽ More We describe a new optimization scheme for finding high-quality correlation clusterings in planar graphs that uses weighted perfect matching as a subroutine. Our method provides lower-bounds on the energy of the optimal correlation clustering that are typically fast to compute and tight in practice. We demonstrate our algorithm on the problem of image segmentation where this approach outperforms existing global optimization techniques in minimizing the objective and is competitive with the state of the art in producing high-quality segmentations. △ Less

Submitted 1 August, 2012; originally announced August 2012.

Comments: This is the extended version of a paper to appear at the 12th European Conference on Computer Vision (ECCV 2012)

arXiv:1202.3771 [pdf]

Tightening MRF Relaxations with Planar Subproblems

Authors: Julian Yarkony, Ragib Morshed, Alexander T. Ihler, Charless C. Fowlkes

Abstract: We describe a new technique for computing lower-bounds on the minimum energy configuration of a planar Markov Random Field (MRF). Our method successively adds large numbers of constraints and enforces consistency over binary projections of the original problem state space. These constraints are represented in terms of subproblems in a dual-decomposition framework that is optimized using subgradien… ▽ More We describe a new technique for computing lower-bounds on the minimum energy configuration of a planar Markov Random Field (MRF). Our method successively adds large numbers of constraints and enforces consistency over binary projections of the original problem state space. These constraints are represented in terms of subproblems in a dual-decomposition framework that is optimized using subgradient techniques. The complete set of constraints we consider enforces cycle consistency over the original graph. In practice we find that the method converges quickly on most problems with the addition of a few subproblems and outperforms existing methods for some interesting classes of hard potentials. △ Less

Submitted 14 February, 2012; originally announced February 2012.

Report number: UAI-P-2011-PG-770-777

arXiv:1104.1204 [pdf, other]

Planar Cycle Covering Graphs

Authors: Julian Yarkony, Alexander T. Ihler, Charless C. Fowlkes

Abstract: We describe a new variational lower-bound on the minimum energy configuration of a planar binary Markov Random Field (MRF). Our method is based on adding auxiliary nodes to every face of a planar embedding of the graph in order to capture the effect of unary potentials. A ground state of the resulting approximation can be computed efficiently by reduction to minimum-weight perfect matching. We sho… ▽ More We describe a new variational lower-bound on the minimum energy configuration of a planar binary Markov Random Field (MRF). Our method is based on adding auxiliary nodes to every face of a planar embedding of the graph in order to capture the effect of unary potentials. A ground state of the resulting approximation can be computed efficiently by reduction to minimum-weight perfect matching. We show that optimization of variational parameters achieves the same lower-bound as dual-decomposition into the set of all cycles of the original graph. We demonstrate that our variational optimization converges quickly and provides high-quality solutions to hard combinatorial problems 10-100x faster than competing algorithms that optimize the same bound. △ Less

Submitted 6 April, 2011; originally announced April 2011.

Showing 1–28 of 28 results for author: Yarkony, J