-
A New Class of Compact Formulations for Vehicle Routing Problems
Authors:
Udayan Mandal,
Amelia Regan,
Louis Martin Rousseau,
Julian Yarkony
Abstract:
This paper introduces a novel compact mixed integer linear programming (MILP) formulation and a discretization discovery-based solution approach for the Vehicle Routing Problem with Time Windows (VRPTW). We aim to solve the optimization problem efficiently by constraining the linear programming (LP) solutions to use only flows corresponding to time and capacity-feasible routes that are locally ele…
▽ More
This paper introduces a novel compact mixed integer linear programming (MILP) formulation and a discretization discovery-based solution approach for the Vehicle Routing Problem with Time Windows (VRPTW). We aim to solve the optimization problem efficiently by constraining the linear programming (LP) solutions to use only flows corresponding to time and capacity-feasible routes that are locally elementary (prohibiting cycles of customers localized in space).
We employ a discretization discovery algorithm to refine the LP relaxation iteratively. This iterative process alternates between two steps: (1) increasing time/capacity/elementarity enforcement to increase the LP objective, albeit at the expense of increased complexity (more variables and constraints), and (2) decreasing enforcement without decreasing the LP objective to reduce complexity. This iterative approach ensures we produce an LP relaxation that closely approximates the optimal MILP objective with minimal complexity, facilitating an efficient solution via an off-the-shelf MILP solver.
The effectiveness of our method is demonstrated through empirical evaluations on classical VRPTW instances. We showcase the efficiency of solving the final MILP and multiple iterations of LP relaxations, highlighting the decreased integrality gap of the final LP relaxation. We believe that our approach holds promise for addressing a wide range of routing problems within and beyond the VRPTW domain.
△ Less
Submitted 29 February, 2024;
originally announced March 2024.
-
Graph Master and Local Area Routes for Efficient Column Generation for the Capacitated Vehicle Routing Problem with Time Windows
Authors:
Udayan Mandal,
Amelia Regan,
Louis Martin Rousseau,
Julian Yarkony
Abstract:
In this research we consider the problem of accelerating the convergence of column generation (CG) for the weighted set cover formulation of the capacitated vehicle routing problem with time windows (CVRPTW). We adapt two new techniques, Local Area (LA) routes and Graph Master (GM) to these problems. LA-routes rely on pre-computing all lowest cost elementary sub-routes, called LA-arcs, where all c…
▽ More
In this research we consider the problem of accelerating the convergence of column generation (CG) for the weighted set cover formulation of the capacitated vehicle routing problem with time windows (CVRPTW). We adapt two new techniques, Local Area (LA) routes and Graph Master (GM) to these problems. LA-routes rely on pre-computing all lowest cost elementary sub-routes, called LA-arcs, where all customers but the final customer are localized in space. LA-routes are constructed by concatenating LA-arcs where the final customer in a given LA-arc is the first customer in the subsequent LA-arc. To construct the lowest reduced cost elementary route during the pricing step of CG we apply a Decremental State Space Relaxation/time window discretization method over time, remaining demand, and customers visited; where the edges in the associated pricing graph are LA-arcs. To accelerate the convergence of CG we use an enhanced GM approach. We map each route generated during pricing to a strict total ordering of all customers, that respects the ordering of customers in the route; and somewhat preserves spatial locality. Each such strict total ordering is then mapped to a multi-graph where each node is associated with a tuple of customer, capacity remaining, and time remaining. Nodes are connected by feasible LA-arcs when the first/last customers in the LA-arc are less/greater than each intermediate customer (and each other) with respect to the total order. The multi-graph of a given route can express that route and other "related" routes; and every path from source to sink describes a feasible elementary route. Solving optimization over the restricted master problem over all multi-graphs is done efficiently by constructing the relevant nodes/edges on demand.
△ Less
Submitted 23 April, 2023;
originally announced April 2023.
-
Local Area Routes and Valid Inequalities for Efficient Vehicle Routing
Authors:
Udayan Mandal,
Amelia Regan,
Julian Yarkony
Abstract:
In this research we introduce Local Area (LA) routes for improving the efficiency and tightness of column generation (CG) methods for solving vehicle routing problems (VRP). LA-routes rely on pre-computing the lowest cost elementary sub-route (called an LA-arc) for each tuple consisting of the following: (1) a (first) customer where the LA-arc begins, (2) a distant customer (from the first) where…
▽ More
In this research we introduce Local Area (LA) routes for improving the efficiency and tightness of column generation (CG) methods for solving vehicle routing problems (VRP). LA-routes rely on pre-computing the lowest cost elementary sub-route (called an LA-arc) for each tuple consisting of the following: (1) a (first) customer where the LA-arc begins, (2) a distant customer (from the first) where the LA-arc ends, and (3) a set of intermediate customers near the first customer. LA-routes are constructed by concatenating LA-arcs where the final customer in a given LA-arc is the first customer in the subsequent LA-arc. A Decremental State Space Relaxation method is applied over LA-routes to construct the lowest reduced cost elementary route during the pricing step of CG. LA-route based solvers can be used to efficiently tighten the standard set cover VRP using a variant of subset row inequalities, which do not alter the structure of pricing. We incorporate LA-arcs into a novel CG stabilization scheme. Specifically each column generated during pricing is mapped to an ordered list of customers consistent with that column. An LA-arc is consistent with an ordering if the first/last customer in the arc come before/after all other customers in the LA-arc in the associated ordering respectively. Each such ordering is then mapped to a multi-graph where nodes correspond to (customer/demand) and edges correspond to LA-arcs consistent with that ordering. Hence any path from source to sink on the multi-graph is a feasible elementary route. The ordering for a column places customers spatially nearby in nearby positions on the ordering so that routes can be generated so as to permit spatially nearby customers to be visited without traveling far away first. We solve the restricted master problem over these graphs, which has special structure allowing for fast solution.
△ Less
Submitted 30 January, 2023; v1 submitted 26 September, 2022;
originally announced September 2022.
-
Local Area Routes for Vehicle Routing Problems
Authors:
Udayan Mandal,
Amelia Regan,
Julian Yarkony
Abstract:
We consider an approach for improving the efficiency of column generation (CG) methods for solving vehicle routing problems. We introduce Local Area (LA) route relaxations, an alternative/complement to the commonly used ng-route relaxations and Decremental State Space Relaxations (DSSR) inside of CG formulations. LA routes are a subset of ng-routes and a super-set of elementary routes. Normally, t…
▽ More
We consider an approach for improving the efficiency of column generation (CG) methods for solving vehicle routing problems. We introduce Local Area (LA) route relaxations, an alternative/complement to the commonly used ng-route relaxations and Decremental State Space Relaxations (DSSR) inside of CG formulations. LA routes are a subset of ng-routes and a super-set of elementary routes. Normally, the pricing stage of CG must produce elementary routes, which are routes without repeated customers, using processes which can be computationally expensive. Non-elementary routes visit at least one customer more than once, creating a cycle. LA routes relax the constraint of being an elementary route in such a manner as to permit efficient pricing. LA routes are best understood in terms of ng-route relaxations. Ng-routes are routes which are permitted to have non-localized cycles in space; this means that at least one intermediate customer (called a breaker) in the cycle must consider the starting customer in the cycle to be spatially far away. LA routes are described using a set of special indexes corresponding to customers on the route ordered from the start to the end of the route. LA route relaxations further restrict the set of permitted cycles beyond that of ng-routes by additionally enforcing that the breaker must be a located at a special index where the set of special indexes is defined recursively as follows. The first special index in the route is at index 1 meaning that it is associated with the first customer in the route. The k'th special index corresponds to the first customer after the k-1'th special index, that is not considered to be a neighbor of (considered spatially far from) the customer located at the k-1'th special index. We demonstrate that LA route relaxations can significantly improve the computational speed of pricing when compared to the standard DSSR.
△ Less
Submitted 10 July, 2022;
originally announced July 2022.
-
Principled Graph Management
Authors:
Julian Yarkony,
Amelia Regan
Abstract:
Graph Generation is a recently introduced enhanced Column Generation algorithm for solving expanded Linear Programming relaxations of mixed integer linear programs without weakening the expanded relaxations which characterize these methods. To apply Graph Generation we must be able to map any given column generated during pricing to a small directed acyclic graph for which any path from source to…
▽ More
Graph Generation is a recently introduced enhanced Column Generation algorithm for solving expanded Linear Programming relaxations of mixed integer linear programs without weakening the expanded relaxations which characterize these methods. To apply Graph Generation we must be able to map any given column generated during pricing to a small directed acyclic graph for which any path from source to sink describes a feasible column. This structure is easily satisfied for vehicle routing, crew scheduling and various logistics problems where pricing is a constrained shortest path problem. The construction of such graphs trades off the size/diversity of a subset of columns modeled by the graphs versus the additional computational time required to solve the problems induced by larger graphs.
Graph Generation (GG) has two computational bottlenecks. The first is pricing. Pricing in GG and Column Generation (CG) is identical because of the structure of the problems solved. The second bottleneck is the restricted master problem (RMP), which is more computationally intensive in GG than in CG given the same number of columns generated. By design GG converges in fewer iterations than CG, and hence requires fewer calls to pricing. Therefore, when the computation time of GG is dominated by pricing, as opposed to solving the RMP, GG converges much faster than CG in terms of time. However GG need not converge faster than CG when the GG RMP, rather than pricing, dominates computation.
In this paper we introduce Principled Graph Management (PGM), which is an algorithm to solve the GG RMP rapidly by exploiting its special structure. We demonstrate the effectiveness of PGM inside a GG solution to the classical Capacitated Vehicle Routing Problem. We demonstrate that PGM solves the GG RMP hundreds of times faster than the baseline solver and that the improvement in speed increases with problem size.
△ Less
Submitted 2 February, 2022;
originally announced February 2022.
-
Graph Generation: A New Approach to Solving Expanded Linear Programming Relaxations
Authors:
Julian Yarkony,
Naveed Haghani,
Amelia Regan
Abstract:
In this article we introduce Graph Generation, an enhanced Column Generation (CG) algorithm for solving expanded linear programming relaxations of mixed integer linear programs. To apply Graph Generation, we must be able to map any given column to a small directed acyclic graph for which any path from source to sink describes a feasible column. This structure is easily satisfied for vehicle routin…
▽ More
In this article we introduce Graph Generation, an enhanced Column Generation (CG) algorithm for solving expanded linear programming relaxations of mixed integer linear programs. To apply Graph Generation, we must be able to map any given column to a small directed acyclic graph for which any path from source to sink describes a feasible column. This structure is easily satisfied for vehicle routing and crew scheduling problems; and other such problems where pricing is a resource constrained shortest path problem. Such graphs are then added to the restricted master problem (RMP) when the corresponding column is generated during pricing. The use of Graph Generation does not weaken the linear programming relaxation being solved. At any given iteration of CG enhanced by Graph Generation; the technique permits the RMP to express a much wider set of columns than those generated during pricing, leading to faster convergence of CG. Graph Generation does not change the structure of the CG pricing problem. We show how the method can be applied in a general way, and then demonstrate the effectiveness of our approach on the classical Capacitated Vehicle Routing Problem.
△ Less
Submitted 3 October, 2021;
originally announced October 2021.
-
Detour Dual Optimal Inequalities for Column Generation with Application to Routing and Location
Authors:
Julian Yarkony,
Naveed Haghani,
Amelia Regan
Abstract:
We consider the problem of accelerating column generation (CG) for logistics optimization problems using vehicle routing as an example. Without loss of generality, we focus on the Capacitated Vehicle Routing Problem (CVRP) via the addition of a new class of dual optimal inequalities (DOI) that incorporate information about detours from the vehicle routes. These inequalities extend the Smooth-DOI r…
▽ More
We consider the problem of accelerating column generation (CG) for logistics optimization problems using vehicle routing as an example. Without loss of generality, we focus on the Capacitated Vehicle Routing Problem (CVRP) via the addition of a new class of dual optimal inequalities (DOI) that incorporate information about detours from the vehicle routes. These inequalities extend the Smooth-DOI recently introduced in the literature for the solution of certain classes of set-covering problems by CG. The Detour-DOI introduced in this article permit low cost swap operations between items on a given active route with items near to other items on that route to estimate (and bound) the values of the dual variables. Smooth-DOI in contrast only permit low cost swap operations between nearby items. The use of Detour-DOI permits a faster convergence of CG without weakening the linear programming relaxation. We then argue that these DOI can also be conveniently applied to single source capacitated facility location problems. These problems have been shown to be equivalent to a broad class of logistics optimization problems that include, for example telecommunication network design and production planning. The importance of develo** vastly more efficient column generation solvers cannot be overstated. Detour-DOI, which permit large numbers of columns to be expressed with a finite set of variables, contributes to this important endeavor.
△ Less
Submitted 20 August, 2021;
originally announced August 2021.
-
Family Column Generation: A Principled Stabilized Column Generation Approach
Authors:
Naveen Haghani,
Julian Yarkony,
Amelia Regan
Abstract:
We tackle the problem of accelerating column generation (CG) approaches to set cover formulations in operations research. At each iteration of CG we generate a dual solution that approximately solves the LP over all columns consisting of a subset of columns in the nascent set. We refer to this linear program (LP) as the Family Restricted Master Problem (FRMP), which provides a tighter bound on the…
▽ More
We tackle the problem of accelerating column generation (CG) approaches to set cover formulations in operations research. At each iteration of CG we generate a dual solution that approximately solves the LP over all columns consisting of a subset of columns in the nascent set. We refer to this linear program (LP) as the Family Restricted Master Problem (FRMP), which provides a tighter bound on the master problem at each iteration of CG, while preserving efficient inference. For example, in the single source capacitated facility location problem (SSCFLP) the family of a column $l$ associated with facility $f$ and customer set $N_l$ contains the set of columns associated with $f$ and the customer set that lies in the power set of $N_l$. The solution to FRMP optimization is attacked with a coordinate ascent method in the dual. The generation of direction of travel corresponds to solving the restricted master problem over columns corresponding to the reduced lowest cost column in each family given specific dual variables based on the incumbent dual, and is easily generated without resolving complex pricing problems. We apply our algorithm to the SSCFLP and demonstrate improved performance over two relevant baselines.
△ Less
Submitted 28 March, 2021;
originally announced March 2021.
-
Multi-Robot Routing with Time Windows: A Column Generation Approach
Authors:
Naveed Haghani,
Jiaoyang Li,
Sven Koenig,
Gautam Kunapuli,
Claudio Contardo,
Amelia Regan,
Julian Yarkony
Abstract:
Robots performing tasks in warehouses provide the first example of wide-spread adoption of autonomous vehicles in transportation and logistics. The efficiency of these operations, which can vary widely in practice, are a key factor in the success of supply chains. In this work we consider the problem of coordinating a fleet of robots performing picking operations in a warehouse so as to maximize t…
▽ More
Robots performing tasks in warehouses provide the first example of wide-spread adoption of autonomous vehicles in transportation and logistics. The efficiency of these operations, which can vary widely in practice, are a key factor in the success of supply chains. In this work we consider the problem of coordinating a fleet of robots performing picking operations in a warehouse so as to maximize the net profit achieved within a time period while respecting problem- and robot-specific constraints. We formulate the problem as a weighted set packing problem where the elements in consideration are items on the warehouse floor that can be picked up and delivered within specified time windows. We enforce the constraint that robots must not collide, that each item is picked up and delivered by at most one robot, and that the number of robots active at any time does not exceed the total number available. Since the set of routes is exponential in the size of the input, we attack optimization of the resulting integer linear program using column generation, where pricing amounts to solving an elementary resource-constrained shortest-path problem. We propose an efficient optimization scheme that avoids consideration of every increment within the time windows. We also propose a heuristic pricing algorithm that can efficiently solve the pricing subproblem. While this itself is an important problem, the insights gained from solving these problems effectively can lead to new advances in other time-widow constrained vehicle routing problems.
△ Less
Submitted 15 March, 2021;
originally announced March 2021.
-
Integer Programming for Multi-Robot Planning: A Column Generation Approach
Authors:
Naveed Haghani,
Jiaoyang Li,
Sven Koenig,
Gautam Kunapuli,
Claudio Contardo,
Julian Yarkony
Abstract:
We consider the problem of coordinating a fleet of robots in a warehouse so as to maximize the reward achieved within a time limit while respecting problem and robot specific constraints. We formulate the problem as a weighted set packing problem where elements are defined as being the space-time positions a robot can occupy and the items that can be picked up and delivered. We enforce that robots…
▽ More
We consider the problem of coordinating a fleet of robots in a warehouse so as to maximize the reward achieved within a time limit while respecting problem and robot specific constraints. We formulate the problem as a weighted set packing problem where elements are defined as being the space-time positions a robot can occupy and the items that can be picked up and delivered. We enforce that robots do not collide, that each item is delivered at most once, and that the number of robots active at any time does not exceed the total number available. Since the set of robot routes is not enumerable, we attack optimization using column generation where pricing is a resource-constrained shortest-path problem.
△ Less
Submitted 8 June, 2020;
originally announced June 2020.
-
Relaxed Dual Optimal Inequalities for Relaxed Columns: with Application to Vehicle Routing
Authors:
Naveed Haghani,
Claudio Contardo,
Julian Yarkony
Abstract:
We address the problem of accelerating column generation for set cover problems in which we relax the state space of the columns to do efficient pricing. We achieve this by adapting the recently introduced smooth and flexible dual optimal inequalities (DOI) for use with relaxed columns. Smooth DOI exploit the observation that similar items are nearly fungible, and hence should be associated with s…
▽ More
We address the problem of accelerating column generation for set cover problems in which we relax the state space of the columns to do efficient pricing. We achieve this by adapting the recently introduced smooth and flexible dual optimal inequalities (DOI) for use with relaxed columns. Smooth DOI exploit the observation that similar items are nearly fungible, and hence should be associated with similarly valued dual variables. Flexible DOI exploit the observation that the change in cost of a column induced by removing an item can be bounded. We adapt these DOI to the problem of capacitated vehicle routing in the context of ng-route relaxations. We demonstrate significant speed ups on a benchmark data set, while provably not weakening the relaxation.
△ Less
Submitted 11 April, 2020;
originally announced April 2020.
-
Smooth and flexible dual optimal inequalities
Authors:
Naveed Haghani,
Claudio Contardo,
Julian Yarkony
Abstract:
We address the problem of accelerating column generation (CG) for set-covering formulations via dual optimal inequalities (DOI). DOI use knowledge of the dual solution space to derive inequalities that might be violated by intermediate solutions to a restricted master problem, and as such are efficient at reducing the number of iterations and the oscillations of the dual variables commonly observe…
▽ More
We address the problem of accelerating column generation (CG) for set-covering formulations via dual optimal inequalities (DOI). DOI use knowledge of the dual solution space to derive inequalities that might be violated by intermediate solutions to a restricted master problem, and as such are efficient at reducing the number of iterations and the oscillations of the dual variables commonly observed in column generation procedures. We study two novel classes of DOI which are referred to as Flexible DOI (F-DOI) and Smooth-DOI (S-DOI), respectively (and jointly as SF-DOI). F-DOI provide rebates for covering items more than necessary. S-DOI describe the payment of a penalty to permit the under-coverage of items in exchange for the over-inclusion of other items. Unlike other classes of DOI from the literature, the S-DOI and F-DOI rely on very little to no problem-specific knowledge, and as such have the potential to be applied to a vast number of problem domains. In particular, we illustrate the efficiency of the new inequalities by embedding them within a column generation solver for the single source capacitated facility location problem (SSCFLP). A speed-up of a factor of up to 130x can be observed as when compared to a non-stabilized variant of the same CG procedure to achieve the linear relaxation lower bound on problems with dense columns and structured assignments costs.
△ Less
Submitted 7 January, 2020;
originally announced January 2020.
-
End-to-end Training of CNN-CRF via Differentiable Dual-Decomposition
Authors:
Shaofei Wang,
Vishnu Lokhande,
Maneesh Singh,
Konrad Kording,
Julian Yarkony
Abstract:
Modern computer vision (CV) is often based on convolutional neural networks (CNNs) that excel at hierarchical feature extraction. The previous generation of CV approaches was often based on conditional random fields (CRFs) that excel at modeling flexible higher order interactions. As their benefits are complementary they are often combined. However, these approaches generally use mean-field approx…
▽ More
Modern computer vision (CV) is often based on convolutional neural networks (CNNs) that excel at hierarchical feature extraction. The previous generation of CV approaches was often based on conditional random fields (CRFs) that excel at modeling flexible higher order interactions. As their benefits are complementary they are often combined. However, these approaches generally use mean-field approximations and thus, arguably, did not directly optimize the real problem. Here we revisit dual-decomposition-based approaches to CRF optimization, an alternative to the mean-field approximation. These algorithms can efficiently and exactly solve sub-problems and directly optimize a convex upper bound of the real problem, providing optimality certificates on the way. Our approach uses a novel fixed-point iteration algorithm which enjoys dual-monotonicity, dual-differentiability and high parallelism. The whole system, CRF and CNN can thus be efficiently trained using back-propagation. We demonstrate the effectiveness of our system on semantic image segmentation, showing consistent improvement over baseline models.
△ Less
Submitted 5 December, 2019;
originally announced December 2019.
-
Accelerating Column Generation via Flexible Dual Optimal Inequalities with Application to Entity Resolution
Authors:
Vishnu Suresh Lokhande,
Shaofei Wang,
Maneesh Singh,
Julian Yarkony
Abstract:
In this paper, we introduce a new optimization approach to Entity Resolution. Traditional approaches tackle entity resolution with hierarchical clustering, which does not benefit from a formal optimization formulation. In contrast, we model entity resolution as correlation-clustering, which we treat as a weighted set-packing problem and write as an integer linear program (ILP). In this case source…
▽ More
In this paper, we introduce a new optimization approach to Entity Resolution. Traditional approaches tackle entity resolution with hierarchical clustering, which does not benefit from a formal optimization formulation. In contrast, we model entity resolution as correlation-clustering, which we treat as a weighted set-packing problem and write as an integer linear program (ILP). In this case sources in the input data correspond to elements and entities in output data correspond to sets/clusters. We tackle optimization of weighted set packing by relaxing integrality in our ILP formulation. The set of potential sets/clusters can not be explicitly enumerated, thus motivating optimization via column generation. In addition to the novel formulation, we also introduce new dual optimal inequalities (DOI), that we call flexible dual optimal inequalities, which tightly lower-bound dual variables during optimization and accelerate column generation. We apply our formulation to entity resolution (also called de-duplication of records), and achieve state-of-the-art accuracy on two popular benchmark datasets. The project page is available at the following url, https://github.com/lokhande-vishnu/EntityResolution
△ Less
Submitted 21 February, 2020; v1 submitted 12 September, 2019;
originally announced September 2019.
-
Massively Parallel Benders Decomposition for Correlation Clustering
Authors:
Margret Keuper,
Jovita Lukasik,
Maneesh Singh,
Julian Yarkony
Abstract:
We tackle the problem of graph partitioning for image segmentation using correlation clustering (CC), which we treat as an integer linear program (ILP). We reformulate optimization in the ILP so as to admit efficient optimization via Benders decomposition, a classic technique from operations research. Our Benders decomposition formulation has many subproblems, each associated with a node in the CC…
▽ More
We tackle the problem of graph partitioning for image segmentation using correlation clustering (CC), which we treat as an integer linear program (ILP). We reformulate optimization in the ILP so as to admit efficient optimization via Benders decomposition, a classic technique from operations research. Our Benders decomposition formulation has many subproblems, each associated with a node in the CC instance's graph, which are solved in parallel. Each Benders subproblem enforces the cycle inequalities corresponding to the negative weight edges attached to its corresponding node in the CC instance. We generate Magnanti-Wong Benders rows in addition to standard Benders rows, to accelerate optimization. Our Benders decomposition approach provides a promising new avenue to accelerate optimization for CC, and allows for massive parallelization.
△ Less
Submitted 2 August, 2019; v1 submitted 14 February, 2019;
originally announced February 2019.
-
Accelerating Message Passing for MAP with Benders Decomposition
Authors:
Julian Yarkony,
Shaofei Wang
Abstract:
We introduce a novel mechanism to tighten the local polytope relaxation for MAP inference in Markov random fields with low state space variables. We consider a surjection of the variables to a set of hyper-variables and apply the local polytope relaxation over these hyper-variables. The state space of each individual hyper-variable is constructed to be enumerable while the vector product of pairs…
▽ More
We introduce a novel mechanism to tighten the local polytope relaxation for MAP inference in Markov random fields with low state space variables. We consider a surjection of the variables to a set of hyper-variables and apply the local polytope relaxation over these hyper-variables. The state space of each individual hyper-variable is constructed to be enumerable while the vector product of pairs is not easily enumerable making message passing inference intractable.
To circumvent the difficulty of enumerating the vector product of state spaces of hyper-variables we introduce a novel Benders decomposition approach. This produces an upper envelope describing the message constructed from affine functions of the individual variables that compose the hyper-variable receiving the message. The envelope is tight at the minimizers which are shared by the true message. Benders rows are constructed to be Pareto optimal and are generated using an efficient procedure targeted for binary problems.
△ Less
Submitted 13 May, 2018;
originally announced May 2018.
-
Efficient Multi-Person Pose Estimation with Provable Guarantees
Authors:
Shaofei Wang,
Konrad Paul Kording,
Julian Yarkony
Abstract:
Multi-person pose estimation (MPPE) in natural images is key to the meaningful use of visual data in many fields including movement science, security, and rehabilitation. In this paper we tackle MPPE with a bottom-up approach, starting with candidate detections of body parts from a convolutional neural network (CNN) and grou** them into people. We formulate the grou** of body part detections i…
▽ More
Multi-person pose estimation (MPPE) in natural images is key to the meaningful use of visual data in many fields including movement science, security, and rehabilitation. In this paper we tackle MPPE with a bottom-up approach, starting with candidate detections of body parts from a convolutional neural network (CNN) and grou** them into people. We formulate the grou** of body part detections into people as a minimum-weight set packing (MWSP) problem where the set of potential people is the power set of body part detections. We model the quality of a hypothesis of a person which is a set in the MWSP by an augmented tree-structured Markov random field where variables correspond to body-parts and their state-spaces correspond to the power set of the detections for that part.
We describe a novel algorithm that combines efficiency with provable bounds on this MWSP problem. We employ an implicit column generation strategy where the pricing problem is formulated as a dynamic program. To efficiently solve this dynamic program we exploit the problem structure utilizing a nested Bender's decomposition (NBD) exact inference strategy which we speed up by recycling Bender's rows between calls to the pricing problem.
We test our approach on the MPII-Multiperson dataset, showing that our approach obtains comparable results with the state-of-the-art algorithm for joint node labeling and grou** problems, and that NBD achieves considerable speed-ups relative to a naive dynamic programming approach. Typical algorithms that solve joint node labeling and grou** problems use heuristics and thus can not obtain proofs of optimality. Our approach, in contrast, proves that for over 99 percent of problem instances we find the globally optimal solution and otherwise provide upper/lower bounds.
△ Less
Submitted 21 November, 2017;
originally announced November 2017.
-
Efficient Column Generation for Cell Detection and Segmentation
Authors:
Chong Zhang,
Shaofei Wang,
Miguel A. Gonzalez-Ballester,
Julian Yarkony
Abstract:
We study the problem of instance segmentation in biological images with crowded and compact cells. We formulate this task as an integer program where variables correspond to cells and constraints enforce that cells do not overlap. To solve this integer program, we propose a column generation formulation where the pricing program is solved via exact optimization of very small scale integer programs…
▽ More
We study the problem of instance segmentation in biological images with crowded and compact cells. We formulate this task as an integer program where variables correspond to cells and constraints enforce that cells do not overlap. To solve this integer program, we propose a column generation formulation where the pricing program is solved via exact optimization of very small scale integer programs. Column generation is tightened using odd set inequalities which fit elegantly into pricing problem optimization. Our column generation approach achieves fast stable anytime inference for our instance segmentation problems. We demonstrate on three distinct light microscopy datasets, with several hundred cells each, that our proposed algorithm rapidly achieves or exceeds state of the art accuracy.
△ Less
Submitted 21 September, 2017;
originally announced September 2017.
-
Multi-Person Pose Estimation via Column Generation
Authors:
Shaofei Wang,
Chong Zhang,
Miguel A. Gonzalez-Ballester,
Alexander Ihler,
Julian Yarkony
Abstract:
We study the problem of multi-person pose estimation in natural images. A pose estimate describes the spatial position and identity (head, foot, knee, etc.) of every non-occluded body part of a person. Pose estimation is difficult due to issues such as deformation and variation in body configurations and occlusion of parts, while multi-person settings add complications such as an unknown number of…
▽ More
We study the problem of multi-person pose estimation in natural images. A pose estimate describes the spatial position and identity (head, foot, knee, etc.) of every non-occluded body part of a person. Pose estimation is difficult due to issues such as deformation and variation in body configurations and occlusion of parts, while multi-person settings add complications such as an unknown number of people, with unknown appearance and possible interactions in their poses and part locations. We give a novel integer program formulation of the multi-person pose estimation problem, in which variables correspond to assignments of parts in the image to poses in a two-tier, hierarchical way. This enables us to develop an efficient custom optimization procedure based on column generation, where columns are produced by exact optimization of very small scale integer programs. We demonstrate improved accuracy and speed for our method on the MPII multi-person pose estimation benchmark.
△ Less
Submitted 18 September, 2017;
originally announced September 2017.
-
Exploiting skeletal structure in computer vision annotation with Benders decomposition
Authors:
Shaofei Wang,
Konrad Kording,
Julian Yarkony
Abstract:
Many annotation problems in computer vision can be phrased as integer linear programs (ILPs). The use of standard industrial solvers does not to exploit the underlying structure of such problems eg, the skeleton in pose estimation. The leveraging of the underlying structure in conjunction with industrial solvers promises increases in both speed and accuracy. Such structure can be exploited using B…
▽ More
Many annotation problems in computer vision can be phrased as integer linear programs (ILPs). The use of standard industrial solvers does not to exploit the underlying structure of such problems eg, the skeleton in pose estimation. The leveraging of the underlying structure in conjunction with industrial solvers promises increases in both speed and accuracy. Such structure can be exploited using Bender's decomposition, a technique from operations research, that solves complex ILPs or mixed integer linear programs by decomposing them into sub-problems that communicate via a master problem. The intuition is that conditioned on a small subset of the variables the solution to the remaining variables can be computed easily by taking advantage of properties of the ILP constraint matrix such as block structure. In this paper we apply Benders decomposition to a typical problem in computer vision where we have many sub-ILPs (eg, partitioning of detections, body-parts) coupled to a master ILP (eg, constructing skeletons). Dividing inference problems into a master problem and sub-problems motivates the development of a plethora of novel models, and inference approaches for the field of computer vision.
△ Less
Submitted 13 September, 2017;
originally announced September 2017.
-
Efficient Pose and Cell Segmentation using Column Generation
Authors:
Shaofei Wang,
Chong Zhang,
Miguel A. Gonzalez-Ballester,
Julian Yarkony
Abstract:
We study the problems of multi-person pose segmentation in natural images and instance segmentation in biological images with crowded cells. We formulate these distinct tasks as integer programs where variables correspond to poses/cells. To optimize, we propose a generic relaxation scheme for solving these combinatorial problems using a column generation formulation where the program for generatin…
▽ More
We study the problems of multi-person pose segmentation in natural images and instance segmentation in biological images with crowded cells. We formulate these distinct tasks as integer programs where variables correspond to poses/cells. To optimize, we propose a generic relaxation scheme for solving these combinatorial problems using a column generation formulation where the program for generating a column is solved via exact optimization of very small scale integer programs. This results in efficient exploration of the spaces of poses and cells.
△ Less
Submitted 1 December, 2016;
originally announced December 2016.
-
Convex Optimization For Non-Convex Problems via Column Generation
Authors:
Julian Yarkony,
Kamalika Chaudhuri
Abstract:
We apply column generation to approximating complex structured objects via a set of primitive structured objects under either the cross entropy or L2 loss. We use L1 regularization to encourage the use of few structured primitive objects. We attack approximation using convex optimization over an infinite number of variables each corresponding to a primitive structured object that are generated on…
▽ More
We apply column generation to approximating complex structured objects via a set of primitive structured objects under either the cross entropy or L2 loss. We use L1 regularization to encourage the use of few structured primitive objects. We attack approximation using convex optimization over an infinite number of variables each corresponding to a primitive structured object that are generated on demand by easy inference in the Lagrangian dual. We apply our approach to producing low rank approximations to large 3-way tensors.
△ Less
Submitted 13 February, 2016;
originally announced February 2016.
-
Tracking Objects with Higher Order Interactions using Delayed Column Generation
Authors:
Shaofei Wang,
Steffen Wolf,
Charless Fowlkes,
Julian Yarkony
Abstract:
We study the problem of multi-target tracking and data association in video. We formulate this in terms of selecting a subset of high-quality tracks subject to the constraint that no pair of selected tracks is associated with a common detection (of an object). This objective is equivalent to the classic NP-hard problem of finding a maximum-weight set packing (MWSP) where tracks correspond to sets…
▽ More
We study the problem of multi-target tracking and data association in video. We formulate this in terms of selecting a subset of high-quality tracks subject to the constraint that no pair of selected tracks is associated with a common detection (of an object). This objective is equivalent to the classic NP-hard problem of finding a maximum-weight set packing (MWSP) where tracks correspond to sets and is made further difficult since the number of candidate tracks grows exponentially in the number of detections. We present a relaxation of this combinatorial problem that uses a column generation formulation where the pricing problem is solved via dynamic programming to efficiently explore the space of tracks. We employ row generation to tighten the bound in such a way as to preserve efficient inference in the pricing problem. We show the practical utility of this algorithm for tracking problems in natural and biological video datasets.
△ Less
Submitted 9 August, 2016; v1 submitted 8 December, 2015;
originally announced December 2015.
-
Next Generation Multicuts for Semi-Planar Graphs
Authors:
Julian Yarkony
Abstract:
We study the problem of multicut segmentation. We introduce modified versions of the Semi-PlanarCC based on bounding Lagrange multipliers. We apply our work to natural image segmentation.
We study the problem of multicut segmentation. We introduce modified versions of the Semi-PlanarCC based on bounding Lagrange multipliers. We apply our work to natural image segmentation.
△ Less
Submitted 6 November, 2015;
originally announced November 2015.
-
Planar Ultrametric Rounding for Image Segmentation
Authors:
Julian Yarkony,
Charless C. Fowlkes
Abstract:
We study the problem of hierarchical clustering on planar graphs. We formulate this in terms of an LP relaxation of ultrametric rounding. To solve this LP efficiently we introduce a dual cutting plane scheme that uses minimum cost perfect matching as a subroutine in order to efficiently explore the space of planar partitions. We apply our algorithm to the problem of hierarchical image segmentation…
▽ More
We study the problem of hierarchical clustering on planar graphs. We formulate this in terms of an LP relaxation of ultrametric rounding. To solve this LP efficiently we introduce a dual cutting plane scheme that uses minimum cost perfect matching as a subroutine in order to efficiently explore the space of planar partitions. We apply our algorithm to the problem of hierarchical image segmentation.
△ Less
Submitted 9 September, 2015; v1 submitted 9 July, 2015;
originally announced July 2015.
-
Fast Planar Correlation Clustering for Image Segmentation
Authors:
Julian Yarkony,
Alexander T. Ihler,
Charless C. Fowlkes
Abstract:
We describe a new optimization scheme for finding high-quality correlation clusterings in planar graphs that uses weighted perfect matching as a subroutine. Our method provides lower-bounds on the energy of the optimal correlation clustering that are typically fast to compute and tight in practice. We demonstrate our algorithm on the problem of image segmentation where this approach outperforms ex…
▽ More
We describe a new optimization scheme for finding high-quality correlation clusterings in planar graphs that uses weighted perfect matching as a subroutine. Our method provides lower-bounds on the energy of the optimal correlation clustering that are typically fast to compute and tight in practice. We demonstrate our algorithm on the problem of image segmentation where this approach outperforms existing global optimization techniques in minimizing the objective and is competitive with the state of the art in producing high-quality segmentations.
△ Less
Submitted 1 August, 2012;
originally announced August 2012.
-
Tightening MRF Relaxations with Planar Subproblems
Authors:
Julian Yarkony,
Ragib Morshed,
Alexander T. Ihler,
Charless C. Fowlkes
Abstract:
We describe a new technique for computing lower-bounds on the minimum energy configuration of a planar Markov Random Field (MRF). Our method successively adds large numbers of constraints and enforces consistency over binary projections of the original problem state space. These constraints are represented in terms of subproblems in a dual-decomposition framework that is optimized using subgradien…
▽ More
We describe a new technique for computing lower-bounds on the minimum energy configuration of a planar Markov Random Field (MRF). Our method successively adds large numbers of constraints and enforces consistency over binary projections of the original problem state space. These constraints are represented in terms of subproblems in a dual-decomposition framework that is optimized using subgradient techniques. The complete set of constraints we consider enforces cycle consistency over the original graph. In practice we find that the method converges quickly on most problems with the addition of a few subproblems and outperforms existing methods for some interesting classes of hard potentials.
△ Less
Submitted 14 February, 2012;
originally announced February 2012.
-
Planar Cycle Covering Graphs
Authors:
Julian Yarkony,
Alexander T. Ihler,
Charless C. Fowlkes
Abstract:
We describe a new variational lower-bound on the minimum energy configuration of a planar binary Markov Random Field (MRF). Our method is based on adding auxiliary nodes to every face of a planar embedding of the graph in order to capture the effect of unary potentials. A ground state of the resulting approximation can be computed efficiently by reduction to minimum-weight perfect matching. We sho…
▽ More
We describe a new variational lower-bound on the minimum energy configuration of a planar binary Markov Random Field (MRF). Our method is based on adding auxiliary nodes to every face of a planar embedding of the graph in order to capture the effect of unary potentials. A ground state of the resulting approximation can be computed efficiently by reduction to minimum-weight perfect matching. We show that optimization of variational parameters achieves the same lower-bound as dual-decomposition into the set of all cycles of the original graph. We demonstrate that our variational optimization converges quickly and provides high-quality solutions to hard combinatorial problems 10-100x faster than competing algorithms that optimize the same bound.
△ Less
Submitted 6 April, 2011;
originally announced April 2011.