HTML conversions sometimes display errors due to content that did not convert correctly from the source. This paper uses the following packages that are not yet supported by the HTML conversion tool. Feedback on these issues are not necessary; they are known and are being worked on.

  • failed: subdepth
  • failed: boxedminipage
  • failed: xifthen

Authors: achieve the best HTML results from your LaTeX submissions by following these best practices.

License: arXiv.org perpetual non-exclusive license
arXiv:2401.14317v1 [cs.DS] 25 Jan 2024

Maximizing the Minimum Eigenvalue in Constant Dimension

Adam Brown [email protected], [email protected]; supported in part by NSF CCF-2106444 and NSF CCF-1910423. Georgia Institute of Technology. Aditi Laddha [email protected]; supported in part by the Institute for Foundations of Data Science at Yale and NSF CCF- 2007443. Yale University. Mohit Singh11footnotemark: 1 Georgia Institute of Technology.
Abstract

In an instance of the minimum eigenvalue problem, we are given a collection of n𝑛nitalic_n vectors v1,,vndsubscript𝑣1subscript𝑣𝑛superscript𝑑v_{1},\ldots,v_{n}\subset{\mathbb{R}^{d}}italic_v start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT , … , italic_v start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT ⊂ blackboard_R start_POSTSUPERSCRIPT italic_d end_POSTSUPERSCRIPT, and the goal is to pick a subset B[n]𝐵delimited-[]𝑛B\subseteq[n]italic_B ⊆ [ italic_n ] of given vectors to maximize the minimum eigenvalue of the matrix iBvivisubscript𝑖𝐵subscript𝑣𝑖superscriptsubscript𝑣𝑖top\sum_{i\in B}v_{i}v_{i}^{\top}∑ start_POSTSUBSCRIPT italic_i ∈ italic_B end_POSTSUBSCRIPT italic_v start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT italic_v start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ⊤ end_POSTSUPERSCRIPT. Often, additional combinatorial constraints such as cardinality constraint (|B|k)𝐵𝑘\left(|B|\leq k\right)( | italic_B | ≤ italic_k ) or matroid constraint (B𝐵Bitalic_B is a basis of a matroid defined on [n]delimited-[]𝑛[n][ italic_n ]) must be satisfied by the chosen set of vectors. The minimum eigenvalue problem with matroid constraints models a wide variety of problems including the Santa Clause problem, the E-design problem, and the constructive Kadison-Singer problem.

In this paper, we give a randomized algorithm that finds a set B[n]𝐵delimited-[]𝑛B\subseteq[n]italic_B ⊆ [ italic_n ] subject to any matroid constraint whose minimum eigenvalue is at least (1ϵ)1italic-ϵ(1-\epsilon)( 1 - italic_ϵ ) times the optimum, with high probability. The running time of the algorithm is O(nO(dlog(d)/ϵ2))𝑂superscript𝑛𝑂𝑑𝑑superscriptitalic-ϵ2O\left(n^{O(d\log(d)/\epsilon^{2})}\right)italic_O ( italic_n start_POSTSUPERSCRIPT italic_O ( italic_d roman_log ( italic_d ) / italic_ϵ start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT ) end_POSTSUPERSCRIPT ). In particular, our results give a polynomial time asymptotic scheme when the dimension of the vectors is constant. Our algorithm uses a convex programming relaxation of the problem after guessing a rescaling which allows us to apply pipage rounding and matrix Chernoff inequalities to round to a good solution. The key new component is a structural lemma which enables us to “guess” the appropriate rescaling, which could be of independent interest. Our approach generalizes the approximation guarantee to monotone, homogeneous functions and as such we can maximize det(iBvivi)1/dsuperscriptsubscript𝑖𝐵subscript𝑣𝑖superscriptsubscript𝑣𝑖top1𝑑\det(\sum_{i\in B}v_{i}v_{i}^{\top})^{1/d}roman_det ( ∑ start_POSTSUBSCRIPT italic_i ∈ italic_B end_POSTSUBSCRIPT italic_v start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT italic_v start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ⊤ end_POSTSUPERSCRIPT ) start_POSTSUPERSCRIPT 1 / italic_d end_POSTSUPERSCRIPT, or minimize any norm of the eigenvalues of the matrix (iBvivi)1superscriptsubscript𝑖𝐵subscript𝑣𝑖superscriptsubscript𝑣𝑖top1\left(\sum_{i\in B}v_{i}v_{i}^{\top}\right)^{-1}( ∑ start_POSTSUBSCRIPT italic_i ∈ italic_B end_POSTSUBSCRIPT italic_v start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT italic_v start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ⊤ end_POSTSUPERSCRIPT ) start_POSTSUPERSCRIPT - 1 end_POSTSUPERSCRIPT, with the same running time under some mild assumptions. As a byproduct, we also get a simple algorithm for an algorithmic version of Kadison-Singer problem.

1 Introduction

Subset selection problems with spectral objectives offer a natural model for studying problems in a variety of fields, including numerical linear algebra [AB13], graph theory [BSS09], convex geometry [SEFM15, Nik15], resource allocation [AS07, CCK09], and optimal design of experiments [AZLSW17, NST19, LZ21], all under a single umbrella.

In this work, we consider the minimum eigenvalue problem. In an instance of a minimum eigenvalue problem, we are given a collection of n𝑛nitalic_n vectors v1,,vndsubscript𝑣1subscript𝑣𝑛superscript𝑑v_{1},\ldots,v_{n}\in{\mathbb{R}^{d}}italic_v start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT , … , italic_v start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT ∈ blackboard_R start_POSTSUPERSCRIPT italic_d end_POSTSUPERSCRIPT, and the goal is to pick a subset B[n]𝐵delimited-[]𝑛B\subseteq[n]italic_B ⊆ [ italic_n ] of given vectors to maximize the minimum eigenvalue of the matrix iBvivisubscript𝑖𝐵subscript𝑣𝑖superscriptsubscript𝑣𝑖top\sum_{i\in B}v_{i}v_{i}^{\top}∑ start_POSTSUBSCRIPT italic_i ∈ italic_B end_POSTSUBSCRIPT italic_v start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT italic_v start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ⊤ end_POSTSUPERSCRIPT. The selected set B𝐵Bitalic_B must satisfy additional constraints such as cardinality, partition, or more generally matroid constraints. While much of the focus in previous works [AZLSW17, NST19, LZ21] has been on cardinality constraints, in this work, we consider general matroid constraints. The generality of matroid constraints allows us to model the algorithmic version of the Kadison Singer problem [MSS15, JMS22] as well as the Santa Claus allocation problem [AS07, AFS12, Fei08, CCK09, DRZ20] as a special case of the minimum eigenvalue problem.

The “discrepancy” formulation of the Kadison-Singer problem (shown to be equivalent to the original formulation in [Wea04] and proved in [MSS15]) states that given a set of vectors v1,,vmdsubscript𝑣1subscript𝑣𝑚superscript𝑑v_{1},\ldots,v_{m}\in\mathbb{R}^{d}italic_v start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT , … , italic_v start_POSTSUBSCRIPT italic_m end_POSTSUBSCRIPT ∈ blackboard_R start_POSTSUPERSCRIPT italic_d end_POSTSUPERSCRIPT with viαdelimited-∥∥subscript𝑣𝑖𝛼\left\lVert v_{i}\right\rVert\leq\alpha∥ italic_v start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT ∥ ≤ italic_α and ivivi=Idsubscript𝑖subscript𝑣𝑖superscriptsubscript𝑣𝑖topsubscript𝐼𝑑\sum_{i}v_{i}v_{i}^{\top}=I_{d}∑ start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT italic_v start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT italic_v start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ⊤ end_POSTSUPERSCRIPT = italic_I start_POSTSUBSCRIPT italic_d end_POSTSUBSCRIPT, there exists a partition of [m]delimited-[]𝑚[m][ italic_m ] into two subsets S1,S2subscript𝑆1subscript𝑆2S_{1},S_{2}italic_S start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT , italic_S start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT, such that for every j𝑗jitalic_j, iSjvivisubscript𝑖subscript𝑆𝑗subscript𝑣𝑖superscriptsubscript𝑣𝑖top\sum_{i\in S_{j}}v_{i}v_{i}^{\top}∑ start_POSTSUBSCRIPT italic_i ∈ italic_S start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT end_POSTSUBSCRIPT italic_v start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT italic_v start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ⊤ end_POSTSUPERSCRIPT spectrally approximates Id/2subscript𝐼𝑑2I_{d}/2italic_I start_POSTSUBSCRIPT italic_d end_POSTSUBSCRIPT / 2 to an additive factor of O(α)𝑂𝛼O(\alpha)italic_O ( italic_α ). Algorithmically, finding such a partition is equivalent to solving an instance of the minimum eigenvalue maximization problem under partition matroid constraints (see Section 2.1).

Another classical application of the minimum eigenvalue problem arises in the area of optimal design of experiments in statistics [Puk06, AZLSW17]. The goal in the design of experiments is to select a subset of vectors S𝑆Sitalic_S from a given list of vectors {v1,,vn}subscript𝑣1subscript𝑣𝑛\{v_{1},\ldots,v_{n}\}{ italic_v start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT , … , italic_v start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT } such that certain measures of the covariance matrix (iSvivi)1superscriptsubscript𝑖𝑆subscript𝑣𝑖superscriptsubscript𝑣𝑖top1\left(\sum_{i\in S}v_{i}v_{i}^{\top}\right)^{-1}( ∑ start_POSTSUBSCRIPT italic_i ∈ italic_S end_POSTSUBSCRIPT italic_v start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT italic_v start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ⊤ end_POSTSUPERSCRIPT ) start_POSTSUPERSCRIPT - 1 end_POSTSUPERSCRIPT are small. In particular, minimizing the maximum eigenvalue of the covariance matrix, classically known as the E𝐸Eitalic_E-design problem in statistics, is exactly the minimum eigenvalue problem. While much of the previous work has focused on the case when the selected set of measurements S𝑆Sitalic_S must satisfy cardinality constraints, our work generalizes this problem to be studied under general matroid constraints.

1.1 Our Results and Contributions

In this work, we present an approximation algorithm for the minimum eigenvalue problem for all matroids. We use the randomized rounding technique of pipage rounding to give a polynomial time approximation scheme (PTAS) when the dimension is constant.

Theorem 1

For any ϵ>0italic-ϵ0\epsilon>0italic_ϵ > 0 there is an O(nO(dlog(d)/ϵ2))𝑂superscript𝑛𝑂𝑑𝑑superscriptitalic-ϵ2O\left(n^{O(d\log(d)/\epsilon^{2})}\right)italic_O ( italic_n start_POSTSUPERSCRIPT italic_O ( italic_d roman_log ( italic_d ) / italic_ϵ start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT ) end_POSTSUPERSCRIPT )-time algorithm which, given a collection of vectors v1,,vndsubscript𝑣1normal-…subscript𝑣𝑛superscript𝑑v_{1},\ldots,v_{n}\in\mathbb{R}^{d}italic_v start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT , … , italic_v start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT ∈ blackboard_R start_POSTSUPERSCRIPT italic_d end_POSTSUPERSCRIPT and a matroid =([n],)delimited-[]𝑛\mathcal{M}=([n],\mathcal{I})caligraphic_M = ( [ italic_n ] , caligraphic_I ) returns a set B𝐵B\in\mathcal{I}italic_B ∈ caligraphic_I such that with probability at least 1d41superscript𝑑41-d^{-4}1 - italic_d start_POSTSUPERSCRIPT - 4 end_POSTSUPERSCRIPT

λmin(iBvivi)(1ϵ)maxBλmin(iBvivi).subscript𝜆subscript𝑖𝐵subscript𝑣𝑖superscriptsubscript𝑣𝑖top1italic-ϵsubscriptsuperscript𝐵subscript𝜆subscript𝑖superscript𝐵subscript𝑣𝑖superscriptsubscript𝑣𝑖top\lambda_{\min}\left(\sum_{i\in B}v_{i}v_{i}^{\top}\right)\geq(1-\epsilon)\cdot% \max_{B^{\star}\in\mathcal{I}}\;\lambda_{\min}\left(\sum_{i\in B^{\star}}v_{i}% v_{i}^{\top}\right).italic_λ start_POSTSUBSCRIPT roman_min end_POSTSUBSCRIPT ( ∑ start_POSTSUBSCRIPT italic_i ∈ italic_B end_POSTSUBSCRIPT italic_v start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT italic_v start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ⊤ end_POSTSUPERSCRIPT ) ≥ ( 1 - italic_ϵ ) ⋅ roman_max start_POSTSUBSCRIPT italic_B start_POSTSUPERSCRIPT ⋆ end_POSTSUPERSCRIPT ∈ caligraphic_I end_POSTSUBSCRIPT italic_λ start_POSTSUBSCRIPT roman_min end_POSTSUBSCRIPT ( ∑ start_POSTSUBSCRIPT italic_i ∈ italic_B start_POSTSUPERSCRIPT ⋆ end_POSTSUPERSCRIPT end_POSTSUBSCRIPT italic_v start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT italic_v start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ⊤ end_POSTSUPERSCRIPT ) .

Our result generalizes to give a PTAS (for constant dimension) when the objective is a general matrix function satisfying certain technical properties. In particular, this implies that a similar result as in Theorem 1 is achievable when the objective is to maximize the determinant of iBvivisubscript𝑖𝐵subscript𝑣𝑖superscriptsubscript𝑣𝑖top\sum_{i\in B}v_{i}v_{i}^{\top}∑ start_POSTSUBSCRIPT italic_i ∈ italic_B end_POSTSUBSCRIPT italic_v start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT italic_v start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ⊤ end_POSTSUPERSCRIPT or to minimize any norm of the eigenvalues of (iBvivi)1superscriptsubscript𝑖𝐵subscript𝑣𝑖superscriptsubscript𝑣𝑖top1(\sum_{i\in B}v_{i}v_{i}^{\top})^{-1}( ∑ start_POSTSUBSCRIPT italic_i ∈ italic_B end_POSTSUBSCRIPT italic_v start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT italic_v start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ⊤ end_POSTSUPERSCRIPT ) start_POSTSUPERSCRIPT - 1 end_POSTSUPERSCRIPT.

Theorem 2

Suppose we have a collection of vectors 𝒱=(v1,,vn)d𝒱subscript𝑣1normal-…subscript𝑣𝑛superscript𝑑\mathcal{V}=(v_{1},\ldots,v_{n})\in\mathbb{R}^{d}caligraphic_V = ( italic_v start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT , … , italic_v start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT ) ∈ blackboard_R start_POSTSUPERSCRIPT italic_d end_POSTSUPERSCRIPT, and a matroid =([n],)delimited-[]𝑛\mathcal{M}=([n],\mathcal{I})caligraphic_M = ( [ italic_n ] , caligraphic_I ). Let f:𝕊d+:f:\mathbb{S}_{d}^{+}:\rightarrow\mathbb{R}italic_f : blackboard_S start_POSTSUBSCRIPT italic_d end_POSTSUBSCRIPT start_POSTSUPERSCRIPT + end_POSTSUPERSCRIPT : → blackboard_R be a concave, monotone, and homogeneous function given with a value and first order oracle. For any ϵ>0italic-ϵ0\epsilon>0italic_ϵ > 0, there is an O(nO(dlog(d)/ϵ2))𝑂superscript𝑛𝑂𝑑𝑑superscriptitalic-ϵ2O\left(n^{O(d\log(d)/\epsilon^{2})}\right)italic_O ( italic_n start_POSTSUPERSCRIPT italic_O ( italic_d roman_log ( italic_d ) / italic_ϵ start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT ) end_POSTSUPERSCRIPT )-time randomized algorithm, which takes (𝒱,,f)𝒱𝑓(\mathcal{V},\mathcal{M},f)( caligraphic_V , caligraphic_M , italic_f ) as input and returns a set B𝐵B\in\mathcal{I}italic_B ∈ caligraphic_I such that with probability at least 1d41superscript𝑑41-d^{-4}1 - italic_d start_POSTSUPERSCRIPT - 4 end_POSTSUPERSCRIPT,

f(iBvivi)(1ϵ)maxBf(iBvivi).𝑓subscript𝑖𝐵subscript𝑣𝑖superscriptsubscript𝑣𝑖top1italic-ϵsubscriptsuperscript𝐵𝑓subscript𝑖superscript𝐵subscript𝑣𝑖superscriptsubscript𝑣𝑖topf\left(\sum_{i\in B}v_{i}v_{i}^{\top}\right)\geq(1-\epsilon)\cdot\max_{B^{% \star}\in\mathcal{I}}\;f\left(\sum_{i\in B^{\star}}v_{i}v_{i}^{\top}\right)\,.italic_f ( ∑ start_POSTSUBSCRIPT italic_i ∈ italic_B end_POSTSUBSCRIPT italic_v start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT italic_v start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ⊤ end_POSTSUPERSCRIPT ) ≥ ( 1 - italic_ϵ ) ⋅ roman_max start_POSTSUBSCRIPT italic_B start_POSTSUPERSCRIPT ⋆ end_POSTSUPERSCRIPT ∈ caligraphic_I end_POSTSUBSCRIPT italic_f ( ∑ start_POSTSUBSCRIPT italic_i ∈ italic_B start_POSTSUPERSCRIPT ⋆ end_POSTSUPERSCRIPT end_POSTSUBSCRIPT italic_v start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT italic_v start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ⊤ end_POSTSUPERSCRIPT ) .

Although Theorem 2 is stated in terms of maximizing concave functions, our algorithm can also be applied to minimize monotone and homogeneous convex functions (e.g., trace(iBvivi)1tracesuperscriptsubscript𝑖𝐵subscript𝑣𝑖superscriptsubscript𝑣𝑖top1\mathrm{trace}(\sum_{i\in B}v_{i}v_{i}^{\top})^{-1}roman_trace ( ∑ start_POSTSUBSCRIPT italic_i ∈ italic_B end_POSTSUBSCRIPT italic_v start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT italic_v start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ⊤ end_POSTSUPERSCRIPT ) start_POSTSUPERSCRIPT - 1 end_POSTSUPERSCRIPT) by considering the natural convex relaxation of the function over the matroid base polytope and using the same rounding strategy.

Technical Overview.

The first natural direction is to construct a convex programming relaxation for the problem and aim to apply randomized rounding methods to it.

maxλmin(X)X=i=1nxivivix𝒫()x0subscript𝜆𝑋missing-subexpression𝑋superscriptsubscript𝑖1𝑛subscript𝑥𝑖subscript𝑣𝑖superscriptsubscript𝑣𝑖topmissing-subexpression𝑥𝒫missing-subexpression𝑥0\begin{array}[]{cl}\max&\lambda_{\min}\left(X\right)\\ &X=\sum\limits_{i=1}^{n}x_{i}\cdot v_{i}v_{i}^{\top}\\ &x\in\mathcal{P}(\mathcal{M})\\ &x\geq 0\end{array}start_ARRAY start_ROW start_CELL roman_max end_CELL start_CELL italic_λ start_POSTSUBSCRIPT roman_min end_POSTSUBSCRIPT ( italic_X ) end_CELL end_ROW start_ROW start_CELL end_CELL start_CELL italic_X = ∑ start_POSTSUBSCRIPT italic_i = 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_n end_POSTSUPERSCRIPT italic_x start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT ⋅ italic_v start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT italic_v start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ⊤ end_POSTSUPERSCRIPT end_CELL end_ROW start_ROW start_CELL end_CELL start_CELL italic_x ∈ caligraphic_P ( caligraphic_M ) end_CELL end_ROW start_ROW start_CELL end_CELL start_CELL italic_x ≥ 0 end_CELL end_ROW end_ARRAY (CP)

Here, 𝒫()𝒫\mathcal{P}(\mathcal{M})caligraphic_P ( caligraphic_M ) denotes the matroid base polytope of \mathcal{M}caligraphic_M. Unfortunately, this direct approach faces problems as this natural relaxation has an unbounded integrality gap even in very special cases (see Appendix A.1). The main challenge is the presence of long vectors that contribute significantly towards the optimum solution. A natural way to formalize the contribution of a vector is to consider its leverage score. Indeed, if T𝑇Titalic_T denotes the optimum solution and AT=iTvivisubscript𝐴𝑇subscript𝑖𝑇subscript𝑣𝑖superscriptsubscript𝑣𝑖topA_{T}=\sum_{i\in T}v_{i}v_{i}^{\top}italic_A start_POSTSUBSCRIPT italic_T end_POSTSUBSCRIPT = ∑ start_POSTSUBSCRIPT italic_i ∈ italic_T end_POSTSUBSCRIPT italic_v start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT italic_v start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ⊤ end_POSTSUPERSCRIPT, let li=viAT1visubscript𝑙𝑖superscriptsubscript𝑣𝑖topsuperscriptsubscript𝐴𝑇1subscript𝑣𝑖l_{i}=v_{i}^{\top}A_{T}^{-1}v_{i}italic_l start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT = italic_v start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ⊤ end_POSTSUPERSCRIPT italic_A start_POSTSUBSCRIPT italic_T end_POSTSUBSCRIPT start_POSTSUPERSCRIPT - 1 end_POSTSUPERSCRIPT italic_v start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT be the leverage score of visubscript𝑣𝑖v_{i}italic_v start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT and let S={iT:liϵ2logd}𝑆conditional-set𝑖𝑇subscript𝑙𝑖superscriptitalic-ϵ2𝑑S=\{i\in T:l_{i}\geq\frac{\epsilon^{2}}{\log d}\}italic_S = { italic_i ∈ italic_T : italic_l start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT ≥ divide start_ARG italic_ϵ start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT end_ARG start_ARG roman_log italic_d end_ARG } be the set of vectors in the optimum solution with large leverage scores. The bound ϵ2logdsuperscriptitalic-ϵ2𝑑\frac{\epsilon^{2}}{\log d}divide start_ARG italic_ϵ start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT end_ARG start_ARG roman_log italic_d end_ARG is chosen to allow randomized rounding methods to work (see Lemma 2 for details). Using the simple fact that the sum of leverage scores of all vectors in the optimum solution is exactly d𝑑ditalic_d, it follows that |S|dlogdϵ2𝑆𝑑𝑑superscriptitalic-ϵ2|S|\leq\frac{d\log d}{\epsilon^{2}}| italic_S | ≤ divide start_ARG italic_d roman_log italic_d end_ARG start_ARG italic_ϵ start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT end_ARG. Thus we could easily enumerate all such subsets S𝑆Sitalic_S in time nO(dlogd/ϵ2)superscript𝑛𝑂𝑑𝑑superscriptitalic-ϵ2n^{O(d\log d/\epsilon^{2})}italic_n start_POSTSUPERSCRIPT italic_O ( italic_d roman_log italic_d / italic_ϵ start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT ) end_POSTSUPERSCRIPT. For each such guess S𝑆Sitalic_S, we include S𝑆Sitalic_S in our solution and solve the convex program. We then apply the randomized rounding method to the solution of the convex program. Unfortunately, the challenge lies in ensuring that the convex program not only selects the vectors in S𝑆Sitalic_S (this can be easily done by setting their indicator variable to one) but also avoids selecting all vectors not in T𝑇Titalic_T that have a large leverage score. The latter is crucial for the randomized rounding approach to work effectively. Unfortunately, since we did not guess T𝑇Titalic_T, we have no way to insist that we do not pick these vectors in the convex program.

To address this problem, we present a new structural lemma that enables us to compute the leverage score as given by matrix AS=iSviviTsubscript𝐴𝑆subscript𝑖𝑆subscript𝑣𝑖superscriptsubscript𝑣𝑖𝑇A_{S}=\sum_{i\in S}v_{i}v_{i}^{T}italic_A start_POSTSUBSCRIPT italic_S end_POSTSUBSCRIPT = ∑ start_POSTSUBSCRIPT italic_i ∈ italic_S end_POSTSUBSCRIPT italic_v start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT italic_v start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_T end_POSTSUPERSCRIPT. The lemma shows that there are few vectors with large leverage scores, even when using ASsubscript𝐴𝑆A_{S}italic_A start_POSTSUBSCRIPT italic_S end_POSTSUBSCRIPT instead of ATsubscript𝐴𝑇A_{T}italic_A start_POSTSUBSCRIPT italic_T end_POSTSUBSCRIPT. Observe that AS1AT1succeeds-or-equalssuperscriptsubscript𝐴𝑆1superscriptsubscript𝐴𝑇1A_{S}^{-1}\succeq A_{T}^{-1}italic_A start_POSTSUBSCRIPT italic_S end_POSTSUBSCRIPT start_POSTSUPERSCRIPT - 1 end_POSTSUPERSCRIPT ⪰ italic_A start_POSTSUBSCRIPT italic_T end_POSTSUBSCRIPT start_POSTSUPERSCRIPT - 1 end_POSTSUPERSCRIPT and therefore, the leverage scores with respect to ASsubscript𝐴𝑆A_{S}italic_A start_POSTSUBSCRIPT italic_S end_POSTSUBSCRIPT are larger. Nevertheless, we still show a similar bound in the following lemma.

Lemma 1

For any set T𝑇Titalic_T and a set of vectors {vi:iT}conditional-setsubscript𝑣𝑖𝑖𝑇\{v_{i}:i\in T\}{ italic_v start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT : italic_i ∈ italic_T } in dsuperscript𝑑\mathbb{R}^{d}blackboard_R start_POSTSUPERSCRIPT italic_d end_POSTSUPERSCRIPT such that iTvivisubscript𝑖𝑇subscript𝑣𝑖superscriptsubscript𝑣𝑖top\sum_{i\in T}v_{i}v_{i}^{\top}∑ start_POSTSUBSCRIPT italic_i ∈ italic_T end_POSTSUBSCRIPT italic_v start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT italic_v start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ⊤ end_POSTSUPERSCRIPT is invertible, there exists a subset ST𝑆𝑇S\subseteq Titalic_S ⊆ italic_T such that |S|=O(dlog(d)/ϵ2)𝑆𝑂𝑑𝑑superscriptitalic-ϵ2|S|=O(d\log(d)/\epsilon^{2})| italic_S | = italic_O ( italic_d roman_log ( italic_d ) / italic_ϵ start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT ), AS=iSvivisubscript𝐴𝑆subscript𝑖𝑆subscript𝑣𝑖superscriptsubscript𝑣𝑖topA_{S}=\sum_{i\in S}v_{i}v_{i}^{\top}italic_A start_POSTSUBSCRIPT italic_S end_POSTSUBSCRIPT = ∑ start_POSTSUBSCRIPT italic_i ∈ italic_S end_POSTSUBSCRIPT italic_v start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT italic_v start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ⊤ end_POSTSUPERSCRIPT is invertible, and for all iT\S𝑖normal-\𝑇𝑆i\in T\backslash Sitalic_i ∈ italic_T \ italic_S,

viAS1viϵ210log(d).superscriptsubscript𝑣𝑖topsuperscriptsubscript𝐴𝑆1subscript𝑣𝑖superscriptitalic-ϵ210𝑑v_{i}^{\top}A_{S}^{-1}v_{i}\leq\frac{\epsilon^{2}}{10\log(d)}\,.italic_v start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ⊤ end_POSTSUPERSCRIPT italic_A start_POSTSUBSCRIPT italic_S end_POSTSUBSCRIPT start_POSTSUPERSCRIPT - 1 end_POSTSUPERSCRIPT italic_v start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT ≤ divide start_ARG italic_ϵ start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT end_ARG start_ARG 10 roman_log ( italic_d ) end_ARG .

With the help of Lemma 1, we can now guess the set S𝑆Sitalic_S and insist that the convex program includes all these vectors in the chosen subset. More importantly, it allows us to insist that all vectors visubscript𝑣𝑖v_{i}italic_v start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT not in S𝑆Sitalic_S such that viAS1vi>ϵ210log(d)superscriptsubscript𝑣𝑖topsuperscriptsubscript𝐴𝑆1subscript𝑣𝑖superscriptitalic-ϵ210𝑑v_{i}^{\top}A_{S}^{-1}v_{i}>\frac{\epsilon^{2}}{10\log(d)}italic_v start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ⊤ end_POSTSUPERSCRIPT italic_A start_POSTSUBSCRIPT italic_S end_POSTSUBSCRIPT start_POSTSUPERSCRIPT - 1 end_POSTSUPERSCRIPT italic_v start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT > divide start_ARG italic_ϵ start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT end_ARG start_ARG 10 roman_log ( italic_d ) end_ARG not be included in the chosen solution. The last step can be done since we have guessed the set S𝑆Sitalic_S. This allows us to apply the randomized rounding approach to the convex programming solution.

There are some points worth mentioning about the randomized rounding approach. When the constraint matroid is a partition matroid, randomized rounding is a natural choice: for each part, the convex programming solution can be interpreted as a probability distribution over vectors in that part. Independently, for each part, pick one of the vectors with probability given by the convex programming solution. A simple application of the matrix Chernoff bound and the fact that leverage scores are all small due to Lemma 1 gives us the desired result. Due to the simplicity of the approach for partition matroids as well as the applicability of these constraints, we first prove the result for partition matroids in Section 2. We also show the application of our result to obtain an algorithmic version of the Kadison-Singer problem [MSS15] for constant dimension. We slightly improve the run time compared to the recent work [JMS22].

Corollary 1

Suppose we are given collection of vectors 𝒰=(u1,,un)d𝒰subscript𝑢1normal-…subscript𝑢𝑛superscript𝑑\mathcal{U}=(u_{1},\ldots,u_{n})\in\mathbb{R}^{d}caligraphic_U = ( italic_u start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT , … , italic_u start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT ) ∈ blackboard_R start_POSTSUPERSCRIPT italic_d end_POSTSUPERSCRIPT with ui2αsuperscriptnormsubscript𝑢𝑖2𝛼\|u_{i}\|^{2}\leq\alpha∥ italic_u start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT ∥ start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT ≤ italic_α for any i[n]𝑖delimited-[]𝑛i\in[n]italic_i ∈ [ italic_n ] and i=1nuiui=Idsuperscriptsubscript𝑖1𝑛subscript𝑢𝑖superscriptsubscript𝑢𝑖topsubscript𝐼𝑑\sum_{i=1}^{n}u_{i}u_{i}^{\top}=I_{d}∑ start_POSTSUBSCRIPT italic_i = 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_n end_POSTSUPERSCRIPT italic_u start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT italic_u start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ⊤ end_POSTSUPERSCRIPT = italic_I start_POSTSUBSCRIPT italic_d end_POSTSUBSCRIPT and a constant c>0𝑐0c>0italic_c > 0 such that there exists a set T*superscript𝑇T^{*}italic_T start_POSTSUPERSCRIPT * end_POSTSUPERSCRIPT satisfying

(12cα)IdiT*uiui(12+cα)Id.precedes-or-equals12𝑐𝛼subscript𝐼𝑑subscript𝑖superscript𝑇subscript𝑢𝑖superscriptsubscript𝑢𝑖topprecedes-or-equals12𝑐𝛼subscript𝐼𝑑\left(\frac{1}{2}-c\sqrt{\alpha}\right)I_{d}\preceq\sum_{i\in T^{*}}u_{i}u_{i}% ^{\top}\preceq\left(\frac{1}{2}+c\sqrt{\alpha}\right)I_{d}\,.( divide start_ARG 1 end_ARG start_ARG 2 end_ARG - italic_c square-root start_ARG italic_α end_ARG ) italic_I start_POSTSUBSCRIPT italic_d end_POSTSUBSCRIPT ⪯ ∑ start_POSTSUBSCRIPT italic_i ∈ italic_T start_POSTSUPERSCRIPT * end_POSTSUPERSCRIPT end_POSTSUBSCRIPT italic_u start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT italic_u start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ⊤ end_POSTSUPERSCRIPT ⪯ ( divide start_ARG 1 end_ARG start_ARG 2 end_ARG + italic_c square-root start_ARG italic_α end_ARG ) italic_I start_POSTSUBSCRIPT italic_d end_POSTSUBSCRIPT .

For any ϵ>0italic-ϵ0\epsilon>0italic_ϵ > 0, there exists a randomized algorithm such which given 𝒰𝒰\mathcal{U}caligraphic_U and c𝑐citalic_c as input, returns a set T𝑇Titalic_T such that

(1ϵ)(12cα)IdiTuiui(1+ϵ)(12+cα)Id,precedes-or-equals1italic-ϵ12𝑐𝛼subscript𝐼𝑑subscript𝑖𝑇subscript𝑢𝑖superscriptsubscript𝑢𝑖topprecedes-or-equals1italic-ϵ12𝑐𝛼subscript𝐼𝑑(1-\epsilon)\cdot\left(\frac{1}{2}-c\sqrt{\alpha}\right)I_{d}\preceq\sum_{i\in T% }u_{i}u_{i}^{\top}\preceq(1+\epsilon)\cdot\left(\frac{1}{2}+c\sqrt{\alpha}% \right)I_{d}\,,( 1 - italic_ϵ ) ⋅ ( divide start_ARG 1 end_ARG start_ARG 2 end_ARG - italic_c square-root start_ARG italic_α end_ARG ) italic_I start_POSTSUBSCRIPT italic_d end_POSTSUBSCRIPT ⪯ ∑ start_POSTSUBSCRIPT italic_i ∈ italic_T end_POSTSUBSCRIPT italic_u start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT italic_u start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ⊤ end_POSTSUPERSCRIPT ⪯ ( 1 + italic_ϵ ) ⋅ ( divide start_ARG 1 end_ARG start_ARG 2 end_ARG + italic_c square-root start_ARG italic_α end_ARG ) italic_I start_POSTSUBSCRIPT italic_d end_POSTSUBSCRIPT ,

with probability at least 1O(d4)1𝑂superscript𝑑41-O(d^{-4})1 - italic_O ( italic_d start_POSTSUPERSCRIPT - 4 end_POSTSUPERSCRIPT ). The run time of the algorithm is O(nO(dlogd/ϵ2))𝑂superscript𝑛𝑂𝑑𝑑superscriptitalic-ϵ2O(n^{O(d\log{d}/\epsilon^{2})})italic_O ( italic_n start_POSTSUPERSCRIPT italic_O ( italic_d roman_log italic_d / italic_ϵ start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT ) end_POSTSUPERSCRIPT ).

For general matroids, a straightforward application of randomized rounding does not work since it will not ensure that the chosen set is an independent set in the matroid. Instead, we use pipage rounding for general matroids, which involves randomly walking in the matroid polytope to return a vertex while ensuring that the output solution has even better concentration than is given by independent randomized rounding. To show these concentration results, we build on the work of Harvey and Olver [HO14] and give lower tail bounds on the distribution obtained via pipage rounding in Lemma 4.

1.2 Related Work

The minimum eigenvalue problem with partition constraints can be interpreted as a generalization of the max-min allocation problem. In the case of cardinality constraints, it can also model problems from experimental design and spectral sparsification. We give an overview of prior work for these special cases.

Max-min allocation and Santa Claus:

In the max-min allocation problem, we are given a set [d]delimited-[]𝑑[d][ italic_d ] of agents and a set [n]delimited-[]𝑛[n][ italic_n ] of items where agent j[d]𝑗delimited-[]𝑑j\in[d]italic_j ∈ [ italic_d ] has valuation hij0subscript𝑖𝑗0h_{ij}\geq 0italic_h start_POSTSUBSCRIPT italic_i italic_j end_POSTSUBSCRIPT ≥ 0 for item i𝑖iitalic_i. The goal is to select an assignment σ:[n][d]:𝜎delimited-[]𝑛delimited-[]𝑑\sigma:[n]\rightarrow[d]italic_σ : [ italic_n ] → [ italic_d ] which maximizes

minj[d]i:σ(i)=jhij.subscript𝑗delimited-[]𝑑subscript:𝑖𝜎𝑖𝑗subscript𝑖𝑗\min_{j\in[d]}\sum_{i:\sigma(i)=j}h_{ij}.roman_min start_POSTSUBSCRIPT italic_j ∈ [ italic_d ] end_POSTSUBSCRIPT ∑ start_POSTSUBSCRIPT italic_i : italic_σ ( italic_i ) = italic_j end_POSTSUBSCRIPT italic_h start_POSTSUBSCRIPT italic_i italic_j end_POSTSUBSCRIPT .

This can be seen as a special case of the minimum eigenvalue problem with partition constraints.

Bansal and Sviridenko [BS06] introduced the configuration LP as a relaxation for the max-min allocation problem but showed that it has an integrality gap of Ω(n)Ω𝑛\Omega(\sqrt{n})roman_Ω ( square-root start_ARG italic_n end_ARG ) [BS06]. Asadpour and Saberi [AS07] gave a rounding scheme for the same LP, which achieves an O(nlog3n)𝑂𝑛superscript3𝑛O(\sqrt{n}\log^{3}n)italic_O ( square-root start_ARG italic_n end_ARG roman_log start_POSTSUPERSCRIPT 3 end_POSTSUPERSCRIPT italic_n )-approximation. This was later improved by Chakrabarty et al. [CCK09] to an O~(nϵ)~𝑂superscript𝑛italic-ϵ\tilde{O}(n^{\epsilon})over~ start_ARG italic_O end_ARG ( italic_n start_POSTSUPERSCRIPT italic_ϵ end_POSTSUPERSCRIPT )-approximation for any ϵΩ(loglogn/logn)italic-ϵΩ𝑛𝑛\epsilon\in\Omega(\log\log n/\log n)italic_ϵ ∈ roman_Ω ( roman_log roman_log italic_n / roman_log italic_n ) by iteratively constructing new instances with smaller integrality gap.

A further special case is the Santa Claus problem where each item i𝑖iitalic_i has an intrinsic value Hi0subscript𝐻𝑖0H_{i}\geq 0italic_H start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT ≥ 0 such that hij{0,Hi}subscript𝑖𝑗0subscript𝐻𝑖h_{ij}\in\{0,H_{i}\}italic_h start_POSTSUBSCRIPT italic_i italic_j end_POSTSUBSCRIPT ∈ { 0 , italic_H start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT } for all players j[d]𝑗delimited-[]𝑑j\in[d]italic_j ∈ [ italic_d ]. Here, Bansal and Sviridenko [BS06] used the configuration LP to find an O(loglogn/logloglogn)𝑂𝑛𝑛O(\log\log n/\log\log\log n)italic_O ( roman_log roman_log italic_n / roman_log roman_log roman_log italic_n )-approximation. Feige [Fei08] non-constructively showed a constant upper bound on the integrality gap of the configuration LP for the Santa Claus problem by iteratively applying the Lovász Local Lemma. The current best bound is due to Haxell and Szabó [HS23], who used new topological techniques to show that the integrality gap is at most 3.5343.5343.5343.534. Bounds on the integrality gap do not immediately lead to efficient approximation algorithms, but Davies et al. [DRZ20] recently gave an algorithm for a more general setting that can be used to achieve a (4+ϵ)4italic-ϵ(4+\epsilon)( 4 + italic_ϵ )-approximation for the Santa Claus problem.

Experimental Design (E-optimal Design):

Even with cardinality constraints (uniform matroid of rank k𝑘kitalic_k), the minimum eigenvalue problem is NP-hard [cMI09]. Allen-Zhu et al. [AZLSW17] showed that it is possible to deterministically find a (1ϵ)1italic-ϵ(1-\epsilon)( 1 - italic_ϵ )-approximation so long as kΩ(d/ϵ2)𝑘Ω𝑑superscriptitalic-ϵ2k\geq\Omega(d/\epsilon^{2})italic_k ≥ roman_Ω ( italic_d / italic_ϵ start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT ) by rounding the natural convex relaxation. They also conjectured that this requirement was necessary. This conjecture was confirmed in [NST19], where they showed an integrality gap instance for the convex relaxation. Recently Lau and Zhou [LZ21] have built on the regret minimization framework from [AZLSW17] to show that a modified local search algorithm with a “smoothed” objective works as long as there is a near-optimal solution with a good condition number.

Spectral Sparsification and Kadison-Singer.

The problem of rounding the natural convex programming relaxation for the minimum eigenvalue problem is closely related to spectral sparsification [BSS09] and the Kadison-Singer problem [MSS15]. In spectral sparsification [BSS09], the goal is to pick a small subset of vectors S[n]𝑆delimited-[]𝑛S\subseteq[n]italic_S ⊆ [ italic_n ] such that iSwivivisubscript𝑖𝑆subscript𝑤𝑖subscript𝑣𝑖superscriptsubscript𝑣𝑖top\sum_{i\in S}w_{i}v_{i}v_{i}^{\top}∑ start_POSTSUBSCRIPT italic_i ∈ italic_S end_POSTSUBSCRIPT italic_w start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT italic_v start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT italic_v start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ⊤ end_POSTSUPERSCRIPT spectrally approximates i[n]vivisubscript𝑖delimited-[]𝑛subscript𝑣𝑖superscriptsubscript𝑣𝑖top\sum_{i\in[n]}v_{i}v_{i}^{\top}∑ start_POSTSUBSCRIPT italic_i ∈ [ italic_n ] end_POSTSUBSCRIPT italic_v start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT italic_v start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ⊤ end_POSTSUPERSCRIPT for some weights wisubscript𝑤𝑖w_{i}italic_w start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT. In the cardinality constrained minimum eigenvalue problem, rounding the convex programming solution involves finding a small set S𝑆Sitalic_S, such that iSvivisubscript𝑖𝑆subscript𝑣𝑖superscriptsubscript𝑣𝑖top\sum_{i\in S}v_{i}v_{i}^{\top}∑ start_POSTSUBSCRIPT italic_i ∈ italic_S end_POSTSUBSCRIPT italic_v start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT italic_v start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ⊤ end_POSTSUPERSCRIPT spectrally approximates i[n]xivivisubscript𝑖delimited-[]𝑛subscript𝑥𝑖subscript𝑣𝑖superscriptsubscript𝑣𝑖top\sum_{i\in[n]}x_{i}v_{i}v_{i}^{\top}∑ start_POSTSUBSCRIPT italic_i ∈ [ italic_n ] end_POSTSUBSCRIPT italic_x start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT italic_v start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT italic_v start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ⊤ end_POSTSUPERSCRIPT, where the weights xisubscript𝑥𝑖x_{i}italic_x start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT form the solution to the convex relaxation. Indeed [AZLSW17] essentially build on this connection to obtain their results for the E𝐸Eitalic_E-design problem discussed earlier. The Kadison-Singer problem [MSS15] is closely related to the minimum eigenvalue problem under a partition matroid constraint. We utilize this connection in Corollary 1 to give an algorithmic version of the Kadison-Singer problem for constant dimensions. More generally, the Kadison-Singer problem can be reformulated as showing that the integrality gap of the natural relaxation of the minimum eigenvalue problem under partition matroid constraints is at most 1/(1ϵ)11italic-ϵ1/(1-\epsilon)1 / ( 1 - italic_ϵ ) if the length of each vector is at most O(ϵ)𝑂italic-ϵO(\epsilon)italic_O ( italic_ϵ ). We discuss this connection in Section 4.

2 The Algorithm for Partition Matroids

To highlight the main idea of our algorithm, we first prove Theorem 1 for the special case of partition matroid. Let =(E,)𝐸\mathcal{M}=(E,\mathcal{I})caligraphic_M = ( italic_E , caligraphic_I ) be a partition matroid where E=P1Pk𝐸subscript𝑃1subscript𝑃𝑘E=P_{1}\cup\cdots\cup P_{k}italic_E = italic_P start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT ∪ ⋯ ∪ italic_P start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT be a disjoint union of parts with each part containing n𝑛nitalic_n elements, and we have a collection of vectors vijsubscript𝑣𝑖𝑗v_{ij}italic_v start_POSTSUBSCRIPT italic_i italic_j end_POSTSUBSCRIPT for i[k]𝑖delimited-[]𝑘i\in[k]italic_i ∈ [ italic_k ] and jPi𝑗subscript𝑃𝑖j\in P_{i}italic_j ∈ italic_P start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT. The goal is to select an element σ(i)Pi𝜎𝑖subscript𝑃𝑖\sigma(i)\in P_{i}italic_σ ( italic_i ) ∈ italic_P start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT for each i𝑖iitalic_i to maximize λmin(i=1kviσ(i)viσ(i)).subscript𝜆superscriptsubscript𝑖1𝑘subscript𝑣𝑖𝜎𝑖superscriptsubscript𝑣𝑖𝜎𝑖top\lambda_{\min}\left(\sum_{i=1}^{k}v_{i\sigma(i)}v_{i\sigma(i)}^{\top}\right).italic_λ start_POSTSUBSCRIPT roman_min end_POSTSUBSCRIPT ( ∑ start_POSTSUBSCRIPT italic_i = 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_k end_POSTSUPERSCRIPT italic_v start_POSTSUBSCRIPT italic_i italic_σ ( italic_i ) end_POSTSUBSCRIPT italic_v start_POSTSUBSCRIPT italic_i italic_σ ( italic_i ) end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ⊤ end_POSTSUPERSCRIPT ) .

We can construct the natural convex relaxation of this problem as follows. For each i[k]𝑖delimited-[]𝑘i\in[k]italic_i ∈ [ italic_k ] and jPi𝑗subscript𝑃𝑖j\in P_{i}italic_j ∈ italic_P start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT, we add a decision variable xijsubscript𝑥𝑖𝑗x_{ij}italic_x start_POSTSUBSCRIPT italic_i italic_j end_POSTSUBSCRIPT which represents whether we select the vector vjsubscript𝑣𝑗v_{j}italic_v start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT from part Pisubscript𝑃𝑖P_{i}italic_P start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT, i.e., if σ(i)=j𝜎𝑖𝑗\sigma(i)=jitalic_σ ( italic_i ) = italic_j. Then we get the convex program

maxλmin(X)X=i=1kjPixijvijvijjPixij=1,i[k]x0subscript𝜆𝑋missing-subexpression𝑋superscriptsubscript𝑖1𝑘subscript𝑗subscript𝑃𝑖subscript𝑥𝑖𝑗subscript𝑣𝑖𝑗superscriptsubscript𝑣𝑖𝑗topmissing-subexpressionformulae-sequencesubscript𝑗subscript𝑃𝑖subscript𝑥𝑖𝑗1for-all𝑖delimited-[]𝑘missing-subexpression𝑥0\begin{array}[]{cl}\max&\lambda_{\min}\left(X\right)\\ &X=\sum\limits_{i=1}^{k}\sum\limits_{j\in P_{i}}x_{ij}\cdot v_{ij}v_{ij}^{\top% }\\ &\sum\limits_{j\in P_{i}}x_{ij}=1,\quad\forall i\in[k]\\ &x\geq 0\end{array}start_ARRAY start_ROW start_CELL roman_max end_CELL start_CELL italic_λ start_POSTSUBSCRIPT roman_min end_POSTSUBSCRIPT ( italic_X ) end_CELL end_ROW start_ROW start_CELL end_CELL start_CELL italic_X = ∑ start_POSTSUBSCRIPT italic_i = 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_k end_POSTSUPERSCRIPT ∑ start_POSTSUBSCRIPT italic_j ∈ italic_P start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT end_POSTSUBSCRIPT italic_x start_POSTSUBSCRIPT italic_i italic_j end_POSTSUBSCRIPT ⋅ italic_v start_POSTSUBSCRIPT italic_i italic_j end_POSTSUBSCRIPT italic_v start_POSTSUBSCRIPT italic_i italic_j end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ⊤ end_POSTSUPERSCRIPT end_CELL end_ROW start_ROW start_CELL end_CELL start_CELL ∑ start_POSTSUBSCRIPT italic_j ∈ italic_P start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT end_POSTSUBSCRIPT italic_x start_POSTSUBSCRIPT italic_i italic_j end_POSTSUBSCRIPT = 1 , ∀ italic_i ∈ [ italic_k ] end_CELL end_ROW start_ROW start_CELL end_CELL start_CELL italic_x ≥ 0 end_CELL end_ROW end_ARRAY

The constraint jPixij=1subscript𝑗subscript𝑃𝑖subscript𝑥𝑖𝑗1\sum_{j\in P_{i}}x_{ij}=1∑ start_POSTSUBSCRIPT italic_j ∈ italic_P start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT end_POSTSUBSCRIPT italic_x start_POSTSUBSCRIPT italic_i italic_j end_POSTSUBSCRIPT = 1 ensures that we have a probability distribution over the possible assignments within each part in the optimal solution.

Given an optimal solution xsuperscript𝑥x^{\star}italic_x start_POSTSUPERSCRIPT ⋆ end_POSTSUPERSCRIPT with value OPT𝑂𝑃𝑇OPTitalic_O italic_P italic_T, a natural rounding strategy is to round independently within each part. Following this rounding strategy, we get a rank 1111 random matrix Misubscript𝑀𝑖M_{i}italic_M start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT for each part Pisubscript𝑃𝑖P_{i}italic_P start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT with

Pr(Mi=vijvij)=xij,jPi.formulae-sequencePrsubscript𝑀𝑖subscript𝑣𝑖𝑗superscriptsubscript𝑣𝑖𝑗topsubscriptsuperscript𝑥𝑖𝑗for-all𝑗subscript𝑃𝑖\operatorname{\mathrm{Pr}}(M_{i}=v_{ij}v_{ij}^{\top})=x^{\star}_{ij}\,,\quad% \forall j\in P_{i}\,.roman_Pr ( italic_M start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT = italic_v start_POSTSUBSCRIPT italic_i italic_j end_POSTSUBSCRIPT italic_v start_POSTSUBSCRIPT italic_i italic_j end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ⊤ end_POSTSUPERSCRIPT ) = italic_x start_POSTSUPERSCRIPT ⋆ end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_i italic_j end_POSTSUBSCRIPT , ∀ italic_j ∈ italic_P start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT .

The following matrix concentration inequality bounds the probability of failure of this rounding strategy.

Theorem 3

[Tro15, Theorem 5.1.1] Consider independent random matrices M1,,Mk𝕊d+subscript𝑀1normal-…subscript𝑀𝑘superscriptsubscript𝕊𝑑M_{1},\ldots,M_{k}\in\mathbb{S}_{d}^{+}italic_M start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT , … , italic_M start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT ∈ blackboard_S start_POSTSUBSCRIPT italic_d end_POSTSUBSCRIPT start_POSTSUPERSCRIPT + end_POSTSUPERSCRIPT. Set

μmin=λmin(𝔼[i=1kMi]).subscript𝜇subscript𝜆𝔼superscriptsubscript𝑖1𝑘subscript𝑀𝑖\mu_{\min}=\lambda_{\min}\left(\operatorname{\mathbb{E}}\left[\sum_{i=1}^{k}M_% {i}\right]\right)\,.italic_μ start_POSTSUBSCRIPT roman_min end_POSTSUBSCRIPT = italic_λ start_POSTSUBSCRIPT roman_min end_POSTSUBSCRIPT ( blackboard_E [ ∑ start_POSTSUBSCRIPT italic_i = 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_k end_POSTSUPERSCRIPT italic_M start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT ] ) .

If λmax(Mi)Rsubscript𝜆subscript𝑀𝑖𝑅\lambda_{\max}(M_{i})\leq Ritalic_λ start_POSTSUBSCRIPT roman_max end_POSTSUBSCRIPT ( italic_M start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT ) ≤ italic_R for all i[k]𝑖delimited-[]𝑘i\in[k]italic_i ∈ [ italic_k ] a.s. then

Pr(λmin(i=1kMi)<(1ϵ)μmin)dexp(ϵ2μmin2R).Prsubscript𝜆superscriptsubscript𝑖1𝑘subscript𝑀𝑖1italic-ϵsubscript𝜇𝑑superscriptitalic-ϵ2subscript𝜇𝑚𝑖𝑛2𝑅\operatorname{\mathrm{Pr}}\left(\lambda_{\min}\left(\sum_{i=1}^{k}M_{i}\right)% <(1-\epsilon)\mu_{\min}\right)\leq d\cdot\exp\left(\frac{-\epsilon^{2}\mu_{min% }}{2R}\right).roman_Pr ( italic_λ start_POSTSUBSCRIPT roman_min end_POSTSUBSCRIPT ( ∑ start_POSTSUBSCRIPT italic_i = 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_k end_POSTSUPERSCRIPT italic_M start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT ) < ( 1 - italic_ϵ ) italic_μ start_POSTSUBSCRIPT roman_min end_POSTSUBSCRIPT ) ≤ italic_d ⋅ roman_exp ( divide start_ARG - italic_ϵ start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT italic_μ start_POSTSUBSCRIPT italic_m italic_i italic_n end_POSTSUBSCRIPT end_ARG start_ARG 2 italic_R end_ARG ) .

If we round according to the optimal solution xsuperscript𝑥x^{\star}italic_x start_POSTSUPERSCRIPT ⋆ end_POSTSUPERSCRIPT then 𝔼[i=1kMi]=i=1kjPixijvijvij𝔼superscriptsubscript𝑖1𝑘subscript𝑀𝑖superscriptsubscript𝑖1𝑘subscript𝑗subscript𝑃𝑖superscriptsubscript𝑥𝑖𝑗subscript𝑣𝑖𝑗superscriptsubscript𝑣𝑖𝑗top\operatorname{\mathbb{E}}\left[\sum_{i=1}^{k}M_{i}\right]=\sum_{i=1}^{k}\sum_{% j\in P_{i}}x_{ij}^{\star}v_{ij}v_{ij}^{\top}blackboard_E [ ∑ start_POSTSUBSCRIPT italic_i = 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_k end_POSTSUPERSCRIPT italic_M start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT ] = ∑ start_POSTSUBSCRIPT italic_i = 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_k end_POSTSUPERSCRIPT ∑ start_POSTSUBSCRIPT italic_j ∈ italic_P start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT end_POSTSUBSCRIPT italic_x start_POSTSUBSCRIPT italic_i italic_j end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ⋆ end_POSTSUPERSCRIPT italic_v start_POSTSUBSCRIPT italic_i italic_j end_POSTSUBSCRIPT italic_v start_POSTSUBSCRIPT italic_i italic_j end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ⊤ end_POSTSUPERSCRIPT.

So μmin=OPTsubscript𝜇𝑂𝑃𝑇\mu_{\min}=OPTitalic_μ start_POSTSUBSCRIPT roman_min end_POSTSUBSCRIPT = italic_O italic_P italic_T, and since for our particular case Misubscript𝑀𝑖M_{i}italic_M start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT are rank 1111, R=maxiλmax(Mi)=maxijvij2𝑅subscript𝑖subscript𝜆subscript𝑀𝑖subscript𝑖𝑗superscriptnormsubscript𝑣𝑖𝑗2R=\max_{i}\lambda_{\max}(M_{i})=\max_{ij}\|v_{ij}\|^{2}italic_R = roman_max start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT italic_λ start_POSTSUBSCRIPT roman_max end_POSTSUBSCRIPT ( italic_M start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT ) = roman_max start_POSTSUBSCRIPT italic_i italic_j end_POSTSUBSCRIPT ∥ italic_v start_POSTSUBSCRIPT italic_i italic_j end_POSTSUBSCRIPT ∥ start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT. To bound the failure probability, we want Rϵ2/log(d)𝑅superscriptitalic-ϵ2𝑑R\approx\epsilon^{2}/\log(d)italic_R ≈ italic_ϵ start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT / roman_log ( italic_d ), which in turn requires that maxijvij2=O(ϵ2/log(d))subscript𝑖𝑗superscriptnormsubscript𝑣𝑖𝑗2𝑂superscriptitalic-ϵ2𝑑\max_{ij}\|v_{ij}\|^{2}=O(\epsilon^{2}/\log(d))roman_max start_POSTSUBSCRIPT italic_i italic_j end_POSTSUBSCRIPT ∥ italic_v start_POSTSUBSCRIPT italic_i italic_j end_POSTSUBSCRIPT ∥ start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT = italic_O ( italic_ϵ start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT / roman_log ( italic_d ) ). This is a very strong assumption on an instance.

The plan is to “guess” a suitable change of basis such that all the vectors in the support of our optimal solution have a small norm. This will be useful because of the following standard, but slightly more flexible, version of the preceding matrix concentration inequality.

Corollary 2

Consider independent random matrices M1,,Mk𝕊d+subscript𝑀1normal-…subscript𝑀𝑘superscriptsubscript𝕊𝑑M_{1},\ldots,M_{k}\in\mathbb{S}_{d}^{+}italic_M start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT , … , italic_M start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT ∈ blackboard_S start_POSTSUBSCRIPT italic_d end_POSTSUBSCRIPT start_POSTSUPERSCRIPT + end_POSTSUPERSCRIPT and let A𝐴Aitalic_A be an arbitrary positive definite matrix. Define μmin:=λmin(A1/2𝔼[i=1kMi]A1/2).assignsubscript𝜇subscript𝜆superscript𝐴12𝔼superscriptsubscript𝑖1𝑘subscript𝑀𝑖superscript𝐴12\mu_{\min}:=\lambda_{\min}\left(A^{-1/2}\operatorname{\mathbb{E}}\left[\sum_{i% =1}^{k}M_{i}\right]A^{-1/2}\right).italic_μ start_POSTSUBSCRIPT roman_min end_POSTSUBSCRIPT := italic_λ start_POSTSUBSCRIPT roman_min end_POSTSUBSCRIPT ( italic_A start_POSTSUPERSCRIPT - 1 / 2 end_POSTSUPERSCRIPT blackboard_E [ ∑ start_POSTSUBSCRIPT italic_i = 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_k end_POSTSUPERSCRIPT italic_M start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT ] italic_A start_POSTSUPERSCRIPT - 1 / 2 end_POSTSUPERSCRIPT ) . If λmax(A1/2MiA1/2)Rsubscript𝜆superscript𝐴12subscript𝑀𝑖superscript𝐴12𝑅\lambda_{\max}(A^{-1/2}M_{i}A^{-1/2})\leq Ritalic_λ start_POSTSUBSCRIPT roman_max end_POSTSUBSCRIPT ( italic_A start_POSTSUPERSCRIPT - 1 / 2 end_POSTSUPERSCRIPT italic_M start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT italic_A start_POSTSUPERSCRIPT - 1 / 2 end_POSTSUPERSCRIPT ) ≤ italic_R for all i[k]𝑖delimited-[]𝑘i\in[k]italic_i ∈ [ italic_k ] a.s. then

Pr(i=1kMi(1ϵ)μminA)dexp(ϵ2μmin2R).Prnot-succeeds-nor-equalssuperscriptsubscript𝑖1𝑘subscript𝑀𝑖1italic-ϵsubscript𝜇𝐴𝑑superscriptitalic-ϵ2subscript𝜇𝑚𝑖𝑛2𝑅\operatorname{\mathrm{Pr}}\left(\sum_{i=1}^{k}M_{i}\nsucceq(1-\epsilon)\mu_{% \min}\cdot A\right)\leq d\cdot\exp\left(\frac{-\epsilon^{2}\mu_{min}}{2R}% \right).roman_Pr ( ∑ start_POSTSUBSCRIPT italic_i = 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_k end_POSTSUPERSCRIPT italic_M start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT ⋡ ( 1 - italic_ϵ ) italic_μ start_POSTSUBSCRIPT roman_min end_POSTSUBSCRIPT ⋅ italic_A ) ≤ italic_d ⋅ roman_exp ( divide start_ARG - italic_ϵ start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT italic_μ start_POSTSUBSCRIPT italic_m italic_i italic_n end_POSTSUBSCRIPT end_ARG start_ARG 2 italic_R end_ARG ) .

Again, since Misubscript𝑀𝑖M_{i}italic_M start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT is rank 1111 for our case, we have R=maxi[k]λmax(A1/2MiA1/2)=maxi,jvijA1vij.𝑅subscript𝑖delimited-[]𝑘subscript𝜆superscript𝐴12subscript𝑀𝑖superscript𝐴12subscript𝑖𝑗superscriptsubscript𝑣𝑖𝑗topsuperscript𝐴1subscript𝑣𝑖𝑗R=\max_{i\in[k]}\;\lambda_{\max}(A^{-1/2}M_{i}A^{-1/2})=\max_{i,j}\;v_{ij}^{% \top}A^{-1}v_{ij}\,.italic_R = roman_max start_POSTSUBSCRIPT italic_i ∈ [ italic_k ] end_POSTSUBSCRIPT italic_λ start_POSTSUBSCRIPT roman_max end_POSTSUBSCRIPT ( italic_A start_POSTSUPERSCRIPT - 1 / 2 end_POSTSUPERSCRIPT italic_M start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT italic_A start_POSTSUPERSCRIPT - 1 / 2 end_POSTSUPERSCRIPT ) = roman_max start_POSTSUBSCRIPT italic_i , italic_j end_POSTSUBSCRIPT italic_v start_POSTSUBSCRIPT italic_i italic_j end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ⊤ end_POSTSUPERSCRIPT italic_A start_POSTSUPERSCRIPT - 1 end_POSTSUPERSCRIPT italic_v start_POSTSUBSCRIPT italic_i italic_j end_POSTSUBSCRIPT . So, to use this corollary, we first need to find a matrix A𝐴Aitalic_A such that vijA1vij=O(ϵ2/log(d))superscriptsubscript𝑣𝑖𝑗topsuperscript𝐴1subscript𝑣𝑖𝑗𝑂superscriptitalic-ϵ2𝑑v_{ij}^{\top}A^{-1}v_{ij}=O(\epsilon^{2}/\log(d))italic_v start_POSTSUBSCRIPT italic_i italic_j end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ⊤ end_POSTSUPERSCRIPT italic_A start_POSTSUPERSCRIPT - 1 end_POSTSUPERSCRIPT italic_v start_POSTSUBSCRIPT italic_i italic_j end_POSTSUBSCRIPT = italic_O ( italic_ϵ start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT / roman_log ( italic_d ) ) for all [i][k],jPiformulae-sequencedelimited-[]𝑖delimited-[]𝑘𝑗subscript𝑃𝑖[i]\in[k],j\in P_{i}[ italic_i ] ∈ [ italic_k ] , italic_j ∈ italic_P start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT. We will only need to consider matrices A𝐴Aitalic_A of a specific form that uses the input vectors.

Given a subset SE𝑆𝐸S\subseteq Eitalic_S ⊆ italic_E, we define AS:=(i,j)Svijvijassignsubscript𝐴𝑆subscript𝑖𝑗𝑆subscript𝑣𝑖𝑗superscriptsubscript𝑣𝑖𝑗topA_{S}:=\sum_{(i,j)\in S}v_{ij}v_{ij}^{\top}italic_A start_POSTSUBSCRIPT italic_S end_POSTSUBSCRIPT := ∑ start_POSTSUBSCRIPT ( italic_i , italic_j ) ∈ italic_S end_POSTSUBSCRIPT italic_v start_POSTSUBSCRIPT italic_i italic_j end_POSTSUBSCRIPT italic_v start_POSTSUBSCRIPT italic_i italic_j end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ⊤ end_POSTSUPERSCRIPT, and consider the set of long vectors in the norm induced by ASsubscript𝐴𝑆A_{S}italic_A start_POSTSUBSCRIPT italic_S end_POSTSUBSCRIPT: L(S):={(i,j)E\S:vijAS1vij>ϵ210log(d)}.assign𝐿𝑆conditional-set𝑖𝑗\𝐸𝑆superscriptsubscript𝑣𝑖𝑗topsuperscriptsubscript𝐴𝑆1subscript𝑣𝑖𝑗superscriptitalic-ϵ210𝑑L(S):=\left\{(i,j)\in E\backslash S:v_{ij}^{\top}A_{S}^{-1}v_{ij}>\frac{% \epsilon^{2}}{10\log(d)}\right\}.italic_L ( italic_S ) := { ( italic_i , italic_j ) ∈ italic_E \ italic_S : italic_v start_POSTSUBSCRIPT italic_i italic_j end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ⊤ end_POSTSUPERSCRIPT italic_A start_POSTSUBSCRIPT italic_S end_POSTSUBSCRIPT start_POSTSUPERSCRIPT - 1 end_POSTSUPERSCRIPT italic_v start_POSTSUBSCRIPT italic_i italic_j end_POSTSUBSCRIPT > divide start_ARG italic_ϵ start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT end_ARG start_ARG 10 roman_log ( italic_d ) end_ARG } . For a fixed set S𝑆Sitalic_S, the following convex program ensures that S𝑆Sitalic_S is included in the solution and no “long” vectors from L(S)𝐿𝑆L(S)italic_L ( italic_S ) are included in the solution.

maxλmin(X)X=i=1kjPixijvijvijjPixij=1,i[k]xij=0,(i,j)L(S)xij=1,(i,j)Sx0subscript𝜆𝑋missing-subexpression𝑋superscriptsubscript𝑖1𝑘subscript𝑗subscript𝑃𝑖subscript𝑥𝑖𝑗subscript𝑣𝑖𝑗superscriptsubscript𝑣𝑖𝑗topmissing-subexpressionformulae-sequencesubscript𝑗subscript𝑃𝑖subscript𝑥𝑖𝑗1for-all𝑖delimited-[]𝑘missing-subexpressionformulae-sequencesubscript𝑥𝑖𝑗0for-all𝑖𝑗𝐿𝑆missing-subexpressionformulae-sequencesubscript𝑥𝑖𝑗1for-all𝑖𝑗𝑆missing-subexpression𝑥0\begin{array}[]{cl}\max&\lambda_{\min}\left(X\right)\\ &X=\sum\limits_{i=1}^{k}\sum\limits_{j\in P_{i}}x_{ij}\cdot v_{ij}v_{ij}^{\top% }\\ &\sum\limits_{j\in P_{i}}x_{ij}=1,\,\,\forall i\in[k]\\ &x_{ij}=0,\,\,\forall(i,j)\in L(S)\\ &x_{ij}=1,\,\,\forall(i,j)\in S\\ &x\geq 0\end{array}start_ARRAY start_ROW start_CELL roman_max end_CELL start_CELL italic_λ start_POSTSUBSCRIPT roman_min end_POSTSUBSCRIPT ( italic_X ) end_CELL end_ROW start_ROW start_CELL end_CELL start_CELL italic_X = ∑ start_POSTSUBSCRIPT italic_i = 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_k end_POSTSUPERSCRIPT ∑ start_POSTSUBSCRIPT italic_j ∈ italic_P start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT end_POSTSUBSCRIPT italic_x start_POSTSUBSCRIPT italic_i italic_j end_POSTSUBSCRIPT ⋅ italic_v start_POSTSUBSCRIPT italic_i italic_j end_POSTSUBSCRIPT italic_v start_POSTSUBSCRIPT italic_i italic_j end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ⊤ end_POSTSUPERSCRIPT end_CELL end_ROW start_ROW start_CELL end_CELL start_CELL ∑ start_POSTSUBSCRIPT italic_j ∈ italic_P start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT end_POSTSUBSCRIPT italic_x start_POSTSUBSCRIPT italic_i italic_j end_POSTSUBSCRIPT = 1 , ∀ italic_i ∈ [ italic_k ] end_CELL end_ROW start_ROW start_CELL end_CELL start_CELL italic_x start_POSTSUBSCRIPT italic_i italic_j end_POSTSUBSCRIPT = 0 , ∀ ( italic_i , italic_j ) ∈ italic_L ( italic_S ) end_CELL end_ROW start_ROW start_CELL end_CELL start_CELL italic_x start_POSTSUBSCRIPT italic_i italic_j end_POSTSUBSCRIPT = 1 , ∀ ( italic_i , italic_j ) ∈ italic_S end_CELL end_ROW start_ROW start_CELL end_CELL start_CELL italic_x ≥ 0 end_CELL end_ROW end_ARRAY (CP(S))

Because of the extra constraints excluding “long” vectors, we could now use the flexible matrix concentration inequalities to randomly round the optimal solution.

But, it is not clear that there is a good choice of S𝑆Sitalic_S for which the convex program CP(S) is still a relaxation of the original problem. Lemma 1, which we restate here for the reader’s convenience, shows that there exists a suitable set S𝑆Sitalic_S that is not too large.

See 1

The proof of this lemma is inspired by the local search algorithm of [MSTX19].

At first glance, it may not be apparent why a subset satisfying the conditions of Lemma 1 should exist. However, in the proof, we show that any subset of T𝑇Titalic_T that is locally optimal with respect to a local search criteria indeed satisfies the guarantees of Lemma 1.

Proof  (of Lemma 1) We consider the local search process of [MSTX19]. Starting with a set S𝑆Sitalic_S of size \ellroman_ℓ such that A=iSvivi𝐴subscript𝑖𝑆subscript𝑣𝑖superscriptsubscript𝑣𝑖topA=\sum_{i\in S}v_{i}v_{i}^{\top}italic_A = ∑ start_POSTSUBSCRIPT italic_i ∈ italic_S end_POSTSUBSCRIPT italic_v start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT italic_v start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ⊤ end_POSTSUPERSCRIPT is invertible, we apply the following update rule. For any jT\S𝑗\𝑇𝑆j\in T\backslash Sitalic_j ∈ italic_T \ italic_S and iS𝑖𝑆i\in Sitalic_i ∈ italic_S, if det(A)<det(Avivi+vjvj)𝐴𝐴subscript𝑣𝑖superscriptsubscript𝑣𝑖topsubscript𝑣𝑗superscriptsubscript𝑣𝑗top\det(A)<\det(A-v_{i}v_{i}^{\top}+v_{j}v_{j}^{\top})roman_det ( italic_A ) < roman_det ( italic_A - italic_v start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT italic_v start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ⊤ end_POSTSUPERSCRIPT + italic_v start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT italic_v start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ⊤ end_POSTSUPERSCRIPT ), update S={S\{i}}{j}𝑆\𝑆𝑖𝑗S=\{S\backslash\{i\}\}\cup\{j\}italic_S = { italic_S \ { italic_i } } ∪ { italic_j } and iterate.

Let ST𝑆𝑇S\subseteq Titalic_S ⊆ italic_T be a locally optimal (under single element swaps) solution for this process (such an S𝑆Sitalic_S corresponds to the locally optimal solution determinant maximization problem subject to the cardinality constraint |S|𝑆|S|\leq\ell| italic_S | ≤ roman_ℓ), and let A=iSvivi𝐴subscript𝑖𝑆subscript𝑣𝑖superscriptsubscript𝑣𝑖topA=\sum_{i\in S}v_{i}v_{i}^{\top}italic_A = ∑ start_POSTSUBSCRIPT italic_i ∈ italic_S end_POSTSUBSCRIPT italic_v start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT italic_v start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ⊤ end_POSTSUPERSCRIPT. More concretely, this means that for all iS𝑖𝑆i\in Sitalic_i ∈ italic_S and jT\S𝑗\𝑇𝑆j\in T\backslash Sitalic_j ∈ italic_T \ italic_S,

det(A)det(Avivi+vjvj).𝐴𝐴subscript𝑣𝑖superscriptsubscript𝑣𝑖topsubscript𝑣𝑗superscriptsubscript𝑣𝑗top\det(A)\geq\det(A-v_{i}v_{i}^{\top}+v_{j}v_{j}^{\top})\,.roman_det ( italic_A ) ≥ roman_det ( italic_A - italic_v start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT italic_v start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ⊤ end_POSTSUPERSCRIPT + italic_v start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT italic_v start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ⊤ end_POSTSUPERSCRIPT ) .

We calculate the determinant on the right-hand side using the matrix determinant lemma,

det(Avivi+vjvj)𝐴subscript𝑣𝑖superscriptsubscript𝑣𝑖topsubscript𝑣𝑗superscriptsubscript𝑣𝑗top\displaystyle\det(A-v_{i}v_{i}^{\top}+v_{j}v_{j}^{\top})roman_det ( italic_A - italic_v start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT italic_v start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ⊤ end_POSTSUPERSCRIPT + italic_v start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT italic_v start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ⊤ end_POSTSUPERSCRIPT ) =det(A+[vivj][vivj])=det(A)det(I2+[vivj]A1[vivj])absent𝐴matrixsubscript𝑣𝑖subscript𝑣𝑗superscriptmatrixsubscript𝑣𝑖subscript𝑣𝑗top𝐴subscript𝐼2superscriptmatrixsubscript𝑣𝑖subscript𝑣𝑗topsuperscript𝐴1matrixsubscript𝑣𝑖subscript𝑣𝑗\displaystyle=\det\left(A+\begin{bmatrix}v_{i}&v_{j}\end{bmatrix}\begin{% bmatrix}-v_{i}&v_{j}\end{bmatrix}^{\top}\right)=\det(A)\cdot\det\left(I_{2}+% \begin{bmatrix}-v_{i}&v_{j}\end{bmatrix}^{\top}A^{-1}\begin{bmatrix}v_{i}&v_{j% }\end{bmatrix}\right)= roman_det ( italic_A + [ start_ARG start_ROW start_CELL italic_v start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT end_CELL start_CELL italic_v start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT end_CELL end_ROW end_ARG ] [ start_ARG start_ROW start_CELL - italic_v start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT end_CELL start_CELL italic_v start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT end_CELL end_ROW end_ARG ] start_POSTSUPERSCRIPT ⊤ end_POSTSUPERSCRIPT ) = roman_det ( italic_A ) ⋅ roman_det ( italic_I start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT + [ start_ARG start_ROW start_CELL - italic_v start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT end_CELL start_CELL italic_v start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT end_CELL end_ROW end_ARG ] start_POSTSUPERSCRIPT ⊤ end_POSTSUPERSCRIPT italic_A start_POSTSUPERSCRIPT - 1 end_POSTSUPERSCRIPT [ start_ARG start_ROW start_CELL italic_v start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT end_CELL start_CELL italic_v start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT end_CELL end_ROW end_ARG ] )
=det(A)((1viA1vi)(1+vjA1vj)+(viA1vj)2).absent𝐴1superscriptsubscript𝑣𝑖topsuperscript𝐴1subscript𝑣𝑖1superscriptsubscript𝑣𝑗topsuperscript𝐴1subscript𝑣𝑗superscriptsuperscriptsubscript𝑣𝑖topsuperscript𝐴1subscript𝑣𝑗2\displaystyle=\det(A)\cdot\left((1-v_{i}^{\top}A^{-1}v_{i})(1+v_{j}^{\top}A^{-% 1}v_{j})+(v_{i}^{\top}A^{-1}v_{j})^{2}\right)\,.= roman_det ( italic_A ) ⋅ ( ( 1 - italic_v start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ⊤ end_POSTSUPERSCRIPT italic_A start_POSTSUPERSCRIPT - 1 end_POSTSUPERSCRIPT italic_v start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT ) ( 1 + italic_v start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ⊤ end_POSTSUPERSCRIPT italic_A start_POSTSUPERSCRIPT - 1 end_POSTSUPERSCRIPT italic_v start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT ) + ( italic_v start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ⊤ end_POSTSUPERSCRIPT italic_A start_POSTSUPERSCRIPT - 1 end_POSTSUPERSCRIPT italic_v start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT ) start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT ) .

So local optimality implies that for every iS𝑖𝑆i\in Sitalic_i ∈ italic_S and jS𝑗𝑆j\notin Sitalic_j ∉ italic_S, (1viA1vi)(1+vjA1vj)+(viA1vj)211superscriptsubscript𝑣𝑖topsuperscript𝐴1subscript𝑣𝑖1superscriptsubscript𝑣𝑗topsuperscript𝐴1subscript𝑣𝑗superscriptsuperscriptsubscript𝑣𝑖topsuperscript𝐴1subscript𝑣𝑗21(1-v_{i}^{\top}A^{-1}v_{i})(1+v_{j}^{\top}A^{-1}v_{j})+(v_{i}^{\top}A^{-1}v_{j% })^{2}\leq 1( 1 - italic_v start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ⊤ end_POSTSUPERSCRIPT italic_A start_POSTSUPERSCRIPT - 1 end_POSTSUPERSCRIPT italic_v start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT ) ( 1 + italic_v start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ⊤ end_POSTSUPERSCRIPT italic_A start_POSTSUPERSCRIPT - 1 end_POSTSUPERSCRIPT italic_v start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT ) + ( italic_v start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ⊤ end_POSTSUPERSCRIPT italic_A start_POSTSUPERSCRIPT - 1 end_POSTSUPERSCRIPT italic_v start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT ) start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT ≤ 1. Rearranging this inequality we get

vjA1vj(viA1vi)(vjA1vj)+(viA1vj)2viA1vi.superscriptsubscript𝑣𝑗topsuperscript𝐴1subscript𝑣𝑗superscriptsubscript𝑣𝑖topsuperscript𝐴1subscript𝑣𝑖superscriptsubscript𝑣𝑗topsuperscript𝐴1subscript𝑣𝑗superscriptsuperscriptsubscript𝑣𝑖topsuperscript𝐴1subscript𝑣𝑗2superscriptsubscript𝑣𝑖topsuperscript𝐴1subscript𝑣𝑖v_{j}^{\top}A^{-1}v_{j}-(v_{i}^{\top}A^{-1}v_{i})\cdot(v_{j}^{\top}A^{-1}v_{j}% )+(v_{i}^{\top}A^{-1}v_{j})^{2}\leq v_{i}^{\top}A^{-1}v_{i}\,.italic_v start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ⊤ end_POSTSUPERSCRIPT italic_A start_POSTSUPERSCRIPT - 1 end_POSTSUPERSCRIPT italic_v start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT - ( italic_v start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ⊤ end_POSTSUPERSCRIPT italic_A start_POSTSUPERSCRIPT - 1 end_POSTSUPERSCRIPT italic_v start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT ) ⋅ ( italic_v start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ⊤ end_POSTSUPERSCRIPT italic_A start_POSTSUPERSCRIPT - 1 end_POSTSUPERSCRIPT italic_v start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT ) + ( italic_v start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ⊤ end_POSTSUPERSCRIPT italic_A start_POSTSUPERSCRIPT - 1 end_POSTSUPERSCRIPT italic_v start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT ) start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT ≤ italic_v start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ⊤ end_POSTSUPERSCRIPT italic_A start_POSTSUPERSCRIPT - 1 end_POSTSUPERSCRIPT italic_v start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT . (1)

Note that iSviA1vi=A,A1=dsubscript𝑖𝑆superscriptsubscript𝑣𝑖topsuperscript𝐴1subscript𝑣𝑖𝐴superscript𝐴1𝑑\sum_{i\in S}v_{i}^{\top}A^{-1}v_{i}=\langle A,A^{-1}\rangle=d∑ start_POSTSUBSCRIPT italic_i ∈ italic_S end_POSTSUBSCRIPT italic_v start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ⊤ end_POSTSUPERSCRIPT italic_A start_POSTSUPERSCRIPT - 1 end_POSTSUPERSCRIPT italic_v start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT = ⟨ italic_A , italic_A start_POSTSUPERSCRIPT - 1 end_POSTSUPERSCRIPT ⟩ = italic_d and iS(viA1vj)2=vjA1vjsubscript𝑖𝑆superscriptsuperscriptsubscript𝑣𝑖topsuperscript𝐴1subscript𝑣𝑗2superscriptsubscript𝑣𝑗topsuperscript𝐴1subscript𝑣𝑗\sum_{i\in S}(v_{i}^{\top}A^{-1}v_{j})^{2}=v_{j}^{\top}A^{-1}v_{j}∑ start_POSTSUBSCRIPT italic_i ∈ italic_S end_POSTSUBSCRIPT ( italic_v start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ⊤ end_POSTSUPERSCRIPT italic_A start_POSTSUPERSCRIPT - 1 end_POSTSUPERSCRIPT italic_v start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT ) start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT = italic_v start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ⊤ end_POSTSUPERSCRIPT italic_A start_POSTSUPERSCRIPT - 1 end_POSTSUPERSCRIPT italic_v start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT. So for a fixed jT\S𝑗\𝑇𝑆j\in T\backslash Sitalic_j ∈ italic_T \ italic_S, summing equation (1) over all iS𝑖𝑆i\in Sitalic_i ∈ italic_S implies vjA1vjdvjA1vj+vjA1vjdsuperscriptsubscript𝑣𝑗topsuperscript𝐴1subscript𝑣𝑗𝑑superscriptsubscript𝑣𝑗topsuperscript𝐴1subscript𝑣𝑗superscriptsubscript𝑣𝑗topsuperscript𝐴1subscript𝑣𝑗𝑑\ell\cdot v_{j}^{\top}A^{-1}v_{j}-d\cdot v_{j}^{\top}A^{-1}v_{j}+v_{j}^{\top}A% ^{-1}v_{j}\leq droman_ℓ ⋅ italic_v start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ⊤ end_POSTSUPERSCRIPT italic_A start_POSTSUPERSCRIPT - 1 end_POSTSUPERSCRIPT italic_v start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT - italic_d ⋅ italic_v start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ⊤ end_POSTSUPERSCRIPT italic_A start_POSTSUPERSCRIPT - 1 end_POSTSUPERSCRIPT italic_v start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT + italic_v start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ⊤ end_POSTSUPERSCRIPT italic_A start_POSTSUPERSCRIPT - 1 end_POSTSUPERSCRIPT italic_v start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT ≤ italic_d. Rearranging, we see that for any jT\S𝑗\𝑇𝑆j\in T\backslash Sitalic_j ∈ italic_T \ italic_S,

vjA1vjdd+1=ϵ210log(d),superscriptsubscript𝑣𝑗topsuperscript𝐴1subscript𝑣𝑗𝑑𝑑1superscriptitalic-ϵ210𝑑v_{j}^{\top}A^{-1}v_{j}\leq\frac{d}{\ell-d+1}=\frac{\epsilon^{2}}{10\log(d)}\,,italic_v start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ⊤ end_POSTSUPERSCRIPT italic_A start_POSTSUPERSCRIPT - 1 end_POSTSUPERSCRIPT italic_v start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT ≤ divide start_ARG italic_d end_ARG start_ARG roman_ℓ - italic_d + 1 end_ARG = divide start_ARG italic_ϵ start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT end_ARG start_ARG 10 roman_log ( italic_d ) end_ARG ,

where the last equality follows by choosing =10dlog(d)/ϵ2+d110𝑑𝑑superscriptitalic-ϵ2𝑑1\ell=10d\log(d)/\epsilon^{2}+d-1roman_ℓ = 10 italic_d roman_log ( italic_d ) / italic_ϵ start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT + italic_d - 1. \Box

We will apply this lemma to the case when T={viσ(i):i[k]}𝑇conditional-setsubscript𝑣𝑖superscript𝜎𝑖𝑖delimited-[]𝑘T=\{v_{i\sigma^{\star}(i)}:i\in[k]\}italic_T = { italic_v start_POSTSUBSCRIPT italic_i italic_σ start_POSTSUPERSCRIPT ⋆ end_POSTSUPERSCRIPT ( italic_i ) end_POSTSUBSCRIPT : italic_i ∈ [ italic_k ] }m where σ*superscript𝜎\sigma^{*}italic_σ start_POSTSUPERSCRIPT * end_POSTSUPERSCRIPT is the choice function that maximizes the minimum eigenvalue, i.e., when T𝑇Titalic_T contains the vectors from an optimal integral assignment. In particular, we get the following corollary.

Lemma 2

There is a subset SE𝑆𝐸S\subseteq Eitalic_S ⊆ italic_E such that |S|=O(dlog(d)/ϵ2)𝑆𝑂𝑑𝑑superscriptitalic-ϵ2|S|=O(d\log(d)/\epsilon^{2})| italic_S | = italic_O ( italic_d roman_log ( italic_d ) / italic_ϵ start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT ), and the convex program CP(S) is a relaxation for the minimum eigenvalue problem.

As d𝑑ditalic_d is a constant, the size of the set S𝑆Sitalic_S we search for is also constant. Thus, there are at most O(nO(dlog(d)/ϵ2))𝑂superscript𝑛𝑂𝑑𝑑superscriptitalic-ϵ2O(n^{O(d\log(d)/\epsilon^{2})})italic_O ( italic_n start_POSTSUPERSCRIPT italic_O ( italic_d roman_log ( italic_d ) / italic_ϵ start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT ) end_POSTSUPERSCRIPT ) possible choices for S𝑆Sitalic_S. We will consider each choice in turn to guess the correct set. Note that trying every set of the appropriate size will be the dominant factor in determining the algorithm’s runtime.

The following lemma proves that for any fixed subset S𝑆Sitalic_S, rounding the optimal solution to CP(S) gives a good approximation to the optimal value of CP(S).

Lemma 3

Let SE𝑆𝐸S\subseteq Eitalic_S ⊆ italic_E be an independent set, and let xsuperscript𝑥normal-⋆x^{\star}italic_x start_POSTSUPERSCRIPT ⋆ end_POSTSUPERSCRIPT be the optimal solution to CP(S). Then rounding randomly in each part outputs an assignment σ:[k]Enormal-:𝜎normal-→delimited-[]𝑘𝐸\sigma:[k]\rightarrow Eitalic_σ : [ italic_k ] → italic_E with σ(i)Pi𝜎𝑖subscript𝑃𝑖\sigma(i)\in P_{i}italic_σ ( italic_i ) ∈ italic_P start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT such that

Pr[λmin(i=1kviσ(i)viσ(i))<(1ϵ)λmin(i=1kjPixijvijvij)]<d4.Prsubscript𝜆superscriptsubscript𝑖1𝑘subscript𝑣𝑖𝜎𝑖superscriptsubscript𝑣𝑖𝜎𝑖top1italic-ϵsubscript𝜆superscriptsubscript𝑖1𝑘subscript𝑗subscript𝑃𝑖subscriptsuperscript𝑥𝑖𝑗subscript𝑣𝑖𝑗superscriptsubscript𝑣𝑖𝑗topsuperscript𝑑4\operatorname{\mathrm{Pr}}\left[\lambda_{\min}\left(\sum_{i=1}^{k}v_{i\sigma(i% )}v_{i\sigma(i)}^{\top}\right)<(1-\epsilon)\cdot\lambda_{\min}\left(\sum_{i=1}% ^{k}\sum_{j\in P_{i}}x^{\star}_{ij}\,v_{ij}v_{ij}^{\top}\right)\right]<d^{-4}.roman_Pr [ italic_λ start_POSTSUBSCRIPT roman_min end_POSTSUBSCRIPT ( ∑ start_POSTSUBSCRIPT italic_i = 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_k end_POSTSUPERSCRIPT italic_v start_POSTSUBSCRIPT italic_i italic_σ ( italic_i ) end_POSTSUBSCRIPT italic_v start_POSTSUBSCRIPT italic_i italic_σ ( italic_i ) end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ⊤ end_POSTSUPERSCRIPT ) < ( 1 - italic_ϵ ) ⋅ italic_λ start_POSTSUBSCRIPT roman_min end_POSTSUBSCRIPT ( ∑ start_POSTSUBSCRIPT italic_i = 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_k end_POSTSUPERSCRIPT ∑ start_POSTSUBSCRIPT italic_j ∈ italic_P start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT end_POSTSUBSCRIPT italic_x start_POSTSUPERSCRIPT ⋆ end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_i italic_j end_POSTSUBSCRIPT italic_v start_POSTSUBSCRIPT italic_i italic_j end_POSTSUBSCRIPT italic_v start_POSTSUBSCRIPT italic_i italic_j end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ⊤ end_POSTSUPERSCRIPT ) ] < italic_d start_POSTSUPERSCRIPT - 4 end_POSTSUPERSCRIPT .

Proof  Let X=i=1kjPixijvijvij𝑋superscriptsubscript𝑖1𝑘subscript𝑗subscript𝑃𝑖subscriptsuperscript𝑥𝑖𝑗subscript𝑣𝑖𝑗superscriptsubscript𝑣𝑖𝑗topX=\sum_{i=1}^{k}\sum_{j\in P_{i}}x^{\star}_{ij}\,v_{ij}v_{ij}^{\top}italic_X = ∑ start_POSTSUBSCRIPT italic_i = 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_k end_POSTSUPERSCRIPT ∑ start_POSTSUBSCRIPT italic_j ∈ italic_P start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT end_POSTSUBSCRIPT italic_x start_POSTSUPERSCRIPT ⋆ end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_i italic_j end_POSTSUBSCRIPT italic_v start_POSTSUBSCRIPT italic_i italic_j end_POSTSUBSCRIPT italic_v start_POSTSUBSCRIPT italic_i italic_j end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ⊤ end_POSTSUPERSCRIPT. The matrix Xsuperscript𝑋X^{\star}italic_X start_POSTSUPERSCRIPT ⋆ end_POSTSUPERSCRIPT contains vijvijsubscript𝑣𝑖𝑗superscriptsubscript𝑣𝑖𝑗topv_{ij}v_{ij}^{\top}italic_v start_POSTSUBSCRIPT italic_i italic_j end_POSTSUBSCRIPT italic_v start_POSTSUBSCRIPT italic_i italic_j end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ⊤ end_POSTSUPERSCRIPT with coefficient 1111 for every (i,j)S𝑖𝑗𝑆(i,j)\in S( italic_i , italic_j ) ∈ italic_S. Thus XiSvivisucceeds-or-equals𝑋subscript𝑖𝑆subscript𝑣𝑖superscriptsubscript𝑣𝑖topX\succeq\sum_{i\in S}v_{i}v_{i}^{\top}italic_X ⪰ ∑ start_POSTSUBSCRIPT italic_i ∈ italic_S end_POSTSUBSCRIPT italic_v start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT italic_v start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ⊤ end_POSTSUPERSCRIPT, so vjX1vjϵ210log(d)superscriptsubscript𝑣𝑗topsuperscript𝑋1subscript𝑣𝑗superscriptitalic-ϵ210𝑑v_{j}^{\top}X^{-1}v_{j}\leq\frac{\epsilon^{2}}{10\log(d)}italic_v start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ⊤ end_POSTSUPERSCRIPT italic_X start_POSTSUPERSCRIPT - 1 end_POSTSUPERSCRIPT italic_v start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT ≤ divide start_ARG italic_ϵ start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT end_ARG start_ARG 10 roman_log ( italic_d ) end_ARG for all jLS𝑗𝐿𝑆j\notin L\cup Sitalic_j ∉ italic_L ∪ italic_S. For the purposes of the analysis, for every jS𝑗𝑆j\in Sitalic_j ∈ italic_S we can replace the vector visubscript𝑣𝑖v_{i}italic_v start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT with (viX1vi)10log(d)/ϵ2superscriptsubscript𝑣𝑖topsuperscript𝑋1subscript𝑣𝑖10𝑑superscriptitalic-ϵ2(v_{i}^{\top}X^{-1}v_{i})\cdot\sqrt{10\log(d)/\epsilon^{2}}( italic_v start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ⊤ end_POSTSUPERSCRIPT italic_X start_POSTSUPERSCRIPT - 1 end_POSTSUPERSCRIPT italic_v start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT ) ⋅ square-root start_ARG 10 roman_log ( italic_d ) / italic_ϵ start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT end_ARG copies of the same vector scaled down to have squared-length at most ϵ2/(10log(d))superscriptitalic-ϵ210𝑑\epsilon^{2}/(10\log(d))italic_ϵ start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT / ( 10 roman_log ( italic_d ) ) with respect to X𝑋Xitalic_X. Since all elements of S𝑆Sitalic_S get value 1111 in xsuperscript𝑥x^{\star}italic_x start_POSTSUPERSCRIPT ⋆ end_POSTSUPERSCRIPT, we can similarly extend the vector xsuperscript𝑥x^{\star}italic_x start_POSTSUPERSCRIPT ⋆ end_POSTSUPERSCRIPT so that it has a 1111 in all the copied entries. Since these values of x*superscript𝑥x^{*}italic_x start_POSTSUPERSCRIPT * end_POSTSUPERSCRIPT are deterministic, nothing changes about the resulting distribution over matrices, but we can now assume that vijX1vij<ϵ2/(10log(d))superscriptsubscript𝑣𝑖𝑗topsuperscript𝑋1subscript𝑣𝑖𝑗superscriptitalic-ϵ210𝑑v_{ij}^{\top}X^{-1}v_{ij}<\epsilon^{2}/(10\log(d))italic_v start_POSTSUBSCRIPT italic_i italic_j end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ⊤ end_POSTSUPERSCRIPT italic_X start_POSTSUPERSCRIPT - 1 end_POSTSUPERSCRIPT italic_v start_POSTSUBSCRIPT italic_i italic_j end_POSTSUBSCRIPT < italic_ϵ start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT / ( 10 roman_log ( italic_d ) ) for all (i,j)𝑖𝑗(i,j)( italic_i , italic_j ) in the support of xsuperscript𝑥x^{\star}italic_x start_POSTSUPERSCRIPT ⋆ end_POSTSUPERSCRIPT.

Next, define random matrices M1,,Mksubscript𝑀1subscript𝑀𝑘M_{1},\ldots,M_{k}italic_M start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT , … , italic_M start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT such that for any i[k]𝑖delimited-[]𝑘i\in[k]italic_i ∈ [ italic_k ], Pr(Mi=vijvij)=xijPrsubscript𝑀𝑖subscript𝑣𝑖𝑗superscriptsubscript𝑣𝑖𝑗topsubscriptsuperscript𝑥𝑖𝑗\operatorname{\mathrm{Pr}}\left(M_{i}=v_{ij}v_{ij}^{\top}\right)=x^{\star}_{ij}roman_Pr ( italic_M start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT = italic_v start_POSTSUBSCRIPT italic_i italic_j end_POSTSUBSCRIPT italic_v start_POSTSUBSCRIPT italic_i italic_j end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ⊤ end_POSTSUPERSCRIPT ) = italic_x start_POSTSUPERSCRIPT ⋆ end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_i italic_j end_POSTSUBSCRIPT for all jPi𝑗subscript𝑃𝑖j\in P_{i}italic_j ∈ italic_P start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT. We then apply Corollary 2 with A=X𝐴𝑋A=Xitalic_A = italic_X on random matrices M1,,Mksubscript𝑀1subscript𝑀𝑘M_{1},\ldots,M_{k}italic_M start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT , … , italic_M start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT. Since xsuperscript𝑥x^{\star}italic_x start_POSTSUPERSCRIPT ⋆ end_POSTSUPERSCRIPT is not supported on L𝐿Litalic_L,

R=maxiλmax(X1/2MiX1/2)ϵ210log(d).𝑅subscript𝑖subscript𝜆superscript𝑋12subscript𝑀𝑖superscript𝑋12superscriptitalic-ϵ210𝑑R=\max_{i}\;\lambda_{\max}(X^{-1/2}M_{i}X^{-1/2})\leq\frac{\epsilon^{2}}{10% \log(d)}\,.italic_R = roman_max start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT italic_λ start_POSTSUBSCRIPT roman_max end_POSTSUBSCRIPT ( italic_X start_POSTSUPERSCRIPT - 1 / 2 end_POSTSUPERSCRIPT italic_M start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT italic_X start_POSTSUPERSCRIPT - 1 / 2 end_POSTSUPERSCRIPT ) ≤ divide start_ARG italic_ϵ start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT end_ARG start_ARG 10 roman_log ( italic_d ) end_ARG .

In addition, as 𝔼(iMi)=X𝔼subscript𝑖subscript𝑀𝑖𝑋\operatorname{\mathbb{E}}\left(\sum_{i}M_{i}\right)=Xblackboard_E ( ∑ start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT italic_M start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT ) = italic_X by definition, we have μmin=1subscript𝜇1\mu_{\min}=1italic_μ start_POSTSUBSCRIPT roman_min end_POSTSUBSCRIPT = 1.

Thus, if σ:[k]E:𝜎delimited-[]𝑘𝐸\sigma:[k]\rightarrow Eitalic_σ : [ italic_k ] → italic_E is the choice function obtained by independent rounding,

Prσ[i=1kviσ(i)viσ(i)(1ϵ)X]subscriptPr𝜎not-succeeds-nor-equalssuperscriptsubscript𝑖1𝑘subscript𝑣𝑖𝜎𝑖superscriptsubscript𝑣𝑖𝜎𝑖top1italic-ϵ𝑋\displaystyle\operatorname{\mathrm{Pr}}_{\sigma}\left[\sum_{i=1}^{k}v_{i\sigma% (i)}v_{i\sigma(i)}^{\top}\nsucceq(1-\epsilon)X\right]roman_Pr start_POSTSUBSCRIPT italic_σ end_POSTSUBSCRIPT [ ∑ start_POSTSUBSCRIPT italic_i = 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_k end_POSTSUPERSCRIPT italic_v start_POSTSUBSCRIPT italic_i italic_σ ( italic_i ) end_POSTSUBSCRIPT italic_v start_POSTSUBSCRIPT italic_i italic_σ ( italic_i ) end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ⊤ end_POSTSUPERSCRIPT ⋡ ( 1 - italic_ϵ ) italic_X ] dexp(5log(d))=d4.absent𝑑5𝑑superscript𝑑4\displaystyle\leq d\cdot\exp(-5\log(d))=d^{-4}.≤ italic_d ⋅ roman_exp ( - 5 roman_log ( italic_d ) ) = italic_d start_POSTSUPERSCRIPT - 4 end_POSTSUPERSCRIPT .

We conclude that λmin(i=1kviσ(i)viσ(i))(1ϵ)λmin(X)subscript𝜆superscriptsubscript𝑖1𝑘subscript𝑣𝑖𝜎𝑖superscriptsubscript𝑣𝑖𝜎𝑖top1italic-ϵsubscript𝜆𝑋\lambda_{\min}\left(\sum_{i=1}^{k}v_{i\sigma(i)}v_{i\sigma(i)}^{\top}\right)% \geq(1-\epsilon)\lambda_{\min}\left(X\right)italic_λ start_POSTSUBSCRIPT roman_min end_POSTSUBSCRIPT ( ∑ start_POSTSUBSCRIPT italic_i = 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_k end_POSTSUPERSCRIPT italic_v start_POSTSUBSCRIPT italic_i italic_σ ( italic_i ) end_POSTSUBSCRIPT italic_v start_POSTSUBSCRIPT italic_i italic_σ ( italic_i ) end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ⊤ end_POSTSUPERSCRIPT ) ≥ ( 1 - italic_ϵ ) italic_λ start_POSTSUBSCRIPT roman_min end_POSTSUBSCRIPT ( italic_X ) with probability at least 1d41superscript𝑑41-d^{-4}1 - italic_d start_POSTSUPERSCRIPT - 4 end_POSTSUPERSCRIPT. \Box

Combining this lemma with the earlier guarantee that there exists a set S𝑆Sitalic_S of reasonable size such that CP(S) is a relaxation, we get the following algorithm: try all possible choices for the set S𝑆Sitalic_S and return the solution with the best objective.

Algorithm 1 Algorithm to find an approximation to OPT𝑂𝑃𝑇OPTitalic_O italic_P italic_T
1:Input: Partition matroid \mathcal{M}caligraphic_M with k𝑘kitalic_k parts P1,,Pksubscript𝑃1subscript𝑃𝑘P_{1},\ldots,P_{k}italic_P start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT , … , italic_P start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT
2:for each S[n]𝑆delimited-[]𝑛S\subseteq[n]italic_S ⊆ [ italic_n ] such that |S|=10dlog(d)/ϵ2+d1𝑆10𝑑𝑑superscriptitalic-ϵ2𝑑1|S|=10d\log(d)/\epsilon^{2}+d-1| italic_S | = 10 italic_d roman_log ( italic_d ) / italic_ϵ start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT + italic_d - 1 do
3:     x*superscript𝑥absentx^{*}\leftarrowitalic_x start_POSTSUPERSCRIPT * end_POSTSUPERSCRIPT ← optimal solution of CP(S) for matroid \mathcal{M}caligraphic_M
4:     For each i[k]𝑖delimited-[]𝑘i\in[k]italic_i ∈ [ italic_k ], assign σS(i)=jsubscript𝜎𝑆𝑖𝑗\sigma_{S}(i)=jitalic_σ start_POSTSUBSCRIPT italic_S end_POSTSUBSCRIPT ( italic_i ) = italic_j with probability xij*subscriptsuperscript𝑥𝑖𝑗x^{*}_{ij}italic_x start_POSTSUPERSCRIPT * end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_i italic_j end_POSTSUBSCRIPT
5:end for
6:Return the choice function σSsubscript𝜎𝑆\sigma_{S}italic_σ start_POSTSUBSCRIPT italic_S end_POSTSUBSCRIPT which maximizes λmin(iviσS(i)viσS(i))subscript𝜆subscript𝑖subscript𝑣𝑖subscript𝜎𝑆𝑖superscriptsubscript𝑣𝑖subscript𝜎𝑆𝑖top\lambda_{\min}\left(\sum_{i}v_{i\sigma_{S}(i)}v_{i\sigma_{S}(i)}^{\top}\right)italic_λ start_POSTSUBSCRIPT roman_min end_POSTSUBSCRIPT ( ∑ start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT italic_v start_POSTSUBSCRIPT italic_i italic_σ start_POSTSUBSCRIPT italic_S end_POSTSUBSCRIPT ( italic_i ) end_POSTSUBSCRIPT italic_v start_POSTSUBSCRIPT italic_i italic_σ start_POSTSUBSCRIPT italic_S end_POSTSUBSCRIPT ( italic_i ) end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ⊤ end_POSTSUPERSCRIPT ) over all choices of S𝑆Sitalic_S

Proof  (of Theorem 1 for Partition Matroids) By Lemma 2 there is a set SE𝑆𝐸S\subseteq Eitalic_S ⊆ italic_E with |S|=O(dlogd/ϵ2)𝑆𝑂𝑑𝑑superscriptitalic-ϵ2|S|=O(d\log d/\epsilon^{2})| italic_S | = italic_O ( italic_d roman_log italic_d / italic_ϵ start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT ) such that CP(S) is a relaxation.

Let x*superscript𝑥x^{*}italic_x start_POSTSUPERSCRIPT * end_POSTSUPERSCRIPT be the optimal value of CP(S) and let σsuperscript𝜎\sigma^{\star}italic_σ start_POSTSUPERSCRIPT ⋆ end_POSTSUPERSCRIPT be the choice function of the optimal basis for the minimum eigenvalue problem. Since CP(S) is a relaxation, we have λmin(i=1kviσ*(i)viσ*(i))λmin(ijxij*vijvij)subscript𝜆superscriptsubscript𝑖1𝑘subscript𝑣𝑖superscript𝜎𝑖superscriptsubscript𝑣𝑖superscript𝜎𝑖topsubscript𝜆subscript𝑖𝑗subscriptsuperscript𝑥𝑖𝑗subscript𝑣𝑖𝑗superscriptsubscript𝑣𝑖𝑗top\lambda_{\min}\left(\sum_{i=1}^{k}v_{i\sigma^{*}(i)}v_{i\sigma^{*}(i)}^{\top}% \right)\leq\lambda_{\min}\left(\sum_{ij}x^{*}_{ij}v_{ij}v_{ij}^{\top}\right)italic_λ start_POSTSUBSCRIPT roman_min end_POSTSUBSCRIPT ( ∑ start_POSTSUBSCRIPT italic_i = 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_k end_POSTSUPERSCRIPT italic_v start_POSTSUBSCRIPT italic_i italic_σ start_POSTSUPERSCRIPT * end_POSTSUPERSCRIPT ( italic_i ) end_POSTSUBSCRIPT italic_v start_POSTSUBSCRIPT italic_i italic_σ start_POSTSUPERSCRIPT * end_POSTSUPERSCRIPT ( italic_i ) end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ⊤ end_POSTSUPERSCRIPT ) ≤ italic_λ start_POSTSUBSCRIPT roman_min end_POSTSUBSCRIPT ( ∑ start_POSTSUBSCRIPT italic_i italic_j end_POSTSUBSCRIPT italic_x start_POSTSUPERSCRIPT * end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_i italic_j end_POSTSUBSCRIPT italic_v start_POSTSUBSCRIPT italic_i italic_j end_POSTSUBSCRIPT italic_v start_POSTSUBSCRIPT italic_i italic_j end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ⊤ end_POSTSUPERSCRIPT ).

Lemma 3 implies that with high probability, the choice function obtained by rounding x*superscript𝑥x^{*}italic_x start_POSTSUPERSCRIPT * end_POSTSUPERSCRIPT, σSsubscript𝜎𝑆\sigma_{S}italic_σ start_POSTSUBSCRIPT italic_S end_POSTSUBSCRIPT, is a good approximation to CP(S). So combining Lemma 3 with the previous inequality gives

λmin(i=1kviσS(i)viσS(i))subscript𝜆superscriptsubscript𝑖1𝑘subscript𝑣𝑖subscript𝜎𝑆𝑖superscriptsubscript𝑣𝑖subscript𝜎𝑆𝑖top\displaystyle\lambda_{\min}\left(\sum_{i=1}^{k}v_{i\sigma_{S}(i)}v_{i\sigma_{S% }(i)}^{\top}\right)italic_λ start_POSTSUBSCRIPT roman_min end_POSTSUBSCRIPT ( ∑ start_POSTSUBSCRIPT italic_i = 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_k end_POSTSUPERSCRIPT italic_v start_POSTSUBSCRIPT italic_i italic_σ start_POSTSUBSCRIPT italic_S end_POSTSUBSCRIPT ( italic_i ) end_POSTSUBSCRIPT italic_v start_POSTSUBSCRIPT italic_i italic_σ start_POSTSUBSCRIPT italic_S end_POSTSUBSCRIPT ( italic_i ) end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ⊤ end_POSTSUPERSCRIPT ) (1ϵ)λmin(ijxij*vijvij)(1ϵ)λmin(i=1kviσ*(i)viσ*(i)),absent1italic-ϵsubscript𝜆subscript𝑖𝑗subscriptsuperscript𝑥𝑖𝑗subscript𝑣𝑖𝑗superscriptsubscript𝑣𝑖𝑗top1italic-ϵsubscript𝜆superscriptsubscript𝑖1𝑘subscript𝑣𝑖superscript𝜎𝑖superscriptsubscript𝑣𝑖superscript𝜎𝑖top\displaystyle\geq(1-\epsilon)\cdot\lambda_{\min}\left(\sum_{ij}x^{*}_{ij}v_{ij% }v_{ij}^{\top}\right)\geq(1-\epsilon)\cdot\lambda_{\min}\left(\sum_{i=1}^{k}v_% {i\sigma^{*}(i)}v_{i\sigma^{*}(i)}^{\top}\right)\,,≥ ( 1 - italic_ϵ ) ⋅ italic_λ start_POSTSUBSCRIPT roman_min end_POSTSUBSCRIPT ( ∑ start_POSTSUBSCRIPT italic_i italic_j end_POSTSUBSCRIPT italic_x start_POSTSUPERSCRIPT * end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_i italic_j end_POSTSUBSCRIPT italic_v start_POSTSUBSCRIPT italic_i italic_j end_POSTSUBSCRIPT italic_v start_POSTSUBSCRIPT italic_i italic_j end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ⊤ end_POSTSUPERSCRIPT ) ≥ ( 1 - italic_ϵ ) ⋅ italic_λ start_POSTSUBSCRIPT roman_min end_POSTSUBSCRIPT ( ∑ start_POSTSUBSCRIPT italic_i = 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_k end_POSTSUPERSCRIPT italic_v start_POSTSUBSCRIPT italic_i italic_σ start_POSTSUPERSCRIPT * end_POSTSUPERSCRIPT ( italic_i ) end_POSTSUBSCRIPT italic_v start_POSTSUBSCRIPT italic_i italic_σ start_POSTSUPERSCRIPT * end_POSTSUPERSCRIPT ( italic_i ) end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ⊤ end_POSTSUPERSCRIPT ) ,

with probability at least 1d41superscript𝑑41-d^{-4}1 - italic_d start_POSTSUPERSCRIPT - 4 end_POSTSUPERSCRIPT. Since we iterate over all choice functions in step 6 of Algorithm 1, we will output a choice function σ𝜎\sigmaitalic_σ which is at least as good as σSsubscript𝜎𝑆\sigma_{S}italic_σ start_POSTSUBSCRIPT italic_S end_POSTSUBSCRIPT with the same probability. \Box

2.1 Application: Algorithmic Kadison-Singer Problem

The Kadison-Singer conjecture was resolved in [MSS15] using the following theorem which can be interpreted as a generalization of Weaver’s conjecture  [Wea04].

Theorem 4

[MSS15, Corollary 1.5 with r=2𝑟2r=2italic_r = 2] Let u1,,umdsubscript𝑢1normal-…subscript𝑢𝑚superscript𝑑u_{1},\ldots,u_{m}\in\mathbb{R}^{d}italic_u start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT , … , italic_u start_POSTSUBSCRIPT italic_m end_POSTSUBSCRIPT ∈ blackboard_R start_POSTSUPERSCRIPT italic_d end_POSTSUPERSCRIPT be vectors such that i=1muiui=Isuperscriptsubscript𝑖1𝑚subscript𝑢𝑖superscriptsubscript𝑢𝑖top𝐼\sum_{i=1}^{m}u_{i}u_{i}^{\top}=I∑ start_POSTSUBSCRIPT italic_i = 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_m end_POSTSUPERSCRIPT italic_u start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT italic_u start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ⊤ end_POSTSUPERSCRIPT = italic_I and ui2αsuperscriptnormsubscript𝑢𝑖2𝛼\|u_{i}\|^{2}\leq\alpha∥ italic_u start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT ∥ start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT ≤ italic_α for all i.𝑖i.italic_i . There exists a set T[m]𝑇delimited-[]𝑚T\subseteq[m]italic_T ⊆ [ italic_m ] such that

(123α)IdiTuiui(12+3α)Id.precedes-or-equals123𝛼subscript𝐼𝑑subscript𝑖𝑇subscript𝑢𝑖superscriptsubscript𝑢𝑖topprecedes-or-equals123𝛼subscript𝐼𝑑\left(\frac{1}{2}-3\sqrt{\alpha}\right)I_{d}\preceq\sum_{i\in T}u_{i}u_{i}^{% \top}\preceq\left(\frac{1}{2}+3\sqrt{\alpha}\right)I_{d}.( divide start_ARG 1 end_ARG start_ARG 2 end_ARG - 3 square-root start_ARG italic_α end_ARG ) italic_I start_POSTSUBSCRIPT italic_d end_POSTSUBSCRIPT ⪯ ∑ start_POSTSUBSCRIPT italic_i ∈ italic_T end_POSTSUBSCRIPT italic_u start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT italic_u start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ⊤ end_POSTSUPERSCRIPT ⪯ ( divide start_ARG 1 end_ARG start_ARG 2 end_ARG + 3 square-root start_ARG italic_α end_ARG ) italic_I start_POSTSUBSCRIPT italic_d end_POSTSUBSCRIPT .

Their proof is based on analyzing interlacing families of polynomials and does not lead to an efficient algorithm to find such a subset T𝑇Titalic_T.

In [JMS22], they introduce an algorithmic form of the Kadison-Singer problem, which asks to find such a subset assuming it exists. For a constant c>0𝑐0c>0italic_c > 0 and a set of vectors u1,,umdsubscript𝑢1subscript𝑢𝑚superscript𝑑u_{1},\ldots,u_{m}\in\mathbb{R}^{d}italic_u start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT , … , italic_u start_POSTSUBSCRIPT italic_m end_POSTSUBSCRIPT ∈ blackboard_R start_POSTSUPERSCRIPT italic_d end_POSTSUPERSCRIPT such that ui2αsuperscriptnormsubscript𝑢𝑖2𝛼\|u_{i}\|^{2}\leq\alpha∥ italic_u start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT ∥ start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT ≤ italic_α, i=1muiui=Isuperscriptsubscript𝑖1𝑚subscript𝑢𝑖superscriptsubscript𝑢𝑖top𝐼\sum_{i=1}^{m}u_{i}u_{i}^{\top}=I∑ start_POSTSUBSCRIPT italic_i = 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_m end_POSTSUPERSCRIPT italic_u start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT italic_u start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ⊤ end_POSTSUPERSCRIPT = italic_I where there exists a subset T[m]𝑇delimited-[]𝑚T\subseteq[m]italic_T ⊆ [ italic_m ] satisfying

(12cα)IiTuiui(12+cα),precedes-or-equals12𝑐𝛼𝐼subscript𝑖𝑇subscript𝑢𝑖superscriptsubscript𝑢𝑖topprecedes-or-equals12𝑐𝛼\left(\frac{1}{2}-c\sqrt{\alpha}\right)I\preceq\sum_{i\in T}u_{i}u_{i}^{\top}% \preceq\left(\frac{1}{2}+c\sqrt{\alpha}\right),( divide start_ARG 1 end_ARG start_ARG 2 end_ARG - italic_c square-root start_ARG italic_α end_ARG ) italic_I ⪯ ∑ start_POSTSUBSCRIPT italic_i ∈ italic_T end_POSTSUBSCRIPT italic_u start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT italic_u start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ⊤ end_POSTSUPERSCRIPT ⪯ ( divide start_ARG 1 end_ARG start_ARG 2 end_ARG + italic_c square-root start_ARG italic_α end_ARG ) , (2)

the goal is actually to find a set T[m]𝑇delimited-[]𝑚T\subseteq[m]italic_T ⊆ [ italic_m ] which satisfies the above condition. This problem is FNP-hard when c=1/(42)𝑐142c=1/(4\sqrt{2})italic_c = 1 / ( 4 square-root start_ARG 2 end_ARG ) for general values of d𝑑ditalic_d [JMS22, Theorem 2].

Their main result [JMS22, Theorem 1] is an algorithm with running time

O((mk)poly(m,d)) for k=O(dϵ2log(d)log(1cα)),𝑂binomial𝑚𝑘poly𝑚𝑑 for 𝑘𝑂𝑑superscriptitalic-ϵ2𝑑1𝑐𝛼O\left(\binom{m}{k}\cdot\text{poly}(m,d)\right)\text{ for }k=O\left(\frac{d}{% \epsilon^{2}}\log(d)\log\left(\frac{1}{c\sqrt{\alpha}}\right)\right)\,,italic_O ( ( FRACOP start_ARG italic_m end_ARG start_ARG italic_k end_ARG ) ⋅ poly ( italic_m , italic_d ) ) for italic_k = italic_O ( divide start_ARG italic_d end_ARG start_ARG italic_ϵ start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT end_ARG roman_log ( italic_d ) roman_log ( divide start_ARG 1 end_ARG start_ARG italic_c square-root start_ARG italic_α end_ARG end_ARG ) ) ,

which returns a set T[m]superscript𝑇delimited-[]𝑚T^{\prime}\subseteq[m]italic_T start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT ⊆ [ italic_m ] such that

(1ϵ)(12cα)IiTuiui(1+ϵ)(12+cα)I,precedes-or-equals1italic-ϵ12𝑐𝛼𝐼subscript𝑖superscript𝑇subscript𝑢𝑖superscriptsubscript𝑢𝑖topprecedes-or-equals1italic-ϵ12𝑐𝛼𝐼(1-\epsilon)\left(\frac{1}{2}-c\sqrt{\alpha}\right)I\preceq\sum_{i\in T^{% \prime}}u_{i}u_{i}^{\top}\preceq(1+\epsilon)\left(\frac{1}{2}+c\sqrt{\alpha}% \right)I,( 1 - italic_ϵ ) ( divide start_ARG 1 end_ARG start_ARG 2 end_ARG - italic_c square-root start_ARG italic_α end_ARG ) italic_I ⪯ ∑ start_POSTSUBSCRIPT italic_i ∈ italic_T start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT end_POSTSUBSCRIPT italic_u start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT italic_u start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ⊤ end_POSTSUPERSCRIPT ⪯ ( 1 + italic_ϵ ) ( divide start_ARG 1 end_ARG start_ARG 2 end_ARG + italic_c square-root start_ARG italic_α end_ARG ) italic_I , (3)

In this section, we will show how to use the rounding technique for partition matroids to give a simpler algorithm that achieves the same guarantee with the same run time, except we save the small dependence on log(1/cα)1𝑐𝛼\log(1/c\sqrt{\alpha})roman_log ( 1 / italic_c square-root start_ARG italic_α end_ARG ) in the exponent.

Proof  (of Corollary 1) Given vectors u1,,umdsubscript𝑢1subscript𝑢𝑚superscript𝑑u_{1},\ldots,u_{m}\in\mathbb{R}^{d}italic_u start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT , … , italic_u start_POSTSUBSCRIPT italic_m end_POSTSUBSCRIPT ∈ blackboard_R start_POSTSUPERSCRIPT italic_d end_POSTSUPERSCRIPT, we construct an instance of the minimum eigenvalue with partition constraints as follows. Let E={1,2}×[m]𝐸12delimited-[]𝑚E=\{1,2\}\times[m]italic_E = { 1 , 2 } × [ italic_m ], with m𝑚mitalic_m parts P1,,Pmsubscript𝑃1subscript𝑃𝑚P_{1},\ldots,P_{m}italic_P start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT , … , italic_P start_POSTSUBSCRIPT italic_m end_POSTSUBSCRIPT so that Pi={(i,1),(i,2)}subscript𝑃𝑖𝑖1𝑖2P_{i}=\{(i,1),(i,2)\}italic_P start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT = { ( italic_i , 1 ) , ( italic_i , 2 ) } for i[m]𝑖delimited-[]𝑚i\in[m]italic_i ∈ [ italic_m ]. For each i[m]𝑖delimited-[]𝑚i\in[m]italic_i ∈ [ italic_m ] define the vectors

vi1=[ui0]2d, and vi2=[0ui]2d.formulae-sequencesubscript𝑣𝑖1matrixsubscript𝑢𝑖0superscript2𝑑 and subscript𝑣𝑖2matrix0subscript𝑢𝑖superscript2𝑑v_{i1}=\begin{bmatrix}u_{i}\\ 0\end{bmatrix}\in\mathbb{R}^{2d},\text{ and }v_{i2}=\begin{bmatrix}0\\ u_{i}\end{bmatrix}\in\mathbb{R}^{2d}.italic_v start_POSTSUBSCRIPT italic_i 1 end_POSTSUBSCRIPT = [ start_ARG start_ROW start_CELL italic_u start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT end_CELL end_ROW start_ROW start_CELL 0 end_CELL end_ROW end_ARG ] ∈ blackboard_R start_POSTSUPERSCRIPT 2 italic_d end_POSTSUPERSCRIPT , and italic_v start_POSTSUBSCRIPT italic_i 2 end_POSTSUBSCRIPT = [ start_ARG start_ROW start_CELL 0 end_CELL end_ROW start_ROW start_CELL italic_u start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT end_CELL end_ROW end_ARG ] ∈ blackboard_R start_POSTSUPERSCRIPT 2 italic_d end_POSTSUPERSCRIPT .

To see how v𝑣vitalic_v and u𝑢uitalic_u are related, note that for any δ[0,1/2)𝛿012\delta\in[0,1/2)italic_δ ∈ [ 0 , 1 / 2 ) there is a choice function σ:[m]{1,2}:𝜎delimited-[]𝑚12\sigma:[m]\rightarrow\{1,2\}italic_σ : [ italic_m ] → { 1 , 2 } such that

(12δ)I2di=1mviσ(i)viσ(i)precedes-or-equals12𝛿subscript𝐼2𝑑superscriptsubscript𝑖1𝑚subscript𝑣𝑖𝜎𝑖superscriptsubscript𝑣𝑖𝜎𝑖top\left(\frac{1}{2}-\delta\right)I_{2d}\preceq\sum_{i=1}^{m}v_{i\sigma(i)}v_{i% \sigma(i)}^{\top}( divide start_ARG 1 end_ARG start_ARG 2 end_ARG - italic_δ ) italic_I start_POSTSUBSCRIPT 2 italic_d end_POSTSUBSCRIPT ⪯ ∑ start_POSTSUBSCRIPT italic_i = 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_m end_POSTSUPERSCRIPT italic_v start_POSTSUBSCRIPT italic_i italic_σ ( italic_i ) end_POSTSUBSCRIPT italic_v start_POSTSUBSCRIPT italic_i italic_σ ( italic_i ) end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ⊤ end_POSTSUPERSCRIPT (4)

if and only if there is a set T[m]𝑇delimited-[]𝑚T\subseteq[m]italic_T ⊆ [ italic_m ] such that

(12δ)IiTuiui(12+δ)I.precedes-or-equals12𝛿𝐼subscript𝑖𝑇subscript𝑢𝑖superscriptsubscript𝑢𝑖topprecedes-or-equals12𝛿𝐼\left(\frac{1}{2}-\delta\right)I\preceq\sum_{i\in T}u_{i}u_{i}^{\top}\preceq% \left(\frac{1}{2}+\delta\right)I.( divide start_ARG 1 end_ARG start_ARG 2 end_ARG - italic_δ ) italic_I ⪯ ∑ start_POSTSUBSCRIPT italic_i ∈ italic_T end_POSTSUBSCRIPT italic_u start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT italic_u start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ⊤ end_POSTSUPERSCRIPT ⪯ ( divide start_ARG 1 end_ARG start_ARG 2 end_ARG + italic_δ ) italic_I . (5)

Given σ𝜎\sigmaitalic_σ satisfying (4), let X1:=i:σ(i)=1uiuiassignsubscript𝑋1subscript:𝑖𝜎𝑖1subscript𝑢𝑖superscriptsubscript𝑢𝑖topX_{1}:=\sum_{i:\sigma(i)=1}u_{i}u_{i}^{\top}italic_X start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT := ∑ start_POSTSUBSCRIPT italic_i : italic_σ ( italic_i ) = 1 end_POSTSUBSCRIPT italic_u start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT italic_u start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ⊤ end_POSTSUPERSCRIPT and X2:=i:σ(i)=2uiuiassignsubscript𝑋2subscript:𝑖𝜎𝑖2subscript𝑢𝑖superscriptsubscript𝑢𝑖topX_{2}:=\sum_{i:\sigma(i)=2}u_{i}u_{i}^{\top}italic_X start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT := ∑ start_POSTSUBSCRIPT italic_i : italic_σ ( italic_i ) = 2 end_POSTSUBSCRIPT italic_u start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT italic_u start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ⊤ end_POSTSUPERSCRIPT. Then X1subscript𝑋1X_{1}italic_X start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT and X2subscript𝑋2X_{2}italic_X start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT are respectively the first and second diagonal d×d𝑑𝑑d\times ditalic_d × italic_d block of i=1mviσ(i)viσ(i)superscriptsubscript𝑖1𝑚subscript𝑣𝑖𝜎𝑖superscriptsubscript𝑣𝑖𝜎𝑖top\sum_{i=1}^{m}v_{i\sigma(i)}v_{i\sigma(i)}^{\top}∑ start_POSTSUBSCRIPT italic_i = 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_m end_POSTSUPERSCRIPT italic_v start_POSTSUBSCRIPT italic_i italic_σ ( italic_i ) end_POSTSUBSCRIPT italic_v start_POSTSUBSCRIPT italic_i italic_σ ( italic_i ) end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ⊤ end_POSTSUPERSCRIPT. Therefore (12δ)I2di=1mviσ(i)viσ(i)precedes-or-equals12𝛿subscript𝐼2𝑑superscriptsubscript𝑖1𝑚subscript𝑣𝑖𝜎𝑖superscriptsubscript𝑣𝑖𝜎𝑖top\left(\frac{1}{2}-\delta\right)I_{2d}\preceq\sum_{i=1}^{m}v_{i\sigma(i)}v_{i% \sigma(i)}^{\top}( divide start_ARG 1 end_ARG start_ARG 2 end_ARG - italic_δ ) italic_I start_POSTSUBSCRIPT 2 italic_d end_POSTSUBSCRIPT ⪯ ∑ start_POSTSUBSCRIPT italic_i = 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_m end_POSTSUPERSCRIPT italic_v start_POSTSUBSCRIPT italic_i italic_σ ( italic_i ) end_POSTSUBSCRIPT italic_v start_POSTSUBSCRIPT italic_i italic_σ ( italic_i ) end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ⊤ end_POSTSUPERSCRIPT if and only if X1(12δ)Idsucceeds-or-equalssubscript𝑋112𝛿subscript𝐼𝑑X_{1}\succeq\left(\frac{1}{2}-\delta\right)I_{d}italic_X start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT ⪰ ( divide start_ARG 1 end_ARG start_ARG 2 end_ARG - italic_δ ) italic_I start_POSTSUBSCRIPT italic_d end_POSTSUBSCRIPT and X2(12δ)Idsucceeds-or-equalssubscript𝑋212𝛿subscript𝐼𝑑X_{2}\succeq\left(\frac{1}{2}-\delta\right)I_{d}italic_X start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT ⪰ ( divide start_ARG 1 end_ARG start_ARG 2 end_ARG - italic_δ ) italic_I start_POSTSUBSCRIPT italic_d end_POSTSUBSCRIPT. In addition, since X1+X2=Idsubscript𝑋1subscript𝑋2subscript𝐼𝑑X_{1}+X_{2}=I_{d}italic_X start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT + italic_X start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT = italic_I start_POSTSUBSCRIPT italic_d end_POSTSUBSCRIPT, this is equivalent to

(12δ)Idi:σ(i)=1uiui=IdX2(12+δ)Id.precedes-or-equals12𝛿subscript𝐼𝑑subscript:𝑖𝜎𝑖1subscript𝑢𝑖subscript𝑢𝑖subscript𝐼𝑑subscript𝑋2precedes-or-equals12𝛿subscript𝐼𝑑\left(\frac{1}{2}-\delta\right)I_{d}\preceq\sum_{i:\sigma(i)=1}u_{i}u_{i}=I_{d% }-X_{2}\preceq\left(\frac{1}{2}+\delta\right)I_{d}.( divide start_ARG 1 end_ARG start_ARG 2 end_ARG - italic_δ ) italic_I start_POSTSUBSCRIPT italic_d end_POSTSUBSCRIPT ⪯ ∑ start_POSTSUBSCRIPT italic_i : italic_σ ( italic_i ) = 1 end_POSTSUBSCRIPT italic_u start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT italic_u start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT = italic_I start_POSTSUBSCRIPT italic_d end_POSTSUBSCRIPT - italic_X start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT ⪯ ( divide start_ARG 1 end_ARG start_ARG 2 end_ARG + italic_δ ) italic_I start_POSTSUBSCRIPT italic_d end_POSTSUBSCRIPT .

We then use Algorithm 1 to find a (1ϵ)1italic-ϵ(1-\epsilon)( 1 - italic_ϵ ) approximate solution σ:[m]{1,2}:𝜎delimited-[]𝑚12\sigma:[m]\rightarrow\{1,2\}italic_σ : [ italic_m ] → { 1 , 2 } to input \mathcal{M}caligraphic_M and vectors vijsubscript𝑣𝑖𝑗v_{ij}italic_v start_POSTSUBSCRIPT italic_i italic_j end_POSTSUBSCRIPT. Since we assume there is a set T𝑇Titalic_T satisfying (2), Theorem 1 implies that with probability at least 1O(d4)1𝑂superscript𝑑41-O(d^{-4})1 - italic_O ( italic_d start_POSTSUPERSCRIPT - 4 end_POSTSUPERSCRIPT ), Algorithm 1 will return a choice function σ*superscript𝜎\sigma^{*}italic_σ start_POSTSUPERSCRIPT * end_POSTSUPERSCRIPT such that (1ϵ)(12cα)I2di=1mviσ*(i)viσ*(i)precedes-or-equals1italic-ϵ12𝑐𝛼subscript𝐼2𝑑superscriptsubscript𝑖1𝑚subscript𝑣𝑖superscript𝜎𝑖superscriptsubscript𝑣𝑖superscript𝜎𝑖top(1-\epsilon)\left(\frac{1}{2}-c\sqrt{\alpha}\right)I_{2d}\preceq\sum_{i=1}^{m}% v_{i\sigma^{*}(i)}v_{i\sigma^{*}(i)}^{\top}( 1 - italic_ϵ ) ( divide start_ARG 1 end_ARG start_ARG 2 end_ARG - italic_c square-root start_ARG italic_α end_ARG ) italic_I start_POSTSUBSCRIPT 2 italic_d end_POSTSUBSCRIPT ⪯ ∑ start_POSTSUBSCRIPT italic_i = 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_m end_POSTSUPERSCRIPT italic_v start_POSTSUBSCRIPT italic_i italic_σ start_POSTSUPERSCRIPT * end_POSTSUPERSCRIPT ( italic_i ) end_POSTSUBSCRIPT italic_v start_POSTSUBSCRIPT italic_i italic_σ start_POSTSUPERSCRIPT * end_POSTSUPERSCRIPT ( italic_i ) end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ⊤ end_POSTSUPERSCRIPT, and we will return the set T={i[m]:σ*(i)=1}superscript𝑇conditional-set𝑖delimited-[]𝑚superscript𝜎𝑖1T^{\prime}=\{i\in[m]:\sigma^{*}(i)=1\}italic_T start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT = { italic_i ∈ [ italic_m ] : italic_σ start_POSTSUPERSCRIPT * end_POSTSUPERSCRIPT ( italic_i ) = 1 }.

From the equivalence between (4) and (5), the set T={i[m]:σ(i)=1}superscript𝑇conditional-set𝑖delimited-[]𝑚𝜎𝑖1T^{\prime}=\{i\in[m]:\sigma(i)=1\}italic_T start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT = { italic_i ∈ [ italic_m ] : italic_σ ( italic_i ) = 1 } satisfies (3)

(1ϵ)(12cα)IdiTuiui(1+ϵ)(12+cα)Id.precedes-or-equals1italic-ϵ12𝑐𝛼subscript𝐼𝑑subscript𝑖superscript𝑇subscript𝑢𝑖subscript𝑢𝑖precedes-or-equals1italic-ϵ12𝑐𝛼subscript𝐼𝑑(1-\epsilon)\left(\frac{1}{2}-c\sqrt{\alpha}\right)I_{d}\preceq\sum_{i\in T^{% \prime}}u_{i}u_{i}\preceq(1+\epsilon)\left(\frac{1}{2}+c\sqrt{\alpha}\right)I_% {d}\,.( 1 - italic_ϵ ) ( divide start_ARG 1 end_ARG start_ARG 2 end_ARG - italic_c square-root start_ARG italic_α end_ARG ) italic_I start_POSTSUBSCRIPT italic_d end_POSTSUBSCRIPT ⪯ ∑ start_POSTSUBSCRIPT italic_i ∈ italic_T start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT end_POSTSUBSCRIPT italic_u start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT italic_u start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT ⪯ ( 1 + italic_ϵ ) ( divide start_ARG 1 end_ARG start_ARG 2 end_ARG + italic_c square-root start_ARG italic_α end_ARG ) italic_I start_POSTSUBSCRIPT italic_d end_POSTSUBSCRIPT .

\Box

3 General Matroid Constraints

In the general form of the problem, we are given a collection of vectors v1,,vndsubscript𝑣1subscript𝑣𝑛superscript𝑑v_{1},\ldots,v_{n}\in\mathbb{R}^{d}italic_v start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT , … , italic_v start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT ∈ blackboard_R start_POSTSUPERSCRIPT italic_d end_POSTSUPERSCRIPT and a matroid =([n],)delimited-[]𝑛\mathcal{M}=([n],\mathcal{I})caligraphic_M = ( [ italic_n ] , caligraphic_I ), and the goal is to find a basis B𝐵B\in\mathcal{I}italic_B ∈ caligraphic_I which maximizes λmin(iBvivi).subscript𝜆subscript𝑖𝐵subscript𝑣𝑖superscriptsubscript𝑣𝑖top\lambda_{\min}\left(\sum_{i\in B}v_{i}v_{i}^{\top}\right).italic_λ start_POSTSUBSCRIPT roman_min end_POSTSUBSCRIPT ( ∑ start_POSTSUBSCRIPT italic_i ∈ italic_B end_POSTSUBSCRIPT italic_v start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT italic_v start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ⊤ end_POSTSUPERSCRIPT ) . For background on matroids, see Appendix B.1.

For a general matroid, the idea of finding a linear transformation under which all elements in the optimal solution have a small norm generalizes easily. So we can use the same approach of first guessing a set SE𝑆𝐸S\subseteq Eitalic_S ⊆ italic_E on a reasonable size and then solving the convex relaxation of the problem conditioned on S𝑆Sitalic_S being included in the solution.

Given a subset S[n]𝑆delimited-[]𝑛S\subseteq[n]italic_S ⊆ [ italic_n ], we can again set AS=iSvivisubscript𝐴𝑆subscript𝑖𝑆subscript𝑣𝑖superscriptsubscript𝑣𝑖topA_{S}=\sum_{i\in S}v_{i}v_{i}^{\top}italic_A start_POSTSUBSCRIPT italic_S end_POSTSUBSCRIPT = ∑ start_POSTSUBSCRIPT italic_i ∈ italic_S end_POSTSUBSCRIPT italic_v start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT italic_v start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ⊤ end_POSTSUPERSCRIPT, and consider the set of long vectors:

L(S)={i[n]\S:viAS1vi>ϵ210log(d)}.𝐿𝑆conditional-set𝑖\delimited-[]𝑛𝑆superscriptsubscript𝑣𝑖topsuperscriptsubscript𝐴𝑆1subscript𝑣𝑖superscriptitalic-ϵ210𝑑L(S)=\left\{i\in[n]\backslash S:v_{i}^{\top}A_{S}^{-1}v_{i}>\frac{\epsilon^{2}% }{10\log(d)}\right\}.italic_L ( italic_S ) = { italic_i ∈ [ italic_n ] \ italic_S : italic_v start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ⊤ end_POSTSUPERSCRIPT italic_A start_POSTSUBSCRIPT italic_S end_POSTSUBSCRIPT start_POSTSUPERSCRIPT - 1 end_POSTSUPERSCRIPT italic_v start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT > divide start_ARG italic_ϵ start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT end_ARG start_ARG 10 roman_log ( italic_d ) end_ARG } .

For a matroid \mathcal{M}caligraphic_M, let 𝒫(M)[0,1]n𝒫𝑀superscript01𝑛\mathcal{P}(M)\subseteq[0,1]^{n}caligraphic_P ( italic_M ) ⊆ [ 0 , 1 ] start_POSTSUPERSCRIPT italic_n end_POSTSUPERSCRIPT denote the matroid base polytope. Then the following is the natural convex programming relaxation which excludes the “long” vectors.

maxλmin(X)X=i=1nxivijvijx𝒫()xi=0,iL(S)xi=1,iSx0subscript𝜆𝑋missing-subexpression𝑋superscriptsubscript𝑖1𝑛subscript𝑥𝑖subscript𝑣𝑖𝑗superscriptsubscript𝑣𝑖𝑗topmissing-subexpression𝑥𝒫missing-subexpressionformulae-sequencesubscript𝑥𝑖0for-all𝑖𝐿𝑆missing-subexpressionformulae-sequencesubscript𝑥𝑖1for-all𝑖𝑆missing-subexpression𝑥0\begin{array}[]{cl}\max&\lambda_{\min}\left(X\right)\\ &X=\sum\limits_{i=1}^{n}x_{i}\cdot v_{ij}v_{ij}^{\top}\\ &x\in\mathcal{P}(\mathcal{M})\\ &x_{i}=0,\,\,\forall i\in L(S)\\ &x_{i}=1,\,\,\forall i\in S\\ &x\geq 0\end{array}start_ARRAY start_ROW start_CELL roman_max end_CELL start_CELL italic_λ start_POSTSUBSCRIPT roman_min end_POSTSUBSCRIPT ( italic_X ) end_CELL end_ROW start_ROW start_CELL end_CELL start_CELL italic_X = ∑ start_POSTSUBSCRIPT italic_i = 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_n end_POSTSUPERSCRIPT italic_x start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT ⋅ italic_v start_POSTSUBSCRIPT italic_i italic_j end_POSTSUBSCRIPT italic_v start_POSTSUBSCRIPT italic_i italic_j end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ⊤ end_POSTSUPERSCRIPT end_CELL end_ROW start_ROW start_CELL end_CELL start_CELL italic_x ∈ caligraphic_P ( caligraphic_M ) end_CELL end_ROW start_ROW start_CELL end_CELL start_CELL italic_x start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT = 0 , ∀ italic_i ∈ italic_L ( italic_S ) end_CELL end_ROW start_ROW start_CELL end_CELL start_CELL italic_x start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT = 1 , ∀ italic_i ∈ italic_S end_CELL end_ROW start_ROW start_CELL end_CELL start_CELL italic_x ≥ 0 end_CELL end_ROW end_ARRAY (CP(S))

This convex program can be solved in polynomial time (see Appendix B.1). Just like in the partition case, Lemma 1 guarantees that there is a set S𝑆Sitalic_S for which CP(S) is a relaxation for the minimum eigenvalue problem. As before, after solving CP(S), we can guarantee that all the vectors in the fractional support of the optimal solution will have a small norm with respect to ASsubscript𝐴𝑆A_{S}italic_A start_POSTSUBSCRIPT italic_S end_POSTSUBSCRIPT.

The challenge in extending the earlier approach to general matroid constraints comes from the rounding step. For a partition matroid, we could simply round the fractional optimum of CP(S) independently in each part to obtain a basis. However, for more general constraints, it is not so clear how to round a fractional solution to a basis.

Instead of rounding independently, we will use the technique of pipage rounding to find a basis. The following lemma is the lower-tail version of the same concentration inequality proved in [HO14]. For completeness, we will include a proof of the version we need in Appendix B.

Lemma 4

Let 𝒫()𝒫\mathcal{P}(\mathcal{M})caligraphic_P ( caligraphic_M ) be a matroid base polytope and x𝒫()𝑥𝒫x\in\mathcal{P}(\mathcal{M})italic_x ∈ caligraphic_P ( caligraphic_M ). Let M1,,Mmsubscript𝑀1normal-…subscript𝑀𝑚M_{1},\ldots,M_{m}italic_M start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT , … , italic_M start_POSTSUBSCRIPT italic_m end_POSTSUBSCRIPT be self-adjoint matrices that satisfy λmax(Mi)Rsubscript𝜆subscript𝑀𝑖𝑅\lambda_{\max}(M_{i})\leq Ritalic_λ start_POSTSUBSCRIPT roman_max end_POSTSUBSCRIPT ( italic_M start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT ) ≤ italic_R. Let μ=λmin(i[n]xiMi)𝜇subscript𝜆subscript𝑖delimited-[]𝑛subscript𝑥𝑖subscript𝑀𝑖\mu=\lambda_{\min}\left(\sum_{i\in[n]}x_{i}M_{i}\right)italic_μ = italic_λ start_POSTSUBSCRIPT roman_min end_POSTSUBSCRIPT ( ∑ start_POSTSUBSCRIPT italic_i ∈ [ italic_n ] end_POSTSUBSCRIPT italic_x start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT italic_M start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT ). If randomized pipage rounding (Algorithm 3) starts at x𝑥xitalic_x and outputs the extreme point x^=χ(B)normal-^𝑥𝜒𝐵\hat{x}=\chi(B)over^ start_ARG italic_x end_ARG = italic_χ ( italic_B ) of 𝒫()𝒫\mathcal{P}(\mathcal{M})caligraphic_P ( caligraphic_M ), then we have

Pr[iBMi(1ϵ)μ]dexp(ϵ2μ2R).Prsubscript𝑖𝐵subscript𝑀𝑖1italic-ϵ𝜇𝑑superscriptitalic-ϵ2𝜇2𝑅\operatorname{\mathrm{Pr}}\left[\sum_{i\in B}M_{i}\leq(1-\epsilon)\cdot\mu% \right]\leq d\cdot\exp\left(\frac{-\epsilon^{2}\mu}{2R}\right)\,.roman_Pr [ ∑ start_POSTSUBSCRIPT italic_i ∈ italic_B end_POSTSUBSCRIPT italic_M start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT ≤ ( 1 - italic_ϵ ) ⋅ italic_μ ] ≤ italic_d ⋅ roman_exp ( divide start_ARG - italic_ϵ start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT italic_μ end_ARG start_ARG 2 italic_R end_ARG ) .

We use this lemma to generalize our earlier approach to all matroids.

Lemma 5

Let SE𝑆𝐸S\subseteq Eitalic_S ⊆ italic_E be an independent set in \mathcal{M}caligraphic_M and let xsuperscript𝑥normal-⋆x^{\star}italic_x start_POSTSUPERSCRIPT ⋆ end_POSTSUPERSCRIPT be the optimal solution to CP(S). Then pipage rounding starting at xsuperscript𝑥normal-⋆x^{\star}italic_x start_POSTSUPERSCRIPT ⋆ end_POSTSUPERSCRIPT outputs a basis B𝐵Bitalic_B such that

Pr[λmin(iBvivi)<(1ϵ)λmin(i[n]xivivi)]<d4.Prsubscript𝜆subscript𝑖𝐵subscript𝑣𝑖superscriptsubscript𝑣𝑖top1italic-ϵsubscript𝜆subscript𝑖delimited-[]𝑛subscriptsuperscript𝑥𝑖subscript𝑣𝑖superscriptsubscript𝑣𝑖topsuperscript𝑑4\operatorname{\mathrm{Pr}}\left[\lambda_{\min}\left(\sum_{i\in B}v_{i}v_{i}^{% \top}\right)<(1-\epsilon)\lambda_{\min}\left(\sum_{i\in[n]}x^{\star}_{i}\,v_{i% }v_{i}^{\top}\right)\right]<d^{-4}.roman_Pr [ italic_λ start_POSTSUBSCRIPT roman_min end_POSTSUBSCRIPT ( ∑ start_POSTSUBSCRIPT italic_i ∈ italic_B end_POSTSUBSCRIPT italic_v start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT italic_v start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ⊤ end_POSTSUPERSCRIPT ) < ( 1 - italic_ϵ ) italic_λ start_POSTSUBSCRIPT roman_min end_POSTSUBSCRIPT ( ∑ start_POSTSUBSCRIPT italic_i ∈ [ italic_n ] end_POSTSUBSCRIPT italic_x start_POSTSUPERSCRIPT ⋆ end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT italic_v start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT italic_v start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ⊤ end_POSTSUPERSCRIPT ) ] < italic_d start_POSTSUPERSCRIPT - 4 end_POSTSUPERSCRIPT .

The proof is identical to that of Lemma 3, except we use the matrix concentration inequality from Lemma 4.

Using this lemma, the following algorithm gives a (1ϵ)1italic-ϵ(1-\epsilon)( 1 - italic_ϵ )-approximation with high probability.

Algorithm 2 Algorithm to find an approximation to OPT𝑂𝑃𝑇OPTitalic_O italic_P italic_T
1:for each S[n]𝑆delimited-[]𝑛S\subseteq[n]italic_S ⊆ [ italic_n ] such that |S|=10dlog(d)/ϵ2+d1𝑆10𝑑𝑑superscriptitalic-ϵ2𝑑1|S|=10d\log(d)/\epsilon^{2}+d-1| italic_S | = 10 italic_d roman_log ( italic_d ) / italic_ϵ start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT + italic_d - 1 do
2:     Solve CP(S) to get optimal solution xsuperscript𝑥x^{\star}italic_x start_POSTSUPERSCRIPT ⋆ end_POSTSUPERSCRIPT.
3:     Let BSsubscript𝐵𝑆absentB_{S}\leftarrowitalic_B start_POSTSUBSCRIPT italic_S end_POSTSUBSCRIPT ← basis returned by Algorithm 3 for input xsuperscript𝑥x^{\star}italic_x start_POSTSUPERSCRIPT ⋆ end_POSTSUPERSCRIPT
4:end for
5:Return the basis BSsubscript𝐵𝑆B_{S}italic_B start_POSTSUBSCRIPT italic_S end_POSTSUBSCRIPT with the best objective

3.1 Proof of Theorem 1 and Theorem 2

In this section we prove Theorem 2. The proof of Theorem 1 follows identically.

Proof  (of Theorem 2) Let B*=argmaxBf(iBvivi)superscript𝐵subscript𝐵𝑓subscript𝑖𝐵subscript𝑣𝑖superscriptsubscript𝑣𝑖topB^{*}=\arg\max_{B\in\mathcal{I}}f(\sum_{i\in B}v_{i}v_{i}^{\top})italic_B start_POSTSUPERSCRIPT * end_POSTSUPERSCRIPT = roman_arg roman_max start_POSTSUBSCRIPT italic_B ∈ caligraphic_I end_POSTSUBSCRIPT italic_f ( ∑ start_POSTSUBSCRIPT italic_i ∈ italic_B end_POSTSUBSCRIPT italic_v start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT italic_v start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ⊤ end_POSTSUPERSCRIPT ) and let OPT=f(iB*vivi)𝑂𝑃𝑇𝑓subscript𝑖superscript𝐵subscript𝑣𝑖superscriptsubscript𝑣𝑖topOPT=f\left(\sum_{i\in B^{*}}v_{i}v_{i}^{\top}\right)italic_O italic_P italic_T = italic_f ( ∑ start_POSTSUBSCRIPT italic_i ∈ italic_B start_POSTSUPERSCRIPT * end_POSTSUPERSCRIPT end_POSTSUBSCRIPT italic_v start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT italic_v start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ⊤ end_POSTSUPERSCRIPT ). Let SB*𝑆superscript𝐵S\subseteq B^{*}italic_S ⊆ italic_B start_POSTSUPERSCRIPT * end_POSTSUPERSCRIPT such that |S|=O(dlogd/ϵ2)𝑆𝑂𝑑𝑑superscriptitalic-ϵ2|S|=O(d\log{d}/\epsilon^{2})| italic_S | = italic_O ( italic_d roman_log italic_d / italic_ϵ start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT ), A=(i,j)Svijvij𝐴subscript𝑖𝑗𝑆subscript𝑣𝑖𝑗superscriptsubscript𝑣𝑖𝑗topA=\sum_{(i,j)\in S}v_{ij}v_{ij}^{\top}italic_A = ∑ start_POSTSUBSCRIPT ( italic_i , italic_j ) ∈ italic_S end_POSTSUBSCRIPT italic_v start_POSTSUBSCRIPT italic_i italic_j end_POSTSUBSCRIPT italic_v start_POSTSUBSCRIPT italic_i italic_j end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ⊤ end_POSTSUPERSCRIPT is invertible, and for all (i,j)B*\S𝑖𝑗\superscript𝐵𝑆(i,j)\in B^{*}\backslash S( italic_i , italic_j ) ∈ italic_B start_POSTSUPERSCRIPT * end_POSTSUPERSCRIPT \ italic_S, vijA1vijϵ2/10dlogdsuperscriptsubscript𝑣𝑖𝑗topsuperscript𝐴1subscript𝑣𝑖𝑗superscriptitalic-ϵ210𝑑𝑑v_{ij}^{\top}A^{-1}v_{ij}\leq\epsilon^{2}/10d\log{d}italic_v start_POSTSUBSCRIPT italic_i italic_j end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ⊤ end_POSTSUPERSCRIPT italic_A start_POSTSUPERSCRIPT - 1 end_POSTSUPERSCRIPT italic_v start_POSTSUBSCRIPT italic_i italic_j end_POSTSUBSCRIPT ≤ italic_ϵ start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT / 10 italic_d roman_log italic_d. By Lemma 1, such a set S𝑆Sitalic_S exists.

Let x*superscript𝑥x^{*}italic_x start_POSTSUPERSCRIPT * end_POSTSUPERSCRIPT be the optimal solution of CP(S). Since the indicator vector of B*superscript𝐵B^{*}italic_B start_POSTSUPERSCRIPT * end_POSTSUPERSCRIPT satisfies the constraints of CP(S), OPT=f(iB*vivi)f(iExi*vivi)𝑂𝑃𝑇𝑓subscript𝑖superscript𝐵subscript𝑣𝑖superscriptsubscript𝑣𝑖top𝑓subscript𝑖𝐸subscriptsuperscript𝑥𝑖subscript𝑣𝑖superscriptsubscript𝑣𝑖topOPT=f\left(\sum_{i\in B^{*}}v_{i}v_{i}^{\top}\right)\leq f\left(\sum_{i\in E}x% ^{*}_{i}\cdot v_{i}v_{i}^{\top}\right)italic_O italic_P italic_T = italic_f ( ∑ start_POSTSUBSCRIPT italic_i ∈ italic_B start_POSTSUPERSCRIPT * end_POSTSUPERSCRIPT end_POSTSUBSCRIPT italic_v start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT italic_v start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ⊤ end_POSTSUPERSCRIPT ) ≤ italic_f ( ∑ start_POSTSUBSCRIPT italic_i ∈ italic_E end_POSTSUBSCRIPT italic_x start_POSTSUPERSCRIPT * end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT ⋅ italic_v start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT italic_v start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ⊤ end_POSTSUPERSCRIPT ). Therefore,

Pr[f(iB~vivi)<(1ϵ)OPT]Pr[f(iB~vivi)<(1ϵ)f(iExi*vivi)].Pr𝑓subscript𝑖~𝐵subscript𝑣𝑖superscriptsubscript𝑣𝑖top1italic-ϵ𝑂𝑃𝑇Pr𝑓subscript𝑖~𝐵subscript𝑣𝑖superscriptsubscript𝑣𝑖top1italic-ϵ𝑓subscript𝑖𝐸subscriptsuperscript𝑥𝑖subscript𝑣𝑖superscriptsubscript𝑣𝑖top\Pr\left[f(\sum_{i\in\tilde{B}}v_{i}v_{i}^{\top})<(1-\epsilon)\cdot OPT\right]% \leq\Pr\left[f(\sum_{i\in\tilde{B}}v_{i}v_{i}^{\top})<(1-\epsilon)\cdot f\left% (\sum_{i\in E}x^{*}_{i}\cdot v_{i}v_{i}^{\top}\right)\right]\,.roman_Pr [ italic_f ( ∑ start_POSTSUBSCRIPT italic_i ∈ over~ start_ARG italic_B end_ARG end_POSTSUBSCRIPT italic_v start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT italic_v start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ⊤ end_POSTSUPERSCRIPT ) < ( 1 - italic_ϵ ) ⋅ italic_O italic_P italic_T ] ≤ roman_Pr [ italic_f ( ∑ start_POSTSUBSCRIPT italic_i ∈ over~ start_ARG italic_B end_ARG end_POSTSUBSCRIPT italic_v start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT italic_v start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ⊤ end_POSTSUPERSCRIPT ) < ( 1 - italic_ϵ ) ⋅ italic_f ( ∑ start_POSTSUBSCRIPT italic_i ∈ italic_E end_POSTSUBSCRIPT italic_x start_POSTSUPERSCRIPT * end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT ⋅ italic_v start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT italic_v start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ⊤ end_POSTSUPERSCRIPT ) ] .

Let X:=iExi*viviassign𝑋subscript𝑖𝐸subscriptsuperscript𝑥𝑖subscript𝑣𝑖superscriptsubscript𝑣𝑖topX:=\sum_{i\in E}x^{*}_{i}\cdot v_{i}v_{i}^{\top}italic_X := ∑ start_POSTSUBSCRIPT italic_i ∈ italic_E end_POSTSUBSCRIPT italic_x start_POSTSUPERSCRIPT * end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT ⋅ italic_v start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT italic_v start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ⊤ end_POSTSUPERSCRIPT. Since f𝑓fitalic_f is monotone and homogeneous, we have

Pr[f(iB~vivi)<(1ϵ)f(X)]Pr𝑓subscript𝑖~𝐵subscript𝑣𝑖superscriptsubscript𝑣𝑖top1italic-ϵ𝑓𝑋\displaystyle\Pr\left[f(\sum_{i\in\tilde{B}}v_{i}v_{i}^{\top})<(1-\epsilon)% \cdot f(X)\right]roman_Pr [ italic_f ( ∑ start_POSTSUBSCRIPT italic_i ∈ over~ start_ARG italic_B end_ARG end_POSTSUBSCRIPT italic_v start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT italic_v start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ⊤ end_POSTSUPERSCRIPT ) < ( 1 - italic_ϵ ) ⋅ italic_f ( italic_X ) ] =Pr[f(iB~vivi)<f((1ϵ)X)]absentPr𝑓subscript𝑖~𝐵subscript𝑣𝑖superscriptsubscript𝑣𝑖top𝑓1italic-ϵ𝑋\displaystyle=\Pr\left[f(\sum_{i\in\tilde{B}}v_{i}v_{i}^{\top})<f((1-\epsilon)% \cdot X)\right]= roman_Pr [ italic_f ( ∑ start_POSTSUBSCRIPT italic_i ∈ over~ start_ARG italic_B end_ARG end_POSTSUBSCRIPT italic_v start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT italic_v start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ⊤ end_POSTSUPERSCRIPT ) < italic_f ( ( 1 - italic_ϵ ) ⋅ italic_X ) ] (6)
Pr[iB~vivi(1ϵ)X]absentPrnot-succeeds-nor-equalssubscript𝑖~𝐵subscript𝑣𝑖superscriptsubscript𝑣𝑖top1italic-ϵ𝑋\displaystyle\leq\Pr\left[\sum_{i\in\tilde{B}}v_{i}v_{i}^{\top}\nsucceq(1-% \epsilon)\cdot X\right]≤ roman_Pr [ ∑ start_POSTSUBSCRIPT italic_i ∈ over~ start_ARG italic_B end_ARG end_POSTSUBSCRIPT italic_v start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT italic_v start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ⊤ end_POSTSUPERSCRIPT ⋡ ( 1 - italic_ϵ ) ⋅ italic_X ]
=Pr[iB~X1/2viviX1/2(1ϵ)I]absentPrnot-succeeds-nor-equalssubscript𝑖~𝐵superscript𝑋12subscript𝑣𝑖superscriptsubscript𝑣𝑖topsuperscript𝑋121italic-ϵ𝐼\displaystyle=\Pr\left[\sum_{i\in\tilde{B}}X^{-1/2}v_{i}v_{i}^{\top}X^{-1/2}% \nsucceq(1-\epsilon)\cdot I\right]= roman_Pr [ ∑ start_POSTSUBSCRIPT italic_i ∈ over~ start_ARG italic_B end_ARG end_POSTSUBSCRIPT italic_X start_POSTSUPERSCRIPT - 1 / 2 end_POSTSUPERSCRIPT italic_v start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT italic_v start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ⊤ end_POSTSUPERSCRIPT italic_X start_POSTSUPERSCRIPT - 1 / 2 end_POSTSUPERSCRIPT ⋡ ( 1 - italic_ϵ ) ⋅ italic_I ]
=Pr[λmin(iB~X1/2viviX1/2)(1ϵ)]absentPrsubscript𝜆subscript𝑖~𝐵superscript𝑋12subscript𝑣𝑖superscriptsubscript𝑣𝑖topsuperscript𝑋121italic-ϵ\displaystyle=\Pr\left[\lambda_{\min}(\sum_{i\in\tilde{B}}X^{-1/2}v_{i}v_{i}^{% \top}X^{-1/2})\leq(1-\epsilon)\right]= roman_Pr [ italic_λ start_POSTSUBSCRIPT roman_min end_POSTSUBSCRIPT ( ∑ start_POSTSUBSCRIPT italic_i ∈ over~ start_ARG italic_B end_ARG end_POSTSUBSCRIPT italic_X start_POSTSUPERSCRIPT - 1 / 2 end_POSTSUPERSCRIPT italic_v start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT italic_v start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ⊤ end_POSTSUPERSCRIPT italic_X start_POSTSUPERSCRIPT - 1 / 2 end_POSTSUPERSCRIPT ) ≤ ( 1 - italic_ϵ ) ] (7)

Similar to the proof of Lemma 3, we will apply Lemma 4 to random matrices Mi=vivisubscript𝑀𝑖subscript𝑣𝑖superscriptsubscript𝑣𝑖topM_{i}=v_{i}v_{i}^{\top}italic_M start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT = italic_v start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT italic_v start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ⊤ end_POSTSUPERSCRIPT after appropriate transformations to ensure R=O(ϵ2/logd)𝑅𝑂superscriptitalic-ϵ2𝑑R=O(\epsilon^{2}/\log{d})italic_R = italic_O ( italic_ϵ start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT / roman_log italic_d ). First note that since xi*=1subscriptsuperscript𝑥𝑖1x^{*}_{i}=1italic_x start_POSTSUPERSCRIPT * end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT = 1 for any iS𝑖𝑆i\in Sitalic_i ∈ italic_S, we have iB𝑖𝐵i\in Bitalic_i ∈ italic_B as pipage rounding does not change the integral elements of x*superscript𝑥x^{*}italic_x start_POSTSUPERSCRIPT * end_POSTSUPERSCRIPT. Therefore iSviviXprecedes-or-equalssubscript𝑖𝑆subscript𝑣𝑖superscriptsubscript𝑣𝑖top𝑋\sum_{i\in S}v_{i}v_{i}^{\top}\preceq X∑ start_POSTSUBSCRIPT italic_i ∈ italic_S end_POSTSUBSCRIPT italic_v start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT italic_v start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ⊤ end_POSTSUPERSCRIPT ⪯ italic_X, and for any iB\S𝑖\𝐵𝑆i\in B\backslash Sitalic_i ∈ italic_B \ italic_S,

λmax(X1/2viviX1/2)=viX1vivi(jSvjvj)1viϵ210logd.subscript𝜆superscript𝑋12subscript𝑣𝑖superscriptsubscript𝑣𝑖topsuperscript𝑋12superscriptsubscript𝑣𝑖topsuperscript𝑋1subscript𝑣𝑖superscriptsubscript𝑣𝑖topsuperscriptsubscript𝑗𝑆subscript𝑣𝑗superscriptsubscript𝑣𝑗top1subscript𝑣𝑖superscriptitalic-ϵ210𝑑\lambda_{\max}(X^{-1/2}v_{i}v_{i}^{\top}X^{-1/2})=v_{i}^{\top}X^{-1}v_{i}\leq v% _{i}^{\top}\left(\sum_{j\in S}v_{j}v_{j}^{\top}\right)^{-1}v_{i}\leq\frac{% \epsilon^{2}}{10\log{d}}.italic_λ start_POSTSUBSCRIPT roman_max end_POSTSUBSCRIPT ( italic_X start_POSTSUPERSCRIPT - 1 / 2 end_POSTSUPERSCRIPT italic_v start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT italic_v start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ⊤ end_POSTSUPERSCRIPT italic_X start_POSTSUPERSCRIPT - 1 / 2 end_POSTSUPERSCRIPT ) = italic_v start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ⊤ end_POSTSUPERSCRIPT italic_X start_POSTSUPERSCRIPT - 1 end_POSTSUPERSCRIPT italic_v start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT ≤ italic_v start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ⊤ end_POSTSUPERSCRIPT ( ∑ start_POSTSUBSCRIPT italic_j ∈ italic_S end_POSTSUBSCRIPT italic_v start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT italic_v start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ⊤ end_POSTSUPERSCRIPT ) start_POSTSUPERSCRIPT - 1 end_POSTSUPERSCRIPT italic_v start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT ≤ divide start_ARG italic_ϵ start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT end_ARG start_ARG 10 roman_log italic_d end_ARG .

For any iS𝑖𝑆i\in Sitalic_i ∈ italic_S, viX1vi1subscript𝑣𝑖superscript𝑋1superscriptsubscript𝑣𝑖top1v_{i}X^{-1}v_{i}^{\top}\leq 1italic_v start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT italic_X start_POSTSUPERSCRIPT - 1 end_POSTSUPERSCRIPT italic_v start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ⊤ end_POSTSUPERSCRIPT ≤ 1, and since xi*=1superscriptsubscript𝑥𝑖1x_{i}^{*}=1italic_x start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT start_POSTSUPERSCRIPT * end_POSTSUPERSCRIPT = 1 we can replace Mi=vivisubscript𝑀𝑖subscript𝑣𝑖superscriptsubscript𝑣𝑖topM_{i}=v_{i}v_{i}^{\top}italic_M start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT = italic_v start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT italic_v start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ⊤ end_POSTSUPERSCRIPT with r=10logd/ϵ2𝑟10𝑑superscriptitalic-ϵ2r=10\log{d}/\epsilon^{2}italic_r = 10 roman_log italic_d / italic_ϵ start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT matrices Mi1,,Mirsuperscriptsubscript𝑀𝑖1superscriptsubscript𝑀𝑖𝑟M_{i}^{1},\ldots,M_{i}^{r}italic_M start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT start_POSTSUPERSCRIPT 1 end_POSTSUPERSCRIPT , … , italic_M start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_r end_POSTSUPERSCRIPT such that Mij=ϵ210logdvivisuperscriptsubscript𝑀𝑖𝑗superscriptitalic-ϵ210𝑑subscript𝑣𝑖superscriptsubscript𝑣𝑖topM_{i}^{j}=\frac{\epsilon^{2}}{10\log{d}}v_{i}v_{i}^{\top}italic_M start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_j end_POSTSUPERSCRIPT = divide start_ARG italic_ϵ start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT end_ARG start_ARG 10 roman_log italic_d end_ARG italic_v start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT italic_v start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ⊤ end_POSTSUPERSCRIPT. This ensures that λmax(X1/2MijX1/2)ϵ2/10logdsubscript𝜆superscript𝑋12superscriptsubscript𝑀𝑖𝑗superscript𝑋12superscriptitalic-ϵ210𝑑\lambda_{\max}(X^{-1/2}M_{i}^{j}X^{-1/2})\leq\epsilon^{2}/10\log{d}italic_λ start_POSTSUBSCRIPT roman_max end_POSTSUBSCRIPT ( italic_X start_POSTSUPERSCRIPT - 1 / 2 end_POSTSUPERSCRIPT italic_M start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_j end_POSTSUPERSCRIPT italic_X start_POSTSUPERSCRIPT - 1 / 2 end_POSTSUPERSCRIPT ) ≤ italic_ϵ start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT / 10 roman_log italic_d for all j[r]𝑗delimited-[]𝑟j\in[r]italic_j ∈ [ italic_r ]. Applying Lemma 4 on random matrices {Mi}iB\S,{Mij}iS,j[r]subscriptsubscript𝑀𝑖𝑖\𝐵𝑆subscriptsuperscriptsubscript𝑀𝑖𝑗formulae-sequence𝑖𝑆𝑗delimited-[]𝑟\{M_{i}\}_{i\in B\backslash S},\{M_{i}^{j}\}_{i\in S,j\in[r]}{ italic_M start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT } start_POSTSUBSCRIPT italic_i ∈ italic_B \ italic_S end_POSTSUBSCRIPT , { italic_M start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_j end_POSTSUPERSCRIPT } start_POSTSUBSCRIPT italic_i ∈ italic_S , italic_j ∈ [ italic_r ] end_POSTSUBSCRIPT gives

Pr[λmin(iB~X1/2viviX1/2)(1ϵ)]d4.Prsubscript𝜆subscript𝑖~𝐵superscript𝑋12subscript𝑣𝑖superscriptsubscript𝑣𝑖topsuperscript𝑋121italic-ϵsuperscript𝑑4\Pr\left[\lambda_{\min}(\sum_{i\in\tilde{B}}X^{-1/2}v_{i}v_{i}^{\top}X^{-1/2})% \leq(1-\epsilon)\right]\leq d^{-4}\,.roman_Pr [ italic_λ start_POSTSUBSCRIPT roman_min end_POSTSUBSCRIPT ( ∑ start_POSTSUBSCRIPT italic_i ∈ over~ start_ARG italic_B end_ARG end_POSTSUBSCRIPT italic_X start_POSTSUPERSCRIPT - 1 / 2 end_POSTSUPERSCRIPT italic_v start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT italic_v start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ⊤ end_POSTSUPERSCRIPT italic_X start_POSTSUPERSCRIPT - 1 / 2 end_POSTSUPERSCRIPT ) ≤ ( 1 - italic_ϵ ) ] ≤ italic_d start_POSTSUPERSCRIPT - 4 end_POSTSUPERSCRIPT .

If B𝐵Bitalic_B is the basis returned by Algorithm 2, then f(B)f(B~)𝑓𝐵𝑓~𝐵f(B)\geq f(\tilde{B})italic_f ( italic_B ) ≥ italic_f ( over~ start_ARG italic_B end_ARG ). Using the above inequality with (7) implies that f(iBvivi)(1ϵ)OPT𝑓subscript𝑖𝐵subscript𝑣𝑖superscriptsubscript𝑣𝑖top1italic-ϵ𝑂𝑃𝑇f(\sum_{i\in B}v_{i}v_{i}^{\top})\geq(1-\epsilon)\cdot OPTitalic_f ( ∑ start_POSTSUBSCRIPT italic_i ∈ italic_B end_POSTSUBSCRIPT italic_v start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT italic_v start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ⊤ end_POSTSUPERSCRIPT ) ≥ ( 1 - italic_ϵ ) ⋅ italic_O italic_P italic_T with probability at least 1d41superscript𝑑41-d^{-4}1 - italic_d start_POSTSUPERSCRIPT - 4 end_POSTSUPERSCRIPT. \Box

3.2 Pipage Rounding

The purpose of this section is to give an explanation of the pipage rounding technique and motivate the proof of Lemma 4. For a detailed discussion of pipage rounding, see [HO14].

For a set SE𝑆𝐸S\subseteq Eitalic_S ⊆ italic_E, let x*superscript𝑥x^{*}italic_x start_POSTSUPERSCRIPT * end_POSTSUPERSCRIPT be the optimal solution of CP(S). If we round x*superscript𝑥x^{*}italic_x start_POSTSUPERSCRIPT * end_POSTSUPERSCRIPT independently, i.e., add element i𝑖iitalic_i to the output with probability xi*superscriptsubscript𝑥𝑖x_{i}^{*}italic_x start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT start_POSTSUPERSCRIPT * end_POSTSUPERSCRIPT, then the set obtained, say B𝐵Bitalic_B, might not be independent in \mathcal{M}caligraphic_M. But the following concentration inequality would still hold due to independence,

Pr[λmin(iBvivi)<(1ϵ)λmin(i[n]xivivi)]<d4.Prsubscript𝜆subscript𝑖𝐵subscript𝑣𝑖superscriptsubscript𝑣𝑖top1italic-ϵsubscript𝜆subscript𝑖delimited-[]𝑛subscriptsuperscript𝑥𝑖subscript𝑣𝑖superscriptsubscript𝑣𝑖topsuperscript𝑑4\operatorname{\mathrm{Pr}}\left[\lambda_{\min}\left(\sum_{i\in B}v_{i}v_{i}^{% \top}\right)<(1-\epsilon)\lambda_{\min}\left(\sum_{i\in[n]}x^{\star}_{i}\,v_{i% }v_{i}^{\top}\right)\right]<d^{-4}.roman_Pr [ italic_λ start_POSTSUBSCRIPT roman_min end_POSTSUBSCRIPT ( ∑ start_POSTSUBSCRIPT italic_i ∈ italic_B end_POSTSUBSCRIPT italic_v start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT italic_v start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ⊤ end_POSTSUPERSCRIPT ) < ( 1 - italic_ϵ ) italic_λ start_POSTSUBSCRIPT roman_min end_POSTSUBSCRIPT ( ∑ start_POSTSUBSCRIPT italic_i ∈ [ italic_n ] end_POSTSUBSCRIPT italic_x start_POSTSUPERSCRIPT ⋆ end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT italic_v start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT italic_v start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ⊤ end_POSTSUPERSCRIPT ) ] < italic_d start_POSTSUPERSCRIPT - 4 end_POSTSUPERSCRIPT . (8)

The main idea behind pipage rounding is to iteratively transform a point x𝒫()𝑥𝒫x\in\mathcal{P}(\mathcal{M})italic_x ∈ caligraphic_P ( caligraphic_M ) to a basis of \mathcal{M}caligraphic_M while ensuring that the failure probability from equation (8) does not increase.

For a point x[0,1]n𝑥superscript01𝑛x\in[0,1]^{n}italic_x ∈ [ 0 , 1 ] start_POSTSUPERSCRIPT italic_n end_POSTSUPERSCRIPT, let D(x)𝐷𝑥D(x)italic_D ( italic_x ) represent the corresponding product distribution over {0,1}nsuperscript01𝑛\{0,1\}^{n}{ 0 , 1 } start_POSTSUPERSCRIPT italic_n end_POSTSUPERSCRIPT with marginals given by x𝑥xitalic_x, i.e., include element i𝑖iitalic_i in the output with probability xisubscript𝑥𝑖x_{i}italic_x start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT. For x[0,1]n𝑥superscript01𝑛x\in[0,1]^{n}italic_x ∈ [ 0 , 1 ] start_POSTSUPERSCRIPT italic_n end_POSTSUPERSCRIPT and ϵ>0italic-ϵ0\epsilon>0italic_ϵ > 0, define

pϵ(x):=PrBD(x)(λmin(iBvivi)(1ϵ)λmin(i[n]xivivi)).assignsubscript𝑝italic-ϵ𝑥subscriptPrsimilar-to𝐵𝐷𝑥subscript𝜆subscript𝑖𝐵subscript𝑣𝑖superscriptsubscript𝑣𝑖top1italic-ϵsubscript𝜆subscript𝑖delimited-[]𝑛subscript𝑥𝑖subscript𝑣𝑖superscriptsubscript𝑣𝑖topp_{\epsilon}(x):=\Pr_{B\sim D(x)}\left(\lambda_{\min}(\sum_{i\in B}v_{i}v_{i}^% {\top})\leq(1-\epsilon)\cdot\lambda_{\min}\left(\sum_{i\in[n]}x_{i}v_{i}v_{i}^% {\top}\right)\right)\,.italic_p start_POSTSUBSCRIPT italic_ϵ end_POSTSUBSCRIPT ( italic_x ) := roman_Pr start_POSTSUBSCRIPT italic_B ∼ italic_D ( italic_x ) end_POSTSUBSCRIPT ( italic_λ start_POSTSUBSCRIPT roman_min end_POSTSUBSCRIPT ( ∑ start_POSTSUBSCRIPT italic_i ∈ italic_B end_POSTSUBSCRIPT italic_v start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT italic_v start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ⊤ end_POSTSUPERSCRIPT ) ≤ ( 1 - italic_ϵ ) ⋅ italic_λ start_POSTSUBSCRIPT roman_min end_POSTSUBSCRIPT ( ∑ start_POSTSUBSCRIPT italic_i ∈ [ italic_n ] end_POSTSUBSCRIPT italic_x start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT italic_v start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT italic_v start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ⊤ end_POSTSUPERSCRIPT ) ) .

So pϵ(x)subscript𝑝italic-ϵ𝑥p_{\epsilon}(x)italic_p start_POSTSUBSCRIPT italic_ϵ end_POSTSUBSCRIPT ( italic_x ) is the failure probability of getting a (1ϵ)1italic-ϵ(1-\epsilon)( 1 - italic_ϵ )-approximation when rounding independently at point x𝑥xitalic_x. [HO14] showed that there exists a function gϵ(x)subscript𝑔italic-ϵ𝑥g_{\epsilon}(x)italic_g start_POSTSUBSCRIPT italic_ϵ end_POSTSUBSCRIPT ( italic_x ) s.t. pϵ(x)gϵ(x)dexp(ϵ2μmin2R)subscript𝑝italic-ϵ𝑥subscript𝑔italic-ϵ𝑥𝑑superscriptitalic-ϵ2subscript𝜇𝑚𝑖𝑛2𝑅p_{\epsilon}(x)\leq g_{\epsilon}(x)\leq d\cdot\exp\left(\frac{-\epsilon^{2}\mu% _{min}}{2R}\right)italic_p start_POSTSUBSCRIPT italic_ϵ end_POSTSUBSCRIPT ( italic_x ) ≤ italic_g start_POSTSUBSCRIPT italic_ϵ end_POSTSUBSCRIPT ( italic_x ) ≤ italic_d ⋅ roman_exp ( divide start_ARG - italic_ϵ start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT italic_μ start_POSTSUBSCRIPT italic_m italic_i italic_n end_POSTSUBSCRIPT end_ARG start_ARG 2 italic_R end_ARG ) and gϵsubscript𝑔italic-ϵg_{\epsilon}italic_g start_POSTSUBSCRIPT italic_ϵ end_POSTSUBSCRIPT is concave under swaps, i.e., for all a,b[n]𝑎𝑏delimited-[]𝑛a,b\in[n]italic_a , italic_b ∈ [ italic_n ] and x𝒫()𝑥𝒫x\in\mathcal{P}(\mathcal{M})italic_x ∈ caligraphic_P ( caligraphic_M ) the map zgϵ(x+z(eaeb))maps-to𝑧subscript𝑔italic-ϵ𝑥𝑧subscript𝑒𝑎subscript𝑒𝑏z\mapsto g_{\epsilon}(x+z(e_{a}-e_{b}))italic_z ↦ italic_g start_POSTSUBSCRIPT italic_ϵ end_POSTSUBSCRIPT ( italic_x + italic_z ( italic_e start_POSTSUBSCRIPT italic_a end_POSTSUBSCRIPT - italic_e start_POSTSUBSCRIPT italic_b end_POSTSUBSCRIPT ) ) is concave.

So, if x𝑥xitalic_x is not an extreme point of 𝒫()𝒫\mathcal{P}(\mathcal{M})caligraphic_P ( caligraphic_M ), then there exist a,b[n]𝑎𝑏delimited-[]𝑛a,b\in[n]italic_a , italic_b ∈ [ italic_n ] and ϵ>0italic-ϵ0\epsilon>0italic_ϵ > 0 such that x±ϵ(ea+eb)𝒫()plus-or-minus𝑥italic-ϵsubscript𝑒𝑎subscript𝑒𝑏𝒫x\pm\epsilon(e_{a}+e_{b})\in\mathcal{P}(\mathcal{M})italic_x ± italic_ϵ ( italic_e start_POSTSUBSCRIPT italic_a end_POSTSUBSCRIPT + italic_e start_POSTSUBSCRIPT italic_b end_POSTSUBSCRIPT ) ∈ caligraphic_P ( caligraphic_M ). Let l=min{z:x+z(eaeb)𝒫()}𝑙:𝑧𝑥𝑧subscript𝑒𝑎subscript𝑒𝑏𝒫l=\min\{z:x+z(e_{a}-e_{b})\in\mathcal{P}(\mathcal{M})\}italic_l = roman_min { italic_z : italic_x + italic_z ( italic_e start_POSTSUBSCRIPT italic_a end_POSTSUBSCRIPT - italic_e start_POSTSUBSCRIPT italic_b end_POSTSUBSCRIPT ) ∈ caligraphic_P ( caligraphic_M ) } and u=max{z:x+z(eaeb)𝒫()u=\max\{z:x+z(e_{a}-e_{b})\in\mathcal{P}(\mathcal{M})italic_u = roman_max { italic_z : italic_x + italic_z ( italic_e start_POSTSUBSCRIPT italic_a end_POSTSUBSCRIPT - italic_e start_POSTSUBSCRIPT italic_b end_POSTSUBSCRIPT ) ∈ caligraphic_P ( caligraphic_M ).

With this, we can define xl=x+l(eaeb)superscript𝑥𝑙𝑥𝑙subscript𝑒𝑎subscript𝑒𝑏x^{l}=x+l(e_{a}-e_{b})italic_x start_POSTSUPERSCRIPT italic_l end_POSTSUPERSCRIPT = italic_x + italic_l ( italic_e start_POSTSUBSCRIPT italic_a end_POSTSUBSCRIPT - italic_e start_POSTSUBSCRIPT italic_b end_POSTSUBSCRIPT ) and xu=x+u(eaeb)superscript𝑥𝑢𝑥𝑢subscript𝑒𝑎subscript𝑒𝑏x^{u}=x+u(e_{a}-e_{b})italic_x start_POSTSUPERSCRIPT italic_u end_POSTSUPERSCRIPT = italic_x + italic_u ( italic_e start_POSTSUBSCRIPT italic_a end_POSTSUBSCRIPT - italic_e start_POSTSUBSCRIPT italic_b end_POSTSUBSCRIPT ). Since g(x+z(eaeb))𝑔𝑥𝑧subscript𝑒𝑎subscript𝑒𝑏g(x+z(e_{a}-e_{b}))italic_g ( italic_x + italic_z ( italic_e start_POSTSUBSCRIPT italic_a end_POSTSUBSCRIPT - italic_e start_POSTSUBSCRIPT italic_b end_POSTSUBSCRIPT ) ) is concave as a function of z𝑧zitalic_z, we know that either g(xl)g(x)𝑔superscript𝑥𝑙𝑔𝑥g(x^{l})\leq g(x)italic_g ( italic_x start_POSTSUPERSCRIPT italic_l end_POSTSUPERSCRIPT ) ≤ italic_g ( italic_x ) or g(xu)g(x)𝑔superscript𝑥𝑢𝑔𝑥g(x^{u})\leq g(x)italic_g ( italic_x start_POSTSUPERSCRIPT italic_u end_POSTSUPERSCRIPT ) ≤ italic_g ( italic_x ). Moreover, both xlsuperscript𝑥𝑙x^{l}italic_x start_POSTSUPERSCRIPT italic_l end_POSTSUPERSCRIPT and xusuperscript𝑥𝑢x^{u}italic_x start_POSTSUPERSCRIPT italic_u end_POSTSUPERSCRIPT are on a lower dimensional face than the initial point x𝑥xitalic_x. Thus, for any initial point x0𝒫()subscript𝑥0𝒫x_{0}\in\mathcal{P}(\mathcal{M})italic_x start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT ∈ caligraphic_P ( caligraphic_M ), a total of m𝑚mitalic_m iterations suffice to find an extreme point with x^^𝑥\hat{x}over^ start_ARG italic_x end_ARG with g(x^)g(x0)𝑔^𝑥𝑔subscript𝑥0g(\hat{x})\leq g(x_{0})italic_g ( over^ start_ARG italic_x end_ARG ) ≤ italic_g ( italic_x start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT ).

In randomized pipage starting at x𝒫()𝑥𝒫x\in\mathcal{P}(\mathcal{M})italic_x ∈ caligraphic_P ( caligraphic_M ), our next iterate xsuperscript𝑥x^{\prime}italic_x start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT of the rounding procedure will be xlsuperscript𝑥𝑙x^{l}italic_x start_POSTSUPERSCRIPT italic_l end_POSTSUPERSCRIPT with probability uul𝑢𝑢𝑙\frac{u}{u-l}divide start_ARG italic_u end_ARG start_ARG italic_u - italic_l end_ARG and xusuperscript𝑥𝑢x^{u}italic_x start_POSTSUPERSCRIPT italic_u end_POSTSUPERSCRIPT with probability lul𝑙𝑢𝑙\frac{-l}{u-l}divide start_ARG - italic_l end_ARG start_ARG italic_u - italic_l end_ARG. This ensures that 𝔼(x)=x𝔼superscript𝑥𝑥\operatorname{\mathbb{E}}(x^{\prime})=xblackboard_E ( italic_x start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT ) = italic_x, and the concavity under swaps guarantees that 𝔼[gϵ(x)]gϵ(x)𝔼subscript𝑔italic-ϵsuperscript𝑥subscript𝑔italic-ϵ𝑥\operatorname{\mathbb{E}}[g_{\epsilon}(x^{\prime})]\leq g_{\epsilon}(x)blackboard_E [ italic_g start_POSTSUBSCRIPT italic_ϵ end_POSTSUBSCRIPT ( italic_x start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT ) ] ≤ italic_g start_POSTSUBSCRIPT italic_ϵ end_POSTSUBSCRIPT ( italic_x ) by Jensen’s in the variable z𝑧zitalic_z. If we start at a point x0𝒫())x_{0}\in\mathcal{P}(\mathcal{M}))italic_x start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT ∈ caligraphic_P ( caligraphic_M ) ) and iterate this random procedure m𝑚mitalic_m times, we get an extreme point x^^𝑥\hat{x}over^ start_ARG italic_x end_ARG which satisfies 𝔼[x^]=x0𝔼^𝑥subscript𝑥0\operatorname{\mathbb{E}}[\hat{x}]=x_{0}blackboard_E [ over^ start_ARG italic_x end_ARG ] = italic_x start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT and 𝔼[g(x^)]g(x0)𝔼𝑔^𝑥𝑔subscript𝑥0\operatorname{\mathbb{E}}[g(\hat{x})]\leq g(x_{0})blackboard_E [ italic_g ( over^ start_ARG italic_x end_ARG ) ] ≤ italic_g ( italic_x start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT ).

This gives the intuition behind the proof of Lemma 6, and leads to the following algorithm.

Algorithm 3 Randomized Pipage Rounding
1:Input: Point x𝒫()𝑥𝒫x\in\mathcal{P}(\mathcal{M})italic_x ∈ caligraphic_P ( caligraphic_M ), where 𝒫()𝒫\mathcal{P}(\mathcal{M})caligraphic_P ( caligraphic_M ) is a matroid base polytope
2:while x𝑥xitalic_x is not integral do
3:     a,b𝑎𝑏absenta,b\leftarrowitalic_a , italic_b ← distinct elements of [n]delimited-[]𝑛[n][ italic_n ] s.t. ϵ>0italic-ϵ0\exists\epsilon>0∃ italic_ϵ > 0 with x±ϵ(eaeb)𝒫()plus-or-minus𝑥italic-ϵsubscript𝑒𝑎subscript𝑒𝑏𝒫x\pm\epsilon(e_{a}-e_{b})\in\mathcal{P}(\mathcal{M})italic_x ± italic_ϵ ( italic_e start_POSTSUBSCRIPT italic_a end_POSTSUBSCRIPT - italic_e start_POSTSUBSCRIPT italic_b end_POSTSUBSCRIPT ) ∈ caligraphic_P ( caligraphic_M )
4:     min{y0:xy(eaeb)𝒫()}:𝑦0𝑥𝑦subscript𝑒𝑎subscript𝑒𝑏𝒫\ell\leftarrow\min\{y\geq 0:x-y(e_{a}-e_{b})\in\mathcal{P}(\mathcal{M})\}roman_ℓ ← roman_min { italic_y ≥ 0 : italic_x - italic_y ( italic_e start_POSTSUBSCRIPT italic_a end_POSTSUBSCRIPT - italic_e start_POSTSUBSCRIPT italic_b end_POSTSUBSCRIPT ) ∈ caligraphic_P ( caligraphic_M ) }
5:     hmax{y0:x+y(eaeb)𝒫()}:𝑦0𝑥𝑦subscript𝑒𝑎subscript𝑒𝑏𝒫h\leftarrow\max\{y\geq 0:x+y(e_{a}-e_{b})\in\mathcal{P}(\mathcal{M})\}italic_h ← roman_max { italic_y ≥ 0 : italic_x + italic_y ( italic_e start_POSTSUBSCRIPT italic_a end_POSTSUBSCRIPT - italic_e start_POSTSUBSCRIPT italic_b end_POSTSUBSCRIPT ) ∈ caligraphic_P ( caligraphic_M ) }
6:     x{x(eaeb)w.p. /(+h)x+h(eaeb)w.p. h/(+h)𝑥cases𝑥subscript𝑒𝑎subscript𝑒𝑏w.p. 𝑥subscript𝑒𝑎subscript𝑒𝑏w.p. x\leftarrow\begin{cases}x-\ell(e_{a}-e_{b})&\text{w.p. }\ell/(\ell+h)\\ x+h(e_{a}-e_{b})&\text{w.p. }h/(\ell+h)\end{cases}italic_x ← { start_ROW start_CELL italic_x - roman_ℓ ( italic_e start_POSTSUBSCRIPT italic_a end_POSTSUBSCRIPT - italic_e start_POSTSUBSCRIPT italic_b end_POSTSUBSCRIPT ) end_CELL start_CELL w.p. roman_ℓ / ( roman_ℓ + italic_h ) end_CELL end_ROW start_ROW start_CELL italic_x + italic_h ( italic_e start_POSTSUBSCRIPT italic_a end_POSTSUBSCRIPT - italic_e start_POSTSUBSCRIPT italic_b end_POSTSUBSCRIPT ) end_CELL start_CELL w.p. italic_h / ( roman_ℓ + italic_h ) end_CELL end_ROW
7:end while
8:Return basis B𝒫()𝐵𝒫B\in\mathcal{P}(\mathcal{M})italic_B ∈ caligraphic_P ( caligraphic_M ) with indicator vector x𝑥xitalic_x

4 Conclusion and Remarks

The resolution of the Kadison-Singer problem in [MSS15] using the interlacing families of polynomials implies the following existential result about maximizing the minimum eigenvalue under partition matroid constraints.

Theorem 5

[MSS15, Theorem 1.4] For ϵ>0italic-ϵ0\epsilon>0italic_ϵ > 0 and vectors {vij}i[k],j[n]dsubscriptsubscript𝑣𝑖𝑗formulae-sequence𝑖delimited-[]𝑘𝑗delimited-[]𝑛superscript𝑑\{v_{ij}\}_{i\in[k],j\in[n]}\in\mathbb{R}^{d}{ italic_v start_POSTSUBSCRIPT italic_i italic_j end_POSTSUBSCRIPT } start_POSTSUBSCRIPT italic_i ∈ [ italic_k ] , italic_j ∈ [ italic_n ] end_POSTSUBSCRIPT ∈ blackboard_R start_POSTSUPERSCRIPT italic_d end_POSTSUPERSCRIPT with vij2ϵsuperscriptnormsubscript𝑣𝑖𝑗2italic-ϵ\|v_{ij}\|^{2}\leq\epsilon∥ italic_v start_POSTSUBSCRIPT italic_i italic_j end_POSTSUBSCRIPT ∥ start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT ≤ italic_ϵ for all i[k],j[n]formulae-sequence𝑖delimited-[]𝑘𝑗delimited-[]𝑛i\in[k],j\in[n]italic_i ∈ [ italic_k ] , italic_j ∈ [ italic_n ], if there exist xij0subscript𝑥𝑖𝑗0x_{ij}\geq 0italic_x start_POSTSUBSCRIPT italic_i italic_j end_POSTSUBSCRIPT ≥ 0 such that

i=1kj=1nxijvijvij=Id and j=1nxij=1 for all i[k],formulae-sequencesuperscriptsubscript𝑖1𝑘superscriptsubscript𝑗1𝑛subscript𝑥𝑖𝑗subscript𝑣𝑖𝑗superscriptsubscript𝑣𝑖𝑗topsubscript𝐼𝑑 and superscriptsubscript𝑗1𝑛subscript𝑥𝑖𝑗1 for all 𝑖delimited-[]𝑘\sum_{i=1}^{k}\sum_{j=1}^{n}x_{ij}\cdot v_{ij}v_{ij}^{\top}=I_{d}\quad\text{ % and }\quad\sum_{j=1}^{n}x_{ij}=1\text{ for all }i\in[k],∑ start_POSTSUBSCRIPT italic_i = 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_k end_POSTSUPERSCRIPT ∑ start_POSTSUBSCRIPT italic_j = 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_n end_POSTSUPERSCRIPT italic_x start_POSTSUBSCRIPT italic_i italic_j end_POSTSUBSCRIPT ⋅ italic_v start_POSTSUBSCRIPT italic_i italic_j end_POSTSUBSCRIPT italic_v start_POSTSUBSCRIPT italic_i italic_j end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ⊤ end_POSTSUPERSCRIPT = italic_I start_POSTSUBSCRIPT italic_d end_POSTSUBSCRIPT and ∑ start_POSTSUBSCRIPT italic_j = 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_n end_POSTSUPERSCRIPT italic_x start_POSTSUBSCRIPT italic_i italic_j end_POSTSUBSCRIPT = 1 for all italic_i ∈ [ italic_k ] ,

then there exists a choice function σ:[k][n]normal-:𝜎normal-→delimited-[]𝑘delimited-[]𝑛\sigma:[k]\rightarrow[n]italic_σ : [ italic_k ] → [ italic_n ] such that

(1ϵ)2Idi=1kviσ(i)viσ(i)(1+ϵ)2Id.precedes-or-equalssuperscript1italic-ϵ2subscript𝐼𝑑superscriptsubscript𝑖1𝑘subscript𝑣𝑖𝜎𝑖superscriptsubscript𝑣𝑖𝜎𝑖topprecedes-or-equalssuperscript1italic-ϵ2subscript𝐼𝑑(1-\sqrt{\epsilon})^{2}\cdot I_{d}\preceq\sum_{i=1}^{k}v_{i\sigma(i)}v_{i% \sigma(i)}^{\top}\preceq(1+\sqrt{\epsilon})^{2}\cdot I_{d}.( 1 - square-root start_ARG italic_ϵ end_ARG ) start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT ⋅ italic_I start_POSTSUBSCRIPT italic_d end_POSTSUBSCRIPT ⪯ ∑ start_POSTSUBSCRIPT italic_i = 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_k end_POSTSUPERSCRIPT italic_v start_POSTSUBSCRIPT italic_i italic_σ ( italic_i ) end_POSTSUBSCRIPT italic_v start_POSTSUBSCRIPT italic_i italic_σ ( italic_i ) end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ⊤ end_POSTSUPERSCRIPT ⪯ ( 1 + square-root start_ARG italic_ϵ end_ARG ) start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT ⋅ italic_I start_POSTSUBSCRIPT italic_d end_POSTSUBSCRIPT .

We can state this result equivalently as an “existential” rounding result. When vij2ϵsuperscriptnormsubscript𝑣𝑖𝑗2italic-ϵ\|v_{ij}\|^{2}\leq\epsilon∥ italic_v start_POSTSUBSCRIPT italic_i italic_j end_POSTSUBSCRIPT ∥ start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT ≤ italic_ϵ, Theorem 5 implies that the integrality gap of the natural convex relaxation (CP) for the minimum eigenvalue problem with partition constraints is only 1/(1ϵ)21superscript1italic-ϵ21/(1-\sqrt{\epsilon})^{2}1 / ( 1 - square-root start_ARG italic_ϵ end_ARG ) start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT. It is an open problem to efficiently round the solution to the convex relaxation with comparable guarantees for any dimension d𝑑ditalic_d.

More generally, the problem of designing an approximation algorithm for the minimum eigenvalue problem under partition or matroid constraints in arbitrary dimensions remains wide open. However, checking whether there is a solution with a non-zero objective can be solved in polynomial time solvable through matroid intersection. Recently, there has been significant progress in the case of maximizing the determinant [Nik15, SEFM15, NS16, AGV18, SX18, MNST20, BLP+{}^{+}start_FLOATSUPERSCRIPT + end_FLOATSUPERSCRIPT22], but it remains open whether those techniques can be utilized for the minimum eigenvalue problem.

References

  • [AB13] Haim Avron and Christos Boutsidis. Faster subset selection for matrices and applications. SIAM Journal on Matrix Analysis and Applications, 34(4):1464–1499, 2013.
  • [AFS12] Arash Asadpour, Uriel Feige, and Amin Saberi. Santa claus meets hypergraph matchings. ACM Trans. Algorithms, 8(3), jul 2012.
  • [AGV18] Nima Anari, Shayan Oveis Gharan, and Cynthia Vinzant. Log-concave polynomials, entropy, and a deterministic approximation algorithm for counting bases of matroids. In 2018 IEEE 59th Annual Symposium on Foundations of Computer Science (FOCS), pages 35–46. IEEE, 2018.
  • [AS07] Arash Asadpour and Amin Saberi. An approximation algorithm for max-min fair allocation of indivisible goods. In Proceedings of the Thirty-Ninth Annual ACM Symposium on Theory of Computing, STOC ’07, page 114–121, New York, NY, USA, 2007. Association for Computing Machinery.
  • [AZLSW17] Zeyuan Allen-Zhu, Yuanzhi Li, Aarti Singh, and Yining Wang. Near-optimal discrete optimization for experimental design: A regret minimization approach. Mathematical Programming, 186, 11 2017.
  • [BLP+{}^{+}start_FLOATSUPERSCRIPT + end_FLOATSUPERSCRIPT22] Adam Brown, Aditi Laddha, Madhusudhan Pittu, Mohit Singh, and Prasad Tetali. Determinant maximization via matroid intersection algorithms. In 2022 IEEE 63rd Annual Symposium on Foundations of Computer Science (FOCS), pages 255–266, Los Alamitos, CA, USA, nov 2022. IEEE Computer Society.
  • [BS06] Nikhil Bansal and Maxim Sviridenko. The santa claus problem. In Proceedings of the Thirty-Eighth Annual ACM Symposium on Theory of Computing, STOC ’06, page 31–40, New York, NY, USA, 2006. Association for Computing Machinery.
  • [BSS09] Joshua D. Batson, Daniel A. Spielman, and Nikhil Srivastava. Twice-ramanujan sparsifiers. In Proceedings of the Forty-First Annual ACM Symposium on Theory of Computing, STOC ’09, page 255–262, New York, NY, USA, 2009. Association for Computing Machinery.
  • [CCK09] Deeparnab Chakrabarty, Julia Chuzhoy, and Sanjeev Khanna. On allocating goods to maximize fairness. In Proceedings of the 2009 50th Annual IEEE Symposium on Foundations of Computer Science, FOCS ’09, page 107–116, USA, 2009. IEEE Computer Society.
  • [cMI09] Ali Çivril and Malik Magdon-Ismail. On selecting a maximum volume sub-matrix of a matrix and related problems. Theoretical Computer Science, 410:4801–4811, 2009.
  • [Cun84] W. H. Cunningham. Testing membership in matroid polyhedra. Journal of Combinatorial Theory, Series B, 36:161–188, 1984.
  • [DRZ20] Sami Davies, Thomas Rothvoss, and Yihao Zhang. A tale of santa claus, hypergraphs and matroids. In Proceedings of the Thirty-First Annual ACM-SIAM Symposium on Discrete Algorithms, SODA ’20, page 2748–2757, USA, 2020. Society for Industrial and Applied Mathematics.
  • [Fei08] Uriel Feige. On allocations that maximize fairness. In Proceedings of the Nineteenth Annual ACM-SIAM Symposium on Discrete Algorithms, SODA ’08, page 287–293, USA, 2008. Society for Industrial and Applied Mathematics.
  • [HO14] Nicholas JA Harvey and Neil Olver. Pipage rounding, pessimistic estimators and matrix concentration. In Proceedings of the twenty-fifth annual ACM-SIAM symposium on Discrete algorithms, pages 926–945. SIAM, 2014.
  • [HS23] Penny Haxell and Tibor Szabó. Improved integrality gap in max-min allocation: or topology at the north pole. In Proceedings of the 2023 Annual ACM-SIAM Symposium on Discrete Algorithms (SODA), pages 2875–2897, 2023.
  • [JMS22] Ben Jourdan, Peter Macgregor, and He Sun. Is the algorithmic kadison-singer problem hard?, 2022. arxiv.longhoe.net:2205.02161.
  • [LZ21] Lap Chi Lau and Hong Zhou. A local search framework for experimental design. In Proceedings of the Thirty-Second Annual ACM-SIAM Symposium on Discrete Algorithms, SODA ’21, page 1039–1058, USA, 2021. Society for Industrial and Applied Mathematics.
  • [MNST20] Vivek Madan, Aleksandar Nikolov, Mohit Singh, and Uthaipon Tantipongpipat. Maximizing determinants under matroid constraints. In 2020 IEEE 61st Annual Symposium on Foundations of Computer Science (FOCS), pages 565–576, 2020.
  • [MSS15] Adam W. Marcus, Daniel A. Spielman, and Nikhil Srivastava. Interlacing families ii: Mixed characteristic polynomials and the kadison–singer problem. Annals of Mathematics, 182:327–350, 2015.
  • [MSTX19] Vivek Madan, Mohit Singh, Uthaipon Tantipongpipat, and Weijun Xie. Combinatorial algorithms for optimal design. In Proceedings of the Thirty-Second Conference on Learning Theory, 06 2019.
  • [Nik15] Aleksandar Nikolov. Randomized rounding for the largest simplex problem. In Proceedings of the Forty-Seventh Annual ACM Symposium on Theory of Computing, STOC ’15, page 861–870, New York, NY, USA, 2015. Association for Computing Machinery.
  • [NS16] Aleksandar Nikolov and Mohit Singh. Maximizing determinants under partition constraints. In Proceedings of the Forty-Eighth Annual ACM Symposium on Theory of Computing, STOC ’16, page 192–201, New York, NY, USA, 2016. Association for Computing Machinery.
  • [NST19] Aleksandar Nikolov, Mohit Singh, and Uthaipon Tao Tantipongpipat. Proportional volume sampling and approximation algorithms for a-optimal design. Proceedings of the Thirtieth Annual ACM-SIAM Symposium on Discrete Algorithms, pages 1369–1386, 2019.
  • [Puk06] Friedrich Pukelsheim. Optimal design of experiments. SIAM, 2006.
  • [SEFM15] Marco Di Summa, Friedrich Eisenbrand, Yuri Faenza, and Carsten Moldenhauer. On largest volume simplices and sub-determinants. In Proceedings of the Twenty-Sixth Annual ACM-SIAM Symposium on Discrete Algorithms, SODA ’15, page 315–323, USA, 2015. Society for Industrial and Applied Mathematics.
  • [SX18] Mohit Singh and Weijun Xie. Approximate positive correlated distributions and approximation algorithms for D-optimal design. In Proceedings of SODA, 2018.
  • [Tro15] Joel A. Tropp. An introduction to matrix concentration inequalities. Foundations and Trends® in Machine Learning, 8(1-2):1–230, 2015.
  • [Wea04] Nik Weaver. The kadison–singer problem in discrepancy theory. Discrete mathematics, 278(1-3):227–239, 2004.

Appendix A Omitted proofs

Proof  (of Corollary 2) This is a simple calculation, using the fact the the semidefinite order is preserved under conjugation.

Pr(i=1kMi(1ϵ)μminA)Prnot-succeeds-nor-equalssuperscriptsubscript𝑖1𝑘subscript𝑀𝑖1italic-ϵsubscript𝜇𝐴\displaystyle\operatorname{\mathrm{Pr}}\left(\sum_{i=1}^{k}M_{i}\nsucceq(1-% \epsilon)\mu_{\min}\cdot A\right)roman_Pr ( ∑ start_POSTSUBSCRIPT italic_i = 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_k end_POSTSUPERSCRIPT italic_M start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT ⋡ ( 1 - italic_ϵ ) italic_μ start_POSTSUBSCRIPT roman_min end_POSTSUBSCRIPT ⋅ italic_A ) =Pr(i=1kA1/2MiA1/2(1ϵ)μminI)absentPrnot-succeeds-nor-equalssuperscriptsubscript𝑖1𝑘superscript𝐴12subscript𝑀𝑖superscript𝐴121italic-ϵsubscript𝜇𝐼\displaystyle=\operatorname{\mathrm{Pr}}\left(\sum_{i=1}^{k}A^{-1/2}M_{i}A^{-1% /2}\nsucceq(1-\epsilon)\mu_{\min}\cdot I\right)= roman_Pr ( ∑ start_POSTSUBSCRIPT italic_i = 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_k end_POSTSUPERSCRIPT italic_A start_POSTSUPERSCRIPT - 1 / 2 end_POSTSUPERSCRIPT italic_M start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT italic_A start_POSTSUPERSCRIPT - 1 / 2 end_POSTSUPERSCRIPT ⋡ ( 1 - italic_ϵ ) italic_μ start_POSTSUBSCRIPT roman_min end_POSTSUBSCRIPT ⋅ italic_I )
=Pr(λmin(i=1kA1/2MiA1/2)<(1ϵ)μmin)absentPrsubscript𝜆superscriptsubscript𝑖1𝑘superscript𝐴12subscript𝑀𝑖superscript𝐴121italic-ϵsubscript𝜇\displaystyle=\operatorname{\mathrm{Pr}}\left(\lambda_{\min}\left(\sum_{i=1}^{% k}A^{-1/2}M_{i}A^{-1/2}\right)<(1-\epsilon)\mu_{\min}\right)= roman_Pr ( italic_λ start_POSTSUBSCRIPT roman_min end_POSTSUBSCRIPT ( ∑ start_POSTSUBSCRIPT italic_i = 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_k end_POSTSUPERSCRIPT italic_A start_POSTSUPERSCRIPT - 1 / 2 end_POSTSUPERSCRIPT italic_M start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT italic_A start_POSTSUPERSCRIPT - 1 / 2 end_POSTSUPERSCRIPT ) < ( 1 - italic_ϵ ) italic_μ start_POSTSUBSCRIPT roman_min end_POSTSUBSCRIPT )
dexp(ϵ2μmin2R).absent𝑑superscriptitalic-ϵ2subscript𝜇𝑚𝑖𝑛2𝑅\displaystyle\leq d\cdot\exp\left(\frac{-\epsilon^{2}\mu_{min}}{2R}\right)\,.≤ italic_d ⋅ roman_exp ( divide start_ARG - italic_ϵ start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT italic_μ start_POSTSUBSCRIPT italic_m italic_i italic_n end_POSTSUBSCRIPT end_ARG start_ARG 2 italic_R end_ARG ) .

\Box

A.1 Integrality Gap Example

Consider the vectors v1=e1,v2=e1,v3=e2,v4=e3formulae-sequencesubscript𝑣1subscript𝑒1formulae-sequencesubscript𝑣2subscript𝑒1formulae-sequencesubscript𝑣3subscript𝑒2subscript𝑣4subscript𝑒3v_{1}=e_{1},v_{2}=e_{1},v_{3}=e_{2},v_{4}=e_{3}italic_v start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT = italic_e start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT , italic_v start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT = italic_e start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT , italic_v start_POSTSUBSCRIPT 3 end_POSTSUBSCRIPT = italic_e start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT , italic_v start_POSTSUBSCRIPT 4 end_POSTSUBSCRIPT = italic_e start_POSTSUBSCRIPT 3 end_POSTSUBSCRIPT in 3superscript3\mathbb{R}^{3}blackboard_R start_POSTSUPERSCRIPT 3 end_POSTSUPERSCRIPT and a partition matroid =([4],)delimited-[]4\mathcal{M}=([4],\mathcal{I})caligraphic_M = ( [ 4 ] , caligraphic_I ) defined by the bases {1,2,3},{1,2,4}123124\{1,2,3\},\{1,2,4\}{ 1 , 2 , 3 } , { 1 , 2 , 4 }. The optimal value of maximizing the minimum eigenvalue for this instance is 00 as we are forced to pick v1subscript𝑣1v_{1}italic_v start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT and v2subscript𝑣2v_{2}italic_v start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT in any basis and they are linearly dependent.

The convex relaxation of maximizing the minimum eigenvalue for this instance is given by

maxλmin(X)X=x1v1v1+x2v2v2+x3v3v3+x4v4v4x1=1,i[k]x2=1,i[k]x3+x4=1,i[k]x0subscript𝜆𝑋missing-subexpression𝑋subscript𝑥1subscript𝑣1superscriptsubscript𝑣1topsubscript𝑥2subscript𝑣2superscriptsubscript𝑣2topsubscript𝑥3subscript𝑣3superscriptsubscript𝑣3topsubscript𝑥4subscript𝑣4superscriptsubscript𝑣4topmissing-subexpressionformulae-sequencesubscript𝑥11for-all𝑖delimited-[]𝑘missing-subexpressionformulae-sequencesubscript𝑥21for-all𝑖delimited-[]𝑘missing-subexpressionformulae-sequencesubscript𝑥3subscript𝑥41for-all𝑖delimited-[]𝑘missing-subexpression𝑥0\begin{array}[]{cl}\max&\lambda_{\min}\left(X\right)\\ &X=x_{1}\cdot v_{1}v_{1}^{\top}+x_{2}\cdot v_{2}v_{2}^{\top}+x_{3}\cdot v_{3}v% _{3}^{\top}+x_{4}\cdot v_{4}v_{4}^{\top}\\ &x_{1}=1,\quad\forall i\in[k]\\ &x_{2}=1,\quad\forall i\in[k]\\ &x_{3}+x_{4}=1,\quad\forall i\in[k]\\ &x\geq 0\end{array}start_ARRAY start_ROW start_CELL roman_max end_CELL start_CELL italic_λ start_POSTSUBSCRIPT roman_min end_POSTSUBSCRIPT ( italic_X ) end_CELL end_ROW start_ROW start_CELL end_CELL start_CELL italic_X = italic_x start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT ⋅ italic_v start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT italic_v start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ⊤ end_POSTSUPERSCRIPT + italic_x start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT ⋅ italic_v start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT italic_v start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ⊤ end_POSTSUPERSCRIPT + italic_x start_POSTSUBSCRIPT 3 end_POSTSUBSCRIPT ⋅ italic_v start_POSTSUBSCRIPT 3 end_POSTSUBSCRIPT italic_v start_POSTSUBSCRIPT 3 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ⊤ end_POSTSUPERSCRIPT + italic_x start_POSTSUBSCRIPT 4 end_POSTSUBSCRIPT ⋅ italic_v start_POSTSUBSCRIPT 4 end_POSTSUBSCRIPT italic_v start_POSTSUBSCRIPT 4 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ⊤ end_POSTSUPERSCRIPT end_CELL end_ROW start_ROW start_CELL end_CELL start_CELL italic_x start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT = 1 , ∀ italic_i ∈ [ italic_k ] end_CELL end_ROW start_ROW start_CELL end_CELL start_CELL italic_x start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT = 1 , ∀ italic_i ∈ [ italic_k ] end_CELL end_ROW start_ROW start_CELL end_CELL start_CELL italic_x start_POSTSUBSCRIPT 3 end_POSTSUBSCRIPT + italic_x start_POSTSUBSCRIPT 4 end_POSTSUBSCRIPT = 1 , ∀ italic_i ∈ [ italic_k ] end_CELL end_ROW start_ROW start_CELL end_CELL start_CELL italic_x ≥ 0 end_CELL end_ROW end_ARRAY (CP)

The optimum of (CP) is attained when x1=x2=1subscript𝑥1subscript𝑥21x_{1}=x_{2}=1italic_x start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT = italic_x start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT = 1 and x3=x4subscript𝑥3subscript𝑥4x_{3}=x_{4}italic_x start_POSTSUBSCRIPT 3 end_POSTSUBSCRIPT = italic_x start_POSTSUBSCRIPT 4 end_POSTSUBSCRIPT which gives

X=2e1e1+12e2e2+12e3e3.𝑋2subscript𝑒1superscriptsubscript𝑒1top12subscript𝑒2superscriptsubscript𝑒2top12subscript𝑒3superscriptsubscript𝑒3topX=2e_{1}e_{1}^{\top}+\frac{1}{2}e_{2}e_{2}^{\top}+\frac{1}{2}e_{3}e_{3}^{\top}\,.italic_X = 2 italic_e start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT italic_e start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ⊤ end_POSTSUPERSCRIPT + divide start_ARG 1 end_ARG start_ARG 2 end_ARG italic_e start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT italic_e start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ⊤ end_POSTSUPERSCRIPT + divide start_ARG 1 end_ARG start_ARG 2 end_ARG italic_e start_POSTSUBSCRIPT 3 end_POSTSUBSCRIPT italic_e start_POSTSUBSCRIPT 3 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ⊤ end_POSTSUPERSCRIPT .

So the optimal value of (CP) is 1/2121/21 / 2, whereas the true optimal is 00.

Appendix B Matroids and Pipage Rounding

In this section, we provide the necessary background on matroids, as well as the lower tail versions of lemmas from [HO14], which let us prove Lemma 4.

B.1 Matroids

A pair =(E,)𝐸\mathcal{M}=(E,\mathcal{I})caligraphic_M = ( italic_E , caligraphic_I ) is a matroid if E𝐸Eitalic_E is a finite set and \mathcal{I}caligraphic_I is a collection of subsets of E𝐸Eitalic_E satisfying

  1. (1)

    If I𝐼I\in\mathcal{I}italic_I ∈ caligraphic_I and JI𝐽𝐼J\subseteq Iitalic_J ⊆ italic_I then J𝐽J\in\mathcal{I}italic_J ∈ caligraphic_I, and

  2. (2)

    If I,J𝐼𝐽I,J\in\mathcal{I}italic_I , italic_J ∈ caligraphic_I and |I|<|J|𝐼𝐽|I|<|J|| italic_I | < | italic_J | then there is eJ\I𝑒\𝐽𝐼e\in J\backslash Iitalic_e ∈ italic_J \ italic_I such that I{e}𝐼𝑒I\cup\{e\}\in\mathcal{I}italic_I ∪ { italic_e } ∈ caligraphic_I.

The sets in \mathcal{I}caligraphic_I are referred to as the independent sets of the matroid \mathcal{M}caligraphic_M. The maximal sets in \mathcal{I}caligraphic_I are called bases, and it is a consequence of the matroid axioms that all bases have the same cardinality. For a subset UE𝑈𝐸U\subseteq Eitalic_U ⊆ italic_E, we denote my r(U)𝑟𝑈r(U)italic_r ( italic_U ) the maximum size of an independent set in U𝑈Uitalic_U and call this the rank of U𝑈Uitalic_U. In this notation, we can say that every basis of \mathcal{M}caligraphic_M has cardinality exactly r(E)𝑟𝐸r(E)italic_r ( italic_E ). Given a matroid \mathcal{M}caligraphic_M, the matroid base polytope is the convex hull of indicator vectors of the bases of \mathcal{M}caligraphic_M, and is denoted 𝒫()𝒫\mathcal{P}(\mathcal{M})caligraphic_P ( caligraphic_M ). The base polytope has the following linear description

𝒫()𝒫\displaystyle\mathcal{P}(\mathcal{M})caligraphic_P ( caligraphic_M ) =conv{χ(B):B a basis of }absentconvconditional-set𝜒𝐵𝐵 a basis of \displaystyle=\text{conv}\left\{\chi(B):B\text{ a basis of }\mathcal{M}\right\}= conv { italic_χ ( italic_B ) : italic_B a basis of caligraphic_M }
={xE:eExe=r(E),eUxer(U)UE,x0}.absentconditional-set𝑥superscript𝐸formulae-sequenceformulae-sequencesubscript𝑒𝐸subscript𝑥𝑒𝑟𝐸subscript𝑒𝑈subscript𝑥𝑒𝑟𝑈for-all𝑈𝐸𝑥0\displaystyle=\left\{x\in\mathbb{R}^{E}:\sum_{e\in E}x_{e}=r(E),\sum_{e\in U}x% _{e}\leq r(U)\,\,\forall U\subseteq E,x\geq 0\right\}.= { italic_x ∈ blackboard_R start_POSTSUPERSCRIPT italic_E end_POSTSUPERSCRIPT : ∑ start_POSTSUBSCRIPT italic_e ∈ italic_E end_POSTSUBSCRIPT italic_x start_POSTSUBSCRIPT italic_e end_POSTSUBSCRIPT = italic_r ( italic_E ) , ∑ start_POSTSUBSCRIPT italic_e ∈ italic_U end_POSTSUBSCRIPT italic_x start_POSTSUBSCRIPT italic_e end_POSTSUBSCRIPT ≤ italic_r ( italic_U ) ∀ italic_U ⊆ italic_E , italic_x ≥ 0 } .

Cunningham  [Cun84] showed that given x+E𝑥subscriptsuperscript𝐸x\in\mathbb{R}^{E}_{+}italic_x ∈ blackboard_R start_POSTSUPERSCRIPT italic_E end_POSTSUPERSCRIPT start_POSTSUBSCRIPT + end_POSTSUBSCRIPT, it is possible to find a violated constraint for 𝒫()𝒫\mathcal{P}(\mathcal{M})caligraphic_P ( caligraphic_M ) in strongly polynomial time using only an independence oracle for the matroid \mathcal{M}caligraphic_M.

B.2 Pipage Rounding

The following theorem follows from the discussion in Section 3.2.

Theorem 6

[HO14] There is a randomized polynomial time algorithm that, given x0𝒫()subscript𝑥0𝒫x_{0}\in\mathcal{P}(\mathcal{M})italic_x start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT ∈ caligraphic_P ( caligraphic_M ), outputs an extreme point x^normal-^𝑥\hat{x}over^ start_ARG italic_x end_ARG of 𝒫()𝒫\mathcal{P}(\mathcal{M})caligraphic_P ( caligraphic_M ) with 𝔼[x^]=x0𝔼normal-^𝑥subscript𝑥0\operatorname{\mathbb{E}}[\hat{x}]=x_{0}blackboard_E [ over^ start_ARG italic_x end_ARG ] = italic_x start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT and such that for any g𝑔gitalic_g concave under swaps 𝔼[g(x^)]g(x)𝔼𝑔normal-^𝑥𝑔𝑥\operatorname{\mathbb{E}}[g(\hat{x})]\leq g(x)blackboard_E [ italic_g ( over^ start_ARG italic_x end_ARG ) ] ≤ italic_g ( italic_x ).

We will mainly make use of this theorem through the following claim. The conditions of the claim come from pessimistic estimators, but not all of them are strictly necessary. For a point x[0,1]n𝑥superscript01𝑛x\in[0,1]^{n}italic_x ∈ [ 0 , 1 ] start_POSTSUPERSCRIPT italic_n end_POSTSUPERSCRIPT, let D(x)𝐷𝑥D(x)italic_D ( italic_x ) represent the corresponding product distribution over {0,1}nsuperscript01𝑛\{0,1\}^{n}{ 0 , 1 } start_POSTSUPERSCRIPT italic_n end_POSTSUPERSCRIPT with marginals given by x𝑥xitalic_x.

Lemma 6

[HO14] Let {0,1}nsuperscript01𝑛\mathcal{E}\subseteq\{0,1\}^{n}caligraphic_E ⊆ { 0 , 1 } start_POSTSUPERSCRIPT italic_n end_POSTSUPERSCRIPT and g:𝒫()normal-:𝑔normal-→𝒫g:\mathcal{P}(\mathcal{M})\rightarrow\mathbb{R}italic_g : caligraphic_P ( caligraphic_M ) → blackboard_R satisfy

PrxD(x)[x]g(x), andsubscriptPrsimilar-to𝑥𝐷𝑥𝑥𝑔𝑥 and\displaystyle\operatorname{\mathrm{Pr}}_{x\sim D(x)}[x\in\mathcal{E}]\leq g(x)% ,\text{ and }roman_Pr start_POSTSUBSCRIPT italic_x ∼ italic_D ( italic_x ) end_POSTSUBSCRIPT [ italic_x ∈ caligraphic_E ] ≤ italic_g ( italic_x ) , and
min{g(xxiei),g(x+(1xi)ei)}g(x)𝑔𝑥subscript𝑥𝑖subscript𝑒𝑖𝑔𝑥1subscript𝑥𝑖subscript𝑒𝑖𝑔𝑥\displaystyle\min\{g(x-x_{i}e_{i}),g(x+(1-x_{i})e_{i})\}\leq g(x)roman_min { italic_g ( italic_x - italic_x start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT italic_e start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT ) , italic_g ( italic_x + ( 1 - italic_x start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT ) italic_e start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT ) } ≤ italic_g ( italic_x )

for all x[0,1]n𝑥superscript01𝑛x\in[0,1]^{n}italic_x ∈ [ 0 , 1 ] start_POSTSUPERSCRIPT italic_n end_POSTSUPERSCRIPT, and g𝑔gitalic_g be concave under swaps. If pipage rounding is started at an initial point x0Psubscript𝑥0𝑃x_{0}\in Pitalic_x start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT ∈ italic_P and x^normal-^𝑥\hat{x}over^ start_ARG italic_x end_ARG is the random extreme point, then Pr[x^]g(x0)normal-Prnormal-^𝑥𝑔subscript𝑥0\operatorname{\mathrm{Pr}}[\hat{x}\in\mathcal{E}]\leq g(x_{0})roman_Pr [ over^ start_ARG italic_x end_ARG ∈ caligraphic_E ] ≤ italic_g ( italic_x start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT ).

Essentially, this lemma says that if we have a pessimistic estimator which is concave under swaps, then pipage rounding has the same type of concentration behavior as independent rounding, but will actually return a vertex of the matroid polytope.

For our particular application, we will be choosing the function g𝑔gitalic_g to be an estimator for matrix concentration due to Tropp [Tro15].

Theorem 7

[Tro15, Theorem 5.1.1] Let M1,,Mnsubscript𝑀1normal-…subscript𝑀𝑛M_{1},\ldots,M_{n}italic_M start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT , … , italic_M start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT be self-adjoint matrices with λmax(Mi)Rsubscript𝜆subscript𝑀𝑖𝑅\lambda_{\max}(M_{i})\leq Ritalic_λ start_POSTSUBSCRIPT roman_max end_POSTSUBSCRIPT ( italic_M start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT ) ≤ italic_R for all i[n]𝑖delimited-[]𝑛i\in[n]italic_i ∈ [ italic_n ] and let μmin=λmin(𝔼x𝒟(x)[i[n]xiMi]\mu_{\min}=\lambda_{\min}(\operatorname{\mathbb{E}}_{x\sim\mathcal{D}(x)}[\sum% _{i\in[n]}x_{i}M_{i}]italic_μ start_POSTSUBSCRIPT roman_min end_POSTSUBSCRIPT = italic_λ start_POSTSUBSCRIPT roman_min end_POSTSUBSCRIPT ( blackboard_E start_POSTSUBSCRIPT italic_x ∼ caligraphic_D ( italic_x ) end_POSTSUBSCRIPT [ ∑ start_POSTSUBSCRIPT italic_i ∈ [ italic_n ] end_POSTSUBSCRIPT italic_x start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT italic_M start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT ]. For t𝑡t\in\mathbb{R}italic_t ∈ blackboard_R, we have the bound

PrxD(x)[λmin(i[n]xiMi)t]infθ<0gt,θ(x)subscriptPrsimilar-to𝑥𝐷𝑥subscript𝜆subscript𝑖delimited-[]𝑛subscript𝑥𝑖subscript𝑀𝑖𝑡subscriptinfimum𝜃0subscript𝑔𝑡𝜃𝑥\operatorname{\mathrm{Pr}}_{x\sim D(x)}\left[\lambda_{\min}\left(\sum_{i\in[n]% }x_{i}M_{i}\right)\leq t\right]\leq\inf_{\theta<0}g_{t,\theta}(x)roman_Pr start_POSTSUBSCRIPT italic_x ∼ italic_D ( italic_x ) end_POSTSUBSCRIPT [ italic_λ start_POSTSUBSCRIPT roman_min end_POSTSUBSCRIPT ( ∑ start_POSTSUBSCRIPT italic_i ∈ [ italic_n ] end_POSTSUBSCRIPT italic_x start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT italic_M start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT ) ≤ italic_t ] ≤ roman_inf start_POSTSUBSCRIPT italic_θ < 0 end_POSTSUBSCRIPT italic_g start_POSTSUBSCRIPT italic_t , italic_θ end_POSTSUBSCRIPT ( italic_x )

where gt,θ(x)=eθttrexp(i[n]log𝔼eθxiMi)subscript𝑔𝑡𝜃𝑥normal-⋅superscript𝑒𝜃𝑡normal-trsubscript𝑖delimited-[]𝑛𝔼superscript𝑒normal-⋅𝜃subscript𝑥𝑖subscript𝑀𝑖g_{t,\theta}(x)=e^{-\theta t}\cdot\mathrm{tr}\exp\left(\sum_{i\in[n]}\log% \operatorname{\mathbb{E}}e^{\theta x_{i}\cdot M_{i}}\right)italic_g start_POSTSUBSCRIPT italic_t , italic_θ end_POSTSUBSCRIPT ( italic_x ) = italic_e start_POSTSUPERSCRIPT - italic_θ italic_t end_POSTSUPERSCRIPT ⋅ roman_tr roman_exp ( ∑ start_POSTSUBSCRIPT italic_i ∈ [ italic_n ] end_POSTSUBSCRIPT roman_log blackboard_E italic_e start_POSTSUPERSCRIPT italic_θ italic_x start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT ⋅ italic_M start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT end_POSTSUPERSCRIPT ). Furthermore, for t=(1ϵ)μmin𝑡1italic-ϵsubscript𝜇t=(1-\epsilon)\mu_{\min}italic_t = ( 1 - italic_ϵ ) italic_μ start_POSTSUBSCRIPT roman_min end_POSTSUBSCRIPT,

gt,θ(x)d(eϵ(1ϵ)1ϵ)μmin/R.subscript𝑔𝑡𝜃𝑥𝑑superscriptsuperscript𝑒italic-ϵsuperscript1italic-ϵ1italic-ϵsubscript𝜇𝑅g_{t,\theta}(x)\leq d\cdot\left(\frac{e^{-\epsilon}}{(1-\epsilon)^{1-\epsilon}% }\right)^{\mu_{\min}/R}.italic_g start_POSTSUBSCRIPT italic_t , italic_θ end_POSTSUBSCRIPT ( italic_x ) ≤ italic_d ⋅ ( divide start_ARG italic_e start_POSTSUPERSCRIPT - italic_ϵ end_POSTSUPERSCRIPT end_ARG start_ARG ( 1 - italic_ϵ ) start_POSTSUPERSCRIPT 1 - italic_ϵ end_POSTSUPERSCRIPT end_ARG ) start_POSTSUPERSCRIPT italic_μ start_POSTSUBSCRIPT roman_min end_POSTSUBSCRIPT / italic_R end_POSTSUPERSCRIPT .

This is the lower-tail version of the same concentration inequality which was used in [HO14]. In that paper, they provide an upper-tail version of Lemma 4 using a new generalization of Lieb’s concavity theorem, stated below.

Lemma 7

[HO14] Let L𝕊d𝐿subscript𝕊𝑑L\in\mathbb{S}_{d}italic_L ∈ blackboard_S start_POSTSUBSCRIPT italic_d end_POSTSUBSCRIPT, C1,C2𝕊d++subscript𝐶1subscript𝐶2superscriptsubscript𝕊𝑑absentC_{1},C_{2}\in\mathbb{S}_{d}^{++}italic_C start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT , italic_C start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT ∈ blackboard_S start_POSTSUBSCRIPT italic_d end_POSTSUBSCRIPT start_POSTSUPERSCRIPT + + end_POSTSUPERSCRIPT, and K1,K2𝕊d+subscript𝐾1subscript𝐾2superscriptsubscript𝕊𝑑K_{1},K_{2}\in\mathbb{S}_{d}^{+}italic_K start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT , italic_K start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT ∈ blackboard_S start_POSTSUBSCRIPT italic_d end_POSTSUBSCRIPT start_POSTSUPERSCRIPT + end_POSTSUPERSCRIPT. Then the univariate function

ztrexp(L+log(C1+zK1)+log(C2zK2))𝑧tr𝐿subscript𝐶1𝑧subscript𝐾1subscript𝐶2𝑧subscript𝐾2z\rightarrow\mathrm{tr}\exp\left(L+\log(C_{1}+zK_{1})+\log(C_{2}-zK_{2})\right)italic_z → roman_tr roman_exp ( italic_L + roman_log ( italic_C start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT + italic_z italic_K start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT ) + roman_log ( italic_C start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT - italic_z italic_K start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT ) )

is concave in a neighborhood of 00.

As a consequence, we get the following lemma.

Lemma 8

For θ<0𝜃0\theta<0italic_θ < 0, all x[0,1]m𝑥superscript01𝑚x\in[0,1]^{m}italic_x ∈ [ 0 , 1 ] start_POSTSUPERSCRIPT italic_m end_POSTSUPERSCRIPT, the function

gt,θ(x)=eθttrexp(i[m]log𝔼x𝒟(x)eθxiMi)subscript𝑔𝑡𝜃𝑥superscript𝑒𝜃𝑡trsubscript𝑖delimited-[]𝑚subscript𝔼similar-to𝑥𝒟𝑥superscript𝑒𝜃subscript𝑥𝑖subscript𝑀𝑖g_{t,\theta}(x)=e^{-\theta t}\cdot\mathrm{tr}\exp\left(\sum_{i\in[m]}\log% \operatorname{\mathbb{E}}_{x\sim\mathcal{D}(x)}e^{\theta x_{i}\cdot M_{i}}\right)italic_g start_POSTSUBSCRIPT italic_t , italic_θ end_POSTSUBSCRIPT ( italic_x ) = italic_e start_POSTSUPERSCRIPT - italic_θ italic_t end_POSTSUPERSCRIPT ⋅ roman_tr roman_exp ( ∑ start_POSTSUBSCRIPT italic_i ∈ [ italic_m ] end_POSTSUBSCRIPT roman_log blackboard_E start_POSTSUBSCRIPT italic_x ∼ caligraphic_D ( italic_x ) end_POSTSUBSCRIPT italic_e start_POSTSUPERSCRIPT italic_θ italic_x start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT ⋅ italic_M start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT end_POSTSUPERSCRIPT )

is concave under swaps.

Proof  (of Lemma 8) Let Ci:=𝔼x𝒟(x)[eθxiMi]=xieθMi+(1xi)I0assignsubscript𝐶𝑖subscript𝔼similar-to𝑥𝒟𝑥superscript𝑒𝜃subscript𝑥𝑖subscript𝑀𝑖subscript𝑥𝑖superscript𝑒𝜃subscript𝑀𝑖1subscript𝑥𝑖𝐼succeeds0C_{i}:=\operatorname{\mathbb{E}}_{x\sim\mathcal{D}(x)}[e^{\theta x_{i}\cdot M_% {i}}]=x_{i}\cdot e^{\theta M_{i}}+(1-x_{i})\cdot I\succ 0italic_C start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT := blackboard_E start_POSTSUBSCRIPT italic_x ∼ caligraphic_D ( italic_x ) end_POSTSUBSCRIPT [ italic_e start_POSTSUPERSCRIPT italic_θ italic_x start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT ⋅ italic_M start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT end_POSTSUPERSCRIPT ] = italic_x start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT ⋅ italic_e start_POSTSUPERSCRIPT italic_θ italic_M start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT end_POSTSUPERSCRIPT + ( 1 - italic_x start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT ) ⋅ italic_I ≻ 0, and for any i[n]𝑖delimited-[]𝑛i\in[n]italic_i ∈ [ italic_n ]

𝔼x𝒟(x+zei)[eθxiMi]=(xi+z)eθMi+(1xiz)I=Ciz(IeθMi).subscript𝔼similar-to𝑥𝒟𝑥𝑧subscript𝑒𝑖superscript𝑒𝜃subscript𝑥𝑖subscript𝑀𝑖subscript𝑥𝑖𝑧superscript𝑒𝜃subscript𝑀𝑖1subscript𝑥𝑖𝑧𝐼subscript𝐶𝑖𝑧𝐼superscript𝑒𝜃subscript𝑀𝑖\displaystyle\operatorname{\mathbb{E}}_{x\sim\mathcal{D}(x+ze_{i})}[e^{\theta x% _{i}\cdot M_{i}}]=(x_{i}+z)e^{\theta M_{i}}+(1-x_{i}-z)\cdot I=C_{i}-z\cdot(I-% e^{\theta M_{i}}).blackboard_E start_POSTSUBSCRIPT italic_x ∼ caligraphic_D ( italic_x + italic_z italic_e start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT ) end_POSTSUBSCRIPT [ italic_e start_POSTSUPERSCRIPT italic_θ italic_x start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT ⋅ italic_M start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT end_POSTSUPERSCRIPT ] = ( italic_x start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT + italic_z ) italic_e start_POSTSUPERSCRIPT italic_θ italic_M start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT end_POSTSUPERSCRIPT + ( 1 - italic_x start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT - italic_z ) ⋅ italic_I = italic_C start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT - italic_z ⋅ ( italic_I - italic_e start_POSTSUPERSCRIPT italic_θ italic_M start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT end_POSTSUPERSCRIPT ) .

Then a,b[n]for-all𝑎𝑏delimited-[]𝑛\forall a,b\in[n]∀ italic_a , italic_b ∈ [ italic_n ],

gt,θ(x+z(eaeb))subscript𝑔𝑡𝜃𝑥𝑧subscript𝑒𝑎subscript𝑒𝑏\displaystyle g_{t,\theta}(x+z(e_{a}-e_{b}))italic_g start_POSTSUBSCRIPT italic_t , italic_θ end_POSTSUBSCRIPT ( italic_x + italic_z ( italic_e start_POSTSUBSCRIPT italic_a end_POSTSUBSCRIPT - italic_e start_POSTSUBSCRIPT italic_b end_POSTSUBSCRIPT ) )
=eθttrexp(i[n]\{a,b}logCi+log(Cb+z(IeθMb))+log(Caz(IeθMa)))absentsuperscript𝑒𝜃𝑡trsubscript𝑖\delimited-[]𝑛𝑎𝑏subscript𝐶𝑖subscript𝐶𝑏𝑧𝐼superscript𝑒𝜃subscript𝑀𝑏subscript𝐶𝑎𝑧𝐼superscript𝑒𝜃subscript𝑀𝑎\displaystyle=e^{-\theta t}\cdot\mathrm{tr}\exp\left(\sum_{i\in[n]\backslash\{% a,b\}}\log C_{i}+\log\left(C_{b}+z\cdot(I-e^{\theta M_{b}})\right)+\log\left(C% _{a}-z\cdot(I-e^{\theta M_{a}})\right)\right)= italic_e start_POSTSUPERSCRIPT - italic_θ italic_t end_POSTSUPERSCRIPT ⋅ roman_tr roman_exp ( ∑ start_POSTSUBSCRIPT italic_i ∈ [ italic_n ] \ { italic_a , italic_b } end_POSTSUBSCRIPT roman_log italic_C start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT + roman_log ( italic_C start_POSTSUBSCRIPT italic_b end_POSTSUBSCRIPT + italic_z ⋅ ( italic_I - italic_e start_POSTSUPERSCRIPT italic_θ italic_M start_POSTSUBSCRIPT italic_b end_POSTSUBSCRIPT end_POSTSUPERSCRIPT ) ) + roman_log ( italic_C start_POSTSUBSCRIPT italic_a end_POSTSUBSCRIPT - italic_z ⋅ ( italic_I - italic_e start_POSTSUPERSCRIPT italic_θ italic_M start_POSTSUBSCRIPT italic_a end_POSTSUBSCRIPT end_POSTSUPERSCRIPT ) ) )
=eθttrexp(L+log(Cb+zKb)+log(CazKa)),absentsuperscript𝑒𝜃𝑡tr𝐿subscript𝐶𝑏𝑧subscript𝐾𝑏subscript𝐶𝑎𝑧subscript𝐾𝑎\displaystyle=e^{-\theta t}\cdot\mathrm{tr}\exp\left(L+\log\left(C_{b}+z\cdot K% _{b}\right)+\log\left(C_{a}-z\cdot K_{a}\right)\right)\,,= italic_e start_POSTSUPERSCRIPT - italic_θ italic_t end_POSTSUPERSCRIPT ⋅ roman_tr roman_exp ( italic_L + roman_log ( italic_C start_POSTSUBSCRIPT italic_b end_POSTSUBSCRIPT + italic_z ⋅ italic_K start_POSTSUBSCRIPT italic_b end_POSTSUBSCRIPT ) + roman_log ( italic_C start_POSTSUBSCRIPT italic_a end_POSTSUBSCRIPT - italic_z ⋅ italic_K start_POSTSUBSCRIPT italic_a end_POSTSUBSCRIPT ) ) ,

where Ka=(IeθMa)subscript𝐾𝑎𝐼superscript𝑒𝜃subscript𝑀𝑎K_{a}=(I-e^{\theta M_{a}})italic_K start_POSTSUBSCRIPT italic_a end_POSTSUBSCRIPT = ( italic_I - italic_e start_POSTSUPERSCRIPT italic_θ italic_M start_POSTSUBSCRIPT italic_a end_POSTSUBSCRIPT end_POSTSUPERSCRIPT ), Kb=(IeθMb)subscript𝐾𝑏𝐼superscript𝑒𝜃subscript𝑀𝑏K_{b}=(I-e^{\theta M_{b}})italic_K start_POSTSUBSCRIPT italic_b end_POSTSUBSCRIPT = ( italic_I - italic_e start_POSTSUPERSCRIPT italic_θ italic_M start_POSTSUBSCRIPT italic_b end_POSTSUBSCRIPT end_POSTSUPERSCRIPT ), and L=i[n]\{a,b}logCi𝕊d𝐿subscript𝑖\delimited-[]𝑛𝑎𝑏subscript𝐶𝑖subscript𝕊𝑑L=\sum_{i\in[n]\backslash\{a,b\}}\log C_{i}\in\mathbb{S}_{d}italic_L = ∑ start_POSTSUBSCRIPT italic_i ∈ [ italic_n ] \ { italic_a , italic_b } end_POSTSUBSCRIPT roman_log italic_C start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT ∈ blackboard_S start_POSTSUBSCRIPT italic_d end_POSTSUBSCRIPT. If θ0𝜃0\theta\leq 0italic_θ ≤ 0, then Ka0succeeds-or-equalssubscript𝐾𝑎0K_{a}\succeq 0italic_K start_POSTSUBSCRIPT italic_a end_POSTSUBSCRIPT ⪰ 0 and Kb0succeeds-or-equalssubscript𝐾𝑏0K_{b}\succeq 0italic_K start_POSTSUBSCRIPT italic_b end_POSTSUBSCRIPT ⪰ 0. Using Lemma 7, zgt,θ(x+z(eaeb))𝑧subscript𝑔𝑡𝜃𝑥𝑧subscript𝑒𝑎subscript𝑒𝑏z\rightarrow g_{t,\theta}(x+z(e_{a}-e_{b}))italic_z → italic_g start_POSTSUBSCRIPT italic_t , italic_θ end_POSTSUBSCRIPT ( italic_x + italic_z ( italic_e start_POSTSUBSCRIPT italic_a end_POSTSUBSCRIPT - italic_e start_POSTSUBSCRIPT italic_b end_POSTSUBSCRIPT ) ) is concave in z𝑧zitalic_z, and the result follows. \Box

Combining Lemma 8 and Theorem 7 with Lemma 6, we obtain Lemma 4.