-
Online Min-Max Paging
Authors:
Ashish Chiplunkar,
Monika Henzinger,
Sagar Sudhir Kale,
Maximilian Vötsch
Abstract:
Motivated by fairness requirements in communication networks, we introduce a natural variant of the online paging problem, called \textit{min-max} paging, where the objective is to minimize the maximum number of faults on any page. While the classical paging problem, whose objective is to minimize the total number of faults, admits $k$-competitive deterministic and $O(\log k)$-competitive randomiz…
▽ More
Motivated by fairness requirements in communication networks, we introduce a natural variant of the online paging problem, called \textit{min-max} paging, where the objective is to minimize the maximum number of faults on any page. While the classical paging problem, whose objective is to minimize the total number of faults, admits $k$-competitive deterministic and $O(\log k)$-competitive randomized algorithms, we show that min-max paging does not admit a $c(k)$-competitive algorithm for any function $c$. Specifically, we prove that the randomized competitive ratio of min-max paging is $Ω(\log(n))$ and its deterministic competitive ratio is $Ω(k\log(n)/\log(k))$, where $n$ is the total number of pages ever requested.
We design a fractional algorithm for paging with a more general objective -- minimize the value of an $n$-variate differentiable convex function applied to the vector of the number of faults on each page. This gives an $O(\log(n)\log(k))$-competitive fractional algorithm for min-max paging. We show how to round such a fractional algorithm with at most a $k$ factor loss in the competitive ratio, resulting in a deterministic $O(k\log(n)\log(k))$-competitive algorithm for min-max paging. This matches our lower bound modulo a $\mathrm{poly}(\log(k))$ factor. We also give a randomized rounding algorithm that results in a $O(\log^2 n \log k)$-competitive algorithm.
△ Less
Submitted 6 December, 2022;
originally announced December 2022.
-
Prophet Inequality: Order selection beats random order
Authors:
Archit Bubna,
Ashish Chiplunkar
Abstract:
In the prophet inequality problem, a gambler faces a sequence of items arriving online with values drawn independently from known distributions. On seeing an item, the gambler must choose whether to accept its value as her reward and quit the game, or reject it and continue. The gambler's aim is to maximize her expected reward relative to the expected maximum of the values of all items. Since the…
▽ More
In the prophet inequality problem, a gambler faces a sequence of items arriving online with values drawn independently from known distributions. On seeing an item, the gambler must choose whether to accept its value as her reward and quit the game, or reject it and continue. The gambler's aim is to maximize her expected reward relative to the expected maximum of the values of all items. Since the seventies, a tight bound of 1/2 has been known for this competitive ratio in the setting where the items arrive in an adversarial order (Krengel and Sucheston, 1977, 1978). However, the optimum ratio still remains unknown in the order selection setting, where the gambler selects the arrival order, as well as in prophet secretary, where the items arrive in a random order. Moreover, it is not even known whether a separation exists between the two settings.
In this paper, we show that the power of order selection allows the gambler to guarantee a strictly better competitive ratio than if the items arrive randomly. For the order selection setting, we identify an instance for which Peng and Tang's (FOCS'22) state-of-the-art algorithm performs no better than their claimed competitive ratio of (approximately) 0.7251, thus illustrating the need for an improved approach. We therefore extend their design and provide a more general algorithm design framework, using which we show that their ratio can be beaten, by designing a 0.7258-competitive algorithm. For the random order setting, we improve upon Correa, Saona and Ziliotto's (SODA'19) 0.732-hardness result to show a hardness of 0.7254 for general algorithms - even in the setting where the gambler knows the arrival order beforehand, thus establishing a separation between the order selection and random order settings.
△ Less
Submitted 22 May, 2023; v1 submitted 8 November, 2022;
originally announced November 2022.
-
Factorial Lower Bounds for (Almost) Random Order Streams
Authors:
Ashish Chiplunkar,
John Kallaugher,
Michael Kapralov,
Eric Price
Abstract:
In this paper we introduce and study the \textsc{StreamingCycles} problem, a random order streaming version of the Boolean Hidden Hypermatching problem that has been instrumental in streaming lower bounds over the past decade. In this problem the edges of a graph $G$, comprising $n/\ell$ disjoint length-$\ell$ cycles on $n$ vertices, are partitioned randomly among $n$ players. Every edge is annota…
▽ More
In this paper we introduce and study the \textsc{StreamingCycles} problem, a random order streaming version of the Boolean Hidden Hypermatching problem that has been instrumental in streaming lower bounds over the past decade. In this problem the edges of a graph $G$, comprising $n/\ell$ disjoint length-$\ell$ cycles on $n$ vertices, are partitioned randomly among $n$ players. Every edge is annotated with an independent uniformly random bit, and the players' task is to output the parity of some cycle in $G$ after one round of sequential communication.
Our main result is an $\ell^{Ω(\ell)}$ lower bound on the communication complexity of \textsc{StreamingCycles}, which is tight up to constant factors in $\ell$. Applications of our lower bound for \textsc{StreamingCycles} include an essentially tight lower bound for component collection in (almost) random order graph streams, making progress towards a conjecture of Peng and Sohler [SODA'18] and the first exponential space lower bounds for random walk generation.
△ Less
Submitted 19 September, 2022; v1 submitted 19 October, 2021;
originally announced October 2021.
-
The Randomized Competitive Ratio of Weighted $k$-server is at least Exponential
Authors:
Nikhil Ayyadevara,
Ashish Chiplunkar
Abstract:
The weighted $k$-server problem is a natural generalization of the $k$-server problem in which the cost incurred in moving a server is the distance traveled times the weight of the server. Even after almost three decades since the seminal work of Fiat and Ricklin (1994), the competitive ratio of this problem remains poorly understood, even on the simplest class of metric spaces -- the uniform metr…
▽ More
The weighted $k$-server problem is a natural generalization of the $k$-server problem in which the cost incurred in moving a server is the distance traveled times the weight of the server. Even after almost three decades since the seminal work of Fiat and Ricklin (1994), the competitive ratio of this problem remains poorly understood, even on the simplest class of metric spaces -- the uniform metric spaces. In particular, in the case of randomized algorithms against the oblivious adversary, neither a better upper bound that the doubly exponential deterministic upper bound, nor a better lower bound than the logarithmic lower bound of unweighted $k$-server, is known. In this article, we make significant progress towards understanding the randomized competitive ratio of weighted $k$-server on uniform metrics. We cut down the triply exponential gap between the upper and lower bound to a singly exponential gap by proving that the competitive ratio is at least exponential in $k$, substantially improving on the previously known lower bound of about $\ln k$.
△ Less
Submitted 14 July, 2021; v1 submitted 22 February, 2021;
originally announced February 2021.
-
How to Solve Fair $k$-Center in Massive Data Models
Authors:
Ashish Chiplunkar,
Sagar Kale,
Sivaramakrishnan Natarajan Ramamoorthy
Abstract:
Fueled by massive data, important decision making is being automated with the help of algorithms, therefore, fairness in algorithms has become an especially important research topic. In this work, we design new streaming and distributed algorithms for the fair $k$-center problem that models fair data summarization. The streaming and distributed models of computation have an attractive feature of b…
▽ More
Fueled by massive data, important decision making is being automated with the help of algorithms, therefore, fairness in algorithms has become an especially important research topic. In this work, we design new streaming and distributed algorithms for the fair $k$-center problem that models fair data summarization. The streaming and distributed models of computation have an attractive feature of being able to handle massive data sets that do not fit into main memory. Our main contributions are: (a) the first distributed algorithm; which has provably constant approximation ratio and is extremely parallelizable, and (b) a two-pass streaming algorithm with a provable approximation guarantee matching the best known algorithm (which is not a streaming algorithm). Our algorithms have the advantages of being easy to implement in practice, being fast with linear running times, having very small working memory and communication, and outperforming existing algorithms on several real and synthetic data sets. To complement our distributed algorithm, we also give a hardness result for natural distributed algorithms, which holds for even the special case of $k$-center.
△ Less
Submitted 24 February, 2020; v1 submitted 18 February, 2020;
originally announced February 2020.
-
Testing Graph Clusterability: Algorithms and Lower Bounds
Authors:
Ashish Chiplunkar,
Michael Kapralov,
Sanjeev Khanna,
Aida Mousavifar,
Yuval Peres
Abstract:
We consider the problem of testing graph cluster structure: given access to a graph $G=(V, E)$, can we quickly determine whether the graph can be partitioned into a few clusters with good inner conductance, or is far from any such graph? This is a generalization of the well-studied problem of testing graph expansion, where one wants to distinguish between the graph having good expansion (i.e.\ bei…
▽ More
We consider the problem of testing graph cluster structure: given access to a graph $G=(V, E)$, can we quickly determine whether the graph can be partitioned into a few clusters with good inner conductance, or is far from any such graph? This is a generalization of the well-studied problem of testing graph expansion, where one wants to distinguish between the graph having good expansion (i.e.\ being a good single cluster) and the graph having a sparse cut (i.e.\ being a union of at least two clusters). A recent work of Czumaj, Peng, and Sohler (STOC'15) gave an ingenious sublinear time algorithm for testing $k$-clusterability in time $\tilde{O}(n^{1/2} \text{poly}(k))$: their algorithm implicitly embeds a random sample of vertices of the graph into Euclidean space, and then clusters the samples based on estimates of Euclidean distances between the points. This yields a very efficient testing algorithm, but only works if the cluster structure is very strong: it is necessary to assume that the gap between conductances of accepted and rejected graphs is at least logarithmic in the size of the graph $G$. In this paper we show how one can leverage more refined geometric information, namely angles as opposed to distances, to obtain a sublinear time tester that works even when the gap is a sufficiently large constant. Our tester is based on the singular value decomposition of a natural matrix derived from random walk transition probabilities from a small sample of seed nodes.
We complement our algorithm with a matching lower bound on the query complexity of testing clusterability. Our lower bound is based on a novel property testing problem, which we analyze using Fourier analytic tools. As a byproduct of our techniques, we also achieve new lower bounds for the problem of approximating MAX-CUT value in sublinear time.
△ Less
Submitted 18 September, 2018; v1 submitted 14 August, 2018;
originally announced August 2018.
-
Set Cover with Delay -- Clairvoyance is not Required
Authors:
Yossi Azar,
Ashish Chiplunkar,
Shay Kutten,
Noam Touitou
Abstract:
In most online problems with delay, clairvoyance (i.e. knowing the future delay of a request upon its arrival) is required for polylogarithmic competitiveness. In this paper, we show that this is not the case for set cover with delay (SCD) -- specifically, we present the first non-clairvoyant algorithm, which is $O(\log n \log m)$-competitive, where $n$ is the number of elements and $m$ is the num…
▽ More
In most online problems with delay, clairvoyance (i.e. knowing the future delay of a request upon its arrival) is required for polylogarithmic competitiveness. In this paper, we show that this is not the case for set cover with delay (SCD) -- specifically, we present the first non-clairvoyant algorithm, which is $O(\log n \log m)$-competitive, where $n$ is the number of elements and $m$ is the number of sets. This matches the best known result for the classic online set cover (a special case of non-clairvoyant SCD). Moreover, clairvoyance does not allow for significant improvement - we present lower bounds of $Ω(\sqrt{\log n})$ and $Ω(\sqrt{\log m})$ for SCD which apply for the clairvoyant case. In addition, the competitiveness of our algorithm does not depend on the number of requests. Such a guarantee on the size of the universe alone was not previously known even for the clairvoyant case - the only previously-known algorithm (due to Carrasco et al.) is clairvoyant, with competitiveness that grows with the number of requests. For the special case of vertex cover with delay, we show a simpler, deterministic algorithm which is $3$-competitive (and also non-clairvoyant).
△ Less
Submitted 22 June, 2020; v1 submitted 23 July, 2018;
originally announced July 2018.
-
Prophet Secretary: Surpassing the $1-1/e$ Barrier
Authors:
Yossi Azar,
Ashish Chiplunkar,
Haim Kaplan
Abstract:
In the Prophet Secretary problem, samples from a known set of probability distributions arrive one by one in a uniformly random order, and an algorithm must irrevocably pick one of the samples as soon as it arrives. The goal is to maximize the expected value of the sample picked relative to the expected maximum of the distributions. This is one of the most simple and fundamental problems in online…
▽ More
In the Prophet Secretary problem, samples from a known set of probability distributions arrive one by one in a uniformly random order, and an algorithm must irrevocably pick one of the samples as soon as it arrives. The goal is to maximize the expected value of the sample picked relative to the expected maximum of the distributions. This is one of the most simple and fundamental problems in online decision making that models the process selling one item to a sequence of costumers. For a closely related problem called the Prophet Inequality where the order of the random variables is adversarial, it is known that one can achieve in expectation $1/2$ of the expected maximum, and no better ratio is possible. For the Prophet Secretary problem, that is, when the variables arrive in a random order, Esfandiari et al.\ (ESA 2015) showed that one can actually get $1-1/e$ of the maximum. The $1-1/e$ bound was recently extended to more general settings (Ehsani et al., 2017). Given these results, one might be tempted to believe that $1-1/e$ is the correct bound. We show that this is not the case by providing an algorithm for the Prophet Secretary problem that beats the $1-1/e$ bound and achieves $1-1/e+1/400$ of the optimum value. We also prove a hardness result on the performance of algorithms under a natural restriction which we call deterministic distribution-insensitivity.
△ Less
Submitted 6 November, 2017;
originally announced November 2017.
-
Polylogarithmic Bounds on the Competitiveness of Min-cost (Bipartite) Perfect Matching with Delays
Authors:
Yossi Azar,
Ashish Chiplunkar,
Haim Kaplan
Abstract:
We consider the problem of online Min-cost Perfect Matching with Delays (MPMD) recently introduced by Emek et al, (STOC 2016). This problem is defined on an underlying $n$-point metric space. An adversary presents real-time requests online at points of the metric space, and the algorithm is required to match them, possibly after kee** them waiting for some time. The cost incurred is the sum of t…
▽ More
We consider the problem of online Min-cost Perfect Matching with Delays (MPMD) recently introduced by Emek et al, (STOC 2016). This problem is defined on an underlying $n$-point metric space. An adversary presents real-time requests online at points of the metric space, and the algorithm is required to match them, possibly after kee** them waiting for some time. The cost incurred is the sum of the distances between matched pairs of points (the connection cost), and the sum of the waiting times of the requests (the delay cost). We present an algorithm with a competitive ratio of $O(\log n)$, which improves the upper bound of $O(\log^2n+\logΔ)$ of Emek et al, by removing the dependence on $Δ$, the aspect ratio of the metric space (which can be unbounded as a function of $n$). The core of our algorithm is a deterministic algorithm for MPMD on metrics induced by edge-weighted trees of height $h$, whose cost is guaranteed to be at most $O(1)$ times the connection cost plus $O(h)$ times the delay cost of every feasible solution. The reduction from MPMD on arbitrary metrics to MPMD on trees is achieved using the result on embedding $n$-point metric spaces into distributions over weighted hierarchically separated trees of height $O(\log n)$, with distortion $O(\log n)$. We also prove a lower bound of $Ω(\sqrt{\log n})$ on the competitive ratio of any randomized algorithm. This is the first lower bound which increases with $n$, and is attained on the metric of $n$ equally spaced points on a line.
The problem of Min-cost Bipartite Perfect Matching with Delays (MBPMD) is the same as MPMD except that every request is either positive or negative, and requests can be matched only if they have opposite polarity. We prove an upper bound of $O(\log n)$ and a lower bound of $Ω(\log^{1/3}n)$ on the competitive ratio of MBPMD with a more involved analysis.
△ Less
Submitted 17 October, 2016;
originally announced October 2016.
-
On Randomized Algorithms for Matching in the Online Preemptive Model
Authors:
Ashish Chiplunkar,
Sumedh Tirodkar,
Sundar Vishwanathan
Abstract:
We investigate the power of randomized algorithms for the maximum cardinality matching (MCM) and the maximum weight matching (MWM) problems in the online preemptive model. In this model, the edges of a graph are revealed one by one and the algorithm is required to always maintain a valid matching. On seeing an edge, the algorithm has to either accept or reject the edge. If accepted, then the adjac…
▽ More
We investigate the power of randomized algorithms for the maximum cardinality matching (MCM) and the maximum weight matching (MWM) problems in the online preemptive model. In this model, the edges of a graph are revealed one by one and the algorithm is required to always maintain a valid matching. On seeing an edge, the algorithm has to either accept or reject the edge. If accepted, then the adjacent edges are discarded. The complexity of the problem is settled for deterministic algorithms.
Almost nothing is known for randomized algorithms. A lower bound of $1.693$ is known for MCM with a trivial upper bound of $2$. An upper bound of $5.356$ is known for MWM. We initiate a systematic study of the same in this paper with an aim to isolate and understand the difficulty. We begin with a primal-dual analysis of the deterministic algorithm due to McGregor. All deterministic lower bounds are on instances which are trees at every step. For this class of (unweighted) graphs we present a randomized algorithm which is $\frac{28}{15}$-competitive. The analysis is a considerable extension of the (simple) primal-dual analysis for the deterministic case. The key new technique is that the distribution of primal charge to dual variables depends on the "neighborhood" and needs to be done after having seen the entire input. The assignment is asymmetric: in that edges may assign different charges to the two end-points. Also the proof depends on a non-trivial structural statement on the performance of the algorithm on the input tree.
The other main result of this paper is an extension of the deterministic lower bound of Varadaraja to a natural class of randomized algorithms which decide whether to accept a new edge or not using independent random choices.
△ Less
Submitted 2 July, 2015; v1 submitted 30 December, 2014;
originally announced December 2014.
-
Approximating the Regular Graphic TSP in near linear time
Authors:
Ashish Chiplunkar,
Sundar Vishwanathan
Abstract:
We present a randomized approximation algorithm for computing traveling salesperson tours in undirected regular graphs. Given an $n$-vertex, $k$-regular graph, the algorithm computes a tour of length at most $\left(1+\frac{7}{\ln k-O(1)}\right)n$, with high probability, in $O(nk \log k)$ time. This improves upon a recent result by Vishnoi (\cite{Vishnoi12}, FOCS 2012) for the same problem, in term…
▽ More
We present a randomized approximation algorithm for computing traveling salesperson tours in undirected regular graphs. Given an $n$-vertex, $k$-regular graph, the algorithm computes a tour of length at most $\left(1+\frac{7}{\ln k-O(1)}\right)n$, with high probability, in $O(nk \log k)$ time. This improves upon a recent result by Vishnoi (\cite{Vishnoi12}, FOCS 2012) for the same problem, in terms of both approximation factor, and running time. The key ingredient of our algorithm is a technique that uses edge-coloring algorithms to sample a cycle cover with $O(n/\log k)$ cycles with high probability, in near linear time.
Additionally, we also give a deterministic $\frac{3}{2}+O\left(\frac{1}{\sqrt{k}}\right)$ factor approximation algorithm running in time $O(nk)$.
△ Less
Submitted 13 June, 2014; v1 submitted 9 April, 2014;
originally announced April 2014.
-
On Randomized Memoryless Algorithms for the Weighted $k$-server Problem
Authors:
Ashish Chiplunkar,
Sundar Vishwanathan
Abstract:
The weighted $k$-server problem is a generalization of the $k$-server problem in which the cost of moving a server of weight $β_i$ through a distance $d$ is $β_i\cdot d$. The weighted server problem on uniform spaces models caching where caches have different write costs. We prove tight bounds on the performance of randomized memoryless algorithms for this problem on uniform metric spaces. We prov…
▽ More
The weighted $k$-server problem is a generalization of the $k$-server problem in which the cost of moving a server of weight $β_i$ through a distance $d$ is $β_i\cdot d$. The weighted server problem on uniform spaces models caching where caches have different write costs. We prove tight bounds on the performance of randomized memoryless algorithms for this problem on uniform metric spaces. We prove that there is an $α_k$-competitive memoryless algorithm for this problem, where $α_k=α_{k-1}^2+3α_{k-1}+1$; $α_1=1$. On the other hand we also prove that no randomized memoryless algorithm can have competitive ratio better than $α_k$.
To prove the upper bound of $α_k$ we develop a framework to bound from above the competitive ratio of any randomized memoryless algorithm for this problem. The key technical contribution is a method for working with potential functions defined implicitly as the solution of a linear system. The result is robust in the sense that a small change in the probabilities used by the algorithm results in a small change in the upper bound on the competitive ratio. The above result has two important implications. Firstly this yields an $α_k$-competitive memoryless algorithm for the weighted $k$-server problem on uniform spaces. This is the first competitive algorithm for $k>2$ which is memoryless. Secondly, this helps us prove that the Harmonic algorithm, which chooses probabilities in inverse proportion to weights, has a competitive ratio of $kα_k$.
△ Less
Submitted 29 May, 2014; v1 submitted 1 January, 2013;
originally announced January 2013.
-
Metrical Service Systems with Multiple Servers
Authors:
Ashish Chiplunkar,
Sundar Vishwanathan
Abstract:
We study the problem of metrical service systems with multiple servers (MSSMS), which generalizes two well-known problems -- the $k$-server problem, and metrical service systems. The MSSMS problem is to service requests, each of which is an $l$-point subset of a metric space, using $k$ servers, with the objective of minimizing the total distance traveled by the servers.
Feuerstein initiated a st…
▽ More
We study the problem of metrical service systems with multiple servers (MSSMS), which generalizes two well-known problems -- the $k$-server problem, and metrical service systems. The MSSMS problem is to service requests, each of which is an $l$-point subset of a metric space, using $k$ servers, with the objective of minimizing the total distance traveled by the servers.
Feuerstein initiated a study of this problem by proving upper and lower bounds on the deterministic competitive ratio for uniform metric spaces. We improve Feuerstein's analysis of the upper bound and prove that his algorithm achieves a competitive ratio of $k({{k+l}\choose{l}}-1)$. In the randomized online setting, for uniform metric spaces, we give an algorithm which achieves a competitive ratio $\mathcal{O}(k^3\log l)$, beating the deterministic lower bound of ${{k+l}\choose{l}}-1$. We prove that any randomized algorithm for MSSMS on uniform metric spaces must be $Ω(\log kl)$-competitive. We then prove an improved lower bound of ${{k+2l-1}\choose{k}}-{{k+l-1}\choose{k}}$ on the competitive ratio of any deterministic algorithm for $(k,l)$-MSSMS, on general metric spaces. In the offline setting, we give a pseudo-approximation algorithm for $(k,l)$-MSSMS on general metric spaces, which achieves an approximation ratio of $l$ using $kl$ servers. We also prove a matching hardness result, that a pseudo-approximation with less than $kl$ servers is unlikely, even for uniform metric spaces. For general metric spaces, we highlight the limitations of a few popular techniques, that have been used in algorithm design for the $k$-server problem and metrical service systems.
△ Less
Submitted 21 May, 2014; v1 submitted 23 June, 2012;
originally announced June 2012.