Skip to main content

Showing 1–43 of 43 results for author: Rubinfeld, R

Searching in archive cs. Search in all archives.
.
  1. arXiv:2403.00129  [pdf, ps, other

    cs.DS

    Average-Case Local Computation Algorithms

    Authors: Amartya Shankha Biswas, Ruidi Cao, Edward Pyne, Ronitt Rubinfeld

    Abstract: We initiate the study of Local Computation Algorithms on average case inputs. In the Local Computation Algorithm (LCA) model, we are given probe access to a huge graph, and asked to answer membership queries about some combinatorial structure on the graph, answering each query with sublinear work. For instance, an LCA for the $k$-spanner problem gives access to a sparse subgraph $H\subseteq G$ t… ▽ More

    Submitted 29 February, 2024; originally announced March 2024.

    Comments: 27 pages

  2. arXiv:2211.03232  [pdf, other

    cs.LG stat.ML

    Exponentially Improving the Complexity of Simulating the Weisfeiler-Lehman Test with Graph Neural Networks

    Authors: Anders Aamand, Justin Y. Chen, Piotr Indyk, Shyam Narayanan, Ronitt Rubinfeld, Nicholas Schiefer, Sandeep Silwal, Tal Wagner

    Abstract: Recent work shows that the expressive power of Graph Neural Networks (GNNs) in distinguishing non-isomorphic graphs is exactly the same as that of the Weisfeiler-Lehman (WL) graph test. In particular, they show that the WL test can be simulated by GNNs. However, those simulations involve neural networks for the 'combine' function of size polynomial or even exponential in the number of graph nodes… ▽ More

    Submitted 21 December, 2022; v1 submitted 6 November, 2022; originally announced November 2022.

    Comments: 22 pages,5 figures, published at NeurIPS 2022. Updated funding statements

  3. arXiv:2204.11894  [pdf, other

    cs.DS

    Properly learning monotone functions via local reconstruction

    Authors: Jane Lange, Ronitt Rubinfeld, Arsen Vasilyan

    Abstract: We give a $2^{\tilde{O}(\sqrt{n}/ε)}$-time algorithm for properly learning monotone Boolean functions under the uniform distribution over $\{0,1\}^n$. Our algorithm is robust to adversarial label noise and has a running time nearly matching that of the state-of-the-art improper learning algorithm of Bshouty and Tamon (JACM '96) and an information-theoretic lower bound of Blais et al (RANDOM '15).… ▽ More

    Submitted 27 March, 2023; v1 submitted 25 April, 2022; originally announced April 2022.

    Comments: FOCS 2022

  4. arXiv:2204.07196  [pdf, ps, other

    cs.LG cs.DS

    Testing distributional assumptions of learning algorithms

    Authors: Ronitt Rubinfeld, Arsen Vasilyan

    Abstract: There are many high dimensional function classes that have fast agnostic learning algorithms when assumptions on the distribution of examples can be made, such as Gaussianity or uniformity over the domain. But how can one be confident that data indeed satisfies such assumption, so that one can trust in output quality of the agnostic learning algorithm? We propose a model by which to systematically… ▽ More

    Submitted 19 November, 2022; v1 submitted 14 April, 2022; originally announced April 2022.

    ACM Class: F.3

  5. arXiv:2203.09572  [pdf, other

    cs.DS cs.LG

    Triangle and Four Cycle Counting with Predictions in Graph Streams

    Authors: Justin Y. Chen, Talya Eden, Piotr Indyk, Honghao Lin, Shyam Narayanan, Ronitt Rubinfeld, Sandeep Silwal, Tal Wagner, David P. Woodruff, Michael Zhang

    Abstract: We propose data-driven one-pass streaming algorithms for estimating the number of triangles and four cycles, two fundamental problems in graph analytics that are widely studied in the graph data stream literature. Recently, (Hsu 2018) and (Jiang 2020) applied machine learning techniques in other data stream problems, using a trained oracle that can predict certain properties of the stream elements… ▽ More

    Submitted 17 March, 2022; originally announced March 2022.

    Comments: To be presented at ICLR 2022

  6. arXiv:2107.06582  [pdf, other

    cs.DS

    Towards a Decomposition-Optimal Algorithm for Counting and Sampling Arbitrary Motifs in Sublinear Time

    Authors: Amartya Shankha Biswas, Talya Eden, Ronitt Rubinfeld

    Abstract: We consider the problem of sampling and approximately counting an arbitrary given motif $H$ in a graph $G$, where access to $G$ is given via queries: degree, neighbor, and pair, as well as uniform edge sample queries. Previous algorithms for these tasks were based on a decomposition of $H$ into a collection of odd cycles and stars, denoted… ▽ More

    Submitted 19 July, 2021; v1 submitted 14 July, 2021; originally announced July 2021.

  7. arXiv:2106.08396  [pdf, other

    cs.LG cs.DS math.ST

    Learning-based Support Estimation in Sublinear Time

    Authors: Talya Eden, Piotr Indyk, Shyam Narayanan, Ronitt Rubinfeld, Sandeep Silwal, Tal Wagner

    Abstract: We consider the problem of estimating the number of distinct elements in a large data set (or, equivalently, the support size of the distribution induced by the data set) from a random sample of its elements. The problem occurs in many applications, including biology, genomics, computer systems and linguistics. A line of research spanning the last decade resulted in algorithms that estimate the su… ▽ More

    Submitted 15 June, 2021; originally announced June 2021.

    Comments: 17 pages. Published as a conference paper in ICLR 2021

  8. arXiv:2102.07740  [pdf, other

    cs.DS

    Local Access to Random Walks

    Authors: Amartya Shankha Biswas, Edward Pyne, Ronitt Rubinfeld

    Abstract: For a graph $G$ on $n$ vertices, naively sampling the position of a random walk of at time $t$ requires work $Ω(t)$. We desire local access algorithms supporting $\text{position}(G,s,t)$ queries, which return the position of a random walk from some start vertex $s$ at time $t$, where the joint distribution of returned positions is $1/\text{poly}(n)$ close to the uniform distribution over such walk… ▽ More

    Submitted 15 February, 2021; originally announced February 2021.

  9. arXiv:2012.15002  [pdf, ps, other

    cs.DS

    New Partitioning Techniques and Faster Algorithms for Approximate Interval Scheduling

    Authors: Spencer Compton, Slobodan Mitrović, Ronitt Rubinfeld

    Abstract: Interval scheduling is a basic problem in the theory of algorithms and a classical task in combinatorial optimization. We develop a set of techniques for partitioning and grou** jobs based on their starting and ending times, that enable us to view an instance of interval scheduling on many jobs as a union of multiple interval scheduling instances, each containing only a few jobs. Instantiating t… ▽ More

    Submitted 23 February, 2023; v1 submitted 29 December, 2020; originally announced December 2020.

    Comments: Main result (Theorem 2) has stronger guarantees, updates/queries now in $\operatorname{poly}(\log(n),\frac{1}{\varepsilon})$ time

  10. arXiv:2010.02888  [pdf, other

    cs.LG math.ST stat.ML

    Testing Tail Weight of a Distribution Via Hazard Rate

    Authors: Maryam Aliakbarpour, Amartya Shankha Biswas, Kavya Ravichandran, Ronitt Rubinfeld

    Abstract: Understanding the shape of a distribution of data is of interest to people in a great variety of fields, as it may affect the types of algorithms used for that data. We study one such problem in the framework of distribution property testing, characterizing the number of samples required to to distinguish whether a distribution has a certain property or is far from having that property. In particu… ▽ More

    Submitted 4 December, 2022; v1 submitted 6 October, 2020; originally announced October 2020.

  11. arXiv:2008.08032  [pdf, ps, other

    cs.DS

    Sampling Multiple Edges Efficiently

    Authors: Talya Eden, Saleet Mossel, Ronitt Rubinfeld

    Abstract: We present a sublinear time algorithm that allows one to sample multiple edges from a distribution that is pointwise $ε$-close to the uniform distribution, in an \emph{amortized-efficient} fashion. We consider the adjacency list query model, where access to a graph $G$ is given via degree and neighbor queries. The problem of sampling a single edge in this model has been raised by Eden and Rosenb… ▽ More

    Submitted 19 July, 2021; v1 submitted 18 August, 2020; originally announced August 2020.

    ACM Class: F.2.2; G.2.2

  12. arXiv:2008.03891  [pdf, other

    cs.DB

    Rapid Approximate Aggregation with Distribution-Sensitive Interval Guarantees

    Authors: Stephen Macke, Maryam Aliakbarpour, Ilias Diakonikolas, Aditya Parameswaran, Ronitt Rubinfeld

    Abstract: Aggregating data is fundamental to data analytics, data exploration, and OLAP. Approximate query processing (AQP) techniques are often used to accelerate computation of aggregates using samples, for which confidence intervals (CIs) are widely used to quantify the associated error. CIs used in practice fall into two categories: techniques that are tight but not correct, i.e., they yield tight inter… ▽ More

    Submitted 10 August, 2020; originally announced August 2020.

  13. arXiv:2006.05028  [pdf, other

    cs.DS cs.LG

    Online Page Migration with ML Advice

    Authors: Piotr Indyk, Frederik Mallmann-Trenn, Slobodan Mitrović, Ronitt Rubinfeld

    Abstract: We consider online algorithms for the {\em page migration problem} that use predictions, potentially imperfect, to improve their performance. The best known online algorithms for this problem, due to Westbrook'94 and Bienkowski et al'17, have competitive ratios strictly bounded away from 1. In contrast, we show that if the algorithm is given a prediction of the input sequence, then it can achieve… ▽ More

    Submitted 8 June, 2020; originally announced June 2020.

  14. arXiv:2002.08299  [pdf, other

    cs.DS cs.DC

    Massively Parallel Algorithms for Small Subgraph Counting

    Authors: Amartya Shankha Biswas, Talya Eden, Quanquan C. Liu, Slobodan Mitrović, Ronitt Rubinfeld

    Abstract: Over the last two decades, frameworks for distributed-memory parallel computation, such as MapReduce, Hadoop, Spark and Dryad, have gained significant popularity with the growing prevalence of large network datasets. The Massively Parallel Computation (MPC) model is the de-facto standard for studying graph algorithms in these frameworks theoretically. Subgraph counting is one such fundamental prob… ▽ More

    Submitted 18 July, 2022; v1 submitted 19 February, 2020; originally announced February 2020.

    Comments: Abstract truncated per arXiv requirements

  15. Monotone probability distributions over the Boolean cube can be learned with sublinear samples

    Authors: Ronitt Rubinfeld, Arsen Vasilyan

    Abstract: A probability distribution over the Boolean cube is monotone if flip** the value of a coordinate from zero to one can only increase the probability of an element. Given samples of an unknown monotone distribution over the Boolean cube, we give (to our knowledge) the first algorithm that learns an approximation of the distribution in statistical distance using a number of samples that is sublinea… ▽ More

    Submitted 9 February, 2020; originally announced February 2020.

  16. arXiv:1910.14154  [pdf, ps, other

    cs.DS cs.DC

    Improved Local Computation Algorithm for Set Cover via Sparsification

    Authors: Christoph Grunau, Slobodan Mitrović, Ronitt Rubinfeld, Ali Vakilian

    Abstract: We design a Local Computation Algorithm (LCA) for the set cover problem. Given a set system where each set has size at most $s$ and each element is contained in at most $t$ sets, the algorithm reports whether a given set is in some fixed set cover whose expected size is $O(\log{s})$ times the minimum fractional set cover value. Our algorithm requires… ▽ More

    Submitted 5 November, 2019; v1 submitted 30 October, 2019; originally announced October 2019.

    Comments: To appear in ACM-SIAM Symposium on Discrete Algorithms (SODA 2020)

  17. arXiv:1907.03190  [pdf, ps, other

    math.ST cs.DS stat.ML

    Testing Mixtures of Discrete Distributions

    Authors: Maryam Aliakbarpour, Ravi Kumar, Ronitt Rubinfeld

    Abstract: There has been significant study on the sample complexity of testing properties of distributions over large domains. For many properties, it is known that the sample complexity can be substantially smaller than the domain size. For example, over a domain of size $n$, distinguishing the uniform distribution from distributions that are far from uniform in $\ell_1$-distance uses only $O(\sqrt{n})$ sa… ▽ More

    Submitted 6 July, 2019; originally announced July 2019.

    Comments: Appeared in COLT 2019

  18. arXiv:1907.03182  [pdf, ps, other

    cs.DS math.ST stat.ML

    Towards Testing Monotonicity of Distributions Over General Posets

    Authors: Maryam Aliakbarpour, Themis Gouleakis, John Peebles, Ronitt Rubinfeld, Anak Yodpinyanee

    Abstract: In this work, we consider the sample complexity required for testing the monotonicity of distributions over partial orders. A distribution $p$ over a poset is monotone if, for any pair of domain elements $x$ and $y$ such that $x \preceq y$, $p(x) \leq p(y)$. To understand the sample complexity of this problem, we introduce a new property called bigness over a finite domain, where the distribution… ▽ More

    Submitted 6 July, 2019; originally announced July 2019.

    Comments: Appeared in COLT 2019

  19. arXiv:1904.06745  [pdf, ps, other

    cs.DS

    Approximating the noise sensitivity of a monotone Boolean function

    Authors: Ronitt Rubinfeld, Arsen Vasilyan

    Abstract: The noise sensitivity of a Boolean function $f: \{0,1\}^n \rightarrow \{0,1\}$ is one of its fundamental properties. A function of a positive noise parameter $δ$, it is denoted as $NS_δ[f]$. Here we study the algorithmic problem of approximating it for monotone $f$, such that $NS_δ[f] \geq 1/n^{C}$ for constant $C$, and where $δ$ satisfies $1/n \leq δ\leq 1/2$. For such $f$ and $δ$, we give a rand… ▽ More

    Submitted 14 April, 2019; originally announced April 2019.

  20. arXiv:1902.08266  [pdf, other

    cs.DS cs.DM

    Local Computation Algorithms for Spanners

    Authors: Merav Parter, Ronitt Rubinfeld, Ali Vakilian, Anak Yodpinyanee

    Abstract: A graph spanner is a fundamental graph structure that faithfully preserves the pairwise distances in the input graph up to a small multiplicative stretch. The common objective in the computation of spanners is to achieve the best-known existential size-stretch trade-off efficiently. Classical models and algorithmic analysis of graph spanners essentially assume that the algorithm can read the inp… ▽ More

    Submitted 21 February, 2019; originally announced February 2019.

    Comments: An extended abstract appeared in the proceedings of ITCS 2019

  21. arXiv:1902.03534  [pdf, ps, other

    cs.DS cs.DM

    Set Cover in Sub-linear Time

    Authors: Piotr Indyk, Sepideh Mahabadi, Ronitt Rubinfeld, Ali Vakilian, Anak Yodpinyanee

    Abstract: We study the classic set cover problem from the perspective of sub-linear algorithms. Given access to a collection of $m$ sets over $n$ elements in the query model, we show that sub-linear algorithms derived from existing techniques have almost tight query complexities. On one hand, first we show an adaptation of the streaming algorithm presented in Har-Peled et al. [2016] to the sub-linear quer… ▽ More

    Submitted 9 February, 2019; originally announced February 2019.

  22. arXiv:1802.08237  [pdf, ps, other

    cs.DS cs.DC

    Improved Massively Parallel Computation Algorithms for MIS, Matching, and Vertex Cover

    Authors: Mohsen Ghaffari, Themis Gouleakis, Christian Konrad, Slobodan Mitrović, Ronitt Rubinfeld

    Abstract: We present $O(\log\log n)$-round algorithms in the Massively Parallel Computation (MPC) model, with $\tilde{O}(n)$ memory per machine, that compute a maximal independent set, a $1+ε$ approximation of maximum matching, and a $2+ε$ approximation of minimum vertex cover, for any $n$-vertex graph and any constant $ε>0$. These improve the state of the art as follows: - Our MIS algorithm leads to a si… ▽ More

    Submitted 17 March, 2022; v1 submitted 22 February, 2018; originally announced February 2018.

  23. arXiv:1711.10692  [pdf, other

    cs.DS

    Local Access to Huge Random Objects through Partial Sampling

    Authors: Amartya Shankha Biswas, Ronitt Rubinfeld, Anak Yodpinyanee

    Abstract: Consider an algorithm performing a computation on a huge random object. Is it necessary to generate the entire object up front, or is it possible to provide query access to the object and sample it incrementally "on-the-fly"? Such an implementation should emulate the object by answering queries in a manner consistent with a random instance sampled from the true distribution. Our first set of res… ▽ More

    Submitted 5 December, 2020; v1 submitted 29 November, 2017; originally announced November 2017.

  24. arXiv:1707.05497  [pdf, other

    cs.LG cs.DS cs.IT stat.ML

    Differentially Private Identity and Closeness Testing of Discrete Distributions

    Authors: Maryam Aliakbarpour, Ilias Diakonikolas, Ronitt Rubinfeld

    Abstract: We investigate the problems of identity and closeness testing over a discrete population from random samples. Our goal is to develop efficient testers while guaranteeing Differential Privacy to the individuals of the population. We describe an approach that yields sample-efficient differentially private testers for these problems. Our theoretical results show that there exist private identity and… ▽ More

    Submitted 18 July, 2017; originally announced July 2017.

    Comments: Submitted, May 2017

  25. arXiv:1604.07038  [pdf, ps, other

    cs.DS

    A Local Algorithm for Constructing Spanners in Minor-Free Graphs

    Authors: Reut Levi, Dana Ron, Ronitt Rubinfeld

    Abstract: Constructing a spanning tree of a graph is one of the most basic tasks in graph theory. We consider this problem in the setting of local algorithms: one wants to quickly determine whether a given edge $e$ is in a specific spanning tree, without computing the whole spanning tree, but rather by inspecting the local neighborhood of $e$. The challenge is to maintain consistency. That is, to answer que… ▽ More

    Submitted 24 April, 2016; originally announced April 2016.

    Comments: arXiv admin note: substantial text overlap with arXiv:1402.3609

  26. arXiv:1601.04233  [pdf, other

    cs.DS

    Sublinear-Time Algorithms for Counting Star Subgraphs with Applications to Join Selectivity Estimation

    Authors: Maryam Aliakbarpour, Amartya Shankha Biswas, Themistoklis Gouleakis, John Peebles, Ronitt Rubinfeld, Anak Yodpinyanee

    Abstract: We study the problem of estimating the value of sums of the form $S_p \triangleq \sum \binom{x_i}{p}$ when one has the ability to sample $x_i \geq 0$ with probability proportional to its magnitude. When $p=2$, this problem is equivalent to estimating the selectivity of a self-join query in database systems when one can sample rows randomly. We also study the special case when $\{x_i\}$ is the degr… ▽ More

    Submitted 16 January, 2016; originally announced January 2016.

    Comments: 21 pages

  27. arXiv:1507.03558  [pdf, ps, other

    cs.DS cs.CC math.PR math.ST

    Testing Shape Restrictions of Discrete Distributions

    Authors: Clément L. Canonne, Ilias Diakonikolas, Themis Gouleakis, Ronitt Rubinfeld

    Abstract: We study the question of testing structured properties (classes) of discrete distributions. Specifically, given sample access to an arbitrary distribution $D$ over $[n]$ and a property $\mathcal{P}$, the goal is to distinguish between $D\in\mathcal{P}$ and $\ell_1(D,\mathcal{P})>\varepsilon$. We develop a general algorithm for this question, which applies to a large range of "shape-constrained" pr… ▽ More

    Submitted 21 January, 2016; v1 submitted 13 July, 2015; originally announced July 2015.

  28. arXiv:1504.06544  [pdf, ps, other

    cs.DS cs.LG math.PR

    Sampling Correctors

    Authors: Clément Canonne, Themis Gouleakis, Ronitt Rubinfeld

    Abstract: In many situations, sample data is obtained from a noisy or imperfect source. In order to address such corruptions, this paper introduces the concept of a sampling corrector. Such algorithms use structure that the distribution is purported to have, in order to allow one to make "on-the-fly" corrections to samples drawn from probability distributions. These algorithms then act as filters between th… ▽ More

    Submitted 31 March, 2018; v1 submitted 24 April, 2015; originally announced April 2015.

  29. arXiv:1502.04022  [pdf, ps, other

    cs.DS

    Local Computation Algorithms for Graphs of Non-Constant Degrees

    Authors: Reut Levi, Ronitt Rubinfeld, Anak Yodpinyanee

    Abstract: In the model of \emph{local computation algorithms} (LCAs), we aim to compute the queried part of the output by examining only a small (sublinear) portion of the input. Many recently developed LCAs on graph problems achieve time and space complexities with very low dependence on $n$, the number of vertices. Nonetheless, these complexities are generally at least exponential in $d$, the upper bound… ▽ More

    Submitted 13 February, 2015; originally announced February 2015.

  30. arXiv:1502.00413  [pdf, ps, other

    math.CO cs.DS

    Constructing Near Spanning Trees with Few Local Inspections

    Authors: Reut Levi, Guy Moshkovitz, Dana Ron, Ronitt Rubinfeld, Asaf Shapira

    Abstract: Constructing a spanning tree of a graph is one of the most basic tasks in graph theory. Motivated by several recent studies of local graph algorithms, we consider the following variant of this problem. Let G be a connected bounded-degree graph. Given an edge $e$ in $G$ we would like to decide whether $e$ belongs to a connected subgraph $G'$ consisting of $(1+ε)n$ edges (for a prespecified constant… ▽ More

    Submitted 3 February, 2015; v1 submitted 2 February, 2015; originally announced February 2015.

    Comments: References fixed

  31. A Self-Tester for Linear Functions over the Integers with an Elementary Proof of Correctness

    Authors: Sheela Devadas, Ronitt Rubinfeld

    Abstract: We present simple, self-contained proofs of correctness for algorithms for linearity testing and program checking of linear functions on finite subsets of integers represented as n-bit numbers. In addition we explore a generalization of self-testing to homomorphisms on a multidimensional vector space. We show that our self-testing algorithm for the univariate case can be directly generalized to ve… ▽ More

    Submitted 22 June, 2015; v1 submitted 17 December, 2014; originally announced December 2014.

  32. arXiv:1412.3040  [pdf, other

    cs.DB

    Rapid Sampling for Visualizations with Ordering Guarantees

    Authors: Albert Kim, Eric Blais, Aditya Parameswaran, Piotr Indyk, Sam Madden, Ronitt Rubinfeld

    Abstract: Visualizations are frequently used as a means to understand trends and gather insights from datasets, but often take a long time to generate. In this paper, we focus on the problem of rapidly generating approximate visualizations while preserving crucial visual proper- ties of interest to analysts. Our primary focus will be on sampling algorithms that preserve the visual property of ordering; our… ▽ More

    Submitted 9 December, 2014; originally announced December 2014.

    Comments: Tech Report. 17 pages. Condensed version to appear in VLDB Vol. 8 No. 5

  33. arXiv:1402.3835  [pdf, ps, other

    cs.DS cs.DM math.PR math.ST

    Testing probability distributions underlying aggregated data

    Authors: Clément Canonne, Ronitt Rubinfeld

    Abstract: In this paper, we analyze and study a hybrid model for testing and learning probability distributions. Here, in addition to samples, the testing algorithm is provided with one of two different types of oracles to the unknown distribution $D$ over $[n]$. More precisely, we define both the dual and cumulative dual access models, in which the algorithm $A$ can both sample from $D$ and respectively, f… ▽ More

    Submitted 16 February, 2014; originally announced February 2014.

  34. arXiv:1402.3609  [pdf, ps, other

    cs.DS

    Local Algorithms for Sparse Spanning Graphs

    Authors: Reut Levi, Dana Ron, Ronitt Rubinfeld

    Abstract: Constructing a spanning tree of a graph is one of the most basic tasks in graph theory. We consider a relaxed version of this problem in the setting of local algorithms. The relaxation is that the constructed subgraph is a sparse spanning subgraph containing at most $(1+ε)n$ edges (where $n$ is the number of vertices and $ε$ is a given approximation/sparsity parameter). In the local setting, the g… ▽ More

    Submitted 27 April, 2021; v1 submitted 14 February, 2014; originally announced February 2014.

    Comments: Upper bounds for expanding graphs and minor free graphs

  35. arXiv:1301.2495  [pdf, ps, other

    cs.DS

    A simple online competitive adaptation of Lempel-Ziv compression with efficient random access support

    Authors: Akashnil Dutta, Reut Levi, Dana Ron, Ronitt Rubinfeld

    Abstract: We present a simple adaptation of the Lempel Ziv 78' (LZ78) compression scheme ({\em IEEE Transactions on Information Theory, 1978}) that supports efficient random access to the input string. Namely, given query access to the compressed string, it is possible to efficiently recover any symbol of the input string. The compression algorithm is given as input a parameter $\eps >0$, and with very high… ▽ More

    Submitted 11 January, 2013; originally announced January 2013.

  36. arXiv:1208.2956  [pdf, ps, other

    cs.DS

    Local reconstructors and tolerant testers for connectivity and diameter

    Authors: Andrea Campagna, Alan Guo, Ronitt Rubinfeld

    Abstract: A local property reconstructor for a graph property is an algorithm which, given oracle access to the adjacency list of a graph that is "close" to having the property, provides oracle access to the adjacency matrix of a "correction" of the graph, i.e. a graph which has the property and is close to the given graph. For this model, we achieve local property reconstructors for the properties of conne… ▽ More

    Submitted 21 June, 2013; v1 submitted 14 August, 2012; originally announced August 2012.

    Comments: 21 pages, updated abstract, improved exposition

  37. arXiv:1110.1079  [pdf, ps, other

    cs.DS

    A Near-Optimal Sublinear-Time Algorithm for Approximating the Minimum Vertex Cover Size

    Authors: Krzysztof Onak, Dana Ron, Michal Rosen, Ronitt Rubinfeld

    Abstract: We give a nearly optimal sublinear-time algorithm for approximating the size of a minimum vertex cover in a graph G. The algorithm may query the degree deg(v) of any vertex v of its choice, and for each 1 <= i <= deg(v), it may ask for the i-th neighbor of v. Letting VC_opt(G) denote the minimum size of vertex cover in G, the algorithm outputs, with high constant success probability, an estimate V… ▽ More

    Submitted 5 October, 2011; originally announced October 2011.

  38. arXiv:1109.6178  [pdf, ps, other

    cs.DS

    Space-efficient Local Computation Algorithms

    Authors: Noga Alon, Ronitt Rubinfeld, Shai Vardi, Ning Xie

    Abstract: Recently Rubinfeld et al. (ICS 2011, pp. 223--238) proposed a new model of sublinear algorithms called \emph{local computation algorithms}. In this model, a computation problem $F$ may have more than one legal solution and each of them consists of many bits. The local computation algorithm for $F$ should answer in an online fashion, for any index $i$, the $i^{\mathrm{th}}$ bit of some legal soluti… ▽ More

    Submitted 29 November, 2011; v1 submitted 28 September, 2011; originally announced September 2011.

  39. arXiv:1104.1377  [pdf, ps, other

    cs.DS

    Fast Local Computation Algorithms

    Authors: Ronitt Rubinfeld, Gil Tamir, Shai Vardi, Ning Xie

    Abstract: For input $x$, let $F(x)$ denote the set of outputs that are the "legal" answers for a computational problem $F$. Suppose $x$ and members of $F(x)$ are so large that there is not time to read them in their entirety. We propose a model of {\em local computation algorithms} which for a given input $x$, support queries by a user to values of specified locations $y_i$ in a legal output $y \in F(x)$. W… ▽ More

    Submitted 7 April, 2011; originally announced April 2011.

    Comments: A preliminary version of this paper appeared in ICS 2011, pp. 223-238

  40. arXiv:1101.5345  [pdf, ps, other

    cs.DS cs.DM

    Approximating the Influence of a monotone Boolean function in O(\sqrt{n}) query complexity

    Authors: Dana Ron, Ronitt Rubinfeld, Muli Safra, Omri Weinstein

    Abstract: The {\em Total Influence} ({\em Average Sensitivity) of a discrete function is one of its fundamental measures. We study the problem of approximating the total influence of a monotone Boolean function \ifnum\plusminus=1 $f: \{\pm1\}^n \longrightarrow \{\pm1\}$, \else $f: \bitset^n \to \bitset$, \fi which we denote by $I[f]$. We present a randomized algorithm that approximates the influence of such… ▽ More

    Submitted 27 January, 2011; originally announced January 2011.

  41. arXiv:1009.5397  [pdf, ps, other

    cs.DS math.PR math.ST

    Testing Closeness of Discrete Distributions

    Authors: Tugkan Batu, Lance Fortnow, Ronitt Rubinfeld, Warren D. Smith, Patrick White

    Abstract: Given samples from two distributions over an $n$-element set, we wish to test whether these distributions are statistically close. We present an algorithm which uses sublinear in $n$, specifically, $O(n^{2/3}ε^{-8/3}\log n)$, independent samples from each distribution, runs in time linear in the sample size, makes no assumptions about the structure of the distributions, and distinguishes the cases… ▽ More

    Submitted 4 November, 2010; v1 submitted 27 September, 2010; originally announced September 2010.

    Comments: 26 pages, A preliminary version of this paper appeared in the 41st Symposium on Foundations of Computer Science, 2000, Redondo Beach, CA, A comment from W.D. Smith has been added on the title page

    ACM Class: F.2.2; G.3

  42. arXiv:0904.0292  [pdf, ps, other

    cs.DS

    Sublinear Time Algorithms for Earth Mover's Distance

    Authors: Khanh Do Ba, Huy L Nguyen, Huy N Nguyen, Ronitt Rubinfeld

    Abstract: We study the problem of estimating the Earth Mover's Distance (EMD) between probability distributions when given access only to samples. We give closeness testers and additive-error estimators over domains in $[0, Δ]^d$, with sample complexities independent of domain size - permitting the testability even of continuous distributions over infinite domains. Instead, our algorithms depend on other… ▽ More

    Submitted 1 April, 2009; originally announced April 2009.

    Comments: 12 pages

  43. arXiv:0706.1084  [pdf, ps, other

    cs.DS

    Sublinear Algorithms for Approximating String Compressibility

    Authors: Sofya Raskhodnikova, Dana Ron, Ronitt Rubinfeld, Adam Smith

    Abstract: We raise the question of approximating the compressibility of a string with respect to a fixed compression scheme, in sublinear time. We study this question in detail for two popular lossless compression schemes: run-length encoding (RLE) and Lempel-Ziv (LZ), and present sublinear algorithms for approximating compressibility with respect to both schemes. We also give several lower bounds that sh… ▽ More

    Submitted 7 June, 2007; originally announced June 2007.

    Comments: To appear in the proceedings of RANDOM 2007