Skip to main content

Showing 1–18 of 18 results for author: Aamand, A

.
  1. arXiv:2312.07535  [pdf, other

    cs.DS cs.LG

    Improved Frequency Estimation Algorithms with and without Predictions

    Authors: Anders Aamand, Justin Y. Chen, Huy Lê Nguyen, Sandeep Silwal, Ali Vakilian

    Abstract: Estimating frequencies of elements appearing in a data stream is a key task in large-scale data analysis. Popular sketching approaches to this problem (e.g., CountMin and CountSketch) come with worst-case guarantees that probabilistically bound the error of the estimated frequencies for any possible input. The work of Hsu et al. (2019) introduced the idea of using machine learning to tailor sketch… ▽ More

    Submitted 12 December, 2023; originally announced December 2023.

    Comments: NeurIPS 2023

  2. arXiv:2309.16840  [pdf, other

    cs.DS cs.LG

    Constant Approximation for Individual Preference Stable Clustering

    Authors: Anders Aamand, Justin Y. Chen, Allen Liu, Sandeep Silwal, Pattara Sukprasert, Ali Vakilian, Fred Zhang

    Abstract: Individual preference (IP) stability, introduced by Ahmadi et al. (ICML 2022), is a natural clustering objective inspired by stability and fairness constraints. A clustering is $α$-IP stable if the average distance of every data point to its own cluster is at most $α$ times the average distance to any other cluster. Unfortunately, determining if a dataset admits a $1$-IP stable clustering is NP-Ha… ▽ More

    Submitted 28 September, 2023; originally announced September 2023.

    Comments: 20 pages

  3. arXiv:2306.11312  [pdf, other

    cs.DS cs.LG stat.ML

    Data Structures for Density Estimation

    Authors: Anders Aamand, Alexandr Andoni, Justin Y. Chen, Piotr Indyk, Shyam Narayanan, Sandeep Silwal

    Abstract: We study statistical/computational tradeoffs for the following density estimation problem: given $k$ distributions $v_1, \ldots, v_k$ over a discrete domain of size $n$, and sampling access to a distribution $p$, identify $v_i$ that is "close" to $p$. Our main result is the first data structure that, given a sublinear (in $n$) number of samples from $p$, identifies $v_i$ in time sublinear in $k$.… ▽ More

    Submitted 20 June, 2023; originally announced June 2023.

    Comments: To appear at ICML'23

  4. arXiv:2303.01453  [pdf, other

    cs.DS cs.LG

    Improved Space Bounds for Learning with Experts

    Authors: Anders Aamand, Justin Y. Chen, Huy Lê Nguyen, Sandeep Silwal

    Abstract: We give improved tradeoffs between space and regret for the online learning with expert advice problem over $T$ days with $n$ experts. Given a space budget of $n^δ$ for $δ\in (0,1)$, we provide an algorithm achieving regret $\tilde{O}(n^2 T^{1/(1+δ)})$, improving upon the regret bound $\tilde{O}(n^2 T^{2/(2+δ)})$ in the recent work of [PZ23]. The improvement is particularly salient in the regime… ▽ More

    Submitted 2 March, 2023; originally announced March 2023.

  5. arXiv:2211.03232  [pdf, other

    cs.LG stat.ML

    Exponentially Improving the Complexity of Simulating the Weisfeiler-Lehman Test with Graph Neural Networks

    Authors: Anders Aamand, Justin Y. Chen, Piotr Indyk, Shyam Narayanan, Ronitt Rubinfeld, Nicholas Schiefer, Sandeep Silwal, Tal Wagner

    Abstract: Recent work shows that the expressive power of Graph Neural Networks (GNNs) in distinguishing non-isomorphic graphs is exactly the same as that of the Weisfeiler-Lehman (WL) graph test. In particular, they show that the WL test can be simulated by GNNs. However, those simulations involve neural networks for the 'combine' function of size polynomial or even exponential in the number of graph nodes… ▽ More

    Submitted 21 December, 2022; v1 submitted 6 November, 2022; originally announced November 2022.

    Comments: 22 pages,5 figures, published at NeurIPS 2022. Updated funding statements

  6. arXiv:2112.03791  [pdf, other

    cs.CG cs.DS

    Online Sorting and Translational Packing of Convex Polygons

    Authors: Anders Aamand, Mikkel Abrahamsen, Lorenzo Beretta, Linda Kleist

    Abstract: We investigate several online packing problems in which convex polygons arrive one by one and have to be placed irrevocably into a container, while the aim is to minimize the used space. Among other variants, we consider strip packing and bin packing, where the container is the infinite horizontal strip $[0,\infty)\times [0,1]$ or a collection of $1 \times 1$ bins, respectively. We draw interest… ▽ More

    Submitted 8 April, 2024; v1 submitted 7 December, 2021; originally announced December 2021.

  7. arXiv:2110.11439  [pdf, other

    cs.DS cs.LG

    (Optimal) Online Bipartite Matching with Degree Information

    Authors: Anders Aamand, Justin Y. Chen, Piotr Indyk

    Abstract: We propose a model for online graph problems where algorithms are given access to an oracle that predicts (e.g., based on modeling assumptions or on past data) the degrees of nodes in the graph. Within this model, we study the classic problem of online bipartite matching, and a natural greedy matching algorithm called MinPredictedDegree, which uses predictions of the degrees of offline nodes. For… ▽ More

    Submitted 14 November, 2022; v1 submitted 21 October, 2021; originally announced October 2021.

    Comments: To appear in NeurIPS'22. A prior version of this work was titled "(Optimal) Online Bipartite Matching with Predicted Degrees"

  8. arXiv:2104.05093  [pdf, other

    cs.DS

    Load Balancing with Dynamic Set of Balls and Bins

    Authors: Anders Aamand, Jakob Bæk Tejs Knudsen, Mikkel Thorup

    Abstract: In dynamic load balancing, we wish to distribute balls into bins in an environment where both balls and bins can be added and removed. We want to minimize the maximum load of any bin but we also want to minimize the number of balls and bins affected when adding or removing a ball or a bin. We want a hashing-style solution where we given the ID of a ball can find its bin efficiently. We are given… ▽ More

    Submitted 11 April, 2021; originally announced April 2021.

    Comments: Accepted at STOC'21

  9. arXiv:2104.03721  [pdf, other

    math.PR

    On Sums of Monotone Random Integer Variables

    Authors: Anders Aamand, Noga Alon, Jakob Bæk Tejs Knudsen, Mikkel Thorup

    Abstract: We say that a random integer variable $X$ is monotone if the modulus of the characteristic function of $X$ is decreasing on $[0,π]$. This is the case for many commonly encountered variables, e.g., Bernoulli, Poisson and geometric random variables. In this note, we provide estimates for the probability that the sum of independent monotone integer variables attains precisely a specific value. We do… ▽ More

    Submitted 13 April, 2021; v1 submitted 8 April, 2021; originally announced April 2021.

    Comments: 8 pages

  10. arXiv:2011.10983  [pdf, other

    cs.CG cs.DS

    Tiling with Squares and Packing Dominos in Polynomial Time

    Authors: Anders Aamand, Mikkel Abrahamsen, Thomas D. Ahle, Peter M. R. Rasmussen

    Abstract: A polyomino is a polygonal region with axis parallel edges and corners of integral coordinates, which may have holes. In this paper, we consider planar tiling and packing problems with polyomino pieces and a polyomino container $P$. We give two polynomial time algorithms, one for deciding if $P$ can be tiled with $k\times k$ squares for any fixed $k$ which can be part of the input (that is, decidi… ▽ More

    Submitted 9 August, 2021; v1 submitted 22 November, 2020; originally announced November 2020.

    Comments: Compared to the first version, running times for domino packing have been improved and a simpler algorithm has been described

  11. arXiv:2004.01156  [pdf, other

    cs.DS

    No Repetition: Fast Streaming with Highly Concentrated Hashing

    Authors: Anders Aamand, Debarati Das, Evangelos Kipouridis, Jakob B. T. Knudsen, Peter M. R. Rasmussen, Mikkel Thorup

    Abstract: To get estimators that work within a certain error bound with high probability, a common strategy is to design one that works with constant probability, and then boost the probability using independent repetitions. Important examples of this approach are small space algorithms for estimating the number of distinct elements in a stream, or estimating the set similarity between large sets. Using sta… ▽ More

    Submitted 2 April, 2020; originally announced April 2020.

    Comments: 10 pages

  12. arXiv:1909.00852  [pdf, other

    cs.CG

    Disks in Curves of Bounded Convex Curvature

    Authors: Anders Aamand, Mikkel Abrahamsen, Mikkel Thorup

    Abstract: We say that a simple, closed curve $γ$ in the plane has bounded convex curvature if for every point $x$ on $γ$, there is an open unit disk $U_x$ and $\varepsilon_x>0$ such that $x\in\partial U_x$ and $B_{\varepsilon_x}(x)\cap U_x\subset\text{Int}\;γ$. We prove that the interior of every curve of bounded convex curvature contains an open unit disk.

    Submitted 2 September, 2019; originally announced September 2019.

    Comments: Accepted for publication in The American Mathematical Monthly

    MSC Class: 51M04 (Primary) 53A04 (Secondary)

  13. arXiv:1908.05198  [pdf, other

    cs.DS

    (Learned) Frequency Estimation Algorithms under Zipfian Distribution

    Authors: Anders Aamand, Piotr Indyk, Ali Vakilian

    Abstract: \begin{abstract} The frequencies of the elements in a data stream are an important statistical measure and the task of estimating them arises in many applications within data analysis and machine learning. Two of the most popular algorithms for this problem, Count-Min and Count-Sketch, are widely used in practice. In a recent work [Hsu et al., ICLR'19], it was shown empirically that augmenting C… ▽ More

    Submitted 11 August, 2020; v1 submitted 14 August, 2019; originally announced August 2019.

  14. arXiv:1905.00369  [pdf, other

    cs.DS

    Fast hashing with Strong Concentration Bounds

    Authors: Anders Aamand, Jakob B. T. Knudsen, Mathias B. T. Knudsen, Peter M. R. Rasmussen, Mikkel Thorup

    Abstract: Previous work on tabulation hashing by Patrascu and Thorup from STOC'11 on simple tabulation and from SODA'13 on twisted tabulation offered Chernoff-style concentration bounds on hash based sums, e.g., the number of balls/keys hashing to a given bin, but under some quite severe restrictions on the expected values of these sums. The basic idea in tabulation hashing is to view a key as consisting of… ▽ More

    Submitted 10 August, 2020; v1 submitted 1 May, 2019; originally announced May 2019.

    Comments: 54 pages, 3 figures. An extended abstract appeared at the 52nd Annual ACM Symposium on Theory of Computing (STOC20)

  15. arXiv:1902.01732  [pdf, other

    cs.CG

    Classifying Convex Bodies by their Contact and Intersection Graphs

    Authors: Anders Aamand, Mikkel Abrahamsen, Jakob Bæk Tejs Knudsen, Peter Michael Reichstein Rasmussen

    Abstract: Suppose that $A$ is a convex body in the plane and that $A_1,\dots,A_n$ are translates of $A$. Such translates give rise to an intersection graph of $A$, $G=(V,E)$, with vertices $V=\{1,\dots,n\}$ and edges $E=\{uv\mid A_u\cap A_v\neq \emptyset\}$. The subgraph $G'=(V, E')$ satisfying that $E'\subset E$ is the set of edges $uv$ for which the interiors of $A_u$ and $A_v$ are disjoint is a unit dist… ▽ More

    Submitted 5 February, 2019; originally announced February 2019.

    Comments: 19 pages, 7 figures

    MSC Class: 52C05

  16. arXiv:1810.13187  [pdf, ps, other

    cs.DS

    Non-Empty Bins with Simple Tabulation Hashing

    Authors: Anders Aamand, Mikkel Thorup

    Abstract: We consider the hashing of a set $X\subseteq U$ with $|X|=m$ using a simple tabulation hash function $h:U\to [n]=\{0,\dots,n-1\}$ and analyse the number of non-empty bins, that is, the size of $h(X)$. We show that the expected size of $h(X)$ matches that with fully random hashing to within low-order terms. We also provide concentration bounds. The number of non-empty bins is a fundamental measure… ▽ More

    Submitted 31 October, 2018; originally announced October 2018.

    Comments: To appear at SODA'19

  17. arXiv:1804.09684  [pdf, other

    cs.DS

    Power of $d$ Choices with Simple Tabulation

    Authors: Anders Aamand, Mathias Bæk Tejs Knudsen, Mikkel Thorup

    Abstract: Suppose that we are to place $m$ balls into $n$ bins sequentially using the $d$-choice paradigm: For each ball we are given a choice of $d$ bins, according to $d$ hash functions $h_1,\dots,h_d$ and we place the ball in the least loaded of these bins breaking ties arbitrarily. Our interest is in the number of balls in the fullest bin after all $m$ balls have been placed. Azar et al. [STOC'94] pro… ▽ More

    Submitted 25 April, 2018; originally announced April 2018.

    Comments: Accepted at ICALP 2018

  18. arXiv:1708.07389  [pdf, ps, other

    cs.DS cs.DM

    One-Way Trail Orientations

    Authors: Anders Aamand, Niklas Hjuler, Jacob Holm, Eva Rotenberg

    Abstract: Given a graph, does there exist an orientation of the edges such that the resulting directed graph is strongly connected? Robbins' theorem [Robbins, Am. Math. Monthly, 1939] states that such an orientation exists if and only if the graph is $2$-edge connected. A natural extension of this problem is the following: Suppose that the edges of the graph is partitioned into trails. Can we orient the t… ▽ More

    Submitted 24 August, 2017; originally announced August 2017.

    Comments: Earlier version submitted to SODA'17