Skip to main content

Showing 1–50 of 85 results for author: Seshadhri, C

.
  1. arXiv:2406.08624  [pdf, other

    cs.DS cs.DM cs.SI

    A Sublinear Algorithm for Approximate Shortest Paths in Large Networks

    Authors: Sabyasachi Basu, Nadia Kōshima, Talya Eden, Omri Ben-Eliezer, C. Seshadhri

    Abstract: Computing distances and finding shortest paths in massive real-world networks is a fundamental algorithmic task in network analysis. There are two main approaches to solving this task. On one hand are traversal-based algorithms like bidirectional breadth-first search (BiBFS) with no preprocessing step and slow individual distance inquiries. On the other hand are indexing-based approaches, which ma… ▽ More

    Submitted 12 June, 2024; originally announced June 2024.

  2. arXiv:2405.00262  [pdf, other

    cs.DS cs.DC

    Improved Massively Parallel Triangle Counting in $O(1)$ Rounds

    Authors: Quanquan C. Liu, C. Seshadhri

    Abstract: In this short note, we give a novel algorithm for $O(1)$ round triangle counting in bounded arboricity graphs. Counting triangles in $O(1)$ rounds (exactly) is listed as one of the interesting remaining open problems in the recent survey of Im et al. [IKLMV23]. The previous paper of Biswas et al. [BELMR20], which achieved the best bounds under this setting, used $O(\log \log n)$ rounds in sublinea… ▽ More

    Submitted 30 April, 2024; originally announced May 2024.

    Comments: To appear in PODC 2024

  3. arXiv:2311.09584  [pdf, other

    cs.DS cs.DM

    A Dichotomy Hierarchy Characterizing Linear Time Subgraph Counting in Bounded Degeneracy Graphs

    Authors: Daniel Paul-Pena, C. Seshadhri

    Abstract: Subgraph and homomorphism counting are fundamental algorithmic problems. Given a constant-sized pattern graph $H$ and a large input graph $G$, we wish to count the number of $H$-homomorphisms/subgraphs in $G$. Given the massive sizes of real-world graphs and the practical importance of counting problems, we focus on when (near) linear time algorithms are possible. The seminal work of Chiba-Nishize… ▽ More

    Submitted 16 November, 2023; originally announced November 2023.

  4. arXiv:2304.01416  [pdf, other

    cs.DS

    A $d^{1/2+o(1)}$ Monotonicity Tester for Boolean Functions on $d$-Dimensional Hypergrids

    Authors: Hadley Black, Deeparnab Chakrabarty, C. Seshadhri

    Abstract: Monotonicity testing of Boolean functions on the hypergrid, $f:[n]^d \to \{0,1\}$, is a classic topic in property testing. Determining the non-adaptive complexity of this problem is an important open question. For arbitrary $n$, [Black-Chakrabarty-Seshadhri, SODA 2020] describe a tester with query complexity $\widetilde{O}(\varepsilon^{-4/3}d^{5/6})$. This complexity is independent of $n$, but has… ▽ More

    Submitted 3 April, 2023; originally announced April 2023.

  5. arXiv:2303.14550  [pdf, other

    cs.SI math.OC

    Theoretical bounds on the network community profile from low-rank semi-definite programming

    Authors: Yufan Huang, C. Seshadhri, David F. Gleich

    Abstract: We study a new connection between a technical measure called $μ$-conductance that arises in the study of Markov chains for sampling convex bodies and the network community profile that characterizes size-resolved properties of clusters and communities in social and information networks. The idea of $μ$-conductance is similar to the traditional graph conductance, but disregards sets with small volu… ▽ More

    Submitted 25 March, 2023; originally announced March 2023.

  6. arXiv:2211.08605  [pdf, other

    cs.DS cs.DM

    A Dichotomy Theorem for Linear Time Homomorphism Orbit Counting in Bounded Degeneracy Graphs

    Authors: Daniel Paul-Pena, C. Seshadhri

    Abstract: Counting the number of homomorphisms of a pattern graph H in a large input graph G is a fundamental problem in computer science. There are myriad applications of this problem in databases, graph algorithms, and network science. Often, we need more than just the total count. Especially in large network analysis, we wish to compute, for each vertex v of G, the number of H-homomorphisms that v partic… ▽ More

    Submitted 15 November, 2022; originally announced November 2022.

  7. arXiv:2211.06352  [pdf, other

    cs.SI cs.DM cs.DS

    Spectral Triadic Decompositions of Real-World Networks

    Authors: Sabyasachi Basu, Suman Kalyan Bera, C. Seshadhri

    Abstract: A fundamental problem in mathematics and network analysis is to find conditions under which a graph can be partitioned into smaller pieces. The most important tool for this partitioning is the Fiedler vector or discrete Cheeger inequality. These results relate the graph spectrum (eigenvalues of the normalized adjacency matrix) to the ability to break a graph into two pieces, with few edge deletion… ▽ More

    Submitted 8 May, 2024; v1 submitted 11 November, 2022; originally announced November 2022.

  8. arXiv:2211.05281  [pdf, other

    cs.DS cs.DM

    Directed Isoperimetric Theorems for Boolean Functions on the Hypergrid and an $\widetilde{O}(n\sqrt{d})$ Monotonicity Tester

    Authors: Hadley Black, Deeparnab Chakrabarty, C. Seshadhri

    Abstract: The problem of testing monotonicity for Boolean functions on the hypergrid, $f:[n]^d \to \{0,1\}$ is a classic topic in property testing. When $n=2$, the domain is the hypercube. For the hypercube case, a breakthrough result of Khot-Minzer-Safra (FOCS 2015) gave a non-adaptive, one-sided tester making $\widetilde{O}(\varepsilon^{-2}\sqrt{d})$ queries. Up to polylog $d$ and $\varepsilon$ factors, t… ▽ More

    Submitted 9 November, 2022; originally announced November 2022.

  9. arXiv:2201.08481  [pdf, other

    cs.SI cs.AI cs.DS

    Classic Graph Structural Features Outperform Factorization-Based Graph Embedding Methods on Community Labeling

    Authors: Andrew Stolman, Caleb Levy, C. Seshadhri, Aneesh Sharma

    Abstract: Graph representation learning (also called graph embeddings) is a popular technique for incorporating network structure into machine learning models. Unsupervised graph embedding methods aim to capture graph structure by learning a low-dimensional vector representation (the embedding) for each node. Despite the widespread use of these embeddings for a variety of downstream transductive machine lea… ▽ More

    Submitted 20 January, 2022; originally announced January 2022.

  10. The complexity of testing all properties of planar graphs, and the role of isomorphism

    Authors: Sabyasachi Basu, Akash Kumar, C. Seshadhri

    Abstract: Consider property testing on bounded degree graphs and let $\varepsilon>0$ denote the proximity parameter. A remarkable theorem of Newman-Sohler (SICOMP 2013) asserts that all properties of planar graphs (more generally hyperfinite) are testable with query complexity only depending on $\varepsilon$. Recent advances in testing minor-freeness have proven that all additive and monotone properties of… ▽ More

    Submitted 25 August, 2021; v1 submitted 24 August, 2021; originally announced August 2021.

    Journal ref: Proceedings of the 2022 Annual ACM-SIAM Symposium on Discrete Algorithms (SODA), 1702-1714

  11. arXiv:2106.02762  [pdf, other

    cs.SI cs.DS

    Faster and Generalized Temporal Triangle Counting, via Degeneracy Ordering

    Authors: Noujan Pashanasangi, C. Seshadhri

    Abstract: Triangle counting is a fundamental technique in network analysis, that has received much attention in various input models. The vast majority of triangle counting algorithms are targeted to static graphs. Yet, many real-world graphs are directed and temporal, where edges come with timestamps. Temporal triangles yield much more information, since they account for both the graph topology and the tim… ▽ More

    Submitted 4 June, 2021; originally announced June 2021.

    Comments: To be published in KDD 2021

  12. arXiv:2104.11079  [pdf, other

    cs.AI cs.CE

    Randomized Algorithms for Scientific Computing (RASC)

    Authors: Aydin Buluc, Tamara G. Kolda, Stefan M. Wild, Mihai Anitescu, Anthony DeGennaro, John Jakeman, Chandrika Kamath, Ramakrishnan Kannan, Miles E. Lopes, Per-Gunnar Martinsson, Kary Myers, Jelani Nelson, Juan M. Restrepo, C. Seshadhri, Draguna Vrabie, Brendt Wohlberg, Stephen J. Wright, Chao Yang, Peter Zwart

    Abstract: Randomized algorithms have propelled advances in artificial intelligence and represent a foundational research area in advancing AI for Science. Future advancements in DOE Office of Science priority areas such as climate science, astrophysics, fusion, advanced materials, combustion, and quantum computing all require randomized algorithms for surmounting challenges of complexity, robustness, and sc… ▽ More

    Submitted 21 March, 2022; v1 submitted 19 April, 2021; originally announced April 2021.

  13. arXiv:2102.00556  [pdf, ps, other

    cs.DS

    Random walks and forbidden minors III: poly(d/ε)-time partition oracles for minor-free graph classes

    Authors: Akash Kumar, C. Seshadhri, Andrew Stolman

    Abstract: Consider the family of bounded degree graphs in any minor-closed family (such as planar graphs). Let d be the degree bound and n be the number of vertices of such a graph. Graphs in these classes have hyperfinite decompositions, where, for a sufficiently small \e > 0, one removes \edn edges to get connected components of size independent of n. An important tool for sublinear algorithms and propert… ▽ More

    Submitted 2 May, 2021; v1 submitted 31 January, 2021; originally announced February 2021.

    Comments: 31 pages

  14. arXiv:2010.08083  [pdf, ps, other

    cs.DS cs.CC

    Near-Linear Time Homomorphism Counting in Bounded Degeneracy Graphs: The Barrier of Long Induced Cycles

    Authors: Suman K. Bera, Noujan Pashanasangi, C. Seshadhri

    Abstract: Counting homomorphisms of a constant sized pattern graph $H$ in an input graph $G$ is a fundamental computational problem. There is a rich history of studying the complexity of this problem, under various constraints on the input $G$ and the pattern $H$. Given the significance of this problem and the large sizes of modern inputs, we investigate when near-linear time algorithms are possible. We foc… ▽ More

    Submitted 18 November, 2020; v1 submitted 15 October, 2020; originally announced October 2020.

    Comments: To be published in Symposium on Discrete Algorithms (SODA) 2021 Added conclusion section in the new version

  15. arXiv:2010.05998  [pdf, ps, other

    cs.CC cs.DS math.CO

    Counting Subgraphs in Degenerate Graphs

    Authors: Suman K. Bera, Lior Gishboliner, Yevgeny Levanzov, C. Seshadhri, Asaf Shapira

    Abstract: We consider the problem of counting the number of copies of a fixed graph $H$ within an input graph $G$. This is one of the most well-studied algorithmic graph problems, with many theoretical and practical applications. We focus on solving this problem when the input $G$ has bounded degeneracy. This is a rich family of graphs, containing all graphs without a fixed minor (e.g. planar graphs), as we… ▽ More

    Submitted 9 December, 2021; v1 submitted 12 October, 2020; originally announced October 2020.

  16. arXiv:2007.15743  [pdf, ps, other

    cs.DS cs.SI

    Distribution-Free Models of Social Networks

    Authors: Tim Roughgarden, C. Seshadhri

    Abstract: The structure of large-scale social networks has predominantly been articulated using generative models, a form of average-case analysis. This chapter surveys recent proposals of more robust models of such networks. These models posit deterministic and empirically supported combinatorial structure rather than a specific probability distribution. We discuss the formal definitions of these models an… ▽ More

    Submitted 30 July, 2020; originally announced July 2020.

    Comments: Chapter 28 of the book Beyond the Worst-Case Analysis of Algorithms, edited by Tim Roughgarden and published by Cambridge University Press (2020)

  17. arXiv:2007.09768  [pdf, other

    math.CO cs.DM cs.DS

    FPT Algorithms for Finding Near-Cliques in $c$-Closed Graphs

    Authors: Balaram Behera, Edin Husić, Shweta Jain, Tim Roughgarden, C. Seshadhri

    Abstract: Finding large cliques or cliques missing a few edges is a fundamental algorithmic task in the study of real-world graphs, with applications in community detection, pattern recognition, and clustering. A number of effective backtracking-based heuristics for these problems have emerged from recent empirical work in social network analysis. Given the NP-hardness of variants of clique counting, these… ▽ More

    Submitted 19 November, 2021; v1 submitted 19 July, 2020; originally announced July 2020.

    Comments: Accepted to ITCS 2022

    MSC Class: 68W01; 68R10; 05C85

  18. Provably and Efficiently Approximating Near-cliques using the Turán Shadow: PEANUTS

    Authors: Shweta Jain, C. Seshadhri

    Abstract: Clique and near-clique counts are important graph properties with applications in graph generation, graph modeling, graph analytics, community detection among others. They are the archetypal examples of dense subgraphs. While there are several different definitions of near-cliques, most of them share the attribute that they are cliques that are missing a small number of edges. Clique counting is i… ▽ More

    Submitted 24 June, 2020; originally announced June 2020.

    Comments: The Web Conference, 2020 (WWW)

  19. arXiv:2006.11947  [pdf, other

    cs.DS

    How to Count Triangles, without Seeing the Whole Graph

    Authors: Suman K. Bera, C. Seshadhri

    Abstract: Triangle counting is a fundamental problem in the analysis of large graphs. There is a rich body of work on this problem, in varying streaming and distributed models, yet all these algorithms require reading the whole input graph. In many scenarios, we do not have access to the whole graph, and can only sample a small portion of the graph (typically through crawling). In such a setting, how can we… ▽ More

    Submitted 21 June, 2020; originally announced June 2020.

    Comments: Accepted for publication in KDD 2020

  20. arXiv:2003.13151  [pdf, ps, other

    cs.DS

    How the Degeneracy Helps for Triangle Counting in Graph Streams

    Authors: Suman K. Bera, C. Seshadhri

    Abstract: We revisit the well-studied problem of triangle count estimation in graph streams. Given a graph represented as a stream of $m$ edges, our aim is to compute a $(1\pm\varepsilon)$-approximation to the triangle count $T$, using a small space algorithm. For arbitrary order and a constant number of passes, the space complexity is known to be essentially $Θ(\min(m^{3/2}/T, m/\sqrt{T}))$ (McGregor et al… ▽ More

    Submitted 29 March, 2020; originally announced March 2020.

    Comments: Accepted for publication in PODS'2020

  21. arXiv:2003.12635  [pdf, other

    cs.LG cs.DM cs.SI stat.ML

    The impossibility of low rank representations for triangle-rich complex networks

    Authors: C. Seshadhri, Aneesh Sharma, Andrew Stolman, Ashish Goel

    Abstract: The study of complex networks is a significant development in modern science, and has enriched the social sciences, biology, physics, and computer science. Models and algorithms for such networks are pervasive in our society, and impact human behavior via social networks, search engines, and recommender systems to name a few. A widely used algorithmic technique for modeling such complex networks i… ▽ More

    Submitted 27 March, 2020; originally announced March 2020.

    Journal ref: PNAS, March 2020

  22. The Power of Pivoting for Exact Clique Counting

    Authors: Shweta Jain, C. Seshadhri

    Abstract: Clique counting is a fundamental task in network analysis, and even the simplest setting of $3$-cliques (triangles) has been the center of much recent research. Getting the count of $k$-cliques for larger $k$ is algorithmically challenging, due to the exponential blowup in the search space of large cliques. But a number of recent applications (especially for community detection or clustering) use… ▽ More

    Submitted 19 January, 2020; originally announced January 2020.

    Comments: 10 pages, WSDM 2020

  23. arXiv:1911.10616  [pdf, other

    cs.SI cs.DS

    Efficiently Counting Vertex Orbits of All 5-vertex Subgraphs, by EVOKE

    Authors: Noujan Pashanasangi, C. Seshadhri

    Abstract: Subgraph counting is a fundamental task in network analysis. Typically, algorithmic work is on total counting, where we wish to count the total frequency of a (small) pattern subgraph in a large input data set. But many applications require local counts (also called vertex orbit counts) wherein, for every vertex $v$ of the input graph, one needs the count of the pattern subgraph involving $v$. Thi… ▽ More

    Submitted 12 December, 2019; v1 submitted 24 November, 2019; originally announced November 2019.

    Comments: We replaced the previous version with the full version

  24. arXiv:1911.05896  [pdf, ps, other

    cs.DS

    Linear Time Subgraph Counting, Graph Degeneracy, and the Chasm at Size Six

    Authors: Suman K. Bera, Noujan Pashanasangi, C. Seshadhri

    Abstract: We consider the problem of counting all $k$-vertex subgraphs in an input graph, for any constant $k$. This problem (denoted sub-cnt$_k$) has been studied extensively in both theory and practice. In a classic result, Chiba and Nishizeki (SICOMP 85) gave linear time algorithms for clique and 4-cycle counting for bounded degeneracy graphs. This is a rich class of sparse graphs that contains, for exam… ▽ More

    Submitted 27 November, 2019; v1 submitted 13 November, 2019; originally announced November 2019.

    Comments: The previous version did not handle the case of k=8. We corrected that in this version

  25. arXiv:1904.01055  [pdf, ps, other

    cs.DM math.CO

    Random walks and forbidden minors II: A $\text{poly}(d\varepsilon^{-1})$-query tester for minor-closed properties of bounded-degree graphs

    Authors: Akash Kumar, C. Seshadhri, Andrew Stolman

    Abstract: Let $G$ be a graph with $n$ vertices and maximum degree $d$. Fix some minor-closed property $\mathcal{P}$ (such as planarity). We say that $G$ is $\varepsilon$-far from $\mathcal{P}$ if one has to remove $\varepsilon dn$ edges to make it have $\mathcal{P}$. The problem of property testing $\mathcal{P}$ was introduced in the seminal work of Benjamini-Schramm-Shapira (STOC 2008) that gave a tester w… ▽ More

    Submitted 1 April, 2019; originally announced April 2019.

  26. arXiv:1811.04425  [pdf, ps, other

    cs.DS

    Faster sublinear approximations of $k$-cliques for low arboricity graphs

    Authors: Talya Eden, Dana Ron, C. Seshadhri

    Abstract: Given query access to an undirected graph $G$, we consider the problem of computing a $(1\pmε)$-approximation of the number of $k$-cliques in $G$. The standard query model for general graphs allows for degree queries, neighbor queries, and pair queries. Let $n$ be the number of vertices, $m$ be the number of edges, and $n_k$ be the number of $k$-cliques. Previous work by Eden, Ron and Seshadhri (S… ▽ More

    Submitted 11 November, 2018; originally announced November 2018.

  27. arXiv:1811.01427  [pdf, other

    cs.DM cs.CC

    Domain Reduction for Monotonicity Testing: A $o(d)$ Tester for Boolean Functions in $d$-Dimensions

    Authors: Hadley Black, Deeparnab Chakrabarty, C. Seshadhri

    Abstract: We describe a $\tilde{O}(d^{5/6})$-query monotonicity tester for Boolean functions $f:[n]^d \to \{0,1\}$ on the $n$-hypergrid. This is the first $o(d)$ monotonicity tester with query complexity independent of $n$. Motivated by this independence of $n$, we initiate the study of monotonicity testing of measurable Boolean functions $f:\mathbb{R}^d \to \{0,1\}$ over the continuous domain, where the di… ▽ More

    Submitted 9 December, 2019; v1 submitted 4 November, 2018; originally announced November 2018.

  28. arXiv:1805.08187  [pdf, ps, other

    cs.DM cs.DS

    Finding forbidden minors in sublinear time: a $n^{1/2+o(1)}$-query one-sided tester for minor closed properties on bounded degree graphs

    Authors: Akash Kumar, C. Seshadhri, Andrew Stolman

    Abstract: Let $G$ be an undirected, bounded degree graph with $n$ vertices. Fix a finite graph $H$, and suppose one must remove $\varepsilon n$ edges from $G$ to make it $H$-minor free (for some small constant $\varepsilon > 0$). We give an $n^{1/2+o(1)}$-time randomized procedure that, with high probability, finds an $H$-minor in such a graph. As an application, suppose one must remove $\varepsilon n$ edge… ▽ More

    Submitted 27 August, 2018; v1 submitted 21 May, 2018; originally announced May 2018.

    Comments: 31 pages

  29. arXiv:1804.07431  [pdf, other

    math.CO cs.DM cs.DS cs.SI

    Finding Cliques in Social Networks: A New Distribution-Free Model

    Authors: Jacob Fox, Tim Roughgarden, C. Seshadhri, Fan Wei, Nicole Wein

    Abstract: We propose a new distribution-free model of social networks. Our definitions are motivated by one of the most universal signatures of social networks, triadic closure---the property that pairs of vertices with common neighbors tend to be adjacent. Our most basic definition is that of a "$c$-closed" graph, where for every pair of vertices $u,v$ with at least $c$ common neighbors, $u$ and $v$ are ad… ▽ More

    Submitted 19 April, 2018; originally announced April 2018.

    Comments: main text 13 pages; 2 figures; appendix 9 pages

    MSC Class: 68W01; 68R10; 05C85; 05D99

  30. arXiv:1801.02816  [pdf, ps, other

    cs.DS cs.CC cs.DM

    Adaptive Boolean Monotonicity Testing in Total Influence Time

    Authors: Deeparnab Chakrabarty, C. Seshadhri

    Abstract: The problem of testing monotonicity of a Boolean function $f:\{0,1\}^n \to \{0,1\}$ has received much attention recently. Denoting the proximity parameter by $\varepsilon$, the best tester is the non-adaptive $\widetilde{O}(\sqrt{n}/\varepsilon^2)$ tester of Khot-Minzer-Safra (FOCS 2015). Let $I(f)$ denote the total influence of $f$. We give an adaptive tester whose running time is… ▽ More

    Submitted 9 January, 2018; originally announced January 2018.

  31. arXiv:1710.10545  [pdf, other

    cs.DM cs.CC cs.DS

    A $o(d) \cdot \text{polylog}~n$ Monotonicity Tester for Boolean Functions over the Hypergrid $[n]^d$

    Authors: Hadley Black, Deeparnab Chakrabarty, C. Seshadhri

    Abstract: We study monotonicity testing of Boolean functions over the hypergrid $[n]^d$ and design a non-adaptive tester with $1$-sided error whose query complexity is $\tilde{O}(d^{5/6})\cdot \text{poly}(\log n,1/ε)$. Previous to our work, the best known testers had query complexity linear in $d$ but independent of $n$. We improve upon these testers as long as $n = 2^{d^{o(1)}}$. To obtain our results, w… ▽ More

    Submitted 28 October, 2017; originally announced October 2017.

  32. arXiv:1710.08607  [pdf, other

    cs.SI cs.DS math.ST

    Provable and practical approximations for the degree distribution using sublinear graph samples

    Authors: Talya Eden, Shweta Jain, Ali Pinar, Dana Ron, C. Seshadhri

    Abstract: The degree distribution is one of the most fundamental properties used in the analysis of massive graphs. There is a large literature on graph sampling, where the goal is to estimate properties (especially the degree distribution) of a large graph through a small, random sample. The degree distribution estimation poses a significant challenge, due to its heavy-tailed nature and the large variance… ▽ More

    Submitted 28 August, 2018; v1 submitted 24 October, 2017; originally announced October 2017.

    Comments: Longer version of the WWW 2018 submission

  33. arXiv:1707.04858  [pdf, ps, other

    cs.DS

    On Approximating the Number of $k$-cliques in Sublinear Time

    Authors: Talya Eden, Dana Ron, C. Seshadhri

    Abstract: We study the problem of approximating the number of $k$-cliques in a graph when given query access to the graph. We consider the standard query model for general graphs via (1) degree queries, (2) neighbor queries and (3) pair queries. Let $n$ denote the number of vertices in the graph, $m$ the number of edges, and $C_k$ the number of $k$-cliques. We design an algorithm that outputs a… ▽ More

    Submitted 12 March, 2018; v1 submitted 16 July, 2017; originally announced July 2017.

  34. arXiv:1706.00053  [pdf, ps, other

    cs.CC cs.DM

    A Lower Bound for Nonadaptive, One-Sided Error Testing of Unateness of Boolean Functions over the Hypercube

    Authors: Roksana Baleshzar, Deeparnab Chakrabarty, Ramesh Krishnan S. Pallavoor, Sofya Raskhodnikova, C. Seshadhri

    Abstract: A Boolean function $f:\{0,1\}^d \mapsto \{0,1\}$ is unate if, along each coordinate, the function is either nondecreasing or nonincreasing. In this note, we prove that any nonadaptive, one-sided error unateness tester must make $Ω(\frac{d}{\log d})$ queries. This result improves upon the $Ω(\frac{d}{\log^2 d})$ lower bound for the same class of testers due to Chen et al. (STOC, 2017).

    Submitted 31 May, 2017; originally announced June 2017.

  35. Local Algorithms for Hierarchical Dense Subgraph Discovery

    Authors: Ahmet Erdem Sariyuce, C. Seshadhri, Ali Pinar

    Abstract: Finding the dense regions of a graph and relations among them is a fundamental problem in network analysis. Core and truss decompositions reveal dense subgraphs with hierarchical relations. The incremental nature of algorithms for computing these decompositions and the need for global information at each step of the algorithm hinders scalable parallelization and approximations since the densest re… ▽ More

    Submitted 14 September, 2018; v1 submitted 2 April, 2017; originally announced April 2017.

  36. arXiv:1703.05199  [pdf, ps, other

    cs.DS cs.DM

    Optimal Unateness Testers for Real-Valued Functions: Adaptivity Helps

    Authors: Roksana Baleshzar, Deeparnab Chakrabarty, Ramesh Krishnan S. Pallavoor, Sofya Raskhodnikova, C. Seshadhri

    Abstract: We study the problem of testing unateness of functions $f:\{0,1\}^d \to \mathbb{R}.$ We give a $O(\frac{d}ε \cdot \log\frac{d}ε)$-query nonadaptive tester and a $O(\frac{d}ε)$-query adaptive tester and show that both testers are optimal for a fixed distance parameter $ε$. Previously known unateness testers worked only for Boolean functions, and their query complexity had worse dependence on the di… ▽ More

    Submitted 15 March, 2017; originally announced March 2017.

  37. arXiv:1703.01054  [pdf, other

    cs.SI cs.DC cs.DS

    When Hashes Met Wedges: A Distributed Algorithm for Finding High Similarity Vectors

    Authors: Aneesh Sharma, C. Seshadhri, Ashish Goel

    Abstract: Finding similar user pairs is a fundamental task in social networks, with numerous applications in ranking and personalization tasks such as link prediction and tie strength detection. A common manifestation of user similarity is based upon network structure: each user is represented by a vector that represents the user's network connections, where pairwise cosine similarity among these vectors de… ▽ More

    Submitted 3 March, 2017; originally announced March 2017.

  38. arXiv:1611.05561  [pdf, other

    cs.SI cs.DS

    A Fast and Provable Method for Estimating Clique Counts Using Turán's Theorem

    Authors: Shweta Jain, C. Seshadhri

    Abstract: Clique counts reveal important properties about the structure of massive graphs, especially social networks. The simple setting of just 3-cliques (triangles) has received much attention from the research community. For larger cliques (even, say 6-cliques) the problem quickly becomes intractable because of combinatorial explosion. Most methods used for triangle counting do not scale for large cliqu… ▽ More

    Submitted 28 August, 2018; v1 submitted 16 November, 2016; originally announced November 2016.

    Comments: Added a link to the code

  39. arXiv:1610.09411  [pdf, other

    cs.SI cs.DS

    ESCAPE: Efficiently Counting All 5-Vertex Subgraphs

    Authors: Ali Pinar, C. Seshadhri, V. Vishal

    Abstract: Counting the frequency of small subgraphs is a fundamental technique in network analysis across various domains, most notably in bioinformatics and social networks. The special case of triangle counting has received much attention. Getting results for 4-vertex or 5-vertex patterns is highly challenging, and there are few practical results known that can scale to massive sizes. We introduce an al… ▽ More

    Submitted 28 October, 2016; originally announced October 2016.

  40. arXiv:1608.06980  [pdf, ps, other

    cs.DS cs.DM

    A $\widetilde{O}(n)$ Non-Adaptive Tester for Unateness

    Authors: Deeparnab Chakrabarty, C. Seshadhri

    Abstract: Khot and Shinkar (RANDOM, 2016) recently describe an adaptive, $O(n \log(n)/\varepsilon)$-query tester for unateness of Boolean functions $f:\{0,1\}^n \to \{0,1\}$. In this note we describe a simple non-adaptive, $O(n \log(n/\varepsilon)/\varepsilon)$ -query tester for unateness for functions over the hypercube with any ordered range.

    Submitted 2 September, 2016; v1 submitted 24 August, 2016; originally announced August 2016.

    Comments: We mention the relation of our algorithm to Levin's investment strategy, as pointed out by Oded Goldreich

  41. arXiv:1604.03661  [pdf, ps, other

    cs.DS cs.DM

    Sublinear Time Estimation of Degree Distribution Moments: The Degeneracy Connection

    Authors: Talya Eden, Dana Ron, C. Seshadhri

    Abstract: We revisit the classic problem of estimating the degree distribution moments of an undirected graph. Consider an undirected graph $G=(V,E)$ with $n$ vertices, and define (for $s > 0$) $μ_s = \frac{1}{n}\cdot\sum_{v \in V} d^s_v$. Our aim is to estimate $μ_s$ within a multiplicative error of $(1+ε)$ (for a given approximation parameter $ε>0$) in sublinear time. We consider the sparse graph model th… ▽ More

    Submitted 16 February, 2017; v1 submitted 13 April, 2016; originally announced April 2016.

  42. arXiv:1506.08258  [pdf, other

    cs.CE cs.DC cs.MS

    Trigger detection for adaptive scientific workflows using percentile sampling

    Authors: Janine C. Bennett, Ankit Bhagatwala, Jacqueline H. Chen, C. Seshadhri, Ali Pinar, Maher Salloum

    Abstract: Increasing complexity of scientific simulations and HPC architectures are driving the need for adaptive workflows, where the composition and execution of computational and data manipulation steps dynamically depend on the evolutionary state of the simulation itself. Consider for example, the frequency of data storage. Critical phases of the simulation should be captured with high frequency and wit… ▽ More

    Submitted 27 June, 2015; originally announced June 2015.

  43. Diamond Sampling for Approximate Maximum All-pairs Dot-product (MAD) Search

    Authors: Grey Ballard, Ali Pinar, Tamara G. Kolda, C. Seshadhri

    Abstract: Given two sets of vectors, $A = \{{a_1}, \dots, {a_m}\}$ and $B=\{{b_1},\dots,{b_n}\}$, our problem is to find the top-$t$ dot products, i.e., the largest $|{a_i}\cdot{b_j}|$ among all possible pairs. This is a fundamental mathematical problem that appears in numerous data applications involving similarity search, link prediction, and collaborative filtering. We propose a sampling-based approach t… ▽ More

    Submitted 18 June, 2015; v1 submitted 11 June, 2015; originally announced June 2015.

    Journal ref: ICDM 2015: Proceedings of the 2015 IEEE International Conference on Data Mining, pp. 11-20, November 2015

  44. arXiv:1506.02574  [pdf, other

    cs.DS

    Catching the head, tail, and everything in between: a streaming algorithm for the degree distribution

    Authors: Olivia Simpson, C. Seshadhri, Andrew McGregor

    Abstract: The degree distribution is one of the most fundamental graph properties of interest for real-world graphs. It has been widely observed in numerous domains that graphs typically have a tailed or scale-free degree distribution. While the average degree is usually quite small, the variance is quite high and there are vertices with degrees at all scales. We focus on the problem of approximating the de… ▽ More

    Submitted 25 November, 2015; v1 submitted 8 June, 2015; originally announced June 2015.

  45. arXiv:1505.01927  [pdf, ps, other

    cs.DS cs.DM cs.SI

    A simpler sublinear algorithm for approximating the triangle count

    Authors: C. Seshadhri

    Abstract: A recent result of Eden, Levi, and Ron (ECCC 2015) provides a sublinear time algorithm to estimate the number of triangles in a graph. Given an undirected graph $G$, one can query the degree of a vertex, the existence of an edge between vertices, and the $i$th neighbor of a vertex. Suppose the graph has $n$ vertices, $m$ edges, and $t$ triangles. In this model, Eden et al provided a… ▽ More

    Submitted 8 May, 2015; originally announced May 2015.

  46. Approximately Counting Triangles in Sublinear Time

    Authors: Talya Eden, Amit Levi, Dana Ron, C. Seshadhri

    Abstract: We consider the problem of estimating the number of triangles in a graph. This problem has been extensively studied in both theory and practice, but all existing algorithms read the entire graph. In this work we design a {\em sublinear-time\/} algorithm for approximating the number of triangles in a graph, where the algorithm is given query access to the graph. The allowed queries are degree queri… ▽ More

    Submitted 22 September, 2015; v1 submitted 3 April, 2015; originally announced April 2015.

    Comments: To appear in the 56th Annual IEEE Symposium on Foundations of Computer Science (FOCS 2015)

  47. arXiv:1411.4942  [pdf, other

    cs.DS cs.DM cs.SI

    Path Sampling: A Fast and Provable Method for Estimating 4-Vertex Subgraph Counts

    Authors: Madhav Jha, C. Seshadhri, Ali Pinar

    Abstract: Counting the frequency of small subgraphs is a fundamental technique in network analysis across various domains, most notably in bioinformatics and social networks. The special case of triangle counting has received much attention. Getting results for 4-vertex patterns is highly challenging, and there are few practical results known that can scale to massive sizes. Indeed, even a highly tuned enum… ▽ More

    Submitted 18 November, 2014; originally announced November 2014.

  48. arXiv:1411.3312  [pdf, other

    cs.SI

    Finding the Hierarchy of Dense Subgraphs using Nucleus Decompositions

    Authors: Ahmet Erdem Sariyuce, C. Seshadhri, Ali Pinar, Umit V. Catalyurek

    Abstract: Finding dense substructures in a graph is a fundamental graph mining operation, with applications in bioinformatics, social networks, and visualization to name a few. Yet most standard formulations of this problem (like clique, quasiclique, k-densest subgraph) are NP-hard. Furthermore, the goal is rarely to find the "true optimum", but to identify many (if not all) dense substructures, understand… ▽ More

    Submitted 9 March, 2015; v1 submitted 12 November, 2014; originally announced November 2014.

  49. arXiv:1411.2689  [pdf, other

    cs.CG

    Avoiding the Global Sort: A Faster Contour Tree Algorithm

    Authors: Benjamin Raichel, C. Seshadhri

    Abstract: We revisit the classical problem of computing the \emph{contour tree} of a scalar field $f:\mathbb{M} \to \mathbb{R}$, where $\mathbb{M}$ is a triangulated simplicial mesh in $\mathbb{R}^d$. The contour tree is a fundamental topological structure that tracks the evolution of level sets of $f$ and has numerous applications in data analysis and visualization. All existing algorithms begin with a g… ▽ More

    Submitted 10 December, 2015; v1 submitted 10 November, 2014; originally announced November 2014.

  50. arXiv:1409.4360  [pdf, other

    nlin.CD cs.DM nlin.CG

    Characterizing short-term stability for Boolean networks over any distribution of transfer functions

    Authors: C. Seshadhri, Andrew M. Smith, Yevgeniy Vorobeychik, Jackson Mayo, Robert C. Armstrong

    Abstract: We present a characterization of short-term stability of random Boolean networks under \emph{arbitrary} distributions of transfer functions. Given any distribution of transfer functions for a random Boolean network, we present a formula that decides whether short-term chaos (damage spreading) will happen. We provide a formal proof for this formula, and empirically show that its predictions are acc… ▽ More

    Submitted 15 September, 2014; originally announced September 2014.

    Journal ref: Phys. Rev. E 94, 012301 (2016)