-
Towards analyzing large graphs with quantum annealing and quantum gate computers
Abstract: The use of quantum computing in graph community detection and regularity checking related to Szemeredi's Regularity Lemma (SRL) are demonstrated with D-Wave Systems' quantum annealer and simulations. We demonstrate the capability of quantum computing in solving hard problems relevant to big data. A new community detection algorithm based on SRL is also introduced and tested. In worst case scenario… ▽ More
Submitted 30 June, 2020; originally announced June 2020.
Comments: Extended version of a conference paper, IEEE BigData 2019, Los Angeles U.S.A. International Journal of Data Mining Science (IJDAT), 2020
ACM Class: I.5
-
Analysis of large sparse graphs using regular decomposition of graph distance matrices
Abstract: Statistical analysis of large and sparse graphs is a challenging problem in data science due to the high dimensionality and nonlinearity of the problem. This paper presents a fast and scalable algorithm for partitioning such graphs into disjoint groups based on observed graph distances from a set of reference nodes. The resulting partition provides a low-dimensional approximation of the full dista… ▽ More
Submitted 21 December, 2018; v1 submitted 26 November, 2018; originally announced November 2018.
Comments: IEEE BigData 2018 Conference Workshop, Advances in High Dimensional Big Data, 10.-13.12. 2018, Seattle USA
-
Regular decomposition of large graphs and other structures: scalability and robustness towards missing data
Abstract: A method for compression of large graphs and matrices to a block structure is further developed. Szemerédi's regularity lemma is used as a generic motivation of the significance of stochastic block models. Another ingredient of the method is Rissanen's minimum description length principle (MDL). We continue our previous work on the subject, considering cases of missing data and scaling of algorith… ▽ More
Submitted 23 November, 2017; originally announced November 2017.
Comments: Accepted for publication in: Fourth International Workshop on High Performance Big Graph Data Management, Analysis, and Mining, December 11, 2017, Bosto U.S.A
-
Regular Decomposition: an information and graph theoretic approach to stochastic block models
Abstract: A method for compression of large graphs and non-negative matrices to a block structure is proposed. Szemerédi's regularity lemma is used as heuristic motivation of the significance of stochastic block models. Another ingredient of the method is Rissanen's minimum description length principle (MDL). We propose practical algorithms and provide theoretical results on the accuracy of the method.
Submitted 12 August, 2019; v1 submitted 24 April, 2017; originally announced April 2017.
Comments: Simulation example added. Poisson block model code length estimates changed
MSC Class: 68P30
-
On the stability of two-chunk file-sharing systems
Abstract: We consider five different peer-to-peer file sharing systems with two chunks, with the aim of finding chunk selection algorithms that have provably stable performance with any input rate and assuming non-altruistic peers who leave the system immediately after downloading the second chunk. We show that many algorithms that first looked promising lead to unstable or oscillating behavior. However,… ▽ More
Submitted 29 October, 2009; originally announced October 2009.
Comments: 19 pages, 7 figures
MSC Class: 60K25; 68M14
Journal ref: Queueing Systems (2011) 67: 183