Predicting Graph Categories from Structural Properties
Authors:
James P. Canning,
Emma E. Ingram,
Sammantha Nowak-Wolff,
Adriana M. Ortiz,
Nesreen K. Ahmed,
Ryan A. Rossi,
Karl R. B. Schmitt,
Sucheta Soundarajan
Abstract:
This paper has been withdrawn from arXiv.org due to a disagreement among the authors related to several peer-review comments received prior to submission on arXiv.org. Even though the current version of this paper is withdrawn, there was no disagreement between authors on the novel work in this paper.
One specific issue was the discussion of related work by Ikehara \& Clauset (found on page 8 of…
▽ More
This paper has been withdrawn from arXiv.org due to a disagreement among the authors related to several peer-review comments received prior to submission on arXiv.org. Even though the current version of this paper is withdrawn, there was no disagreement between authors on the novel work in this paper.
One specific issue was the discussion of related work by Ikehara \& Clauset (found on page 8 of the previously posted version). Peer-review comments on a similar version made ALL authors aware that the discussion misrepresented their work prior to submission to arXiv.org. However, some authors choose to post to arXiv a minimally updated version without the consent of all authors or properly addressing this attribution issue.
================
Original Paper Abstract:
Complex networks are often categorized according to the underlying phenomena that they represent such as molecular interactions, re-tweets, and brain activity. In this work, we investigate the problem of predicting the category (domain) of arbitrary networks. This includes complex networks from different domains as well as synthetically generated graphs from five different network models. A classification accuracy of $96.6\%$ is achieved using a random forest classifier with both real and synthetic networks. This work makes two important findings. First, our results indicate that complex networks from various domains have distinct structural properties that allow us to predict with high accuracy the category of a new previously unseen network. Second, synthetic graphs are trivial to classify as the classification model can predict with near-certainty the network model used to generate it. Overall, the results demonstrate that networks drawn from different domains (and network models) are trivial to distinguish using only a handful of simple structural properties.
△ Less
Submitted 1 August, 2019; v1 submitted 7 May, 2018;
originally announced May 2018.
Network Classification and Categorization
Authors:
James P. Canning,
Emma E. Ingram,
Sammantha Nowak-Wolff,
Adriana M. Ortiz,
Nesreen K. Ahmed,
Ryan A. Rossi,
Karl R. B. Schmitt,
Sucheta Soundarajan
Abstract:
To the best of our knowledge, this paper presents the first large-scale study that tests whether network categories (e.g., social networks vs. web graphs) are distinguishable from one another (using both categories of real-world networks and synthetic graphs). A classification accuracy of $94.2\%$ was achieved using a random forest classifier with both real and synthetic networks. This work makes…
▽ More
To the best of our knowledge, this paper presents the first large-scale study that tests whether network categories (e.g., social networks vs. web graphs) are distinguishable from one another (using both categories of real-world networks and synthetic graphs). A classification accuracy of $94.2\%$ was achieved using a random forest classifier with both real and synthetic networks. This work makes two important findings. First, real-world networks from various domains have distinct structural properties that allow us to predict with high accuracy the category of an arbitrary network. Second, classifying synthetic networks is trivial as our models can easily distinguish between synthetic graphs and the real-world networks they are supposed to model.
△ Less
Submitted 13 September, 2017;
originally announced September 2017.