-
Spectra of random graphs with community structure and arbitrary degrees
Authors:
Xiao Zhang,
Raj Rao Nadakuditi,
M. E. J. Newman
Abstract:
Using methods from random matrix theory researchers have recently calculated the full spectra of random networks with arbitrary degrees and with community structure. Both reveal interesting spectral features, including deviations from the Wigner semicircle distribution and phase transitions in the spectra of community structured networks. In this paper we generalize both calculations, giving a pre…
▽ More
Using methods from random matrix theory researchers have recently calculated the full spectra of random networks with arbitrary degrees and with community structure. Both reveal interesting spectral features, including deviations from the Wigner semicircle distribution and phase transitions in the spectra of community structured networks. In this paper we generalize both calculations, giving a prescription for calculating the spectrum of a network with both community structure and an arbitrary degree distribution. In general the spectrum has two parts, a continuous spectral band, which can depart strongly from the classic semicircle form, and a set of outlying eigenvalues that indicate the presence of communities.
△ Less
Submitted 30 September, 2013;
originally announced October 2013.
-
AdS-Sliced Flavor Branes and Adding Flavor to the Janus Solution
Authors:
Adam B. Clark,
Nathan Crossette,
George M. Newman,
Andrea Rommal
Abstract:
We implement D7 flavor branes in AdS-sliced coordinates on $AdS_5\times S^5$ with the ansatz that the brane fluctuates only in the warped ($μ$) direction in this slicing, which is particularly appropriate for studying the Janus solution. The natural field theory dual in this slicing is $\mathcal{N}=4$ super Yang-Mills on two copies of $AdS_4$. Branes extending from $μ=\pmπ/2$ can end at different…
▽ More
We implement D7 flavor branes in AdS-sliced coordinates on $AdS_5\times S^5$ with the ansatz that the brane fluctuates only in the warped ($μ$) direction in this slicing, which is particularly appropriate for studying the Janus solution. The natural field theory dual in this slicing is $\mathcal{N}=4$ super Yang-Mills on two copies of $AdS_4$. Branes extending from $μ=\pmπ/2$ can end at different locations, giving rise to quarks with piecewise constant mass on each $AdS_4$ half-space, jum** discontinuously between them. A second class of flavor brane solutions exists in this coordinate system, dubbed "continuous" flavor branes, that extend across the entire range of $μ$. We propose that the correct dual interpretation of "disconnected" flavor brane in this ansatz is a quark hypermultiplet with constant mass on one of the AdS$_4$ half-spaces with totally reflecting boundary conditions at the boundary of AdS$_4$; whereas the dual interpretation of a continuous flavor brane has totally transparent boundary conditions. Numerical studies indicate that AdS-sliced D7 flavor branes of both classes exhibit spontaneous chiral symmetry breaking, as non-zero vev persists for solutions of the equation of motion down to zero mass. Continuous flavor branes in this ansatz exhibit many other surprising behaviors: their masses seem to be capped at a modest value near $m=0.551$ in units of the inverse AdS radius, and there may be a phase transition between continuous branes of different configurations. We also numerically study quark states in Janus. The behavior of mass and vev is similar in Janus, including the existence of chiral symmetry breaking at zero mass, though qualitative features of the phase diagram change (sometimes significantly) as the Janus parameter $c_0$ increases.
△ Less
Submitted 2 October, 2013; v1 submitted 30 September, 2013;
originally announced September 2013.
-
The Brownian Net with Killing
Authors:
Charles M. Newman,
K. Ravishankar,
Emmanuel Schertzer
Abstract:
Motivated by its relevance for the study of perturbations of one-dimensional voter models, including stochastic Potts models at low temperature, we consider diffusively rescaled coalescing random walks with branching and killing. Our main result is convergence to a new continuum process, in which the random space-time paths of the Sun-Swart Brownian net are terminated at a Poisson cloud of killing…
▽ More
Motivated by its relevance for the study of perturbations of one-dimensional voter models, including stochastic Potts models at low temperature, we consider diffusively rescaled coalescing random walks with branching and killing. Our main result is convergence to a new continuum process, in which the random space-time paths of the Sun-Swart Brownian net are terminated at a Poisson cloud of killing points. We also prove existence of a percolation transition as the killing rate varies. Key issues for convergence are the relations of the discrete model killing points and their Poisson intensity measure to the continuum counterparts.
△ Less
Submitted 22 September, 2013;
originally announced September 2013.
-
Spectral community detection in sparse networks
Authors:
M. E. J. Newman
Abstract:
Spectral methods based on the eigenvectors of matrices are widely used in the analysis of network data, particularly for community detection and graph partitioning. Standard methods based on the adjacency matrix and related matrices, however, break down for very sparse networks, which includes many networks of practical interest. As a solution to this problem it has been recently proposed that we…
▽ More
Spectral methods based on the eigenvectors of matrices are widely used in the analysis of network data, particularly for community detection and graph partitioning. Standard methods based on the adjacency matrix and related matrices, however, break down for very sparse networks, which includes many networks of practical interest. As a solution to this problem it has been recently proposed that we focus instead on the spectrum of the non-backtracking matrix, an alternative matrix representation of a network that shows better behavior in the sparse limit. Inspired by this suggestion, we here make use of a relaxation method to derive a spectral community detection algorithm that works well even in the sparse regime where other methods break down. Interestingly, however, the matrix at the heart of the method, it turns out, is not exactly the non-backtracking matrix, but a variant of it with a somewhat different definition. We study the behavior of this variant matrix for both artificial and real-world networks and find it to have desirable properties, especially in the common case of networks with broad degree distributions, for which it appears to have a better behaved spectrum and eigenvectors than the original non-backtracking matrix.
△ Less
Submitted 29 August, 2013;
originally announced August 2013.
-
Spectral methods for network community detection and graph partitioning
Authors:
M. E. J. Newman
Abstract:
We consider three distinct and well studied problems concerning network structure: community detection by modularity maximization, community detection by statistical inference, and normalized-cut graph partitioning. Each of these problems can be tackled using spectral algorithms that make use of the eigenvectors of matrix representations of the network. We show that with certain choices of the fre…
▽ More
We consider three distinct and well studied problems concerning network structure: community detection by modularity maximization, community detection by statistical inference, and normalized-cut graph partitioning. Each of these problems can be tackled using spectral algorithms that make use of the eigenvectors of matrix representations of the network. We show that with certain choices of the free parameters appearing in these spectral algorithms the algorithms for all three problems are, in fact, identical, and hence that, at least within the spectral approximations used here, there is no difference between the modularity- and inference-based community detection methods, or between either and graph partitioning.
△ Less
Submitted 29 July, 2013;
originally announced July 2013.
-
Planar Ising magnetization field II. Properties of the critical and near-critical scaling limits
Authors:
Federico Camia,
Christophe Garban,
Charles M. Newman
Abstract:
In [CGN12], we proved that the renormalized critical Ising magnetization fields $Φ^a:= a^{15/8} \sum_{x\in a\, \Z^2} σ_x \, δ_x$ converge as $a\to 0$ to a random distribution that we denoted by $Φ^\infty$. The purpose of this paper is to establish some fundamental properties satisfied by this $Φ^\infty$ and the near-critical fields $Φ^{\infty,h}$. More precisely, we obtain the following results. \…
▽ More
In [CGN12], we proved that the renormalized critical Ising magnetization fields $Φ^a:= a^{15/8} \sum_{x\in a\, \Z^2} σ_x \, δ_x$ converge as $a\to 0$ to a random distribution that we denoted by $Φ^\infty$. The purpose of this paper is to establish some fundamental properties satisfied by this $Φ^\infty$ and the near-critical fields $Φ^{\infty,h}$. More precisely, we obtain the following results. \bi [(i)] If $A\subset \C$ is a smooth bounded domain and if $m=m_A := <{Φ^\infty, 1_A}$ denotes the limiting rescaled magnetization in $A$, then there is a constant $c=c_A>0$ such that {equation*} \log \Pb{m > x} \underset{x\to \infty}{\sim} -c \; x^{16}\,.{equation*} In particular, this provides an alternative proof that the field $Φ^\infty$ is non-Gaussian (another proof of this fact would use the $n$-point correlation functions established in \cite{CHI} which do not satisfy Wick's formula). [(ii)] The random variable $m=m_A$ has a smooth {\it density} and one has more precisely the following bound on its Fourier transform: $|\Eb{e^{i\,t m}} |\le e^{- \tilde{c}\, |t|^{16/15}}$. [(iii)] There exists a one-parameter family $Φ^{\infty,h}$ of near-critical scaling limits for the magnetization field in the plane with vanishingly small external magnetic field. \ei
△ Less
Submitted 15 July, 2013;
originally announced July 2013.
-
Community detection and graph partitioning
Authors:
M. E. J. Newman
Abstract:
Many methods have been proposed for community detection in networks. Some of the most promising are methods based on statistical inference, which rest on solid mathematical foundations and return excellent results in practice. In this paper we show that two of the most widely used inference methods can be mapped directly onto versions of the standard minimum-cut graph partitioning problem, which a…
▽ More
Many methods have been proposed for community detection in networks. Some of the most promising are methods based on statistical inference, which rest on solid mathematical foundations and return excellent results in practice. In this paper we show that two of the most widely used inference methods can be mapped directly onto versions of the standard minimum-cut graph partitioning problem, which allows us to apply any of the many well-understood partitioning algorithms to the solution of community detection problems. We illustrate the approach by adapting the Laplacian spectral partitioning method to perform community inference, testing the resulting algorithm on a range of examples, including computer-generated and real-world networks. Both the quality of the results and the running time rival the best previous methods.
△ Less
Submitted 21 May, 2013;
originally announced May 2013.
-
Interacting epidemics and coinfection on contact networks
Authors:
M. E. J. Newman,
C. R. Ferrario
Abstract:
The spread of certain diseases can be promoted, in some cases substantially, by prior infection with another disease. One example is that of HIV, whose immunosuppressant effects significantly increase the chances of infection with other pathogens. Such coinfection processes, when combined with nontrivial structure in the contact networks over which diseases spread, can lead to complex patterns of…
▽ More
The spread of certain diseases can be promoted, in some cases substantially, by prior infection with another disease. One example is that of HIV, whose immunosuppressant effects significantly increase the chances of infection with other pathogens. Such coinfection processes, when combined with nontrivial structure in the contact networks over which diseases spread, can lead to complex patterns of epidemiological behavior. Here we consider a mathematical model of two diseases spreading through a single population, where infection with one disease is dependent on prior infection with the other. We solve exactly for the sizes of the outbreaks of both diseases in the limit of large population size, along with the complete phase diagram of the system. Among other things, we use our model to demonstrate how diseases can be controlled not only by reducing the rate of their spread, but also by reducing the spread of other infections upon which they depend.
△ Less
Submitted 20 May, 2013;
originally announced May 2013.
-
Nature vs. Nurture: Predictability in Low-Temperature Ising Dynamics
Authors:
J. Ye,
J. Machta,
C. M. Newman,
D. L. Stein
Abstract:
Consider a dynamical many-body system with a random initial state subsequently evolving through stochastic dynamics. What is the relative importance of the initial state ("nature") vs. the realization of the stochastic dynamics ("nurture") in predicting the final state? We examined this question for the two-dimensional Ising ferromagnet following an initial deep quench from $T=\infty$ to $T=0$. We…
▽ More
Consider a dynamical many-body system with a random initial state subsequently evolving through stochastic dynamics. What is the relative importance of the initial state ("nature") vs. the realization of the stochastic dynamics ("nurture") in predicting the final state? We examined this question for the two-dimensional Ising ferromagnet following an initial deep quench from $T=\infty$ to $T=0$. We performed Monte Carlo studies on the overlap between "identical twins" raised in independent dynamical environments, up to size $L=500$. Our results suggest an overlap decaying with time as $t^{-θ_h}$ with $θ_h = 0.22 \pm 0.02$; the same exponent holds for a quench to low but nonzero temperature. This "heritability exponent" may equal the persistence exponent for the 2D Ising ferromagnet, but the two differ more generally.
△ Less
Submitted 23 October, 2013; v1 submitted 15 May, 2013;
originally announced May 2013.
-
Coauthorship and citation in scientific publishing
Authors:
Travis Martin,
Brian Ball,
Brian Karrer,
M. E. J. Newman
Abstract:
A large number of published studies have examined the properties of either networks of citation among scientific papers or networks of coauthorship among scientists. Here, using an extensive data set covering more than a century of physics papers published in the Physical Review, we study a hybrid coauthorship/citation network that combines the two, which we analyze to gain insight into the correl…
▽ More
A large number of published studies have examined the properties of either networks of citation among scientific papers or networks of coauthorship among scientists. Here, using an extensive data set covering more than a century of physics papers published in the Physical Review, we study a hybrid coauthorship/citation network that combines the two, which we analyze to gain insight into the correlations and interactions between authorship and citation. Among other things, we investigate the extent to which individuals tend to cite themselves or their collaborators more than others, the extent to which they cite themselves or their collaborators more quickly after publication, and the extent to which they tend to return the favor of a citation from another scientist.
△ Less
Submitted 1 April, 2013;
originally announced April 2013.
-
Coarsening in 2D slabs
Authors:
Michael Damron,
Hana Kogan,
Charles M. Newman,
Vladas Sidoravicius
Abstract:
We study coarsening; that is, the zero-temperature limit of Glauber dynamics in the standard Ising model on slabs S_k = Z^2 x {0, ..., k-1} of all thicknesses k \geq 2 (with free and periodic boundary conditions in the third coordinate). We show that with free boundary conditions, for k \geq 3, some sites fixate for large times and some do not, whereas for k=2, all sites fixate. With periodic boun…
▽ More
We study coarsening; that is, the zero-temperature limit of Glauber dynamics in the standard Ising model on slabs S_k = Z^2 x {0, ..., k-1} of all thicknesses k \geq 2 (with free and periodic boundary conditions in the third coordinate). We show that with free boundary conditions, for k \geq 3, some sites fixate for large times and some do not, whereas for k=2, all sites fixate. With periodic boundary conditions, for k \geq 4, some sites fixate and others do not, while for k=2 and 3, all sites fixate.
△ Less
Submitted 11 March, 2013;
originally announced March 2013.
-
Absence of site percolation at criticality in Z^2 x {0,1}
Authors:
Michael Damron,
Charles M. Newman,
Vladas Sidoravicius
Abstract:
In this note we consider site percolation on a two dimensional sandwich of thickness two, the graph Z^2 x {0,1}. We prove that there is no percolation at the critical point. The same arguments are valid for a sandwich of thickness three with periodic boundary conditions. It remains an open problem to extend this result to other sandwiches.
In this note we consider site percolation on a two dimensional sandwich of thickness two, the graph Z^2 x {0,1}. We prove that there is no percolation at the critical point. The same arguments are valid for a sandwich of thickness three with periodic boundary conditions. It remains an open problem to extend this result to other sandwiches.
△ Less
Submitted 17 November, 2012;
originally announced November 2012.
-
On the knot Floer filtration of the concordance group
Authors:
Stephen Hancock,
Jennifer Hom,
Michael Newman
Abstract:
The knot Floer complex together with the associated concordance invariant epsilon can be used to define a filtration on the smooth concordance group. We show that the indexing set of this filtration contains the natural numbers cross the integers as an ordered subset.
The knot Floer complex together with the associated concordance invariant epsilon can be used to define a filtration on the smooth concordance group. We show that the indexing set of this filtration contains the natural numbers cross the integers as an ordered subset.
△ Less
Submitted 6 February, 2014; v1 submitted 15 October, 2012;
originally announced October 2012.
-
First-principles multiway spectral partitioning of graphs
Authors:
Maria A. Riolo,
M. E. J. Newman
Abstract:
We consider the minimum-cut partitioning of a graph into more than two parts using spectral methods. While there exist well-established spectral algorithms for this problem that give good results, they have traditionally not been well motivated. Rather than being derived from first principles by minimizing graph cuts, they are typically presented without direct derivation and then proved after the…
▽ More
We consider the minimum-cut partitioning of a graph into more than two parts using spectral methods. While there exist well-established spectral algorithms for this problem that give good results, they have traditionally not been well motivated. Rather than being derived from first principles by minimizing graph cuts, they are typically presented without direct derivation and then proved after the fact to work. In this paper, we take a contrasting approach in which we start with a matrix formulation of the minimum cut problem and then show, via a relaxed optimization, how it can be mapped onto a spectral embedding defined by the leading eigenvectors of the graph Laplacian. The end result is an algorithm that is similar in spirit to, but different in detail from, previous spectral partitioning approaches. In tests of the algorithm we find that it outperforms previous approaches on certain particularly difficult partitioning problems.
△ Less
Submitted 19 July, 2013; v1 submitted 26 September, 2012;
originally announced September 2012.
-
Spectra of random graphs with arbitrary expected degrees
Authors:
Raj Rao Nadakuditi,
M. E. J. Newman
Abstract:
We study random graphs with arbitrary distributions of expected degree and derive expressions for the spectra of their adjacency and modularity matrices. We give a complete prescription for calculating the spectra that is exact in the limit of large network size and large vertex degrees. We also study the effect on the spectra of hubs in the network, vertices of unusually high degree, and show tha…
▽ More
We study random graphs with arbitrary distributions of expected degree and derive expressions for the spectra of their adjacency and modularity matrices. We give a complete prescription for calculating the spectra that is exact in the limit of large network size and large vertex degrees. We also study the effect on the spectra of hubs in the network, vertices of unusually high degree, and show that these produce isolated eigenvalues outside the main spectral band, akin to impurity states in condensed matter systems, with accompanying eigenvectors that are strongly localized around the hubs. We also give numerical results that confirm our analytic expressions.
△ Less
Submitted 6 August, 2012;
originally announced August 2012.
-
Free and Very Free Morphisms into a Fermat Hypersurface
Authors:
Tabes Bridges,
Rankeya Datta,
Joseph Eddy,
Michael Newman,
John Yu
Abstract:
This paper studies the existence of free and very free curves on the degree 5 Fermat hypersurface in P^5 over a field of characteristic 2. We find that such curves exist in degrees 8 and 9 and not in lower degrees.
This paper studies the existence of free and very free curves on the degree 5 Fermat hypersurface in P^5 over a field of characteristic 2. We find that such curves exist in degrees 8 and 9 and not in lower degrees.
△ Less
Submitted 25 July, 2012; v1 submitted 20 July, 2012;
originally announced July 2012.
-
Friendship networks and social status
Authors:
Brian Ball,
M. E. J. Newman
Abstract:
In empirical studies of friendship networks participants are typically asked, in interviews or questionnaires, to identify some or all of their close friends, resulting in a directed network in which friendships can, and often do, run in only one direction between a pair of individuals. Here we analyze a large collection of such networks representing friendships among students at US high and junio…
▽ More
In empirical studies of friendship networks participants are typically asked, in interviews or questionnaires, to identify some or all of their close friends, resulting in a directed network in which friendships can, and often do, run in only one direction between a pair of individuals. Here we analyze a large collection of such networks representing friendships among students at US high and junior-high schools and show that the pattern of unreciprocated friendships is far from random. In every network, without exception, we find that there exists a ranking of participants, from low to high, such that almost all unreciprocated friendships consist of a lower-ranked individual claiming friendship with a higher-ranked one. We present a maximum-likelihood method for deducing such rankings from observed network data and conjecture that the rankings produced reflect a measure of social status. We note in particular that reciprocated and unreciprocated friendships obey different statistics, suggesting different formation processes, and that rankings are correlated with other characteristics of the participants that are traditionally associated with status, such as age and overall popularity as measured by total number of friends.
△ Less
Submitted 30 May, 2012;
originally announced May 2012.
-
The Ising magnetization exponent on Z^2 is 1/15
Authors:
Federico Camia,
Christophe Garban,
Charles M. Newman
Abstract:
We prove that for the Ising model defined on the plane $\Z^2$ at $β=β_c$, the average magnetization under an external magnetic field $h>0$ behaves exactly like \[{σ_0}_{β_c, h} \asymp h^{\frac 1 {15}}\,. \] The proof, which is surprisingly simple compared to an analogous result for percolation (i.e. that $θ(p)=(p-p_c)^{5/36+o(1)}$ on the triangular lattice \cite{\SmirnovWerner,\KestenScaling}) rel…
▽ More
We prove that for the Ising model defined on the plane $\Z^2$ at $β=β_c$, the average magnetization under an external magnetic field $h>0$ behaves exactly like \[{σ_0}_{β_c, h} \asymp h^{\frac 1 {15}}\,. \] The proof, which is surprisingly simple compared to an analogous result for percolation (i.e. that $θ(p)=(p-p_c)^{5/36+o(1)}$ on the triangular lattice \cite{\SmirnovWerner,\KestenScaling}) relies on the GHS inequality as well as the RSW theorem for FK percolation from \cite{\RSWfk}. The use of GHS to obtain inequalities involving critical exponents is not new; in this paper we show how it can be combined with RSW to obtain matching upper and lower bounds for the average magnetization.
△ Less
Submitted 17 June, 2013; v1 submitted 30 May, 2012;
originally announced May 2012.
-
Planar Ising magnetization field I. Uniqueness of the critical scaling limit
Authors:
Federico Camia,
Christophe Garban,
Charles M. Newman
Abstract:
The aim of this paper is to prove the following result. Consider the critical Ising model on the rescaled grid $a\mathbb{Z}^2$, then the renormalized magnetization field \[Φ^a:=a^{15/8}\sum_{x\in a\mathbb{Z}^2}σ_xδ_x,\] seen as a random distribution (i.e., generalized function) on the plane, has a unique scaling limit as the mesh size $a\searrow0$. The limiting field is conformally covariant.
The aim of this paper is to prove the following result. Consider the critical Ising model on the rescaled grid $a\mathbb{Z}^2$, then the renormalized magnetization field \[Φ^a:=a^{15/8}\sum_{x\in a\mathbb{Z}^2}σ_xδ_x,\] seen as a random distribution (i.e., generalized function) on the plane, has a unique scaling limit as the mesh size $a\searrow0$. The limiting field is conformally covariant.
△ Less
Submitted 6 March, 2015; v1 submitted 30 May, 2012;
originally announced May 2012.
-
Spin Glasses: Old and New Complexity
Authors:
D. L. Stein,
C. M. Newman
Abstract:
Spin glasses are disordered magnetic systems that exhibit a variety of properties that are characteristic of complex systems. After a brief review of basic spin glass concepts, their use in areas such as computer science, biology, and other fields will be explored. This use and its underlying basis will be termed old complexity. Newer concepts and ideas flowing from more recent studies of spin gla…
▽ More
Spin glasses are disordered magnetic systems that exhibit a variety of properties that are characteristic of complex systems. After a brief review of basic spin glass concepts, their use in areas such as computer science, biology, and other fields will be explored. This use and its underlying basis will be termed old complexity. Newer concepts and ideas flowing from more recent studies of spin glasses will then be discussed, leading to a proposal for a kind of new complexity.
△ Less
Submitted 15 May, 2012;
originally announced May 2012.
-
Graph spectra and the detectability of community structure in networks
Authors:
Raj Rao Nadakuditi,
M. E. J. Newman
Abstract:
We study networks that display community structure -- groups of nodes within which connections are unusually dense. Using methods from random matrix theory, we calculate the spectra of such networks in the limit of large size, and hence demonstrate the presence of a phase transition in matrix methods for community detection, such as the popular modularity maximization method. The transition separa…
▽ More
We study networks that display community structure -- groups of nodes within which connections are unusually dense. Using methods from random matrix theory, we calculate the spectra of such networks in the limit of large size, and hence demonstrate the presence of a phase transition in matrix methods for community detection, such as the popular modularity maximization method. The transition separates a regime in which such methods successfully detect the community structure from one in which the structure is present but is not detected. By comparing these results with recent analyses of maximum-likelihood methods we are able to show that spectral modularity maximization is an optimal detection method in the sense that no other method will succeed in the regime where the modularity method fails.
△ Less
Submitted 8 May, 2012;
originally announced May 2012.
-
Is the missing axiom of matroid theory lost forever?
Authors:
Dillon Mayhew,
Mike Newman,
Geoff Whittle
Abstract:
We conjecture that it is not possible to finitely axiomatize matroid representability in monadic second-order logic for matroids, and we describe some partial progress towards this conjecture. We present a collection of sentences in monadic second-order logic and show that it is possible to finitely axiomatize matroids using only sentences in this collection. Moreover, we can also axiomatize repre…
▽ More
We conjecture that it is not possible to finitely axiomatize matroid representability in monadic second-order logic for matroids, and we describe some partial progress towards this conjecture. We present a collection of sentences in monadic second-order logic and show that it is possible to finitely axiomatize matroids using only sentences in this collection. Moreover, we can also axiomatize representability over any fixed finite field (assuming Rota's conjecture holds). We prove that it is not possible to finitely axiomatize representability, or representability over any fixed infinite field, using sentences from the collection.
△ Less
Submitted 13 February, 2016; v1 submitted 16 April, 2012;
originally announced April 2012.
-
A note on packing spanning trees in graphs and bases in matroids
Authors:
Robert F. Bailey,
Mike Newman,
Brett Stevens
Abstract:
We consider the class of graphs for which the edge connectivity is equal to the maximum number of edge-disjoint spanning trees, and the natural generalization to matroids, where the cogirth is equal to the number of disjoint bases. We provide descriptions of such graphs and matroids, showing that such a graph (or matroid) has a unique decomposition. In the case of graphs, our results are relevant…
▽ More
We consider the class of graphs for which the edge connectivity is equal to the maximum number of edge-disjoint spanning trees, and the natural generalization to matroids, where the cogirth is equal to the number of disjoint bases. We provide descriptions of such graphs and matroids, showing that such a graph (or matroid) has a unique decomposition. In the case of graphs, our results are relevant for certain communication protocols.
△ Less
Submitted 7 February, 2014; v1 submitted 5 March, 2012;
originally announced March 2012.
-
Ergodicity and Percolation for Variants of One-dimensional Voter Models
Authors:
Y. Mohylevskyy,
C. M. Newman,
K. Ravishankar
Abstract:
We study variants of one-dimensional q-color voter models in discrete time. In addition to the usual voter model transitions in which a color is chosen from the left or right neighbor of a site there are two types of noisy transitions. One is bulk nucleation where a new random color is chosen. The other is boundary nucleation where a random color is chosen only if the two neighbors have distinct c…
▽ More
We study variants of one-dimensional q-color voter models in discrete time. In addition to the usual voter model transitions in which a color is chosen from the left or right neighbor of a site there are two types of noisy transitions. One is bulk nucleation where a new random color is chosen. The other is boundary nucleation where a random color is chosen only if the two neighbors have distinct colors. We prove under a variety of conditions on q and the magnitudes of the two noise parameters that the system is ergodic, i.e., there is convergence to a unique invariant distribution. The methods are percolation-based using the graphical structure of the model which consists of coalescing random walks combined with branching (boundary nucleation) and dying (bulk nucleation).
△ Less
Submitted 23 April, 2013; v1 submitted 8 December, 2011;
originally announced December 2011.
-
Complex Systems: A Survey
Authors:
M. E. J. Newman
Abstract:
A complex system is a system composed of many interacting parts, often called agents, which displays collective behavior that does not follow trivially from the behaviors of the individual parts. Examples include condensed matter systems, ecosystems, stock markets and economies, biological evolution, and indeed the whole of human society. Substantial progress has been made in the quantitative unde…
▽ More
A complex system is a system composed of many interacting parts, often called agents, which displays collective behavior that does not follow trivially from the behaviors of the individual parts. Examples include condensed matter systems, ecosystems, stock markets and economies, biological evolution, and indeed the whole of human society. Substantial progress has been made in the quantitative understanding of complex systems, particularly since the 1980s, using a combination of basic theory, much of it derived from physics, and computer simulation. The subject is a broad one, drawing on techniques and ideas from a wide range of areas. Here I give a survey of the main themes and methods of complex systems science and an annotated bibliography of resources, ranging from classic papers to recent books and reviews.
△ Less
Submitted 6 December, 2011;
originally announced December 2011.
-
Competing epidemics on complex networks
Authors:
Brian Karrer,
M. E. J. Newman
Abstract:
Human diseases spread over networks of contacts between individuals and a substantial body of recent research has focused on the dynamics of the spreading process. Here we examine a model of two competing diseases spreading over the same network at the same time, where infection with either disease gives an individual subsequent immunity to both. Using a combination of analytic and numerical metho…
▽ More
Human diseases spread over networks of contacts between individuals and a substantial body of recent research has focused on the dynamics of the spreading process. Here we examine a model of two competing diseases spreading over the same network at the same time, where infection with either disease gives an individual subsequent immunity to both. Using a combination of analytic and numerical methods, we derive the phase diagram of the system and estimates of the expected final numbers of individuals infected with each disease. The system shows an unusual dynamical transition between dominance of one disease and dominance of the other as a function of their relative rates of growth. Close to this transition the final outcomes show strong dependence on stochastic fluctuations in the early stages of growth, dependence that decreases with increasing network size, but does so sufficiently slowly as still to be easily visible in systems with millions or billions of individuals. In most regions of the phase diagram we find that one disease eventually dominates while the other reaches only a vanishing fraction of the network, but the system also displays a significant coexistence regime in which both diseases reach epidemic proportions and infect an extensive fraction of the network.
△ Less
Submitted 17 May, 2011;
originally announced May 2011.
-
An efficient and principled method for detecting communities in networks
Authors:
Brian Ball,
Brian Karrer,
M. E. J. Newman
Abstract:
A fundamental problem in the analysis of network data is the detection of network communities, groups of densely interconnected nodes, which may be overlap** or disjoint. Here we describe a method for finding overlap** communities based on a principled statistical approach using generative network models. We show how the method can be implemented using a fast, closed-form expectation-maximizat…
▽ More
A fundamental problem in the analysis of network data is the detection of network communities, groups of densely interconnected nodes, which may be overlap** or disjoint. Here we describe a method for finding overlap** communities based on a principled statistical approach using generative network models. We show how the method can be implemented using a fast, closed-form expectation-maximization algorithm that allows us to analyze networks of millions of nodes in reasonable running times. We test the method both on real-world networks and on synthetic benchmarks and find that it gives results competitive with previous methods. We also show that the same approach can be used to extract nonoverlap** community divisions via a relaxation method, and demonstrate that the algorithm is competitively fast and accurate for the nonoverlap** problem.
△ Less
Submitted 18 April, 2011;
originally announced April 2011.
-
Stochastic blockmodels and community structure in networks
Authors:
Brian Karrer,
M. E. J. Newman
Abstract:
Stochastic blockmodels have been proposed as a tool for detecting community structure in networks as well as for generating synthetic networks for use as benchmarks. Most blockmodels, however, ignore variation in vertex degree, making them unsuitable for applications to real-world networks, which typically display broad degree distributions that can significantly distort the results. Here we demon…
▽ More
Stochastic blockmodels have been proposed as a tool for detecting community structure in networks as well as for generating synthetic networks for use as benchmarks. Most blockmodels, however, ignore variation in vertex degree, making them unsuitable for applications to real-world networks, which typically display broad degree distributions that can significantly distort the results. Here we demonstrate how the generalization of blockmodels to incorporate this missing element leads to an improved objective function for community detection in complex networks. We also propose a heuristic algorithm for community detection using this objective function or its non-degree-corrected counterpart and show that the degree-corrected version dramatically outperforms the uncorrected one in both real-world and synthetic networks.
△ Less
Submitted 23 August, 2010;
originally announced August 2010.
-
Random graphs containing arbitrary distributions of subgraphs
Authors:
Brian Karrer,
M. E. J. Newman
Abstract:
Traditional random graph models of networks generate networks that are locally tree-like, meaning that all local neighborhoods take the form of trees. In this respect such models are highly unrealistic, most real networks having strongly non-tree-like neighborhoods that contain short loops, cliques, or other biconnected subgraphs. In this paper we propose and analyze a new class of random graph…
▽ More
Traditional random graph models of networks generate networks that are locally tree-like, meaning that all local neighborhoods take the form of trees. In this respect such models are highly unrealistic, most real networks having strongly non-tree-like neighborhoods that contain short loops, cliques, or other biconnected subgraphs. In this paper we propose and analyze a new class of random graph models that incorporates general subgraphs, allowing for non-tree-like neighborhoods while still remaining solvable for many fundamental network properties. Among other things we give solutions for the size of the giant component, the position of the phase transition at which the giant component appears, and percolation properties for both site and bond percolation on networks generated by the model.
△ Less
Submitted 10 May, 2010;
originally announced May 2010.
-
A message passing approach for general epidemic models
Authors:
Brian Karrer,
M. E. J. Newman
Abstract:
In most models of the spread of disease over contact networks it is assumed that the probabilities per unit time of disease transmission and recovery from disease are constant, implying exponential distributions of the time intervals for transmission and recovery. Time intervals for real diseases, however, have distributions that in most cases are far from exponential, which leads to disagreements…
▽ More
In most models of the spread of disease over contact networks it is assumed that the probabilities per unit time of disease transmission and recovery from disease are constant, implying exponential distributions of the time intervals for transmission and recovery. Time intervals for real diseases, however, have distributions that in most cases are far from exponential, which leads to disagreements, both qualitative and quantitative, with the models. In this paper, we study a generalized version of the SIR (susceptible-infected-recovered) model of epidemic disease that allows for arbitrary distributions of transmission and recovery times. Standard differential equation approaches cannot be used for this generalized model, but we show that the problem can be reformulated as a time-dependent message passing calculation on the appropriate contact network. The calculation is exact on trees (i.e., loopless networks) or locally tree-like networks (such as random graphs) in the large system size limit. On non-tree-like networks we show that the calculation gives a rigorous bound on the size of disease outbreaks. We demonstrate the method with applications to two specific models and the results compare favorably with numerical simulations.
△ Less
Submitted 22 July, 2010; v1 submitted 29 March, 2010;
originally announced March 2010.
-
Random graph models for directed acyclic networks
Authors:
Brian Karrer,
M. E. J. Newman
Abstract:
We study random graph models for directed acyclic graphs, an important class of networks that includes citation networks, food webs, and feed-forward neural networks among others. We propose two specific models, roughly analogous to the fixed edge number and fixed edge probability variants of traditional undirected random graphs. We calculate a number of properties of these models, including par…
▽ More
We study random graph models for directed acyclic graphs, an important class of networks that includes citation networks, food webs, and feed-forward neural networks among others. We propose two specific models, roughly analogous to the fixed edge number and fixed edge probability variants of traditional undirected random graphs. We calculate a number of properties of these models, including particularly the probability of connection between a given pair of vertices, and compare the results with real-world acyclic network data finding that theory and measurements agree surprisingly well -- far better than the often poor agreement of other random graph models with their corresponding real-world networks.
△ Less
Submitted 24 July, 2009;
originally announced July 2009.
-
Fixation for Distributed Clustering Processes
Authors:
Marcelo R. Hilario,
Oren Louidor,
Charles M. Newman,
Leonardo T. Rolla,
Scott Sheffield,
Vladas Sidoravicius
Abstract:
We study a discrete-time resource flow in $Z^d$, where wealthier vertices attract the resources of their less rich neighbors. For any translation-invariant probability distribution of initial resource quantities, we prove that the flow at each vertex terminates after finitely many steps. This answers (a generalized version of) a question posed by van den Berg and Meester in 1991. The proof uses th…
▽ More
We study a discrete-time resource flow in $Z^d$, where wealthier vertices attract the resources of their less rich neighbors. For any translation-invariant probability distribution of initial resource quantities, we prove that the flow at each vertex terminates after finitely many steps. This answers (a generalized version of) a question posed by van den Berg and Meester in 1991. The proof uses the mass-transport principle and extends to other graphs.
△ Less
Submitted 1 August, 2017; v1 submitted 17 June, 2009;
originally announced June 2009.
-
Random graphs with clustering
Authors:
M. E. J. Newman
Abstract:
We offer a solution to a long-standing problem in the physics of networks, the creation of a plausible, solvable model of a network that displays clustering or transitivity -- the propensity for two neighbors of a network node also to be neighbors of one another. We show how standard random graph models can be generalized to incorporate clustering and give exact solutions for various properties…
▽ More
We offer a solution to a long-standing problem in the physics of networks, the creation of a plausible, solvable model of a network that displays clustering or transitivity -- the propensity for two neighbors of a network node also to be neighbors of one another. We show how standard random graph models can be generalized to incorporate clustering and give exact solutions for various properties of the resulting networks, including sizes of network components, size of the giant component if there is one, position of the phase transition at which the giant component forms, and position of the phase transition for percolation on the network.
△ Less
Submitted 23 March, 2009;
originally announced March 2009.
-
Random hypergraphs and their applications
Authors:
Gourab Ghoshal,
Vinko Zlatic,
Guido Caldarelli,
M. E. J. Newman
Abstract:
In the last few years we have witnessed the emergence, primarily in on-line communities, of new types of social networks that require for their representation more complex graph structures than have been employed in the past. One example is the folksonomy, a tripartite structure of users, resources, and tags -- labels collaboratively applied by the users to the resources in order to impart meani…
▽ More
In the last few years we have witnessed the emergence, primarily in on-line communities, of new types of social networks that require for their representation more complex graph structures than have been employed in the past. One example is the folksonomy, a tripartite structure of users, resources, and tags -- labels collaboratively applied by the users to the resources in order to impart meaningful structure on an otherwise undifferentiated database. Here we propose a mathematical model of such tripartite structures which represents them as random hypergraphs. We show that it is possible to calculate many properties of this model exactly in the limit of large network size and we compare the results against observations of a real folksonomy, that of the on-line photography web site Flickr. We show that in some cases the model matches the properties of the observed network well, while in others there are significant differences, which we find to be attributable to the practice of multiple tagging, i.e., the application by a single user of many tags to one resource, or one tag to many resources.
△ Less
Submitted 2 March, 2009;
originally announced March 2009.
-
Random acyclic networks
Authors:
Brian Karrer,
M. E. J. Newman
Abstract:
Directed acyclic graphs are a fundamental class of networks that includes citation networks, food webs, and family trees, among others. Here we define a random graph model for directed acyclic graphs and give solutions for a number of the model's properties, including connection probabilities and component sizes, as well as a fast algorithm for simulating the model on a computer. We compare the…
▽ More
Directed acyclic graphs are a fundamental class of networks that includes citation networks, food webs, and family trees, among others. Here we define a random graph model for directed acyclic graphs and give solutions for a number of the model's properties, including connection probabilities and component sizes, as well as a fast algorithm for simulating the model on a computer. We compare the predictions of the model to a real-world network of citations between physics papers and find surprisingly good agreement, suggesting that the structure of the real network may be quite well described by the random graph.
△ Less
Submitted 23 February, 2009;
originally announced February 2009.
-
Ising (Conformal) Fields and Cluster Area Measures
Authors:
Federico Camia,
Charles M. Newman
Abstract:
We provide a representation for the scaling limit of the d=2 critical Ising magnetization field as a (conformal) random field using SLE (Schramm-Loewner Evolution) clusters and associated renormalized area measures. The renormalized areas are from the scaling limit of the critical FK (Fortuin-Kasteleyn) clusters and the random field is a convergent sum of the area measures with random signs. Ext…
▽ More
We provide a representation for the scaling limit of the d=2 critical Ising magnetization field as a (conformal) random field using SLE (Schramm-Loewner Evolution) clusters and associated renormalized area measures. The renormalized areas are from the scaling limit of the critical FK (Fortuin-Kasteleyn) clusters and the random field is a convergent sum of the area measures with random signs. Extensions to off-critical scaling limits, to d=3 and to Potts models are also considered.
△ Less
Submitted 21 December, 2008;
originally announced December 2008.
-
Hierarchical structure and the prediction of missing links in networks
Authors:
Aaron Clauset,
Cristopher Moore,
M. E. J. Newman
Abstract:
Networks have in recent years emerged as an invaluable tool for describing and quantifying complex systems in many branches of science. Recent studies suggest that networks often exhibit hierarchical organization, where vertices divide into groups that further subdivide into groups of groups, and so forth over multiple scales. In many cases these groups are found to correspond to known functiona…
▽ More
Networks have in recent years emerged as an invaluable tool for describing and quantifying complex systems in many branches of science. Recent studies suggest that networks often exhibit hierarchical organization, where vertices divide into groups that further subdivide into groups of groups, and so forth over multiple scales. In many cases these groups are found to correspond to known functional units, such as ecological niches in food webs, modules in biochemical networks (protein interaction networks, metabolic networks, or genetic regulatory networks), or communities in social networks. Here we present a general technique for inferring hierarchical structure from network data and demonstrate that the existence of hierarchy can simultaneously explain and quantitatively reproduce many commonly observed topological properties of networks, such as right-skewed degree distributions, high clustering coefficients, and short path lengths. We further show that knowledge of hierarchical structure can be used to predict missing connections in partially known networks with high accuracy, and for more general network structures than competing techniques. Taken together, our results suggest that hierarchy is a central organizing principle of complex networks, capable of offering insight into many network phenomena.
△ Less
Submitted 4 November, 2008;
originally announced November 2008.
-
The first-mover advantage in scientific publication
Authors:
M. E. J. Newman
Abstract:
Mathematical models of the scientific citation process predict a strong "first-mover" effect under which the first papers in a field will, essentially regardless of content, receive citations at a rate enormously higher than papers published later. Moreover papers are expected to retain this advantage in perpetuity -- they should receive more citations indefinitely, no matter how many other pape…
▽ More
Mathematical models of the scientific citation process predict a strong "first-mover" effect under which the first papers in a field will, essentially regardless of content, receive citations at a rate enormously higher than papers published later. Moreover papers are expected to retain this advantage in perpetuity -- they should receive more citations indefinitely, no matter how many other papers are published after them. We test this conjecture against data from a selection of fields and in several cases find a first-mover effect of a magnitude similar to that predicted by the theory. Were we wearing our cynical hat today, we might say that the scientist who wants to become famous is better off -- by a wide margin -- writing a modest paper in next year's hottest field than an outstanding paper in this year's. On the other hand, there are some papers, albeit only a small fraction, that buck the trend and attract significantly more citations than theory predicts despite having relatively late publication dates. We suggest that papers of this kind, though they often receive comparatively few citations overall, are probably worthy of our attention.
△ Less
Submitted 2 September, 2008;
originally announced September 2008.
-
Exceptional Times for the Dynamical Discrete Web
Authors:
L. R. G. Fontes,
C. M. Newman,
K. Ravishankar,
E. Schertzer
Abstract:
The dynamical discrete web (DyDW),introduced in recent work of Howitt and Warren, is a system of coalescing simple symmetric one-dimensional random walks which evolve in an extra continuous dynamical time parameter τ. The evolution is by independent updating of the underlying Bernoulli variables indexed by discrete space-time that define the discrete web at any fixed τ. In this paper, we study t…
▽ More
The dynamical discrete web (DyDW),introduced in recent work of Howitt and Warren, is a system of coalescing simple symmetric one-dimensional random walks which evolve in an extra continuous dynamical time parameter τ. The evolution is by independent updating of the underlying Bernoulli variables indexed by discrete space-time that define the discrete web at any fixed τ. In this paper, we study the existence of exceptional (random) values of τwhere the paths of the web do not behave like usual random walks and the Hausdorff dimension of the set of exceptional such τ. Our results are motivated by those about exceptional times for dynamical percolation in high dimension by Häggstrom, Peres and Steif, and in dimension two by Schramm and Steif. The exceptional behavior of the walks in the DyDW is rather different from the situation for the dynamical random walks of Benjamini, Häggstrom, Peres and Steif. For example, we prove that the walk from the origin S^τ_0 violates the law of the iterated logarithm (LIL) on a set of τof Hausdorff dimension one. We also discuss how these and other results extend to the dynamical Brownian web, the natural scaling limit of the DyDW.
△ Less
Submitted 26 August, 2008;
originally announced August 2008.
-
Marking (1,2) Points of the Brownian Web and Applications
Authors:
C. M. Newman,
K. Ravishankar,
E. Schertzer
Abstract:
The Brownian web (BW), which developed from the work of Arratia and then Tóth and Werner, is a random collection of paths (with specified starting points) in one plus one dimensional space-time that arises as the scaling limit of the discrete web (DW) of coalescing simple random walks. Two recently introduced extensions of the BW, the Brownian net (BN) constructed by Sun and Swart, and the dynam…
▽ More
The Brownian web (BW), which developed from the work of Arratia and then Tóth and Werner, is a random collection of paths (with specified starting points) in one plus one dimensional space-time that arises as the scaling limit of the discrete web (DW) of coalescing simple random walks. Two recently introduced extensions of the BW, the Brownian net (BN) constructed by Sun and Swart, and the dynamical Brownian web (DyBW) proposed by Howitt and Warren, are (or should be) scaling limits of corresponding discrete extensions of the DW -- the discrete net (DN) and the dynamical discrete web (DyDW). These discrete extensions have a natural geometric structure in which the underlying Bernoulli left or right "arrow" structure of the DW is extended by means of branching (i.e., allowing left and right simultaneously) to construct the DN or by means of switching (i.e., from left to right and vice-versa) to construct the DyDW. In this paper we show that there is a similar structure in the continuum where arrow direction is replaced by the left or right parity of the (1,2) space-time points of the BW (points with one incoming path from the past and two outgoing paths to the future, only one of which is a continuation of the incoming path). We then provide a complete construction of the DyBW and an alternate construction of the BN to that of Sun and Swart by proving that the switching or branching can be implemented by a Poissonian marking of the (1,2) points.
△ Less
Submitted 29 June, 2009; v1 submitted 1 June, 2008;
originally announced June 2008.
-
A Percolation-Theoretic Approach to Spin Glass Phase Transitions
Authors:
J. Machta,
C. M. Newman,
D. L. Stein
Abstract:
The magnetically ordered, low temperature phase of Ising ferro- magnets is manifested within the associated Fortuin-Kasteleyn (FK) random cluster representation by the occurrence of a single positive density percolating cluster. In this paper, we review our recent work on the percolation signature for Ising spin glass ordering -- both in the short-range Edwards-Anderson (EA) and infinite-range S…
▽ More
The magnetically ordered, low temperature phase of Ising ferro- magnets is manifested within the associated Fortuin-Kasteleyn (FK) random cluster representation by the occurrence of a single positive density percolating cluster. In this paper, we review our recent work on the percolation signature for Ising spin glass ordering -- both in the short-range Edwards-Anderson (EA) and infinite-range Sherrington-Kirkpatrick (SK) models -- within a two-replica FK representation and also in the different Chayes-Machta-Redner two-replica graphical representation. Numerical studies of the $\pm J$ EA model in dimension three and rigorous results for the SK model are consistent in supporting the conclusion that the signature of spin-glass order in these models is the existence of a single percolating cluster of maximal density normally coexisting with a second percolating cluster of lower density.
△ Less
Submitted 6 May, 2008;
originally announced May 2008.
-
Percolation in the Sherrington-Kirkpatrick Spin Glass
Authors:
J. Machta,
C. M. Newman,
D. L. Stein
Abstract:
We present extended versions and give detailed proofs of results concerning percolation (using various sets of two-replica bond occupation variables) in Sherrington-Kirkpatrick spin glasses (with zero external field) that were first given in an earlier paper by the same authors. We also explain how ultrametricity is manifested by the densities of large percolating clusters. Our main theorems con…
▽ More
We present extended versions and give detailed proofs of results concerning percolation (using various sets of two-replica bond occupation variables) in Sherrington-Kirkpatrick spin glasses (with zero external field) that were first given in an earlier paper by the same authors. We also explain how ultrametricity is manifested by the densities of large percolating clusters. Our main theorems concern the connection between these densities and the usual spin overlap distribution. Their corollaries are that the ordered spin glass phase is characterized by a unique percolating cluster of maximal density (normally coexisting with a second cluster of nonzero but lower density). The proofs involve comparison inequalities between SK multireplica bond occupation variables and the independent variables of standard Erdos-Renyi random graphs.
△ Less
Submitted 19 October, 2007; v1 submitted 8 October, 2007;
originally announced October 2007.
-
The Effect of Pure State Structure on Nonequilibrium Dynamics
Authors:
C. M. Newman,
D. L. Stein
Abstract:
Motivated by short-range Ising spin glasses, we review some rigorous results and their consequences for the relation between the number/nature of equilibrium pure states and nonequilibrium dynamics. Two of the consequences for spin glass dynamics following a deep quench to a temperature with broken spin flip symmetry are: (1) Almost all initial configurations lie on the boundary between the basi…
▽ More
Motivated by short-range Ising spin glasses, we review some rigorous results and their consequences for the relation between the number/nature of equilibrium pure states and nonequilibrium dynamics. Two of the consequences for spin glass dynamics following a deep quench to a temperature with broken spin flip symmetry are: (1) Almost all initial configurations lie on the boundary between the basins of attraction of multiple pure states. (2) Unless there are uncountably many pure states with almost all pairs having zero overlap, there can be no equilibration to a pure state as time goes to infinity. We discuss the relevance of these results to the difficulty of equilibration of spin glasses. We also review some results concerning the ``nature vs. nurture'' problem of whether the large-time behavior of both ferromagnets and spin glasses following a deep quench is determined more by the initial configuration or by the dynamics realization.
△ Less
Submitted 2 October, 2007;
originally announced October 2007.
-
Community structure in directed networks
Authors:
E. A. Leicht,
M. E. J. Newman
Abstract:
We consider the problem of finding communities or modules in directed networks. The most common approach to this problem in the previous literature has been simply to ignore edge direction and apply methods developed for community discovery in undirected networks, but this approach discards potentially useful information contained in the edge directions. Here we show how the widely used benefit…
▽ More
We consider the problem of finding communities or modules in directed networks. The most common approach to this problem in the previous literature has been simply to ignore edge direction and apply methods developed for community discovery in undirected networks, but this approach discards potentially useful information contained in the edge directions. Here we show how the widely used benefit function known as modularity can be generalized in a principled fashion to incorporate the information contained in edge directions. This in turn allows us to find communities by maximizing the modularity over possible divisions of a network, which we do using an algorithm based on the eigenvectors of the corresponding modularity matrix. This method is shown to give demonstrably better results than previous methods on a variety of test networks, both real and computer-generated.
△ Less
Submitted 27 September, 2007;
originally announced September 2007.
-
Robustness of community structure in networks
Authors:
Brian Karrer,
Elizaveta Levina,
M. E. J. Newman
Abstract:
The discovery of community structure is a common challenge in the analysis of network data. Many methods have been proposed for finding community structure, but few have been proposed for determining whether the structure found is statistically significant or whether, conversely, it could have arisen purely as a result of chance. In this paper we show that the significance of community structure…
▽ More
The discovery of community structure is a common challenge in the analysis of network data. Many methods have been proposed for finding community structure, but few have been proposed for determining whether the structure found is statistically significant or whether, conversely, it could have arisen purely as a result of chance. In this paper we show that the significance of community structure can be effectively quantified by measuring its robustness to small perturbations in network structure. We propose a suitable method for perturbing networks and a measure of the resulting change in community structure and use them to assess the significance of community structure in a variety of networks, both real and computer generated.
△ Less
Submitted 13 September, 2007;
originally announced September 2007.
-
Bicomponents and the robustness of networks to failure
Authors:
M. E. J. Newman,
Gourab Ghoshal
Abstract:
A common definition of a robust connection between two nodes in a network such as a communication network is that there should be at least two independent paths connecting them, so that the failure of no single node in the network causes them to become disconnected. This definition leads us naturally to consider bicomponents, subnetworks in which every node has a robust connection of this kind t…
▽ More
A common definition of a robust connection between two nodes in a network such as a communication network is that there should be at least two independent paths connecting them, so that the failure of no single node in the network causes them to become disconnected. This definition leads us naturally to consider bicomponents, subnetworks in which every node has a robust connection of this kind to every other. Here we study bicomponents in both real and model networks using a combination of exact analytic techniques and numerical methods. We show that standard network models predict there to be essentially no small bicomponents in most networks, but there may be a giant bicomponent, whose presence coincides with the presence of the ordinary giant component, and we find that real networks seem by and large to follow this pattern, although there are some interesting exceptions. We study the size of the giant bicomponent as nodes in the network fail, using a specially developed computer algorithm based on data trees, and find in some cases that our networks are quite robust to failure, with large bicomponents persisting until almost all vertices have been removed.
△ Less
Submitted 20 August, 2007;
originally announced August 2007.
-
Component sizes in networks with arbitrary degree distributions
Authors:
M. E. J. Newman
Abstract:
We give an exact solution for the complete distribution of component sizes in random networks with arbitrary degree distributions. The solution tells us the probability that a randomly chosen node belongs to a component of size s, for any s. We apply our results to networks with the three most commonly studied degree distributions -- Poisson, exponential, and power-law -- as well as to the calcu…
▽ More
We give an exact solution for the complete distribution of component sizes in random networks with arbitrary degree distributions. The solution tells us the probability that a randomly chosen node belongs to a component of size s, for any s. We apply our results to networks with the three most commonly studied degree distributions -- Poisson, exponential, and power-law -- as well as to the calculation of cluster sizes for bond percolation on networks, which correspond to the sizes of outbreaks of SIR epidemic processes on the same networks. For the particular case of the power-law degree distribution, we show that the component size distribution itself follows a power law everywhere below the phase transition at which a giant component forms, but takes an exponential form when a giant component is present.
△ Less
Submitted 30 June, 2007;
originally announced July 2007.
-
The Percolation Signature of the Spin Glass Transition
Authors:
J. Machta,
C. M. Newman,
D. L. Stein
Abstract:
Magnetic ordering at low temperature for Ising ferromagnets manifests itself within the associated Fortuin-Kasteleyn (FK) random cluster representation as the occurrence of a single positive density percolating network. In this paper we investigate the percolation signature for Ising spin glass ordering -- both in short-range (EA) and infinite-range (SK) models -- within a two-replica FK represe…
▽ More
Magnetic ordering at low temperature for Ising ferromagnets manifests itself within the associated Fortuin-Kasteleyn (FK) random cluster representation as the occurrence of a single positive density percolating network. In this paper we investigate the percolation signature for Ising spin glass ordering -- both in short-range (EA) and infinite-range (SK) models -- within a two-replica FK representation and also within the different Chayes-Machta-Redner two-replica graphical representation. Based on numerical studies of the $\pm J$ EA model in three dimensions and on rigorous results for the SK model, we conclude that the spin glass transition corresponds to the appearance of {\it two} percolating clusters of {\it unequal} densities.
△ Less
Submitted 30 June, 2007;
originally announced July 2007.
-
Power-law distributions in empirical data
Authors:
Aaron Clauset,
Cosma Rohilla Shalizi,
M. E. J. Newman
Abstract:
Power-law distributions occur in many situations of scientific interest and have significant consequences for our understanding of natural and man-made phenomena. Unfortunately, the detection and characterization of power laws is complicated by the large fluctuations that occur in the tail of the distribution -- the part of the distribution representing large but rare events -- and by the diffic…
▽ More
Power-law distributions occur in many situations of scientific interest and have significant consequences for our understanding of natural and man-made phenomena. Unfortunately, the detection and characterization of power laws is complicated by the large fluctuations that occur in the tail of the distribution -- the part of the distribution representing large but rare events -- and by the difficulty of identifying the range over which power-law behavior holds. Commonly used methods for analyzing power-law data, such as least-squares fitting, can produce substantially inaccurate estimates of parameters for power-law distributions, and even in cases where such methods return accurate answers they are still unsatisfactory because they give no indication of whether the data obey a power law at all. Here we present a principled statistical framework for discerning and quantifying power-law behavior in empirical data. Our approach combines maximum-likelihood fitting methods with goodness-of-fit tests based on the Kolmogorov-Smirnov statistic and likelihood ratios. We evaluate the effectiveness of the approach with tests on synthetic data and give critical comparisons to previous approaches. We also apply the proposed methods to twenty-four real-world data sets from a range of different disciplines, each of which has been conjectured to follow a power-law distribution. In some cases we find these conjectures to be consistent with the data while in others the power law is ruled out.
△ Less
Submitted 2 February, 2009; v1 submitted 7 June, 2007;
originally announced June 2007.
-
Large-scale structure of time evolving citation networks
Authors:
E. A. Leicht,
Gavin Clarkson,
Kerby Shedden,
M. E. J. Newman
Abstract:
In this paper we examine a number of methods for probing and understanding the large-scale structure of networks that evolve over time. We focus in particular on citation networks, networks of references between documents such as papers, patents, or court cases. We describe three different methods of analysis, one based on an expectation-maximization algorithm, one based on modularity optimizati…
▽ More
In this paper we examine a number of methods for probing and understanding the large-scale structure of networks that evolve over time. We focus in particular on citation networks, networks of references between documents such as papers, patents, or court cases. We describe three different methods of analysis, one based on an expectation-maximization algorithm, one based on modularity optimization, and one based on eigenvector centrality. Using the network of citations between opinions of the United States Supreme Court as an example, we demonstrate how each of these methods can reveal significant structural divisions in the network, and how, ultimately, the combination of all three can help us develop a coherent overall picture of the network's shape.
△ Less
Submitted 5 June, 2007; v1 submitted 31 May, 2007;
originally announced June 2007.