Search | arXiv e-print repository

Community Detection in Hypergraphs via Mutual Information Maximization

Authors: Jurgen Kritschgau, Daniel Kaiser, Oliver Alvarado Rodriguez, Ilya Amburg, Jessalyn Bolkema, Thomas Grubb, Fangfei Lan, Sepideh Maleki, Phil Chodrow, Bill Kay

Abstract: The hypergraph community detection problem seeks to identify groups of related nodes in hypergraph data. We propose an information-theoretic hypergraph community detection algorithm which compresses the observed data in terms of community labels and community-edge intersections. This algorithm can also be viewed as maximum-likelihood inference in a degree-corrected microcanonical stochastic blockm… ▽ More The hypergraph community detection problem seeks to identify groups of related nodes in hypergraph data. We propose an information-theoretic hypergraph community detection algorithm which compresses the observed data in terms of community labels and community-edge intersections. This algorithm can also be viewed as maximum-likelihood inference in a degree-corrected microcanonical stochastic blockmodel. We perform the inference/compression step via simulated annealing. Unlike several recent algorithms based on canonical models, our microcanonical algorithm does not require inference of statistical parameters such as node degrees or pairwise group connection rates. Through synthetic experiments, we find that our algorithm succeeds down to recently-conjectured thresholds for sparse random hypergraphs. We also find competitive performance in cluster recovery tasks on several hypergraph data sets. △ Less

Submitted 8 August, 2023; originally announced August 2023.

Comments: Submitted

arXiv:2303.09282 [pdf, ps, other]

Data science and social justice in the mathematics community

Authors: Quindel Jones, Andrés R. Vindas Meléndez, Ariana Mendible, Manuchehr Aminian, Heather Z. Brooks, Nathan Alexander, Carrie Diaz Eaton, Philip Chodrow

Abstract: Data science for social justice (DS4SJ) is data-scientific work that supports the liberation of oppressed and marginalized people. By nature, this work lies at the intersection of technical scholarship and activist practice. We discuss this growing efforts in DS4SJ within the broad mathematics community. We begin by defining terms and offering a series of guiding principles for engaging in critica… ▽ More Data science for social justice (DS4SJ) is data-scientific work that supports the liberation of oppressed and marginalized people. By nature, this work lies at the intersection of technical scholarship and activist practice. We discuss this growing efforts in DS4SJ within the broad mathematics community. We begin by defining terms and offering a series of guiding principles for engaging in critical data science work, providing examples of how these principles play out in practice. We then highlight the roles that DS4SJ can play in the scholarship and pedagogy of practicing mathematicians. We focus in particular on the engagement of early-career mathematicians in DS4SJ, which we illustrate through a series of four personal vignettes. While the primary aim of DS4SJ is to achieve impact for marginalized communities, we also argue that engagement with DS4SJ can benefit the entire mathematical ecosystem, including researchers, instructors, students, departments, institutes, and professional societies. We close with reflections on how these various actors can support ongoing efforts in data science for social justice. △ Less

Submitted 14 March, 2023; originally announced March 2023.

arXiv:2209.07004 [pdf, other]

Emergence of polarization in a sigmoidal bounded-confidence model of opinion dynamics

Authors: Heather Z. Brooks, Philip S. Chodrow, Mason A. Porter

Abstract: We study a nonlinear bounded-confidence model (BCM) of continuous-time opinion dynamics on networks with both persuadable individuals and zealots. The model is parameterized by a scalar $γ$, which controls the steepness of a smooth influence function. This influence function encodes the relative weights that nodes place on the opinions of other nodes. When $γ= 0$, this influence function recovers… ▽ More We study a nonlinear bounded-confidence model (BCM) of continuous-time opinion dynamics on networks with both persuadable individuals and zealots. The model is parameterized by a scalar $γ$, which controls the steepness of a smooth influence function. This influence function encodes the relative weights that nodes place on the opinions of other nodes. When $γ= 0$, this influence function recovers Taylor's averaging model; when $γ\rightarrow \infty$, the influence function converges to that of a modified Hegselmann--Krause (HK) BCM. Unlike the classical HK model, however, our sigmoidal bounded-confidence model (SBCM) is smooth for any finite $γ$. We show that the set of steady states of our SBCM is qualitatively similar to that of the Taylor model when $γ$ is small and that the set of steady states approaches a subset of the set of steady states of a modified HK model as $γ\rightarrow \infty$. For several special graph topologies, we give analytical descriptions of important features of the space of steady states. A notable result is a closed-form relationship between the stability of a polarized state and the graph topology in a simple model of echo chambers in social networks. Because the influence function of our BCM is smooth, we are able to study it with linear stability analysis, which is difficult to employ with the usual discontinuous influence functions in BCMs. △ Less

Submitted 29 July, 2023; v1 submitted 14 September, 2022; originally announced September 2022.

Comments: 29 pages, 7 figures

MSC Class: 91D30; 37N99

arXiv:2204.13586 [pdf, other]

Nonbacktracking spectral clustering of nonuniform hypergraphs

Authors: Philip Chodrow, Nicole Eikmeier, Jamie Haddock

Abstract: Spectral methods offer a tractable, global framework for clustering in graphs via eigenvector computations on graph matrices. Hypergraph data, in which entities interact on edges of arbitrary size, poses challenges for matrix representations and therefore for spectral clustering. We study spectral clustering for nonuniform hypergraphs based on the hypergraph nonbacktracking operator. After reviewi… ▽ More Spectral methods offer a tractable, global framework for clustering in graphs via eigenvector computations on graph matrices. Hypergraph data, in which entities interact on edges of arbitrary size, poses challenges for matrix representations and therefore for spectral clustering. We study spectral clustering for nonuniform hypergraphs based on the hypergraph nonbacktracking operator. After reviewing the definition of this operator and its basic properties, we prove a theorem of Ihara-Bass type which allows eigenpair computations to take place on a smaller matrix, often enabling faster computation. We then propose an alternating algorithm for inference in a hypergraph stochastic blockmodel via linearized belief-propagation which involves a spectral clustering step again using nonbacktracking operators. We provide proofs related to this algorithm that both formalize and extend several previous results. We pose several conjectures about the limits of spectral methods and detectability in hypergraph stochastic blockmodels in general, supporting these with in-expectation analysis of the eigeinpairs of our studied operators. We perform experiments in real and synthetic data that demonstrate the benefits of hypergraph methods over graph-based ones when interactions of different sizes carry different information about cluster structure. △ Less

Submitted 3 September, 2022; v1 submitted 26 April, 2022; originally announced April 2022.

Comments: Main text: 26 pages, 6 figures. Appendix and references: 23 pages, 4 figures

MSC Class: 05C50; 05C65; 15A18; 62H30; 62R07; 91D30

arXiv:2101.09611 [pdf, other]

Generative hypergraph clustering: from blockmodels to modularity

Authors: Philip S. Chodrow, Nate Veldt, Austin R. Benson

Abstract: Hypergraphs are a natural modeling paradigm for a wide range of complex relational systems. A standard analysis task is to identify clusters of closely related or densely interconnected nodes. Many graph algorithms for this task are based on variants of the stochastic blockmodel, a random graph with flexible cluster structure. However, there are few models and algorithms for hypergraph clustering.… ▽ More Hypergraphs are a natural modeling paradigm for a wide range of complex relational systems. A standard analysis task is to identify clusters of closely related or densely interconnected nodes. Many graph algorithms for this task are based on variants of the stochastic blockmodel, a random graph with flexible cluster structure. However, there are few models and algorithms for hypergraph clustering. Here, we propose a Poisson degree-corrected hypergraph stochastic blockmodel (DCHSBM), a generative model of clustered hypergraphs with heterogeneous node degrees and edge sizes. Approximate maximum-likelihood inference in the DCHSBM naturally leads to a clustering objective that generalizes the popular modularity objective for graphs. We derive a general Louvain-type algorithm for this objective, as well as a a faster, specialized "All-Or-Nothing" (AON) variant in which edges are expected to lie fully within clusters. This special case encompasses a recent proposal for modularity in hypergraphs, while also incorporating flexible resolution and edge-size parameters. We show that AON hypergraph Louvain is highly scalable, including as an example an experiment on a synthetic hypergraph of one million nodes. We also demonstrate through synthetic experiments that the detectability regimes for hypergraph community detection differ from methods based on dyadic graph projections. We use our generative model to analyze different patterns of higher-order structure in school contact networks, U.S. congressional bill cosponsorship, U.S. congressional committees, product categories in co-purchasing behavior, and hotel locations from web browsing sessions, finding interpretable higher-order structure. We then study the behavior of our AON hypergraph Louvain algorithm, finding that it is able to recover ground truth clusters in empirical data sets exhibiting the corresponding higher-order structure. △ Less

Submitted 18 August, 2021; v1 submitted 23 January, 2021; originally announced January 2021.

Comments: 23 pages + 7 pages of supplementary information, 3 tables, 4 figures

Journal ref: Science Advances,7:eabh1303, 2021

arXiv:2007.04448 [pdf, other]

doi 10.1073/pnas.2015188118

Emergence of Hierarchy in Networked Endorsement Dynamics

Authors: Mari Kawakatsu, Philip S. Chodrow, Nicole Eikmeier, Daniel B. Larremore

Abstract: Many social and biological systems are characterized by enduring hierarchies, including those organized around prestige in academia, dominance in animal groups, and desirability in online dating. Despite their ubiquity, the general mechanisms that explain the creation and endurance of such hierarchies are not well understood. We introduce a generative model for the dynamics of hierarchies using ti… ▽ More Many social and biological systems are characterized by enduring hierarchies, including those organized around prestige in academia, dominance in animal groups, and desirability in online dating. Despite their ubiquity, the general mechanisms that explain the creation and endurance of such hierarchies are not well understood. We introduce a generative model for the dynamics of hierarchies using time-varying networks in which new links are formed based on the preferences of nodes in the current network and old links are forgotten over time. The model produces a range of hierarchical structures, ranging from egalitarianism to bistable hierarchies, and we derive critical points that separate these regimes in the limit of long system memory. Importantly, our model supports statistical inference, allowing for a principled comparison of generative mechanisms using data. We apply the model to study hierarchical structures in empirical data on hiring patterns among mathematicians, dominance relations among parakeets, and friendships among members of a fraternity, observing several persistent patterns as well as interpretable differences in the generative mechanisms favored by each. Our work contributes to the growing literature on statistically grounded models of time-varying networks. △ Less

Submitted 7 May, 2021; v1 submitted 8 July, 2020; originally announced July 2020.

Comments: Updates corresponding to final published version

Journal ref: Proceedings of the National Academy of Sciences, 118(16) (2021)

arXiv:1911.01331 [pdf, other]

Annotated Hypergraphs: Models and Applications

Authors: Philip Chodrow, Andrew Mellor

Abstract: Hypergraphs offer a natural modeling language for studying polyadic interactions between sets of entities. Many polyadic interactions are asymmetric, with nodes playing distinctive roles. In an academic collaboration network, for example, the order of authors on a paper often reflects the nature of their contributions to the completed work. To model these networks, we introduce \emph{annotated hyp… ▽ More Hypergraphs offer a natural modeling language for studying polyadic interactions between sets of entities. Many polyadic interactions are asymmetric, with nodes playing distinctive roles. In an academic collaboration network, for example, the order of authors on a paper often reflects the nature of their contributions to the completed work. To model these networks, we introduce \emph{annotated hypergraphs} as natural polyadic generalizations of directed graphs. Annotated hypergraphs form a highly general framework for incorporating metadata into polyadic graph models. To facilitate data analysis with annotated hypergraphs, we construct a role-aware configuration null model for these structures and prove an efficient Markov Chain Monte Carlo scheme for sampling from it. We proceed to formulate several metrics and algorithms for the analysis of annotated hypergraphs. Several of these, such as assortativity and modularity, naturally generalize dyadic counterparts. Other metrics, such as local role densities, are unique to the setting of annotated hypergraphs. We illustrate our techniques on six digital social networks, and present a detailed case-study of the Enron email data set. △ Less

Submitted 4 November, 2019; originally announced November 2019.

Comments: 22 pages, 6 figures, 2 tables

arXiv:1909.09037 [pdf, other]

Moments of Uniform Random Multigraphs with Fixed Degree Sequences

Authors: Philip S. Chodrow

Abstract: We study the expected adjacency matrix of a uniformly random multigraph with fixed degree sequence $\mathbf{d} \in \mathbb{Z}_+^n$. This matrix arises in a variety of analyses of networked data sets, including modularity-maximization and mean-field theories of spreading processes. Its structure is well-understood for large, sparse, simple graphs: the expected number of edges between nodes $i$ and… ▽ More We study the expected adjacency matrix of a uniformly random multigraph with fixed degree sequence $\mathbf{d} \in \mathbb{Z}_+^n$. This matrix arises in a variety of analyses of networked data sets, including modularity-maximization and mean-field theories of spreading processes. Its structure is well-understood for large, sparse, simple graphs: the expected number of edges between nodes $i$ and $j$ is roughly $\frac{d_id_j}{\sum_\ell{d_\ell}}$. Many network data sets are neither large, sparse, nor simple, and in these cases the standard approximation no longer applies. We derive a novel estimator using a dynamical approach: the estimator emerges from the stationarity conditions of a class of Markov Chain Monte Carlo algorithms for graph sampling. We derive error bounds for this estimator, and provide an efficient scheme with which to compute it. We test the estimator on synthetic and empirical degree sequences, finding that it enjoys relative error against ground truth a full order of magnitude smaller than the standard approximation. We then compare modularity maximization techniques using both the standard and novel estimator, finding that the qualitative structure of the optimization landscape depends significantly on the estimator choice. Our results emphasize the importance of using carefully specified random graph models in data scientific applications. △ Less

Submitted 6 February, 2020; v1 submitted 19 September, 2019; originally announced September 2019.

Comments: 31 pages, 5 figures, 5 pages of supplementary material

MSC Class: 05C80; 05C82; 91D30; 62-07; 65C05

arXiv:1902.09302 [pdf, other]

Configuration Models of Random Hypergraphs

Authors: Philip S. Chodrow

Abstract: Many empirical networks are intrinsically polyadic, with interactions occurring within groups of agents of arbitrary size. There are, however, few flexible null models that can support statistical inference for such polyadic networks. We define a class of null random hypergraphs that hold constant both the node degree and edge dimension sequences, generalizing the classical dyadic configuration mo… ▽ More Many empirical networks are intrinsically polyadic, with interactions occurring within groups of agents of arbitrary size. There are, however, few flexible null models that can support statistical inference for such polyadic networks. We define a class of null random hypergraphs that hold constant both the node degree and edge dimension sequences, generalizing the classical dyadic configuration model. We provide a Markov Chain Monte Carlo scheme for sampling from these models, and discuss connections and distinctions between our proposed models and previous approaches. We then illustrate these models through a triplet of applications. We start with two classical network topics -- triadic clustering and degree-assortativity. In each, we emphasize the importance of randomizing over hypergraph space rather than projected graph space, showing that this choice can dramatically alter statistical inference and study findings. We then define and study the edge intersection profile of a hypergraph as a measure of higher-order correlation between edges, and derive asymptotic approximations under the stub-labeled null. Our experiments emphasize the ability of explicit, statistically-grounded polyadic modeling to significantly enhance the toolbox of network data science. We close with suggestions for multiple avenues of future work. △ Less

Submitted 13 December, 2019; v1 submitted 25 February, 2019; originally announced February 2019.

Comments: Major revisions to all text and figures

arXiv:1901.09456 [pdf, other]

Log-minor distributions and an application to estimating mean subsystem entropy

Authors: Alice C. Schwarze, Philip S. Chodrow, Mason A. Porter

Abstract: A common task in physics, information theory, and other fields is the analysis of properties of subsystems of a given system. Given the covariance matrix $M$ of a system of $n$ coupled variables, the covariance matrices of the subsystems are principal submatrices of $M$. The rapid growth with $n$ of the set of principal submatrices makes it impractical to exhaustively study each submatrix for even… ▽ More A common task in physics, information theory, and other fields is the analysis of properties of subsystems of a given system. Given the covariance matrix $M$ of a system of $n$ coupled variables, the covariance matrices of the subsystems are principal submatrices of $M$. The rapid growth with $n$ of the set of principal submatrices makes it impractical to exhaustively study each submatrix for even modestly-sized systems. It is therefore of great interest to derive methods for approximating the distributions of important submatrix properties for a given matrix. Motivated by the importance of differential entropy as a systemic measure of disorder, we study the distribution of log-determinants of principal $k\times k$ submatrices when the covariance matrix has bounded condition number. We derive upper bounds for the right tail and the variance of the distribution of minors, and we use these in turn to derive upper bounds on the standard error of the sample mean of subsystem entropy. Our results demonstrate that, despite the rapid growth of the set of subsystems with $n$, the number of samples that are needed to bound the sampling error is asymptotically independent of $n$. Instead, it is sufficient to increase the number of samples in linear proportion to $k$ to achieve a desired sampling accuracy. △ Less

Submitted 27 January, 2019; originally announced January 2019.

Comments: Keywords: empirical distributions, determinants, sampling error, positive-definite matrices, random matrices

MSC Class: 15B99; 15A15; 60E15; 93A10

arXiv:1812.05464 [pdf, other]

doi 10.1137/18M1232346

Local Symmetry and Global Structure in Adaptive Voter Models

Authors: Philip S. Chodrow, Peter J. Mucha

Abstract: Adaptive voter models (AVMs) are simple mechanistic systems that model the emergence of mesoscopic structure from local networked processes driven by conflict and homophily. AVMs display rich behavior, including a phase transition from a fully-fragmented regime of "echo-chambers" to a regime of persistent disagreement governed by low-dimensional quasistable manifolds. Many extant methods for appro… ▽ More Adaptive voter models (AVMs) are simple mechanistic systems that model the emergence of mesoscopic structure from local networked processes driven by conflict and homophily. AVMs display rich behavior, including a phase transition from a fully-fragmented regime of "echo-chambers" to a regime of persistent disagreement governed by low-dimensional quasistable manifolds. Many extant methods for approximating the behavior of AVMs are either restricted in scope, expensive in computation, or inaccurate in predicting important statistics. In this work, we develop a novel, second-order moment closure approximation method for binary-state rewire-to-random and rewire-to-same model variants. We incorporate a small amount of noise via a random mutation term, which renders the system ergodic. Using ergodicity, we then approximate the voting process, which is non-Markovian in the second moments of the system, with a Markovian term near the phase transition. This approximation exploits an asymmetry between different classes of voting events. The resulting scheme enables us to predict the location of the phase transition and the active edge density in the regime of persistent disagreement, across the entire space of parameters and opinion densities. Numerically, our results are nearly exact for the rewire-to-random model, and competitive with other current approaches for the rewire-to-same model. Moreover, our computations display constant scaling in the mean degree, enabling approximations for denser systems than previously possible. We conclude with suggestions for model refinements and extensions. △ Less

Submitted 24 December, 2019; v1 submitted 13 December, 2018; originally announced December 2018.

Comments: 20 Pages, 4 figures. Final revised copy

Journal ref: SIAM Journal on Applied Mathematics (2020)

arXiv:1708.07459 [pdf, ps, other]

Divergence, Entropy, Information: An Opinionated Introduction to Information Theory

Authors: Philip Chodrow

Abstract: Information theory is a mathematical theory of learning with deep connections with topics as diverse as artificial intelligence, statistical physics, and biological evolution. Many primers on information theory paint a broad picture with relatively little mathematical sophistication, while many others develop specific application areas in detail. In contrast, these informal notes aim to outline so… ▽ More Information theory is a mathematical theory of learning with deep connections with topics as diverse as artificial intelligence, statistical physics, and biological evolution. Many primers on information theory paint a broad picture with relatively little mathematical sophistication, while many others develop specific application areas in detail. In contrast, these informal notes aim to outline some elements of the information-theoretic "way of thinking," by cutting a rapid and interesting path through some of the theory's foundational concepts and results. They are aimed at practicing systems scientists who are interested in exploring potential connections between information theory and their own fields. The main mathematical prerequisite for the notes is comfort with elementary probability, including sample spaces, conditioning, and expectations. We take the Kullback-Leibler divergence as our most basic concept, and then proceed to develop the entropy and mutual information. We discuss some of the main results, including the Chernoff bounds as a characterization of the divergence; Gibbs' Theorem; and the Data Processing Inequality. A recurring theme is that the definitions of information theory support natural theorems that sound ``obvious'' when translated into English. More pithily, ``information theory makes common sense precise.'' Since the focus of the notes is not primarily on technical details, proofs are provided only where the relevant techniques are illustrative of broader themes. Otherwise, proofs and intriguing tangents are referenced in liberally-sprinkled footnotes. The notes close with a highly nonexhaustive list of references to resources and other perspectives on the field. △ Less

Submitted 23 March, 2019; v1 submitted 24 August, 2017; originally announced August 2017.

Comments: 18 pages, typo fixes and other style revisions

arXiv:1205.7003 [pdf, ps, other]

Upper and lower bounds for the iterates of order-preserving homogeneous maps on cones

Authors: Philip Chodrow, Cole Franks, Brian Lins

Abstract: We define upper bound and lower bounds for order-preserving homogeneous of degree one maps on a proper closed cone in $\R^n$ in terms of the cone spectral radius. We also define weak upper and lower bounds for these maps. For a proper closed cone $C \subset \R^n$, we prove that any order-preserving homogeneous of degree one map $f: \inter C \rightarrow \inter C$ has a lower bound. If $C$ is polyhe… ▽ More We define upper bound and lower bounds for order-preserving homogeneous of degree one maps on a proper closed cone in $\R^n$ in terms of the cone spectral radius. We also define weak upper and lower bounds for these maps. For a proper closed cone $C \subset \R^n$, we prove that any order-preserving homogeneous of degree one map $f: \inter C \rightarrow \inter C$ has a lower bound. If $C$ is polyhedral, we prove that the map $f$ has a weak upper bound. We give examples of weak upper bounds for certain order-preserving homogeneous of degree one maps defined on the interior of $\R^n_+$. △ Less

Submitted 31 May, 2012; originally announced May 2012.

MSC Class: 47H07 (Primary) 15B48 (Secondary)

Showing 1–13 of 13 results for author: Chodrow, P