Search | arXiv e-print repository

Dynamics of Affective Polarization: From Consensus to Partisan Divides

Authors: Buddhika Nettasinghe, Allon G. Percus, Kristina Lerman

Abstract: Politically divided societies are also often divided emotionally: people like and trust those with similar political views (in-group favoritism) while disliking and distrusting those with different views (out-group animosity). This phenomenon, called affective polarization, influences individual decisions, including seemingly apolitical choices such as whether to wear a mask or what car to buy. We… ▽ More Politically divided societies are also often divided emotionally: people like and trust those with similar political views (in-group favoritism) while disliking and distrusting those with different views (out-group animosity). This phenomenon, called affective polarization, influences individual decisions, including seemingly apolitical choices such as whether to wear a mask or what car to buy. We present a dynamical model of decision-making in an affectively polarized society, identifying three potential global outcomes separated by a sharp boundary in the parameter space: consensus, partisan polarization, and non-partisan polarization. Analysis reveals that larger out-group animosity compared to in-group favoritism, i.e. more hate than love, is sufficient for polarization, while larger in-group favoritism compared to out-group animosity, i.e., more love than hate, is necessary for consensus. We also show that, counter-intuitively, increasing cross-party connections facilitates polarization, and that by emphasizing partisan differences, mass media creates self-fulfilling prophecies that lead to polarization. Affective polarization also creates tip** points in the opinion landscape where one group suddenly reverses their trends. Our findings aid in understanding and addressing the cascading effects of affective polarization, offering insights for strategies to mitigate polarization. △ Less

Submitted 25 March, 2024; originally announced March 2024.

arXiv:2310.02479 [pdf, other]

Addressing Quantum's "Fine Print": State Preparation and Information Extraction for Quantum Algorithms and Geologic Fracture Networks

Authors: Jessie M. Henderson, John Kath, John K. Golden, Allon G. Percus, Daniel O'Malley

Abstract: Quantum algorithms provide an exponential speedup for solving certain classes of linear systems, including those that model geologic fracture flow. However, this revolutionary gain in efficiency does not come without difficulty. Quantum algorithms require that problems satisfy not only algorithm-specific constraints, but also application-specific ones. Otherwise, the quantum advantage carefully at… ▽ More Quantum algorithms provide an exponential speedup for solving certain classes of linear systems, including those that model geologic fracture flow. However, this revolutionary gain in efficiency does not come without difficulty. Quantum algorithms require that problems satisfy not only algorithm-specific constraints, but also application-specific ones. Otherwise, the quantum advantage carefully attained through algorithmic ingenuity can be entirely negated. Previous work addressing quantum algorithms for geologic fracture flow has illustrated core algorithmic approaches while incrementally removing assumptions. This work addresses two further requirements for solving geologic fracture flow systems with quantum algorithms: efficient system state preparation and efficient information extraction. Our approach to addressing each is consistent with an overall exponential speed-up. △ Less

Submitted 3 October, 2023; originally announced October 2023.

Comments: 13 pages, 12 figures, LA-UR-23-31328

arXiv:2306.03416 [pdf, other]

Bayesian Learning of Gas Transport in Three-Dimensional Fracture Networks

Authors: Yingqi Shi, Donald J. Berry, John Kath, Shams Lodhy, An Ly, Allon G. Percus, Jeffrey D. Hyman, Kelly Moran, Justin Strait, Matthew R. Sweeney, Hari S. Viswanathan, Philip H. Stauffer

Abstract: Modeling gas flow through fractures of subsurface rock is a particularly challenging problem because of the heterogeneous nature of the material. High-fidelity simulations using discrete fracture network (DFN) models are one methodology for predicting gas particle breakthrough times at the surface, but are computationally demanding. We propose a Bayesian machine learning method that serves as an e… ▽ More Modeling gas flow through fractures of subsurface rock is a particularly challenging problem because of the heterogeneous nature of the material. High-fidelity simulations using discrete fracture network (DFN) models are one methodology for predicting gas particle breakthrough times at the surface, but are computationally demanding. We propose a Bayesian machine learning method that serves as an efficient surrogate model, or emulator, for these three-dimensional DFN simulations. Our model trains on a small quantity of simulation data and, using a graph/path-based decomposition of the fracture network, rapidly predicts quantiles of the breakthrough time distribution. The approach, based on Gaussian Process Regression (GPR), outputs predictions that are within 20-30% of high-fidelity DFN simulation results. Unlike previously proposed methods, it also provides uncertainty quantification, outputting confidence intervals that are essential given the uncertainty inherent in subsurface modeling. Our trained model runs within a fraction of a second, which is considerably faster than other methods with comparable accuracy and multiple orders of magnitude faster than high-fidelity simulations. △ Less

Submitted 6 June, 2023; originally announced June 2023.

Report number: LA-UR-23-25597

arXiv:2304.03479 [pdf, other]

doi 10.1103/PhysRevE.107.L042301

Clique Densification in Networks

Authors: Haochen Pi, Keith Burghardt, Allon G. Percus, Kristina Lerman

Abstract: Real-world networks are rarely static. Recently, there has been increasing interest in both network growth and network densification, in which the number of edges scales superlinearly with the number of nodes. Less studied but equally important, however, are scaling laws of higher-order cliques, which can drive clustering and network redundancy. In this paper, we study how cliques grow with networ… ▽ More Real-world networks are rarely static. Recently, there has been increasing interest in both network growth and network densification, in which the number of edges scales superlinearly with the number of nodes. Less studied but equally important, however, are scaling laws of higher-order cliques, which can drive clustering and network redundancy. In this paper, we study how cliques grow with network size, by analyzing several empirical networks from emails to Wikipedia interactions. Our results show superlinear scaling laws whose exponents increase with clique size, in contrast to predictions from a previous model. We then show that these results are in qualitative agreement with a new model that we propose, the Local Preferential Attachment Model, where an incoming node links not only to a target node but also to its higher-degree neighbors. Our results provide new insights into how networks grow and where network redundancy occurs. △ Less

Submitted 7 April, 2023; originally announced April 2023.

Comments: 14 pages, 11 figures. Paper is in press at Physical Review E

arXiv:2101.11056 [pdf, other]

A Model of Densifying Collaboration Networks

Authors: Keith A. Burghardt, Allon G. Percus, Kristina Lerman

Abstract: Research collaborations provide the foundation for scientific advances, but we have only recently begun to understand how they form and grow on a global scale. Here we analyze a model of the growth of research collaboration networks to explain the empirical observations that the number of collaborations scales superlinearly with institution size, though at different rates (heterogeneous densificat… ▽ More Research collaborations provide the foundation for scientific advances, but we have only recently begun to understand how they form and grow on a global scale. Here we analyze a model of the growth of research collaboration networks to explain the empirical observations that the number of collaborations scales superlinearly with institution size, though at different rates (heterogeneous densification), the number of institutions grows as a power of the number of researchers (Heaps' law) and institution sizes approximate Zipf's law. This model has three mechanisms: (i) researchers are preferentially hired by large institutions, (ii) new institutions trigger more potential institutions, and (iii) researchers collaborate with friends-of-friends. We show agreement between these assumptions and empirical data, through analysis of co-authorship networks spanning two centuries. We then develop a theoretical understanding of this model, which reveals emergent heterogeneous scaling such that the number of collaborations between institutions scale with an institution's size. △ Less

Submitted 26 January, 2021; originally announced January 2021.

Comments: arXiv admin note: text overlap with arXiv:2001.08734

arXiv:2001.08734 [pdf, other]

The Emergence of Heterogeneous Scaling in Research Institutions

Authors: Keith A. Burghardt, Zihao He, Allon G. Percus, Kristina Lerman

Abstract: Research institutions provide the infrastructure for scientific discovery, yet their role in the production of knowledge is not well characterized. To address this gap, we analyze interactions of researchers within and between institutions from millions of scientific papers. Our analysis reveals that the number of collaborations scales superlinearly with institution size, though at different rates… ▽ More Research institutions provide the infrastructure for scientific discovery, yet their role in the production of knowledge is not well characterized. To address this gap, we analyze interactions of researchers within and between institutions from millions of scientific papers. Our analysis reveals that the number of collaborations scales superlinearly with institution size, though at different rates (heterogeneous densification). We also find that the number of institutions scales with the number of researchers as a power law (Heaps' law) and institution sizes approximate Zipf's law. These patterns can be reproduced by a simple model with three mechanisms: (i) researchers collaborate with friends-of-friends, (ii) new institutions trigger more potential institutions, and (iii) researchers are preferentially hired by large institutions. This model reveals an economy of scale in research: larger institutions grow faster and amplify collaborations. Our work provides a new understanding of emergent behavior in research institutions and how they facilitate innovation. △ Less

Submitted 26 January, 2021; v1 submitted 23 January, 2020; originally announced January 2020.

Comments: 31 pages double-spaced (12 pages main text) and 23 figures (3 figures main text)

arXiv:1910.09538 [pdf, other]

The Transsortative Structure of Networks

Authors: Xin-Zeng Wu, Allon G. Percus, Keith Burghardt, Kristina Lerman

Abstract: Network topologies can be non-trivial, due to the complex underlying behaviors that form them. While past research has shown that some processes on networks may be characterized by low-order statistics describing nodes and their neighbors, such as degree assortativity, these quantities fail to capture important sources of variation in network structure. We introduce a property called transsortativ… ▽ More Network topologies can be non-trivial, due to the complex underlying behaviors that form them. While past research has shown that some processes on networks may be characterized by low-order statistics describing nodes and their neighbors, such as degree assortativity, these quantities fail to capture important sources of variation in network structure. We introduce a property called transsortativity that describes correlations among a node's neighbors, generalizing these statistics from immediate one-hop neighbors to two-hop neighbors. We describe how transsortativity can be systematically varied, independently of the network's degree distribution and assortativity. Moreover, we show that it can significantly impact the spread of contagions as well as the perceptions of neighbors, known as the majority illusion. Our work improves our ability to create and analyze more realistic models of complex networks. △ Less

Submitted 21 October, 2019; originally announced October 2019.

Comments: 6 pages, 5 figures

arXiv:1810.06118 [pdf, other]

doi 10.1016/j.commatsci.2019.02.046

Learning to fail: Predicting fracture evolution in brittle material models using recurrent graph convolutional neural networks

Authors: Max Schwarzer, Bryce Rogan, Yadong Ruan, Zhengming Song, Diana Y. Lee, Allon G. Percus, Viet T. Chau, Bryan A. Moore, Esteban Rougier, Hari S. Viswanathan, Gowri Srinivasan

Abstract: We propose a machine learning approach to address a key challenge in materials science: predicting how fractures propagate in brittle materials under stress, and how these materials ultimately fail. Our methods use deep learning and train on simulation data from high-fidelity models, emulating the results of these models while avoiding the overwhelming computational demands associated with running… ▽ More We propose a machine learning approach to address a key challenge in materials science: predicting how fractures propagate in brittle materials under stress, and how these materials ultimately fail. Our methods use deep learning and train on simulation data from high-fidelity models, emulating the results of these models while avoiding the overwhelming computational demands associated with running a statistically significant sample of simulations. We employ a graph convolutional network that recognizes features of the fracturing material and a recurrent neural network that models the evolution of these features, along with a novel form of data augmentation that compensates for the modest size of our training data. We simultaneously generate predictions for qualitatively distinct material properties. Results on fracture damage and length are within 3% of their simulated values, and results on time to material failure, which is notoriously difficult to predict even with high-fidelity models, are within approximately 15% of simulated values. Once trained, our neural networks generate predictions within seconds, rather than the hours needed to run a single simulation. △ Less

Submitted 15 March, 2019; v1 submitted 14 October, 2018; originally announced October 2018.

Report number: LA-UR-18-29693

Journal ref: Computational Materials Science 162, 322-332 (2019)

arXiv:1807.05472 [pdf, ps, other]

doi 10.1103/PhysRevE.98.022321

Degree Correlations Amplify the Growth of Cascades in Networks

Authors: Xin-Zeng Wu, Peter G. Fennell, Allon G. Percus, Kristina Lerman

Abstract: Networks facilitate the spread of cascades, allowing a local perturbation to percolate via interactions between nodes and their neighbors. We investigate how network structure affects the dynamics of a spreading cascade. By accounting for the joint degree distribution of a network within a generating function framework, we can quantify how degree correlations affect both the onset of global cascad… ▽ More Networks facilitate the spread of cascades, allowing a local perturbation to percolate via interactions between nodes and their neighbors. We investigate how network structure affects the dynamics of a spreading cascade. By accounting for the joint degree distribution of a network within a generating function framework, we can quantify how degree correlations affect both the onset of global cascades and the propensity of nodes of specific degree class to trigger large cascades. However, not all degree correlations are equally important in a spreading process. We introduce a new measure of degree assortativity that accounts for correlations among nodes relevant to a spreading cascade. We show that the critical point defining the onset of global cascades has a monotone relationship to this new assortativity measure. In addition, we show that the choice of nodes to seed the largest cascades is strongly affected by degree correlations. Contrary to traditional wisdom, when degree assortativity is positive, low degree nodes are more likely to generate largest cascades. Our work suggests that it may be possible to tailor spreading processes by manipulating the higher-order structure of networks. △ Less

Submitted 14 July, 2018; originally announced July 2018.

Comments: 9 pages, 8 figures

Journal ref: Phys. Rev. E 98, 022321 (2018)

arXiv:1802.06287 [pdf, other]

Unsupervised vehicle recognition using incremental reseeding of acoustic signatures

Authors: Justin Sunu, Blake Hunter, Allon G. Percus

Abstract: Vehicle recognition and classification have broad applications, ranging from traffic flow management to military target identification. We demonstrate an unsupervised method for automated identification of moving vehicles from roadside audio sensors. Using a short-time Fourier transform to decompose audio signals, we treat the frequency signature in each time window as an individual data point. We… ▽ More Vehicle recognition and classification have broad applications, ranging from traffic flow management to military target identification. We demonstrate an unsupervised method for automated identification of moving vehicles from roadside audio sensors. Using a short-time Fourier transform to decompose audio signals, we treat the frequency signature in each time window as an individual data point. We then use a spectral embedding for dimensionality reduction. Based on the leading eigenvectors, we relate the performance of an incremental reseeding algorithm to that of spectral clustering. We find that incremental reseeding accurately identifies individual vehicles using their acoustic signatures. △ Less

Submitted 17 February, 2018; originally announced February 2018.

arXiv:1705.09869 [pdf, other]

Dimensionality reduction for acoustic vehicle classification with spectral embedding

Authors: Justin Sunu, Allon G. Percus

Abstract: We propose a method for recognizing moving vehicles, using data from roadside audio sensors. This problem has applications ranging widely, from traffic analysis to surveillance. We extract a frequency signature from the audio signal using a short-time Fourier transform, and treat each time window as an individual data point to be classified. By applying a spectral embedding, we decrease the dimens… ▽ More We propose a method for recognizing moving vehicles, using data from roadside audio sensors. This problem has applications ranging widely, from traffic analysis to surveillance. We extract a frequency signature from the audio signal using a short-time Fourier transform, and treat each time window as an individual data point to be classified. By applying a spectral embedding, we decrease the dimensionality of the data sufficiently for K-nearest neighbors to provide accurate vehicle identification. △ Less

Submitted 17 February, 2018; v1 submitted 27 May, 2017; originally announced May 2017.

Comments: Proceedings of the 15th IEEE International Conference on Networking, Sensing and Control (2018)

arXiv:1705.09866 [pdf, other]

doi 10.1007/s10596-018-9720-1

Machine learning for graph-based representations of three-dimensional discrete fracture networks

Authors: Manuel Valera, Zhengyang Guo, Priscilla Kelly, Sean Matz, Vito Adrian Cantu, Allon G. Percus, Jeffrey D. Hyman, Gowri Srinivasan, Hari S. Viswanathan

Abstract: Structural and topological information play a key role in modeling flow and transport through fractured rock in the subsurface. Discrete fracture network (DFN) computational suites such as dfnWorks are designed to simulate flow and transport in such porous media. Flow and transport calculations reveal that a small backbone of fractures exists, where most flow and transport occurs. Restricting the… ▽ More Structural and topological information play a key role in modeling flow and transport through fractured rock in the subsurface. Discrete fracture network (DFN) computational suites such as dfnWorks are designed to simulate flow and transport in such porous media. Flow and transport calculations reveal that a small backbone of fractures exists, where most flow and transport occurs. Restricting the flowing fracture network to this backbone provides a significant reduction in the network's effective size. However, the particle tracking simulations needed to determine the reduction are computationally intensive. Such methods may be impractical for large systems or for robust uncertainty quantification of fracture networks, where thousands of forward simulations are needed to bound system behavior. In this paper, we develop an alternative network reduction approach to characterizing transport in DFNs, by combining graph theoretical and machine learning methods. We consider a graph representation where nodes signify fractures and edges denote their intersections. Using random forest and support vector machines, we rapidly identify a subnetwork that captures the flow patterns of the full DFN, based primarily on node centrality features in the graph. Our supervised learning techniques train on particle-tracking backbone paths found by dfnWorks, but run in negligible time compared to those simulations. We find that our predictions can reduce the network to approximately 20% of its original size, while still generating breakthrough curves consistent with those of the original network. △ Less

Submitted 29 January, 2018; v1 submitted 27 May, 2017; originally announced May 2017.

Comments: Computational Geosciences (2018)

Report number: LA-UR-17-24300

Journal ref: Computational Geosciences 22, 695-710 (2018)

arXiv:1612.08200 [pdf, ps, other]

Neighbor-Neighbor Correlations Explain Measurement Bias in Networks

Authors: Xin-Zeng Wu, Allon G. Percus, Kristina Lerman

Abstract: In numerous physical models on networks, dynamics are based on interactions that exclusively involve properties of a node's nearest neighbors. However, a node's local view of its neighbors may systematically bias perceptions of network connectivity or the prevalence of certain traits. We investigate the strong friendship paradox, which occurs when the majority of a node's neighbors have more neigh… ▽ More In numerous physical models on networks, dynamics are based on interactions that exclusively involve properties of a node's nearest neighbors. However, a node's local view of its neighbors may systematically bias perceptions of network connectivity or the prevalence of certain traits. We investigate the strong friendship paradox, which occurs when the majority of a node's neighbors have more neighbors than does the node itself. We develop a model to predict the magnitude of the paradox, showing that it is enhanced by negative correlations between degrees of neighboring nodes. We then show that by including neighbor-neighbor correlations, which are degree correlations one step beyond those of neighboring nodes, we accurately predict the impact of the strong friendship paradox in real-world networks. Understanding how the paradox biases local observations can inform better measurements of network structure and our understanding of collective phenomena. △ Less

Submitted 24 December, 2016; originally announced December 2016.

arXiv:1405.4332 [pdf, other]

Partitioning Networks with Node Attributes by Compressing Information Flow

Authors: Laura M. Smith, Linhong Zhu, Kristina Lerman, Allon G. Percus

Abstract: Real-world networks are often organized as modules or communities of similar nodes that serve as functional units. These networks are also rich in content, with nodes having distinguishing features or attributes. In order to discover a network's modular structure, it is necessary to take into account not only its links but also node attributes. We describe an information-theoretic method that iden… ▽ More Real-world networks are often organized as modules or communities of similar nodes that serve as functional units. These networks are also rich in content, with nodes having distinguishing features or attributes. In order to discover a network's modular structure, it is necessary to take into account not only its links but also node attributes. We describe an information-theoretic method that identifies modules by compressing descriptions of information flow on a network. Our formulation introduces node content into the description of information flow, which we then minimize to discover groups of nodes with similar attributes that also tend to trap the flow of information. The method has several advantages: it is conceptually simple and does not require ad-hoc parameters to specify the number of modules or to control the relative contribution of links and node attributes to network structure. We apply the proposed method to partition real-world networks with known community structure. We demonstrate that adding node attributes helps recover the underlying community structure in content-rich networks more effectively than using links alone. In addition, we show that our method is faster and more accurate than alternative state-of-the-art algorithms. △ Less

Submitted 16 May, 2014; originally announced May 2014.

Comments: 10 pages

arXiv:1405.2102 [pdf, other]

Improving Image Clustering using Sparse Text and the Wisdom of the Crowds

Authors: Anna Ma, Arjuna Flenner, Deanna Needell, Allon G. Percus

Abstract: We propose a method to improve image clustering using sparse text and the wisdom of the crowds. In particular, we present a method to fuse two different kinds of document features, image and text features, and use a common dictionary or "wisdom of the crowds" as the connection between the two different kinds of documents. With the proposed fusion matrix, we use topic modeling via non-negative matr… ▽ More We propose a method to improve image clustering using sparse text and the wisdom of the crowds. In particular, we present a method to fuse two different kinds of document features, image and text features, and use a common dictionary or "wisdom of the crowds" as the connection between the two different kinds of documents. With the proposed fusion matrix, we use topic modeling via non-negative matrix factorization to cluster documents. △ Less

Submitted 8 May, 2014; originally announced May 2014.

arXiv:1306.1298 [pdf, other]

Multiclass Semi-Supervised Learning on Graphs using Ginzburg-Landau Functional Minimization

Authors: Cristina Garcia-Cardona, Arjuna Flenner, Allon G. Percus

Abstract: We present a graph-based variational algorithm for classification of high-dimensional data, generalizing the binary diffuse interface model to the case of multiple classes. Motivated by total variation techniques, the method involves minimizing an energy functional made up of three terms. The first two terms promote a stepwise continuous classification function with sharp transitions between class… ▽ More We present a graph-based variational algorithm for classification of high-dimensional data, generalizing the binary diffuse interface model to the case of multiple classes. Motivated by total variation techniques, the method involves minimizing an energy functional made up of three terms. The first two terms promote a stepwise continuous classification function with sharp transitions between classes, while preserving symmetry among the class labels. The third term is a data fidelity term, allowing us to incorporate prior information into the model in a semi-supervised framework. The performance of the algorithm on synthetic data, as well as on the COIL and MNIST benchmark datasets, is competitive with state-of-the-art graph-based multiclass segmentation methods. △ Less

Submitted 6 June, 2013; originally announced June 2013.

Comments: 16 pages, to appear in Springer's Lecture Notes in Computer Science volume "Pattern Recognition Applications and Methods 2013", part of series on Advances in Intelligent and Soft Computing

ACM Class: I.5.3

arXiv:1303.2663 [pdf, other]

doi 10.1103/PhysRevE.88.042813

Spectral Clustering with Epidemic Diffusion

Authors: Laura M. Smith, Kristina Lerman, Cristina Garcia-Cardona, Allon G. Percus, Rumi Ghosh

Abstract: Spectral clustering is widely used to partition graphs into distinct modules or communities. Existing methods for spectral clustering use the eigenvalues and eigenvectors of the graph Laplacian, an operator that is closely associated with random walks on graphs. We propose a new spectral partitioning method that exploits the properties of epidemic diffusion. An epidemic is a dynamic process that,… ▽ More Spectral clustering is widely used to partition graphs into distinct modules or communities. Existing methods for spectral clustering use the eigenvalues and eigenvectors of the graph Laplacian, an operator that is closely associated with random walks on graphs. We propose a new spectral partitioning method that exploits the properties of epidemic diffusion. An epidemic is a dynamic process that, unlike the random walk, simultaneously transitions to all the neighbors of a given node. We show that the replicator, an operator describing epidemic diffusion, is equivalent to the symmetric normalized Laplacian of a reweighted graph with edges reweighted by the eigenvector centralities of their incident nodes. Thus, more weight is given to edges connecting more central nodes. We describe a method that partitions the nodes based on the componentwise ratio of the replicator's second eigenvector to the first, and compare its performance to traditional spectral clustering techniques on synthetic graphs with known community structure. We demonstrate that the replicator gives preference to dense, clique-like structures, enabling it to more effectively discover communities that may be obscured by dense intercommunity linking. △ Less

Submitted 4 October, 2013; v1 submitted 11 March, 2013; originally announced March 2013.

Comments: 6 pages, to appear in Physical Review E

ACM Class: I.5.3

arXiv:1301.7320 [pdf, ps, other]

The phase transition in inhomogeneous random intersection graphs

Authors: Milan Bradonjić, Aric Hagberg, Nicolas W. Hengartner, Nathan Lemons, Allon G. Percus

Abstract: We analyze the component evolution in inhomogeneous random intersection graphs when the average degree is close to 1. As the average degree increases, the size of the largest component in the random intersection graph goes through a phase transition. We give bounds on the size of the largest components before and after this transition. We also prove that the largest component after the transition… ▽ More We analyze the component evolution in inhomogeneous random intersection graphs when the average degree is close to 1. As the average degree increases, the size of the largest component in the random intersection graph goes through a phase transition. We give bounds on the size of the largest components before and after this transition. We also prove that the largest component after the transition is unique. These results are similar to the phase transition in Erdős-Rényi random graphs; one notable difference is that the jump in the size of the largest component varies in size depending on the parameters of the random intersection graph. △ Less

Submitted 30 January, 2013; originally announced January 2013.

Comments: 18 pages

arXiv:1212.0945 [pdf, other]

Multiclass Diffuse Interface Models for Semi-Supervised Learning on Graphs

Authors: Cristina Garcia-Cardona, Arjuna Flenner, Allon G. Percus

Abstract: We present a graph-based variational algorithm for multiclass classification of high-dimensional data, motivated by total variation techniques. The energy functional is based on a diffuse interface model with a periodic potential. We augment the model by introducing an alternative measure of smoothness that preserves symmetry among the class labels. Through this modification of the standard Laplac… ▽ More We present a graph-based variational algorithm for multiclass classification of high-dimensional data, motivated by total variation techniques. The energy functional is based on a diffuse interface model with a periodic potential. We augment the model by introducing an alternative measure of smoothness that preserves symmetry among the class labels. Through this modification of the standard Laplacian, we construct an efficient multiclass method that allows for sharp transitions between classes. The experimental results demonstrate that our approach is competitive with the state of the art among other graph-based algorithms. △ Less

Submitted 5 December, 2012; originally announced December 2012.

Comments: 9 pages, to appear in Proceedings of the 2nd International Conference on Pattern Recognition Applications and Methods (ICPRAM 2013)

ACM Class: I.5.3

arXiv:1005.5475 [pdf, ps, other]

doi 10.1007/978-3-642-18009-5_5

Component Evolution in General Random Intersection Graphs

Authors: Milan Bradonjic, Aric Hagberg, Nicolas W. Hengartner, Allon G. Percus

Abstract: Random intersection graphs (RIGs) are an important random structure with applications in social networks, epidemic networks, blog readership, and wireless sensor networks. RIGs can be interpreted as a model for large randomly formed non-metric data sets. We analyze the component evolution in general RIGs, and give conditions on existence and uniqueness of the giant component. Our techniques genera… ▽ More Random intersection graphs (RIGs) are an important random structure with applications in social networks, epidemic networks, blog readership, and wireless sensor networks. RIGs can be interpreted as a model for large randomly formed non-metric data sets. We analyze the component evolution in general RIGs, and give conditions on existence and uniqueness of the giant component. Our techniques generalize existing methods for analysis of component evolution: we analyze survival and extinction properties of a dependent, inhomogeneous Galton-Watson branching process on general RIGs. Our analysis relies on bounding the branching processes and inherits the fundamental concepts of the study of component evolution in Erdős-Rényi graphs. The major challenge comes from the underlying structure of RIGs, which involves its both the set of nodes and the set of attributes, as well as the set of different probabilities among the nodes and attributes. △ Less

Submitted 29 May, 2010; originally announced May 2010.

arXiv:0808.1549 [pdf, ps, other]

doi 10.1063/1.3043666

The Peculiar Phase Structure of Random Graph Bisection

Authors: Allon G. Percus, Gabriel Istrate, Bruno Goncalves, Robert Z. Sumi, Stefan Boettcher

Abstract: The mincut graph bisection problem involves partitioning the n vertices of a graph into disjoint subsets, each containing exactly n/2 vertices, while minimizing the number of "cut" edges with an endpoint in each subset. When considered over sparse random graphs, the phase structure of the graph bisection problem displays certain familiar properties, but also some surprises. It is known that when… ▽ More The mincut graph bisection problem involves partitioning the n vertices of a graph into disjoint subsets, each containing exactly n/2 vertices, while minimizing the number of "cut" edges with an endpoint in each subset. When considered over sparse random graphs, the phase structure of the graph bisection problem displays certain familiar properties, but also some surprises. It is known that when the mean degree is below the critical value of 2 log 2, the cutsize is zero with high probability. We study how the minimum cutsize increases with mean degree above this critical threshold, finding a new analytical upper bound that improves considerably upon previous bounds. Combined with recent results on expander graphs, our bound suggests the unusual scenario that random graph bisection is replica symmetric up to and beyond the critical threshold, with a replica symmetry breaking transition possibly taking place above the threshold. An intriguing algorithmic consequence is that although the problem is NP-hard, we can find near-optimal cutsizes (whose ratio to the optimal value approaches 1 asymptotically) in polynomial time for typical instances near the phase transition. △ Less

Submitted 19 November, 2008; v1 submitted 11 August, 2008; originally announced August 2008.

Comments: substantially revised section 2, changed figures 3, 4 and 6, made minor stylistic changes and added references

Report number: LA-UR 08-5099

Journal ref: J. Math. Phys. 49, 125219 (2008)

arXiv:cs/0503082 [pdf, ps, other]

Spines of Random Constraint Satisfaction Problems: Definition and Connection with Computational Complexity

Authors: Gabriel Istrate, Stefan Boettcher, Allon G. Percus

Abstract: We study the connection between the order of phase transitions in combinatorial problems and the complexity of decision algorithms for such problems. We rigorously show that, for a class of random constraint satisfaction problems, a limited connection between the two phenomena indeed exists. Specifically, we extend the definition of the spine order parameter of Bollobas et al. to random constrai… ▽ More We study the connection between the order of phase transitions in combinatorial problems and the complexity of decision algorithms for such problems. We rigorously show that, for a class of random constraint satisfaction problems, a limited connection between the two phenomena indeed exists. Specifically, we extend the definition of the spine order parameter of Bollobas et al. to random constraint satisfaction problems, rigorously showing that for such problems a discontinuity of the spine is associated with a $2^{Ω(n)}$ resolution complexity (and thus a $2^{Ω(n)}$ complexity of DPLL algorithms) on random instances. The two phenomena have a common underlying cause: the emergence of ``large'' (linear size) minimally unsatisfiable subformulas of a random formula at the satisfiability phase transition. We present several further results that add weight to the intuition that random constraint satisfaction problems with a sharp threshold and a continuous spine are ``qualitatively similar to random 2-SAT''. Finally, we argue that it is the spine rather than the backbone parameter whose continuity has implications for the decision complexity of combinatorial problems, and we provide experimental evidence that the two parameters can behave in a different manner. △ Less

Submitted 29 March, 2005; originally announced March 2005.

Comments: A revised version of this paper will appear in Annals of Mathematics and Artificial Intelligence

arXiv:cond-mat/0402282 [pdf, ps, other]

doi 10.1103/PhysRevE.69.066703

Extremal Optimization at the Phase Transition of the 3-Coloring Problem

Authors: Stefan Boettcher, Allon G. Percus

Abstract: We investigate the phase transition of the 3-coloring problem on random graphs, using the extremal optimization heuristic. 3-coloring is among the hardest combinatorial optimization problems and is closely related to a 3-state anti-ferromagnetic Potts model. Like many other such optimization problems, it has been shown to exhibit a phase transition in its ground state behavior under variation of… ▽ More We investigate the phase transition of the 3-coloring problem on random graphs, using the extremal optimization heuristic. 3-coloring is among the hardest combinatorial optimization problems and is closely related to a 3-state anti-ferromagnetic Potts model. Like many other such optimization problems, it has been shown to exhibit a phase transition in its ground state behavior under variation of a system parameter: the graph's mean vertex degree. This phase transition is often associated with the instances of highest complexity. We use extremal optimization to measure the ground state cost and the ``backbone'', an order parameter related to ground state overlap, averaged over a large number of instances near the transition for random graphs of size $n$ up to 512. For graphs up to this size, benchmarks show that extremal optimization reaches ground states and explores a sufficient number of them to give the correct backbone value after about $O(n^{3.5})$ update steps. Finite size scaling gives a critical mean degree value $α_{\rm c}=4.703(28)$. Furthermore, the exploration of the degenerate ground states indicates that the backbone order parameter, measuring the constrainedness of the problem, exhibits a first-order phase transition. △ Less

Submitted 10 February, 2004; originally announced February 2004.

Comments: RevTex4, 8 pages, 4 postscript figures, related information available at http://www.physics.emory.edu/faculty/boettcher/

Journal ref: Physical Review E 69, 066703 (2004).

arXiv:cond-mat/0301035 [pdf, ps, other]

doi 10.1073/pnas.1635191100

Scaling and Universality in Continuous Length Combinatorial Optimization

Authors: David Aldous, Allon G. Percus

Abstract: We consider combinatorial optimization problems defined over random ensembles, and study how solution cost increases when the optimal solution undergoes a small perturbation delta. For the minimum spanning tree, the increase in cost scales as delta^2; for the mean-field and Euclidean minimum matching and traveling salesman problems in dimension d>=2, the increase scales as delta^3; this is obser… ▽ More We consider combinatorial optimization problems defined over random ensembles, and study how solution cost increases when the optimal solution undergoes a small perturbation delta. For the minimum spanning tree, the increase in cost scales as delta^2; for the mean-field and Euclidean minimum matching and traveling salesman problems in dimension d>=2, the increase scales as delta^3; this is observed in Monte Carlo simulations in d=2,3,4 and in theoretical analysis of a mean-field model. We speculate that the scaling exponent could serve to classify combinatorial optimization problems into a small number of distinct categories, similar to universality classes in statistical physics. △ Less

Submitted 13 August, 2003; v1 submitted 3 January, 2003; originally announced January 2003.

Comments: 5 pages; 3 figures

Report number: LA-UR-02-7322

arXiv:cs/0209030 [pdf, ps, other]

Extremal Optimization: an Evolutionary Local-Search Algorithm

Authors: Stefan Boettcher, Allon G. Percus

Abstract: A recently introduced general-purpose heuristic for finding high-quality solutions for many hard optimization problems is reviewed. The method is inspired by recent progress in understanding far-from-equilibrium phenomena in terms of {\em self-organized criticality,} a concept introduced to describe emergent complexity in physical systems. This method, called {\em extremal optimization,} success… ▽ More A recently introduced general-purpose heuristic for finding high-quality solutions for many hard optimization problems is reviewed. The method is inspired by recent progress in understanding far-from-equilibrium phenomena in terms of {\em self-organized criticality,} a concept introduced to describe emergent complexity in physical systems. This method, called {\em extremal optimization,} successively replaces the value of extremely undesirable variables in a sub-optimal solution with new, random ones. Large, avalanche-like fluctuations in the cost function self-organize from this dynamics, effectively scaling barriers to explore local optima in distant neighborhoods of the configuration space while eliminating the need to tune parameters. Drawing upon models used to simulate the dynamics of granular media, evolution, or geology, extremal optimization complements approximation methods inspired by equilibrium statistical physics, such as {\em simulated annealing}. It may be but one example of applying new insights into {\em non-equilibrium phenomena} systematically to hard optimization problems. This method is widely applicable and so far has proved competitive with -- and even superior to -- more elaborate general-purpose heuristics on testbeds of constrained optimization problems with up to $10^5$ variables, such as bipartitioning, coloring, and satisfiability. Analysis of a suitable model predicts the only free parameter of the method in accordance with all experimental results. △ Less

Submitted 26 September, 2002; originally announced September 2002.

Comments: Latex, 17 pages, to appear in the {\it Proceedings of the 8th INFORMS Computing Society Conference,} (2003)

ACM Class: I.2.8

arXiv:cond-mat/0104214 [pdf, ps, other]

doi 10.1103/PhysRevE.64.026114

Extremal Optimization for Graph Partitioning

Authors: S. Boettcher, A. G. Percus

Abstract: Extremal optimization is a new general-purpose method for approximating solutions to hard optimization problems. We study the method in detail by way of the NP-hard graph partitioning problem. We discuss the scaling behavior of extremal optimization, focusing on the convergence of the average run as a function of runtime and system size. The method has a single free parameter, which we determine… ▽ More Extremal optimization is a new general-purpose method for approximating solutions to hard optimization problems. We study the method in detail by way of the NP-hard graph partitioning problem. We discuss the scaling behavior of extremal optimization, focusing on the convergence of the average run as a function of runtime and system size. The method has a single free parameter, which we determine numerically and justify using a simple argument. Our numerical results demonstrate that on random graphs, extremal optimization maintains consistent accuracy for increasing system sizes, with an approximation error decreasing over runtime roughly as a power law t^(-0.4). On geometrically structured graphs, the scaling of results from the average run suggests that these are far from optimal, with large fluctuations between individual trials. But when only the best runs are considered, results consistent with theoretical arguments are recovered. △ Less

Submitted 11 April, 2001; originally announced April 2001.

Comments: 34 pages, RevTex4, 1 table and 20 ps-figures included, related papers available at http://www.physics.emory.edu/faculty/boettcher/

Journal ref: Phys. Rev. E, 64 (2001) 026114

arXiv:cond-mat/0010337 [pdf, ps, other]

doi 10.1103/PhysRevLett.86.5211

Optimization with Extremal Dynamics

Authors: S. Boettcher, A. G. Percus

Abstract: We explore a new general-purpose heuristic for finding high-quality solutions to hard optimization problems. The method, called extremal optimization, is inspired by self-organized criticality, a concept introduced to describe emergent complexity in physical systems. Extremal optimization successively replaces extremely undesirable variables of a single sub-optimal solution with new, random ones… ▽ More We explore a new general-purpose heuristic for finding high-quality solutions to hard optimization problems. The method, called extremal optimization, is inspired by self-organized criticality, a concept introduced to describe emergent complexity in physical systems. Extremal optimization successively replaces extremely undesirable variables of a single sub-optimal solution with new, random ones. Large fluctuations ensue, that efficiently explore many local optima. With only one adjustable parameter, the heuristic's performance has proven competitive with more elaborate methods, especially near phase transitions which are believed to coincide with the hardest instances. We use extremal optimization to elucidate the phase transition in the 3-coloring problem, and we provide independent confirmation of previously reported extrapolations for the ground-state energy of +-J spin glasses in d=3 and 4. △ Less

Submitted 8 April, 2001; v1 submitted 23 October, 2000; originally announced October 2000.

Comments: 4 pages, RevTex4, 1 table and 3 ps-figures included, as to appear in PRL, related papers available at http://www.physics.emory.edu/faculty/boettcher/

Journal ref: Phys. Rev. Lett, 86 (2001) 5211

arXiv:math/9904056 [pdf, ps, other]

Extremal Optimization: Methods derived from Co-Evolution

Authors: Stefan Boettcher, Allon G. Percus

Abstract: We describe a general-purpose method for finding high-quality solutions to hard optimization problems, inspired by self-organized critical models of co-evolution such as the Bak-Sneppen model. The method, called Extremal Optimization, successively eliminates extremely undesirable components of sub-optimal solutions, rather than ``breeding'' better components. In contrast to Genetic Algorithms wh… ▽ More We describe a general-purpose method for finding high-quality solutions to hard optimization problems, inspired by self-organized critical models of co-evolution such as the Bak-Sneppen model. The method, called Extremal Optimization, successively eliminates extremely undesirable components of sub-optimal solutions, rather than ``breeding'' better components. In contrast to Genetic Algorithms which operate on an entire ``gene-pool'' of possible solutions, Extremal Optimization improves on a single candidate solution by treating each of its components as species co-evolving according to Darwinian principles. Unlike Simulated Annealing, its non-equilibrium approach effects an algorithm requiring few parameters to tune. With only one adjustable parameter, its performance proves competitive with, and often superior to, more elaborate stochastic optimization procedures. We demonstrate it here on two classic hard optimization problems: graph partitioning and the traveling salesman problem. △ Less

Submitted 13 April, 1999; originally announced April 1999.

Comments: 8 pages, Latex, 5 ps-figures included. To appear in ``GECCO-99: Proceedings of the Genetic and Evolutionary Computation Conference,'' (Morgan Kaufmann, San Francisco, 1999)

arXiv:cond-mat/9803104 [pdf, ps, other]

The Traveling Salesman and Related Stochastic Problems

Authors: A. G. Percus

Abstract: In the traveling salesman problem, one must find the length of the shortest closed tour visiting given ``cities''. We study the stochastic version of the problem, taking the locations of cities and the distances separating them to be random variables drawn from an ensemble. We consider first the ensemble where cities are placed in Euclidean space. We investigate how the optimum tour length scale… ▽ More In the traveling salesman problem, one must find the length of the shortest closed tour visiting given ``cities''. We study the stochastic version of the problem, taking the locations of cities and the distances separating them to be random variables drawn from an ensemble. We consider first the ensemble where cities are placed in Euclidean space. We investigate how the optimum tour length scales with number of cities and with number of spatial dimensions. We then examine the analytical theory behind the random link ensemble, where distances between cities are independent random variables. Finally, we look at the related geometric issue of nearest neighbor distances, and find some remarkable universalities. △ Less

Submitted 10 March, 1998; originally announced March 1998.

Comments: PhD Thesis; 106 pages. Longer version (1841K) with higher quality reprint images is available at http://www.lanl.gov/home/percus/

arXiv:math/9802117 [pdf, ps, other]

Scaling universalities of kth-nearest neighbor distances on closed manifolds

Authors: A. G. Percus, O. C. Martin

Abstract: Take N sites distributed randomly and uniformly on a smooth closed surface. We express the expected distance <D_k(N)> from an arbitrary point on the surface to its kth-nearest neighboring site, in terms of the function A(l) giving the area of a disc of radius l about that point. We then find two universalities. First, for a flat surface, where A(l)=πl^2, the k-dependence and the N-dependence sep… ▽ More Take N sites distributed randomly and uniformly on a smooth closed surface. We express the expected distance <D_k(N)> from an arbitrary point on the surface to its kth-nearest neighboring site, in terms of the function A(l) giving the area of a disc of radius l about that point. We then find two universalities. First, for a flat surface, where A(l)=πl^2, the k-dependence and the N-dependence separate in <D_k(N)>. All kth-nearest neighbor distances thus have the same scaling law in N. Second, for a curved surface, the average \int <D_k(N)> dμover the surface is a topological invariant at leading and subleading order in a large N expansion. The 1/N scaling series then depends, up through O(1/N), only on the surface's topology and not on its precise shape. We discuss the case of higher dimensions (d>2), and also interpret our results using Regge calculus. △ Less

Submitted 25 February, 1998; originally announced February 1998.

Comments: 14 pages, 2 figures; submitted to Advances in Applied Mathematics

MSC Class: 60D05 (Primary); 51H25 (Secondary)

Journal ref: Advances in Applied Mathematics 21 (1998) 424-436 (1998); published version available at web page http://www.lanl.gov/home/percus

arXiv:cond-mat/9802295 [pdf, ps, other]

doi 10.1023/A:1004570713967

The stochastic traveling salesman problem: Finite size scaling and the cavity prediction

Authors: A. G. Percus, O. C. Martin

Abstract: We study the random link traveling salesman problem, where lengths l_ij between city i and city j are taken to be independent, identically distributed random variables. We discuss a theoretical approach, the cavity method, that has been proposed for finding the optimal tour length over this random ensemble, given the assumption of replica symmetry. Using finite size scaling and a renormalized mo… ▽ More We study the random link traveling salesman problem, where lengths l_ij between city i and city j are taken to be independent, identically distributed random variables. We discuss a theoretical approach, the cavity method, that has been proposed for finding the optimal tour length over this random ensemble, given the assumption of replica symmetry. Using finite size scaling and a renormalized model, we test the cavity predictions against the results of simulations, and find excellent agreement over a range of distributions. We thus provide numerical evidence that the replica symmetric solution to this problem is the correct one. Finally, we note a surprising result concerning the distribution of kth-nearest neighbor links in optimal tours, and invite a theoretical understanding of this phenomenon. △ Less

Submitted 6 November, 1998; v1 submitted 26 February, 1998; originally announced February 1998.

Comments: 21 pages, 7 figures; to appear in Journal of Statistical Physics (March 1999); this revision contains final version incorporating some changes

Report number: LA-UR-98-4032

Journal ref: Journal of Statistical Physics 94:5/6 (1999) 739-758

arXiv:cond-mat/9607080 [pdf, ps, other]

doi 10.1051/jp1:1997129

The random link approximation for the Euclidean traveling salesman problem

Authors: N. J. Cerf, J. Boutet de Monvel, O. Bohigas, O. C. Martin, A. G. Percus

Abstract: The traveling salesman problem (TSP) consists of finding the length of the shortest closed tour visiting N ``cities''. We consider the Euclidean TSP where the cities are distributed randomly and independently in a d-dimensional unit hypercube. Working with periodic boundary conditions and inspired by a remarkable universality in the kth nearest neighbor distribution, we find for the average opti… ▽ More The traveling salesman problem (TSP) consists of finding the length of the shortest closed tour visiting N ``cities''. We consider the Euclidean TSP where the cities are distributed randomly and independently in a d-dimensional unit hypercube. Working with periodic boundary conditions and inspired by a remarkable universality in the kth nearest neighbor distribution, we find for the average optimum tour length <L_E> = beta_E(d) N^{1-1/d} [1+O(1/N)] with beta_E(2) = 0.7120 +- 0.0002 and beta_E(3) = 0.6979 +- 0.0002. We then derive analytical predictions for these quantities using the random link approximation, where the lengths between cities are taken as independent random variables. From the ``cavity'' equations developed by Krauth, Mezard and Parisi, we calculate the associated random link values beta_RL(d). For d=1,2,3, numerical results show that the random link approximation is a good one, with a discrepancy of less than 2.1% between beta_E(d) and beta_RL(d). For large d, we argue that the approximation is exact up to O(1/d^2) and give a conjecture for beta_E(d), in terms of a power series in 1/d, specifying both leading and subleading coefficients. △ Less

Submitted 9 March, 1998; v1 submitted 11 July, 1996; originally announced July 1996.

Comments: 29 pages, 6 figures; formatting and typos corrected

Report number: IPNO/TH 96-07

Journal ref: Journal de Physique I 7:1 (1997) 117-136

Showing 1–32 of 32 results for author: Percus, A G