-
Characterization of MKIDs for CMB observation at 220 GHz with the South Pole Telescope
Authors:
Karia R. Dibert,
Peter S. Barry,
Adam J. Anderson,
Bradford A. Benson,
Thomas Cecil,
Clarence L. Chang,
Kyra N. Fichman,
Kirit Karkare,
Juliang Li,
Tyler Natoli,
Zhaodi Pan,
Maclean Rouble,
Erik Shirokoff,
Matthew Young
Abstract:
We present an updated design of the 220 GHz microwave kinetic inductance detector (MKID) pixel for SPT-3G+, the next-generation camera for the South Pole Telescope. We show results of the dark testing of a 63-pixel array with mean inductor quality factor $Q_i = 4.8 \times 10^5$, aluminum inductor transition temperature $T_c = 1.19$ K, and kinetic inductance fraction $α_k = 0.32$. We optically char…
▽ More
We present an updated design of the 220 GHz microwave kinetic inductance detector (MKID) pixel for SPT-3G+, the next-generation camera for the South Pole Telescope. We show results of the dark testing of a 63-pixel array with mean inductor quality factor $Q_i = 4.8 \times 10^5$, aluminum inductor transition temperature $T_c = 1.19$ K, and kinetic inductance fraction $α_k = 0.32$. We optically characterize both the microstrip-coupled and CPW-coupled resonators, and find both have a spectral response close to prediction with an optical efficiency of $η\sim 70\%$. However, we find slightly lower optical response on the lower edge of the band than predicted, with neighboring dark detectors showing more response in this region, though at level consistent with less than 5\% frequency shift relative to the optical detectors. The detectors show polarized response consistent with expectations, with a cross-polar response of $\sim 10\%$ for both detector orientations.
△ Less
Submitted 3 April, 2023;
originally announced April 2023.
-
Noise Optimization for MKIDs with Different Design Geometries and Material Selections
Authors:
Z. Pan,
K. R. Dibert,
J. Zhang,
P. S. Barry,
A. J. Anderson,
A. N. Bender,
B. A. Benson,
T. Cecil,
C. L. Chang,
R. Gualtieri,
J. Li,
M. Lisovenko,
V. Novosad,
M. Rouble,
G. Wang,
V. Yefremenko
Abstract:
The separation and optimization of noise components is critical to microwave-kinetic inductance detector (MKID) development. We analyze the effect of several changes to the lumped-element inductor and interdigitated capacitor geometry on the noise performance of a series of MKIDs intended for millimeter-wavelength experiments. We extract the contributions from two-level system noise in the dielect…
▽ More
The separation and optimization of noise components is critical to microwave-kinetic inductance detector (MKID) development. We analyze the effect of several changes to the lumped-element inductor and interdigitated capacitor geometry on the noise performance of a series of MKIDs intended for millimeter-wavelength experiments. We extract the contributions from two-level system noise in the dielectric layer, the generation-recombination noise intrinsic to the superconducting thin-film, and system white noise from each detector noise power spectrum and characterize how these noise components depend on detector geometry, material, and measurement conditions such as driving power and temperature. We observe a reduction in the amplitude of two-level system noise with both an elevated sample temperature and an increased gap between the fingers within the interdigitated capacitors for both aluminum and niobium detectors. We also verify the expected reduction of the generation-recombination noise and associated quasiparticle lifetime with reduced inductor volume. This study also iterates over different materials, including aluminum, niobium, and aluminum manganese, and compares the results with an underlying physical model.
△ Less
Submitted 3 April, 2023;
originally announced April 2023.
-
Hypergraph patterns and collaboration structure
Authors:
Jonas L. Juul,
Austin R. Benson,
Jon Kleinberg
Abstract:
Humans collaborate in different contexts such as in creative or scientific projects, in workplaces and in sports. Depending on the project and external circumstances, a newly formed collaboration may include people that have collaborated before in the past, and people with no collaboration history. Such existing relationships between team members have been reported to influence the performance of…
▽ More
Humans collaborate in different contexts such as in creative or scientific projects, in workplaces and in sports. Depending on the project and external circumstances, a newly formed collaboration may include people that have collaborated before in the past, and people with no collaboration history. Such existing relationships between team members have been reported to influence the performance of teams. However, it is not clear how existing relationships between team members should be quantified, and whether some relationships are more likely to occur in new collaborations than others. Here we introduce a new family of structural patterns, m-patterns, which formalize relationships between collaborators and we study the prevalence of such structures in data and a simple random-hypergraph null model. We analyze the frequency with which different collaboration structures appear in our null model and show how such frequencies depend on size and hyperedge density in the hypergraphs. Comparing the null model to data of human and non-human collaborations, we find that some collaboration structures are vastly under- and overrepresented in empirical datasets. Finally, we find that structures of scientific collaborations on COVID-19 papers in some cases are statistically significantly different from those of non-COVID-19 papers. Examining citation counts for 4 different scientific fields, we also find indications that repeat collaborations are more successful for 2-author scientific publications and less successful for 3-author scientific publications as compared to other collaboration structures.
△ Less
Submitted 5 October, 2022;
originally announced October 2022.
-
Conceptual Design of the Modular Detector and Readout System for the CMB-S4 survey experiment
Authors:
D. R. Barron,
Z. Ahmed,
J. Aguilar,
A. J. Anderson,
C. F. Baker,
P. S. Barry,
J. A. Beall,
A. N. Bender,
B. A. Benson,
R. W. Besuner,
T. W. Cecil,
C. L. Chang,
S. C. Chapman,
G. E. Chesmore,
G. Derylo,
W. B. Doriese,
S. M. Duff,
T. Elleflot,
J. P. Filippini,
B. Flaugher,
J. G. Gomez,
P. K. Grimes,
R. Gualtieri,
I. Gullett,
G. Haller
, et al. (25 additional authors not shown)
Abstract:
We present the conceptual design of the modular detector and readout system for the Cosmic Microwave Background Stage 4 (CMB-S4) ground-based survey experiment. CMB-S4 will map the cosmic microwave background (CMB) and the millimeter-wave sky to unprecedented sensitivity, using 500,000 superconducting detectors observing from Chile and Antarctica to map over 60 percent of the sky. The fundamental…
▽ More
We present the conceptual design of the modular detector and readout system for the Cosmic Microwave Background Stage 4 (CMB-S4) ground-based survey experiment. CMB-S4 will map the cosmic microwave background (CMB) and the millimeter-wave sky to unprecedented sensitivity, using 500,000 superconducting detectors observing from Chile and Antarctica to map over 60 percent of the sky. The fundamental building block of the detector and readout system is a detector module package operated at 100 mK, which is connected to a readout and amplification chain that carries signals out to room temperature. It uses arrays of feedhorn-coupled orthomode transducers (OMT) that collect optical power from the sky onto dc-voltage-biased transition-edge sensor (TES) bolometers. The resulting current signal in the TESs is then amplified by a two-stage cryogenic Superconducting Quantum Interference Device (SQUID) system with a time-division multiplexer to reduce wire count, and matching room-temperature electronics to condition and transmit signals to the data acquisition system. Sensitivity and systematics requirements are being developed for the detector and readout system over a wide range of observing bands (20 to 300 GHz) and optical powers to accomplish CMB-S4's science goals. While the design incorporates the successes of previous generations of CMB instruments, CMB-S4 requires an order of magnitude more detectors than any prior experiment. This requires fabrication of complex superconducting circuits on over 10 square meters of silicon, as well as significant amounts of precision wiring, assembly and cryogenic testing.
△ Less
Submitted 3 August, 2022;
originally announced August 2022.
-
Parallelized Domain Decomposition for Multi-Dimensional Lagrangian Random Walk, Mass-Transfer Particle Tracking Schemes
Authors:
Lucas Schauer,
Michael J. Schmidt,
Nicholas B. Engdahl,
Stephen D. Pankavich,
David A. Benson,
Diogo Bolster
Abstract:
We develop a multi-dimensional, parallelized domain decomposition strategy (DDC) for mass-transfer particle tracking (MTPT) methods. These methods are a type of Lagrangian algorithm for simulating reactive transport and are able to be parallelized by employing large numbers of CPU cores to accelerate run times. In this work, we investigate different procedures for "tiling" the domain in two and th…
▽ More
We develop a multi-dimensional, parallelized domain decomposition strategy (DDC) for mass-transfer particle tracking (MTPT) methods. These methods are a type of Lagrangian algorithm for simulating reactive transport and are able to be parallelized by employing large numbers of CPU cores to accelerate run times. In this work, we investigate different procedures for "tiling" the domain in two and three dimensions, (2-d and 3-d), as this type of formal DDC construction is currently limited to 1-d. An optimal tiling is prescribed based on physical problem parameters and the number of available CPU cores, as each tiling provides distinct results in both accuracy and run time. We further extend the most efficient technique to 3-d for comparison, leading to an analytical discussion of the effect of dimensionality on strategies for implementing DDC schemes. Increasing computational resources (cores) within the DDC method produces a trade-off between inter-node communication and on-node work. For an optimally subdivided diffusion problem, the 2-d parallelized algorithm achieves nearly perfect linear speedup in comparison with the serial run up to around 2700 cores, reducing a 5-hour simulation to 8 seconds, and the 3-d algorithm maintains appreciable speedup up to 1700 cores.
△ Less
Submitted 7 April, 2022;
originally announced April 2022.
-
A Computational Information Criterion for Particle-Tracking with Sparse or Noisy Data
Authors:
Nhat Thanh Tran,
David A. Benson,
Michael J. Schmidt,
Stephen D. Pankavich
Abstract:
Traditional probabilistic methods for the simulation of advection-diffusion equations (ADEs) often overlook the entropic contribution of the discretization, e.g., the number of particles, within associated numerical methods. Many times, the gain in accuracy of a highly discretized numerical model is outweighed by its associated computational costs or the noise within the data. We address the quest…
▽ More
Traditional probabilistic methods for the simulation of advection-diffusion equations (ADEs) often overlook the entropic contribution of the discretization, e.g., the number of particles, within associated numerical methods. Many times, the gain in accuracy of a highly discretized numerical model is outweighed by its associated computational costs or the noise within the data. We address the question of how many particles are needed in a simulation to best approximate and estimate parameters in one-dimensional advective-diffusive transport. To do so, we use the well-known Akaike Information Criterion (AIC) and a recently-developed correction called the Computational Information Criterion (COMIC) to guide the model selection process. Random-walk and mass-transfer particle tracking methods are employed to solve the model equations at various levels of discretization. Numerical results demonstrate that the COMIC provides an optimal number of particles that can describe a more efficient model in terms of parameter estimation and model prediction compared to the model selected by the AIC even when the data is sparse or noisy, the sampling volume is not uniform throughout the physical domain, or the error distribution of the data is non-IID Gaussian.
△ Less
Submitted 13 June, 2021;
originally announced June 2021.
-
A nonlinear diffusion method for semi-supervised learning on hypergraphs
Authors:
Francesco Tudisco,
Konstantin Prokopchik,
Austin R. Benson
Abstract:
Hypergraphs are a common model for multiway relationships in data, and hypergraph semi-supervised learning is the problem of assigning labels to all nodes in a hypergraph, given labels on just a few nodes. Diffusions and label spreading are classical techniques for semi-supervised learning in the graph setting, and there are some standard ways to extend them to hypergraphs. However, these methods…
▽ More
Hypergraphs are a common model for multiway relationships in data, and hypergraph semi-supervised learning is the problem of assigning labels to all nodes in a hypergraph, given labels on just a few nodes. Diffusions and label spreading are classical techniques for semi-supervised learning in the graph setting, and there are some standard ways to extend them to hypergraphs. However, these methods are linear models, and do not offer an obvious way of incorporating node features for making predictions. Here, we develop a nonlinear diffusion process on hypergraphs that spreads both features and labels following the hypergraph structure, which can be interpreted as a hypergraph equilibrium network. Even though the process is nonlinear, we show global convergence to a unique limiting point for a broad class of nonlinearities, which is the global optimum of a interpretable, regularized semi-supervised learning loss function. The limiting point serves as a node embedding from which we make predictions with a linear model. Our approach is much more accurate than several hypergraph neural networks, and also takes less time to train.
△ Less
Submitted 11 February, 2022; v1 submitted 27 March, 2021;
originally announced March 2021.
-
Higher-order Network Analysis Takes Off, Fueled by Classical Ideas and New Data
Authors:
Austin R. Benson,
David F. Gleich,
Desmond J. Higham
Abstract:
Higher-order network analysis uses the ideas of hypergraphs, simplicial complexes, multilinear and tensor algebra, and more, to study complex systems. These are by now well established mathematical abstractions. What's new is that the ideas can be tested and refined on the type of large-scale data arising in today's digital world. This research area therefore is making an impact across many applic…
▽ More
Higher-order network analysis uses the ideas of hypergraphs, simplicial complexes, multilinear and tensor algebra, and more, to study complex systems. These are by now well established mathematical abstractions. What's new is that the ideas can be tested and refined on the type of large-scale data arising in today's digital world. This research area therefore is making an impact across many applications. Here, we provide a brief history, guide, and survey.
△ Less
Submitted 8 March, 2021;
originally announced March 2021.
-
Random Graphs with Prescribed $K$-Core Sequences: A New Null Model for Network Analysis
Authors:
Katherine Van Koevering,
Austin R. Benson,
Jon Kleinberg
Abstract:
In the analysis of large-scale network data, a fundamental operation is the comparison of observed phenomena to the predictions provided by null models: when we find an interesting structure in a family of real networks, it is important to ask whether this structure is also likely to arise in random networks with similar characteristics to the real ones. A long-standing challenge in network analys…
▽ More
In the analysis of large-scale network data, a fundamental operation is the comparison of observed phenomena to the predictions provided by null models: when we find an interesting structure in a family of real networks, it is important to ask whether this structure is also likely to arise in random networks with similar characteristics to the real ones. A long-standing challenge in network analysis has been the relative scarcity of reasonable null models for networks; arguably the most common such model has been the configuration model, which starts with a graph $G$ and produces a random graph with the same node degrees as $G$. This leads to a very weak form of null model, since fixing the node degrees does not preserve many of the crucial properties of the network, including the structure of its subgraphs.
Guided by this challenge, we propose a new family of network null models that operate on the $k$-core decomposition. For a graph $G$, the $k$-core is its maximal subgraph of minimum degree $k$; and the core number of a node $v$ in $G$ is the largest $k$ such that $v$ belongs to the $k$-core of $G$. We provide the first efficient sampling algorithm to solve the following basic combinatorial problem: given a graph $G$, produce a random graph sampled nearly uniformly from among all graphs with the same sequence of core numbers as $G$. This opens the opportunity to compare observed networks $G$ with random graphs that exhibit the same core numbers, a comparison that preserves aspects of the structure of $G$ that are not captured by more local measures like the degree sequence. We illustrate the power of this core-based null model on some fundamental tasks in network analysis, including the enumeration of networks motifs.
△ Less
Submitted 24 February, 2021;
originally announced February 2021.
-
Generative hypergraph clustering: from blockmodels to modularity
Authors:
Philip S. Chodrow,
Nate Veldt,
Austin R. Benson
Abstract:
Hypergraphs are a natural modeling paradigm for a wide range of complex relational systems. A standard analysis task is to identify clusters of closely related or densely interconnected nodes. Many graph algorithms for this task are based on variants of the stochastic blockmodel, a random graph with flexible cluster structure. However, there are few models and algorithms for hypergraph clustering.…
▽ More
Hypergraphs are a natural modeling paradigm for a wide range of complex relational systems. A standard analysis task is to identify clusters of closely related or densely interconnected nodes. Many graph algorithms for this task are based on variants of the stochastic blockmodel, a random graph with flexible cluster structure. However, there are few models and algorithms for hypergraph clustering. Here, we propose a Poisson degree-corrected hypergraph stochastic blockmodel (DCHSBM), a generative model of clustered hypergraphs with heterogeneous node degrees and edge sizes. Approximate maximum-likelihood inference in the DCHSBM naturally leads to a clustering objective that generalizes the popular modularity objective for graphs. We derive a general Louvain-type algorithm for this objective, as well as a a faster, specialized "All-Or-Nothing" (AON) variant in which edges are expected to lie fully within clusters. This special case encompasses a recent proposal for modularity in hypergraphs, while also incorporating flexible resolution and edge-size parameters. We show that AON hypergraph Louvain is highly scalable, including as an example an experiment on a synthetic hypergraph of one million nodes. We also demonstrate through synthetic experiments that the detectability regimes for hypergraph community detection differ from methods based on dyadic graph projections. We use our generative model to analyze different patterns of higher-order structure in school contact networks, U.S. congressional bill cosponsorship, U.S. congressional committees, product categories in co-purchasing behavior, and hotel locations from web browsing sessions, finding interpretable higher-order structure. We then study the behavior of our AON hypergraph Louvain algorithm, finding that it is able to recover ground truth clusters in empirical data sets exhibiting the corresponding higher-order structure.
△ Less
Submitted 18 August, 2021; v1 submitted 23 January, 2021;
originally announced January 2021.
-
Nonparametric, data-based kernel interpolation for particle-tracking simulations and kernel density estimation
Authors:
David A Benson,
Diogo Bolster,
Stephen Pankavich,
Michael J Schmidt
Abstract:
Traditional interpolation techniques for particle tracking include binning and convolutional formulas that use pre-determined (i.e., closed-form, parameteric) kernels. In many instances, the particles are introduced as point sources in time and space, so the cloud of particles (either in space or time) is a discrete representation of the Green's function of an underlying PDE. As such, each particl…
▽ More
Traditional interpolation techniques for particle tracking include binning and convolutional formulas that use pre-determined (i.e., closed-form, parameteric) kernels. In many instances, the particles are introduced as point sources in time and space, so the cloud of particles (either in space or time) is a discrete representation of the Green's function of an underlying PDE. As such, each particle is a sample from the Green's function; therefore, each particle should be distributed according to the Green's function. In short, the kernel of a convolutional interpolation of the particle sample "cloud" should be a replica of the cloud itself. This idea gives rise to an iterative method by which the form of the kernel may be discerned in the process of interpolating the Green's function. When the Green's function is a density, this method is broadly applicable to interpolating a kernel density estimate based on random data drawn from a single distribution. We formulate and construct the algorithm and demonstrate its ability to perform kernel density estimation of skewed and/or heavy-tailed data including breakthrough curves.
△ Less
Submitted 13 October, 2020;
originally announced October 2020.
-
A simple bipartite graph projection model for clustering in networks
Authors:
Austin R. Benson,
Paul Liu,
Hao Yin
Abstract:
Graph datasets are frequently constructed by a projection of a bipartite graph, where two nodes are connected in the projection if they share a common neighbor in the bipartite graph; for example, a coauthorship graph is a projection of an author-publication bipartite graph. Analyzing the structure of the projected graph is common, but we do not have a good understanding of the consequences of the…
▽ More
Graph datasets are frequently constructed by a projection of a bipartite graph, where two nodes are connected in the projection if they share a common neighbor in the bipartite graph; for example, a coauthorship graph is a projection of an author-publication bipartite graph. Analyzing the structure of the projected graph is common, but we do not have a good understanding of the consequences of the projection on such analyses. Here, we propose and analyze a random graph model to study what properties we can expect from the projection step. Our model is based on a Chung-Lu random graph for constructing the bipartite representation, which enables us to rigorously analyze the projected graph. We show that common network properties such as sparsity, heavy-tailed degree distributions, local clustering at nodes, the inverse relationship between node degree, and global transitivity can be explained and analyzed through this simple model. We also develop a fast sampling algorithm for our model, which we show is provably optimal for certain input distributions. Numerical simulations where model parameters come from real-world datasets show that much of the clustering behavior in some datasets can just be explained by the projection step.
△ Less
Submitted 1 July, 2020;
originally announced July 2020.
-
Hypergraph Clustering for Finding Diverse and Experienced Groups
Authors:
Ilya Amburg,
Nate Veldt,
Austin R. Benson
Abstract:
When forming a team or group of individuals, we often seek a balance of expertise in a particular task while at the same time maintaining diversity of skills within each group. Here, we view the problem of finding diverse and experienced groups as clustering in hypergraphs with multiple edge types. The input data is a hypergraph with multiple hyperedge types -- representing information about past…
▽ More
When forming a team or group of individuals, we often seek a balance of expertise in a particular task while at the same time maintaining diversity of skills within each group. Here, we view the problem of finding diverse and experienced groups as clustering in hypergraphs with multiple edge types. The input data is a hypergraph with multiple hyperedge types -- representing information about past experiences of groups of individuals -- and the output is groups of nodes. In contrast to related problems on fair or balanced clustering, we model diversity in terms of variety of past experience (instead of, e.g., protected attributes), with a goal of forming groups that have both experience and diversity with respect to participation in edge types. In other words, both diversity and experience are measured from the types of the hyperedges.
Our clustering model is based on a regularized version of an edge-based hypergraph clustering objective, and we also show how naive objectives actually have no diversity-experience tradeoff. Although our objective function is NP-hard to optimize, we design an efficient 2-approximation algorithm and also show how to compute bounds for the regularization hyperparameter that lead to meaningful diversity-experience tradeoffs. We demonstrate an application of this framework in online review platforms, where the goal is to curate sets of user reviews for a product type. In this context, "experience" corresponds to users familiar with the type of product, and "diversity" to users that have reviewed related products.
△ Less
Submitted 27 October, 2020; v1 submitted 10 June, 2020;
originally announced June 2020.
-
Nonlinear Higher-Order Label Spreading
Authors:
Francesco Tudisco,
Austin R. Benson,
Konstantin Prokopchik
Abstract:
Label spreading is a general technique for semi-supervised learning with point cloud or network data, which can be interpreted as a diffusion of labels on a graph. While there are many variants of label spreading, nearly all of them are linear models, where the incoming information to a node is a weighted sum of information from neighboring nodes. Here, we add nonlinearity to label spreading throu…
▽ More
Label spreading is a general technique for semi-supervised learning with point cloud or network data, which can be interpreted as a diffusion of labels on a graph. While there are many variants of label spreading, nearly all of them are linear models, where the incoming information to a node is a weighted sum of information from neighboring nodes. Here, we add nonlinearity to label spreading through nonlinear functions of higher-order structure in the graph, namely triangles in the graph. For a broad class of nonlinear functions, we prove convergence of our nonlinear higher-order label spreading algorithm to the global solution of a constrained semi-supervised loss function. We demonstrate the efficiency and efficacy of our approach on a variety of point cloud and network datasets, where the nonlinear higher-order model compares favorably to classical label spreading, as well as hypergraph models and graph neural networks.
△ Less
Submitted 8 June, 2020;
originally announced June 2020.
-
Broadband, millimeter-wave antireflection coatings for large-format, cryogenic aluminum oxide optics
Authors:
A. Nadolski,
J. D. Vieira,
J. A. Sobrin,
A. M. Kofman,
P. A. R. Ade,
Z. Ahmed,
A. J. Anderson,
J. S. Avva,
R. Basu Thakur,
A. N. Bender,
B. A. Benson,
L. Bryant,
J. E. Carlstrom,
F. W. Carter,
T. W. Cecil,
C. L. Chang,
J. R. Cheshire IV,
G. E. Chesmore,
J. F. Cliche,
A. Cukierman,
T. de Haan,
M. Dierickx,
J. Ding,
D. Dutcher,
W. Everett
, et al. (64 additional authors not shown)
Abstract:
We present two prescriptions for broadband (~77 - 252 GHz), millimeter-wave antireflection coatings for cryogenic, sintered polycrystalline aluminum oxide optics: one for large-format (700 mm diameter) planar and plano-convex elements, the other for densely packed arrays of quasi-optical elements, in our case 5 mm diameter half-spheres (called "lenslets"). The coatings comprise three layers of com…
▽ More
We present two prescriptions for broadband (~77 - 252 GHz), millimeter-wave antireflection coatings for cryogenic, sintered polycrystalline aluminum oxide optics: one for large-format (700 mm diameter) planar and plano-convex elements, the other for densely packed arrays of quasi-optical elements, in our case 5 mm diameter half-spheres (called "lenslets"). The coatings comprise three layers of commercially-available, polytetrafluoroethylene-based, dielectric sheet material. The lenslet coating is molded to fit the 150 mm diameter arrays directly while the large-diameter lenses are coated using a tiled approach. We review the fabrication processes for both prescriptions then discuss laboratory measurements of their transmittance and reflectance. In addition, we present the inferred refractive indices and loss tangents for the coating materials and the aluminum oxide substrate. We find that at 150 GHz and 300 K the large-format coating sample achieves (97 +/- 2)% transmittance and the lenslet coating sample achieves (94 +/- 3)% transmittance.
△ Less
Submitted 2 March, 2020; v1 submitted 6 December, 2019;
originally announced December 2019.
-
Clustering in graphs and hypergraphs with categorical edge labels
Authors:
Ilya Amburg,
Nate Veldt,
Austin R. Benson
Abstract:
Modern graph or network datasets often contain rich structure that goes beyond simple pairwise connections between nodes. This calls for complex representations that can capture, for instance, edges of different types as well as so-called "higher-order interactions" that involve more than two nodes at a time. However, we have fewer rigorous methods that can provide insight from such representation…
▽ More
Modern graph or network datasets often contain rich structure that goes beyond simple pairwise connections between nodes. This calls for complex representations that can capture, for instance, edges of different types as well as so-called "higher-order interactions" that involve more than two nodes at a time. However, we have fewer rigorous methods that can provide insight from such representations. Here, we develop a computational framework for the problem of clustering hypergraphs with categorical edge labels --- or different interaction types --- where clusters corresponds to groups of nodes that frequently participate in the same type of interaction.
Our methodology is based on a combinatorial objective function that is related to correlation clustering on graphs but enables the design of much more efficient algorithms that also seamlessly generalize to hypergraphs. When there are only two label types, our objective can be optimized in polynomial time, using an algorithm based on minimum cuts. Minimizing our objective becomes NP-hard with more than two label types, but we develop fast approximation algorithms based on linear programming relaxations that have theoretical cluster quality guarantees. We demonstrate the efficacy of our algorithms and the scope of the model through problems in edge-label community detection, clustering with temporal data, and exploratory data analysis.
△ Less
Submitted 17 February, 2020; v1 submitted 22 October, 2019;
originally announced October 2019.
-
Reactive Particle-tracking Solutions to a Benchmark Problem on Heavy Metal Cycling in Lake Sediments
Authors:
Michael J. Schmidt,
Stephen D. Pankavich,
Alexis Navarre-Sitchler,
Nicholas B. Engdahl,
Diogo Bolster,
David A. Benson
Abstract:
Geochemical systems are known to exhibit highly variable spatiotemporal behavior. This may be observed both in non-smooth concentration curves in space for a single sampling time and also in variability between samples taken from the same location at different times. However, most models that are designed to simulate these systems provide only single-solution smooth curves and fail to capture the…
▽ More
Geochemical systems are known to exhibit highly variable spatiotemporal behavior. This may be observed both in non-smooth concentration curves in space for a single sampling time and also in variability between samples taken from the same location at different times. However, most models that are designed to simulate these systems provide only single-solution smooth curves and fail to capture the noise and variability seen in the data. We apply a recently developed reactive particle-tracking method to a system that displays highly-complex geochemical behavior. When the method is made to most closely resemble a corresponding Eulerian method, in its unperturbed form, we see near-exact match between solutions of the two models. More importantly, we consider two approaches for perturbing the model and find that the spatially-perturbed condition is able to capture a greater degree of the variability present in the data. This method of perturbation is a task to which particle methods are uniquely suited and Eulerian models are not well-suited. Additionally, because of the nature of the algorithm, noisy spatial gradients can be highly resolved by a large number of mobile particles, and this incurs negligible computational cost, as compared to expensive chemistry calculations.
△ Less
Submitted 26 August, 2019;
originally announced August 2019.
-
Performance of Al-Mn Transition-Edge Sensor Bolometers in SPT-3G
Authors:
A. J. Anderson,
P. A. R. Ade,
Z. Ahmed,
J. S. Avva,
P. S. Barry,
R. Basu Thakur,
A. N. Bender,
B. A. Benson,
L. Bryant,
K. Byrum,
J. E. Carlstrom,
F. W. Carter,
T. W. Cecil,
C. L. Chang,
H. -M. Cho,
J. F. Cliche,
A. Cukierman,
T. de Haan,
E. V. Denison,
J. Ding,
M. A. Dobbs,
D. Dutcher,
W. Everett,
K. R. Ferguson,
A. Foster
, et al. (64 additional authors not shown)
Abstract:
SPT-3G is a polarization-sensitive receiver, installed on the South Pole Telescope, that measures the anisotropy of the cosmic microwave background (CMB) from degree to arcminute scales. The receiver consists of ten 150~mm-diameter detector wafers, containing a total of 16,000 transition-edge sensor (TES) bolometers observing at 95, 150, and 220 GHz. During the 2018-2019 austral summer, one of the…
▽ More
SPT-3G is a polarization-sensitive receiver, installed on the South Pole Telescope, that measures the anisotropy of the cosmic microwave background (CMB) from degree to arcminute scales. The receiver consists of ten 150~mm-diameter detector wafers, containing a total of 16,000 transition-edge sensor (TES) bolometers observing at 95, 150, and 220 GHz. During the 2018-2019 austral summer, one of these detector wafers was replaced by a new wafer fabricated with Al-Mn TESs instead of the Ti/Au design originally deployed for SPT-3G. We present the results of in-lab characterization and on-sky performance of this Al-Mn wafer, including electrical and thermal properties, optical efficiency measurements, and noise-equivalent temperature. In addition, we discuss and account for several calibration-related systematic errors that affect measurements made using frequency-domain multiplexing readout electronics.
△ Less
Submitted 27 July, 2019;
originally announced July 2019.
-
On-sky performance of the SPT-3G frequency-domain multiplexed readout
Authors:
A. N. Bender,
A. J. Anderson,
J. S. Avva,
P. A. R. Ade,
Z. Ahmed,
P. S. Barry,
R. Basu Thakur,
B. A. Benson,
L. Bryant,
K. Byrum,
J. E. Carlstrom,
F. W. Carter,
T. W. Cecil,
C. L. Chang,
H. -M. Cho,
J. F. Cliche,
A. Cukierman,
T. de Haan,
E. V. Denison,
J. Ding,
M. A. Dobbs,
D. Dutcher,
W. Everett,
K. R. Ferguson,
A. Foster
, et al. (64 additional authors not shown)
Abstract:
Frequency-domain multiplexing (fMux) is an established technique for the readout of large arrays of transition edge sensor (TES) bolometers. Each TES in a multiplexing module has a unique AC voltage bias that is selected by a resonant filter. This scheme enables the operation and readout of multiple bolometers on a single pair of wires, reducing thermal loading onto sub-Kelvin stages. The current…
▽ More
Frequency-domain multiplexing (fMux) is an established technique for the readout of large arrays of transition edge sensor (TES) bolometers. Each TES in a multiplexing module has a unique AC voltage bias that is selected by a resonant filter. This scheme enables the operation and readout of multiple bolometers on a single pair of wires, reducing thermal loading onto sub-Kelvin stages. The current receiver on the South Pole Telescope, SPT-3G, uses a 68x fMux system to operate its large-format camera of $\sim$16,000 TES bolometers. We present here the successful implementation and performance of the SPT-3G readout as measured on-sky. Characterization of the noise reveals a median pair-differenced 1/f knee frequency of 33 mHz, indicating that low-frequency noise in the readout will not limit SPT-3G's measurements of sky power on large angular scales. Measurements also show that the median readout white noise level in each of the SPT-3G observing bands is below the expectation for photon noise, demonstrating that SPT-3G is operating in the photon-noise-dominated regime.
△ Less
Submitted 25 July, 2019;
originally announced July 2019.
-
Pairwise Link Prediction
Authors:
Huda Nassar,
Austin R. Benson,
David F. Gleich
Abstract:
Link prediction is a common problem in network science that transects many disciplines. The goal is to forecast the appearance of new links or to find links missing in the network. Typical methods for link prediction use the topology of the network to predict the most likely future or missing connections between a pair of nodes. However, network evolution is often mediated by higher-order structur…
▽ More
Link prediction is a common problem in network science that transects many disciplines. The goal is to forecast the appearance of new links or to find links missing in the network. Typical methods for link prediction use the topology of the network to predict the most likely future or missing connections between a pair of nodes. However, network evolution is often mediated by higher-order structures involving more than pairs of nodes; for example, cliques on three nodes (also called triangles) are key to the structure of social networks, but the standard link prediction framework does not directly predict these structures. To address this gap, we propose a new link prediction task called "pairwise link prediction" that directly targets the prediction of new triangles, where one is tasked with finding which nodes are most likely to form a triangle with a given edge. We develop two PageRank-based methods for our pairwise link prediction problem and make natural extensions to existing link prediction methods. Our experiments on a variety of networks show that diffusion based methods are less sensitive to the type of graphs used and more consistent in their results. We also show how our pairwise link prediction framework can be used to get better predictions within the context of standard link prediction evaluation.
△ Less
Submitted 10 July, 2019;
originally announced July 2019.
-
Measuring Directed Triadic Closure with Closure Coefficients
Authors:
Hao Yin,
Austin R. Benson,
Johan Ugander
Abstract:
Recent work studying triadic closure in undirected graphs has drawn attention to the distinction between measures that focus on the "center" node of a wedge (i.e., length-2 path) vs. measures that focus on the "initiator," a distinction with considerable consequences. Existing measures in directed graphs, meanwhile, have all been center-focused. In this work, we propose a family of eight directed…
▽ More
Recent work studying triadic closure in undirected graphs has drawn attention to the distinction between measures that focus on the "center" node of a wedge (i.e., length-2 path) vs. measures that focus on the "initiator," a distinction with considerable consequences. Existing measures in directed graphs, meanwhile, have all been center-focused. In this work, we propose a family of eight directed closure coefficients that measure the frequency of triadic closure in directed graphs from the perspective of the node initiating closure. The eight coefficients correspond to different labelled wedges, where the initiator and center nodes are labelled, and we observe dramatic empirical variation in these coefficients on real-world networks, even in cases when the induced directed triangles are isomorphic. To understand this phenomenon, we examine the theoretical behavior of our closure coefficients under a directed configuration model. Our analysis illustrates an underlying connection between the closure coefficients and moments of the joint in- and out-degree distributions of the network, offering an explanation of the observed asymmetries. We also use our directed closure coefficients as predictors in two machine learning tasks. We find interpretable models with AUC scores above 0.92 in class-balanced binary prediction, substantially outperforming models that use traditional center-focused measures.
△ Less
Submitted 7 February, 2020; v1 submitted 25 May, 2019;
originally announced May 2019.
-
A Compact Millimeter-Wavelength Fourier-Transform Spectrometer
Authors:
Zhaodi Pan,
Mira Liu,
Ritoban Basu Thakur,
Bradford A. Benson,
Dale J. Fixsen,
Hazal Goksu,
Eleanor Rath,
Stephan S. Meyer
Abstract:
We have constructed a Fourier-transform spectrometer (FTS) operating between 50 and 330 GHz with minimum volume (355 x260 x64 mm) and weight (13 lbs) while maximizing optical throughput (100 $\mathrm{mm}^2$ sr) and optimizing the spectral resolution (4 GHz). This FTS is designed as a polarizing Martin-Puplett interferometer with unobstructed input and output in which both input polarizations under…
▽ More
We have constructed a Fourier-transform spectrometer (FTS) operating between 50 and 330 GHz with minimum volume (355 x260 x64 mm) and weight (13 lbs) while maximizing optical throughput (100 $\mathrm{mm}^2$ sr) and optimizing the spectral resolution (4 GHz). This FTS is designed as a polarizing Martin-Puplett interferometer with unobstructed input and output in which both input polarizations undergo interference. The instrument construction is simple with mirrors milled on the box walls and one motorized stage as the single moving element. We characterize the performance of the FTS, compare the measurements to an optical simulation, and discuss features that relate to details of the FTS design. The simulation is also used to determine the tolerance of optical alignments for the required specifications. We detail the FTS mechanical design and provide the control software as well as the analysis code online.
△ Less
Submitted 17 May, 2019;
originally announced May 2019.
-
Entropy: The former trouble with particles (including a new numerical model computational penalty for the Akaike information criterion)
Authors:
David A. Benson,
Stephen Pankavich,
Michael Schmidt,
Guillem Sole-Mari
Abstract:
Traditional random-walk particle-tracking (PT) models of advection and dispersion do not track entropy, because particle masses remain constant. Newer mass-transfer particle tracking (MTPT) models have the ability to do so because masses of all compounds may change along trajectories. Additionally, the probability mass functions (PMF) of these MTPT models may be compared to continuous solutions wi…
▽ More
Traditional random-walk particle-tracking (PT) models of advection and dispersion do not track entropy, because particle masses remain constant. Newer mass-transfer particle tracking (MTPT) models have the ability to do so because masses of all compounds may change along trajectories. Additionally, the probability mass functions (PMF) of these MTPT models may be compared to continuous solutions with probability density functions, when a consistent definition of entropy (or similarly, the dilution index) is constructed. This definition reveals that every numerical model incurs a computational entropy. Similar to Akaike's entropic penalty for larger numbers of adjustable parameters, the computational complexity of a model (e.g., number of nodes) adds to the entropy and, as such, must be penalized. The MTPT method can use a particle-collision based kernel or an SPH-derived adaptive kernel. The latter is more representative of a locally well-mixed system (i.e., one in which the dispersion tensor equally represents mixing and solute spreading), while the former better represents the separate processes of mixing versus spreading. We use computational means to demonstrate the viability of each of these methods.
△ Less
Submitted 23 April, 2019;
originally announced May 2019.
-
Network interpolation
Authors:
Thomas Reeves,
Anil Damle,
Austin R. Benson
Abstract:
Given a set of snapshots from a temporal network we develop, analyze, and experimentally validate a so-called network interpolation scheme. Our method allows us to build a plausible, albeit random, sequence of graphs that transition between any two given graphs. Importantly, our model is well characterized by a Markov chain, and we leverage this representation to analytically estimate the hitting…
▽ More
Given a set of snapshots from a temporal network we develop, analyze, and experimentally validate a so-called network interpolation scheme. Our method allows us to build a plausible, albeit random, sequence of graphs that transition between any two given graphs. Importantly, our model is well characterized by a Markov chain, and we leverage this representation to analytically estimate the hitting time (to a predefined distance to the target graph) and long term behavior of our model. These observations also serve to provide interpretation and justification for a rate parameter in our model. Lastly, through a mix of synthetic and real-world data experiments we demonstrate that our model builds reasonable graph trajectories between snapshots, as measured through various graph statistics. In these experiments, we find that our interpolation scheme compares favorably to common network growth models, such as preferential attachment and triadic closure.
△ Less
Submitted 19 February, 2021; v1 submitted 3 May, 2019;
originally announced May 2019.
-
Modeling and Analysis of Tagging Networks in Stack Exchange Communities
Authors:
Xiang Fu,
Shangdi Yu,
Austin R. Benson
Abstract:
Large Question-and-Answer (Q&A) platforms support diverse knowledge curation on the Web. While researchers have studied user behavior on the platforms in a variety of contexts, there is relatively little insight into important by-products of user behavior that also encode knowledge. Here, we analyze and model the macroscopic structure of tags applied by users to annotate and catalog questions, usi…
▽ More
Large Question-and-Answer (Q&A) platforms support diverse knowledge curation on the Web. While researchers have studied user behavior on the platforms in a variety of contexts, there is relatively little insight into important by-products of user behavior that also encode knowledge. Here, we analyze and model the macroscopic structure of tags applied by users to annotate and catalog questions, using a collection of 168 Stack Exchange websites. We find striking similarity in tagging structure across these Stack Exchange communities, even though each community evolves independently (albeit under similar guidelines). Using our empirical findings, we develop a simple generative model that creates random bipartite graphs of tags and questions. Our model accounts for the tag frequency distribution but does not explicitly account for co-tagging correlations. Even under these constraints, we demonstrate empirically and theoretically that our model can reproduce a number of statistical properties of the co-tagging graph that links tags appearing in the same post.
△ Less
Submitted 6 February, 2019;
originally announced February 2019.
-
Numerical Equivalence Between SPH and Probabilistic Mass Transfer Methods for Lagrangian Simulation of Dispersion
Authors:
Guillem Sole-Mari,
Michael J. Schmidt,
Stephen D. Pankavich,
David A. Benson
Abstract:
Several Lagrangian methodologies have been proposed in recent years to simulate advection-dispersion of solutes in fluids as a mass exchange between numerical particles carrying the fluid. In this paper, we unify these methodologies, showing that mass transfer particle tracking (MTPT) algorithms can be framed within the context of smoothed particle hydrodynamics (SPH), provided the choice of a Gau…
▽ More
Several Lagrangian methodologies have been proposed in recent years to simulate advection-dispersion of solutes in fluids as a mass exchange between numerical particles carrying the fluid. In this paper, we unify these methodologies, showing that mass transfer particle tracking (MTPT) algorithms can be framed within the context of smoothed particle hydrodynamics (SPH), provided the choice of a Gaussian smoothing kernel whose bandwidth depends on the dispersion and the time discretization. Numerical simulations are performed for a simple dispersion problem, and they are compared to an analytical solution. Based on the results, we advocate for the use of a kernel bandwidth of the size of the characteristic dispersion length $\ell=\sqrt{2DΔt}$, at least given a "dense enough" distribution of particles, for in this case the mass transfer operation is not just an approximation, but in fact the exact solution, of the solute's displacement by dispersion in a time step.
△ Less
Submitted 22 February, 2019; v1 submitted 21 December, 2018;
originally announced December 2018.
-
Link Prediction in Networks with Core-Fringe Data
Authors:
Austin R. Benson,
Jon Kleinberg
Abstract:
Data collection often involves the partial measurement of a larger system. A common example arises in collecting network data: we often obtain network datasets by recording all of the interactions among a small set of core nodes, so that we end up with a measurement of the network consisting of these core nodes along with a potentially much larger set of fringe nodes that have links to the core. G…
▽ More
Data collection often involves the partial measurement of a larger system. A common example arises in collecting network data: we often obtain network datasets by recording all of the interactions among a small set of core nodes, so that we end up with a measurement of the network consisting of these core nodes along with a potentially much larger set of fringe nodes that have links to the core. Given the ubiquity of this process for assembling network data, it is crucial to understand the role of such a `core-fringe' structure.
Here we study how the inclusion of fringe nodes affects the standard task of network link prediction. One might initially think the inclusion of any additional data is useful, and hence that it should be beneficial to include all fringe nodes that are available. However, we find that this is not true; in fact, there is substantial variability in the value of the fringe nodes for prediction. Once an algorithm is selected, in some datasets, including any additional data from the fringe can actually hurt prediction performance; in other datasets, including some amount of fringe information is useful before prediction performance saturates or even declines; and in further cases, including the entire fringe leads to the best performance. While such variety might seem surprising, we show that these behaviors are exhibited by simple random graph models.
△ Less
Submitted 5 March, 2019; v1 submitted 28 November, 2018;
originally announced November 2018.
-
Accelerating and parallelizing Lagrangian simulations of mixing-limited reactive transport
Authors:
Nicholas B. Engdahl,
Michael J. Schmidt,
David A. Benson
Abstract:
Recent advances in random-walk particle-tracking have enabled direct simulation of mixing and reactions on particles by allowing the particles to interact with each other using a multi-point mass transfer scheme. The mass transfer scheme allows separation of mixing and spreading processes, among other advantages, but it is computationally expensive because its speed depends on the number of intera…
▽ More
Recent advances in random-walk particle-tracking have enabled direct simulation of mixing and reactions on particles by allowing the particles to interact with each other using a multi-point mass transfer scheme. The mass transfer scheme allows separation of mixing and spreading processes, among other advantages, but it is computationally expensive because its speed depends on the number of interacting particle pairs. This note explores methods for relieving the computational bottleneck caused by the mass transfer step, and we use these algorithms to develop a new parallel, interacting particle model. The new model is a combination of a sparse search algorithm and a novel domain-decomposition scheme, both of which offer significant speedup relative to the reference case--even when they are executed serially. We combine the strengths of these methods to create a parallel particle scheme that is highly accurate and efficient with run times that scale as $1 / P$ for a fixed number of particles, where $P$ is the number of computational cores being used. The new parallel model is a significant advance because it enables efficient simulation of large particle ensembles that are needed for environmental simulations, and also because it can naturally pair with parallel geochemical solvers to create a practical Lagrangian tool for simulating mixing and reactions in complex chemical systems.
△ Less
Submitted 7 February, 2019; v1 submitted 13 November, 2018;
originally announced November 2018.
-
Choosing to Grow a Graph: Modeling Network Formation as Discrete Choice
Authors:
Jan Overgoor,
Austin R. Benson,
Johan Ugander
Abstract:
We provide a framework for modeling social network formation through conditional multinomial logit models from discrete choice and random utility theory, in which each new edge is viewed as a "choice" made by a node to connect to another node, based on (generic) features of the other nodes available to make a connection. This perspective on network formation unifies existing models such as prefere…
▽ More
We provide a framework for modeling social network formation through conditional multinomial logit models from discrete choice and random utility theory, in which each new edge is viewed as a "choice" made by a node to connect to another node, based on (generic) features of the other nodes available to make a connection. This perspective on network formation unifies existing models such as preferential attachment, triadic closure, and node fitness, which are all special cases, and thereby provides a flexible means for conceptualizing, estimating, and comparing models. The lens of discrete choice theory also provides several new tools for analyzing social network formation; for example, the significance of node features can be evaluated in a statistically rigorous manner, and mixtures of existing models can be estimated by adapting known expectation-maximization algorithms. We demonstrate the flexibility of our framework through examples that analyze a number of synthetic and real-world datasets. For example, we provide rigorous methods for estimating preferential attachment models and show how to separate the effects of preferential attachment and triadic closure. Non-parametric estimates of the importance of degree show a highly linear trend, and we expose the importance of looking carefully at nodes with degree zero. Examining the formation of a large citation graph, we find evidence for an increased role of degree when accounting for age.
△ Less
Submitted 21 May, 2020; v1 submitted 12 November, 2018;
originally announced November 2018.
-
On the separate treatment of mixing and spreading by the reactive-particle-tracking algorithm: An example of accurate upscaling of reactive Poiseuille flow
Authors:
David A. Benson,
Diogo Bolster,
Stephen Pankavich
Abstract:
The Eulerian advection-dispersion-reaction equation (ADRE) suffers the well-known scale-effect of reduced apparent reaction rates between chemically dissimilar fluids at larger scales (or dimensional averaging). The dispersion tensor in the ADRE must equally and simultaneously account for both solute mixing and spreading. Recent reactive-particle-tracking (RPT) algorithms can, by separate mechanis…
▽ More
The Eulerian advection-dispersion-reaction equation (ADRE) suffers the well-known scale-effect of reduced apparent reaction rates between chemically dissimilar fluids at larger scales (or dimensional averaging). The dispersion tensor in the ADRE must equally and simultaneously account for both solute mixing and spreading. Recent reactive-particle-tracking (RPT) algorithms can, by separate mechanisms, simulate 1) smaller-scale mixing by inter-particle mass transfer, and 2) mass spreading by traditional random walks. To test the supposition that the RPT can accurately track these separate mechanisms, we upscale reactive transport in Hagen-Poiseuille flow between two plates. The simple upscaled 1-D RPT model with one velocity value, an upscaled Taylor macro-dispersivity, and the local molecular diffusion coefficient matches the results obtained from a detailed 2-D model with fully described velocity and diffusion. Both models use the same thermodynamic reaction rate, because the rate is not forced to absorb the loss of information upon upscaling. Analytic and semi-analytic upscaling is also performed using volume averaging and ensemble streamtube techniques. Volume averaging does not perform as well as the RPT, while ensemble streamtubes (using an effective dispersion coefficient along with macro-dispersion) perform almost exactly the same as RPT.
△ Less
Submitted 23 October, 2018;
originally announced November 2018.
-
Design and characterization of the SPT-3G receiver
Authors:
J. A. Sobrin,
P. A. R. Ade,
Z. Ahmed,
A. J. Anderson,
J. S. Avva,
R. Basu Thakur,
A. N. Bender,
B. A. Benson,
J. E. Carlstrom,
F. W. Carter,
T. W. Cecil,
C. L. Chang,
J. F. Cliche,
A. Cukierman,
T. de Haan,
J. Ding,
M. A. Dobbs,
D. Dutcher,
W. Everett,
A. Foster,
J. Gallichio,
A. Gilbert,
J. C. Groh,
S. T. Guns,
N. W. Halverson
, et al. (46 additional authors not shown)
Abstract:
The SPT-3G receiver was commissioned in early 2017 on the 10-meter South Pole Telescope (SPT) to map anisotropies in the cosmic microwave background (CMB). New optics, detector, and readout technologies have yielded a multichroic, high-resolution, low-noise camera with impressive throughput and sensitivity, offering the potential to improve our understanding of inflationary physics, astroparticle…
▽ More
The SPT-3G receiver was commissioned in early 2017 on the 10-meter South Pole Telescope (SPT) to map anisotropies in the cosmic microwave background (CMB). New optics, detector, and readout technologies have yielded a multichroic, high-resolution, low-noise camera with impressive throughput and sensitivity, offering the potential to improve our understanding of inflationary physics, astroparticle physics, and growth of structure. We highlight several key features and design principles of the new receiver, and summarize its performance to date.
△ Less
Submitted 31 August, 2018;
originally announced September 2018.
-
Broadband anti-reflective coatings for cosmic microwave background experiments
Authors:
A. Nadolski,
A. M. Kofman,
J. D. Vieira,
P. A. R. Ade,
Z. Ahmed,
A. J. Anderson,
J. S. Avva,
R. Basu Thakur,
A. N. Bender,
B. A. Benson,
J. E. Carlstrom,
F. W. Carter,
T. W. Cecil,
C. L. Chang,
J. F. Cliche,
A. Cukierman,
T. de Haan,
J. Ding,
M. A. Dobbs,
D. Dutcher,
W. Everett,
A. Foster,
J. Fu,
J. Gallicchio,
A. Gilbert
, et al. (49 additional authors not shown)
Abstract:
The desire for higher sensitivity has driven ground-based cosmic microwave background (CMB) experiments to employ ever larger focal planes, which in turn require larger reimaging optics. Practical limits to the maximum size of these optics motivates the development of quasi-optically-coupled (lenslet-coupled), multi-chroic detectors. These detectors can be sensitive across a broader bandwidth comp…
▽ More
The desire for higher sensitivity has driven ground-based cosmic microwave background (CMB) experiments to employ ever larger focal planes, which in turn require larger reimaging optics. Practical limits to the maximum size of these optics motivates the development of quasi-optically-coupled (lenslet-coupled), multi-chroic detectors. These detectors can be sensitive across a broader bandwidth compared to waveguide-coupled detectors. However, the increase in bandwidth comes at a cost: the lenses (up to $\sim$700 mm diameter) and lenslets ($\sim$5 mm diameter, hemispherical lenses on the focal plane) used in these systems are made from high-refractive-index materials (such as silicon or amorphous aluminum oxide) that reflect nearly a third of the incident radiation. In order to maximize the faint CMB signal that reaches the detectors, the lenses and lenslets must be coated with an anti-reflective (AR) material. The AR coating must maximize radiation transmission in scientifically interesting bands and be cryogenically stable. Such a coating was developed for the third generation camera, SPT-3G, of the South Pole Telescope (SPT) experiment, but the materials and techniques used in the development are general to AR coatings for mm-wave optics. The three-layer polytetrafluoroethylene-based AR coating is broadband, inexpensive, and can be manufactured with simple tools. The coating is field tested; AR coated focal plane elements were deployed in the 2016-2017 austral summer and AR coated reimaging optics were deployed in 2017-2018.
△ Less
Submitted 31 August, 2018;
originally announced September 2018.
-
Random Spatial Network Models with Core-Periphery Structure
Authors:
Junteng Jia,
Austin R. Benson
Abstract:
Core-periphery structure is a common property of complex networks, which is a composition of tightly connected groups of core vertices and sparsely connected periphery vertices. This structure frequently emerges in traffic systems, biology, and social networks via underlying spatial positioning of the vertices. While core-periphery structure is ubiquitous, there have been limited attempts at model…
▽ More
Core-periphery structure is a common property of complex networks, which is a composition of tightly connected groups of core vertices and sparsely connected periphery vertices. This structure frequently emerges in traffic systems, biology, and social networks via underlying spatial positioning of the vertices. While core-periphery structure is ubiquitous, there have been limited attempts at modeling network data with this structure. Here, we develop a generative, random network model with core-periphery structure that jointly accounts for topological and spatial information by "core scores" of vertices. Our model achieves substantially higher likelihood than existing generative models of core-periphery structure, and we demonstrate how the core scores can be used in downstream data mining tasks, such as predicting airline traffic and classifying fungal networks. We also develop nearly linear time algorithms for learning model parameters and network sampling by using a method akin to the fast multipole method, a technique traditional to computational physics, which allow us to scale to networks with millions of vertices with minor tradeoffs in accuracy.
△ Less
Submitted 27 May, 2019; v1 submitted 20 August, 2018;
originally announced August 2018.
-
Three hypergraph eigenvector centralities
Authors:
Austin R. Benson
Abstract:
Eigenvector centrality is a standard network analysis tool for determining the importance of (or ranking of) entities in a connected system that is represented by a graph. However, many complex systems and datasets have natural multi-way interactions that are more faithfully modeled by a hypergraph. Here we extend the notion of graph eigenvector centrality to uniform hypergraphs. Traditional graph…
▽ More
Eigenvector centrality is a standard network analysis tool for determining the importance of (or ranking of) entities in a connected system that is represented by a graph. However, many complex systems and datasets have natural multi-way interactions that are more faithfully modeled by a hypergraph. Here we extend the notion of graph eigenvector centrality to uniform hypergraphs. Traditional graph eigenvector centralities are given by a positive eigenvector of the adjacency matrix, which is guaranteed to exist by the Perron-Frobenius theorem under some mild conditions. The natural representation of a hypergraph is a hypermatrix (colloquially, a tensor). Using recently established Perron-Frobenius theory for tensors, we develop three tensor eigenvectors centralities for hypergraphs, each with different interpretations. We show that these centralities can reveal different information on real-world data by analyzing hypergraphs constructed from n-gram frequencies, co-tagging on stack exchange, and drug combinations observed in patient emergency room visits.
△ Less
Submitted 22 March, 2019; v1 submitted 25 July, 2018;
originally announced July 2018.
-
Random Walks on Simplicial Complexes and the normalized Hodge 1-Laplacian
Authors:
Michael T. Schaub,
Austin R. Benson,
Paul Horn,
Gabor Lippner,
Ali Jadbabaie
Abstract:
Focusing on coupling between edges, we generalize the relationship between the normalized graph Laplacian and random walks on graphs by devising an appropriate normalization for the Hodge Laplacian -- the generalization of the graph Laplacian for simplicial complexes -- and relate this to a random walk on edges. Importantly, these random walks are intimately connected to the topology of the simpli…
▽ More
Focusing on coupling between edges, we generalize the relationship between the normalized graph Laplacian and random walks on graphs by devising an appropriate normalization for the Hodge Laplacian -- the generalization of the graph Laplacian for simplicial complexes -- and relate this to a random walk on edges. Importantly, these random walks are intimately connected to the topology of the simplicial complex, just as random walks on graphs are related to the topology of the graph. This serves as a foundational step towards incorporating Laplacian-based analytics for higher-order interactions. We demonstrate how to use these dynamics for data analytics that extract information about the edge-space of a simplicial complex that complements and extends graph-based analysis. Specifically, we use our normalized Hodge Laplacian to derive spectral embeddings for examining trajectory data of ocean drifters near Madagascar and also develop a generalization of personalized PageRank for the edge-space of simplicial complexes to analyze a book co-purchasing dataset.
△ Less
Submitted 6 November, 2019; v1 submitted 13 July, 2018;
originally announced July 2018.
-
A Lagrangian Method for Reactive Transport with Solid/Aqueous Chemical Phase Interaction
Authors:
Michael J. Schmidt,
Stephen D. Pankavich,
Alexis Navarre-Sitchler,
David A. Benson
Abstract:
A significant drawback of Lagrangian (particle-tracking) reactive transport models has been their inability to properly simulate interactions between solid and liquid chemical phases, such as dissolution and precipitation reactions. This work addresses that problem by implementing a mass-transfer algorithm between mobile and immobile sets of particles that allows aqueous species of reactant that a…
▽ More
A significant drawback of Lagrangian (particle-tracking) reactive transport models has been their inability to properly simulate interactions between solid and liquid chemical phases, such as dissolution and precipitation reactions. This work addresses that problem by implementing a mass-transfer algorithm between mobile and immobile sets of particles that allows aqueous species of reactant that are undergoing transport to interact with stationary solid species. This mass-transfer algorithm is demonstrated to solve the diffusion equation and thus does not introduce any spurious mixing. The algorithm is capable of simulating an arbitrarily small level of diffusion, and can be combined with diffusive random walks to simulate the desired level of diffusion in a reactive transport system.
△ Less
Submitted 7 February, 2019; v1 submitted 15 May, 2018;
originally announced May 2018.
-
Found Graph Data and Planted Vertex Covers
Authors:
Austin R. Benson,
Jon Kleinberg
Abstract:
A typical way in which network data is recorded is to measure all the interactions among a specified set of core nodes; this produces a graph containing this core together with a potentially larger set of fringe nodes that have links to the core. Interactions between pairs of nodes in the fringe, however, are not recorded by this process, and hence not present in the resulting graph data. For exam…
▽ More
A typical way in which network data is recorded is to measure all the interactions among a specified set of core nodes; this produces a graph containing this core together with a potentially larger set of fringe nodes that have links to the core. Interactions between pairs of nodes in the fringe, however, are not recorded by this process, and hence not present in the resulting graph data. For example, a phone service provider may only have records of calls in which at least one of the participants is a customer; this can include calls between a customer and a non-customer, but not between pairs of non-customers.
Knowledge of which nodes belong to the core is an important piece of metadata that is crucial for interpreting the network dataset. But in many cases, this metadata is not available, either because it has been lost due to difficulties in data provenance, or because the network consists of found data obtained in settings such as counter-surveillance. This leads to a natural algorithmic problem, namely the recovery of the core set. Since the core set forms a vertex cover of the graph, we essentially have a planted vertex cover problem, but with an arbitrary underlying graph. We develop a theoretical framework for analyzing this planted vertex cover problem, based on results in the theory of fixed-parameter tractability, together with algorithms for recovering the core. Our algorithms are fast, simple to implement, and out-perform several methods based on network core-periphery structure on various real-world datasets.
△ Less
Submitted 3 May, 2018;
originally announced May 2018.
-
On the accuracy of simulating mixing by random-walk particle-based mass-transfer algorithms
Authors:
Michael J. Schmidt,
Stephen D. Pankavich,
David A. Benson
Abstract:
Several algorithms have been used for mass transfer between particles undergoing advective and macro-dispersive random walks. The mass transfer between particles is required for general reactions on, and among, particles. The mass transfer is shown to be diffusive, and may be simulated using implicit, explicit, or mixed methods. All algorithms investigated are accurate to $\mathcal{O}(Δt)$. For…
▽ More
Several algorithms have been used for mass transfer between particles undergoing advective and macro-dispersive random walks. The mass transfer between particles is required for general reactions on, and among, particles. The mass transfer is shown to be diffusive, and may be simulated using implicit, explicit, or mixed methods. All algorithms investigated are accurate to $\mathcal{O}(Δt)$. For $N$ particles, the implicit and semi-implicit methods require inverse matrix solutions and $\mathcal{O}(N^3)$ calculations. The explicit methods use forward matrix solves and require only $\mathcal{O}(N^2)$ calculations. Practically, this means that naive implementations with more than about 5,000 particles run more reliably using explicit methods
△ Less
Submitted 9 May, 2018; v1 submitted 6 March, 2018;
originally announced March 2018.
-
Simplicial Closure and higher-order link prediction
Authors:
Austin R. Benson,
Rediet Abebe,
Michael T. Schaub,
Ali Jadbabaie,
Jon Kleinberg
Abstract:
Networks provide a powerful formalism for modeling complex systems by using a model of pairwise interactions. But much of the structure within these systems involves interactions that take place among more than two nodes at once; for example, communication within a group rather than person-to person, collaboration among a team rather than a pair of coauthors, or biological interaction between a se…
▽ More
Networks provide a powerful formalism for modeling complex systems by using a model of pairwise interactions. But much of the structure within these systems involves interactions that take place among more than two nodes at once; for example, communication within a group rather than person-to person, collaboration among a team rather than a pair of coauthors, or biological interaction between a set of molecules rather than just two. Such higher-order interactions are ubiquitous, but their empirical study has received limited attention, and little is known about possible organizational principles of such structures. Here we study the temporal evolution of 19 datasets with explicit accounting for higher-order interactions. We show that there is a rich variety of structure in our datasets but datasets from the same system types have consistent patterns of higher-order structure. Furthermore, we find that tie strength and edge density are competing positive indicators of higher-order organization, and these trends are consistent across interactions involving differing numbers of nodes. To systematically further the study of theories for such higher-order structures, we propose higher-order link prediction as a benchmark problem to assess models and algorithms that predict higher-order structure. We find a fundamental differences from traditional pairwise link prediction, with a greater role for local rather than long-range information in predicting the appearance of new interactions.
△ Less
Submitted 11 December, 2018; v1 submitted 19 February, 2018;
originally announced February 2018.
-
Tools for higher-order network analysis
Authors:
Austin R. Benson
Abstract:
Networks are a fundamental model of complex systems throughout the sciences, and network datasets are typically analyzed through lower-order connectivity patterns described at the level of individual nodes and edges. However, higher-order connectivity patterns captured by small subgraphs, also called network motifs, describe the fundamental structures that control and mediate the behavior of many…
▽ More
Networks are a fundamental model of complex systems throughout the sciences, and network datasets are typically analyzed through lower-order connectivity patterns described at the level of individual nodes and edges. However, higher-order connectivity patterns captured by small subgraphs, also called network motifs, describe the fundamental structures that control and mediate the behavior of many complex systems. We develop three tools for network analysis that use higher-order connectivity patterns to gain new insights into network datasets: (1) a framework to cluster nodes into modules based on joint participation in network motifs; (2) a generalization of the clustering coefficient measurement to investigate higher-order closure patterns; and (3) a definition of network motifs for temporal networks and fast algorithms for counting them. Using these tools, we analyze data from biology, ecology, economics, neuroscience, online social networks, scientific collaborations, telecommunications, transportation, and the World Wide Web.
△ Less
Submitted 19 February, 2018;
originally announced February 2018.
-
Higher-order clustering in networks
Authors:
Hao Yin,
Austin R. Benson,
Jure Leskovec
Abstract:
A fundamental property of complex networks is the tendency for edges to cluster. The extent of the clustering is typically quantified by the clustering coefficient, which is the probability that a length-2 path is closed, i.e., induces a triangle in the network. However, higher-order cliques beyond triangles are crucial to understanding complex networks, and the clustering behavior with respect to…
▽ More
A fundamental property of complex networks is the tendency for edges to cluster. The extent of the clustering is typically quantified by the clustering coefficient, which is the probability that a length-2 path is closed, i.e., induces a triangle in the network. However, higher-order cliques beyond triangles are crucial to understanding complex networks, and the clustering behavior with respect to such higher-order network structures is not well understood. Here we introduce higher-order clustering coefficients that measure the closure probability of higher-order network cliques and provide a more comprehensive view of how the edges of complex networks cluster. Our higher-order clustering coefficients are a natural generalization of the traditional clustering coefficient. We derive several properties about higher-order clustering coefficients and analyze them under common random graph models. Finally, we use higher-order clustering coefficients to gain new insights into the structure of real-world networks from several domains.
△ Less
Submitted 4 January, 2018; v1 submitted 12 April, 2017;
originally announced April 2017.
-
Motifs in Temporal Networks
Authors:
Ashwin Paranjape,
Austin R. Benson,
Jure Leskovec
Abstract:
Networks are a fundamental tool for modeling complex systems in a variety of domains including social and communication networks as well as biology and neuroscience. Small subgraph patterns in networks, called network motifs, are crucial to understanding the structure and function of these systems. However, the role of network motifs in temporal networks, which contain many timestamped links betwe…
▽ More
Networks are a fundamental tool for modeling complex systems in a variety of domains including social and communication networks as well as biology and neuroscience. Small subgraph patterns in networks, called network motifs, are crucial to understanding the structure and function of these systems. However, the role of network motifs in temporal networks, which contain many timestamped links between the nodes, is not yet well understood.
Here we develop a notion of a temporal network motif as an elementary unit of temporal networks and provide a general methodology for counting such motifs. We define temporal network motifs as induced subgraphs on sequences of temporal edges, design fast algorithms for counting temporal motifs, and prove their runtime complexity. Our fast algorithms achieve up to 56.5x speedup compared to a baseline method. Furthermore, we use our algorithms to count temporal motifs in a variety of networks. Results show that networks from different domains have significantly different motif counts, whereas networks from the same domain tend to have similar motif counts. We also find that different motifs occur at different time scales, which provides further insights into structure and function of temporal networks.
△ Less
Submitted 29 December, 2016;
originally announced December 2016.
-
Higher-order organization of complex networks
Authors:
Austin R. Benson,
David F. Gleich,
Jure Leskovec
Abstract:
Networks are a fundamental tool for understanding and modeling complex systems in physics, biology, neuroscience, engineering, and social science. Many networks are known to exhibit rich, lower-order connectivity patterns that can be captured at the level of individual nodes and edges. However, higher-order organization of complex networks---at the level of small network subgraphs---remains largel…
▽ More
Networks are a fundamental tool for understanding and modeling complex systems in physics, biology, neuroscience, engineering, and social science. Many networks are known to exhibit rich, lower-order connectivity patterns that can be captured at the level of individual nodes and edges. However, higher-order organization of complex networks---at the level of small network subgraphs---remains largely unknown. Here we develop a generalized framework for clustering networks based on higher-order connectivity patterns. This framework provides mathematical guarantees on the optimality of obtained clusters and scales to networks with billions of edges. The framework reveals higher-order organization in a number of networks including information propagation units in neuronal networks and hub structure in transportation networks. Results show that networks exhibit rich higher-order organizational structures that are exposed by clustering based on higher-order connectivity patterns.
△ Less
Submitted 26 December, 2016;
originally announced December 2016.
-
The Astropy Problem
Authors:
Demitri Muna,
Michael Alexander,
Alice Allen,
Richard Ashley,
Daniel Asmus,
Ruyman Azzollini,
Michele Bannister,
Rachael Beaton,
Andrew Benson,
G. Bruce Berriman,
Maciej Bilicki,
Peter Boyce,
Joanna Bridge,
Jan Cami,
Eryn Cangi,
Xian Chen,
Nicholas Christiny,
Christopher Clark,
Michelle Collins,
Johan Comparat,
Neil Cook,
Darren Croton,
Isak Delberth Davids,
Éric Depagne,
John Donor
, et al. (129 additional authors not shown)
Abstract:
The Astropy Project (http://astropy.org) is, in its own words, "a community effort to develop a single core package for Astronomy in Python and foster interoperability between Python astronomy packages." For five years this project has been managed, written, and operated as a grassroots, self-organized, almost entirely volunteer effort while the software is used by the majority of the astronomical…
▽ More
The Astropy Project (http://astropy.org) is, in its own words, "a community effort to develop a single core package for Astronomy in Python and foster interoperability between Python astronomy packages." For five years this project has been managed, written, and operated as a grassroots, self-organized, almost entirely volunteer effort while the software is used by the majority of the astronomical community. Despite this, the project has always been and remains to this day effectively unfunded. Further, contributors receive little or no formal recognition for creating and supporting what is now critical software. This paper explores the problem in detail, outlines possible solutions to correct this, and presents a few suggestions on how to address the sustainability of general purpose astronomical software.
△ Less
Submitted 10 October, 2016;
originally announced October 2016.
-
Tensor Spectral Clustering for Partitioning Higher-order Network Structures
Authors:
Austin R. Benson,
David F. Gleich,
Jure Leskovec
Abstract:
Spectral graph theory-based methods represent an important class of tools for studying the structure of networks. Spectral methods are based on a first-order Markov chain derived from a random walk on the graph and thus they cannot take advantage of important higher-order network substructures such as triangles, cycles, and feed-forward loops. Here we propose a Tensor Spectral Clustering (TSC) alg…
▽ More
Spectral graph theory-based methods represent an important class of tools for studying the structure of networks. Spectral methods are based on a first-order Markov chain derived from a random walk on the graph and thus they cannot take advantage of important higher-order network substructures such as triangles, cycles, and feed-forward loops. Here we propose a Tensor Spectral Clustering (TSC) algorithm that allows for modeling higher-order network structures in a graph partitioning framework. Our TSC algorithm allows the user to specify which higher-order network structures (cycles, feed-forward loops, etc.) should be preserved by the network clustering. Higher-order network structures of interest are represented using a tensor, which we then partition by develo** a multilinear spectral method. Our framework can be applied to discovering layered flows in networks as well as graph anomaly detection, which we illustrate on synthetic networks. In directed networks, a higher-order structure of particular interest is the directed 3-cycle, which captures feedback loops in networks. We demonstrate that our TSC algorithm produces large partitions that cut fewer directed 3-cycles than standard spectral clustering algorithms.
△ Less
Submitted 17 February, 2015;
originally announced February 2015.
-
A Study of Al-Mn Transition Edge Sensor Engineering for Stability
Authors:
E. M. George,
J. E. Austermann,
J. A. Beall,
D. Becker,
B. A. Benson,
L. E. Bleem,
J. E. Carlstrom,
C. L. Chang,
H- M. Cho,
A. T. Crites,
M. A. Dobbs,
W. Everett,
N. W. Halverson,
J. W. Henning,
G. C. Hilton,
W. L. Holzapfel,
J. Hubmayr,
K. D. Irwin,
D. Li,
M. Lueker,
J. J. McMahon,
J. Mehl,
J. Montgomery,
T. Natoli,
J. P. Nibarger
, et al. (10 additional authors not shown)
Abstract:
The stability of Al-Mn transition edge sensor (TES) bolometers is studied as we vary the engineered TES transition, heat capacity, and/or coupling between the heat capacity and TES. We present thermal structure measurements of each of the 39 designs tested. The data is accurately fit by a two-body bolometer model, which allows us to extract the basic TES parameters that affect device stability. We…
▽ More
The stability of Al-Mn transition edge sensor (TES) bolometers is studied as we vary the engineered TES transition, heat capacity, and/or coupling between the heat capacity and TES. We present thermal structure measurements of each of the 39 designs tested. The data is accurately fit by a two-body bolometer model, which allows us to extract the basic TES parameters that affect device stability. We conclude that parameters affecting device stability can be engineered for optimal device operation, and present the model parameters extracted for the different TES designs.
△ Less
Submitted 10 November, 2013;
originally announced November 2013.
-
Frequency Multiplexed SQUID Readout of Large Bolometer Arrays for Cosmic Microwave Background Measurements
Authors:
M. A. Dobbs,
M. Lueker,
K. A. Aird,
A. N. Bender,
B. A. Benson,
L. E. Bleem,
J. E. Carlstrom,
C. L. Chang,
H. -M. Cho,
J. Clarke,
T. M. Crawford,
A. T. Crites,
D. I. Flanigan,
T. de Haan,
E. M. George,
N. W. Halverson,
W. L. Holzapfel,
J. D. Hrubes,
B. R. Johnson,
J. Joseph,
R. Keisler,
J. Kennedy,
Z. Kermish,
T. M. Lanting,
A. T. Lee
, et al. (22 additional authors not shown)
Abstract:
A technological milestone for experiments employing Transition Edge Sensor (TES) bolometers operating at sub-kelvin temperature is the deployment of detector arrays with 100s--1000s of bolometers. One key technology for such arrays is readout multiplexing: the ability to read out many sensors simultaneously on the same set of wires. This paper describes a frequency-domain multiplexed readout syste…
▽ More
A technological milestone for experiments employing Transition Edge Sensor (TES) bolometers operating at sub-kelvin temperature is the deployment of detector arrays with 100s--1000s of bolometers. One key technology for such arrays is readout multiplexing: the ability to read out many sensors simultaneously on the same set of wires. This paper describes a frequency-domain multiplexed readout system which has been developed for and deployed on the APEX-SZ and South Pole Telescope millimeter wavelength receivers. In this system, the detector array is divided into modules of seven detectors, and each bolometer within the module is biased with a unique ~MHz sinusoidal carrier such that the individual bolometer signals are well separated in frequency space. The currents from all bolometers in a module are summed together and pre-amplified with Superconducting Quantum Interference Devices (SQUIDs) operating at 4 K. Room-temperature electronics demodulate the carriers to recover the bolometer signals, which are digitized separately and stored to disk. This readout system contributes little noise relative to the detectors themselves, is remarkably insensitive to unwanted microphonic excitations, and provides a technology pathway to multiplexing larger numbers of sensors.
△ Less
Submitted 17 July, 2012; v1 submitted 18 December, 2011;
originally announced December 2011.