-
Game Connectivity and Adaptive Dynamics
Authors:
Tom Johnston,
Michael Savery,
Alex Scott,
Bassel Tarbush
Abstract:
We analyse the typical structure of games in terms of the connectivity properties of their best-response graphs. Our central result shows that almost every game that is 'generic' (without indifferences) and has a pure Nash equilibrium and a 'large' number of players is connected, meaning that every action profile that is not a pure Nash equilibrium can reach every pure Nash equilibrium via best-re…
▽ More
We analyse the typical structure of games in terms of the connectivity properties of their best-response graphs. Our central result shows that almost every game that is 'generic' (without indifferences) and has a pure Nash equilibrium and a 'large' number of players is connected, meaning that every action profile that is not a pure Nash equilibrium can reach every pure Nash equilibrium via best-response paths. This has important implications for dynamics in games. In particular, we show that there are simple, uncoupled, adaptive dynamics for which period-by-period play converges almost surely to a pure Nash equilibrium in almost every large generic game that has one (which contrasts with the known fact that there is no such dynamic that leads almost surely to a pure Nash equilibrium in every generic game that has one). We build on recent results in probabilistic combinatorics for our characterisation of game connectivity.
△ Less
Submitted 3 November, 2023; v1 submitted 19 September, 2023;
originally announced September 2023.
-
Molecular Communication for Quorum Sensing Inspired Cooperative Drug Delivery
Authors:
Yuting Fang,
Stuart T. Johnston,
Matt Faria,
Xinyu Huang,
Andrew W. Eckford,
Jamie Evans
Abstract:
A cooperative drug delivery system is proposed, where quorum sensing (QS), a density-dependent bacterial behavior coordination mechanism, is employed by synthetic bacterium-based nanomachines (B-NMs) for controllable drug delivery. In our proposed system, drug delivery is only triggered when there are enough QS molecules, which in turn only happens when there are enough B-NMs. This makes the propo…
▽ More
A cooperative drug delivery system is proposed, where quorum sensing (QS), a density-dependent bacterial behavior coordination mechanism, is employed by synthetic bacterium-based nanomachines (B-NMs) for controllable drug delivery. In our proposed system, drug delivery is only triggered when there are enough QS molecules, which in turn only happens when there are enough B-NMs. This makes the proposed system can be used to achieve a high release rate of drug molecules from a high number of B-NMs when the population density of B-NMs may not be known. Analytical expressions for i) the expected activation probability of the B-NM due to randomly-distributed B-NMs and ii) the expected aggregate absorption rate of drug molecules due to randomly-distributed QS activated B-NMs are derived. Analytical results are verified by particle-based simulations. The derived results can help to predict and control the impact of environmental factors (e.g. diffusion coefficient and degradation rate) on the absorption rate of drug molecules since rigorous diffusion-based molecular channels are considered. Our results show that the activation probability at the B-NM increases as this B-NM is located closer to the center of the B-NM population and the aggregate absorption rate of the drug molecules non-linearly increases as the population density increases.
△ Less
Submitted 14 February, 2023;
originally announced March 2023.
-
Kinetic Langevin MCMC Sampling Without Gradient Lipschitz Continuity -- the Strongly Convex Case
Authors:
Tim Johnston,
Iosif Lytras,
Sotirios Sabanis
Abstract:
In this article we consider sampling from log concave distributions in Hamiltonian setting, without assuming that the objective gradient is globally Lipschitz. We propose two algorithms based on monotone polygonal (tamed) Euler schemes, to sample from a target measure, and provide non-asymptotic 2-Wasserstein distance bounds between the law of the process of each algorithm and the target measure.…
▽ More
In this article we consider sampling from log concave distributions in Hamiltonian setting, without assuming that the objective gradient is globally Lipschitz. We propose two algorithms based on monotone polygonal (tamed) Euler schemes, to sample from a target measure, and provide non-asymptotic 2-Wasserstein distance bounds between the law of the process of each algorithm and the target measure. Finally, we apply these results to bound the excess risk optimization error of the associated optimization problem.
△ Less
Submitted 19 January, 2023;
originally announced January 2023.
-
Shotgun assembly of random graphs
Authors:
Tom Johnston,
Gal Kronenberg,
Alexander Roberts,
Alex Scott
Abstract:
In the graph shotgun assembly problem, we are given the balls of radius $r$ around each vertex of a graph and asked to reconstruct the graph. We study the shotgun assembly of the Erdős-Rényi random graph $\mathcal G(n,p)$ from a wide range of values of $r$. We determine the threshold for reconstructibility for each $r\geq 3$, extending and improving substantially on results of Mossel and Ross for…
▽ More
In the graph shotgun assembly problem, we are given the balls of radius $r$ around each vertex of a graph and asked to reconstruct the graph. We study the shotgun assembly of the Erdős-Rényi random graph $\mathcal G(n,p)$ from a wide range of values of $r$. We determine the threshold for reconstructibility for each $r\geq 3$, extending and improving substantially on results of Mossel and Ross for $r=3$. For $r=2$, we give upper and lower bounds that improve on results of Gaudio and Mossel by polynomial factors. We also give a sharpening of a result of Huang and Tikhomirov for $r=1$.
△ Less
Submitted 3 June, 2023; v1 submitted 25 November, 2022;
originally announced November 2022.
-
Analysis of MC Systems Employing Receivers Covered by Heterogeneous Receptors
Authors:
Xinyu Huang,
Yuting Fang,
Stuart T. Johnston,
Mattew Faria,
Nan Yang,
Robert Schober
Abstract:
This paper investigates the channel impulse response (CIR), i.e., the molecule hitting rate, of a molecular communication (MC) system employing an absorbing receiver (RX) covered by multiple non overlap** receptors. In this system, receptors are heterogeneous, i.e., they may have different sizes and arbitrary locations. Furthermore, we consider two types of transmitter (TX), namely a point TX an…
▽ More
This paper investigates the channel impulse response (CIR), i.e., the molecule hitting rate, of a molecular communication (MC) system employing an absorbing receiver (RX) covered by multiple non overlap** receptors. In this system, receptors are heterogeneous, i.e., they may have different sizes and arbitrary locations. Furthermore, we consider two types of transmitter (TX), namely a point TX and a membrane fusion (MF)-based spherical TX. We assume the point TX or the center of the MF-based TX has a fixed distance to the center of the RX. Given this fixed distance, the TX can be at different locations and the CIR of the RX depends on the exact location of the TX. By averaging over all possible TX locations, we analyze the expected molecule hitting rate at the RX as a function of the sizes and locations of the receptors, where we assume molecule degradation may occur during the propagation of the signaling molecules. Notably, our analysis is valid for different numbers, a wide range of sizes, and arbitrary locations of the receptors, and its accuracy is confirmed via particle-based simulations. Exploiting our numerical results, we show that the expected number of absorbed molecules at the RX increases with the number of receptors, when the total area on the RX surface covered by receptors is fixed. Based on the derived analytical expressions, we compare different geometric receptor distributions by examining the expected number of absorbed molecules at the RX. We show that evenly distributed receptors result in a larger number of absorbed molecules than other distributions. We further compare three models that combine different types of TXs and RXs.
△ Less
Submitted 28 April, 2022;
originally announced April 2022.
-
Analysis of Receiver Covered by Heterogeneous Receptors in Molecular Communications
Authors:
Xinyu Huang,
Yuting Fang,
Stuart T. Johnston,
Matthew Faria,
Nan Yang,
Robert Schober
Abstract:
This paper analyzes the channel impulse response of an absorbing receiver (RX) covered by multiple non-overlap** heterogeneous receptors with different sizes and arbitrary locations in a molecular communication system. In this system, a point transmitter (TX) is assumed to be uniformly located on a virtual sphere at a fixed distance from the RX. Considering molecule degradation during the propag…
▽ More
This paper analyzes the channel impulse response of an absorbing receiver (RX) covered by multiple non-overlap** heterogeneous receptors with different sizes and arbitrary locations in a molecular communication system. In this system, a point transmitter (TX) is assumed to be uniformly located on a virtual sphere at a fixed distance from the RX. Considering molecule degradation during the propagation from the TX to the RX, the expected molecule hitting rate at the RX over varying locations of the TX is analyzed as a function of the size and location of each receptor. Notably, this analytical result is applicable for different numbers, sizes, and locations of receptors, and its accuracy is demonstrated via particle-based simulations. Numerical results show that (i) the expected number of absorbed molecules at the RX increases with an increasing number of receptors, when the total area of receptors on the RX surface is fixed, and (ii) evenly distributed receptors lead to the largest expected number of absorbed molecules.
△ Less
Submitted 15 February, 2022; v1 submitted 3 November, 2021;
originally announced November 2021.
-
DeepMerge II: Building Robust Deep Learning Algorithms for Merging Galaxy Identification Across Domains
Authors:
A. Ćiprijanović,
D. Kafkes,
K. Downey,
S. Jenkins,
G. N. Perdue,
S. Madireddy,
T. Johnston,
G. F. Snyder,
B. Nord
Abstract:
In astronomy, neural networks are often trained on simulation data with the prospect of being used on telescope observations. Unfortunately, training a model on simulation data and then applying it to instrument data leads to a substantial and potentially even detrimental decrease in model accuracy on the new target dataset. Simulated and instrument data represent different data domains, and for a…
▽ More
In astronomy, neural networks are often trained on simulation data with the prospect of being used on telescope observations. Unfortunately, training a model on simulation data and then applying it to instrument data leads to a substantial and potentially even detrimental decrease in model accuracy on the new target dataset. Simulated and instrument data represent different data domains, and for an algorithm to work in both, domain-invariant learning is necessary. Here we employ domain adaptation techniques$-$ Maximum Mean Discrepancy (MMD) as an additional transfer loss and Domain Adversarial Neural Networks (DANNs)$-$ and demonstrate their viability to extract domain-invariant features within the astronomical context of classifying merging and non-merging galaxies. Additionally, we explore the use of Fisher loss and entropy minimization to enforce better in-domain class discriminability. We show that the addition of each domain adaptation technique improves the performance of a classifier when compared to conventional deep learning algorithms. We demonstrate this on two examples: between two Illustris-1 simulated datasets of distant merging galaxies, and between Illustris-1 simulated data of nearby merging galaxies and observed data from the Sloan Digital Sky Survey. The use of domain adaptation techniques in our experiments leads to an increase of target domain classification accuracy of up to ${\sim}20\%$. With further development, these techniques will allow astronomers to successfully implement neural network models trained on simulation data to efficiently detect and study astrophysical objects in current and future large-scale astronomical surveys.
△ Less
Submitted 1 March, 2021;
originally announced March 2021.
-
PEng4NN: An Accurate Performance Estimation Engine for Efficient Automated Neural Network Architecture Search
Authors:
Ariel Keller Rorabaugh,
Silvina Caíno-Lores,
Michael R. Wyatt II,
Travis Johnston,
Michela Taufer
Abstract:
Neural network (NN) models are increasingly used in scientific simulations, AI, and other high performance computing (HPC) fields to extract knowledge from datasets. Each dataset requires tailored NN model architecture, but designing structures by hand is a time-consuming and error-prone process. Neural architecture search (NAS) automates the design of NN architectures. NAS attempts to find well-p…
▽ More
Neural network (NN) models are increasingly used in scientific simulations, AI, and other high performance computing (HPC) fields to extract knowledge from datasets. Each dataset requires tailored NN model architecture, but designing structures by hand is a time-consuming and error-prone process. Neural architecture search (NAS) automates the design of NN architectures. NAS attempts to find well-performing NN models for specialized datsets, where performance is measured by key metrics that capture the NN capabilities (e.g., accuracy of classification of samples in a dataset). Existing NAS methods are resource intensive, especially when searching for highly accurate models for larger and larger datasets.
To address this problem, we propose a performance estimation strategy that reduces the resources for training NNs and increases NAS throughput without jeopardizing accuracy. We implement our strategy via an engine called PEng4NN that plugs into existing NAS methods; in doing so, PEng4NN predicts the final accuracy of NNs early in the training process, informs the NAS of NN performance, and thus enables the NAS to terminate training NNs early. We assess our engine on three diverse datasets (i.e., CIFAR-100, Fashion MNIST, and SVHN). By reducing the training epochs needed, our engine achieves substantial throughput gain; on average, our engine saves 61% to 82% of training epochs, increasing throughput by a factor of 2.5 to 5 compared to a state-of-the-art NAS method. We achieve this gain without compromising accuracy, as we demonstrate with two key outcomes. First, across all our tests, between 74% and 97% of the ground truth best models lie in our set of predicted best models. Second, the accuracy distributions of the ground truth best models and our predicted best models are comparable, with the mean accuracy values differing by at most .7 percentage points across all tests.
△ Less
Submitted 11 August, 2021; v1 submitted 11 January, 2021;
originally announced January 2021.
-
Domain adaptation techniques for improved cross-domain study of galaxy mergers
Authors:
A. Ćiprijanović,
D. Kafkes,
S. Jenkins,
K. Downey,
G. N. Perdue,
S. Madireddy,
T. Johnston,
B. Nord
Abstract:
In astronomy, neural networks are often trained on simulated data with the prospect of being applied to real observations. Unfortunately, simply training a deep neural network on images from one domain does not guarantee satisfactory performance on new images from a different domain. The ability to share cross-domain knowledge is the main advantage of modern deep domain adaptation techniques. Here…
▽ More
In astronomy, neural networks are often trained on simulated data with the prospect of being applied to real observations. Unfortunately, simply training a deep neural network on images from one domain does not guarantee satisfactory performance on new images from a different domain. The ability to share cross-domain knowledge is the main advantage of modern deep domain adaptation techniques. Here we demonstrate the use of two techniques - Maximum Mean Discrepancy (MMD) and adversarial training with Domain Adversarial Neural Networks (DANN) - for the classification of distant galaxy mergers from the Illustris-1 simulation, where the two domains presented differ only due to inclusion of observational noise. We show how the addition of either MMD or adversarial training greatly improves the performance of the classifier on the target domain when compared to conventional machine learning algorithms, thereby demonstrating great promise for their use in astronomy.
△ Less
Submitted 13 November, 2020; v1 submitted 6 November, 2020;
originally announced November 2020.
-
Multi-Objective Optimization for Size and Resilience of Spiking Neural Networks
Authors:
Mihaela Dimovska,
Travis Johnston,
Catherine D. Schuman,
J. Parker Mitchell,
Thomas E. Potok
Abstract:
Inspired by the connectivity mechanisms in the brain, neuromorphic computing architectures model Spiking Neural Networks (SNNs) in silicon. As such, neuromorphic architectures are designed and developed with the goal of having small, low power chips that can perform control and machine learning tasks. However, the power consumption of the developed hardware can greatly depend on the size of the ne…
▽ More
Inspired by the connectivity mechanisms in the brain, neuromorphic computing architectures model Spiking Neural Networks (SNNs) in silicon. As such, neuromorphic architectures are designed and developed with the goal of having small, low power chips that can perform control and machine learning tasks. However, the power consumption of the developed hardware can greatly depend on the size of the network that is being evaluated on the chip. Furthermore, the accuracy of a trained SNN that is evaluated on chip can change due to voltage and current variations in the hardware that perturb the learned weights of the network. While efforts are made on the hardware side to minimize those perturbations, a software based strategy to make the deployed networks more resilient can help further alleviate that issue. In this work, we study Spiking Neural Networks in two neuromorphic architecture implementations with the goal of decreasing their size, while at the same time increasing their resiliency to hardware faults. We leverage an evolutionary algorithm to train the SNNs and propose a multiobjective fitness function to optimize the size and resiliency of the SNN. We demonstrate that this strategy leads to well-performing, small-sized networks that are more resilient to hardware faults.
△ Less
Submitted 4 February, 2020;
originally announced February 2020.
-
Exascale Deep Learning to Accelerate Cancer Research
Authors:
Robert M. Patton,
J. Travis Johnston,
Steven R. Young,
Catherine D. Schuman,
Thomas E. Potok,
Derek C. Rose,
Seung-Hwan Lim,
Junghoon Chae,
Le Hou,
Shahira Abousamra,
Dimitris Samaras,
Joel Saltz
Abstract:
Deep learning, through the use of neural networks, has demonstrated remarkable ability to automate many routine tasks when presented with sufficient data for training. The neural network architecture (e.g. number of layers, types of layers, connections between layers, etc.) plays a critical role in determining what, if anything, the neural network is able to learn from the training data. The trend…
▽ More
Deep learning, through the use of neural networks, has demonstrated remarkable ability to automate many routine tasks when presented with sufficient data for training. The neural network architecture (e.g. number of layers, types of layers, connections between layers, etc.) plays a critical role in determining what, if anything, the neural network is able to learn from the training data. The trend for neural network architectures, especially those trained on ImageNet, has been to grow ever deeper and more complex. The result has been ever increasing accuracy on benchmark datasets with the cost of increased computational demands. In this paper we demonstrate that neural network architectures can be automatically generated, tailored for a specific application, with dual objectives: accuracy of prediction and speed of prediction. Using MENNDL--an HPC-enabled software stack for neural architecture search--we generate a neural network with comparable accuracy to state-of-the-art networks on a cancer pathology dataset that is also $16\times$ faster at inference. The speedup in inference is necessary because of the volume and velocity of cancer pathology data; specifically, the previous state-of-the-art networks are too slow for individual researchers without access to HPC systems to keep pace with the rate of data generation. Our new model enables researchers with modest computational resources to analyze newly generated data faster than it is collected.
△ Less
Submitted 26 September, 2019;
originally announced September 2019.
-
Lipschitz bijections between boolean functions
Authors:
Tom Johnston,
Alex Scott
Abstract:
We answer four questions from a recent paper of Rao and Shinkar on Lipschitz bijections between functions from $\{0,1\}^n$ to $\{0,1\}$. (1) We show that there is no $O(1)$-bi-Lipschitz bijection from $\mathrm{Dictator}$ to $\mathrm{XOR}$ such that each output bit depends on $O(1)$ input bits. (2) We give a construction for a map** from $\mathrm{XOR}$ to $\mathrm{Majority}$ which has average str…
▽ More
We answer four questions from a recent paper of Rao and Shinkar on Lipschitz bijections between functions from $\{0,1\}^n$ to $\{0,1\}$. (1) We show that there is no $O(1)$-bi-Lipschitz bijection from $\mathrm{Dictator}$ to $\mathrm{XOR}$ such that each output bit depends on $O(1)$ input bits. (2) We give a construction for a map** from $\mathrm{XOR}$ to $\mathrm{Majority}$ which has average stretch $O(\sqrt{n})$, matching a previously known lower bound. (3) We give a 3-Lipschitz embedding $φ: \{0,1\}^n \to \{0,1\}^{2n+1}$ such that $\mathrm{XOR}(x) = \mathrm{Majority}(φ(x))$ for all $x \in \{0,1\}^n$. (4) We show that with high probability there is a $O(1)$-bi-Lipschitz map** from $\mathrm{Dictator}$ to a uniformly random balanced function.
△ Less
Submitted 10 December, 2021; v1 submitted 21 December, 2018;
originally announced December 2018.
-
Cospectral mates for the union of some classes in the Johnson association scheme
Authors:
Sebastian M. Cioabă,
Willem H. Haemers,
Travis Johnston,
Matt McGinnis
Abstract:
Let $n\geq k\geq 2$ be two integers and $S$ a subset of $\{0,1,\dots,k-1\}$. The graph $J_{S}(n,k)$ has as vertices the $k$-subsets of the $n$-set $[n]=\{1,\dots,n\}$ and two $k$-subsets $A$ and $B$ are adjacent if $|A\cap B|\in S$. In this paper, we use Godsil-McKay switching to prove that for $m\geq 0$, $k\geq \max(m+2,3)$ and $S = \{0, 1, ..., m\}$, the graphs $J_S(3k-2m-1,k)$ are not determine…
▽ More
Let $n\geq k\geq 2$ be two integers and $S$ a subset of $\{0,1,\dots,k-1\}$. The graph $J_{S}(n,k)$ has as vertices the $k$-subsets of the $n$-set $[n]=\{1,\dots,n\}$ and two $k$-subsets $A$ and $B$ are adjacent if $|A\cap B|\in S$. In this paper, we use Godsil-McKay switching to prove that for $m\geq 0$, $k\geq \max(m+2,3)$ and $S = \{0, 1, ..., m\}$, the graphs $J_S(3k-2m-1,k)$ are not determined by spectrum and for $m\geq 2$, $n\geq 4m+2$ and $S = \{0,1,...,m\}$ the graphs $J_{S}(n,2m+1)$ are not determined by spectrum. We also report some computational searches for Godsil-McKay switching sets in the union of classes in the Johnson scheme for $k\leq 5$.
△ Less
Submitted 8 August, 2017; v1 submitted 30 January, 2017;
originally announced January 2017.
-
In-Situ Data Analysis of Protein Folding Trajectories
Authors:
Travis Johnston,
Boyu Zhang,
Adam Liwo,
Silvia Crivelli,
Michela Taufer
Abstract:
The transition from petascale to exascale computers is characterized by substantial changes in the computer architectures and technologies. The research community relying on computational simulations is being forced to revisit the algorithms for data generation and analysis due to various concerns, such as higher degrees of concurrency, deeper memory hierarchies, substantial I/O and communication…
▽ More
The transition from petascale to exascale computers is characterized by substantial changes in the computer architectures and technologies. The research community relying on computational simulations is being forced to revisit the algorithms for data generation and analysis due to various concerns, such as higher degrees of concurrency, deeper memory hierarchies, substantial I/O and communication constraints. Simulations today typically save all data to analyze later. Simulations at the exascale will require us to analyze data as it is generated and save only what is really needed for analysis, which must be performed predominately in-situ, i.e., executed sufficiently fast locally, limiting memory and disk usage, and avoiding the need to move large data across nodes.
In this paper, we present a distributed method that enables in-situ data analysis for large protein folding trajectory datasets. Traditional trajectory analysis methods currently follow a centralized approach that moves the trajectory datasets to a centralized node and processes the data only after simulations have been completed. Our method, on the other hand, captures conformational information in-situ using local data only while reducing the storage space needed for the part of the trajectory under consideration. This method processes the input trajectory data in one pass, breaks from the centralized approach of traditional analysis, avoids the movement of trajectory data, and still builds the global knowledge on the formation of individual $α$-helices or $β$-strands as trajectory frames are generated.
△ Less
Submitted 30 October, 2015; v1 submitted 29 October, 2015;
originally announced October 2015.
-
Random Voting Effects in Social-Digital Spaces: A case study of Reddit Post Submissions
Authors:
Maria Glenski,
Thomas J. Johnston,
Tim Weninger
Abstract:
At a time when information seekers first turn to digital sources for news and opinion, it is critical that we understand the role that social media plays in human behavior. This is especially true when information consumers also act as information producers and editors by their online activity. In order to better understand the effects that editorial ratings have on online human behavior, we repor…
▽ More
At a time when information seekers first turn to digital sources for news and opinion, it is critical that we understand the role that social media plays in human behavior. This is especially true when information consumers also act as information producers and editors by their online activity. In order to better understand the effects that editorial ratings have on online human behavior, we report the results of a large-scale in-vivo experiment in social media. We find that small, random rating manipulations on social media submissions created significant changes in downstream ratings resulting in significantly different final outcomes. Positive treatment resulted in a positive effect that increased the final rating by 11.02% on average. Compared to the control group, positive treatment also increased the probability of reaching a high rating (>=2000) by 24.6%. Contrary to the results of related work we also find that negative treatment resulted in a negative effect that decreased the final rating by 5.15% on average.
△ Less
Submitted 5 June, 2015;
originally announced June 2015.