-
Thermodynamic Transferability in Coarse-Grained Force Fields using Graph Neural Networks
Authors:
Emily Shinkle,
Aleksandra Pachalieva,
Riti Bahl,
Sakib Matin,
Brendan Gifford,
Galen T. Craven,
Nicholas Lubbers
Abstract:
Coarse-graining is a molecular modeling technique in which an atomistic system is represented in a simplified fashion that retains the most significant system features that contribute to a target output, while removing the degrees of freedom that are less relevant. This reduction in model complexity allows coarse-grained molecular simulations to reach increased spatial and temporal scales compared…
▽ More
Coarse-graining is a molecular modeling technique in which an atomistic system is represented in a simplified fashion that retains the most significant system features that contribute to a target output, while removing the degrees of freedom that are less relevant. This reduction in model complexity allows coarse-grained molecular simulations to reach increased spatial and temporal scales compared to corresponding all-atom models. A core challenge in coarse-graining is to construct a force field that represents the interactions in the new representation in a way that preserves the atomistic-level properties. Many approaches to building coarse-grained force fields have limited transferability between different thermodynamic conditions as a result of averaging over internal fluctuations at a specific thermodynamic state point. Here, we use a graph-convolutional neural network architecture, the Hierarchically Interacting Particle Neural Network with Tensor Sensitivity (HIP-NN-TS), to develop a highly automated training pipeline for coarse grained force fields which allows for studying the transferability of coarse-grained models based on the force-matching approach. We show that this approach not only yields highly accurate force fields, but also that these force fields are more transferable through a variety of thermodynamic conditions. These results illustrate the potential of machine learning techniques such as graph neural networks to improve the construction of transferable coarse-grained force fields.
△ Less
Submitted 17 June, 2024;
originally announced June 2024.
-
Machine learning potentials with Iterative Boltzmann Inversion: training to experiment
Authors:
Sakib Matin,
Alice Allen,
Justin S. Smith,
Nicholas Lubbers,
Ryan B. Jadrich,
Richard A. Messerly,
Benjamin T. Nebgen,
Ying Wai Li,
Sergei Tretiak,
Kipton Barros
Abstract:
Methodologies for training machine learning potentials (MLPs) to quantum-mechanical simulation data have recently seen tremendous progress. Experimental data has a very different character than simulated data, and most MLP training procedures cannot be easily adapted to incorporate both types of data into the training process. We investigate a training procedure based on Iterative Boltzmann Invers…
▽ More
Methodologies for training machine learning potentials (MLPs) to quantum-mechanical simulation data have recently seen tremendous progress. Experimental data has a very different character than simulated data, and most MLP training procedures cannot be easily adapted to incorporate both types of data into the training process. We investigate a training procedure based on Iterative Boltzmann Inversion that produces a pair potential correction to an existing MLP, using equilibrium radial distribution function data. By applying these corrections to a MLP for pure aluminum based on Density Functional Theory, we observe that the resulting model largely addresses previous overstructuring in the melt phase. Interestingly, the corrected MLP also exhibits improved performance in predicting experimental diffusion constants, which are not included in the training procedure. The presented method does not require auto-differentiating through a molecular dynamics solver, and does not make assumptions about the MLP architecture. The results suggest a practical framework of incorporating experimental data into machine learning models to improve accuracy of molecular dynamics simulations.
△ Less
Submitted 10 July, 2023;
originally announced July 2023.
-
Learning Together: Towards foundational models for machine learning interatomic potentials with meta-learning
Authors:
Alice E. A. Allen,
Nicholas Lubbers,
Sakib Matin,
Justin Smith,
Richard Messerly,
Sergei Tretiak,
Kipton Barros
Abstract:
The development of machine learning models has led to an abundance of datasets containing quantum mechanical (QM) calculations for molecular and material systems. However, traditional training methods for machine learning models are unable to leverage the plethora of data available as they require that each dataset be generated using the same QM method. Taking machine learning interatomic potentia…
▽ More
The development of machine learning models has led to an abundance of datasets containing quantum mechanical (QM) calculations for molecular and material systems. However, traditional training methods for machine learning models are unable to leverage the plethora of data available as they require that each dataset be generated using the same QM method. Taking machine learning interatomic potentials (MLIPs) as an example, we show that meta-learning techniques, a recent advancement from the machine learning community, can be used to fit multiple levels of QM theory in the same training process. Meta-learning changes the training procedure to learn a representation that can be easily re-trained to new tasks with small amounts of data. We then demonstrate that meta-learning enables simultaneously training to multiple large organic molecule datasets. As a proof of concept, we examine the performance of a MLIP refit to a small drug-like molecule and show that pre-training potentials to multiple levels of theory with meta-learning improves performance. This difference in performance can be seen both in the reduced error and in the improved smoothness of the potential energy surface produced. We therefore show that meta-learning can utilize existing datasets with inconsistent QM levels of theory to produce models that are better at specializing to new datasets. This opens new routes for creating pre-trained, foundational models for interatomic potentials.
△ Less
Submitted 8 July, 2023;
originally announced July 2023.
-
Cluster Scaling and Critical Points: A Cautionary Tale
Authors:
W. Klein,
Harvey Gould,
Sakib Matin
Abstract:
Many systems in nature are conjectured to exist at a critical point, including the brain and earthquake faults. The primary reason for this conjecture is that the distribution of clusters (avalanches of firing neurons in the brain or regions of slip in earthquake faults) can be described by a power law. Because there are other mechanisms such as $1/f$ noise that can produce power laws, other crite…
▽ More
Many systems in nature are conjectured to exist at a critical point, including the brain and earthquake faults. The primary reason for this conjecture is that the distribution of clusters (avalanches of firing neurons in the brain or regions of slip in earthquake faults) can be described by a power law. Because there are other mechanisms such as $1/f$ noise that can produce power laws, other criteria that the cluster critical exponents must satisfy can be used to conclude whether or not the observed power law behavior indicates an underlying critical point rather than an alternate mechanism. We show how a possible misinterpretation of the cluster scaling data can lead to incorrectly conclude that the measured critical exponents do not satisfy these criteria. Examples of the possible misinterpretation of the data for one-dimensional random site percolation and the one-dimensional Ising model are presented. We stress that the interpretation of a power law cluster distribution indicating the presence of a critical point is subtle, and its misinterpretation might lead to the abandonment of a promising area of research.
△ Less
Submitted 21 March, 2023; v1 submitted 8 July, 2022;
originally announced July 2022.
-
Scaling of causal neural avalanches in a neutral model
Authors:
Sakib Matin,
Thomas Tenzin,
W. Klein
Abstract:
Neural avalanches are collective firings of neurons that exhibit emergent scale-free behavior. Understanding the nature and distribution of these avalanches is an important element in understanding how the brain functions. We study a model of neural avalanches for which the dynamics are governed by neutral theory. The neural avalanches are defined using causal connections between the firing neuron…
▽ More
Neural avalanches are collective firings of neurons that exhibit emergent scale-free behavior. Understanding the nature and distribution of these avalanches is an important element in understanding how the brain functions. We study a model of neural avalanches for which the dynamics are governed by neutral theory. The neural avalanches are defined using causal connections between the firing neurons. We analyze the scaling of causal neural avalanches as the critical point is approached from the absorbing phase. By using cluster analysis tools from percolation theory, we characterize the critical properties of the neural avalanches. We identify the tuning parameters consistent with experiments. The scaling hypothesis provides a unified explanation of the power laws which characterize the critical point. The critical exponents characterizing the avalanche distributions and divergence of the response functions are consistent with the predictions of the scaling hypothesis. We use a universal scaling function for the avalanche profile to find that the firing rates for avalanches of different durations show data collapse after appropriate rescaling. We also find data collapse for the avalanche distribution functions, which is stronger evidence of criticality than just the existence of power laws. Critical slowing-down and power law relaxation of avalanches is observed as the system is tuned to its critical point. We discuss how our results motivate future empirical studies of criticality in the brain.
△ Less
Submitted 14 January, 2021; v1 submitted 15 September, 2019;
originally announced September 2019.
-
Prediction in a driven-dissipative system displaying a continuous phase transition
Authors:
Chon-Kit Pun,
Sakib Matin,
W. Klein,
Harvey Gould
Abstract:
Prediction in complex systems at criticality is believed to be very difficult, if not impossible. Of particular interest is whether earthquakes, whose distribution follows a power law (Gutenberg-Richter) distribution, are in principle unpredictable. We study the predictability of event sizes in the Olmai-Feder-Christensen model at different proximities to criticality using a convolutional neural n…
▽ More
Prediction in complex systems at criticality is believed to be very difficult, if not impossible. Of particular interest is whether earthquakes, whose distribution follows a power law (Gutenberg-Richter) distribution, are in principle unpredictable. We study the predictability of event sizes in the Olmai-Feder-Christensen model at different proximities to criticality using a convolutional neural network. The distribution of event sizes satisfies a power law with a cutoff for large events. We find that prediction decreases as criticality is approached and that prediction is possible only for large, non-scaling events. Our results suggest that earthquake faults that satisfy Gutenberg-Richter scaling are difficult to forecast.
△ Less
Submitted 26 July, 2019;
originally announced July 2019.
-
Boomerang: Rebounding the Consequences of Reputation Feedback on Crowdsourcing Platforms
Authors:
Snehalkumar,
S. Gaikwad,
Durim Morina,
Adam Ginzberg,
Catherine Mullings,
Shirish Goyal,
Dilrukshi Gamage,
Christopher Diemert,
Mathias Burton,
Sharon Zhou,
Mark Whiting,
Karolina Ziulkoski,
Alipta Ballav,
Aaron Gilbee,
Senadhipathige S. Niranga,
Vibhor Sehgal,
Jasmine Lin,
Leonardy Kristianto,
Angela Richmond-Fuller,
Jeff Regino,
Nalin Chhibber,
Dinesh Majeti,
Sachin Sharma,
Kamila Mananova,
Dinesh Dhakal
, et al. (13 additional authors not shown)
Abstract:
Paid crowdsourcing platforms suffer from low-quality work and unfair rejections, but paradoxically, most workers and requesters have high reputation scores. These inflated scores, which make high-quality work and workers difficult to find, stem from social pressure to avoid giving negative feedback. We introduce Boomerang, a reputation system for crowdsourcing that elicits more accurate feedback b…
▽ More
Paid crowdsourcing platforms suffer from low-quality work and unfair rejections, but paradoxically, most workers and requesters have high reputation scores. These inflated scores, which make high-quality work and workers difficult to find, stem from social pressure to avoid giving negative feedback. We introduce Boomerang, a reputation system for crowdsourcing that elicits more accurate feedback by rebounding the consequences of feedback directly back onto the person who gave it. With Boomerang, requesters find that their highly-rated workers gain earliest access to their future tasks, and workers find tasks from their highly-rated requesters at the top of their task feed. Field experiments verify that Boomerang causes both workers and requesters to provide feedback that is more closely aligned with their private opinions. Inspired by a game-theoretic notion of incentive-compatibility, Boomerang opens opportunities for interaction design to incentivize honest reporting over strategic dishonesty.
△ Less
Submitted 14 April, 2019;
originally announced April 2019.
-
Novel effective ergodicity breaking phase transition in a driven-dissipative system
Authors:
Sakib Matin,
Chon-Kit Pun,
Harvey Gould,
W. Klein
Abstract:
We show that the Olami-Feder-Christensen model exhibits an effective ergodicity breaking transition as the noise is varied. Above the critical noise, the average stress on each site converges to the global average. Below the critical noise, the stress on individual sites becomes trapped in different limit cycles. We use ideas from the study of dynamical systems and compute recurrence plots and the…
▽ More
We show that the Olami-Feder-Christensen model exhibits an effective ergodicity breaking transition as the noise is varied. Above the critical noise, the average stress on each site converges to the global average. Below the critical noise, the stress on individual sites becomes trapped in different limit cycles. We use ideas from the study of dynamical systems and compute recurrence plots and the recurrence rate. We identify the order parameter as the recurrence rate averaged over all sites and find numerical evidence that the transition can be characterized by exponents that are consistent with hyperscaling.
△ Less
Submitted 28 July, 2019; v1 submitted 29 March, 2019;
originally announced March 2019.
-
Genetic drift in range expansions is very sensitive to density feedback in dispersal and growth
Authors:
Gabriel Birzu,
Sakib Matin,
Oskar Hallatschek,
Kirill S. Korolev
Abstract:
Theory predicts rapid genetic drift during invasions, yet many expanding populations maintain high genetic diversity. We find that genetic drift is dramatically suppressed when dispersal rates increase with the population density because many more migrants from the diverse, high-density regions arrive at the expansion edge. When density-dependence is weak or negative, the effective population size…
▽ More
Theory predicts rapid genetic drift during invasions, yet many expanding populations maintain high genetic diversity. We find that genetic drift is dramatically suppressed when dispersal rates increase with the population density because many more migrants from the diverse, high-density regions arrive at the expansion edge. When density-dependence is weak or negative, the effective population size of the front scales only logarithmically with the carrying capacity. The dependence, however, switches to a sublinear power law and then to a linear increase as the density-dependence becomes strongly positive. We develop a unified framework revealing that the transitions between different regimes of diversity loss are controlled by a single, universal parameter: the ratio of the expansion velocity to the geometric mean of dispersal and growth rates at expansion edge. Our results suggest that positive density-dependence could dramatically alter evolution in expanding populations even when its contributions to the expansion velocity is small.
△ Less
Submitted 27 March, 2019;
originally announced March 2019.
-
Pinned, locked, pushed, and pulled traveling waves in structured environments
Authors:
Ching-Hao Wang,
Sakib Matin,
Ashish B. George,
Kirill S. Korolev
Abstract:
Traveling fronts describe the transition between two alternative states in a great number of physical and biological systems. Examples include the spread of beneficial mutations, chemical reactions, and the invasions by foreign species. In homogeneous environments, the alternative states are separated by a smooth front moving at a constant velocity. This simple picture can break down in structured…
▽ More
Traveling fronts describe the transition between two alternative states in a great number of physical and biological systems. Examples include the spread of beneficial mutations, chemical reactions, and the invasions by foreign species. In homogeneous environments, the alternative states are separated by a smooth front moving at a constant velocity. This simple picture can break down in structured environments such as tissues, patchy landscapes, and microfluidic devices. Habitat fragmentation can pin the front at a particular location or lock invasion velocities into specific values. Locked velocities are not sensitive to moderate changes in dispersal or growth and are determined by the spatial and temporal periodicity of the environment. The synchronization with the environment results in discontinuous fronts that propagate as periodic pulses. We characterize the transition from continuous to locked invasions and show that it is controlled by positive density-dependence in dispersal or growth. We also demonstrate that velocity locking is robust to demographic and environmental fluctuations and examine stochastic dynamics and evolution in locked invasions.
△ Less
Submitted 10 July, 2020; v1 submitted 6 June, 2018;
originally announced June 2018.
-
Universal fluctuations in growth dynamics of economic systems
Authors:
Nathan C. Frey,
Sakib Matin,
H. Eugene Stanley,
Michael Salinger
Abstract:
The growth of business firms is an example of a system of complex interacting units that resembles complex interacting systems in nature such as earthquakes. Remarkably, work in econophysics has provided evidence that the statistical properties of the growth of business firms follow the same sorts of power laws that characterize physical systems near their critical points. Given how economies chan…
▽ More
The growth of business firms is an example of a system of complex interacting units that resembles complex interacting systems in nature such as earthquakes. Remarkably, work in econophysics has provided evidence that the statistical properties of the growth of business firms follow the same sorts of power laws that characterize physical systems near their critical points. Given how economies change over time, whether these statistical properties are persistent, robust, and universal like those of physical systems remains an open question. Here, we show that the scaling properties of firm growth previously demonstrated for publicly-traded U.S. manufacturing firms from 1974 to 1993 apply to the same sorts of firms from 1993 to 2015, to firms in other broad sectors (such as materials), and to firms in new sectors (such as Internet services). We measure virtually the same scaling exponent for manufacturing for the 1993 to 2015 period as for the 1974 to 1993 period and virtually the same scaling exponent for other sectors as for manufacturing. Furthermore, we show that fluctuations of the growth rate for new industries self-organize into a power law over relatively short time scales.
△ Less
Submitted 21 May, 2018; v1 submitted 5 December, 2017;
originally announced December 2017.
-
Prototype Tasks: Improving Crowdsourcing Results through Rapid, Iterative Task Design
Authors:
Snehalkumar "Neil" S. Gaikwad,
Nalin Chhibber,
Vibhor Sehgal,
Alipta Ballav,
Catherine Mullings,
Ahmed Nasser,
Angela Richmond-Fuller,
Aaron Gilbee,
Dilrukshi Gamage,
Mark Whiting,
Sharon Zhou,
Sekandar Matin,
Senadhipathige Niranga,
Shirish Goyal,
Dinesh Majeti,
Preethi Srinivas,
Adam Ginzberg,
Kamila Mananova,
Karolina Ziulkoski,
Jeff Regino,
Tejas Sarma,
Akshansh Sinha,
Abhratanu Paul,
Christopher Diemert,
Mahesh Murag
, et al. (4 additional authors not shown)
Abstract:
Low-quality results have been a long-standing problem on microtask crowdsourcing platforms, driving away requesters and justifying low wages for workers. To date, workers have been blamed for low-quality results: they are said to make as little effort as possible, do not pay attention to detail, and lack expertise. In this paper, we hypothesize that requesters may also be responsible for low-quali…
▽ More
Low-quality results have been a long-standing problem on microtask crowdsourcing platforms, driving away requesters and justifying low wages for workers. To date, workers have been blamed for low-quality results: they are said to make as little effort as possible, do not pay attention to detail, and lack expertise. In this paper, we hypothesize that requesters may also be responsible for low-quality work: they launch unclear task designs that confuse even earnest workers, under-specify edge cases, and neglect to include examples. We introduce prototype tasks, a crowdsourcing strategy requiring all new task designs to launch a small number of sample tasks. Workers attempt these tasks and leave feedback, enabling the re- quester to iterate on the design before publishing it. We report a field experiment in which tasks that underwent prototype task iteration produced higher-quality work results than the original task designs. With this research, we suggest that a simple and rapid iteration cycle can improve crowd work, and we provide empirical evidence that requester "quality" directly impacts result quality.
△ Less
Submitted 18 July, 2017;
originally announced July 2017.
-
Crowd Guilds: Worker-led Reputation and Feedback on Crowdsourcing Platforms
Authors:
Mark E. Whiting,
Dilrukshi Gamage,
Snehalkumar S. Gaikwad,
Aaron Gilbee,
Shirish Goyal,
Alipta Ballav,
Dinesh Majeti,
Nalin Chhibber,
Angela Richmond-Fuller,
Freddie Vargus,
Tejas Seshadri Sarma,
Varshine Chandrakanthan,
Teogenes Moura,
Mohamed Hashim Salih,
Gabriel Bayomi Tinoco Kalejaiye,
Adam Ginzberg,
Catherine A. Mullings,
Yoni Dayan,
Kristy Milland,
Henrique Orefice,
Jeff Regino,
Sayna Parsi,
Kunz Mainali,
Vibhor Sehgal,
Sekandar Matin
, et al. (3 additional authors not shown)
Abstract:
Crowd workers are distributed and decentralized. While decentralization is designed to utilize independent judgment to promote high-quality results, it paradoxically undercuts behaviors and institutions that are critical to high-quality work. Reputation is one central example: crowdsourcing systems depend on reputation scores from decentralized workers and requesters, but these scores are notoriou…
▽ More
Crowd workers are distributed and decentralized. While decentralization is designed to utilize independent judgment to promote high-quality results, it paradoxically undercuts behaviors and institutions that are critical to high-quality work. Reputation is one central example: crowdsourcing systems depend on reputation scores from decentralized workers and requesters, but these scores are notoriously inflated and uninformative. In this paper, we draw inspiration from historical worker guilds (e.g., in the silk trade) to design and implement crowd guilds: centralized groups of crowd workers who collectively certify each other's quality through double-blind peer assessment. A two-week field experiment compared crowd guilds to a traditional decentralized crowd work model. Crowd guilds produced reputation signals more strongly correlated with ground-truth worker quality than signals available on current crowd working platforms, and more accurate than in the traditional model.
△ Less
Submitted 28 February, 2017; v1 submitted 4 November, 2016;
originally announced November 2016.