Search | arXiv e-print repository

Thermodynamic Transferability in Coarse-Grained Force Fields using Graph Neural Networks

Authors: Emily Shinkle, Aleksandra Pachalieva, Riti Bahl, Sakib Matin, Brendan Gifford, Galen T. Craven, Nicholas Lubbers

Abstract: Coarse-graining is a molecular modeling technique in which an atomistic system is represented in a simplified fashion that retains the most significant system features that contribute to a target output, while removing the degrees of freedom that are less relevant. This reduction in model complexity allows coarse-grained molecular simulations to reach increased spatial and temporal scales compared… ▽ More Coarse-graining is a molecular modeling technique in which an atomistic system is represented in a simplified fashion that retains the most significant system features that contribute to a target output, while removing the degrees of freedom that are less relevant. This reduction in model complexity allows coarse-grained molecular simulations to reach increased spatial and temporal scales compared to corresponding all-atom models. A core challenge in coarse-graining is to construct a force field that represents the interactions in the new representation in a way that preserves the atomistic-level properties. Many approaches to building coarse-grained force fields have limited transferability between different thermodynamic conditions as a result of averaging over internal fluctuations at a specific thermodynamic state point. Here, we use a graph-convolutional neural network architecture, the Hierarchically Interacting Particle Neural Network with Tensor Sensitivity (HIP-NN-TS), to develop a highly automated training pipeline for coarse grained force fields which allows for studying the transferability of coarse-grained models based on the force-matching approach. We show that this approach not only yields highly accurate force fields, but also that these force fields are more transferable through a variety of thermodynamic conditions. These results illustrate the potential of machine learning techniques such as graph neural networks to improve the construction of transferable coarse-grained force fields. △ Less

Submitted 17 June, 2024; originally announced June 2024.

Comments: 31 pages, 6 figures + TOC figure + SI (15 pages, 3 figures)

arXiv:2307.04712 [pdf, other]

Machine learning potentials with Iterative Boltzmann Inversion: training to experiment

Authors: Sakib Matin, Alice Allen, Justin S. Smith, Nicholas Lubbers, Ryan B. Jadrich, Richard A. Messerly, Benjamin T. Nebgen, Ying Wai Li, Sergei Tretiak, Kipton Barros

Abstract: Methodologies for training machine learning potentials (MLPs) to quantum-mechanical simulation data have recently seen tremendous progress. Experimental data has a very different character than simulated data, and most MLP training procedures cannot be easily adapted to incorporate both types of data into the training process. We investigate a training procedure based on Iterative Boltzmann Invers… ▽ More Methodologies for training machine learning potentials (MLPs) to quantum-mechanical simulation data have recently seen tremendous progress. Experimental data has a very different character than simulated data, and most MLP training procedures cannot be easily adapted to incorporate both types of data into the training process. We investigate a training procedure based on Iterative Boltzmann Inversion that produces a pair potential correction to an existing MLP, using equilibrium radial distribution function data. By applying these corrections to a MLP for pure aluminum based on Density Functional Theory, we observe that the resulting model largely addresses previous overstructuring in the melt phase. Interestingly, the corrected MLP also exhibits improved performance in predicting experimental diffusion constants, which are not included in the training procedure. The presented method does not require auto-differentiating through a molecular dynamics solver, and does not make assumptions about the MLP architecture. The results suggest a practical framework of incorporating experimental data into machine learning models to improve accuracy of molecular dynamics simulations. △ Less

Submitted 10 July, 2023; originally announced July 2023.

arXiv:2307.04012 [pdf, other]

Learning Together: Towards foundational models for machine learning interatomic potentials with meta-learning

Authors: Alice E. A. Allen, Nicholas Lubbers, Sakib Matin, Justin Smith, Richard Messerly, Sergei Tretiak, Kipton Barros

Abstract: The development of machine learning models has led to an abundance of datasets containing quantum mechanical (QM) calculations for molecular and material systems. However, traditional training methods for machine learning models are unable to leverage the plethora of data available as they require that each dataset be generated using the same QM method. Taking machine learning interatomic potentia… ▽ More The development of machine learning models has led to an abundance of datasets containing quantum mechanical (QM) calculations for molecular and material systems. However, traditional training methods for machine learning models are unable to leverage the plethora of data available as they require that each dataset be generated using the same QM method. Taking machine learning interatomic potentials (MLIPs) as an example, we show that meta-learning techniques, a recent advancement from the machine learning community, can be used to fit multiple levels of QM theory in the same training process. Meta-learning changes the training procedure to learn a representation that can be easily re-trained to new tasks with small amounts of data. We then demonstrate that meta-learning enables simultaneously training to multiple large organic molecule datasets. As a proof of concept, we examine the performance of a MLIP refit to a small drug-like molecule and show that pre-training potentials to multiple levels of theory with meta-learning improves performance. This difference in performance can be seen both in the reduced error and in the improved smoothness of the potential energy surface produced. We therefore show that meta-learning can utilize existing datasets with inconsistent QM levels of theory to produce models that are better at specializing to new datasets. This opens new routes for creating pre-trained, foundational models for interatomic potentials. △ Less

Submitted 8 July, 2023; originally announced July 2023.

arXiv:2207.04142 [pdf, other]

Cluster Scaling and Critical Points: A Cautionary Tale

Authors: W. Klein, Harvey Gould, Sakib Matin

Abstract: Many systems in nature are conjectured to exist at a critical point, including the brain and earthquake faults. The primary reason for this conjecture is that the distribution of clusters (avalanches of firing neurons in the brain or regions of slip in earthquake faults) can be described by a power law. Because there are other mechanisms such as $1/f$ noise that can produce power laws, other crite… ▽ More Many systems in nature are conjectured to exist at a critical point, including the brain and earthquake faults. The primary reason for this conjecture is that the distribution of clusters (avalanches of firing neurons in the brain or regions of slip in earthquake faults) can be described by a power law. Because there are other mechanisms such as $1/f$ noise that can produce power laws, other criteria that the cluster critical exponents must satisfy can be used to conclude whether or not the observed power law behavior indicates an underlying critical point rather than an alternate mechanism. We show how a possible misinterpretation of the cluster scaling data can lead to incorrectly conclude that the measured critical exponents do not satisfy these criteria. Examples of the possible misinterpretation of the data for one-dimensional random site percolation and the one-dimensional Ising model are presented. We stress that the interpretation of a power law cluster distribution indicating the presence of a critical point is subtle, and its misinterpretation might lead to the abandonment of a promising area of research. △ Less

Submitted 21 March, 2023; v1 submitted 8 July, 2022; originally announced July 2022.

arXiv:1909.06813 [pdf, other]

doi 10.1103/PhysRevResearch.3.013107

Scaling of causal neural avalanches in a neutral model

Authors: Sakib Matin, Thomas Tenzin, W. Klein

Abstract: Neural avalanches are collective firings of neurons that exhibit emergent scale-free behavior. Understanding the nature and distribution of these avalanches is an important element in understanding how the brain functions. We study a model of neural avalanches for which the dynamics are governed by neutral theory. The neural avalanches are defined using causal connections between the firing neuron… ▽ More Neural avalanches are collective firings of neurons that exhibit emergent scale-free behavior. Understanding the nature and distribution of these avalanches is an important element in understanding how the brain functions. We study a model of neural avalanches for which the dynamics are governed by neutral theory. The neural avalanches are defined using causal connections between the firing neurons. We analyze the scaling of causal neural avalanches as the critical point is approached from the absorbing phase. By using cluster analysis tools from percolation theory, we characterize the critical properties of the neural avalanches. We identify the tuning parameters consistent with experiments. The scaling hypothesis provides a unified explanation of the power laws which characterize the critical point. The critical exponents characterizing the avalanche distributions and divergence of the response functions are consistent with the predictions of the scaling hypothesis. We use a universal scaling function for the avalanche profile to find that the firing rates for avalanches of different durations show data collapse after appropriate rescaling. We also find data collapse for the avalanche distribution functions, which is stronger evidence of criticality than just the existence of power laws. Critical slowing-down and power law relaxation of avalanches is observed as the system is tuned to its critical point. We discuss how our results motivate future empirical studies of criticality in the brain. △ Less

Submitted 14 January, 2021; v1 submitted 15 September, 2019; originally announced September 2019.

Journal ref: Phys. Rev. Research 3, 013107 (2021)

arXiv:1907.11790 [pdf, other]

doi 10.1103/PhysRevE.101.022102

Prediction in a driven-dissipative system displaying a continuous phase transition

Authors: Chon-Kit Pun, Sakib Matin, W. Klein, Harvey Gould

Abstract: Prediction in complex systems at criticality is believed to be very difficult, if not impossible. Of particular interest is whether earthquakes, whose distribution follows a power law (Gutenberg-Richter) distribution, are in principle unpredictable. We study the predictability of event sizes in the Olmai-Feder-Christensen model at different proximities to criticality using a convolutional neural n… ▽ More Prediction in complex systems at criticality is believed to be very difficult, if not impossible. Of particular interest is whether earthquakes, whose distribution follows a power law (Gutenberg-Richter) distribution, are in principle unpredictable. We study the predictability of event sizes in the Olmai-Feder-Christensen model at different proximities to criticality using a convolutional neural network. The distribution of event sizes satisfies a power law with a cutoff for large events. We find that prediction decreases as criticality is approached and that prediction is possible only for large, non-scaling events. Our results suggest that earthquake faults that satisfy Gutenberg-Richter scaling are difficult to forecast. △ Less

Submitted 26 July, 2019; originally announced July 2019.

Comments: 12 pages, 6 figures

Journal ref: Phys. Rev. E 101, 022102 (2020)

arXiv:1904.06722 [pdf, other]

doi 10.1145/2984511.2984542

Boomerang: Rebounding the Consequences of Reputation Feedback on Crowdsourcing Platforms

Authors: Snehalkumar, S. Gaikwad, Durim Morina, Adam Ginzberg, Catherine Mullings, Shirish Goyal, Dilrukshi Gamage, Christopher Diemert, Mathias Burton, Sharon Zhou, Mark Whiting, Karolina Ziulkoski, Alipta Ballav, Aaron Gilbee, Senadhipathige S. Niranga, Vibhor Sehgal, Jasmine Lin, Leonardy Kristianto, Angela Richmond-Fuller, Jeff Regino, Nalin Chhibber, Dinesh Majeti, Sachin Sharma, Kamila Mananova, Dinesh Dhakal , et al. (13 additional authors not shown)

Abstract: Paid crowdsourcing platforms suffer from low-quality work and unfair rejections, but paradoxically, most workers and requesters have high reputation scores. These inflated scores, which make high-quality work and workers difficult to find, stem from social pressure to avoid giving negative feedback. We introduce Boomerang, a reputation system for crowdsourcing that elicits more accurate feedback b… ▽ More Paid crowdsourcing platforms suffer from low-quality work and unfair rejections, but paradoxically, most workers and requesters have high reputation scores. These inflated scores, which make high-quality work and workers difficult to find, stem from social pressure to avoid giving negative feedback. We introduce Boomerang, a reputation system for crowdsourcing that elicits more accurate feedback by rebounding the consequences of feedback directly back onto the person who gave it. With Boomerang, requesters find that their highly-rated workers gain earliest access to their future tasks, and workers find tasks from their highly-rated requesters at the top of their task feed. Field experiments verify that Boomerang causes both workers and requesters to provide feedback that is more closely aligned with their private opinions. Inspired by a game-theoretic notion of incentive-compatibility, Boomerang opens opportunities for interaction design to incentivize honest reporting over strategic dishonesty. △ Less

Submitted 14 April, 2019; originally announced April 2019.

ACM Class: H.5.3; H.1.2; J.4; K.4.4; K.4.3

Journal ref: Proceedings of the 29th Annual Symposium on User Interface Software and Technology, 2016

arXiv:1903.12652 [pdf, other]

doi 10.1103/PhysRevE.101.022103

Novel effective ergodicity breaking phase transition in a driven-dissipative system

Authors: Sakib Matin, Chon-Kit Pun, Harvey Gould, W. Klein

Abstract: We show that the Olami-Feder-Christensen model exhibits an effective ergodicity breaking transition as the noise is varied. Above the critical noise, the average stress on each site converges to the global average. Below the critical noise, the stress on individual sites becomes trapped in different limit cycles. We use ideas from the study of dynamical systems and compute recurrence plots and the… ▽ More We show that the Olami-Feder-Christensen model exhibits an effective ergodicity breaking transition as the noise is varied. Above the critical noise, the average stress on each site converges to the global average. Below the critical noise, the stress on individual sites becomes trapped in different limit cycles. We use ideas from the study of dynamical systems and compute recurrence plots and the recurrence rate. We identify the order parameter as the recurrence rate averaged over all sites and find numerical evidence that the transition can be characterized by exponents that are consistent with hyperscaling. △ Less

Submitted 28 July, 2019; v1 submitted 29 March, 2019; originally announced March 2019.

Journal ref: Phys. Rev. E 101, 022103 (2020)

arXiv:1903.11627 [pdf, ps, other]

Genetic drift in range expansions is very sensitive to density feedback in dispersal and growth

Authors: Gabriel Birzu, Sakib Matin, Oskar Hallatschek, Kirill S. Korolev

Abstract: Theory predicts rapid genetic drift during invasions, yet many expanding populations maintain high genetic diversity. We find that genetic drift is dramatically suppressed when dispersal rates increase with the population density because many more migrants from the diverse, high-density regions arrive at the expansion edge. When density-dependence is weak or negative, the effective population size… ▽ More Theory predicts rapid genetic drift during invasions, yet many expanding populations maintain high genetic diversity. We find that genetic drift is dramatically suppressed when dispersal rates increase with the population density because many more migrants from the diverse, high-density regions arrive at the expansion edge. When density-dependence is weak or negative, the effective population size of the front scales only logarithmically with the carrying capacity. The dependence, however, switches to a sublinear power law and then to a linear increase as the density-dependence becomes strongly positive. We develop a unified framework revealing that the transitions between different regimes of diversity loss are controlled by a single, universal parameter: the ratio of the expansion velocity to the geometric mean of dispersal and growth rates at expansion edge. Our results suggest that positive density-dependence could dramatically alter evolution in expanding populations even when its contributions to the expansion velocity is small. △ Less

Submitted 27 March, 2019; originally announced March 2019.

Comments: 36 pages, 5 figures, and 1 table

arXiv:1806.02480 [pdf, other]

Pinned, locked, pushed, and pulled traveling waves in structured environments

Authors: Ching-Hao Wang, Sakib Matin, Ashish B. George, Kirill S. Korolev

Abstract: Traveling fronts describe the transition between two alternative states in a great number of physical and biological systems. Examples include the spread of beneficial mutations, chemical reactions, and the invasions by foreign species. In homogeneous environments, the alternative states are separated by a smooth front moving at a constant velocity. This simple picture can break down in structured… ▽ More Traveling fronts describe the transition between two alternative states in a great number of physical and biological systems. Examples include the spread of beneficial mutations, chemical reactions, and the invasions by foreign species. In homogeneous environments, the alternative states are separated by a smooth front moving at a constant velocity. This simple picture can break down in structured environments such as tissues, patchy landscapes, and microfluidic devices. Habitat fragmentation can pin the front at a particular location or lock invasion velocities into specific values. Locked velocities are not sensitive to moderate changes in dispersal or growth and are determined by the spatial and temporal periodicity of the environment. The synchronization with the environment results in discontinuous fronts that propagate as periodic pulses. We characterize the transition from continuous to locked invasions and show that it is controlled by positive density-dependence in dispersal or growth. We also demonstrate that velocity locking is robust to demographic and environmental fluctuations and examine stochastic dynamics and evolution in locked invasions. △ Less

Submitted 10 July, 2020; v1 submitted 6 June, 2018; originally announced June 2018.

Comments: Equal contribution from first 3 authors

arXiv:1712.02003 [pdf, other]

doi 10.1038/s41598-018-38088-z

Universal fluctuations in growth dynamics of economic systems

Authors: Nathan C. Frey, Sakib Matin, H. Eugene Stanley, Michael Salinger

Abstract: The growth of business firms is an example of a system of complex interacting units that resembles complex interacting systems in nature such as earthquakes. Remarkably, work in econophysics has provided evidence that the statistical properties of the growth of business firms follow the same sorts of power laws that characterize physical systems near their critical points. Given how economies chan… ▽ More The growth of business firms is an example of a system of complex interacting units that resembles complex interacting systems in nature such as earthquakes. Remarkably, work in econophysics has provided evidence that the statistical properties of the growth of business firms follow the same sorts of power laws that characterize physical systems near their critical points. Given how economies change over time, whether these statistical properties are persistent, robust, and universal like those of physical systems remains an open question. Here, we show that the scaling properties of firm growth previously demonstrated for publicly-traded U.S. manufacturing firms from 1974 to 1993 apply to the same sorts of firms from 1993 to 2015, to firms in other broad sectors (such as materials), and to firms in new sectors (such as Internet services). We measure virtually the same scaling exponent for manufacturing for the 1993 to 2015 period as for the 1974 to 1993 period and virtually the same scaling exponent for other sectors as for manufacturing. Furthermore, we show that fluctuations of the growth rate for new industries self-organize into a power law over relatively short time scales. △ Less

Submitted 21 May, 2018; v1 submitted 5 December, 2017; originally announced December 2017.

Comments: 15 pages, 7 figures

Journal ref: Scientific Reports 9, 713 (2019)

arXiv:1707.05645 [pdf, other]

Prototype Tasks: Improving Crowdsourcing Results through Rapid, Iterative Task Design

Authors: Snehalkumar "Neil" S. Gaikwad, Nalin Chhibber, Vibhor Sehgal, Alipta Ballav, Catherine Mullings, Ahmed Nasser, Angela Richmond-Fuller, Aaron Gilbee, Dilrukshi Gamage, Mark Whiting, Sharon Zhou, Sekandar Matin, Senadhipathige Niranga, Shirish Goyal, Dinesh Majeti, Preethi Srinivas, Adam Ginzberg, Kamila Mananova, Karolina Ziulkoski, Jeff Regino, Tejas Sarma, Akshansh Sinha, Abhratanu Paul, Christopher Diemert, Mahesh Murag , et al. (4 additional authors not shown)

Abstract: Low-quality results have been a long-standing problem on microtask crowdsourcing platforms, driving away requesters and justifying low wages for workers. To date, workers have been blamed for low-quality results: they are said to make as little effort as possible, do not pay attention to detail, and lack expertise. In this paper, we hypothesize that requesters may also be responsible for low-quali… ▽ More Low-quality results have been a long-standing problem on microtask crowdsourcing platforms, driving away requesters and justifying low wages for workers. To date, workers have been blamed for low-quality results: they are said to make as little effort as possible, do not pay attention to detail, and lack expertise. In this paper, we hypothesize that requesters may also be responsible for low-quality work: they launch unclear task designs that confuse even earnest workers, under-specify edge cases, and neglect to include examples. We introduce prototype tasks, a crowdsourcing strategy requiring all new task designs to launch a small number of sample tasks. Workers attempt these tasks and leave feedback, enabling the re- quester to iterate on the design before publishing it. We report a field experiment in which tasks that underwent prototype task iteration produced higher-quality work results than the original task designs. With this research, we suggest that a simple and rapid iteration cycle can improve crowd work, and we provide empirical evidence that requester "quality" directly impacts result quality. △ Less

Submitted 18 July, 2017; originally announced July 2017.

Comments: 2 pages (with 2 pages references, 2 pages Appx), HCOMP 2017, Association for the Advancement of Artificial Intelligence (www.aaai.org)

Report number: 1952894A

arXiv:1611.01572 [pdf, other]

doi 10.1145/2998181.2998234

Crowd Guilds: Worker-led Reputation and Feedback on Crowdsourcing Platforms

Authors: Mark E. Whiting, Dilrukshi Gamage, Snehalkumar S. Gaikwad, Aaron Gilbee, Shirish Goyal, Alipta Ballav, Dinesh Majeti, Nalin Chhibber, Angela Richmond-Fuller, Freddie Vargus, Tejas Seshadri Sarma, Varshine Chandrakanthan, Teogenes Moura, Mohamed Hashim Salih, Gabriel Bayomi Tinoco Kalejaiye, Adam Ginzberg, Catherine A. Mullings, Yoni Dayan, Kristy Milland, Henrique Orefice, Jeff Regino, Sayna Parsi, Kunz Mainali, Vibhor Sehgal, Sekandar Matin , et al. (3 additional authors not shown)

Abstract: Crowd workers are distributed and decentralized. While decentralization is designed to utilize independent judgment to promote high-quality results, it paradoxically undercuts behaviors and institutions that are critical to high-quality work. Reputation is one central example: crowdsourcing systems depend on reputation scores from decentralized workers and requesters, but these scores are notoriou… ▽ More Crowd workers are distributed and decentralized. While decentralization is designed to utilize independent judgment to promote high-quality results, it paradoxically undercuts behaviors and institutions that are critical to high-quality work. Reputation is one central example: crowdsourcing systems depend on reputation scores from decentralized workers and requesters, but these scores are notoriously inflated and uninformative. In this paper, we draw inspiration from historical worker guilds (e.g., in the silk trade) to design and implement crowd guilds: centralized groups of crowd workers who collectively certify each other's quality through double-blind peer assessment. A two-week field experiment compared crowd guilds to a traditional decentralized crowd work model. Crowd guilds produced reputation signals more strongly correlated with ground-truth worker quality than signals available on current crowd working platforms, and more accurate than in the traditional model. △ Less

Submitted 28 February, 2017; v1 submitted 4 November, 2016; originally announced November 2016.

Comments: 12 pages, 6 figures, 1 table. To be presented at CSCW2017

ACM Class: H.5.3

Journal ref: ACM Conference on Computer Supported Cooperative Work and Social Computing. ACM, New York, NY, USA, 1902-1913

Showing 1–13 of 13 results for author: Matin, S