Search | arXiv e-print repository

Evidential Deep Learning: Enhancing Predictive Uncertainty Estimation for Earth System Science Applications

Authors: John S. Schreck, David John Gagne II, Charlie Becker, William E. Chapman, Kim Elmore, Da Fan, Gabrielle Gantos, Eliot Kim, Dhamma Kimpara, Thomas Martin, Maria J. Molina, Vanessa M. Pryzbylo, Jacob Radford, Belen Saavedra, Justin Willson, Christopher Wirz

Abstract: Robust quantification of predictive uncertainty is critical for understanding factors that drive weather and climate outcomes. Ensembles provide predictive uncertainty estimates and can be decomposed physically, but both physics and machine learning ensembles are computationally expensive. Parametric deep learning can estimate uncertainty with one model by predicting the parameters of a probabilit… ▽ More Robust quantification of predictive uncertainty is critical for understanding factors that drive weather and climate outcomes. Ensembles provide predictive uncertainty estimates and can be decomposed physically, but both physics and machine learning ensembles are computationally expensive. Parametric deep learning can estimate uncertainty with one model by predicting the parameters of a probability distribution but do not account for epistemic uncertainty.. Evidential deep learning, a technique that extends parametric deep learning to higher-order distributions, can account for both aleatoric and epistemic uncertainty with one model. This study compares the uncertainty derived from evidential neural networks to those obtained from ensembles. Through applications of classification of winter precipitation type and regression of surface layer fluxes, we show evidential deep learning models attaining predictive accuracy rivaling standard methods, while robustly quantifying both sources of uncertainty. We evaluate the uncertainty in terms of how well the predictions are calibrated and how well the uncertainty correlates with prediction error. Analyses of uncertainty in the context of the inputs reveal sensitivities to underlying meteorological processes, facilitating interpretation of the models. The conceptual simplicity, interpretability, and computational efficiency of evidential neural networks make them highly extensible, offering a promising approach for reliable and practical uncertainty quantification in Earth system science modeling. In order to encourage broader adoption of evidential deep learning in Earth System Science, we have developed a new Python package, MILES-GUESS (https://github.com/ai2es/miles-guess), that enables users to train and evaluate both evidential and ensemble deep learning. △ Less

Submitted 19 February, 2024; v1 submitted 22 September, 2023; originally announced September 2023.

arXiv:2305.11910 [pdf, other]

Machine Learning and VIIRS Satellite Retrievals for Skillful Fuel Moisture Content Monitoring in Wildfire Management

Authors: John S. Schreck, William Petzke, Pedro A. Jimenez, Thomas Brummet, Jason C. Knievel, Eric James, Branko Kosovic, David John Gagne

Abstract: Monitoring the fuel moisture content (FMC) of vegetation is crucial for managing and mitigating the impact of wildland fires. The combination of in situ FMC observations with numerical weather prediction (NWP) models and satellite retrievals has enabled the development of machine learning (ML) models to estimate dead FMC retrievals over the contiguous US (CONUS). In this study, ML models were trai… ▽ More Monitoring the fuel moisture content (FMC) of vegetation is crucial for managing and mitigating the impact of wildland fires. The combination of in situ FMC observations with numerical weather prediction (NWP) models and satellite retrievals has enabled the development of machine learning (ML) models to estimate dead FMC retrievals over the contiguous US (CONUS). In this study, ML models were trained using variables from the National Water Model and the High-Resolution Rapid Refresh (HRRR) NWP models, and static variables characterizing the surface properties, as well as surface reflectances and land surface temperature (LST) retrievals from the VIIRS instrument on board the Suomi-NPP satellite system. Extensive hyper-parameter optimization yielded skillful FMC models compared to a daily climatography RMSE (+44\%) and to an hourly climatography RMSE (+24\%). Furthermore, VIIRS retrievals were important predictors for estimating FMC, contributing significantly as a group due to their high band-correlation. In contrast, individual predictors in the HRRR group had relatively high importance according to the explainability techniques used. When both HRRR and VIIRS retrievals were not used as model inputs, the performance dropped significantly. If VIIRS retrievals were not used, the RMSE performance was worse. This highlights the importance of VIIRS retrievals in modeling FMC, which yielded better models compared to MODIS. Overall, the importance of the VIIRS group of predictors corroborates the dynamic relationship between the 10-h fuel and the atmosphere and soil moisture. These findings emphasize the significance of selecting appropriate data sources for predicting FMC with ML models, with VIIRS retrievals and selected HRRR variables being critical components in producing skillful FMC estimates. △ Less

Submitted 17 May, 2023; originally announced May 2023.

arXiv:2301.02757 [pdf, other]

doi 10.1364/OE.486741

Mimicking non-ideal instrument behavior for hologram processing using neural style translation

Authors: John S. Schreck, Matthew Hayman, Gabrielle Gantos, Aaron Bansemer, David John Gagne

Abstract: Holographic cloud probes provide unprecedented information on cloud particle density, size and position. Each laser shot captures particles within a large volume, where images can be computationally refocused to determine particle size and shape. However, processing these holograms, either with standard methods or with machine learning (ML) models, requires considerable computational resources, ti… ▽ More Holographic cloud probes provide unprecedented information on cloud particle density, size and position. Each laser shot captures particles within a large volume, where images can be computationally refocused to determine particle size and shape. However, processing these holograms, either with standard methods or with machine learning (ML) models, requires considerable computational resources, time and occasional human intervention. ML models are trained on simulated holograms obtained from the physical model of the probe since real holograms have no absolute truth labels. Using another processing method to produce labels would be subject to errors that the ML model would subsequently inherit. Models perform well on real holograms only when image corruption is performed on the simulated images during training, thereby mimicking non-ideal conditions in the actual probe (Schreck et. al, 2022). Optimizing image corruption requires a cumbersome manual labeling effort. Here we demonstrate the application of the neural style translation approach (Gatys et. al, 2016) to the simulated holograms. With a pre-trained convolutional neural network (VGG-19), the simulated holograms are ``stylized'' to resemble the real ones obtained from the probe, while at the same time preserving the simulated image ``content'' (e.g. the particle locations and sizes). Two image similarity metrics concur that the stylized images are more like real holograms than the synthetic ones. With an ML model trained to predict particle locations and shapes on the stylized data sets, we observed comparable performance on both simulated and real holograms, obviating the need to perform manual labeling. The described approach is not specific to hologram images and could be applied in other domains for capturing noise and imperfections in observational instruments to make simulated data more like real world observations. △ Less

Submitted 6 January, 2023; originally announced January 2023.

Comments: 23 pages, 9 figures

arXiv:2209.10630 [pdf, other]

Stochastic Kinetic Study of Protein Aggregation and Molecular Crowding Effects of Ab40 and Ab42

Authors: John Bridstrup, Jian-Min Yuan, John S. Schreck

Abstract: Two isoforms of beta amyloid peptides, Ab40 and Ab42, differ from each other only in the last two amino acids, IA, at the end of Ab42. They, however, differ significantly in their ability in inducing Alzheimer's disease (AD). The rate curves of fibril growth of Ab40 and Ab42 and the effects of molecular crowding have been measured in in vitro experiments. These experimental curves, on the other ha… ▽ More Two isoforms of beta amyloid peptides, Ab40 and Ab42, differ from each other only in the last two amino acids, IA, at the end of Ab42. They, however, differ significantly in their ability in inducing Alzheimer's disease (AD). The rate curves of fibril growth of Ab40 and Ab42 and the effects of molecular crowding have been measured in in vitro experiments. These experimental curves, on the other hand, have been fitted in terms of rate constants for elementary reaction steps using rate equation approaches. Several sets of such rate parameters have been reported in the literature. Employing a recently developed stochastic kinetic method, implemented in a browser-based simulator, popsim, we study to reveal the differences in the kinetic behaviors implied by these sets of rate parameters. In particular, the stochastic method is used to distinguish the kinetic behaviors between Ab40 and Ab42 isoforms. As a result, we make general comments on the usefulness of these sets of rate parameters. △ Less

Submitted 21 September, 2022; originally announced September 2022.

Comments: To appear in the Journal of the Chinese Chemical Society

arXiv:2203.08898 [pdf, other]

Neural network processing of holographic images

Authors: John S. Schreck, Gabrielle Gantos, Matthew Hayman, Aaron Bansemer, David John Gagne

Abstract: HOLODEC, an airborne cloud particle imager, captures holographic images of a fixed volume of cloud to characterize the types and sizes of cloud particles, such as water droplets and ice crystals. Cloud particle properties include position, diameter, and shape. We present a hologram processing algorithm, HolodecML, that utilizes a neural segmentation model, GPUs, and computational parallelization.… ▽ More HOLODEC, an airborne cloud particle imager, captures holographic images of a fixed volume of cloud to characterize the types and sizes of cloud particles, such as water droplets and ice crystals. Cloud particle properties include position, diameter, and shape. We present a hologram processing algorithm, HolodecML, that utilizes a neural segmentation model, GPUs, and computational parallelization. HolodecML is trained using synthetically generated holograms based on a model of the instrument, and predicts masks around particles found within reconstructed images. From these masks, the position and size of the detected particles can be characterized in three dimensions. In order to successfully process real holograms, we find we must apply a series of image corrupting transformations and noise to the synthetic images used in training. In this evaluation, HolodecML had comparable position and size estimation performance to the standard processing method, but improved particle detection by nearly 20\% on several thousand manually labeled HOLODEC images. However, the improvement only occurred when image corruption was performed on the simulated images during training, thereby mimicking non-ideal conditions in the actual probe. The trained model also learned to differentiate artifacts and other impurities in the HOLODEC images from the particles, even though no such objects were present in the training data set, while the standard processing method struggled to separate particles from artifacts. The novelty of the training approach, which leveraged noise as a means for parameterizing non-ideal aspects of the HOLODEC detector, could be applied in other domains where the theoretical model is incapable of fully describing the real-world operation of the instrument and accurate truth data required for supervised learning cannot be obtained from real-world observations. △ Less

Submitted 18 March, 2022; v1 submitted 16 March, 2022; originally announced March 2022.

Comments: 38 pages, 15 figures. Submitted to Atmospheric Measurement Techniques

arXiv:2111.10166 [pdf, other]

doi 10.1016/B978-0-12-824406-7.00016-6

Free-energy landscapes of DNA and its assemblies: Perspectives from coarse-grained modelling

Authors: Jonathan P. K. Doye, Ard A. Louis, John S. Schreck, Flavio Romano, Ryan M. Harrison, Majid Mosayebi, Megan C. Engel, Thomas E. Ouldridge

Abstract: This chapter will provide an overview of how characterizing free-energy landscapes can provide insights into the biophysical properties of DNA, as well as into the behaviour of the DNA assemblies used in the field of DNA nanotechnology. The landscapes for these complex systems are accessible through the use of accurate coarse-grained descriptions of DNA. Particular foci will be the landscapes asso… ▽ More This chapter will provide an overview of how characterizing free-energy landscapes can provide insights into the biophysical properties of DNA, as well as into the behaviour of the DNA assemblies used in the field of DNA nanotechnology. The landscapes for these complex systems are accessible through the use of accurate coarse-grained descriptions of DNA. Particular foci will be the landscapes associated with DNA self-assembly and mechanical deformation, where the latter can arise from either externally imposed forces or internal stresses. △ Less

Submitted 19 November, 2021; originally announced November 2021.

Comments: 20 pages, 5 figures

Journal ref: Frontiers of Nanoscience, Vol. 21, Ch. 9, pp 195-210 (2022)

arXiv:2108.06517 [pdf, other]

doi 10.1039/D1NR05716B

Characterizing the free-energy landscapes of DNA origamis

Authors: Chak Kui Wong, Chuyan Tang, John S. Schreck, Jonathan P. K. Doye

Abstract: We show how coarse-grained modelling combined with umbrella sampling using distance-based order parameters can be applied to compute the free-energy landscapes associated with mechanical deformations of large DNA nanostructures. We illustrate this approach for the strong bending of DNA nanotubes and the potentially bistable landscape of twisted DNA origami sheets. The homogeneous bending of the DN… ▽ More We show how coarse-grained modelling combined with umbrella sampling using distance-based order parameters can be applied to compute the free-energy landscapes associated with mechanical deformations of large DNA nanostructures. We illustrate this approach for the strong bending of DNA nanotubes and the potentially bistable landscape of twisted DNA origami sheets. The homogeneous bending of the DNA nanotubes is well described by the worm-like chain model; for more extreme bending the nanotubes reversibly buckle with the bending deformations localized at one or two "kinks". For a twisted one-layer DNA origami, the twist is coupled to the bending of the sheet giving rise to a free-energy landscape that has two nearly-degenerate minima that have opposite curvatures. By contrast, for a two-layer origami, the increased stiffness with respect to bending leads to a landscape with a single free-energy minimum that has a saddle-like geometry. The ability to compute such landscapes is likely to be particularly useful for DNA mechanotechnology and for understanding stress accumulation during the self-assembly of origamis into higher-order structures. △ Less

Submitted 14 August, 2021; originally announced August 2021.

Comments: main (17 pages, 7 figures); Supporting Information (14 pages, 11 figures)

Journal ref: Nanoscale 14, 2638-2648 (2022)

arXiv:2102.01569 [pdf, other]

Stochastic kinetic treatment of protein aggregation and the effects of macromolecular crowding

Authors: John Bridstrup, John S Schreck, Jesse L Jorgenson, Jian-Min Yuan

Abstract: Investigation of protein self-assembly processes is important for the understanding of the growth processes of functional proteins as well as disease-causing amyloids. Inside cells, intrinsic molecular fluctuations are so high that they cast doubt on the validity of the deterministic rate equation approach. Furthermore, the protein environments inside cells are often crowded with other macromolecu… ▽ More Investigation of protein self-assembly processes is important for the understanding of the growth processes of functional proteins as well as disease-causing amyloids. Inside cells, intrinsic molecular fluctuations are so high that they cast doubt on the validity of the deterministic rate equation approach. Furthermore, the protein environments inside cells are often crowded with other macromolecules, with volume fractions of the crowders as high as 40%. We study protein self-aggregation at the cellular level using Gillespie's stochastic algorithm and investigate the effects of macromolecular crowding using models built on scaled-particle and transition-state theories. The stochastic kinetic method can be formulated to provide information on the dominating aggregation mechanisms in a method called reaction frequency (or propensity) analysis. This method reveals that the change of scaling laws related to the lag time can be directly related to the change in the frequencies of reaction mechanisms. Further examination of the time evolution of the fibril mass and length quantities unveils that maximal fluctuations occur in the periods of rapid fibril growth and the fluctuations of both quantities can be sensitive functions of rate constants. The presence of crowders often amplifies the roles of primary and secondary nucleation and causes shifting in the relative importance of elongation, shrinking, fragmentation and coagulation of linear aggregates. Comparison of the results of stochastic simulations with those of rate equations gives us information on the convergence relation between them and how the roles of reaction mechanisms change as the system volume is varied. △ Less

Submitted 2 February, 2021; originally announced February 2021.

arXiv:2004.05052 [pdf]

The oxDNA coarse-grained model as a tool to simulate DNA origami

Authors: Jonathan P. K. Doye, Hannah Fowler, Domen Prešern, Joakim Bohlin, Lorenzo Rovigatti, Flavio Romano, Petr Šulc, Chak Kui Wong, Ard A. Louis, John S. Schreck, Megan C. Engel, Michael Matthies, Erik Benson, Erik Poppleton, Benedict E. K. Snodin

Abstract: This chapter introduces how to run molecular dynamics simulations for DNA origami using the oxDNA coarse-grained model. This chapter introduces how to run molecular dynamics simulations for DNA origami using the oxDNA coarse-grained model. △ Less

Submitted 10 April, 2020; originally announced April 2020.

Comments: 17 pages, 5 figures

arXiv:1901.06569 [pdf, other]

Learning retrosynthetic planning through self-play

Authors: John S. Schreck, Connor W. Coley, Kyle J. M. Bishop

Abstract: The problem of retrosynthetic planning can be framed as one player game, in which the chemist (or a computer program) works backwards from a molecular target to simpler starting materials though a series of choices regarding which reactions to perform. This game is challenging as the combinatorial space of possible choices is astronomical, and the value of each choice remains uncertain until the s… ▽ More The problem of retrosynthetic planning can be framed as one player game, in which the chemist (or a computer program) works backwards from a molecular target to simpler starting materials though a series of choices regarding which reactions to perform. This game is challenging as the combinatorial space of possible choices is astronomical, and the value of each choice remains uncertain until the synthesis plan is completed and its cost evaluated. Here, we address this problem using deep reinforcement learning to identify policies that make (near) optimal reaction choices during each step of retrosynthetic planning. Using simulated experience or self-play, we train neural networks to estimate the expected synthesis cost or value of any given molecule based on a representation of its molecular structure. We show that learned policies based on this value network outperform heuristic approaches in synthesizing unfamiliar molecules from available starting materials using the fewest number of reactions. We discuss how the learned policies described here can be incorporated into existing synthesis planning tools and how they can be adapted to changes in the synthesis cost objective or material availability. △ Less

Submitted 19 January, 2019; originally announced January 2019.

arXiv:1809.08430 [pdf, other]

doi 10.1093/nar/gky1304

Coarse-grained modelling of the structural properties of DNA origami

Authors: Benedict E. K. Snodin, John S. Schreck, Flavio Romano, Ard A. Louis, Jonathan P. K. Doye

Abstract: We use the oxDNA coarse-grained model to provide a detailed characterization of the fundamental structural properties of DNA origamis, focussing on archetypal 2D and 3D origamis. The model reproduces well the characteristic pattern of helix bending in a 2D origami, showing that it stems from the intrinsic tendency of anti-parallel four-way junctions to splay apart, a tendency that is enhanced both… ▽ More We use the oxDNA coarse-grained model to provide a detailed characterization of the fundamental structural properties of DNA origamis, focussing on archetypal 2D and 3D origamis. The model reproduces well the characteristic pattern of helix bending in a 2D origami, showing that it stems from the intrinsic tendency of anti-parallel four-way junctions to splay apart, a tendency that is enhanced both by less screened electrostatic interactions and by increased thermal motion. We also compare to the structure of a 3D origami whose structure has been determined by cryo-electron microscopy. The oxDNA average structure has a root-mean-square deviation from the experimental structure of 8.4 Angstrom, which is of the order of the experimental resolution. These results illustrate that the oxDNA model is capable of providing detailed and accurate insights into the structure of DNA origamis, and has the potential to be used to routinely pre-screen putative origami designs. △ Less

Submitted 22 September, 2018; originally announced September 2018.

Comments: 14 pages, 10 figures

Journal ref: Nucleic Acids Res. 47, 1585-1597 (2019)

arXiv:1712.02161 [pdf, other]

doi 10.1063/1.5019344

Multi-scale coarse-graining for the study of assembly pathways in DNA-brick self assembly

Authors: Pedro Fonseca, Flavio Romano, John S. Schreck, Thomas E. Ouldridge, Jonathan P. K. Doye, Ard A. Louis

Abstract: Inspired by recent successes using single-stranded DNA tiles to produce complex structures, we develop a two-step coarse-graining approach that uses detailed thermodynamic calculations with oxDNA, a nucleotide-based model of DNA, to parametrize a coarser kinetic model that can reach the time and length scales needed to study the assembly mechanisms of these structures. We test the model by perform… ▽ More Inspired by recent successes using single-stranded DNA tiles to produce complex structures, we develop a two-step coarse-graining approach that uses detailed thermodynamic calculations with oxDNA, a nucleotide-based model of DNA, to parametrize a coarser kinetic model that can reach the time and length scales needed to study the assembly mechanisms of these structures. We test the model by performing a detailed study of the assembly pathways for a two-dimensional target structure made up of 334 unique strands each of which are 42 nucleotides long. Without adjustable parameters, the model reproduces a critical temperature for the formation of the assembly that is close to the temperature at which assembly first occurs in experiments. Furthermore, the model allows us to investigate in detail the nucleation barriers and the distribution of critical nucleus shapes for the assembly of a single target structure. The assembly intermediates are compact and highly connected (although not maximally so) and classical nucleation theory provides a good fit to the height and shape of the nucleation barrier at temperatures close to where assembly first occurs. △ Less

Submitted 28 March, 2018; v1 submitted 6 December, 2017; originally announced December 2017.

Journal ref: J. Chem. Phys 148, 134910 (2018)

arXiv:1607.06626 [pdf, other]

doi 10.1088/0953-8984/29/1/014006

Self-assembly of two-dimensional binary quasicrystals: A possible route to a DNA quasicrystal

Authors: Aleks Reinhardt, John S. Schreck, Flavio Romano, Jonathan P. K. Doye

Abstract: We use Monte Carlo simulations and free-energy techniques to show that binary solutions of penta- and hexavalent two-dimensional patchy particles can form thermodynamically stable quasicrystals even at very narrow patch widths, provided their patch interactions are chosen in an appropriate way. Such patchy particles can be thought of as a coarse-grained representation of DNA multi-arm `star' motif… ▽ More We use Monte Carlo simulations and free-energy techniques to show that binary solutions of penta- and hexavalent two-dimensional patchy particles can form thermodynamically stable quasicrystals even at very narrow patch widths, provided their patch interactions are chosen in an appropriate way. Such patchy particles can be thought of as a coarse-grained representation of DNA multi-arm `star' motifs, which can be chosen to bond with one another very specifically by tuning the DNA sequences of the protruding arms. We explore several possible design strategies and conclude that DNA star tiles that are designed to interact with one another in a specific but not overly constrained way could potentially be used to construct soft quasicrystals in experiment. We verify that such star tiles can form stable dodecagonal motifs using oxDNA, a realistic coarse-grained model of DNA. △ Less

Submitted 22 July, 2016; originally announced July 2016.

Journal ref: J. Phys.: Condens. Matter 29, 014006 (2017)

arXiv:1504.00821 [pdf, other]

doi 10.1063/1.4921957

Introducing Improved Structural Properties and Salt Dependence into a Coarse-Grained Model of DNA

Authors: Benedict E. K. Snodin, Ferdinando Randisi, Majid Mosayebi, Petr Sulc, John S. Schreck, Flavio Romano, Thomas E. Ouldridge, Roman Tsukanov, Eyal Nir, Ard A. Louis, Jonathan P. K. Doye

Abstract: We introduce an extended version of oxDNA, a coarse-grained model of DNA designed to capture the thermodynamic, structural and mechanical properties of single- and double-stranded DNA. By including explicit major and minor grooves, and by slightly modifying the coaxial stacking and backbone-backbone interactions, we improve the ability of the model to treat large (kilobase-pair) structures such as… ▽ More We introduce an extended version of oxDNA, a coarse-grained model of DNA designed to capture the thermodynamic, structural and mechanical properties of single- and double-stranded DNA. By including explicit major and minor grooves, and by slightly modifying the coaxial stacking and backbone-backbone interactions, we improve the ability of the model to treat large (kilobase-pair) structures such as DNA origami which are sensitive to these geometric features. Further, we extend the model, which was previously parameterised to just one salt concentration ([Na$^+$]=0.5M), so that it can be used for a range of salt concentrations including those corresponding to physiological conditions. Finally, we use new experimental data to parameterise the oxDNA potential so that consecutive adenine bases stack with a different strength to consecutive thymine bases, a feature which allows a more accurate treatment of systems where the flexibility of single-stranded regions is important. We illustrate the new possibilities opened up by the updated model, oxDNA2, by presenting results from simulations of the structure of large DNA objects and by using the model to investigate some salt-dependent properties of DNA. △ Less

Submitted 19 May, 2015; v1 submitted 3 April, 2015; originally announced April 2015.

Journal ref: J. Chem. Phys. 142, 234901 (2015)

arXiv:1412.6309 [pdf, other]

doi 10.1063/1.4917199

Characterizing the bending and flexibility induced by bulges in DNA duplexes

Authors: John S. Schreck, Thomas E. Ouldridge, Flavio Romano, Ard A. Louis, Jonathan P. K. Doye

Abstract: Advances in DNA nanotechnology have stimulated the search for simple motifs that can be used to control the properties of DNA nanostructures. One such motif, which has been used extensively in structures such as polyhedral cages, two-dimensional arrays, and ribbons, is a bulged duplex, that is two helical segments that connect at a bulge loop. We use a coarse-grained model of DNA to characterize s… ▽ More Advances in DNA nanotechnology have stimulated the search for simple motifs that can be used to control the properties of DNA nanostructures. One such motif, which has been used extensively in structures such as polyhedral cages, two-dimensional arrays, and ribbons, is a bulged duplex, that is two helical segments that connect at a bulge loop. We use a coarse-grained model of DNA to characterize such bulged duplexes. We find that this motif can adopt structures belonging to two main classes: one where the stacking of the helices at the center of the system is preserved, the geometry is roughly straight and the bulge is on one side of the duplex, and the other where the stacking at the center is broken, thus allowing this junction to act as a hinge and increasing flexibility. Small loops favor states where stacking at the center of the duplex is preserved, with loop bases either flipped out or incorporated into the duplex. Duplexes with longer loops show more of a tendency to unstack at the bulge and adopt an open structure. The unstacking probability, however, is highest for loops of intermediate lengths, when the rigidity of single-stranded DNA is significant and the loop resists compression. The properties of this basic structural motif clearly correlate with the structural behavior of certain nano-scale objects, where the enhanced flexibility associated with larger bulges has been used to tune the self-assembly product as well as the detailed geometry of the resulting nanostructures. △ Less

Submitted 19 December, 2014; originally announced December 2014.

Comments: 12 pages + 4 pages of supplemental materials

Journal ref: J. Chem. Phys. 142, 165101 (2015)

arXiv:1408.4401 [pdf, other]

doi 10.1093/nar/gkv582

DNA hairpins primarily promote duplex melting rather than inhibiting hybridization

Authors: John S. Schreck, Thomas E. Ouldridge, Flavio Romano, Petr Sulc, Liam Shaw, Ard A. Louis, Jonathan P. K. Doye

Abstract: The effect of secondary structure on DNA duplex formation is poorly understood. We use a coarse-grained model of DNA to show that specific 3- and 4-base pair hairpins reduce hybridization rates by factors of 2 and 10 respectively, in good agreement with experiment. By contrast, melting rates are accelerated by factors of ~100 and ~2000. This surprisingly large speed-up occurs because hairpins form… ▽ More The effect of secondary structure on DNA duplex formation is poorly understood. We use a coarse-grained model of DNA to show that specific 3- and 4-base pair hairpins reduce hybridization rates by factors of 2 and 10 respectively, in good agreement with experiment. By contrast, melting rates are accelerated by factors of ~100 and ~2000. This surprisingly large speed-up occurs because hairpins form during the melting process, stabilizing partially melted states, and facilitating dissociation. These results may help guide the design of DNA devices that use hairpins to modulate hybridization and dissociation pathways and rates. △ Less

Submitted 19 August, 2014; originally announced August 2014.

Comments: 5 pages + 14 pages of appendices

Journal ref: Nucl. Acids Res. 43, 6181-6190 (2015)

arXiv:1308.5161 [pdf, other]

doi 10.1021/jp401586p

A Kinetic Study of Amyloid Formation: Fibril Growth and Length Distributions

Authors: John S. Schreck, Jian-Min Yuan

Abstract: We propose a kinetic model for the self-aggregation by amyloid proteins. By extending several well-known models for protein aggregation, the time evolution of aggregate concentrations containing $r$ proteins, denoted $c_r(t)$, can be written in terms of generalized Smoluchowski kinetics. With this approach we take into account all possible aggregation and fragmentation reactions involving clusters… ▽ More We propose a kinetic model for the self-aggregation by amyloid proteins. By extending several well-known models for protein aggregation, the time evolution of aggregate concentrations containing $r$ proteins, denoted $c_r(t)$, can be written in terms of generalized Smoluchowski kinetics. With this approach we take into account all possible aggregation and fragmentation reactions involving clusters of any size. Correspondingly, an aggregate of size x+y could be formed by or break-up into two smaller constituent aggregates of sizes x and y. The rates of each aggregation or fragmentation reaction, called kernels, are specified in terms of the aggregate size, and we solve $c_r(t)$ for large cluster sizes using numerical techniques. We show that by using Smoluchowski kinetics many pathways to fibrillation are possible and quantities, such as the aggregate length distribution at an arbitrary time, can be calculated. We show that the predicted results of the model are in agreement with the experimental observations. △ Less

Submitted 23 August, 2013; originally announced August 2013.

Journal ref: J Phys Chem B. 2013 May 30;117(21):6574-83

arXiv:1308.5132 [pdf, other]

doi 10.3390/ijms140917420

Statistical Mechanical Treatments of Protein Amyloid Formation

Authors: John S. Schreck, Jian-Min Yuan

Abstract: Protein aggregation is an important field of investigation because it is closely related to the problem of neurodegenerative diseases, to the development of biomaterials, and to the growth of cellular structures such as cyto-skeleton. Self-aggregation of protein amyloids, for example, is a complicated process involving many species and levels of structures. This complexity, however, can be dealt w… ▽ More Protein aggregation is an important field of investigation because it is closely related to the problem of neurodegenerative diseases, to the development of biomaterials, and to the growth of cellular structures such as cyto-skeleton. Self-aggregation of protein amyloids, for example, is a complicated process involving many species and levels of structures. This complexity, however, can be dealt with using statistical mechanical tools, such as free energies, partition functions, and transfer matrices. In this article, we review general strategies for studying protein aggregation using statistical mechanical approaches and show that canonical and grand canonical ensembles can be used in such approaches. The grand canonical approach is particularly convenient since competing pathways of assembly and dis-assembly can be considered simultaneously. Another advantage of using statistical mechanics is that numerically exact solutions can be obtained for all of the thermodynamic properties of fibrils, such as the amount of fibrils formed, as a function of initial protein concentration. Furthermore, statistical mechanics models can be used to fit experimental data when they are available for comparison. △ Less

Submitted 23 August, 2013; originally announced August 2013.

Comments: Accepted to IJMS

Journal ref: Int. J. Mol. Sci. 2013, 14, 17420-17452

arXiv:1308.3843 [pdf, other]

doi 10.1039/C3CP53545B

Coarse-graining DNA for simulations of DNA nanotechnology

Authors: Jonathan P. K. Doye, Thomas E. Ouldridge, Ard A. Louis, Flavio Romano, Petr Sulc, Christian Matek, Benedict E. K. Snodin, Lorenzo Rovigatti, John S. Schreck, Ryan M. Harrison, William P. J. Smith

Abstract: To simulate long time and length scale processes involving DNA it is necessary to use a coarse-grained description. Here we provide an overview of different approaches to such coarse graining, focussing on those at the nucleotide level that allow the self-assembly processes associated with DNA nanotechnology to be studied. OxDNA, our recently-developed coarse-grained DNA model, is particularly sui… ▽ More To simulate long time and length scale processes involving DNA it is necessary to use a coarse-grained description. Here we provide an overview of different approaches to such coarse graining, focussing on those at the nucleotide level that allow the self-assembly processes associated with DNA nanotechnology to be studied. OxDNA, our recently-developed coarse-grained DNA model, is particularly suited to this task, and has opened up this field to systematic study by simulations. We illustrate some of the range of DNA nanotechnology systems to which the model is being applied, as well as the insights it can provide into fundamental biophysical properties of DNA. △ Less

Submitted 18 August, 2013; originally announced August 2013.

Comments: 20 pages, 9 figures

Journal ref: Phys. Chem. Chem. Phys. 15, 20395-20414 (2013)

arXiv:1111.2323 [pdf, ps, other]

doi 10.1016/j.bpj.2011.11.1400

A Statistical Mechanical Approach to Protein Aggregation

Authors: John S. Schreck, Jian-Min Yuan

Abstract: We develop a theory of aggregation using statistical mechanical methods. An example of a complicated aggregation system with several levels of structures is peptide/protein self-assembly. The problem of protein aggregation is important for the understanding and treatment of neurodegenerative diseases and also for the development of bio-macromolecules as new materials. We write the effective Hamilt… ▽ More We develop a theory of aggregation using statistical mechanical methods. An example of a complicated aggregation system with several levels of structures is peptide/protein self-assembly. The problem of protein aggregation is important for the understanding and treatment of neurodegenerative diseases and also for the development of bio-macromolecules as new materials. We write the effective Hamiltonian in terms of interaction energies between protein monomers, protein and solvent, as well as between protein filaments. The grand partition function can be expressed in terms of a Zimm-Bragg-like transfer matrix, which is calculated exactly and all thermodynamic properties can be obtained. We start with two-state and three-state descriptions of protein monomers using Potts models that can be generalized to include q-states, for which the exactly solvable feature of the model remains. We focus on n X N lattice systems, corresponding to the ordered structures observed in some real fibrils. We have obtained results on nucleation processes and phase diagrams, in which a protein property such as the sheet content of aggregates is expressed as a function of the number of proteins on the lattice and inter-protein or interfacial interaction energies. We have applied our methods to Aβ(1-40) and Curli fibrils and obtained results in good agreement with experiments. △ Less

Submitted 9 November, 2011; originally announced November 2011.

Comments: 13 pages, 8 figures, accepted to J. Chem. Phys

Journal ref: J. Chem. Phys. 135, 235102 (2011)

arXiv:1005.4919 [pdf, ps, other]

doi 10.1103/PhysRevE.81.061919

Exactly Solvable Model for Helix-Coil-Sheet Transitions in Protein Systems

Authors: John S. Schreck, Jian-Min Yuan

Abstract: In view of the important role helix-sheet transitions play in protein aggregation, we introduce a simple model to study secondary structural transitions of helix-coil-sheet systems using a Potts model starting with an effective Hamiltonian. This energy function depends on four parameters that approximately describe entropic and enthalpic contributions to the stability of a polypeptide in helical a… ▽ More In view of the important role helix-sheet transitions play in protein aggregation, we introduce a simple model to study secondary structural transitions of helix-coil-sheet systems using a Potts model starting with an effective Hamiltonian. This energy function depends on four parameters that approximately describe entropic and enthalpic contributions to the stability of a polypeptide in helical and sheet conformations. The sheet structures involve long-range interactions between residues which are far in sequence, but are in contact in real space. Such contacts are included in the Hamiltonian. Using standard statistical mechanical techniques, the partition function is solved exactly using transfer matrices. Based on this model, we study thermodynamic properties of polypeptides, including phase transitions between helix, sheet, and coil structures. △ Less

Submitted 8 November, 2011; v1 submitted 26 May, 2010; originally announced May 2010.

Comments: Updated version with corrections

Journal ref: Phys. Rev. E 81, 061919 (2010)

Showing 1–21 of 21 results for author: Schreck, J S