-
C-A test of DNA force fields
Authors:
Ivan A. Strelnikov,
Natalya A. Kovaleva,
Artem P. Klinov,
Elena A. Zubova
Abstract:
The DNA duplex may be locally strongly bent in complexes with proteins, for example, with polymerases or in a nucleosome. At such bends, the DNA helix is locally in the non-canonical forms A (with a narrow major groove and a large amount of north sugars) or C (with a narrow minor groove and a large share of BII phosphates). To model the formation of such complexes by molecular dynamics methods, th…
▽ More
The DNA duplex may be locally strongly bent in complexes with proteins, for example, with polymerases or in a nucleosome. At such bends, the DNA helix is locally in the non-canonical forms A (with a narrow major groove and a large amount of north sugars) or C (with a narrow minor groove and a large share of BII phosphates). To model the formation of such complexes by molecular dynamics methods, the force field is required to reproduce these conformational transitions for a naked DNA. We analyzed the available experimental data on the B-C and B-A transitions under the conditions easily implemented in modeling: in an aqueous NaCl solution. We selected six DNA duplexes which conformations at different salt concentrations are known reliably enough. At low salt concentrations, poly(GC) and poly(A) are in the B-form, classical and slightly shifted to the A-form, respectively. The duplexes ATAT and GGTATACC have a strong and salt concentration dependent bias toward the A-form. The polymers poly(AC) and poly(G) take the C- and A-forms, respectively, at high salt concentrations. The reproduction of the behavior of these oligomers can serve as a test for the balance of interactions between the base stacking and the conformational flexibility of the sugar-phosphate backbone in a DNA force field. We tested the AMBER bsc1 and CHARMM36 force fields and their hybrids, and we failed to reproduce the experiment. In all the force fields, the salt concentration dependence is very weak. The known B-philicity of the AMBER force field proved to result from the B-philicity of its excessively strong base stacking. In the CHARMM force field, the B-form is a result of a fragile balance between the A-philic base stacking (especially for G:C pairs) and the C-philic backbone. Finally, we analyzed some recent simulations of the LacI-, SOX-4-, and Sac7d-DNA complex formation in the framework of the AMBER force field.
△ Less
Submitted 19 November, 2022;
originally announced November 2022.
-
FOLIO: Natural Language Reasoning with First-Order Logic
Authors:
Simeng Han,
Hailey Schoelkopf,
Yilun Zhao,
Zhenting Qi,
Martin Riddell,
Wenfei Zhou,
James Coady,
David Peng,
Yujie Qiao,
Luke Benson,
Lucy Sun,
Alex Wardle-Solano,
Hannah Szabo,
Ekaterina Zubova,
Matthew Burtell,
Jonathan Fan,
Yixin Liu,
Brian Wong,
Malcolm Sailor,
Ansong Ni,
Linyong Nan,
Jungo Kasai,
Tao Yu,
Rui Zhang,
Alexander R. Fabbri
, et al. (10 additional authors not shown)
Abstract:
Large language models (LLMs) have achieved remarkable performance on a variety of natural language understanding tasks. However, existing benchmarks are inadequate in measuring the complex logical reasoning capabilities of a model. We present FOLIO, a human-annotated, logically complex and diverse dataset for reasoning in natural language (NL), equipped with first-order logic (FOL) annotations. FO…
▽ More
Large language models (LLMs) have achieved remarkable performance on a variety of natural language understanding tasks. However, existing benchmarks are inadequate in measuring the complex logical reasoning capabilities of a model. We present FOLIO, a human-annotated, logically complex and diverse dataset for reasoning in natural language (NL), equipped with first-order logic (FOL) annotations. FOLIO consists of 1,430 examples (unique conclusions), each paired with one of 487 sets of premises used to deductively reason for the validity of each conclusion. The logical correctness of the premises and conclusions is ensured by their FOL annotations, which are automatically verified by an FOL inference engine. In addition to the main NL reasoning task, NL-FOL pairs in FOLIO constitute a new NL-FOL translation dataset. Our experiments on FOLIO systematically evaluate the FOL reasoning ability of supervised fine-tuning on medium-sized language models. For both NL reasoning and NL-FOL translation, we benchmark multiple state-of-the-art language models. Our results show that a subset of FOLIO presents a challenge for one of the most capable {Large Language Model (LLM)} publicly available, GPT-4.
△ Less
Submitted 17 May, 2024; v1 submitted 2 September, 2022;
originally announced September 2022.
-
Shear induced martensitic transformations in crystalline polyethylene: direct MD simulation
Authors:
Ivan A. Strelnikov,
Elena A. Zubova
Abstract:
For the first time, we carry out molecular dynamics (MD) simulation of shear-induced martensitic phase transitions between the orthorhombic and non-orthorhombic (triclinic and monoclinic) phases of crystalline polyethylene (PE) in the framework of a realistic all atom model of the polymer. We show that the variation of the shear rate allows observing on a nano-sample both a strongly nonequilibrium…
▽ More
For the first time, we carry out molecular dynamics (MD) simulation of shear-induced martensitic phase transitions between the orthorhombic and non-orthorhombic (triclinic and monoclinic) phases of crystalline polyethylene (PE) in the framework of a realistic all atom model of the polymer. We show that the variation of the shear rate allows observing on a nano-sample both a strongly nonequilibrium phase transition occurring by random nucleation and irregular growth of a new phase ('civilian' way, for rapid deformations) and the coherent, or 'military', kinetics (generally considered as usual for martensitic transformations). We induce transitions from the orthorhombic to the triclinic phase according to two transformation modes observed in experiment on PE single crystals. Rapid deformation favors the transition directly to the triclinic phase, slow deformation - first to the intermediate monoclinic, and only then - to the triclinic phase. The second way corresponds to the experiment on extended chain PE. We explain this result and analyze the competition between different transformation and plastic deformation modes. Rotations of PE chains around their axes necessary for the transition between the orthorhombic and non-orthorhombic phases are executed by short twist defects diffusing along the chains. The transition between the monoclinic and triclinic phases occurs through half-chain-period translations of the chains along their axes, mostly collectively, as crystallographic slips.
△ Less
Submitted 29 March, 2019; v1 submitted 17 March, 2019;
originally announced March 2019.
-
The "sugar" coarse-grained DNA model
Authors:
Natalia A. Kovaleva,
Irina P. Kikot,
Mikhail A. Mazo,
Elena A. Zubova
Abstract:
More than twenty coarse-grained (CG) DNA models have been developed for simulating the behavior of this molecule under various conditions, including those required for nanotechnology. However, none of these models reproduces the DNA polymorphism associated with conformational changes in the ribose rings of the DNA backbone. These changes make an essential contribution to the DNA local deformabilit…
▽ More
More than twenty coarse-grained (CG) DNA models have been developed for simulating the behavior of this molecule under various conditions, including those required for nanotechnology. However, none of these models reproduces the DNA polymorphism associated with conformational changes in the ribose rings of the DNA backbone. These changes make an essential contribution to the DNA local deformability and provide the possibility of the transition of the DNA double helix from the B-form to the A-form during interactions with biological molecules. We propose a CG representation of the ribose conformational flexibility. We substantiate the choice of the CG sites (6 per nucleotide) needed for the "sugar" GC DNA model, and obtain the potentials of the CG interactions between the sites by the "bottom-up" approach using the all-atom AMBER force field. We show that the representation of the ribose flexibility requires one non-harmonic and one three-particle potential, the forms of both the potentials being different from the ones generally used. The model also includes (i) explicit representation of ions (in an implicit solvent) and (ii) sequence dependence. With these features, the sugar CG DNA model reproduces (with the same parameters) both the B- and A- stable forms under corresponding conditions and demonstrates both the A to B and the B to A phase transitions.
△ Less
Submitted 7 July, 2016; v1 submitted 13 January, 2014;
originally announced January 2014.
-
The simplest model of polymer crystal exhibiting polymorphism
Authors:
Elena A. Zubova,
Alexander V. Savin,
Nikolay K. Balabaev
Abstract:
Almost all the polymer crystals have several polymorphic modifications. Their structure and existence conditions, as well as transitions between them are not understood even in the case of the 'model' polymer polyethylene (PE). For analysis of polymorphism in polymer crystals, we consider the simplest possible model of polymer chain: an extended flat zigzag made of 'united' atoms (replacing CH2-gr…
▽ More
Almost all the polymer crystals have several polymorphic modifications. Their structure and existence conditions, as well as transitions between them are not understood even in the case of the 'model' polymer polyethylene (PE). For analysis of polymorphism in polymer crystals, we consider the simplest possible model of polymer chain: an extended flat zigzag made of 'united' atoms (replacing CH2-groups in PE chain); the united atoms belonging to different zigzags interact via Lennard-Jones potential. Analysis of potential of interaction between such zigzags allowed to predict the structure of five possible equilibrium lattices in polymer crystal built out of such zigzags. Molecular dynamics simulation of the crystal built out of flexible zigzags showed that, depending on model parameters (dimensions of the zigzag and equilibrium distance of Lennard-Jones potential), one to three of these lattices are stable in bulk at low temperatures. We have determined the model parameters at which the existing stable lattices are analogous to the ones observed in real PE and linear alkanes. The triclinic lattice has the lowest potential energy, then follows the monoclinic lattice, and the orthorhombic lattice has the highest energy, exactly as in full atomic (with hydrogens) molecular dynamics models.
△ Less
Submitted 5 September, 2011;
originally announced September 2011.
-
Can low-frequency breathers exist in a quasi-1D crystal?
Authors:
A. V. Savin,
E. A. Zubova,
L. I. Manevitch
Abstract:
We investigate a quasi-1D crystal: 2D system of coupled linear chains of particles with strong intra-chain and weak inter-chain interactions. Nonlinear dynamics of one of these chains when the rest of them are fixed is reduced to the well known Frenkel-Kontorova (FK) model. Its continuum limit, sine-Gordon (sG) equation, predicts two types of soliton solutions: topological solitons and breathers…
▽ More
We investigate a quasi-1D crystal: 2D system of coupled linear chains of particles with strong intra-chain and weak inter-chain interactions. Nonlinear dynamics of one of these chains when the rest of them are fixed is reduced to the well known Frenkel-Kontorova (FK) model. Its continuum limit, sine-Gordon (sG) equation, predicts two types of soliton solutions: topological solitons and breathers. It is known that the quasi-1D topological solitons exist also in a 2D system of coupled chains and even in a 3D model of a polymer crystal. Numerical simulation shows that the breathers inherent to the FK model do not exist in the system of chains. The effect changes scenario of kink-antikink collision at small velocities: it always results only in intensive phonon radiation while kink-antikink recombination in the FK model results in long-living low-frequency sG breather creation. The reason of the difference is that the model of coupled chains catches 'acoustic' part of phonon spectrum of a crystal while the FK model does not. Low-frequency SG breathers in a crystal intensively emit resonant 'acoustic' phonons and come to ruin.
△ Less
Submitted 28 September, 2004;
originally announced September 2004.