-
A Survey of Zero-shot Generalisation in Deep Reinforcement Learning
Authors:
Robert Kirk,
Amy Zhang,
Edward Grefenstette,
Tim Rocktäschel
Abstract:
The study of zero-shot generalisation (ZSG) in deep Reinforcement Learning (RL) aims to produce RL algorithms whose policies generalise well to novel unseen situations at deployment time, avoiding overfitting to their training environments. Tackling this is vital if we are to deploy reinforcement learning algorithms in real world scenarios, where the environment will be diverse, dynamic and unpred…
▽ More
The study of zero-shot generalisation (ZSG) in deep Reinforcement Learning (RL) aims to produce RL algorithms whose policies generalise well to novel unseen situations at deployment time, avoiding overfitting to their training environments. Tackling this is vital if we are to deploy reinforcement learning algorithms in real world scenarios, where the environment will be diverse, dynamic and unpredictable. This survey is an overview of this nascent field. We rely on a unifying formalism and terminology for discussing different ZSG problems, building upon previous works. We go on to categorise existing benchmarks for ZSG, as well as current methods for tackling these problems. Finally, we provide a critical discussion of the current state of the field, including recommendations for future work. Among other conclusions, we argue that taking a purely procedural content generation approach to benchmark design is not conducive to progress in ZSG, we suggest fast online adaptation and tackling RL-specific problems as some areas for future work on methods for ZSG, and we recommend building benchmarks in underexplored problem settings such as offline RL ZSG and reward-function variation.
△ Less
Submitted 19 January, 2023; v1 submitted 18 November, 2021;
originally announced November 2021.
-
MiniHack the Planet: A Sandbox for Open-Ended Reinforcement Learning Research
Authors:
Mikayel Samvelyan,
Robert Kirk,
Vitaly Kurin,
Jack Parker-Holder,
Minqi Jiang,
Eric Hambro,
Fabio Petroni,
Heinrich Küttler,
Edward Grefenstette,
Tim Rocktäschel
Abstract:
Progress in deep reinforcement learning (RL) is heavily driven by the availability of challenging benchmarks used for training agents. However, benchmarks that are widely adopted by the community are not explicitly designed for evaluating specific capabilities of RL methods. While there exist environments for assessing particular open problems in RL (such as exploration, transfer learning, unsuper…
▽ More
Progress in deep reinforcement learning (RL) is heavily driven by the availability of challenging benchmarks used for training agents. However, benchmarks that are widely adopted by the community are not explicitly designed for evaluating specific capabilities of RL methods. While there exist environments for assessing particular open problems in RL (such as exploration, transfer learning, unsupervised environment design, or even language-assisted RL), it is generally difficult to extend these to richer, more complex environments once research goes beyond proof-of-concept results. We present MiniHack, a powerful sandbox framework for easily designing novel RL environments. MiniHack is a one-stop shop for RL experiments with environments ranging from small rooms to complex, procedurally generated worlds. By leveraging the full set of entities and environment dynamics from NetHack, one of the richest grid-based video games, MiniHack allows designing custom RL testbeds that are fast and convenient to use. With this sandbox framework, novel environments can be designed easily, either using a human-readable description language or a simple Python interface. In addition to a variety of RL tasks and baselines, MiniHack can wrap existing RL benchmarks and provide ways to seamlessly add additional complexity.
△ Less
Submitted 16 November, 2021; v1 submitted 27 September, 2021;
originally announced September 2021.
-
Language Models as a Knowledge Source for Cognitive Agents
Authors:
Robert E. Wray, III,
James R. Kirk,
John E. Laird
Abstract:
Language models (LMs) are sentence-completion engines trained on massive corpora. LMs have emerged as a significant breakthrough in natural-language processing, providing capabilities that go far beyond sentence completion including question answering, summarization, and natural-language inference. While many of these capabilities have potential application to cognitive systems, exploiting languag…
▽ More
Language models (LMs) are sentence-completion engines trained on massive corpora. LMs have emerged as a significant breakthrough in natural-language processing, providing capabilities that go far beyond sentence completion including question answering, summarization, and natural-language inference. While many of these capabilities have potential application to cognitive systems, exploiting language models as a source of task knowledge, especially for task learning, offers significant, near-term benefits. We introduce language models and the various tasks to which they have been applied and then review methods of knowledge extraction from language models. The resulting analysis outlines both the challenges and opportunities for using language models as a new knowledge source for cognitive systems. It also identifies possible ways to improve knowledge extraction from language models using the capabilities provided by cognitive systems. Central to success will be the ability of a cognitive agent to itself learn an abstract model of the knowledge implicit in the LM as well as methods to extract high-quality knowledge effectively and efficiently. To illustrate, we introduce a hypothetical robot agent and describe how language models could extend its task knowledge and improve its performance and the kinds of knowledge and methods the agent can use to exploit the knowledge within a language model.
△ Less
Submitted 23 October, 2021; v1 submitted 16 September, 2021;
originally announced September 2021.
-
Piecewise circular curves and positivity
Authors:
Jean-Philippe Burelle,
Ryan Kirk
Abstract:
We introduce the moduli space of generic piecewise circular $n$-gons in the Riemann sphere and relate it to a moduli space of Legendrian polygons. We prove that when $n=2k$, this moduli space contains a connected component homeomorphic to the Fock-Goncharov space of $k$-tuples of positive flags for $\mathsf{PSp}(4,\mathbb{R})$ and hence is a topological ball. We characterize this component geometr…
▽ More
We introduce the moduli space of generic piecewise circular $n$-gons in the Riemann sphere and relate it to a moduli space of Legendrian polygons. We prove that when $n=2k$, this moduli space contains a connected component homeomorphic to the Fock-Goncharov space of $k$-tuples of positive flags for $\mathsf{PSp}(4,\mathbb{R})$ and hence is a topological ball. We characterize this component geometrically as the space of simple piecewise circular curves with decreasing curvature.
△ Less
Submitted 19 August, 2021;
originally announced August 2021.
-
Hatemoji: A Test Suite and Adversarially-Generated Dataset for Benchmarking and Detecting Emoji-based Hate
Authors:
Hannah Rose Kirk,
Bertram Vidgen,
Paul Röttger,
Tristan Thrush,
Scott A. Hale
Abstract:
Detecting online hate is a complex task, and low-performing models have harmful consequences when used for sensitive applications such as content moderation. Emoji-based hate is an emerging challenge for automated detection. We present HatemojiCheck, a test suite of 3,930 short-form statements that allows us to evaluate performance on hateful language expressed with emoji. Using the test suite, we…
▽ More
Detecting online hate is a complex task, and low-performing models have harmful consequences when used for sensitive applications such as content moderation. Emoji-based hate is an emerging challenge for automated detection. We present HatemojiCheck, a test suite of 3,930 short-form statements that allows us to evaluate performance on hateful language expressed with emoji. Using the test suite, we expose weaknesses in existing hate detection models. To address these weaknesses, we create the HatemojiBuild dataset using a human-and-model-in-the-loop approach. Models built with these 5,912 adversarial examples perform substantially better at detecting emoji-based hate, while retaining strong performance on text-only hate. Both HatemojiCheck and HatemojiBuild are made publicly available. See our Github Repository (https://github.com/HannahKirk/Hatemoji). HatemojiCheck, HatemojiBuild, and the final Hatemoji Model are also available on HuggingFace (https://huggingface.co/datasets/HannahRoseKirk/).
△ Less
Submitted 6 May, 2022; v1 submitted 12 August, 2021;
originally announced August 2021.
-
Memes in the Wild: Assessing the Generalizability of the Hateful Memes Challenge Dataset
Authors:
Hannah Rose Kirk,
Yennie Jun,
Paulius Rauba,
Gal Wachtel,
Ruining Li,
Xingjian Bai,
Noah Broestl,
Martin Doff-Sotta,
Aleksandar Shtedritski,
Yuki M. Asano
Abstract:
Hateful memes pose a unique challenge for current machine learning systems because their message is derived from both text- and visual-modalities. To this effect, Facebook released the Hateful Memes Challenge, a dataset of memes with pre-extracted text captions, but it is unclear whether these synthetic examples generalize to `memes in the wild'. In this paper, we collect hateful and non-hateful m…
▽ More
Hateful memes pose a unique challenge for current machine learning systems because their message is derived from both text- and visual-modalities. To this effect, Facebook released the Hateful Memes Challenge, a dataset of memes with pre-extracted text captions, but it is unclear whether these synthetic examples generalize to `memes in the wild'. In this paper, we collect hateful and non-hateful memes from Pinterest to evaluate out-of-sample performance on models pre-trained on the Facebook dataset. We find that memes in the wild differ in two key aspects: 1) Captions must be extracted via OCR, injecting noise and diminishing performance of multimodal models, and 2) Memes are more diverse than `traditional memes', including screenshots of conversations or text on a plain background. This paper thus serves as a reality check for the current benchmark of hateful meme detection and its applicability for detecting real world hate.
△ Less
Submitted 9 July, 2021;
originally announced July 2021.
-
The 300 "Correlators" Suggests 4D, $\cal N$ = 1 SUSY Is a Solution to a Set of Sudoku Puzzles
Authors:
Aleksander J. Cianciara,
S. James Gates Jr,
Yangrui Hu,
Renee Kirk
Abstract:
A conjecture is made that the weight space for 4D, $\cal N$-extended supersymmetrical representations is embedded within the permutahedra associated with permutation groups ${\mathbb{S}}{}_{d}$. Adinkras and Coxeter Groups associated with minimal representations of 4D, $\cal N$ = 1 supersymmetry provide evidence supporting this conjecture. It is shown the appearance of the mathematics of 4D,…
▽ More
A conjecture is made that the weight space for 4D, $\cal N$-extended supersymmetrical representations is embedded within the permutahedra associated with permutation groups ${\mathbb{S}}{}_{d}$. Adinkras and Coxeter Groups associated with minimal representations of 4D, $\cal N$ = 1 supersymmetry provide evidence supporting this conjecture. It is shown the appearance of the mathematics of 4D, $\cal N$ = 1 minimal off-shell supersymmetry representations is equivalent to solving a four color problem on the truncated octahedron. This observation suggest an entirely new way to approach the off-shell SUSY auxiliary field problem based on IT algorithms probing the properties of ${\mathbb{S}}{}_{d}$.
△ Less
Submitted 13 April, 2021; v1 submitted 24 December, 2020;
originally announced December 2020.
-
Accurate Automatic Segmentation of Amygdala Subnuclei and Modeling of Uncertainty via Bayesian Fully Convolutional Neural Network
Authors:
Yilin Liu,
Gengyan Zhao,
Brendon M. Nacewicz,
Nagesh Adluru,
Gregory R. Kirk,
Peter A Ferrazzano,
Martin Styner,
Andrew L. Alexander
Abstract:
Recent advances in deep learning have improved the segmentation accuracy of subcortical brain structures, which would be useful in neuroimaging studies of many neurological disorders. However, most of the previous deep learning work does not investigate the specific difficulties that exist in segmenting extremely small but important brain regions such as the amygdala and its subregions. To tackle…
▽ More
Recent advances in deep learning have improved the segmentation accuracy of subcortical brain structures, which would be useful in neuroimaging studies of many neurological disorders. However, most of the previous deep learning work does not investigate the specific difficulties that exist in segmenting extremely small but important brain regions such as the amygdala and its subregions. To tackle this challenging task, a novel 3D Bayesian fully convolutional neural network was developed to apply a dilated dualpathway approach that retains fine details and utilizes both local and more global contextual information to automatically segment the amygdala and its subregions at high precision. The proposed method provides insights on network design and sampling strategy that target segmentations of small 3D structures. In particular, this study confirms that a large context, enabled by a large field of view, is beneficial for segmenting small objects; furthermore, precise contextual information enabled by dilated convolutions allows for better boundary localization, which is critical for examining the morphology of the structure. In addition, it is demonstrated that the uncertainty information estimated from our network may be leveraged to identify atypicality in data. Our method was compared with two state-of-the-art deep learning models and a traditional multi-atlas approach, and exhibited excellent performance as measured both by Dice overlap as well as average symmetric surface distance. To the best of our knowledge, this work is the first deep learning-based approach that targets the subregions of the amygdala.
△ Less
Submitted 19 February, 2019;
originally announced February 2019.
-
Next-Generation Quantum Theory of Atoms in Molecules for the Ground and Excited State of the Ring-Opening of Cyclohexadiene (CHD)
Authors:
Tian Tian,
Tianlv Xu,
Steven R. Kirk,
Michael Filatov,
Samantha Jenkins
Abstract:
The factors underlying the experimentally observed branching ratio (70:30) of the (1,3-cyclohexadiene) CHD$\rightarrow$HT (1,3,5-hexatriene) photochemical ring-opening reaction are investigated. The ring-opening reaction path is optimized by a high-level multi-reference DFT method and the density along the path is analyzed by the QTAIM and stress tensor methods. The performed density analysis sugg…
▽ More
The factors underlying the experimentally observed branching ratio (70:30) of the (1,3-cyclohexadiene) CHD$\rightarrow$HT (1,3,5-hexatriene) photochemical ring-opening reaction are investigated. The ring-opening reaction path is optimized by a high-level multi-reference DFT method and the density along the path is analyzed by the QTAIM and stress tensor methods. The performed density analysis suggests that, in both $S_{1}$ and $S_{0}$ electronic states, there exists an attractive interaction between the ends of the fissile $σ$ -bond of CHD that steers the ring-opening reaction predominantly in the direction of restoration of the ring. It is suggested that opening of the ring and formation of the reaction product (HT) can only be achieved when there is a sufficient persistent nuclear momentum in the direction of stretching of the fissile bond. As this orientation of the nuclear momentum vector can be expected to be relatively rare during the dynamics, this explains the observed low quantum yield of the ring-opening reaction.
△ Less
Submitted 31 October, 2018;
originally announced October 2018.
-
Stress Tensor Eigenvector Following with Next-Generation Quantum Theory of Atoms in Molecules
Authors:
Jia Hui Li,
Wei Jie Huang,
Tianlv Xu,
Steven R. Kirk,
Samantha Jenkins
Abstract:
The eigenvectors of the electronic stress tensor have been identified as useful for the prediction of chemical reactivity because they determine the most preferred directions to move the bonds that correspond to a qualitative change in the molecular electronic structure. A new 3-D vector based interpretation of the chemical bond that we refer to as the bond-path framework set…
▽ More
The eigenvectors of the electronic stress tensor have been identified as useful for the prediction of chemical reactivity because they determine the most preferred directions to move the bonds that correspond to a qualitative change in the molecular electronic structure. A new 3-D vector based interpretation of the chemical bond that we refer to as the bond-path framework set $\mathbb{B} = \{p,q,r\}$ provides a version of the quantum theory of atoms in molecules (QTAIM) beyond the minimum definition for bonding that is particularly suitable for understanding changes in molecular electronic structure that occur during reactions. The bond-path framework set $\mathbb{B}$ is straightforwardly constructed and visualized from the eigenvalues and eigenvectors of QTAIM. This approach is applied to the structural deformations of ethene that occur during applied torsion $θ$, -180.0° $\leq$ $θ$ $\leq$ +180.0°. The corresponding stress tensor version is readily constructed as $\mathbb{B}_σ = \{p_σ,q_σ,r\}$ within the QTAIM partitioning making it possible to compare experimentally and computationally determined electronic charge densities. The bond-path framework set $\mathbb{B}$ or $\mathbb{B}_σ$ are the networks that comprise three strands: the least preferred ($p, p_σ$), most preferred ($q, q_σ$) and $r$ is the familiar QTAIM bond-path. We demonstrate that the most preferred direction for bond motion using the stress tensor corresponds to the most compressible direction and not to the least compressible direction as previously reported. We show the necessity for a directional approach constructed using the eigenvectors along the entire bond length and demonstrate the insufficiency of the sole use of scalar measures for capturing the nature of the stress tensor within the QTAIM partitioning.
△ Less
Submitted 16 September, 2018;
originally announced September 2018.
-
Automatic Identification of Twin Zygosity in Resting-State Functional MRI
Authors:
Andrey Gritsenko,
Martin A. Lindquist,
Gregory R. Kirk,
Moo K. Chung
Abstract:
A key strength of twin studies arises from the fact that there are two types of twins, monozygotic and dizygotic, that share differing amounts of genetic information. Accurate differentiation of twin types allows efficient inference on genetic influences in a population. However, identification of zygosity is often prone to errors without genotying. In this study, we propose a novel pairwise featu…
▽ More
A key strength of twin studies arises from the fact that there are two types of twins, monozygotic and dizygotic, that share differing amounts of genetic information. Accurate differentiation of twin types allows efficient inference on genetic influences in a population. However, identification of zygosity is often prone to errors without genotying. In this study, we propose a novel pairwise feature representation to classify the zygosity of twin pairs of resting state functional magnetic resonance images (rs-fMRI). For this, we project an fMRI signal to a set of basis functions and use the projection coefficients as the compact and discriminative feature representation of noisy fMRI. We encode the relationship between twins as the correlation between the new feature representations across brain regions. We employ hill climbing variable selection to identify brain regions that are the most genetically affected. The proposed framework was applied to 208 twin pairs and achieved 94.19% classification accuracy in automatically identifying the zygosity of paired images.
△ Less
Submitted 26 October, 2018; v1 submitted 30 June, 2018;
originally announced July 2018.
-
A Vector-Based Representation of the Chemical Bond for the Substituted Torsion of Biphenyl
Authors:
Jiahui Li,
Weijie Huang,
Tianlv Xu,
Steven R. Kirk,
Samantha Jenkins
Abstract:
We use a new interpretation of the chemical bond within QTAIM. The bond-path framework set $\mathbb{B} = \{p, q, r\}$ with associated linkages with lengths $\mathbb{H}^*,m\mathbb{H}$ and the familiar bond-path length is used to describe a torsion $θ$, $0.0^{\circ} \leq θ\lt 22.0^{\circ}$ of \emph{para}-substituted biphenyl, $\mathrm{C}_{12}\mathrm{H}_{9-x}$, $x = \mathrm{N}(\mathrm{CH}_3)_2$,…
▽ More
We use a new interpretation of the chemical bond within QTAIM. The bond-path framework set $\mathbb{B} = \{p, q, r\}$ with associated linkages with lengths $\mathbb{H}^*,m\mathbb{H}$ and the familiar bond-path length is used to describe a torsion $θ$, $0.0^{\circ} \leq θ\lt 22.0^{\circ}$ of \emph{para}-substituted biphenyl, $\mathrm{C}_{12}\mathrm{H}_{9-x}$, $x = \mathrm{N}(\mathrm{CH}_3)_2$, $\mathrm{NH}_2$, $\mathrm{CH}_3$, CHO, CN, $\mathrm{NO}_{2}$. We include consideration of the H---H bonding interactions and find that the lengths $\mathbb{H} \gt \mathbb{H}^{*}$ that we explain in terms of the most and least preferred directions of charge density accumulation. We also consider the fractional eigenvector-following path lengths $\mathbb{H}_f$ and $\mathbb{H}_{f{θ_{\rm min}}}$.
△ Less
Submitted 4 April, 2018;
originally announced April 2018.
-
A Vector-Based Representation of the Chemical Bond for the Normal Modes of Benzene
Authors:
Wei Jie Huang,
Alireza Azizi,
Tianlv Xu,
Steven R. Kirk,
Samantha Jenkins
Abstract:
We introduce a vector-based interpretation of the chemical bond within the quantum theory of atoms in molecules (QTAIM), the bond-path framework set $\mathbb{B} = \{p, q, r\}$, to follow variations in the 3-D morphology of all bonds for the four infra-red (IR) active normal modes of benzene. The bond-path framework set comprises three unique paths $p$, $q$ and $r$ where $r$ is the familiar QTAIM b…
▽ More
We introduce a vector-based interpretation of the chemical bond within the quantum theory of atoms in molecules (QTAIM), the bond-path framework set $\mathbb{B} = \{p, q, r\}$, to follow variations in the 3-D morphology of all bonds for the four infra-red (IR) active normal modes of benzene. The bond-path framework set comprises three unique paths $p$, $q$ and $r$ where $r$ is the familiar QTAIM bond concept of bond-path ($r$) while the two new paths $p$ and $q$ are formulated from the least and most preferred directions of electron density accumulation respectively. We find 3-D distortions including bond stretching/compression, torsion and curving. We introduce two fractional measures to quantify these variations away from linearity of the bond.
△ Less
Submitted 7 May, 2018; v1 submitted 3 April, 2018;
originally announced April 2018.
-
Quinone-based Switches for Candidate Building Blocks of Molecular Junctions with QTAIM and the Stress Tensor
Authors:
Tianlv Xu,
Lingling Wang,
Yang **,
Tanja van Mourik,
Herbert Früchtl,
Steven R. Kirk,
Samantha Jenkins
Abstract:
The current work investigates candidate building blocks based on molecular junctions from hydrogen transfer tautomerization in the benzoquinone-like core of an azophenine molecule with QTAIM and the recently-introduced stress tensor trajectory analysis. We find that in particular the stress tensor trajectories are well suited to describe the mechanism of the switching process. The effects of an Fe…
▽ More
The current work investigates candidate building blocks based on molecular junctions from hydrogen transfer tautomerization in the benzoquinone-like core of an azophenine molecule with QTAIM and the recently-introduced stress tensor trajectory analysis. We find that in particular the stress tensor trajectories are well suited to describe the mechanism of the switching process. The effects of an Fe-dopant atom coordinated to the quinone ring, as well as F and Cl substitution of different ring-hydrogens, are investigated and the new QTAIM and stress tensor analysis is used to draw conclusions on the effectiveness of such molecules as molecular switches in nano-sized electronic circuits. We find that the coordinated Fe-dopant greatly improves the switching properties, both in terms of the tautomerisation barrier that has to be crossed in the switching process and the expected conductance behavior, while the effects of hydrogen substitution are more subtle. The absence of the Fe-dopant atom led to impaired functioning of the switch 'OFF' mechanism as well as coinciding with the formation of closed-shell H---H bond critical points that indicated a strained or electron deficient environment. Our analysis demonstrates promise for future use in design of molecular electronic devices.
△ Less
Submitted 2 April, 2018;
originally announced April 2018.
-
Predicting Competitive and Non-Competitive Torquoselectivity in Ring-Opening Reactions using QTAIM and the Stress Tensor
Authors:
Alireza Azizi,
Roya Momen,
Alejandro Morales-Bayuelo,
Tianlv Xu,
Steven R. Kirk,
Samantha Jenkins
Abstract:
We present a new vector-based representation of the chemical bond referred to as the bond-path frame-work set $\mathbb{B} = {p, q, r}$, where $p$, $q$ and $r$ represent three paths with corresponding eigenvector-following path lengths $\mathbb{H}^{*},\mathbb{H}$ and the bond-path length from the quantum theory of atoms in molecules (QTAIM). We find that longer path lengths $\mathbb{H}$ of the ring…
▽ More
We present a new vector-based representation of the chemical bond referred to as the bond-path frame-work set $\mathbb{B} = {p, q, r}$, where $p$, $q$ and $r$ represent three paths with corresponding eigenvector-following path lengths $\mathbb{H}^{*},\mathbb{H}$ and the bond-path length from the quantum theory of atoms in molecules (QTAIM). We find that longer path lengths $\mathbb{H}$ of the ring-opening bonds predict the preference for the transition state inward (\textbf{TSIC}) or transition state outward (\textbf{TSOC}) ring opening reactions in agreement with experiment for all five reactions \textbf{R1-R5}. Competitiveness and non-competitiveness have traditionally been considered using activation energies. The activation energy however, for \textbf{R3} does not satisfactorily determine competitiveness or provide consistent agreement with experimental yields. We choose a selection of five competitive and non-competitive reactions; methyl-cyclobutene (\textbf{R1}), ethyl-methyl-cyclobutene (\textbf{R2}), iso-propyl-methyl-cyclobutene (\textbf{R3}), ter-butyl-methyl-cyclobutene (\textbf{R4}) and phenyl-methyl-cyclobutene (R5). Therefore, in this investigation we provide a new criterion, within the QTAIM framework, to determine whether the reactions \textbf{R1-R5} are competitive or non-competitive. We that find \textbf{R2}, \textbf{R3} and \textbf{R5} are competitive and \textbf{R1} and \textbf{R4} are non-competitive reactions in contrast to the results from the activation energies, calling into question the reliability of activation energies.
△ Less
Submitted 2 April, 2018;
originally announced April 2018.
-
Next-Generation Quantum Theory of Atoms in Molecules for the Ground and Excited States of Fulvene
Authors:
Wei Jie Huang,
Roya Momen,
Alireza Azizi,
Tianlv Xu,
Steven R. Kirk,
Michael Filatov,
Samantha Jenkins
Abstract:
A vector-based representation of the chemical bond is introduced that we refer to as the bond-path frame-work set = $\mathbb{B} = \{p, q, r\}$, where $p$, $q$ and $r$ represent three paths with corresponding eigenvector-following path lengths $\mathbb{H}^{*},\mathbb{H}$ and the familiar quantum theory of atoms in molecules (QTAIM) bond-path length. The eigenvector-following path lengths…
▽ More
A vector-based representation of the chemical bond is introduced that we refer to as the bond-path frame-work set = $\mathbb{B} = \{p, q, r\}$, where $p$, $q$ and $r$ represent three paths with corresponding eigenvector-following path lengths $\mathbb{H}^{*},\mathbb{H}$ and the familiar quantum theory of atoms in molecules (QTAIM) bond-path length. The eigenvector-following path lengths $\mathbb{H}^{*}$ and $\mathbb{H}$ are constructed along the bond-path from the $\underline{\mathbf{\mathit{e}}}_{1}$ and $\underline{\mathbf{\mathit{e}}}_{2}$ Hessian eigenvectors respectively, which correspond to the least and most preferred directions of charge density accumulation. In particular, the paths $p$ and $q$ provide a vector representation of the scalar QTAIM ellipticity ε. The bond-path frame-work set $\mathbb{B}$ is applied to the excited state deactivation of fulvene that involves distortions along various intramolecular degrees of freedom, such as the bond stretching/compression of bond length alternation (BLA) and bond torsion distortions. We find that the $\mathbb{H}^{*}$ and $\mathbb{H}$ lengths can differentiate between the ground and excited electronic states, in contrast to the QTAIM bond-path length. In particular, the eigenvector-following path lengths $\mathbb{H}^{*}$ and $\mathbb{H}$ are found to be shorter for the excited state than the ground state for both the BLA and bond torsion distortions indicating that distortions resulting in lower $\mathbb{H}^{*}$ and $\mathbb{H}$ values are easier to perform.
△ Less
Submitted 6 May, 2018; v1 submitted 2 April, 2018;
originally announced April 2018.
-
The Role of Weak Interactions in Characterizing Peptide Folding Preferences using a QTAIM Interpretation of the Ramachandran Plot (φ-ψ)
Authors:
Roya Momen,
Alireza Azizi,
Lingling Wang,
Yang **,
Tianlv Xu,
Steven R. Kirk,
Wenxuan Li,
Sergei Manzhos,
Samantha Jenkins
Abstract:
The Ramachandran plot is a potent way to understand structures of biomolecules, however, the original formulation of the Ramachandran plot only considers backbone conformations. We formulate a new interpretation of the original Ramachandran plot ($φ-ψ$) that can include a description of the weaker interactions including both the hydrogen bonds and H$---$H bonds as a new way to derive insights into…
▽ More
The Ramachandran plot is a potent way to understand structures of biomolecules, however, the original formulation of the Ramachandran plot only considers backbone conformations. We formulate a new interpretation of the original Ramachandran plot ($φ-ψ$) that can include a description of the weaker interactions including both the hydrogen bonds and H$---$H bonds as a new way to derive insights into the phenomenon of peptide folding. We use QTAIM (quantum theory of atoms in molecules) to interpret the Ramachandran plot. Specifically, we show that QTAIM analysis permits identifying key regions of the Ramachandran plot without the need for massive data sets. A highly non-linear relationship is found between the QTAIM vector-derived interpreted Ramachandran plot and the conventional Ramachandran plot ($φ-ψ$) demonstrating that this new approach is not a trivial coordinate transformation. An investigation of both the backbone and the weaker bonds within the framework of the QTAIM interpreted Ramachandran plot was found to be in line with physical intuition. The least-preferred directions calculated for the hydrogen bonds and H$---$H bonds were found to coincide with the 'unlikely' regions of the Ramachandran plot.
△ Less
Submitted 18 June, 2017;
originally announced June 2017.
-
Can Tonne-Scale Direct Detection Experiments Discover Nuclear Dark Matter?
Authors:
A. Butcher,
R. Kirk,
J. Monroe,
S. M. West
Abstract:
Models of nuclear dark matter propose that the dark sector contains large composite states consisting of dark nucleons in analogy to Standard Model nuclei. We examine the direct detection phenomenology of a particular class of nuclear dark matter model at the current generation of tonne-scale liquid noble experiments, in particular DEAP-3600 and XENON1T. In our chosen nuclear dark matter scenario…
▽ More
Models of nuclear dark matter propose that the dark sector contains large composite states consisting of dark nucleons in analogy to Standard Model nuclei. We examine the direct detection phenomenology of a particular class of nuclear dark matter model at the current generation of tonne-scale liquid noble experiments, in particular DEAP-3600 and XENON1T. In our chosen nuclear dark matter scenario distinctive features arise in the recoil energy spectra due to the non-point-like nature of the composite dark matter state. We calculate the number of events required to distinguish these spectra from those of a standard point-like WIMP state with a decaying exponential recoil spectrum. In the most favourable regions of nuclear dark matter parameter space, we find that a few tens of events are needed to distinguish nuclear dark matter from WIMPs at the $3\,σ$ level in a single experiment. Given the total exposure time of DEAP-3600 and XENON1T we find that at best a $2\,σ$ distinction is possible by these experiments individually, while $3\,σ$ sensitivity is reached for a range of parameters by the combination of the two experiments. We show that future upgrades of these experiments have potential to distinguish a large range of nuclear dark matter models from that of a WIMP at greater than $3\,σ$.
△ Less
Submitted 6 October, 2016;
originally announced October 2016.
-
Geomorphologic Map** of Titan's Polar Terrains: Constraining Surface Processes and Landscape Evolution
Authors:
Samuel Birch,
Alexander Hayes,
William Dietrich,
Alan Howard,
Charlie Bristow,
Michael Malaska,
Jeff Moore,
Marco Mastrogiuseppe,
Jason Hofgartner,
David Williams,
Oliver White,
Jason Soderblom,
Jason Barnes,
Elizabeth Turtle,
Jonathan Lunine,
Charles Wood,
Catherine Neish,
Randy Kirk,
Ellen Stofan,
Ralph Lorenz,
Rosaly Lopes
Abstract:
We present a geomorphologic map of Titan's polar terrains. The map was generated from a combination of Cassini Synthetic Aperture Radar (SAR) and Imaging Science Subsystem imaging products, as well as altimetry, SARTopo and radargrammetry topographic datasets. In combining imagery with topographic data, our geomorphologic map reveals a stratigraphic sequence from which we infer process interaction…
▽ More
We present a geomorphologic map of Titan's polar terrains. The map was generated from a combination of Cassini Synthetic Aperture Radar (SAR) and Imaging Science Subsystem imaging products, as well as altimetry, SARTopo and radargrammetry topographic datasets. In combining imagery with topographic data, our geomorphologic map reveals a stratigraphic sequence from which we infer process interactions between units. In map** both polar regions with the same geomorphologic units, we conclude that processes that formed the terrains of the north polar region also acted to form the landscape we observe at the south. Uniform, SAR-dark plains are interpreted as sedimentary deposits, and are bounded by moderately dissected uplands. These plains contain the highest density of filled and empty lake depressions, and canyons. These units unconformably overlay a basement rock that outcrops as mountains and SAR-bright dissected terrains at various elevations across both poles. All these units are then superposed by surficial units that slope towards the seas, suggestive of subsequent overland transport of sediment. From estimates of the depths of the embedded empty depressions and canyons that drain into the seas, the SAR-dark plains must be >600 m thick in places, though the thickness may vary across the poles. At the lowest elevations of each polar region, there are large seas, which are currently liquid methane/ethane filled at the north and empty at the south. The large plains deposits and the surrounding hillslopes may represent remnant landforms that are a result of previously vast polar oceans, where larger liquid bodies may have allowed for a sustained accumulation of soluble and insoluble sediments, potentially forming layered sedimentary deposits. Coupled with vertical crustal movements, the resulting layers would be of varying solubilities and erosional resistances.
△ Less
Submitted 3 August, 2016;
originally announced August 2016.
-
Simplified Models for Dark Matter Searches at the LHC
Authors:
Jalal Abdallah,
Henrique Araujo,
Alexandre Arbey,
Adi Ashkenazi,
Alexander Belyaev,
Joshua Berger,
Celine Boehm,
Antonio Boveia,
Amelia Brennan,
Jim Brooke,
Oliver Buchmueller,
Matthew Buckley,
Giorgio Busoni,
Lorenzo Calibbi,
Sushil Chauhan,
Nadir Daci,
Gavin Davies,
Isabelle De Bruyn,
Paul De Jong,
Albert De Roeck,
Kees de Vries,
Daniele Del Re,
Andrea De Simone,
Andrea Di Simone,
Caterina Doglioni
, et al. (72 additional authors not shown)
Abstract:
This document outlines a set of simplified models for dark matter and its interactions with Standard Model particles. It is intended to summarize the main characteristics that these simplified models have when applied to dark matter searches at the LHC, and to provide a number of useful expressions for reference. The list of models includes both s-channel and t-channel scenarios. For s-channel, sp…
▽ More
This document outlines a set of simplified models for dark matter and its interactions with Standard Model particles. It is intended to summarize the main characteristics that these simplified models have when applied to dark matter searches at the LHC, and to provide a number of useful expressions for reference. The list of models includes both s-channel and t-channel scenarios. For s-channel, spin-0 and spin-1 mediation is discussed, and also realizations where the Higgs particle provides a portal between the dark and visible sectors. The guiding principles underpinning the proposed simplified models are spelled out, and some suggestions for implementation are presented.
△ Less
Submitted 23 March, 2016; v1 submitted 9 June, 2015;
originally announced June 2015.
-
Dark Matter with Topological Defects in the Inert Doublet Model
Authors:
Mark Hindmarsh,
Russell Kirk,
Jose Miguel No,
Stephen M. West
Abstract:
We examine the production of dark matter by decaying topological defects in the high mass region $m_{\mathrm{DM}} \gg m_W$ of the Inert Doublet Model, extended with an extra U(1) gauge symmetry. The density of dark matter states (the neutral Higgs states of the inert doublet) is determined by the interplay of the freeze-out mechanism and the additional production of dark matter states from the dec…
▽ More
We examine the production of dark matter by decaying topological defects in the high mass region $m_{\mathrm{DM}} \gg m_W$ of the Inert Doublet Model, extended with an extra U(1) gauge symmetry. The density of dark matter states (the neutral Higgs states of the inert doublet) is determined by the interplay of the freeze-out mechanism and the additional production of dark matter states from the decays of topological defects, in this case cosmic strings. These decays increase the predicted relic abundance compared to the standard freeze-out only case, and as a consequence the viable parameter space of the Inert Doublet Model can be widened substantially. In particular, for a given dark matter annihilation rate lower dark matter masses become viable. We investigate the allowed mass range taking into account constraints on the energy injection rate from the diffuse $γ$-ray background and Big Bang Nucleosynthesis, together with constraints on the dark matter properties coming from direct and indirect detection limits. For the Inert Doublet Model high-mass region, an inert Higgs mass as low as $\sim 200$ GeV is permitted. There is also an upper limit on string mass per unit length, and hence the symmetry breaking scale, from the relic abundance in this scenario. Depending on assumptions made about the string decays, the limits are in the range $10^{12}$ GeV to $10^{13}$ GeV.
△ Less
Submitted 29 July, 2015; v1 submitted 15 December, 2014;
originally announced December 2014.
-
Dark Matter from Decaying Topological Defects
Authors:
Mark Hindmarsh,
Russell Kirk,
Stephen M. West
Abstract:
We study dark matter production by decaying topological defects, in particular cosmic strings. In topological defect or "top-down" (TD) scenarios, the dark matter injection rate varies as a power law with time with exponent $p-4$. We find a formula in closed form for the yield for all $p < 3/2$, which accurately reproduces the solution of the Boltzmann equation. We investigate two scenarios (…
▽ More
We study dark matter production by decaying topological defects, in particular cosmic strings. In topological defect or "top-down" (TD) scenarios, the dark matter injection rate varies as a power law with time with exponent $p-4$. We find a formula in closed form for the yield for all $p < 3/2$, which accurately reproduces the solution of the Boltzmann equation. We investigate two scenarios ($p=1$, $p=7/6$) motivated by cosmic strings which decay into TeV-scale states with a high branching fraction into dark matter particles. For dark matter models annihilating either by s-wave or p-wave, we find the regions of parameter space where the TD model can account for the dark matter relic density as measured by Planck. We find that topological defects can be the principal source of dark matter, even when the standard freeze-out calculation under-predicts the relic density and hence can lead to potentially large "boost factor" enhancements in the dark matter annihilation rate. We examine dark matter model-independent limits on this scenario arising from unitarity and discuss example model-dependent limits coming from indirect dark matter search experiments. In the four cases studied, the upper bound on $Gμ$ for strings with an appreciable channel into TeV-scale states is significantly more stringent than the current Cosmic Microwave Background limits.
△ Less
Submitted 13 February, 2014; v1 submitted 7 November, 2013;
originally announced November 2013.
-
Continuity properties of vectors realizing points in the classical field of values
Authors:
Dan Corey,
Charles R. Johnson,
Ryan Kirk,
Brian Lins,
Ilya Spitkovsky
Abstract:
For an $n$-by-$n$ matrix $A$, let $f_A$ be its "field of values generating function" defined as $f_A\colon x\mapsto x^*Ax$. We consider two natural versions of the continuity, which we call strong and weak, of $f_A^{-1}$ (which is of course multi-valued) on the field of values $F(A)$. The strong continuity holds, in particular, on the interior of $F(A)$, and at such points $z \in \partial F(A)$ wh…
▽ More
For an $n$-by-$n$ matrix $A$, let $f_A$ be its "field of values generating function" defined as $f_A\colon x\mapsto x^*Ax$. We consider two natural versions of the continuity, which we call strong and weak, of $f_A^{-1}$ (which is of course multi-valued) on the field of values $F(A)$. The strong continuity holds, in particular, on the interior of $F(A)$, and at such points $z \in \partial F(A)$ which are either corner points, belong to the relative interior of flat portions of $\partial F(A)$, or whose preimage under $f_A$ is contained in a one-dimensional set. Consequently, $f_A^{-1}$ is continuous in this sense on the whole $F(A)$ for all normal, 2-by-2, and unitarily irreducible 3-by-3 matrices. Nevertheless, we show by example that the strong continuity of $f_A^{-1}$ fails at certain points of $\partial F(A)$ for some (unitarily reducible) 3-by-3 and (unitarily irreducible) 4-by-4 matrices. The weak continuity, in its turn, fails for some unitarily reducible 4-by-4 and untiarily irreducible 6-by-6 matrices.
△ Less
Submitted 18 July, 2013;
originally announced July 2013.
-
Tiling for Performance Tuning on Different Models of GPUs
Authors:
Chang Xu,
Steven R. Kirk,
Samantha Jenkins
Abstract:
The strategy of using CUDA-compatible GPUs as a parallel computation solution to improve the performance of programs has been more and more widely approved during the last two years since the CUDA platform was released. Its benefit extends from the graphic domain to many other computationally intensive domains. Tiling, as the most general and important technique, is widely used for optimization…
▽ More
The strategy of using CUDA-compatible GPUs as a parallel computation solution to improve the performance of programs has been more and more widely approved during the last two years since the CUDA platform was released. Its benefit extends from the graphic domain to many other computationally intensive domains. Tiling, as the most general and important technique, is widely used for optimization in CUDA programs. New models of GPUs with better compute capabilities have, however, been released, new versions of CUDA SDKs were also released. These updated compute capabilities must to be considered when optimizing using the tiling technique. In this paper, we implement image interpolation algorithms as a test case to discuss how different tiling strategies affect the program's performance. We especially focus on how the different models of GPUs affect the tiling's effectiveness by executing the same program on two different models of GPUs equipped testing platforms. The results demonstrate that an optimized tiling strategy on one GPU model is not always a good solution when execute on other GPU models, especially when some external conditions were changed.
△ Less
Submitted 11 January, 2010;
originally announced January 2010.
-
Molecular dynamics simulation of nanocolloidal amorphous silica particles: Part II
Authors:
S. Jenkins,
S. R. Kirk,
M. Persson,
J. Carlen,
Z. Abbas
Abstract:
Explicit molecular dynamics simulations were applied to a pair of amorphous silica nanoparticles of diameter 3.2 nm immersed in a background electrolyte. Mean forces acting between the pair of silica nanoparticles were extracted at four different background electrolyte concentrations. Dependence of the inter-particle potential of mean force on the separation and the silicon to sodium ratio, as w…
▽ More
Explicit molecular dynamics simulations were applied to a pair of amorphous silica nanoparticles of diameter 3.2 nm immersed in a background electrolyte. Mean forces acting between the pair of silica nanoparticles were extracted at four different background electrolyte concentrations. Dependence of the inter-particle potential of mean force on the separation and the silicon to sodium ratio, as well as on the background electrolyte concentration, are demonstrated. The pH was indirectly accounted for via the ratio of silicon to sodium used in the simulations. The nature of the interaction of the counter-ions with charged silica surface sites (deprotonated silanols) was also investigated. The effect of the sodium double layer on the water ordering was investigated for three Si:Na+ ratios. The number of water molecules trapped inside the nanoparticles was investigated as the Si:Na+ ratio was varied. Differences in this number between the two nanoparticles in the simulations are attributed to differences in the calculated electric dipole moment. The implications of the form of the potentials for aggregation are also discussed.
△ Less
Submitted 11 September, 2007; v1 submitted 19 August, 2007;
originally announced August 2007.
-
Molecular dynamics simulation of nanocolloidal amorphous silica particles: Part I
Authors:
S. Jenkins,
S. R. Kirk,
M. Persson,
J. Carlen,
Z. Abbas
Abstract:
Explicit molecular dynamics simulations were applied to a pair of amorphous silica nanoparticles in aqueous solution, of diameter 4.4 nm with four different background electrolyte concentrations, to extract the mean force acting between the pair of silica nanoparticles. Dependences of the interparticle forces with separation and the background electrolyte concentration were demonstrated. The nat…
▽ More
Explicit molecular dynamics simulations were applied to a pair of amorphous silica nanoparticles in aqueous solution, of diameter 4.4 nm with four different background electrolyte concentrations, to extract the mean force acting between the pair of silica nanoparticles. Dependences of the interparticle forces with separation and the background electrolyte concentration were demonstrated. The nature of the interaction of the counter-ions with charged silica surface sites (deprotonated silanols) was investigated. A 'patchy' double layer of adsorbed sodium counter-ions. was observed. Dependences of the interparticle potential of mean force with separation and the background electrolyte concentration were demonstrated. Direct evidence of the solvation forces is presented in terms of changes of the water ordering at the surfaces of the isolated and double nanoparticles. The nature of the interaction of the counter-ions with charged silica surface sites (deprotonated silanols) was investigated in terms of quantifying the effects of the number of water molecules separately inside each of the pair of nanoparticles by defining an impermeability measure. A direct correlation was found between impermeability (related to the silica surface 'hairiness') and the disruption of water ordering. Differences in the impermeability between the two nanoparticles are attributed to differences in the calculated electric dipole moment.
△ Less
Submitted 16 September, 2007; v1 submitted 19 August, 2007;
originally announced August 2007.
-
Identification of phases in scale-free networks
Authors:
Samantha Jenkins,
Steven R. Kirk
Abstract:
There is a pressing need for a description of complex systems that includes considerations of the underlying network of interactions, for a diverse range of biological, technological and other networks. In this work relationships between second-order phase transitions and the power laws associated with scale-free networks are directly quantified. A unique unbiased partitioning of complex network…
▽ More
There is a pressing need for a description of complex systems that includes considerations of the underlying network of interactions, for a diverse range of biological, technological and other networks. In this work relationships between second-order phase transitions and the power laws associated with scale-free networks are directly quantified. A unique unbiased partitioning of complex networks (exemplified in this work by software architectures) into high- and low-connectivity regions can be made. Other applications to finance and aerogels are outlined.
△ Less
Submitted 21 October, 2004;
originally announced October 2004.