Search | arXiv e-print repository

doi 10.1613/jair.1.14174

A Survey of Zero-shot Generalisation in Deep Reinforcement Learning

Authors: Robert Kirk, Amy Zhang, Edward Grefenstette, Tim Rocktäschel

Abstract: The study of zero-shot generalisation (ZSG) in deep Reinforcement Learning (RL) aims to produce RL algorithms whose policies generalise well to novel unseen situations at deployment time, avoiding overfitting to their training environments. Tackling this is vital if we are to deploy reinforcement learning algorithms in real world scenarios, where the environment will be diverse, dynamic and unpred… ▽ More The study of zero-shot generalisation (ZSG) in deep Reinforcement Learning (RL) aims to produce RL algorithms whose policies generalise well to novel unseen situations at deployment time, avoiding overfitting to their training environments. Tackling this is vital if we are to deploy reinforcement learning algorithms in real world scenarios, where the environment will be diverse, dynamic and unpredictable. This survey is an overview of this nascent field. We rely on a unifying formalism and terminology for discussing different ZSG problems, building upon previous works. We go on to categorise existing benchmarks for ZSG, as well as current methods for tackling these problems. Finally, we provide a critical discussion of the current state of the field, including recommendations for future work. Among other conclusions, we argue that taking a purely procedural content generation approach to benchmark design is not conducive to progress in ZSG, we suggest fast online adaptation and tackling RL-specific problems as some areas for future work on methods for ZSG, and we recommend building benchmarks in underexplored problem settings such as offline RL ZSG and reward-function variation. △ Less

Submitted 19 January, 2023; v1 submitted 18 November, 2021; originally announced November 2021.

Comments: JAIR version. Added formal definitions of ZSPT and related concepts, JAIR formatting, other small rewrites; https://www.jair.org/index.php/jair/article/view/14174

Journal ref: Journal of Artificial Intelligence Research (JAIR), 76:201-264, 2023

arXiv:2109.13202 [pdf, other]

MiniHack the Planet: A Sandbox for Open-Ended Reinforcement Learning Research

Authors: Mikayel Samvelyan, Robert Kirk, Vitaly Kurin, Jack Parker-Holder, Minqi Jiang, Eric Hambro, Fabio Petroni, Heinrich Küttler, Edward Grefenstette, Tim Rocktäschel

Abstract: Progress in deep reinforcement learning (RL) is heavily driven by the availability of challenging benchmarks used for training agents. However, benchmarks that are widely adopted by the community are not explicitly designed for evaluating specific capabilities of RL methods. While there exist environments for assessing particular open problems in RL (such as exploration, transfer learning, unsuper… ▽ More Progress in deep reinforcement learning (RL) is heavily driven by the availability of challenging benchmarks used for training agents. However, benchmarks that are widely adopted by the community are not explicitly designed for evaluating specific capabilities of RL methods. While there exist environments for assessing particular open problems in RL (such as exploration, transfer learning, unsupervised environment design, or even language-assisted RL), it is generally difficult to extend these to richer, more complex environments once research goes beyond proof-of-concept results. We present MiniHack, a powerful sandbox framework for easily designing novel RL environments. MiniHack is a one-stop shop for RL experiments with environments ranging from small rooms to complex, procedurally generated worlds. By leveraging the full set of entities and environment dynamics from NetHack, one of the richest grid-based video games, MiniHack allows designing custom RL testbeds that are fast and convenient to use. With this sandbox framework, novel environments can be designed easily, either using a human-readable description language or a simple Python interface. In addition to a variety of RL tasks and baselines, MiniHack can wrap existing RL benchmarks and provide ways to seamlessly add additional complexity. △ Less

Submitted 16 November, 2021; v1 submitted 27 September, 2021; originally announced September 2021.

Comments: NeurIPS 2021: Datasets and Benchmarks Track

arXiv:2109.08270 [pdf]

Language Models as a Knowledge Source for Cognitive Agents

Authors: Robert E. Wray, III, James R. Kirk, John E. Laird

Abstract: Language models (LMs) are sentence-completion engines trained on massive corpora. LMs have emerged as a significant breakthrough in natural-language processing, providing capabilities that go far beyond sentence completion including question answering, summarization, and natural-language inference. While many of these capabilities have potential application to cognitive systems, exploiting languag… ▽ More Language models (LMs) are sentence-completion engines trained on massive corpora. LMs have emerged as a significant breakthrough in natural-language processing, providing capabilities that go far beyond sentence completion including question answering, summarization, and natural-language inference. While many of these capabilities have potential application to cognitive systems, exploiting language models as a source of task knowledge, especially for task learning, offers significant, near-term benefits. We introduce language models and the various tasks to which they have been applied and then review methods of knowledge extraction from language models. The resulting analysis outlines both the challenges and opportunities for using language models as a new knowledge source for cognitive systems. It also identifies possible ways to improve knowledge extraction from language models using the capabilities provided by cognitive systems. Central to success will be the ability of a cognitive agent to itself learn an abstract model of the knowledge implicit in the LM as well as methods to extract high-quality knowledge effectively and efficiently. To illustrate, we introduce a hypothetical robot agent and describe how language models could extend its task knowledge and improve its performance and the kinds of knowledge and methods the agent can use to exploit the knowledge within a language model. △ Less

Submitted 23 October, 2021; v1 submitted 16 September, 2021; originally announced September 2021.

Comments: 16 pages, 2 figures; accepted for 2021 Advances in Cognitive Systems Conference (revised based on reviews)

ACM Class: I.2.7; I.2.11

arXiv:2108.08680 [pdf, other]

Piecewise circular curves and positivity

Authors: Jean-Philippe Burelle, Ryan Kirk

Abstract: We introduce the moduli space of generic piecewise circular $n$-gons in the Riemann sphere and relate it to a moduli space of Legendrian polygons. We prove that when $n=2k$, this moduli space contains a connected component homeomorphic to the Fock-Goncharov space of $k$-tuples of positive flags for $\mathsf{PSp}(4,\mathbb{R})$ and hence is a topological ball. We characterize this component geometr… ▽ More We introduce the moduli space of generic piecewise circular $n$-gons in the Riemann sphere and relate it to a moduli space of Legendrian polygons. We prove that when $n=2k$, this moduli space contains a connected component homeomorphic to the Fock-Goncharov space of $k$-tuples of positive flags for $\mathsf{PSp}(4,\mathbb{R})$ and hence is a topological ball. We characterize this component geometrically as the space of simple piecewise circular curves with decreasing curvature. △ Less

Submitted 19 August, 2021; originally announced August 2021.

Comments: 28 pages, 8 figures

MSC Class: 51M99; 53D10; 22F30

arXiv:2108.05921 [pdf, other]

Hatemoji: A Test Suite and Adversarially-Generated Dataset for Benchmarking and Detecting Emoji-based Hate

Authors: Hannah Rose Kirk, Bertram Vidgen, Paul Röttger, Tristan Thrush, Scott A. Hale

Abstract: Detecting online hate is a complex task, and low-performing models have harmful consequences when used for sensitive applications such as content moderation. Emoji-based hate is an emerging challenge for automated detection. We present HatemojiCheck, a test suite of 3,930 short-form statements that allows us to evaluate performance on hateful language expressed with emoji. Using the test suite, we… ▽ More Detecting online hate is a complex task, and low-performing models have harmful consequences when used for sensitive applications such as content moderation. Emoji-based hate is an emerging challenge for automated detection. We present HatemojiCheck, a test suite of 3,930 short-form statements that allows us to evaluate performance on hateful language expressed with emoji. Using the test suite, we expose weaknesses in existing hate detection models. To address these weaknesses, we create the HatemojiBuild dataset using a human-and-model-in-the-loop approach. Models built with these 5,912 adversarial examples perform substantially better at detecting emoji-based hate, while retaining strong performance on text-only hate. Both HatemojiCheck and HatemojiBuild are made publicly available. See our Github Repository (https://github.com/HannahKirk/Hatemoji). HatemojiCheck, HatemojiBuild, and the final Hatemoji Model are also available on HuggingFace (https://huggingface.co/datasets/HannahRoseKirk/). △ Less

Submitted 6 May, 2022; v1 submitted 12 August, 2021; originally announced August 2021.

Journal ref: 2022 Annual Conference of the North American Chapter of the Association for Computational Linguistics (NAACL 2022)

arXiv:2107.04313 [pdf, other]

Memes in the Wild: Assessing the Generalizability of the Hateful Memes Challenge Dataset

Authors: Hannah Rose Kirk, Yennie Jun, Paulius Rauba, Gal Wachtel, Ruining Li, Xingjian Bai, Noah Broestl, Martin Doff-Sotta, Aleksandar Shtedritski, Yuki M. Asano

Abstract: Hateful memes pose a unique challenge for current machine learning systems because their message is derived from both text- and visual-modalities. To this effect, Facebook released the Hateful Memes Challenge, a dataset of memes with pre-extracted text captions, but it is unclear whether these synthetic examples generalize to `memes in the wild'. In this paper, we collect hateful and non-hateful m… ▽ More Hateful memes pose a unique challenge for current machine learning systems because their message is derived from both text- and visual-modalities. To this effect, Facebook released the Hateful Memes Challenge, a dataset of memes with pre-extracted text captions, but it is unclear whether these synthetic examples generalize to `memes in the wild'. In this paper, we collect hateful and non-hateful memes from Pinterest to evaluate out-of-sample performance on models pre-trained on the Facebook dataset. We find that memes in the wild differ in two key aspects: 1) Captions must be extracted via OCR, injecting noise and diminishing performance of multimodal models, and 2) Memes are more diverse than `traditional memes', including screenshots of conversations or text on a plain background. This paper thus serves as a reality check for the current benchmark of hateful meme detection and its applicability for detecting real world hate. △ Less

Submitted 9 July, 2021; originally announced July 2021.

Comments: Accepted paper at ACL WOAH 2021

arXiv:2012.13308 [pdf, other]

doi 10.1007/JHEP05(2021)077

The 300 "Correlators" Suggests 4D, $\cal N$ = 1 SUSY Is a Solution to a Set of Sudoku Puzzles

Authors: Aleksander J. Cianciara, S. James Gates Jr, Yangrui Hu, Renee Kirk

Abstract: A conjecture is made that the weight space for 4D, $\cal N$-extended supersymmetrical representations is embedded within the permutahedra associated with permutation groups ${\mathbb{S}}{}_{d}$. Adinkras and Coxeter Groups associated with minimal representations of 4D, $\cal N$ = 1 supersymmetry provide evidence supporting this conjecture. It is shown the appearance of the mathematics of 4D,… ▽ More A conjecture is made that the weight space for 4D, $\cal N$-extended supersymmetrical representations is embedded within the permutahedra associated with permutation groups ${\mathbb{S}}{}_{d}$. Adinkras and Coxeter Groups associated with minimal representations of 4D, $\cal N$ = 1 supersymmetry provide evidence supporting this conjecture. It is shown the appearance of the mathematics of 4D, $\cal N$ = 1 minimal off-shell supersymmetry representations is equivalent to solving a four color problem on the truncated octahedron. This observation suggest an entirely new way to approach the off-shell SUSY auxiliary field problem based on IT algorithms probing the properties of ${\mathbb{S}}{}_{d}$. △ Less

Submitted 13 April, 2021; v1 submitted 24 December, 2020; originally announced December 2020.

Comments: LaTeX twice, 39 pages, 14 figures, 28 tables (v2 added one table and associated content, v3 note added in proof comment)

arXiv:1902.07289 [pdf]

Accurate Automatic Segmentation of Amygdala Subnuclei and Modeling of Uncertainty via Bayesian Fully Convolutional Neural Network

Authors: Yilin Liu, Gengyan Zhao, Brendon M. Nacewicz, Nagesh Adluru, Gregory R. Kirk, Peter A Ferrazzano, Martin Styner, Andrew L. Alexander

Abstract: Recent advances in deep learning have improved the segmentation accuracy of subcortical brain structures, which would be useful in neuroimaging studies of many neurological disorders. However, most of the previous deep learning work does not investigate the specific difficulties that exist in segmenting extremely small but important brain regions such as the amygdala and its subregions. To tackle… ▽ More Recent advances in deep learning have improved the segmentation accuracy of subcortical brain structures, which would be useful in neuroimaging studies of many neurological disorders. However, most of the previous deep learning work does not investigate the specific difficulties that exist in segmenting extremely small but important brain regions such as the amygdala and its subregions. To tackle this challenging task, a novel 3D Bayesian fully convolutional neural network was developed to apply a dilated dualpathway approach that retains fine details and utilizes both local and more global contextual information to automatically segment the amygdala and its subregions at high precision. The proposed method provides insights on network design and sampling strategy that target segmentations of small 3D structures. In particular, this study confirms that a large context, enabled by a large field of view, is beneficial for segmenting small objects; furthermore, precise contextual information enabled by dilated convolutions allows for better boundary localization, which is critical for examining the morphology of the structure. In addition, it is demonstrated that the uncertainty information estimated from our network may be leveraged to identify atypicality in data. Our method was compared with two state-of-the-art deep learning models and a traditional multi-atlas approach, and exhibited excellent performance as measured both by Dice overlap as well as average symmetric surface distance. To the best of our knowledge, this work is the first deep learning-based approach that targets the subregions of the amygdala. △ Less

Submitted 19 February, 2019; originally announced February 2019.

arXiv:1810.13157 [pdf]

doi 10.1016/j.cplett.2019.01.016

Next-Generation Quantum Theory of Atoms in Molecules for the Ground and Excited State of the Ring-Opening of Cyclohexadiene (CHD)

Authors: Tian Tian, Tianlv Xu, Steven R. Kirk, Michael Filatov, Samantha Jenkins

Abstract: The factors underlying the experimentally observed branching ratio (70:30) of the (1,3-cyclohexadiene) CHD$\rightarrow$HT (1,3,5-hexatriene) photochemical ring-opening reaction are investigated. The ring-opening reaction path is optimized by a high-level multi-reference DFT method and the density along the path is analyzed by the QTAIM and stress tensor methods. The performed density analysis sugg… ▽ More The factors underlying the experimentally observed branching ratio (70:30) of the (1,3-cyclohexadiene) CHD$\rightarrow$HT (1,3,5-hexatriene) photochemical ring-opening reaction are investigated. The ring-opening reaction path is optimized by a high-level multi-reference DFT method and the density along the path is analyzed by the QTAIM and stress tensor methods. The performed density analysis suggests that, in both $S_{1}$ and $S_{0}$ electronic states, there exists an attractive interaction between the ends of the fissile $σ$ -bond of CHD that steers the ring-opening reaction predominantly in the direction of restoration of the ring. It is suggested that opening of the ring and formation of the reaction product (HT) can only be achieved when there is a sufficient persistent nuclear momentum in the direction of stretching of the fissile bond. As this orientation of the nuclear momentum vector can be expected to be relatively rare during the dynamics, this explains the observed low quantum yield of the ring-opening reaction. △ Less

Submitted 31 October, 2018; originally announced October 2018.

Comments: 30 pages, manuscript + supplementary materials

arXiv:1809.06732 [pdf]

Stress Tensor Eigenvector Following with Next-Generation Quantum Theory of Atoms in Molecules

Authors: Jia Hui Li, Wei Jie Huang, Tianlv Xu, Steven R. Kirk, Samantha Jenkins

Abstract: The eigenvectors of the electronic stress tensor have been identified as useful for the prediction of chemical reactivity because they determine the most preferred directions to move the bonds that correspond to a qualitative change in the molecular electronic structure. A new 3-D vector based interpretation of the chemical bond that we refer to as the bond-path framework set… ▽ More The eigenvectors of the electronic stress tensor have been identified as useful for the prediction of chemical reactivity because they determine the most preferred directions to move the bonds that correspond to a qualitative change in the molecular electronic structure. A new 3-D vector based interpretation of the chemical bond that we refer to as the bond-path framework set $\mathbb{B} = \{p,q,r\}$ provides a version of the quantum theory of atoms in molecules (QTAIM) beyond the minimum definition for bonding that is particularly suitable for understanding changes in molecular electronic structure that occur during reactions. The bond-path framework set $\mathbb{B}$ is straightforwardly constructed and visualized from the eigenvalues and eigenvectors of QTAIM. This approach is applied to the structural deformations of ethene that occur during applied torsion $θ$, -180.0° $\leq$ $θ$ $\leq$ +180.0°. The corresponding stress tensor version is readily constructed as $\mathbb{B}_σ = \{p_σ,q_σ,r\}$ within the QTAIM partitioning making it possible to compare experimentally and computationally determined electronic charge densities. The bond-path framework set $\mathbb{B}$ or $\mathbb{B}_σ$ are the networks that comprise three strands: the least preferred ($p, p_σ$), most preferred ($q, q_σ$) and $r$ is the familiar QTAIM bond-path. We demonstrate that the most preferred direction for bond motion using the stress tensor corresponds to the most compressible direction and not to the least compressible direction as previously reported. We show the necessity for a directional approach constructed using the eigenvectors along the entire bond length and demonstrate the insufficiency of the sole use of scalar measures for capturing the nature of the stress tensor within the QTAIM partitioning. △ Less

Submitted 16 September, 2018; originally announced September 2018.

Comments: 44 pages, manuscript + supplementary materials. arXiv admin note: text overlap with arXiv:1804.00776

arXiv:1807.00244 [pdf, other]

Automatic Identification of Twin Zygosity in Resting-State Functional MRI

Authors: Andrey Gritsenko, Martin A. Lindquist, Gregory R. Kirk, Moo K. Chung

Abstract: A key strength of twin studies arises from the fact that there are two types of twins, monozygotic and dizygotic, that share differing amounts of genetic information. Accurate differentiation of twin types allows efficient inference on genetic influences in a population. However, identification of zygosity is often prone to errors without genotying. In this study, we propose a novel pairwise featu… ▽ More A key strength of twin studies arises from the fact that there are two types of twins, monozygotic and dizygotic, that share differing amounts of genetic information. Accurate differentiation of twin types allows efficient inference on genetic influences in a population. However, identification of zygosity is often prone to errors without genotying. In this study, we propose a novel pairwise feature representation to classify the zygosity of twin pairs of resting state functional magnetic resonance images (rs-fMRI). For this, we project an fMRI signal to a set of basis functions and use the projection coefficients as the compact and discriminative feature representation of noisy fMRI. We encode the relationship between twins as the correlation between the new feature representations across brain regions. We employ hill climbing variable selection to identify brain regions that are the most genetically affected. The proposed framework was applied to 208 twin pairs and achieved 94.19% classification accuracy in automatically identifying the zygosity of paired images. △ Less

Submitted 26 October, 2018; v1 submitted 30 June, 2018; originally announced July 2018.

arXiv:1804.01892 [pdf]

doi 10.1016/j.cplett.2018.04.059

A Vector-Based Representation of the Chemical Bond for the Substituted Torsion of Biphenyl

Authors: Jiahui Li, Weijie Huang, Tianlv Xu, Steven R. Kirk, Samantha Jenkins

Abstract: We use a new interpretation of the chemical bond within QTAIM. The bond-path framework set $\mathbb{B} = \{p, q, r\}$ with associated linkages with lengths $\mathbb{H}^*,m\mathbb{H}$ and the familiar bond-path length is used to describe a torsion $θ$, $0.0^{\circ} \leq θ\lt 22.0^{\circ}$ of \emph{para}-substituted biphenyl, $\mathrm{C}_{12}\mathrm{H}_{9-x}$, $x = \mathrm{N}(\mathrm{CH}_3)_2$,… ▽ More We use a new interpretation of the chemical bond within QTAIM. The bond-path framework set $\mathbb{B} = \{p, q, r\}$ with associated linkages with lengths $\mathbb{H}^*,m\mathbb{H}$ and the familiar bond-path length is used to describe a torsion $θ$, $0.0^{\circ} \leq θ\lt 22.0^{\circ}$ of \emph{para}-substituted biphenyl, $\mathrm{C}_{12}\mathrm{H}_{9-x}$, $x = \mathrm{N}(\mathrm{CH}_3)_2$, $\mathrm{NH}_2$, $\mathrm{CH}_3$, CHO, CN, $\mathrm{NO}_{2}$. We include consideration of the H---H bonding interactions and find that the lengths $\mathbb{H} \gt \mathbb{H}^{*}$ that we explain in terms of the most and least preferred directions of charge density accumulation. We also consider the fractional eigenvector-following path lengths $\mathbb{H}_f$ and $\mathbb{H}_{f{θ_{\rm min}}}$. △ Less

Submitted 4 April, 2018; originally announced April 2018.

Comments: manuscript+supplementary materials. arXiv admin note: text overlap with arXiv:1804.00776, arXiv:1804.00780, arXiv:1804.01525

arXiv:1804.01525 [pdf]

A Vector-Based Representation of the Chemical Bond for the Normal Modes of Benzene

Authors: Wei Jie Huang, Alireza Azizi, Tianlv Xu, Steven R. Kirk, Samantha Jenkins

Abstract: We introduce a vector-based interpretation of the chemical bond within the quantum theory of atoms in molecules (QTAIM), the bond-path framework set $\mathbb{B} = \{p, q, r\}$, to follow variations in the 3-D morphology of all bonds for the four infra-red (IR) active normal modes of benzene. The bond-path framework set comprises three unique paths $p$, $q$ and $r$ where $r$ is the familiar QTAIM b… ▽ More We introduce a vector-based interpretation of the chemical bond within the quantum theory of atoms in molecules (QTAIM), the bond-path framework set $\mathbb{B} = \{p, q, r\}$, to follow variations in the 3-D morphology of all bonds for the four infra-red (IR) active normal modes of benzene. The bond-path framework set comprises three unique paths $p$, $q$ and $r$ where $r$ is the familiar QTAIM bond concept of bond-path ($r$) while the two new paths $p$ and $q$ are formulated from the least and most preferred directions of electron density accumulation respectively. We find 3-D distortions including bond stretching/compression, torsion and curving. We introduce two fractional measures to quantify these variations away from linearity of the bond. △ Less

Submitted 7 May, 2018; v1 submitted 3 April, 2018; originally announced April 2018.

Comments: Manuscript. Supplementary materials available on https://www.beaconresearch.org/ .arXiv admin note: text overlap with arXiv:1804.00776. v2. minor correction

arXiv:1804.00784 [pdf]

Quinone-based Switches for Candidate Building Blocks of Molecular Junctions with QTAIM and the Stress Tensor

Authors: Tianlv Xu, Lingling Wang, Yang **, Tanja van Mourik, Herbert Früchtl, Steven R. Kirk, Samantha Jenkins

Abstract: The current work investigates candidate building blocks based on molecular junctions from hydrogen transfer tautomerization in the benzoquinone-like core of an azophenine molecule with QTAIM and the recently-introduced stress tensor trajectory analysis. We find that in particular the stress tensor trajectories are well suited to describe the mechanism of the switching process. The effects of an Fe… ▽ More The current work investigates candidate building blocks based on molecular junctions from hydrogen transfer tautomerization in the benzoquinone-like core of an azophenine molecule with QTAIM and the recently-introduced stress tensor trajectory analysis. We find that in particular the stress tensor trajectories are well suited to describe the mechanism of the switching process. The effects of an Fe-dopant atom coordinated to the quinone ring, as well as F and Cl substitution of different ring-hydrogens, are investigated and the new QTAIM and stress tensor analysis is used to draw conclusions on the effectiveness of such molecules as molecular switches in nano-sized electronic circuits. We find that the coordinated Fe-dopant greatly improves the switching properties, both in terms of the tautomerisation barrier that has to be crossed in the switching process and the expected conductance behavior, while the effects of hydrogen substitution are more subtle. The absence of the Fe-dopant atom led to impaired functioning of the switch 'OFF' mechanism as well as coinciding with the formation of closed-shell H---H bond critical points that indicated a strained or electron deficient environment. Our analysis demonstrates promise for future use in design of molecular electronic devices. △ Less

Submitted 2 April, 2018; originally announced April 2018.

Comments: manuscript. Supplementary materials available for download on https://www.beaconresearch.org

arXiv:1804.00780 [pdf]

Predicting Competitive and Non-Competitive Torquoselectivity in Ring-Opening Reactions using QTAIM and the Stress Tensor

Authors: Alireza Azizi, Roya Momen, Alejandro Morales-Bayuelo, Tianlv Xu, Steven R. Kirk, Samantha Jenkins

Abstract: We present a new vector-based representation of the chemical bond referred to as the bond-path frame-work set $\mathbb{B} = {p, q, r}$, where $p$, $q$ and $r$ represent three paths with corresponding eigenvector-following path lengths $\mathbb{H}^{*},\mathbb{H}$ and the bond-path length from the quantum theory of atoms in molecules (QTAIM). We find that longer path lengths $\mathbb{H}$ of the ring… ▽ More We present a new vector-based representation of the chemical bond referred to as the bond-path frame-work set $\mathbb{B} = {p, q, r}$, where $p$, $q$ and $r$ represent three paths with corresponding eigenvector-following path lengths $\mathbb{H}^{*},\mathbb{H}$ and the bond-path length from the quantum theory of atoms in molecules (QTAIM). We find that longer path lengths $\mathbb{H}$ of the ring-opening bonds predict the preference for the transition state inward (\textbf{TSIC}) or transition state outward (\textbf{TSOC}) ring opening reactions in agreement with experiment for all five reactions \textbf{R1-R5}. Competitiveness and non-competitiveness have traditionally been considered using activation energies. The activation energy however, for \textbf{R3} does not satisfactorily determine competitiveness or provide consistent agreement with experimental yields. We choose a selection of five competitive and non-competitive reactions; methyl-cyclobutene (\textbf{R1}), ethyl-methyl-cyclobutene (\textbf{R2}), iso-propyl-methyl-cyclobutene (\textbf{R3}), ter-butyl-methyl-cyclobutene (\textbf{R4}) and phenyl-methyl-cyclobutene (R5). Therefore, in this investigation we provide a new criterion, within the QTAIM framework, to determine whether the reactions \textbf{R1-R5} are competitive or non-competitive. We that find \textbf{R2}, \textbf{R3} and \textbf{R5} are competitive and \textbf{R1} and \textbf{R4} are non-competitive reactions in contrast to the results from the activation energies, calling into question the reliability of activation energies. △ Less

Submitted 2 April, 2018; originally announced April 2018.

Comments: manuscript and supplementary materials. arXiv admin note: text overlap with arXiv:1804.00776

arXiv:1804.00776 [pdf]

Next-Generation Quantum Theory of Atoms in Molecules for the Ground and Excited States of Fulvene

Authors: Wei Jie Huang, Roya Momen, Alireza Azizi, Tianlv Xu, Steven R. Kirk, Michael Filatov, Samantha Jenkins

Abstract: A vector-based representation of the chemical bond is introduced that we refer to as the bond-path frame-work set = $\mathbb{B} = \{p, q, r\}$, where $p$, $q$ and $r$ represent three paths with corresponding eigenvector-following path lengths $\mathbb{H}^{*},\mathbb{H}$ and the familiar quantum theory of atoms in molecules (QTAIM) bond-path length. The eigenvector-following path lengths… ▽ More A vector-based representation of the chemical bond is introduced that we refer to as the bond-path frame-work set = $\mathbb{B} = \{p, q, r\}$, where $p$, $q$ and $r$ represent three paths with corresponding eigenvector-following path lengths $\mathbb{H}^{*},\mathbb{H}$ and the familiar quantum theory of atoms in molecules (QTAIM) bond-path length. The eigenvector-following path lengths $\mathbb{H}^{*}$ and $\mathbb{H}$ are constructed along the bond-path from the $\underline{\mathbf{\mathit{e}}}_{1}$ and $\underline{\mathbf{\mathit{e}}}_{2}$ Hessian eigenvectors respectively, which correspond to the least and most preferred directions of charge density accumulation. In particular, the paths $p$ and $q$ provide a vector representation of the scalar QTAIM ellipticity ε. The bond-path frame-work set $\mathbb{B}$ is applied to the excited state deactivation of fulvene that involves distortions along various intramolecular degrees of freedom, such as the bond stretching/compression of bond length alternation (BLA) and bond torsion distortions. We find that the $\mathbb{H}^{*}$ and $\mathbb{H}$ lengths can differentiate between the ground and excited electronic states, in contrast to the QTAIM bond-path length. In particular, the eigenvector-following path lengths $\mathbb{H}^{*}$ and $\mathbb{H}$ are found to be shorter for the excited state than the ground state for both the BLA and bond torsion distortions indicating that distortions resulting in lower $\mathbb{H}^{*}$ and $\mathbb{H}$ values are easier to perform. △ Less

Submitted 6 May, 2018; v1 submitted 2 April, 2018; originally announced April 2018.

Comments: Manuscript and supplementary materials Title change, some corrections, new figure added

arXiv:1706.05658 [pdf]

doi 10.1002/qua.25456

The Role of Weak Interactions in Characterizing Peptide Folding Preferences using a QTAIM Interpretation of the Ramachandran Plot (φ-ψ)

Authors: Roya Momen, Alireza Azizi, Lingling Wang, Yang **, Tianlv Xu, Steven R. Kirk, Wenxuan Li, Sergei Manzhos, Samantha Jenkins

Abstract: The Ramachandran plot is a potent way to understand structures of biomolecules, however, the original formulation of the Ramachandran plot only considers backbone conformations. We formulate a new interpretation of the original Ramachandran plot ($φ-ψ$) that can include a description of the weaker interactions including both the hydrogen bonds and H$---$H bonds as a new way to derive insights into… ▽ More The Ramachandran plot is a potent way to understand structures of biomolecules, however, the original formulation of the Ramachandran plot only considers backbone conformations. We formulate a new interpretation of the original Ramachandran plot ($φ-ψ$) that can include a description of the weaker interactions including both the hydrogen bonds and H$---$H bonds as a new way to derive insights into the phenomenon of peptide folding. We use QTAIM (quantum theory of atoms in molecules) to interpret the Ramachandran plot. Specifically, we show that QTAIM analysis permits identifying key regions of the Ramachandran plot without the need for massive data sets. A highly non-linear relationship is found between the QTAIM vector-derived interpreted Ramachandran plot and the conventional Ramachandran plot ($φ-ψ$) demonstrating that this new approach is not a trivial coordinate transformation. An investigation of both the backbone and the weaker bonds within the framework of the QTAIM interpreted Ramachandran plot was found to be in line with physical intuition. The least-preferred directions calculated for the hydrogen bonds and H$---$H bonds were found to coincide with the 'unlikely' regions of the Ramachandran plot. △ Less

Submitted 18 June, 2017; originally announced June 2017.

Comments: Submitted to IJQC

Journal ref: International Journal of Quantum Chemistry, Volume118, Issue2, e25456 (2018)

arXiv:1610.01840 [pdf, other]

doi 10.1088/1475-7516/2017/10/035

Can Tonne-Scale Direct Detection Experiments Discover Nuclear Dark Matter?

Authors: A. Butcher, R. Kirk, J. Monroe, S. M. West

Abstract: Models of nuclear dark matter propose that the dark sector contains large composite states consisting of dark nucleons in analogy to Standard Model nuclei. We examine the direct detection phenomenology of a particular class of nuclear dark matter model at the current generation of tonne-scale liquid noble experiments, in particular DEAP-3600 and XENON1T. In our chosen nuclear dark matter scenario… ▽ More Models of nuclear dark matter propose that the dark sector contains large composite states consisting of dark nucleons in analogy to Standard Model nuclei. We examine the direct detection phenomenology of a particular class of nuclear dark matter model at the current generation of tonne-scale liquid noble experiments, in particular DEAP-3600 and XENON1T. In our chosen nuclear dark matter scenario distinctive features arise in the recoil energy spectra due to the non-point-like nature of the composite dark matter state. We calculate the number of events required to distinguish these spectra from those of a standard point-like WIMP state with a decaying exponential recoil spectrum. In the most favourable regions of nuclear dark matter parameter space, we find that a few tens of events are needed to distinguish nuclear dark matter from WIMPs at the $3\,σ$ level in a single experiment. Given the total exposure time of DEAP-3600 and XENON1T we find that at best a $2\,σ$ distinction is possible by these experiments individually, while $3\,σ$ sensitivity is reached for a range of parameters by the combination of the two experiments. We show that future upgrades of these experiments have potential to distinguish a large range of nuclear dark matter models from that of a WIMP at greater than $3\,σ$. △ Less

Submitted 6 October, 2016; originally announced October 2016.

Comments: 23 pages, 7 multipanel figures

arXiv:1608.01340 [pdf]

doi 10.1016/j.icarus.2016.08.003

Geomorphologic Map** of Titan's Polar Terrains: Constraining Surface Processes and Landscape Evolution

Authors: Samuel Birch, Alexander Hayes, William Dietrich, Alan Howard, Charlie Bristow, Michael Malaska, Jeff Moore, Marco Mastrogiuseppe, Jason Hofgartner, David Williams, Oliver White, Jason Soderblom, Jason Barnes, Elizabeth Turtle, Jonathan Lunine, Charles Wood, Catherine Neish, Randy Kirk, Ellen Stofan, Ralph Lorenz, Rosaly Lopes

Abstract: We present a geomorphologic map of Titan's polar terrains. The map was generated from a combination of Cassini Synthetic Aperture Radar (SAR) and Imaging Science Subsystem imaging products, as well as altimetry, SARTopo and radargrammetry topographic datasets. In combining imagery with topographic data, our geomorphologic map reveals a stratigraphic sequence from which we infer process interaction… ▽ More We present a geomorphologic map of Titan's polar terrains. The map was generated from a combination of Cassini Synthetic Aperture Radar (SAR) and Imaging Science Subsystem imaging products, as well as altimetry, SARTopo and radargrammetry topographic datasets. In combining imagery with topographic data, our geomorphologic map reveals a stratigraphic sequence from which we infer process interactions between units. In map** both polar regions with the same geomorphologic units, we conclude that processes that formed the terrains of the north polar region also acted to form the landscape we observe at the south. Uniform, SAR-dark plains are interpreted as sedimentary deposits, and are bounded by moderately dissected uplands. These plains contain the highest density of filled and empty lake depressions, and canyons. These units unconformably overlay a basement rock that outcrops as mountains and SAR-bright dissected terrains at various elevations across both poles. All these units are then superposed by surficial units that slope towards the seas, suggestive of subsequent overland transport of sediment. From estimates of the depths of the embedded empty depressions and canyons that drain into the seas, the SAR-dark plains must be >600 m thick in places, though the thickness may vary across the poles. At the lowest elevations of each polar region, there are large seas, which are currently liquid methane/ethane filled at the north and empty at the south. The large plains deposits and the surrounding hillslopes may represent remnant landforms that are a result of previously vast polar oceans, where larger liquid bodies may have allowed for a sustained accumulation of soluble and insoluble sediments, potentially forming layered sedimentary deposits. Coupled with vertical crustal movements, the resulting layers would be of varying solubilities and erosional resistances. △ Less

Submitted 3 August, 2016; originally announced August 2016.

Comments: 45 pages, 16 figures, 1 table, accepted in Icarus

arXiv:1506.03116 [pdf, other]

doi 10.1016/j.dark.2015.08.001

Simplified Models for Dark Matter Searches at the LHC

Authors: Jalal Abdallah, Henrique Araujo, Alexandre Arbey, Adi Ashkenazi, Alexander Belyaev, Joshua Berger, Celine Boehm, Antonio Boveia, Amelia Brennan, Jim Brooke, Oliver Buchmueller, Matthew Buckley, Giorgio Busoni, Lorenzo Calibbi, Sushil Chauhan, Nadir Daci, Gavin Davies, Isabelle De Bruyn, Paul De Jong, Albert De Roeck, Kees de Vries, Daniele Del Re, Andrea De Simone, Andrea Di Simone, Caterina Doglioni , et al. (72 additional authors not shown)

Abstract: This document outlines a set of simplified models for dark matter and its interactions with Standard Model particles. It is intended to summarize the main characteristics that these simplified models have when applied to dark matter searches at the LHC, and to provide a number of useful expressions for reference. The list of models includes both s-channel and t-channel scenarios. For s-channel, sp… ▽ More This document outlines a set of simplified models for dark matter and its interactions with Standard Model particles. It is intended to summarize the main characteristics that these simplified models have when applied to dark matter searches at the LHC, and to provide a number of useful expressions for reference. The list of models includes both s-channel and t-channel scenarios. For s-channel, spin-0 and spin-1 mediation is discussed, and also realizations where the Higgs particle provides a portal between the dark and visible sectors. The guiding principles underpinning the proposed simplified models are spelled out, and some suggestions for implementation are presented. △ Less

Submitted 23 March, 2016; v1 submitted 9 June, 2015; originally announced June 2015.

Comments: v3: Fixed typo in eqns 14 & 15

Report number: CERN-PH-TH/2015-139, FERMILAB-PUB-15-283-CD

Journal ref: Phys. Dark Univ. 9-10 (2015) 8-23

arXiv:1412.4821 [pdf, other]

doi 10.1088/1475-7516/2015/05/048

Dark Matter with Topological Defects in the Inert Doublet Model

Authors: Mark Hindmarsh, Russell Kirk, Jose Miguel No, Stephen M. West

Abstract: We examine the production of dark matter by decaying topological defects in the high mass region $m_{\mathrm{DM}} \gg m_W$ of the Inert Doublet Model, extended with an extra U(1) gauge symmetry. The density of dark matter states (the neutral Higgs states of the inert doublet) is determined by the interplay of the freeze-out mechanism and the additional production of dark matter states from the dec… ▽ More We examine the production of dark matter by decaying topological defects in the high mass region $m_{\mathrm{DM}} \gg m_W$ of the Inert Doublet Model, extended with an extra U(1) gauge symmetry. The density of dark matter states (the neutral Higgs states of the inert doublet) is determined by the interplay of the freeze-out mechanism and the additional production of dark matter states from the decays of topological defects, in this case cosmic strings. These decays increase the predicted relic abundance compared to the standard freeze-out only case, and as a consequence the viable parameter space of the Inert Doublet Model can be widened substantially. In particular, for a given dark matter annihilation rate lower dark matter masses become viable. We investigate the allowed mass range taking into account constraints on the energy injection rate from the diffuse $γ$-ray background and Big Bang Nucleosynthesis, together with constraints on the dark matter properties coming from direct and indirect detection limits. For the Inert Doublet Model high-mass region, an inert Higgs mass as low as $\sim 200$ GeV is permitted. There is also an upper limit on string mass per unit length, and hence the symmetry breaking scale, from the relic abundance in this scenario. Depending on assumptions made about the string decays, the limits are in the range $10^{12}$ GeV to $10^{13}$ GeV. △ Less

Submitted 29 July, 2015; v1 submitted 15 December, 2014; originally announced December 2014.

Comments: 27 pages, 3 figures. V2: Published version with references added

arXiv:1311.1637 [pdf, other]

doi 10.1088/1475-7516/2014/03/037

Dark Matter from Decaying Topological Defects

Authors: Mark Hindmarsh, Russell Kirk, Stephen M. West

Abstract: We study dark matter production by decaying topological defects, in particular cosmic strings. In topological defect or "top-down" (TD) scenarios, the dark matter injection rate varies as a power law with time with exponent $p-4$. We find a formula in closed form for the yield for all $p < 3/2$, which accurately reproduces the solution of the Boltzmann equation. We investigate two scenarios (… ▽ More We study dark matter production by decaying topological defects, in particular cosmic strings. In topological defect or "top-down" (TD) scenarios, the dark matter injection rate varies as a power law with time with exponent $p-4$. We find a formula in closed form for the yield for all $p < 3/2$, which accurately reproduces the solution of the Boltzmann equation. We investigate two scenarios ($p=1$, $p=7/6$) motivated by cosmic strings which decay into TeV-scale states with a high branching fraction into dark matter particles. For dark matter models annihilating either by s-wave or p-wave, we find the regions of parameter space where the TD model can account for the dark matter relic density as measured by Planck. We find that topological defects can be the principal source of dark matter, even when the standard freeze-out calculation under-predicts the relic density and hence can lead to potentially large "boost factor" enhancements in the dark matter annihilation rate. We examine dark matter model-independent limits on this scenario arising from unitarity and discuss example model-dependent limits coming from indirect dark matter search experiments. In the four cases studied, the upper bound on $Gμ$ for strings with an appreciable channel into TeV-scale states is significantly more stringent than the current Cosmic Microwave Background limits. △ Less

Submitted 13 February, 2014; v1 submitted 7 November, 2013; originally announced November 2013.

Comments: 22 pages, 10 figures

arXiv:1307.5033 [pdf, other]

Continuity properties of vectors realizing points in the classical field of values

Authors: Dan Corey, Charles R. Johnson, Ryan Kirk, Brian Lins, Ilya Spitkovsky

Abstract: For an $n$-by-$n$ matrix $A$, let $f_A$ be its "field of values generating function" defined as $f_A\colon x\mapsto x^*Ax$. We consider two natural versions of the continuity, which we call strong and weak, of $f_A^{-1}$ (which is of course multi-valued) on the field of values $F(A)$. The strong continuity holds, in particular, on the interior of $F(A)$, and at such points $z \in \partial F(A)$ wh… ▽ More For an $n$-by-$n$ matrix $A$, let $f_A$ be its "field of values generating function" defined as $f_A\colon x\mapsto x^*Ax$. We consider two natural versions of the continuity, which we call strong and weak, of $f_A^{-1}$ (which is of course multi-valued) on the field of values $F(A)$. The strong continuity holds, in particular, on the interior of $F(A)$, and at such points $z \in \partial F(A)$ which are either corner points, belong to the relative interior of flat portions of $\partial F(A)$, or whose preimage under $f_A$ is contained in a one-dimensional set. Consequently, $f_A^{-1}$ is continuous in this sense on the whole $F(A)$ for all normal, 2-by-2, and unitarily irreducible 3-by-3 matrices. Nevertheless, we show by example that the strong continuity of $f_A^{-1}$ fails at certain points of $\partial F(A)$ for some (unitarily reducible) 3-by-3 and (unitarily irreducible) 4-by-4 matrices. The weak continuity, in its turn, fails for some unitarily reducible 4-by-4 and untiarily irreducible 6-by-6 matrices. △ Less

Submitted 18 July, 2013; originally announced July 2013.

Comments: 9 pages, 2 figures. Linear and Multilinear Algebra 2013

MSC Class: Primary 15A60; 47A12; Secondary 54C08

arXiv:1001.1718 [pdf]

Tiling for Performance Tuning on Different Models of GPUs

Authors: Chang Xu, Steven R. Kirk, Samantha Jenkins

Abstract: The strategy of using CUDA-compatible GPUs as a parallel computation solution to improve the performance of programs has been more and more widely approved during the last two years since the CUDA platform was released. Its benefit extends from the graphic domain to many other computationally intensive domains. Tiling, as the most general and important technique, is widely used for optimization… ▽ More The strategy of using CUDA-compatible GPUs as a parallel computation solution to improve the performance of programs has been more and more widely approved during the last two years since the CUDA platform was released. Its benefit extends from the graphic domain to many other computationally intensive domains. Tiling, as the most general and important technique, is widely used for optimization in CUDA programs. New models of GPUs with better compute capabilities have, however, been released, new versions of CUDA SDKs were also released. These updated compute capabilities must to be considered when optimizing using the tiling technique. In this paper, we implement image interpolation algorithms as a test case to discuss how different tiling strategies affect the program's performance. We especially focus on how the different models of GPUs affect the tiling's effectiveness by executing the same program on two different models of GPUs equipped testing platforms. The results demonstrate that an optimized tiling strategy on one GPU model is not always a good solution when execute on other GPU models, especially when some external conditions were changed. △ Less

Submitted 11 January, 2010; originally announced January 2010.

Comments: Accepted to ISISE2009 (Second International Symposium on Information Science and Engineering, 26 - 28,Dec. 2009, Shanghai, China)

arXiv:0708.2531 [pdf]

Molecular dynamics simulation of nanocolloidal amorphous silica particles: Part II

Authors: S. Jenkins, S. R. Kirk, M. Persson, J. Carlen, Z. Abbas

Abstract: Explicit molecular dynamics simulations were applied to a pair of amorphous silica nanoparticles of diameter 3.2 nm immersed in a background electrolyte. Mean forces acting between the pair of silica nanoparticles were extracted at four different background electrolyte concentrations. Dependence of the inter-particle potential of mean force on the separation and the silicon to sodium ratio, as w… ▽ More Explicit molecular dynamics simulations were applied to a pair of amorphous silica nanoparticles of diameter 3.2 nm immersed in a background electrolyte. Mean forces acting between the pair of silica nanoparticles were extracted at four different background electrolyte concentrations. Dependence of the inter-particle potential of mean force on the separation and the silicon to sodium ratio, as well as on the background electrolyte concentration, are demonstrated. The pH was indirectly accounted for via the ratio of silicon to sodium used in the simulations. The nature of the interaction of the counter-ions with charged silica surface sites (deprotonated silanols) was also investigated. The effect of the sodium double layer on the water ordering was investigated for three Si:Na+ ratios. The number of water molecules trapped inside the nanoparticles was investigated as the Si:Na+ ratio was varied. Differences in this number between the two nanoparticles in the simulations are attributed to differences in the calculated electric dipole moment. The implications of the form of the potentials for aggregation are also discussed. △ Less

Submitted 11 September, 2007; v1 submitted 19 August, 2007; originally announced August 2007.

Comments: v1. 33 pages, 7 figures (screen-quality PDF), submitted to J. Chem. Phys v2. 15 pages, 4 tables, 6 figures. Content, author list and title changed; single spaced

arXiv:0708.2529 [pdf]

doi 10.1063/1.2803897

Molecular dynamics simulation of nanocolloidal amorphous silica particles: Part I

Authors: S. Jenkins, S. R. Kirk, M. Persson, J. Carlen, Z. Abbas

Abstract: Explicit molecular dynamics simulations were applied to a pair of amorphous silica nanoparticles in aqueous solution, of diameter 4.4 nm with four different background electrolyte concentrations, to extract the mean force acting between the pair of silica nanoparticles. Dependences of the interparticle forces with separation and the background electrolyte concentration were demonstrated. The nat… ▽ More Explicit molecular dynamics simulations were applied to a pair of amorphous silica nanoparticles in aqueous solution, of diameter 4.4 nm with four different background electrolyte concentrations, to extract the mean force acting between the pair of silica nanoparticles. Dependences of the interparticle forces with separation and the background electrolyte concentration were demonstrated. The nature of the interaction of the counter-ions with charged silica surface sites (deprotonated silanols) was investigated. A 'patchy' double layer of adsorbed sodium counter-ions. was observed. Dependences of the interparticle potential of mean force with separation and the background electrolyte concentration were demonstrated. Direct evidence of the solvation forces is presented in terms of changes of the water ordering at the surfaces of the isolated and double nanoparticles. The nature of the interaction of the counter-ions with charged silica surface sites (deprotonated silanols) was investigated in terms of quantifying the effects of the number of water molecules separately inside each of the pair of nanoparticles by defining an impermeability measure. A direct correlation was found between impermeability (related to the silica surface 'hairiness') and the disruption of water ordering. Differences in the impermeability between the two nanoparticles are attributed to differences in the calculated electric dipole moment. △ Less

Submitted 16 September, 2007; v1 submitted 19 August, 2007; originally announced August 2007.

Comments: v1. 37 pages, 9 figures (screen-quality PDF), submitted to J. Chem. Phys v2. 13 pages, 3 tables, 9 figures. Content and author list changed, single-spaced v3. References dislocation fixed

arXiv:cond-mat/0410528 [pdf]

Identification of phases in scale-free networks

Authors: Samantha Jenkins, Steven R. Kirk

Abstract: There is a pressing need for a description of complex systems that includes considerations of the underlying network of interactions, for a diverse range of biological, technological and other networks. In this work relationships between second-order phase transitions and the power laws associated with scale-free networks are directly quantified. A unique unbiased partitioning of complex network… ▽ More There is a pressing need for a description of complex systems that includes considerations of the underlying network of interactions, for a diverse range of biological, technological and other networks. In this work relationships between second-order phase transitions and the power laws associated with scale-free networks are directly quantified. A unique unbiased partitioning of complex networks (exemplified in this work by software architectures) into high- and low-connectivity regions can be made. Other applications to finance and aerogels are outlined. △ Less

Submitted 21 October, 2004; originally announced October 2004.

Comments: 12 pages, 5 figures

Showing 51–77 of 77 results for author: Kirk, R