-
Decoding the Molecular Universe -- Workshop Report
Authors:
Thomas O. Metz,
Joshua N. Adkins,
Peter B. Armentrout,
Patrick Chain,
Fanny Chu,
Courtney D Corley,
John R. Cort,
Elizabeth Denis,
Daniel Drell,
Katherine R. Duncan,
Robert G. Ewing,
Facundo M. Fernandez,
Oliver Fiehn,
Neha Garg,
Stefan Grimme,
Christopher Henry,
Robert L. Hettich,
Tobias Kind,
Roger G. Linington,
Gary W. Miller,
Trent Northen,
Kirsten Overdahl,
Ari Patrinos,
Daniel Raftery,
Paul Rigor
, et al. (8 additional authors not shown)
Abstract:
On August 9-10, 2023, a workshop was convened at the Pacific Northwest National Laboratory (PNNL) in Richland, WA that brought together a group of internationally recognized experts in metabolomics, natural products discovery, chemical ecology, chemical and biological threat assessment, cheminformatics, computational chemistry, cloud computing, artificial intelligence, and novel technology develop…
▽ More
On August 9-10, 2023, a workshop was convened at the Pacific Northwest National Laboratory (PNNL) in Richland, WA that brought together a group of internationally recognized experts in metabolomics, natural products discovery, chemical ecology, chemical and biological threat assessment, cheminformatics, computational chemistry, cloud computing, artificial intelligence, and novel technology development. These experts were invited to assess the value and feasibility of a grand-scale project to create new technologies that would allow the identification and quantification of all small molecules, or to decode the molecular universe. The Decoding the Molecular Universe project would extend and complement the success of the Human Genome Project by develo** new capabilities and technologies to measure small molecules (defined as non-protein, non-polymer molecules less than 1500 Daltons) of any origin and generated in biological systems or produced abiotically. Workshop attendees 1) explored what new understanding of biological and environmental systems could be revealed through the lens of small molecules; 2) characterized the similarities in current needs and technical challenges between each science or mission area for unambiguous and comprehensive determination of the composition and quantities of small molecules of any sample; 3) determined the extent to which technologies or methods currently exist for unambiguously and comprehensively determining the small molecule composition of any sample and in a reasonable time; and 4) identified the attributes of the ideal technology or approach for universal small molecule measurement and identification. The workshop concluded with a discussion of how a project of this scale could be undertaken, possible thrusts for the project, early proof-of-principle applications, and similar efforts upon which the project could be modeled.
△ Less
Submitted 19 November, 2023;
originally announced November 2023.
-
Adaptive Transfer Learning: a simple but effective transfer learning
Authors:
Jung H Lee,
Henry J Kvinge,
Scott Howland,
Zachary New,
John Buckheit,
Lauren A. Phillips,
Elliott Skomski,
Jessica Hibler,
Courtney D. Corley,
Nathan O. Hodas
Abstract:
Transfer learning (TL) leverages previously obtained knowledge to learn new tasks efficiently and has been used to train deep learning (DL) models with limited amount of data. When TL is applied to DL, pretrained (teacher) models are fine-tuned to build domain specific (student) models. This fine-tuning relies on the fact that DL model can be decomposed to classifiers and feature extractors, and a…
▽ More
Transfer learning (TL) leverages previously obtained knowledge to learn new tasks efficiently and has been used to train deep learning (DL) models with limited amount of data. When TL is applied to DL, pretrained (teacher) models are fine-tuned to build domain specific (student) models. This fine-tuning relies on the fact that DL model can be decomposed to classifiers and feature extractors, and a line of studies showed that the same feature extractors can be used to train classifiers on multiple tasks. Furthermore, recent studies proposed multiple algorithms that can fine-tune teacher models' feature extractors to train student models more efficiently. We note that regardless of the fine-tuning of feature extractors, the classifiers of student models are trained with final outputs of feature extractors (i.e., the outputs of penultimate layers). However, a recent study suggested that feature maps in ResNets across layers could be functionally equivalent, raising the possibility that feature maps inside the feature extractors can also be used to train student models' classifiers. Inspired by this study, we tested if feature maps in the hidden layers of the teacher models can be used to improve the student models' accuracy (i.e., TL's efficiency). Specifically, we developed 'adaptive transfer learning (ATL)', which can choose an optimal set of feature maps for TL, and tested it in the few-shot learning setting. Our empirical evaluations suggest that ATL can help DL models learn more efficiently, especially when available examples are limited.
△ Less
Submitted 21 November, 2021;
originally announced November 2021.
-
The complex zeros of random orthogonal polynomials
Authors:
Christopher Corley,
Andrew Ledoan,
Aaron Yeager
Abstract:
We utilize Cauchy's argument principle in combination with the Jacobian of a holomorphic function in several complex variables and the first moment of a ratio of two correlated complex normal random variables to prove explicit formulas for the density and the mean distribution of complex zeros of random polynomials spanned by orthogonal polynomials on the unit circle and on the unit disk. We then…
▽ More
We utilize Cauchy's argument principle in combination with the Jacobian of a holomorphic function in several complex variables and the first moment of a ratio of two correlated complex normal random variables to prove explicit formulas for the density and the mean distribution of complex zeros of random polynomials spanned by orthogonal polynomials on the unit circle and on the unit disk. We then inquire into the consequences of their asymptotical evaluations.
△ Less
Submitted 7 January, 2022; v1 submitted 22 August, 2021;
originally announced August 2021.
-
One Representation to Rule Them All: Identifying Out-of-Support Examples in Few-shot Learning with Generic Representations
Authors:
Henry Kvinge,
Scott Howland,
Nico Courts,
Lauren A. Phillips,
John Buckheit,
Zachary New,
Elliott Skomski,
Jung H. Lee,
Sandeep Tiwari,
Jessica Hibler,
Courtney D. Corley,
Nathan O. Hodas
Abstract:
The field of few-shot learning has made remarkable strides in develo** powerful models that can operate in the small data regime. Nearly all of these methods assume every unlabeled instance encountered will belong to a handful of known classes for which one has examples. This can be problematic for real-world use cases where one routinely finds 'none-of-the-above' examples. In this paper we desc…
▽ More
The field of few-shot learning has made remarkable strides in develo** powerful models that can operate in the small data regime. Nearly all of these methods assume every unlabeled instance encountered will belong to a handful of known classes for which one has examples. This can be problematic for real-world use cases where one routinely finds 'none-of-the-above' examples. In this paper we describe this challenge of identifying what we term 'out-of-support' (OOS) examples. We describe how this problem is subtly different from out-of-distribution detection and describe a new method of identifying OOS examples within the Prototypical Networks framework using a fixed point which we call the generic representation. We show that our method outperforms other existing approaches in the literature as well as other approaches that we propose in this paper. Finally, we investigate how the use of such a generic point affects the geometry of a model's feature space.
△ Less
Submitted 2 June, 2021;
originally announced June 2021.
-
Prototypical Region Proposal Networks for Few-Shot Localization and Classification
Authors:
Elliott Skomski,
Aaron Tuor,
Andrew Avila,
Lauren Phillips,
Zachary New,
Henry Kvinge,
Courtney D. Corley,
Nathan Hodas
Abstract:
Recently proposed few-shot image classification methods have generally focused on use cases where the objects to be classified are the central subject of images. Despite success on benchmark vision datasets aligned with this use case, these methods typically fail on use cases involving densely-annotated, busy images: images common in the wild where objects of relevance are not the central subject,…
▽ More
Recently proposed few-shot image classification methods have generally focused on use cases where the objects to be classified are the central subject of images. Despite success on benchmark vision datasets aligned with this use case, these methods typically fail on use cases involving densely-annotated, busy images: images common in the wild where objects of relevance are not the central subject, instead appearing potentially occluded, small, or among other incidental objects belonging to other classes of potential interest. To localize relevant objects, we employ a prototype-based few-shot segmentation model which compares the encoded features of unlabeled query images with support class centroids to produce region proposals indicating the presence and location of support set classes in a query image. These region proposals are then used as additional conditioning input to few-shot image classifiers. We develop a framework to unify the two stages (segmentation and classification) into an end-to-end classification model -- PRoPnet -- and empirically demonstrate that our methods improve accuracy on image datasets with natural scenes containing multiple object classes.
△ Less
Submitted 8 April, 2021;
originally announced April 2021.
-
THz intersubband electroluminescence from n-type Ge/SiGe quantum cascade structures
Authors:
David Stark,
Muhammad Mirza,
Luca Persichetti,
Michele Montanari,
Sergej Markmann,
Mattias Beck,
Thomas Grange,
Stefan Birner,
Michele Virgilio,
Chiara Ciano,
Michele Ortolani,
Cedric Corley,
Giovanni Capellini,
Luciana Di Gaspare,
Monica De Seta,
Douglas J. Paul,
Jérôme Faist,
Giacomo Scalari
Abstract:
We report electroluminescence originating from L-valley transitions in n-type Ge/Si$_{0.15}$Ge$_{0.85}$ quantum cascade structures centered at 3.4 and 4.9 THz with a line broadening of $Δf/f \approx 0.2$. Three strain-compensated heterostructures, grown on a Si substrate by ultrahigh vacuum chemical vapor deposition, have been investigated. The design is based on a single quantum well active regio…
▽ More
We report electroluminescence originating from L-valley transitions in n-type Ge/Si$_{0.15}$Ge$_{0.85}$ quantum cascade structures centered at 3.4 and 4.9 THz with a line broadening of $Δf/f \approx 0.2$. Three strain-compensated heterostructures, grown on a Si substrate by ultrahigh vacuum chemical vapor deposition, have been investigated. The design is based on a single quantum well active region employing a vertical optical transition and the observed spectral features are well described by non-equilibrium Green's function calculations. The presence of two peaks highlights a suboptimal injection in the upper state of the radiative transition. Comparison of the electroluminescence spectra with similar GaAs/AlGaAs structure yields one order of magnitude lower emission efficiency.
△ Less
Submitted 14 January, 2021;
originally announced January 2021.
-
The level crossings of random sums
Authors:
Christopher Corley,
Andrew Ledoan
Abstract:
Let $\{η_{j}\}_{j = 0}^{N}$ be a sequence of independent and identically distributed complex normal random variables with mean zero and variances $\{σ_{j}^{2}\}_{j = 0}^{N}$. Let $\{f_{j} (z)\}_{j = 0}^{N}$ be a sequence of holomorphic functions that are real-valued on the real line. The purpose of the present study is that of examining the number of times that the random sum…
▽ More
Let $\{η_{j}\}_{j = 0}^{N}$ be a sequence of independent and identically distributed complex normal random variables with mean zero and variances $\{σ_{j}^{2}\}_{j = 0}^{N}$. Let $\{f_{j} (z)\}_{j = 0}^{N}$ be a sequence of holomorphic functions that are real-valued on the real line. The purpose of the present study is that of examining the number of times that the random sum $\sum_{j = 0}^{N} η_{j} f_{j} (z)$ crosses the complex level $\boldsymbol{K} = K_{1} + i K_{2}$, where $K_{1}$ and $K_{2}$ are constants independent of $z$. More specifically, we establish an exact formula for the expected density function for the complex zeros. We then reformulate the problem in terms of successive observations of a Brownian motion. We further answer the basic question about the expected number of complex zeros for coefficients of nonvanishing mean values.
△ Less
Submitted 6 April, 2021; v1 submitted 18 December, 2020;
originally announced December 2020.
-
Fuzzy Simplicial Networks: A Topology-Inspired Model to Improve Task Generalization in Few-shot Learning
Authors:
Henry Kvinge,
Zachary New,
Nico Courts,
Jung H. Lee,
Lauren A. Phillips,
Courtney D. Corley,
Aaron Tuor,
Andrew Avila,
Nathan O. Hodas
Abstract:
Deep learning has shown great success in settings with massive amounts of data but has struggled when data is limited. Few-shot learning algorithms, which seek to address this limitation, are designed to generalize well to new tasks with limited data. Typically, models are evaluated on unseen classes and datasets that are defined by the same fundamental task as they are trained for (e.g. category…
▽ More
Deep learning has shown great success in settings with massive amounts of data but has struggled when data is limited. Few-shot learning algorithms, which seek to address this limitation, are designed to generalize well to new tasks with limited data. Typically, models are evaluated on unseen classes and datasets that are defined by the same fundamental task as they are trained for (e.g. category membership). One can also ask how well a model can generalize to fundamentally different tasks within a fixed dataset (for example: moving from category membership to tasks that involve detecting object orientation or quantity). To formalize this kind of shift we define a notion of "independence of tasks" and identify three new sets of labels for established computer vision datasets that test a model's ability to generalize to tasks which draw on orthogonal attributes in the data. We use these datasets to investigate the failure modes of metric-based few-shot models. Based on our findings, we introduce a new few-shot model called Fuzzy Simplicial Networks (FSN) which leverages a construction from topology to more flexibly represent each class from limited data. In particular, FSN models can not only form multiple representations for a given class but can also begin to capture the low-dimensional structure which characterizes class manifolds in the encoded space of deep networks. We show that FSN outperforms state-of-the-art models on the challenging tasks we introduce in this paper while remaining competitive on standard few-shot benchmarks.
△ Less
Submitted 23 September, 2020;
originally announced September 2020.
-
The density of complex zeros of random sums
Authors:
Christopher Corley,
Andrew Ledoan
Abstract:
Let $\{η_{j}\}_{j = 0}^{N}$ be a sequence of independent, identically distributed random complex Gaussian variables, and let $\{f_{j} (z)\}_{j = 0}^{N}$ be a sequence of given analytic functions that are real-valued on the real number line. We prove an exact formula for the expected density of the distribution of complex zeros of the random equation $\sum_{j = 0}^{N} η_{j} f_{j} (z) = \mathbf{K}$,…
▽ More
Let $\{η_{j}\}_{j = 0}^{N}$ be a sequence of independent, identically distributed random complex Gaussian variables, and let $\{f_{j} (z)\}_{j = 0}^{N}$ be a sequence of given analytic functions that are real-valued on the real number line. We prove an exact formula for the expected density of the distribution of complex zeros of the random equation $\sum_{j = 0}^{N} η_{j} f_{j} (z) = \mathbf{K}$, where $\mathbf{K} \in \mathds{C}$. The method of proof employs a formula for the expected absolute value of quadratic forms of Gaussian random variables. We then obtain the limiting behaviour of the density function as $N$ tends to infinity and provide numerical computations for the density function and empirical distributions for random sums with certain functions $f_{j} (z)$. Finally, we study the case when the $f_{j} (z)$ are polynomials orthogonal on the real line and the unit circle.
△ Less
Submitted 11 October, 2019;
originally announced October 2019.
-
Deep learning to generate in silico chemical property libraries and candidate molecules for small molecule identification in complex samples
Authors:
Sean M. Colby,
Jamie R. Nuñez,
Nathan O. Hodas,
Courtney D. Corley,
Ryan R. Renslow
Abstract:
Comprehensive and unambiguous identification of small molecules in complex samples will revolutionize our understanding of the role of metabolites in biological systems. Existing and emerging technologies have enabled measurement of chemical properties of molecules in complex mixtures and, in concert, are sensitive enough to resolve even stereoisomers. Despite these experimental advances, small mo…
▽ More
Comprehensive and unambiguous identification of small molecules in complex samples will revolutionize our understanding of the role of metabolites in biological systems. Existing and emerging technologies have enabled measurement of chemical properties of molecules in complex mixtures and, in concert, are sensitive enough to resolve even stereoisomers. Despite these experimental advances, small molecule identification is inhibited by (i) chemical reference libraries representing <1% of known molecules, limiting the number of possible identifications, and (ii) the lack of a method to generate candidate matches directly from experimental features (i.e. without a library). To this end, we developed a variational autoencoder (VAE) to learn a continuous numerical, or latent, representation of molecular structure to expand reference libraries for small molecule identification. We extended the VAE to include a chemical property decoder, trained as a multitask network, in order to shape the latent representation such that it assembles according to desired chemical properties. The approach is unique in its application to small molecule identification, with its focus on m/z and CCS, paired with its training paradigm, which involved a cascade of transfer learning iterations. This allows the network to learn as much as possible at each stage, enabling success with progressively smaller datasets without overfitting. Once trained, the network can rapidly predict chemical properties directly from structure, as well as generate candidate structures with desired chemical properties. Additionally, the ability to generate novel molecules along manifolds, defined by chemical property analogues, positions DarkChem as highly useful in a number of application areas, including metabolomics and small molecule identification, drug discovery and design, chemical forensics, and beyond.
△ Less
Submitted 20 May, 2019;
originally announced May 2019.
-
Sharkzor: Interactive Deep Learning for Image Triage, Sort and Summary
Authors:
Meg Pirrung,
Nathan Hilliard,
Artëm Yankov,
Nancy O'Brien,
Paul Weidert,
Courtney D Corley,
Nathan O Hodas
Abstract:
Sharkzor is a web application for machine-learning assisted image sort and summary. Deep learning algorithms are leveraged to infer, augment, and automate the user's mental model. Initially, images uploaded by the user are spread out on a canvas. The user then interacts with the images to impute their mental model into the application's algorithmic underpinnings. Methods of interaction within Shar…
▽ More
Sharkzor is a web application for machine-learning assisted image sort and summary. Deep learning algorithms are leveraged to infer, augment, and automate the user's mental model. Initially, images uploaded by the user are spread out on a canvas. The user then interacts with the images to impute their mental model into the application's algorithmic underpinnings. Methods of interaction within Sharkzor's user interface and user experience support three primary user tasks; triage, organize and automate. The user triages the large pile of overlap** images by moving images of interest into proximity. The user then organizes said images into meaningful groups. After interacting with the images and groups, deep learning helps to automate the user's interactions. The loop of interaction, automation, and response by the user allows the system to quickly make sense of large amounts of data.
△ Less
Submitted 14 February, 2018;
originally announced February 2018.
-
Few-Shot Learning with Metric-Agnostic Conditional Embeddings
Authors:
Nathan Hilliard,
Lawrence Phillips,
Scott Howland,
Artëm Yankov,
Courtney D. Corley,
Nathan O. Hodas
Abstract:
Learning high quality class representations from few examples is a key problem in metric-learning approaches to few-shot learning. To accomplish this, we introduce a novel architecture where class representations are conditioned for each few-shot trial based on a target image. We also deviate from traditional metric-learning approaches by training a network to perform comparisons between classes r…
▽ More
Learning high quality class representations from few examples is a key problem in metric-learning approaches to few-shot learning. To accomplish this, we introduce a novel architecture where class representations are conditioned for each few-shot trial based on a target image. We also deviate from traditional metric-learning approaches by training a network to perform comparisons between classes rather than relying on a static metric comparison. This allows the network to decide what aspects of each class are important for the comparison at hand. We find that this flexible architecture works well in practice, achieving state-of-the-art performance on the Caltech-UCSD birds fine-grained classification task.
△ Less
Submitted 12 February, 2018;
originally announced February 2018.
-
Dynamic Input Structure and Network Assembly for Few-Shot Learning
Authors:
Nathan Hilliard,
Nathan O. Hodas,
Courtney D. Corley
Abstract:
The ability to learn from a small number of examples has been a difficult problem in machine learning since its inception. While methods have succeeded with large amounts of training data, research has been underway in how to accomplish similar performance with fewer examples, known as one-shot or more generally few-shot learning. This technique has been shown to have promising performance, but in…
▽ More
The ability to learn from a small number of examples has been a difficult problem in machine learning since its inception. While methods have succeeded with large amounts of training data, research has been underway in how to accomplish similar performance with fewer examples, known as one-shot or more generally few-shot learning. This technique has been shown to have promising performance, but in practice requires fixed-size inputs making it impractical for production systems where class sizes can vary. This impedes training and the final utility of few-shot learning systems. This paper describes an approach to constructing and training a network that can handle arbitrary example sizes dynamically as the system is used.
△ Less
Submitted 22 August, 2017;
originally announced August 2017.
-
Beyond Fine Tuning: A Modular Approach to Learning on Small Data
Authors:
Ark Anderson,
Kyle Shaffer,
Artem Yankov,
Court D. Corley,
Nathan O. Hodas
Abstract:
In this paper we present a technique to train neural network models on small amounts of data. Current methods for training neural networks on small amounts of rich data typically rely on strategies such as fine-tuning a pre-trained neural network or the use of domain-specific hand-engineered features. Here we take the approach of treating network layers, or entire networks, as modules and combine…
▽ More
In this paper we present a technique to train neural network models on small amounts of data. Current methods for training neural networks on small amounts of rich data typically rely on strategies such as fine-tuning a pre-trained neural network or the use of domain-specific hand-engineered features. Here we take the approach of treating network layers, or entire networks, as modules and combine pre-trained modules with untrained modules, to learn the shift in distributions between data sets. The central impact of using a modular approach comes from adding new representations to a network, as opposed to replacing representations via fine-tuning. Using this technique, we are able surpass results using standard fine-tuning transfer learning approaches, and we are also able to significantly increase performance over such approaches when using smaller amounts of data.
△ Less
Submitted 5 November, 2016;
originally announced November 2016.
-
How a user's personality influences content engagement in social media
Authors:
Nathan O. Hodas,
Ryan Butner,
Court Corley
Abstract:
Social media presents an opportunity for people to share content that they find to be significant, funny, or notable. No single piece of content will appeal to all users, but are there systematic variations between users that can help us better understand information propagation? We conducted an experiment exploring social media usage during disaster scenarios, combining electroencephalogram (EEG)…
▽ More
Social media presents an opportunity for people to share content that they find to be significant, funny, or notable. No single piece of content will appeal to all users, but are there systematic variations between users that can help us better understand information propagation? We conducted an experiment exploring social media usage during disaster scenarios, combining electroencephalogram (EEG), personality surveys, and prompts to share social media, we show how personality not only drives willingness to engage with social media but also helps to determine what type of content users find compelling. As expected, extroverts are more likely to share content. In contrast, one of our central results is that individuals with depressive personalities are the most likely cohort to share informative content, like news or alerts. Because personality and mood will generally be highly correlated between friends via homophily, our results may be an import factor in understanding social contagion.
△ Less
Submitted 1 September, 2016;
originally announced September 2016.