-
Higher-Order Equivariant Neural Networks for Charge Density Prediction in Materials
Authors:
Teddy Koker,
Keegan Quigley,
Eric Taw,
Kevin Tibbetts,
Lin Li
Abstract:
The calculation of electron density distribution using density functional theory (DFT) in materials and molecules is central to the study of their quantum and macro-scale properties, yet accurate and efficient calculation remains a long-standing challenge. We introduce ChargE3Net, an E(3)-equivariant graph neural network for predicting electron density in atomic systems. ChargE3Net enables the lea…
▽ More
The calculation of electron density distribution using density functional theory (DFT) in materials and molecules is central to the study of their quantum and macro-scale properties, yet accurate and efficient calculation remains a long-standing challenge. We introduce ChargE3Net, an E(3)-equivariant graph neural network for predicting electron density in atomic systems. ChargE3Net enables the learning of higher-order equivariant feature to achieve high predictive accuracy and model expressivity. We show that ChargE3Net exceeds the performance of prior work on diverse sets of molecules and materials. When trained on the massive dataset of over 100K materials in the Materials Project database, our model is able to capture the complexity and variability in the data, leading to a significant 26.7% reduction in self-consistent iterations when used to initialize DFT calculations on unseen materials. Furthermore, we show that non-self-consistent DFT calculations using our predicted charge densities yield near-DFT performance on electronic and thermodynamic property prediction at a fraction of the computational cost. Further analysis attributes the greater predictive accuracy to improved modeling of systems with high angular variations. These results illuminate a pathway towards a machine learning-accelerated ab initio calculations for materials discovery.
△ Less
Submitted 14 May, 2024; v1 submitted 8 December, 2023;
originally announced December 2023.
-
Bidirectional Captioning for Clinically Accurate and Interpretable Models
Authors:
Keegan Quigley,
Miriam Cha,
Josh Barua,
Geeticka Chauhan,
Seth Berkowitz,
Steven Horng,
Polina Golland
Abstract:
Vision-language pretraining has been shown to produce high-quality visual encoders which transfer efficiently to downstream computer vision tasks. While generative language models have gained widespread attention, image captioning has thus far been mostly overlooked as a form of cross-modal pretraining in favor of contrastive learning, especially in medical image analysis. In this paper, we experi…
▽ More
Vision-language pretraining has been shown to produce high-quality visual encoders which transfer efficiently to downstream computer vision tasks. While generative language models have gained widespread attention, image captioning has thus far been mostly overlooked as a form of cross-modal pretraining in favor of contrastive learning, especially in medical image analysis. In this paper, we experiment with bidirectional captioning of radiology reports as a form of pretraining and compare the quality and utility of learned embeddings with those from contrastive pretraining methods. We optimize a CNN encoder, transformer decoder architecture named RadTex for the radiology domain. Results show that not only does captioning pretraining yield visual encoders that are competitive with contrastive pretraining (CheXpert competition multi-label AUC of 89.4%), but also that our transformer decoder is capable of generating clinically relevant reports (captioning macro-F1 score of 0.349 using CheXpert labeler) and responding to prompts with targeted, interactive outputs.
△ Less
Submitted 30 October, 2023;
originally announced October 2023.
-
Preliminary Examination of Guardian Cap Head Impact Data Using Instrumented Mouthguards
Authors:
Kristen G. Quigley,
Dustin Hopfe,
Madison R. Taylor,
Philip Pavilionis,
Vincentia Owusu-Amankonah,
Arthur Islas,
Nicholas G. Murray
Abstract:
Purpose The objective of this study is to present preliminary on-field head kinematics data for NCAA Division I American football players through closely matched pre-season workouts both with and without Guardian Caps (GCs). Methods 42 NCAA Division I American football players wore instrumented mouthguards (iMMs) for 6 closely matched workouts, 3 in traditional helmets (PRE) and 3 with GCs (POST)…
▽ More
Purpose The objective of this study is to present preliminary on-field head kinematics data for NCAA Division I American football players through closely matched pre-season workouts both with and without Guardian Caps (GCs). Methods 42 NCAA Division I American football players wore instrumented mouthguards (iMMs) for 6 closely matched workouts, 3 in traditional helmets (PRE) and 3 with GCs (POST) affixed to the exterior of their helmets. This includes 7 players who had consistent data through all workouts. Results There was no significant difference between the collapsed mean values for the entire sample between PRE and POST for peak linear acceleration (PLA) (PRE=16.3, POST=17.2Gs; p=0.20), Peak Angular Acceleration (PAA) (PRE=992.1, POST=1029.4rad/s2; p=0.51 and the total amount of impacts (PRE=9.3, POST=9.7; p=0.72). Similarly, no difference was observed between PRE and POST for PLA (PRE=16.1, POST=17.2Gs; p=0.32), PAA (PRE=951.2, POST=1038.0rad/s2; p=0.29 and total impacts (PRE=9.6, POST=9.7; p=0.32) between sessions for the 7 repeated players. Conclusion These data suggest no difference in head kinematics data (PLA, PAA and total impacts) when GCs are worn. This study suggests GCs are not effective in reducing the magnitude of head impacts experienced by NCAA Division I American football players.
△ Less
Submitted 21 February, 2023;
originally announced February 2023.
-
Graph Contrastive Learning for Materials
Authors:
Teddy Koker,
Keegan Quigley,
Will Spaeth,
Nathan C. Frey,
Lin Li
Abstract:
Recent work has shown the potential of graph neural networks to efficiently predict material properties, enabling high-throughput screening of materials. Training these models, however, often requires large quantities of labelled data, obtained via costly methods such as ab initio calculations or experimental evaluation. By leveraging a series of material-specific transformations, we introduce Cry…
▽ More
Recent work has shown the potential of graph neural networks to efficiently predict material properties, enabling high-throughput screening of materials. Training these models, however, often requires large quantities of labelled data, obtained via costly methods such as ab initio calculations or experimental evaluation. By leveraging a series of material-specific transformations, we introduce CrystalCLR, a framework for constrastive learning of representations with crystal graph neural networks. With the addition of a novel loss function, our framework is able to learn representations competitive with engineered fingerprinting methods. We also demonstrate that via model finetuning, contrastive pretraining can improve the performance of graph neural networks for prediction of material properties and significantly outperform traditional ML models that use engineered fingerprints. Lastly, we observe that CrystalCLR produces material representations that form clusters by compound class.
△ Less
Submitted 23 November, 2022;
originally announced November 2022.
-
Develo** Modular Autonomous Capabilities for sUAS Operations
Authors:
Keegan Quigley,
Virginia Goodwin,
Luis Alvarez,
Justin Yao,
Yousef Salaman Maclara
Abstract:
Small teams in the field can benefit from the capabilities provided by small Uncrewed Aerial Systems (sUAS) for missions such as reconnaissance, hostile attribution, remote emplacement, and search and rescue. The mobility, communications, and flexible payload capacity of sUAS can offer teams new levels of situational awareness and enable more highly coordinated missions than previously possible. H…
▽ More
Small teams in the field can benefit from the capabilities provided by small Uncrewed Aerial Systems (sUAS) for missions such as reconnaissance, hostile attribution, remote emplacement, and search and rescue. The mobility, communications, and flexible payload capacity of sUAS can offer teams new levels of situational awareness and enable more highly coordinated missions than previously possible. However, piloting such aircraft for specific missions draws personnel away from other mission-critical tasks, increasing the load on remaining personnel while also increasing complexity of operations. For wider adoption and use of sUAS for security and humanitarian missions, safe and robust autonomy must be employed to reduce this burden on small teams. In this paper, we present the development of the Collaborative-UAS for Hostile Attribution, Surveillance, Emplacement, and Reconnaissance (CHASER) testbed, for rapidly prototy** capabilities that will reduce strain on small teams through sensor-guided autonomous control. We attempt to address autonomy needs unfilled by commercial sUAS platforms by creating and testing a series of composable modules that can be configured to support multiple missions. Methods implemented and presented here include radar track correlation, on-board computer vision target detection, target position estimation, closed-loop relative position control, and efficient search of a 3D volume for target acquisition. We configure and test a series of these modules in an example mission, executing a fully autonomous chase of an intruding sUAS in live flight, and demonstrating the success of the modularized autonomy approach. We present performance results from simulation or live flight tests for each module. Lastly, we describe the software architecture that we have developed for flexible controls and comment on how the capabilities presented may enable additional missions.
△ Less
Submitted 27 January, 2023; v1 submitted 1 November, 2022;
originally announced November 2022.
-
Deriving time-averaged active inference from control principles
Authors:
Eli Sennesh,
Jordan Theriault,
Jan-Willem van de Meent,
Lisa Feldman Barrett,
Karen Quigley
Abstract:
Active inference offers a principled account of behavior as minimizing average sensory surprise over time. Applications of active inference to control problems have heretofore tended to focus on finite-horizon or discounted-surprise problems, despite deriving from the infinite-horizon, average-surprise imperative of the free-energy principle. Here we derive an infinite-horizon, average-surprise fo…
▽ More
Active inference offers a principled account of behavior as minimizing average sensory surprise over time. Applications of active inference to control problems have heretofore tended to focus on finite-horizon or discounted-surprise problems, despite deriving from the infinite-horizon, average-surprise imperative of the free-energy principle. Here we derive an infinite-horizon, average-surprise formulation of active inference from optimal control principles. Our formulation returns to the roots of active inference in neuroanatomy and neurophysiology, formally reconnecting active inference to optimal feedback control. Our formulation provides a unified objective functional for sensorimotor control and allows for reference states to vary over time.
△ Less
Submitted 22 August, 2022;
originally announced August 2022.
-
RadTex: Learning Efficient Radiograph Representations from Text Reports
Authors:
Keegan Quigley,
Miriam Cha,
Ruizhi Liao,
Geeticka Chauhan,
Steven Horng,
Seth Berkowitz,
Polina Golland
Abstract:
Automated analysis of chest radiography using deep learning has tremendous potential to enhance the clinical diagnosis of diseases in patients. However, deep learning models typically require large amounts of annotated data to achieve high performance -- often an obstacle to medical domain adaptation. In this paper, we build a data-efficient learning framework that utilizes radiology reports to im…
▽ More
Automated analysis of chest radiography using deep learning has tremendous potential to enhance the clinical diagnosis of diseases in patients. However, deep learning models typically require large amounts of annotated data to achieve high performance -- often an obstacle to medical domain adaptation. In this paper, we build a data-efficient learning framework that utilizes radiology reports to improve medical image classification performance with limited labeled data (fewer than 1000 examples). Specifically, we examine image-captioning pretraining to learn high-quality medical image representations that train on fewer examples. Following joint pretraining of a convolutional encoder and transformer decoder, we transfer the learned encoder to various classification tasks. Averaged over 9 pathologies, we find that our model achieves higher classification performance than ImageNet-supervised and in-domain supervised pretraining when labeled training data is limited.
△ Less
Submitted 7 April, 2023; v1 submitted 5 August, 2022;
originally announced August 2022.
-
Deep Learning nearby galaxy peculiar velocities
Authors:
Kevin M. Quigley,
Samuel Hori,
Rupert A. C. Croft
Abstract:
We explore how information in images of nearby galaxies can be used to estimate their distance. We train a convolutional Neural Network (NN) to do this, using galaxy images from the Illustris simulation. We show that if the NN is trained on data with random errors added to the true distance (representing training using spectroscopic redshift instead of actual distance), then the NN can predict dis…
▽ More
We explore how information in images of nearby galaxies can be used to estimate their distance. We train a convolutional Neural Network (NN) to do this, using galaxy images from the Illustris simulation. We show that if the NN is trained on data with random errors added to the true distance (representing training using spectroscopic redshift instead of actual distance), then the NN can predict distances in a test dataset with greater accuracy than it was given in the training set. This is not unusual, as often NNs are trained on data with added noise, in order to increase robustness. In this case, however, it offers a route to estimating peculiar velocities of nearby galaxies. Given a galaxy with a known spectroscopic redshift one can use the NN-predicted distance to make an estimate of the peculiar velocity. Trying this using relatively low resolution (1.4 arcsec per pixel) simulated galaxy images we find fractional RMS distance errors of 7.7% for galaxies at a mean distance of 75 Mpc from the observer, leading to RMS peculiar velocity errors of 440 km/s. In a companion paper we apply the technique to 145,115 nearby galaxies from the NASA Sloan Atlas.
△ Less
Submitted 19 April, 2022;
originally announced April 2022.
-
Optical variability of ICRF3 quasars in the Pan-STARRS 3Pi survey with functional principal components analysis
Authors:
C. T. Berghea,
V. V. Makarov,
K. Quigley,
B. Goldman
Abstract:
We make use of individual (epoch) detection data from the Pan-STARRS 3Pi survey for 2863 optical ICRF3 counterparts in the five wavelength bands g, r, i, z, and y, published as part of the Data Release 2. A dedicated method based on the Functional Principal Component Analysis is developed for these sparse and irregularly sampled data. With certain regularization and normalization constraints, it a…
▽ More
We make use of individual (epoch) detection data from the Pan-STARRS 3Pi survey for 2863 optical ICRF3 counterparts in the five wavelength bands g, r, i, z, and y, published as part of the Data Release 2. A dedicated method based on the Functional Principal Component Analysis is developed for these sparse and irregularly sampled data. With certain regularization and normalization constraints, it allows us to obtain uniform and compatible estimates of the variability amplitudes and average magnitudes between the passbands and objects. We find that the starting assumption of affinity of the light curves for a given object at different wavelengths is violated for several percent of the sample. The distributions of root-mean-square variability amplitudes are strongly skewed toward small values, peaking at ~0.1 mag with tails stretching to 2 mag. Statistically, the lowest variability is found for the r band and the largest for the reddest y band. A small "brighter-redder" effect is present, with amplitudes in y greater than amplitudes in g in 57% of the sample. The variability versus redshift dependence shows a strong decline with z toward redshift 3, which we interpret as the time dilation of the dominant time frequencies. The colors of radio-loud ICRF3 quasars are correlated with redshift in a complicated, wavy pattern governed by the emergence of brightest emission lines within the five passbands.
△ Less
Submitted 9 June, 2021;
originally announced June 2021.
-
Multimodal Representation Learning via Maximization of Local Mutual Information
Authors:
Ruizhi Liao,
Daniel Moyer,
Miriam Cha,
Keegan Quigley,
Seth Berkowitz,
Steven Horng,
Polina Golland,
William M. Wells
Abstract:
We propose and demonstrate a representation learning approach by maximizing the mutual information between local features of images and text. The goal of this approach is to learn useful image representations by taking advantage of the rich information contained in the free text that describes the findings in the image. Our method trains image and text encoders by encouraging the resulting represe…
▽ More
We propose and demonstrate a representation learning approach by maximizing the mutual information between local features of images and text. The goal of this approach is to learn useful image representations by taking advantage of the rich information contained in the free text that describes the findings in the image. Our method trains image and text encoders by encouraging the resulting representations to exhibit high local mutual information. We make use of recent advances in mutual information estimation with neural network discriminators. We argue that the sum of local mutual information is typically a lower bound on the global mutual information. Our experimental results in the downstream image classification tasks demonstrate the advantages of using local features for image-text representation learning.
△ Less
Submitted 14 December, 2021; v1 submitted 7 March, 2021;
originally announced March 2021.
-
Accessibility of Virtual Reality Locomotion Modalities to Adults and Minors
Authors:
Zhijiong Huang,
Yu Zhang,
Kathryn C. Quigley,
Ramya Sankar,
Clemence Wormser,
Xinxin Mo,
Allen Y. Yang
Abstract:
Virtual reality (VR) is an important new technology that is fun-damentally changing the way people experience entertainment and education content. Due to the fact that most currently available VR products are one size fits all, the accessibility of the content design and user interface design, even for healthy children is not well understood. It requires more research to ensure that children can h…
▽ More
Virtual reality (VR) is an important new technology that is fun-damentally changing the way people experience entertainment and education content. Due to the fact that most currently available VR products are one size fits all, the accessibility of the content design and user interface design, even for healthy children is not well understood. It requires more research to ensure that children can have equally good user compared to adults in VR. In our study, we seek to explore accessibility of locomotion in VR between healthy adults and minors along both objective and subjective dimensions. We performed a user experience experiment where subjects completed a simple task of moving and touching underwater animals in VR using one of four different locomotion modalities, as well as real-world walking without wearing VR headsets as the baseline. Our results show that physical body movement that mirrors real-world movement exclusively is the least preferred by both adults and minors. However, within the different modalities of controller assisted locomotion there are variations between adults and minors for preference and challenge levels.
△ Less
Submitted 16 April, 2019;
originally announced April 2019.