-
Asteroid (101955) Bennu in the Laboratory: Properties of the Sample Collected by OSIRIS-REx
Authors:
Dante S. Lauretta,
Harold C. Connolly, Jr.,
Joseph E. Aebersold,
Conel M. O. D. Alexander,
Ronald-L. Ballouz,
Jessica J. Barnes,
Helena C. Bates,
Carina A. Bennett,
Laurinne Blanche,
Erika H. Blumenfeld,
Simon J. Clemett,
George D. Cody,
Daniella N. DellaGiustina,
Jason P. Dworkin,
Scott A. Eckley,
Dionysis I. Foustoukos,
Ian A. Franchi,
Daniel P. Glavin,
Richard C. Greenwood,
Pierre Haenecour,
Victoria E. Hamilton,
Dolores H. Hill,
Takahiro Hiroi,
Kana Ishimaru,
Fred Jourdan
, et al. (28 additional authors not shown)
Abstract:
On 24 September 2023, the NASA OSIRIS-REx mission dropped a capsule to Earth containing approximately 120 g of pristine carbonaceous regolith from Bennu. We describe the delivery and initial allocation of this asteroid sample and introduce its bulk physical, chemical, and mineralogical properties from early analyses. The regolith is very dark overall, with higher-reflectance inclusions and particl…
▽ More
On 24 September 2023, the NASA OSIRIS-REx mission dropped a capsule to Earth containing approximately 120 g of pristine carbonaceous regolith from Bennu. We describe the delivery and initial allocation of this asteroid sample and introduce its bulk physical, chemical, and mineralogical properties from early analyses. The regolith is very dark overall, with higher-reflectance inclusions and particles interspersed. Particle sizes range from sub-micron dust to a stone about 3.5 cm long. Millimeter-scale and larger stones typically have hummocky or angular morphologies. A subset of the stones appears mottled by brighter material that occurs as veins and crusts. Hummocky stones have the lowest densities and mottled stones have the highest. Remote sensing of the surface of Bennu detected hydrated phyllosilicates, magnetite, organic compounds, carbonates, and scarce anhydrous silicates, all of which the sample confirms. We also find sulfides, presolar grains, and, less expectedly, Na-rich phosphates, as well as other trace phases. The sample composition and mineralogy indicate substantial aqueous alteration and resemble those of Ryugu and the most chemically primitive, low-petrologic-type carbonaceous chondrites. Nevertheless, we find distinct hydrogen, nitrogen, and oxygen isotopic compositions, and some of the material we analyzed is enriched in fluid-mobile elements. Our findings underscore the value of sample return, especially for low-density material that may not readily survive atmospheric entry, and lay the groundwork for more comprehensive analyses.
△ Less
Submitted 18 April, 2024;
originally announced April 2024.
-
An Exponential Reduction in Training Data Sizes for Machine Learning Derived Entanglement Witnesses
Authors:
Aiden R. Rosebush,
Alexander C. B. Greenwood,
Brian T. Kirby,
Li Qian
Abstract:
We propose a support vector machine (SVM) based approach for generating an entanglement witness that requires exponentially less training data than previously proposed methods. SVMs generate hyperplanes represented by a weighted sum of expectation values of local observables whose coefficients are optimized to sum to a positive number for all separable states and a negative number for as many enta…
▽ More
We propose a support vector machine (SVM) based approach for generating an entanglement witness that requires exponentially less training data than previously proposed methods. SVMs generate hyperplanes represented by a weighted sum of expectation values of local observables whose coefficients are optimized to sum to a positive number for all separable states and a negative number for as many entangled states as possible near a specific target state. Previous SVM-based approaches for entanglement witness generation used large amounts of randomly generated separable states to perform training, a task with considerable computational overhead. Here, we propose a method for orienting the witness hyperplane using only the significantly smaller set of states consisting of the eigenstates of the generalized Pauli matrices and a set of entangled states near the target entangled states. With the orientation of the witness hyperplane set by the SVM, we tune the plane's placement using a differential program that ensures perfect classification accuracy on a limited test set as well as maximal noise tolerance. For $N$ qubits, the SVM portion of this approach requires only $O(6^N)$ training states, whereas an existing method needs $O(2^{4^N})$. We use this method to construct witnesses of 4 and 5 qubit GHZ states with coefficients agreeing with stabilizer formalism witnesses to within 6.5 percent and 1 percent, respectively. We also use the same training states to generate novel 4 and 5 qubit W state witnesses. Finally, we computationally verify these witnesses on small test sets and propose methods for further verification.
△ Less
Submitted 27 February, 2024; v1 submitted 29 November, 2023;
originally announced November 2023.
-
On-demand Container Loading in AWS Lambda
Authors:
Marc Brooker,
Mike Danilov,
Chris Greenwood,
Phil Piwonka
Abstract:
AWS Lambda is a serverless event-driven compute service, part of a category of cloud compute offerings sometimes called Function-as-a-service (FaaS). When we first released AWS Lambda, functions were limited to 250MB of code and dependencies, packaged as a simple compressed archive. In 2020, we released support for deploying container images as large as 10GiB as Lambda functions, allowing customer…
▽ More
AWS Lambda is a serverless event-driven compute service, part of a category of cloud compute offerings sometimes called Function-as-a-service (FaaS). When we first released AWS Lambda, functions were limited to 250MB of code and dependencies, packaged as a simple compressed archive. In 2020, we released support for deploying container images as large as 10GiB as Lambda functions, allowing customers to bring much larger code bases and sets of dependencies to Lambda. Supporting larger packages, while still meeting Lambda's goals of rapid scale (adding up to 15,000 new containers per second for a single customer, and much more in aggregate), high request rate (millions of requests per second), high scale (millions of unique workloads), and low start-up times (as low as 50ms) presented a significant challenge.
We describe the storage and caching system we built, optimized for delivering container images on-demand, and our experiences designing, building, and operating it at scale. We focus on challenges around security, efficiency, latency, and cost, and how we addressed these challenges in a system that combines caching, deduplication, convergent encryption, erasure coding, and block-level demand loading.
Since building this system, it has reliably processed hundreds of trillions of Lambda invocations for over a million AWS customers, and has shown excellent resilience to load and infrastructure failures.
△ Less
Submitted 24 May, 2023; v1 submitted 22 May, 2023;
originally announced May 2023.
-
Machine-Learning-Derived Entanglement Witnesses
Authors:
Alexander C. B. Greenwood,
Larry T. H. Wu,
Eric Y. Zhu,
Brian T. Kirby,
Li Qian
Abstract:
In this work, we show a correspondence between linear support vector machines (SVMs) and entanglement witnesses, and use this correspondence to generate entanglement witnesses for bipartite and tripartite qubit (and qudit) target entangled states. An SVM allows for the construction of a hyperplane that clearly delineates between separable states and the target entangled state; this hyperplane is a…
▽ More
In this work, we show a correspondence between linear support vector machines (SVMs) and entanglement witnesses, and use this correspondence to generate entanglement witnesses for bipartite and tripartite qubit (and qudit) target entangled states. An SVM allows for the construction of a hyperplane that clearly delineates between separable states and the target entangled state; this hyperplane is a weighted sum of observables ('features') whose coefficients are optimized during the training of the SVM. We demonstrate with this method the ability to obtain witnesses that require only local measurements even when the target state is a non-stabilizer state. Furthermore, we show that SVMs are flexible enough to allow us to rank features, and to reduce the number of features systematically while bounding the inference error. This allows us to derive W state witnesses capable of detecting entanglement with fewer measurement terms than the fidelity method dominant in today's literature. The utility of this approach is demonstrated on quantum hardware furnished through the IBM Quantum Experience.
△ Less
Submitted 22 March, 2023; v1 submitted 5 July, 2021;
originally announced July 2021.
-
An algorithm-based multiple detection influence measure for high dimensional regression using expectile
Authors:
Amadou Barry,
Nikhil Bhagwat,
Bratislav Misic,
Jean-Baptiste Poline,
Celia M. T. Greenwood
Abstract:
The identification of influential observations is an important part of data analysis that can prevent erroneous conclusions drawn from biased estimators. However, in high dimensional data, this identification is challenging. Classical and recently-developed methods often perform poorly when there are multiple influential observations in the same dataset. In particular, current methods can fail whe…
▽ More
The identification of influential observations is an important part of data analysis that can prevent erroneous conclusions drawn from biased estimators. However, in high dimensional data, this identification is challenging. Classical and recently-developed methods often perform poorly when there are multiple influential observations in the same dataset. In particular, current methods can fail when there is masking several influential observations with similar characteristics, or swam** when the influential observations are near the boundary of the space spanned by well-behaved observations. Therefore, we propose an algorithm-based, multi-step, multiple detection procedure to identify influential observations that addresses current limitations. Our three-step algorithm to identify and capture undesirable variability in the data, $\asymMIP,$ is based on two complementary statistics, inspired by asymmetric correlations, and built on expectiles. Simulations demonstrate higher detection power than competing methods. Use of the resulting asymptotic distribution leads to detection of influential observations without the need for computationally demanding procedures such as the bootstrap. The application of our method to the Autism Brain Imaging Data Exchange neuroimaging dataset resulted in a more balanced and accurate prediction of brain maturity based on cortical thickness. See our GitHub for a free R package that implements our algorithm: \texttt{asymMIP} (\url{github.com/AmBarry/hidetify}).
△ Less
Submitted 25 May, 2021;
originally announced May 2021.
-
Detecting differentially methylated regions in bisulfite sequencing data using quasi-binomial mixed models with smooth covariate effect estimates
Authors:
Kaiqiong Zhao,
Karim Oualkacha,
Lajmi Lakhal-Chaieb,
Aurélie Labbe,
Kathleen Klein,
Sasha Bernatsky,
Marie Hudson,
Inés Colmegna,
Celia M. T. Greenwood
Abstract:
Identifying disease-associated changes in DNA methylation can help to gain a better understanding of disease etiology. Bisulfite sequencing technology allows the generation of methylation profiles at single base of DNA. We previously developed a method for estimating smooth covariate effects and identifying differentially methylated regions (DMRs) from bisulfite sequencing data, which copes with e…
▽ More
Identifying disease-associated changes in DNA methylation can help to gain a better understanding of disease etiology. Bisulfite sequencing technology allows the generation of methylation profiles at single base of DNA. We previously developed a method for estimating smooth covariate effects and identifying differentially methylated regions (DMRs) from bisulfite sequencing data, which copes with experimental errors and variable read depths; this method utilizes the binomial distribution to characterize the variability in the methylated counts. However, bisulfite sequencing data frequently include low-count integers and can exhibit over or under dispersion relative to the binomial distribution. We present a substantial improvement to our previous work by proposing a quasi-likelihood-based regional testing approach which accounts for multiplicative and additive sources of dispersion. We demonstrate the theoretical properties of the resulting tests, as well as their marginal and conditional interpretations. Simulations show that the proposed method provides correct inference for smooth covariate effects and captures the major methylation patterns with excellent power.
△ Less
Submitted 18 January, 2021;
originally announced January 2021.
-
A Tracy-Widom Empirical Estimator For Valid P-values With High-Dimensional Datasets
Authors:
Maxime Turgeon,
Celia MT Greenwood,
Aurelie Labbe
Abstract:
Recent technological advances in many domains including both genomics and brain imaging have led to an abundance of high-dimensional and correlated data being routinely collected. Classical multivariate approaches like Multivariate Analysis of Variance (MANOVA) and Canonical Correlation Analysis (CCA) can be used to study relationships between such multivariate datasets. Yet, special care is requi…
▽ More
Recent technological advances in many domains including both genomics and brain imaging have led to an abundance of high-dimensional and correlated data being routinely collected. Classical multivariate approaches like Multivariate Analysis of Variance (MANOVA) and Canonical Correlation Analysis (CCA) can be used to study relationships between such multivariate datasets. Yet, special care is required with high-dimensional data, as the test statistics may be ill-defined and classical inference procedures break down.
In this work, we explain how valid p-values can be derived for these multivariate methods even in high dimensional datasets. Our main contribution is an empirical estimator for the largest root distribution of a singular double Wishart problem; this general framework underlies many common multivariate analysis approaches. From a small number of permutations of the data, we estimate the location and scale parameters of a parametric Tracy-Widom family that provides a good approximation of this distribution. Through simulations, we show that this estimated distribution also leads to valid p-values that can be used for high-dimensional inference. We then apply our approach to a pathway-based analysis of the association between DNA methylation and disease type in patients with systemic auto-immune rheumatic diseases.
△ Less
Submitted 18 November, 2018;
originally announced November 2018.
-
Distinguishing differential susceptibility, diathesis-stress and vantage sensitivity: beyond the single gene and environment model
Authors:
Alexia Jolicoeur-Martineau,
Jay Belsky,
Eszter Szekely,
Keith F. Widaman,
Michael Pluess,
Celia Greenwood,
Ashley Wazana
Abstract:
Currently, two main approaches exist to distinguish differential susceptibility from diathesis-stress and vantage sensitivity in genotype x environment interaction (GxE) research: Regions of significance (RoS) and competitive-confirmatory approaches. Each is limited by their single-gene/single-environment foci given that most phenotypes are the product of multiple interacting genetic and environme…
▽ More
Currently, two main approaches exist to distinguish differential susceptibility from diathesis-stress and vantage sensitivity in genotype x environment interaction (GxE) research: Regions of significance (RoS) and competitive-confirmatory approaches. Each is limited by their single-gene/single-environment foci given that most phenotypes are the product of multiple interacting genetic and environmental factors. We thus addressed these two concerns in a recently developed R package (LEGIT) for constructing GxE interaction models with latent genetic and environmental scores using alternating optimization. Herein we test, by means of computer simulation, diverse GxE models in the context of both single and multiple genes and environments. Results indicate that the RoS and competitive-confirmatory approaches were highly accurate when the sample size was large, whereas the latter performed better in small samples and for small effect sizes. The confirmatory approach generally had good accuracy (a) when effect size was moderate and N >= 500 and (b) when effect size was large and N >= 250, whereas RoS performed poorly. Computational tools to determine the type of GxE of multiple genes and environments are provided as extensions in our LEGIT R package.
△ Less
Submitted 21 August, 2018; v1 submitted 11 December, 2017;
originally announced December 2017.
-
Alternating optimization for GxE modelling with weighted genetic and environmental scores: examples from the MAVAN study
Authors:
Alexia Jolicoeur-Martineau,
Ashley Wazana,
Eszter Szekely,
Meir Steiner,
Alison S. Fleming,
James L. Kennedy,
Michael J. Meaney,
Celia M. T. Greenwood
Abstract:
Motivated by the goal of expanding currently existing genotype x environment interaction (GxE) models to simultaneously include multiple genetic variants and environmental exposures in a parsimonious way, we developed a novel method to estimate the parameters in a GxE model, where G is a weighted sum of genetic variants (genetic score) and E is a weighted sum of environments (environmental score).…
▽ More
Motivated by the goal of expanding currently existing genotype x environment interaction (GxE) models to simultaneously include multiple genetic variants and environmental exposures in a parsimonious way, we developed a novel method to estimate the parameters in a GxE model, where G is a weighted sum of genetic variants (genetic score) and E is a weighted sum of environments (environmental score). The approach uses alternating optimization to estimate the parameters of the GxE model. This is an iterative process where the genetic score weights, the environmental score weights, and the main model parameters are estimated in turn assuming the other parameters to be constant. This technique can be used to construct relatively complex interaction models that are constrained to a particular structure, and hence contain fewer parameters.
We present the model as a two-way interaction longitudinal mixed model, for which ordinary linear regression is a special case, but it can easily be extended to be compatible with k-way interaction models and generalized linear mixed models. The model is implemented in R (LEGIT package) and using SAS macros (LEGIT_SAS). Here we present examples from the Maternal Adversity, Vulnerability, and Neurodevelopment (MAVAN) study where we improve significantly upon already existing models using alternating optimization. Furthermore, through simulations, we demonstrate the power and validity of this approach even with small sample sizes.
△ Less
Submitted 31 August, 2017; v1 submitted 23 March, 2017;
originally announced March 2017.
-
Light Curves and Period Changes of Type II Cepheids in the Globular Clusters M3 and M5
Authors:
Katie Rabidoux,
Horace A. Smith,
Barton J. Pritzl,
Wayne Osborn,
Charles Kuehn,
Jill Randall,
R. Lustig,
K. Wells,
Lisa Taylor,
Nathan De Lee,
K. Kinemuchi,
Aaron LaCluyzé,
D. Hartley,
C. Greenwood,
M. Ingber,
M. Ireland,
E. Pellegrini,
Mary Anderson,
Gene Purdum,
J. Lacy,
M. Curtis,
Jason Smolinski,
Stephen Danford
Abstract:
Light curves in the B, V, and I_c passbands have been obtained for the type II Cepheids V154 in M3 and V42 and V84 in M5. Alternating cycle behavior, similar to that seen among RV Tauri variables, is confirmed for V84. Old and new observations, spanning more than a century, show that V154 has increased in period while V42 has decreased in period. V84, on the other hand, has shown large, erratic…
▽ More
Light curves in the B, V, and I_c passbands have been obtained for the type II Cepheids V154 in M3 and V42 and V84 in M5. Alternating cycle behavior, similar to that seen among RV Tauri variables, is confirmed for V84. Old and new observations, spanning more than a century, show that V154 has increased in period while V42 has decreased in period. V84, on the other hand, has shown large, erratic changes in period that do not appear to reflect the long term evolution of V84 through the HR diagram.
△ Less
Submitted 30 March, 2010;
originally announced March 2010.
-
The Role of ESLEA in the development of eVLBI
Authors:
R. E. Spencer,
R. Hughes-Jones,
M. Strong,
S. Casey,
A. Rushton,
P. Burgess,
S. Kershaw,
C. Greenwood
Abstract:
The internet has been used for data transfer in radio astronomy ever since its inception; however it is only recently that network bandwidth capability means that the internet becomes competitive with traditional forms of data storage. Very Long Baseline Interferometry (VLBI) uses widely separated telescopes between which high bandwidth direct connections have not been feasible until recently. T…
▽ More
The internet has been used for data transfer in radio astronomy ever since its inception; however it is only recently that network bandwidth capability means that the internet becomes competitive with traditional forms of data storage. Very Long Baseline Interferometry (VLBI) uses widely separated telescopes between which high bandwidth direct connections have not been feasible until recently. The academic networks now allow us to connect at high data rates (~1Gbps) in "eVLBI". The ESLEA project (Exploitation of Switched Lightpaths for E-science Applications) has played a major role in the development of eVLBI. We outline this development in this paper.
△ Less
Submitted 9 October, 2009;
originally announced October 2009.
-
Supermassive Black Holes at the Center of Galaxies
Authors:
Christopher J. Greenwood
Abstract:
This was my final paper for the AST 308 Galaxies class at Michigan State University. Using many sources I was able to compile a moderate amount of information concerning the evidence for, and the formation of Supermassive Black Holes.
This was my final paper for the AST 308 Galaxies class at Michigan State University. Using many sources I was able to compile a moderate amount of information concerning the evidence for, and the formation of Supermassive Black Holes.
△ Less
Submitted 13 December, 2005;
originally announced December 2005.