-
LaTable: Towards Large Tabular Models
Authors:
Boris van Breugel,
Jonathan Crabbé,
Rob Davis,
Mihaela van der Schaar
Abstract:
Tabular data is one of the most ubiquitous modalities, yet the literature on tabular generative foundation models is lagging far behind its text and vision counterparts. Creating such a model is hard, due to the heterogeneous feature spaces of different tabular datasets, tabular metadata (e.g. dataset description and feature headers), and tables lacking prior knowledge (e.g. feature order). In thi…
▽ More
Tabular data is one of the most ubiquitous modalities, yet the literature on tabular generative foundation models is lagging far behind its text and vision counterparts. Creating such a model is hard, due to the heterogeneous feature spaces of different tabular datasets, tabular metadata (e.g. dataset description and feature headers), and tables lacking prior knowledge (e.g. feature order). In this work we propose LaTable: a novel tabular diffusion model that addresses these challenges and can be trained across different datasets. Through extensive experiments we find that LaTable outperforms baselines on in-distribution generation, and that finetuning LaTable can generate out-of-distribution datasets better with fewer samples. On the other hand, we explore the poor zero-shot performance of LaTable, and what it may teach us about building generative tabular foundation models with better zero- and few-shot generation capabilities.
△ Less
Submitted 25 June, 2024;
originally announced June 2024.
-
Thermal stability and phase transformation of $α$-, $κ(ε)$-, and $γ$-Ga$_2$O$_3$ thin films to $β$-Ga$_2$O$_3$ under various ambient conditions
Authors:
J. Tang,
K. Jiang,
P. Tseng,
R. C. Kurchin,
L. M. Porter,
R. F. Davis
Abstract:
Phase transitions in metastable $α$-, $κ(ε)$-, and $γ$-Ga$_2$O$_3$ films to thermodynamically stable $β$-Ga$_2$O$_3$ during annealing in air, N$_2$, and vacuum have been systematically investigated via in-situ high-temperature X-ray diffraction and scanning electron microscopy. These respective polymorphs exhibited thermal stability to around 471-525$^\circ$C, 773-825$^\circ$C, and 490-575…
▽ More
Phase transitions in metastable $α$-, $κ(ε)$-, and $γ$-Ga$_2$O$_3$ films to thermodynamically stable $β$-Ga$_2$O$_3$ during annealing in air, N$_2$, and vacuum have been systematically investigated via in-situ high-temperature X-ray diffraction and scanning electron microscopy. These respective polymorphs exhibited thermal stability to around 471-525$^\circ$C, 773-825$^\circ$C, and 490-575$^\circ$C before transforming into $β$-Ga$_2$O$_3$, across all tested ambient conditions. Particular crystallographic orientation relationships were observed before and after the phase transitions, i.e., (0006) $α$-Ga$_2$O$_3$ $\parallel$ $(\overline{4}02)$ $β$-Ga$_2$O$_3$, (004) $κ(ε)$-Ga$_2$O$_3$ $\parallel$ (310) and $(\overline{4}02)$ $β$-Ga$_2$O$_3$, and (400) $γ$-Ga$_2$O$_3$ $\parallel$ (400) $β$-Ga$_2$O$_3$. The phase transition of $α$-Ga$_2$O$_3$ to $β$-Ga$_2$O$_3$ resulted in catastrophic damage to the film and upheaval of the surface. The respective primary and possibly secondary causes of this damage are the +8.6% volume expansion and the dual displacive and reconstructive transformations that occur during this transition. The $κ(ε)$- and $γ$-Ga$_2$O$_3$ films converted to $β$-Ga$_2$O$_3$ via singular reconstructive transformations with small changes in volume and unchanged surface microstructures.
△ Less
Submitted 30 April, 2024;
originally announced May 2024.
-
Helical Phononic Modes Induced by a Screw Dislocation
Authors:
Yun Zhou,
Robert Davis,
Li Chen,
Erda Wen,
Prabhakar Bandaru,
Daniel Sievenpiper
Abstract:
In this study, we investigate a one-dimensional (1D) unidirectional phononic waveguide embedded within a three-dimensional (3D) hexagonal close-packed phononic crystal, achieved by the introduction of a screw dislocation. This approach does not rely on the non-trivial topological characteristics of the 3D crystal. We discover that this dislocation induces a pair of helical modes, characterized by…
▽ More
In this study, we investigate a one-dimensional (1D) unidirectional phononic waveguide embedded within a three-dimensional (3D) hexagonal close-packed phononic crystal, achieved by the introduction of a screw dislocation. This approach does not rely on the non-trivial topological characteristics of the 3D crystal. We discover that this dislocation induces a pair of helical modes, characterized by their orthogonal $x$- and $y$-directional displacements being out of phase by 90 degrees, which results in a distinctive rotational motion. These helical modes demonstrate directional propagation, tightly linked to the helicity of the screw dislocation. Through considerations of symmetry, we reveal that the emergence of these helical modes is governed by the symmetry of the screw dislocation itself. Our findings not only provide insights into the interplay between dislocation-induced symmetry and wave propagation in phononic systems but also open new avenues for designing directionally selective waveguides without relying on the crystal's topological properties.
△ Less
Submitted 28 April, 2024;
originally announced April 2024.
-
Pwyll and Manannán Craters as a Laboratory for Constraining Irradiation Timescales on Europa
Authors:
M. Ryleigh Davis,
Michael E. Brown
Abstract:
We examine high spatial resolution Galileo/NIMS observations of the young (~1 My - 20 My) impact features, Pwyll and Manannán craters, on Europa's trailing hemisphere in an effort to constrain irradiation timescales. We characterize their composition using a linear spectral modeling analysis and find that both craters and their ejecta are depleted in hydrated sulfuric acid relative to nearby older…
▽ More
We examine high spatial resolution Galileo/NIMS observations of the young (~1 My - 20 My) impact features, Pwyll and Manannán craters, on Europa's trailing hemisphere in an effort to constrain irradiation timescales. We characterize their composition using a linear spectral modeling analysis and find that both craters and their ejecta are depleted in hydrated sulfuric acid relative to nearby older terrain. This suggests that the radiolytic sulfur cycle has not yet had enough time to build up an equilibrium concentration of H2SO4, and places a strong lower limit of the age of the craters on the equilibrium timescale of the radiolytic sulfur cycle on Europa's trailing hemisphere. Additionally, we find that the dark and red material seen in the craters and proximal ejecta of Pwyll and Manannán show the spectroscopic signature of hydrated, presumably endogenic salts. This suggests that the irradiation-induced darkening and redenning of endogenic salts thought to occur on Europa's trailing hemisphere has already happened at Pwyll and Manannán, thereby placing an upper limit on the timescale by which salts are irradiation reddened.
△ Less
Submitted 23 April, 2024;
originally announced April 2024.
-
JWST Spectrophotometry of the Small Satellites of Uranus and Neptune
Authors:
Matthew Belyakov,
M. Ryleigh Davis,
Zachariah Milby,
Ian Wong,
Michael E. Brown
Abstract:
We use 1.4-4.6 micron multi-band photometry of the small inner Uranian and Neptunian satellites obtained with the James Webb Space Telescope's near-infrared imager NIRCam to characterize their surface compositions. We find that the satellites of the ice giants have, to first-order, similar compositions to one another, with a 3.0 micron absorption feature possibly associated with an O-H stretch, in…
▽ More
We use 1.4-4.6 micron multi-band photometry of the small inner Uranian and Neptunian satellites obtained with the James Webb Space Telescope's near-infrared imager NIRCam to characterize their surface compositions. We find that the satellites of the ice giants have, to first-order, similar compositions to one another, with a 3.0 micron absorption feature possibly associated with an O-H stretch, indicative of water ice or hydrated minerals. Additionally, the spectrophotometry for the small ice giant satellites matches spectra of some Neptune Trojans and excited Kuiper belt objects, suggesting shared properties. Future spectroscopy of these small satellites is necessary to identify and better constrain their specific surface compositions.
△ Less
Submitted 9 April, 2024;
originally announced April 2024.
-
Effectiveness of Self-Assessment Software to Evaluate Preclinical Operative Procedures
Authors:
Qi Dai,
Ryan Davis,
Houlin Hong,
Ying Gu
Abstract:
Objectives: To assess the effectiveness of digital scanning techniques for self-assessment and of preparations and restorations in preclinical dental education when compared to traditional faculty grading. Methods: Forty-four separate Class I (#30-O), Class II (#30-MO) preparations, and class II amalgam restorations (#31-MO) were generated respectively under preclinical assessment setting. Calibra…
▽ More
Objectives: To assess the effectiveness of digital scanning techniques for self-assessment and of preparations and restorations in preclinical dental education when compared to traditional faculty grading. Methods: Forty-four separate Class I (#30-O), Class II (#30-MO) preparations, and class II amalgam restorations (#31-MO) were generated respectively under preclinical assessment setting. Calibrated faculty evaluated the preparations and restorations using a standard rubric from preclinical operative class. The same teeth were scanned using Planmeca PlanScan intraoral scanner and graded using the Romexis E4D Compare Software. Each tooth was compared against a corresponding gold standard tooth with tolerance intervals ranging from 100μm to 500μm. These scores were compared to traditional faculty grades using a linear mixed model to estimate the mean differences at 95% confidence interval for each tolerance level. Results: The average Compare Software grade of Class I preparation at 300μm tolerance had the smallest mean difference of 1.64 points on a 100 points scale compared to the average faculty grade. Class II preparation at 400μm tolerance had the smallest mean difference of 0.41 points. Finally, Class II Restoration at 300μm tolerance had the smallest mean difference at 0.20 points. Conclusion: In this study, tolerance levels that best correlated the Compare Software grades with the faculty grades were determined for three operative procedures: class I preparation, class II preparation and class II restoration. This Compare Software can be used as a useful adjunct method for more objective grading. It also can be used by students as a great self-assessment tool.
△ Less
Submitted 8 April, 2024;
originally announced April 2024.
-
DE-HNN: An effective neural model for Circuit Netlist representation
Authors:
Zhishang Luo,
Truong Son Hy,
Puoya Tabaghi,
Donghyeon Koh,
Michael Defferrard,
Elahe Rezaei,
Ryan Carey,
Rhett Davis,
Rajeev Jain,
Yusu Wang
Abstract:
The run-time for optimization tools used in chip design has grown with the complexity of designs to the point where it can take several days to go through one design cycle which has become a bottleneck. Designers want fast tools that can quickly give feedback on a design. Using the input and output data of the tools from past designs, one can attempt to build a machine learning model that predicts…
▽ More
The run-time for optimization tools used in chip design has grown with the complexity of designs to the point where it can take several days to go through one design cycle which has become a bottleneck. Designers want fast tools that can quickly give feedback on a design. Using the input and output data of the tools from past designs, one can attempt to build a machine learning model that predicts the outcome of a design in significantly shorter time than running the tool. The accuracy of such models is affected by the representation of the design data, which is usually a netlist that describes the elements of the digital circuit and how they are connected. Graph representations for the netlist together with graph neural networks have been investigated for such models. However, the characteristics of netlists pose several challenges for existing graph learning frameworks, due to the large number of nodes and the importance of long-range interactions between nodes. To address these challenges, we represent the netlist as a directed hypergraph and propose a Directional Equivariant Hypergraph Neural Network (DE-HNN) for the effective learning of (directed) hypergraphs. Theoretically, we show that our DE-HNN can universally approximate any node or hyperedge based function that satisfies certain permutation equivariant and invariant properties natural for directed hypergraphs. We compare the proposed DE-HNN with several State-of-the-art (SOTA) machine learning models for (hyper)graphs and netlists, and show that the DE-HNN significantly outperforms them in predicting the outcome of optimized place-and-route tools directly from the input netlists. Our source code and the netlists data used are publicly available at https://github.com/YusuLab/chips.git
△ Less
Submitted 16 April, 2024; v1 submitted 30 March, 2024;
originally announced April 2024.
-
Towards Modeling Learner Performance with Large Language Models
Authors:
Seyed Parsa Neshaei,
Richard Lee Davis,
Adam Hazimeh,
Bojan Lazarevski,
Pierre Dillenbourg,
Tanja Käser
Abstract:
Recent work exploring the capabilities of pre-trained large language models (LLMs) has demonstrated their ability to act as general pattern machines by completing complex token sequences representing a wide array of tasks, including time-series prediction and robot control. This paper investigates whether the pattern recognition and sequence modeling capabilities of LLMs can be extended to the dom…
▽ More
Recent work exploring the capabilities of pre-trained large language models (LLMs) has demonstrated their ability to act as general pattern machines by completing complex token sequences representing a wide array of tasks, including time-series prediction and robot control. This paper investigates whether the pattern recognition and sequence modeling capabilities of LLMs can be extended to the domain of knowledge tracing, a critical component in the development of intelligent tutoring systems (ITSs) that tailor educational experiences by predicting learner performance over time. In an empirical evaluation across multiple real-world datasets, we compare two approaches to using LLMs for this task, zero-shot prompting and model fine-tuning, with existing, non-LLM approaches to knowledge tracing. While LLM-based approaches do not achieve state-of-the-art performance, fine-tuned LLMs surpass the performance of naive baseline models and perform on par with standard Bayesian Knowledge Tracing approaches across multiple metrics. These findings suggest that the pattern recognition capabilities of LLMs can be used to model complex learning trajectories, opening a novel avenue for applying LLMs to educational contexts. The paper concludes with a discussion of the implications of these findings for future research, suggesting that further refinements and a deeper understanding of LLMs' predictive mechanisms could lead to enhanced performance in knowledge tracing tasks.
△ Less
Submitted 29 February, 2024;
originally announced March 2024.
-
Sample Splitting and Assessing Goodness-of-fit of Time Series
Authors:
Richard A. Davis,
Leon Fernandes
Abstract:
A fundamental and often final step in time series modeling is to assess the quality of fit of a proposed model to the data. Since the underlying distribution of the innovations that generate a model is often not prescribed, goodness-of-fit tests typically take the form of testing the fitted residuals for serial independence. However, these fitted residuals are inherently dependent since they are b…
▽ More
A fundamental and often final step in time series modeling is to assess the quality of fit of a proposed model to the data. Since the underlying distribution of the innovations that generate a model is often not prescribed, goodness-of-fit tests typically take the form of testing the fitted residuals for serial independence. However, these fitted residuals are inherently dependent since they are based on the same parameter estimates and thus standard tests of serial independence, such as those based on the autocorrelation function (ACF) or distance correlation function (ADCF) of the fitted residuals need to be adjusted. The sample splitting procedure in Pfister et al.~(2018) is one such fix for the case of models for independent data, but fails to work in the dependent setting. In this paper sample splitting is leveraged in the time series setting to perform tests of serial dependence of fitted residuals using the ACF and ADCF. Here the first $f_n$ of the data points are used to estimate the parameters of the model and then using these parameter estimates, the last $l_n$ of the data points are used to compute the estimated residuals. Tests for serial independence are then based on these $l_n$ residuals. As long as the overlap between the $f_n$ and $l_n$ data splits is asymptotically 1/2, the ACF and ADCF tests of serial independence tests often have the same limit distributions as though the underlying residuals are indeed iid. In particular if the first half of the data is used to estimate the parameters and the estimated residuals are computed for the entire data set based on these parameter estimates, then the ACF and ADCF can have the same limit distributions as though the residuals were iid. This procedure ameliorates the need for adjustment in the construction of confidence bounds for both the ACF and ADCF in goodness-of-fit testing.
△ Less
Submitted 11 March, 2024;
originally announced March 2024.
-
Wavelet Based Periodic Autoregressive Moving Average Models
Authors:
Rhea Davis,
N. Balakrishna
Abstract:
This paper proposes a wavelet-based method for analysing periodic autoregressive moving average (PARMA) time series. Even though Fourier analysis provides an effective method for analysing periodic time series, it requires the estimation of a large number of Fourier parameters when the PARMA parameters do not vary smoothly. The wavelet-based analysis helps us to obtain a parsimonious model with a…
▽ More
This paper proposes a wavelet-based method for analysing periodic autoregressive moving average (PARMA) time series. Even though Fourier analysis provides an effective method for analysing periodic time series, it requires the estimation of a large number of Fourier parameters when the PARMA parameters do not vary smoothly. The wavelet-based analysis helps us to obtain a parsimonious model with a reduced number of parameters. We have illustrated this with simulated and actual data sets.
△ Less
Submitted 29 February, 2024;
originally announced March 2024.
-
On the Ehrhart Theory of Generalized Symmetric Edge Polytopes
Authors:
Robert Davis,
Akihiro Higashitani,
Hidefumi Ohsugi
Abstract:
The symmetric edge polytope (SEP) of a (finite, undirected) graph is a centrally symmetric lattice polytope whose vertices are defined by the edges of the graph. SEPs have been studied extensively in the past twenty years. Recently, Tóthmérész and, independently, D'Alí, Juhnke-Kubitzke, and Koch generalized the definition of an SEP to regular matroids, as these are the matroids that can be charact…
▽ More
The symmetric edge polytope (SEP) of a (finite, undirected) graph is a centrally symmetric lattice polytope whose vertices are defined by the edges of the graph. SEPs have been studied extensively in the past twenty years. Recently, Tóthmérész and, independently, D'Alí, Juhnke-Kubitzke, and Koch generalized the definition of an SEP to regular matroids, as these are the matroids that can be characterized by totally unimodular matrices. Generalized SEPs are known to have symmetric Ehrhart $h^*$-polynomials, and Ohsugi and Tsuchiya conjectured that (ordinary) SEPs have nonnegative $γ$-vectors.
In this article, we use combinatorial and Gröbner basis techniques to extend additional known properties of SEPs to generalized SEPs. Along the way, we show that generalized SEPs are not necessarily $γ$-nonnegative by providing explicit examples. We prove these polytopes to be "nearly" $γ$-nonnegative in the sense that, by deleting exactly two elements from the matroid, one obtains SEPs for graphs that are $γ$-nonnegative. This provides further evidence that Ohsugi and Tsuchiya's conjecture holds in the ordinary case.
△ Less
Submitted 6 March, 2024; v1 submitted 6 January, 2024;
originally announced January 2024.
-
Characterization of the Repeating FRB 20220912A with the Allen Telescope Array
Authors:
Sofia Z. Sheikh,
Wael Farah,
Alexander W. Pollak,
Andrew,
P. V.,
Siemion,
Mohammed A. Chamma,
Luigi F. Cruz,
Roy H. Davis,
David R. DeBoer,
Vishal Gajjar,
Phil Karn,
Jamar Kittling,
Wenbin Lu,
Mark Masters,
Pranav Premnath,
Sarah Schoultz,
Carol Shumaker,
Gurmehar Singh,
Michael Snodgrass
Abstract:
FRB 20220912A is a repeating Fast Radio Burst (FRB) that was discovered in Fall 2022 and remained highly active for several months. We report the detection of 35 FRBs from 541 hours of follow-up observations of this source using the recently refurbished Allen Telescope Array, covering 1344 MHz of bandwidth primarily centered at 1572 MHz. All 35 FRBs were detected in the lower half of the band with…
▽ More
FRB 20220912A is a repeating Fast Radio Burst (FRB) that was discovered in Fall 2022 and remained highly active for several months. We report the detection of 35 FRBs from 541 hours of follow-up observations of this source using the recently refurbished Allen Telescope Array, covering 1344 MHz of bandwidth primarily centered at 1572 MHz. All 35 FRBs were detected in the lower half of the band with non-detections in the upper half and covered fluences from 4-431 Jy-ms (median$=$48.27 Jy-ms). We find consistency with previous repeater studies for a range of spectrotemporal features including: bursts with downward frequency drifting over time; a positive correlation between bandwidth and center frequency; and a decrease in sub-burst duration over time. We report an apparent decrease in the center frequency of observed bursts over the 2 months of the observing campaign (corresponding to a drop of $6.21\pm 0.76$ MHz per day). We predict a cut-off fluence for FRB 20220912A of $F_\textrm{max}\lesssim 10^4$ Jy-ms, for this source to be consistent with the all-sky rate, and find that FRB 20220912A significantly contributed to the all-sky FRB rate at a level of a few percent for fluences of $\sim$100 Jy-ms. Finally, we investigate characteristic timescales and sub-burst periodicities and find a) a median inter-subburst timescale of 5.82$\pm$1.16 ms in the multi-component bursts and b) no evidence of strict periodicity even in the most evenly-spaced multi-component burst in the sample. Our results demonstrate the importance of wideband observations of FRBs, and provide an important set of observational parameters against which to compare FRB progenitor and emission mechanism models.
△ Less
Submitted 12 December, 2023;
originally announced December 2023.
-
Tightening QC Relaxations of AC Optimal Power Flow through Improved Linear Convex Envelopes
Authors:
Mohammad Rasoul Narimani,
Daniel K. Molzahn,
Katherine R. Davis,
Mariesa L. Crow
Abstract:
AC optimal power flow (AC OPF) is a fundamental problem in power system operations. Accurately modeling the network physics via the AC power flow equations makes AC OPF a challenging nonconvex problem. To search for global optima, recent research has developed a variety of convex relaxations that bound the optimal objective values of AC OPF problems. The well-known QC relaxation convexifies the AC…
▽ More
AC optimal power flow (AC OPF) is a fundamental problem in power system operations. Accurately modeling the network physics via the AC power flow equations makes AC OPF a challenging nonconvex problem. To search for global optima, recent research has developed a variety of convex relaxations that bound the optimal objective values of AC OPF problems. The well-known QC relaxation convexifies the AC OPF problem by enclosing the non-convex terms (trigonometric functions and products) within convex envelopes. The accuracy of this method strongly depends on the tightness of these envelopes. This paper proposes two improvements for tightening QC relaxations of OPF problems. We first consider a particular nonlinear function whose projections are the nonlinear expressions appearing in the polar representation of the power flow equations. We construct a convex envelope around this nonlinear function that takes the form of a polytope and then use projections of this envelope to obtain convex expressions for the nonlinear terms. Second, we use certain characteristics of the sine and cosine expressions along with the changes in their curvature to tighten this convex envelope. We also propose a coordinate transformation that rotates the power flow equations by an angle specific to each bus in order to obtain a tighter envelope. We demonstrate these improvements relative to a state-of-the-art QC relaxation implementation using the PGLib-OPF test cases. The results show improved optimality gaps in 68% of these cases.
△ Less
Submitted 6 April, 2024; v1 submitted 22 October, 2023;
originally announced October 2023.
-
Atomic-scale investigation of $γ$-Ga$_2$O$_3$ deposited on MgAl$_2$O$_4$ and its relationship with $β$-Ga$_2$O$_3$
Authors:
J. Tang,
K. Jiang,
C. Xu,
M. J. Cabral,
K. Xiao,
L. M. Porter,
R. F. Davis
Abstract:
Nominally phase-pure $γ$-$Ga_2O_3$ was deposited on (100) $MgAl_2O_4$ within a narrow temperature window centered at $\sim$470 $^{\circ}$C using metal-organic chemical vapor deposition (MOCVD). The film deposited at 440 $^{\circ}$C exhibited either poor crystallization or an amorphous structure; the film grown at 500 $^{\circ}$C contained both $β$-$Ga_2O_3$ and $γ$-$Ga_2O_3$. A nominally phase-pur…
▽ More
Nominally phase-pure $γ$-$Ga_2O_3$ was deposited on (100) $MgAl_2O_4$ within a narrow temperature window centered at $\sim$470 $^{\circ}$C using metal-organic chemical vapor deposition (MOCVD). The film deposited at 440 $^{\circ}$C exhibited either poor crystallization or an amorphous structure; the film grown at 500 $^{\circ}$C contained both $β$-$Ga_2O_3$ and $γ$-$Ga_2O_3$. A nominally phase-pure $β$-$Ga_2O_3$ film was obtained at 530 $^{\circ}$C. Atomic-resolution scanning transmission electron microscopy (STEM) investigations of the $γ$-$Ga_2O_3$ film grown at 470 $^{\circ}$C revealed a high density of antiphase boundaries. A planar defect model developed for $γ$-$Al_2O_3$ was extended to explain the stacking sequences of the Ga sublattice observed in the STEM images of $γ$-$Ga_2O_3$. The presence of the 180$^{\circ}$ rotational domains and 90$^{\circ}$ rotational domains of $β$-$Ga_2O_3$ inclusions within the $γ$-$Ga_2O_3$ matrix is discussed within the context of a comprehensive investigation of the epitaxial relationship between those two phases in the as-grown film at 470 $^{\circ}$C and the same film annealed at 600 $^{\circ}$C. The results led to the hypotheses that (i) incorporation of certain dopants including Si, Ge, Sn, Mg, Al, and Sc, into $β$-$Ga_2O_3$, locally stabilizes the "$γ$-phase" and (ii) the site preference(s) for these dopants promotes the formation of the "$γ$-phase" and/or $γ$-$Ga_2O_3$ solid solutions. However, in the absence of such dopants, pure $γ$-$Ga_2O_3$ remains the least stable $Ga_2O_3$ polymorph, as indicated by its very narrow growth window, lower growth temperatures relative to other $Ga_2O_3$ polymorphs, and the largest calculated difference in Helmholtz free energy per formula unit between $γ$-$Ga_2O_3$ and $β$-$Ga_2O_3$ than all other polymorphs.
△ Less
Submitted 20 October, 2023; v1 submitted 19 October, 2023;
originally announced October 2023.
-
The Spatial Distribution of the Unidentified 2.07 \textmu m Absorption Feature on Europa and Implications for its Origin
Authors:
M. Ryleigh Davis,
Michael E. Brown,
Samantha K. Trumbo
Abstract:
A weak absorption feature at 2.07 \textmu m on Europa's trailing hemisphere has been suggested to arise from radiolytic processing of an endogenic salt, possibly sourced from the interior ocean. However, if the genesis of this feature requires endogenic material to be present, one might expect to find a correlation between its spatial distribution and the recently disrupted chaos terrains. Using a…
▽ More
A weak absorption feature at 2.07 \textmu m on Europa's trailing hemisphere has been suggested to arise from radiolytic processing of an endogenic salt, possibly sourced from the interior ocean. However, if the genesis of this feature requires endogenic material to be present, one might expect to find a correlation between its spatial distribution and the recently disrupted chaos terrains. Using archived near-infrared observations from Very Large Telescope/SINFONI with a $\sim$1 nm spectral resolution and a linear spatial resolution $\sim$130 km, we examine the spatial distribution of this feature in an effort to explore this endogenic formation hypothesis. We find that while the presence of the 2.07 \textmu m feature is strongly associated with the irradiation pattern on Europa's trailing hemisphere, there is no apparent association between the presence or depth of the absorption feature and Europa's large-scale chaos terrain. This spatial distribution suggests that the formation pathway of the 2.07 \textmu m feature on Europa is independent of any endogenous salts within the recent geology. Instead, we propose that the source of this feature may simply be a product of the radiolytic sulfur cycle or arise from some unidentified parallel irradiation process. Notably, the 2.07 \textmu m absorption band is absent from the Pwyll crater ejecta blanket, suggesting that radiolytic processing has not had enough time to form the species responsible and placing a lower limit on the irradiation timescale. We are unable to find a plausible spectral match to the 2.07 \textmu m feature within the available laboratory data.
△ Less
Submitted 26 August, 2023;
originally announced August 2023.
-
Federated Learning Based Distributed Localization of False Data Injection Attacks on Smart Grids
Authors:
Cihat Keçeci,
Katherine R. Davis,
Erchin Serpedin
Abstract:
Data analysis and monitoring on smart grids are jeopardized by attacks on cyber-physical systems. False data injection attack (FDIA) is one of the classes of those attacks that target the smart measurement devices by injecting malicious data. The employment of machine learning techniques in the detection and localization of FDIA is proven to provide effective results. Training of such models requi…
▽ More
Data analysis and monitoring on smart grids are jeopardized by attacks on cyber-physical systems. False data injection attack (FDIA) is one of the classes of those attacks that target the smart measurement devices by injecting malicious data. The employment of machine learning techniques in the detection and localization of FDIA is proven to provide effective results. Training of such models requires centralized processing of sensitive user data that may not be plausible in a practical scenario. By employing federated learning for the detection of FDIA attacks, it is possible to train a model for the detection and localization of the attacks while preserving the privacy of sensitive user data. However, federated learning introduces new problems such as the personalization of the detectors in each node. In this paper, we propose a federated learning-based scheme combined with a hybrid deep neural network architecture that exploits the local correlations between the connected power buses by employing graph neural networks as well as the temporal patterns in the data by using LSTM layers. The proposed mechanism offers flexible and efficient training of an FDIA detector in a distributed setup while preserving the privacy of the clients. We validate the proposed architecture by extensive simulations on the IEEE 57, 118, and 300 bus systems and real electricity load data.
△ Less
Submitted 17 June, 2023;
originally announced June 2023.
-
TOI-4010: A System of Three Large Short-Period Planets With a Massive Long-Period Companion
Authors:
Michelle Kunimoto,
Andrew Vanderburg,
Chelsea X. Huang,
M. Ryleigh Davis,
Laura Affer,
Andrew Collier Cameron,
David Charbonneau,
Rosario Cosentino,
Mario Damasso,
Xavier Dumusque,
A. F. Martnez Fiorenzano,
Adriano Ghedina,
R. D. Haywood,
Florian Lienhard,
Mercedes López-Morales,
Michel Mayor,
Francesco Pepe,
Matteo Pinamonti,
Ennio Poretti,
Jesús Maldonado,
Ken Rice,
Alessandro Sozzetti,
Thomas G. Wilson,
Stéphane Udry,
Jay Baptista
, et al. (31 additional authors not shown)
Abstract:
We report the confirmation of three exoplanets transiting TOI-4010 (TIC-352682207), a metal-rich K dwarf observed by TESS in Sectors 24, 25, 52, and 58. We confirm these planets with HARPS-N radial velocity observations and measure their masses with 8 - 12% precision. TOI-4010 b is a sub-Neptune ($P = 1.3$ days, $R_{p} = 3.02_{-0.08}^{+0.08}~R_{\oplus}$, $M_{p} = 11.00_{-1.27}^{+1.29}~M_{\oplus}$)…
▽ More
We report the confirmation of three exoplanets transiting TOI-4010 (TIC-352682207), a metal-rich K dwarf observed by TESS in Sectors 24, 25, 52, and 58. We confirm these planets with HARPS-N radial velocity observations and measure their masses with 8 - 12% precision. TOI-4010 b is a sub-Neptune ($P = 1.3$ days, $R_{p} = 3.02_{-0.08}^{+0.08}~R_{\oplus}$, $M_{p} = 11.00_{-1.27}^{+1.29}~M_{\oplus}$) in the hot Neptune desert, and is one of the few such planets with known companions. Meanwhile, TOI-4010 c ($P = 5.4$ days, $R_{p} = 5.93_{-0.12}^{+0.11}~R_{\oplus}$, $M_{p} = 20.31_{-2.11}^{+2.13}~M_{\oplus}$) and TOI-4010 d ($P = 14.7$ days, $R_{p} = 6.18_{-0.14}^{+0.15}~R_{\oplus}$, $M_{p} = 38.15_{-3.22}^{+3.27}~M_{\oplus}$) are similarly-sized sub-Saturns on short-period orbits. Radial velocity observations also reveal a super-Jupiter-mass companion called TOI-4010 e in a long-period, eccentric orbit ($P \sim 762$ days and $e \sim 0.26$ based on available observations). TOI-4010 is one of the few systems with multiple short-period sub-Saturns to be discovered so far.
△ Less
Submitted 19 June, 2023; v1 submitted 8 June, 2023;
originally announced June 2023.
-
Mitigating Molecular Aggregation in Drug Discovery with Predictive Insights from Explainable AI
Authors:
Hunter Sturm,
Jonas Teufel,
Kaitlin A. Isfeld,
Pascal Friederich,
Rebecca L. Davis
Abstract:
As the importance of high-throughput screening (HTS) continues to grow due to its value in early stage drug discovery and data generation for training machine learning models, there is a growing need for robust methods for pre-screening compounds to identify and prevent false-positive hits. Small, colloidally aggregating molecules are one of the primary sources of false-positive hits in high-throu…
▽ More
As the importance of high-throughput screening (HTS) continues to grow due to its value in early stage drug discovery and data generation for training machine learning models, there is a growing need for robust methods for pre-screening compounds to identify and prevent false-positive hits. Small, colloidally aggregating molecules are one of the primary sources of false-positive hits in high-throughput screens, making them an ideal candidate to target for removal from libraries using predictive pre-screening tools. However, a lack of understanding of the causes of molecular aggregation introduces difficulty in the development of predictive tools for detecting aggregating molecules. Herein, we present an examination of the molecular features differentiating datasets of aggregating and non-aggregating molecules, as well as a machine learning approach to predicting molecular aggregation. Our method uses explainable graph neural networks and counterfactuals to reliably predict and explain aggregation, giving additional insights and design rules for future screening. The integration of this method in HTS approaches will help combat false positives, providing better lead molecules more rapidly and thus accelerating drug discovery cycles.
△ Less
Submitted 3 June, 2023;
originally announced June 2023.
-
What is missing in autonomous discovery: Open challenges for the community
Authors:
Phillip M. Maffettone,
Pascal Friederich,
Sterling G. Baird,
Ben Blaiszik,
Keith A. Brown,
Stuart I. Campbell,
Orion A. Cohen,
Tantum Collins,
Rebecca L. Davis,
Ian T. Foster,
Navid Haghmoradi,
Mark Hereld,
Nicole Jung,
Ha-Kyung Kwon,
Gabriella Pizzuto,
Jacob Rintamaki,
Casper Steinmann,
Luca Torresi,
Shi**g Sun
Abstract:
Self-driving labs (SDLs) leverage combinations of artificial intelligence, automation, and advanced computing to accelerate scientific discovery. The promise of this field has given rise to a rich community of passionate scientists, engineers, and social scientists, as evidenced by the development of the Acceleration Consortium and recent Accelerate Conference. Despite its strengths, this rapidly…
▽ More
Self-driving labs (SDLs) leverage combinations of artificial intelligence, automation, and advanced computing to accelerate scientific discovery. The promise of this field has given rise to a rich community of passionate scientists, engineers, and social scientists, as evidenced by the development of the Acceleration Consortium and recent Accelerate Conference. Despite its strengths, this rapidly develo** field presents numerous opportunities for growth, challenges to overcome, and potential risks of which to remain aware. This community perspective builds on a discourse instantiated during the first Accelerate Conference, and looks to the future of self-driving labs with a tempered optimism. Incorporating input from academia, government, and industry, we briefly describe the current status of self-driving labs, then turn our attention to barriers, opportunities, and a vision for what is possible. Our field is delivering solutions in technology and infrastructure, artificial intelligence and knowledge generation, and education and workforce development. In the spirit of community, we intend for this work to foster discussion and drive best practices as our field grows.
△ Less
Submitted 2 May, 2023; v1 submitted 21 April, 2023;
originally announced April 2023.
-
Exploring celebrity influence on public attitude towards the COVID-19 pandemic: social media shared sentiment analysis
Authors:
Brianna M White,
Chad A Melton,
Parya Zareie,
Robert L Davis,
Robert A Bednarczyk,
Arash Shaban-Nejad
Abstract:
The COVID-19 pandemic has introduced new opportunities for health communication, including an increase in the public use of online outlets for health-related emotions. People have turned to social media networks to share sentiments related to the impacts of the COVID-19 pandemic. In this paper we examine the role of social messaging shared by Persons in the Public Eye (i.e. athletes, politicians,…
▽ More
The COVID-19 pandemic has introduced new opportunities for health communication, including an increase in the public use of online outlets for health-related emotions. People have turned to social media networks to share sentiments related to the impacts of the COVID-19 pandemic. In this paper we examine the role of social messaging shared by Persons in the Public Eye (i.e. athletes, politicians, news personnel) in determining overall public discourse direction. We harvested approximately 13 million tweets ranging from 1 January 2020 to 1 March 2022. The sentiment was calculated for each tweet using a fine-tuned DistilRoBERTa model, which was used to compare COVID-19 vaccine-related Twitter posts (tweets) that co-occurred with mentions of People in the Public Eye. Our findings suggest the presence of consistent patterns of emotional content co-occurring with messaging shared by Persons in the Public Eye for the first two years of the COVID-19 pandemic influenced public opinion and largely stimulated online public discourse. We demonstrate that as the pandemic progressed, public sentiment shared on social networks was shaped by risk perceptions, political ideologies and health-protective behaviours shared by Persons in the Public Eye, often in a negative light.
△ Less
Submitted 23 February, 2023;
originally announced March 2023.
-
Clustering Multivariate Time Series using Energy Distance
Authors:
Richard A. Davis,
Leon Fernandes,
Konstantinos Fokianos
Abstract:
A novel methodology is proposed for clustering multivariate time series data using energy distance defined in Székely and Rizzo (2013). Specifically, a dissimilarity matrix is formed using the energy distance statistic to measure separation between the finite dimensional distributions for the component time series. Once the pairwise dissimilarity matrix is calculated, a hierarchical clustering met…
▽ More
A novel methodology is proposed for clustering multivariate time series data using energy distance defined in Székely and Rizzo (2013). Specifically, a dissimilarity matrix is formed using the energy distance statistic to measure separation between the finite dimensional distributions for the component time series. Once the pairwise dissimilarity matrix is calculated, a hierarchical clustering method is then applied to obtain the dendrogram. This procedure is completely nonparametric as the dissimilarities between stationary distributions are directly calculated without making any model assumptions. In order to justify this procedure, asymptotic properties of the energy distance estimates are derived for general stationary and ergodic time series. The method is illustrated in a simulation study for various component time series that are either linear or nonlinear. Finally the methodology is applied to two examples; one involves GDP of selected countries and the other is population size of various states in the U.S.A. in the years 1900 -1999.
△ Less
Submitted 24 March, 2023;
originally announced March 2023.
-
Weighted Ehrhart Theory: Extending Stanley's nonnegativity theorem
Authors:
Esme Bajo,
Robert Davis,
Jesús A. De Loera,
Alexey Garber,
Sofía Garzón Mora,
Katharina Jochemko,
Josephine Yu
Abstract:
We generalize R. P. Stanley's celebrated theorem that the $h^\ast$-polynomial of the Ehrhart series of a rational polytope has nonnegative coefficients and is monotone under containment of polytopes. We show that these results continue to hold for weighted Ehrhart series where lattice points are counted with polynomial weights, as long as the weights are homogeneous polynomials decomposable as sum…
▽ More
We generalize R. P. Stanley's celebrated theorem that the $h^\ast$-polynomial of the Ehrhart series of a rational polytope has nonnegative coefficients and is monotone under containment of polytopes. We show that these results continue to hold for weighted Ehrhart series where lattice points are counted with polynomial weights, as long as the weights are homogeneous polynomials decomposable as sums of products of linear forms that are nonnegative on the polytope. We also show nonnegativity of the $h^\ast$-polynomial as a real-valued function for a larger family of weights.
We then target the case when the weight function is the square of a single (arbitrary) linear form. We show stronger results for two-dimensional convex lattice polygons and give concrete examples showing tightness of the hypotheses. As an application, we construct a counterexample to a conjecture by Berg, Jochemko, and Silverstein on Ehrhart tensor polynomials.
△ Less
Submitted 11 March, 2024; v1 submitted 16 March, 2023;
originally announced March 2023.
-
An Extended Model for Ecological Robustness to Capture Power System Resilience
Authors:
Hao Huang,
Katherine R. Davis,
H. Vincent Poor
Abstract:
The long-term resilient property of ecosystems has been quantified as ecological robustness (RECO) in terms of the energy transfer over food webs. The RECO of resilient ecosystems favors a balance of food webs' network efficiency and redundancy. By integrating RECO with power system constraints, the authors are able to optimize power systems' inherent resilience as ecosystems through network desig…
▽ More
The long-term resilient property of ecosystems has been quantified as ecological robustness (RECO) in terms of the energy transfer over food webs. The RECO of resilient ecosystems favors a balance of food webs' network efficiency and redundancy. By integrating RECO with power system constraints, the authors are able to optimize power systems' inherent resilience as ecosystems through network design and system operation. A previous model used on real power flows and aggregated redundant components for a rigorous map** between ecosystems and power systems. However, the reactive power flows also determine power systems resilience; and the power components' redundancy is part of the global network redundancy. These characteristics should be considered for RECO-oriented evaluation and optimization for power systems. Thus, this paper extends the model for quantifying RECO in power systems using real, reactive, and apparent power flows with the consideration of redundant placement of generators. Recalling the performance of RECO-oriented optimal power flows under N-x contingencies, the analyses suggest reactive power flows and redundant components should be included for RECO to capture power systems' inherent resilience.
△ Less
Submitted 1 October, 2023; v1 submitted 7 March, 2023;
originally announced March 2023.
-
Ising Meson Spectroscopy on a Noisy Digital Quantum Simulator
Authors:
Christopher Lamb,
Yicheng Tang,
Robert Davis,
Ananda Roy
Abstract:
Quantum simulation has the potential to be an indispensable technique for the investigation of non-perturbative phenomena in strongly-interacting quantum field theories (QFTs). In the modern quantum era, with Noisy Intermediate Scale Quantum~(NISQ) simulators widely available and larger-scale quantum machines on the horizon, it is natural to ask: what non-perturbative QFT problems can be solved wi…
▽ More
Quantum simulation has the potential to be an indispensable technique for the investigation of non-perturbative phenomena in strongly-interacting quantum field theories (QFTs). In the modern quantum era, with Noisy Intermediate Scale Quantum~(NISQ) simulators widely available and larger-scale quantum machines on the horizon, it is natural to ask: what non-perturbative QFT problems can be solved with the existing quantum hardware? We show that existing noisy quantum machines can be used to analyze the energy spectrum of a large family of strongly-interacting 1+1D QFTs. The latter exhibit a wide-range of non-perturbative effects like `quark confinement' and `false vacuum decay' which are typically associated with higher-dimensional QFTs of elementary particles. We perform quench experiments on IBM's ibmq_mumbai quantum simulator to compute the energy spectrum of 1+1D quantum Ising model with a longitudinal field. The latter model is particularly interesting due to the formation of mesonic bound states arising from a confining potential for the Ising domain-walls, reminiscent of t'Hooft's model of two-dimensional quantum chromodynamics. Our results demonstrate that digital quantum simulation in the NISQ era has the potential to be a viable alternative to numerical techniques such as density matrix renormalization group or the truncated conformal space methods for analyzing QFTs.
△ Less
Submitted 10 June, 2024; v1 submitted 6 March, 2023;
originally announced March 2023.
-
Initial validation of a soil-based mass-balance approach for empirical monitoring of enhanced rock weathering rates
Authors:
Tom Reershemius,
Mike E. Kelland,
Jacob S. Jordan,
Isabelle R. Davis,
Rocco D'Ascanio,
Boriana Kalderon-Asael,
Dan Asael,
T. Jesper Suhrhoff,
Dimitar Z. Epihov,
David J. Beerling,
Christopher T. Reinhard,
Noah J. Planavsky
Abstract:
Enhanced Rock Weathering (ERW) is a promising scalable and cost-effective Carbon Dioxide Removal (CDR) strategy with significant environmental and agronomic co-benefits. A major barrier to large-scale implementation of ERW is a robust Monitoring, Reporting, and Verification (MRV) framework. To successfully quantify the amount of carbon dioxide removed by ERW, MRV must be accurate, precise, and cos…
▽ More
Enhanced Rock Weathering (ERW) is a promising scalable and cost-effective Carbon Dioxide Removal (CDR) strategy with significant environmental and agronomic co-benefits. A major barrier to large-scale implementation of ERW is a robust Monitoring, Reporting, and Verification (MRV) framework. To successfully quantify the amount of carbon dioxide removed by ERW, MRV must be accurate, precise, and cost-effective. Here, we outline a mass-balance-based method where analysis of the chemical composition of soil samples is used to track in-situ silicate rock weathering. We show that signal-to-noise issues of in-situ soil analysis can be mitigated by using isotope-dilution mass spectrometry to reduce analytical error. We implement a proof-of-concept experiment demonstrating the method in controlled mesocosms. In our experiment, basalt rock feedstock is added to soil columns containing the cereal crop Sorghum bicolor at a rate equivalent to 50 t ha$^{-1}$. Using our approach, we calculate rock weathering corresponding to an average initial CDR value of 1.44 +/- 0.27 tCO$_2$eq ha$^{-1}$ from our experiments after 235 days, within error of an independent estimate calculated using conventional elemental budgeting of reaction products. Our method provides a robust time-integrated estimate of initial CDR, to feed into models that track and validate large-scale carbon removal through ERW.
△ Less
Submitted 22 October, 2023; v1 submitted 9 February, 2023;
originally announced February 2023.
-
The James Webb Space Telescope Mission: Optical Telescope Element Design, Development, and Performance
Authors:
Michael W. McElwain,
Lee D. Feinberg,
Marshall D. Perrin,
Mark Clampin,
C. Matt Mountain,
Matthew D. Lallo,
Charles-Philippe Lajoie,
Randy A. Kimble,
Charles W. Bowers,
Christopher C. Stark,
D. Scott Acton,
Ken Aiello,
Charles Atkinson,
Beth Barinek,
Allison Barto,
Scott Basinger,
Tracy Beck,
Matthew D. Bergkoetter,
Marcel Bluth,
Rene A. Boucarut,
Gregory R. Brady,
Keira J. Brooks,
Bob Brown,
John Byard,
Larkin Carey
, et al. (104 additional authors not shown)
Abstract:
The James Webb Space Telescope (JWST) is a large, infrared space telescope that has recently started its science program which will enable breakthroughs in astrophysics and planetary science. Notably, JWST will provide the very first observations of the earliest luminous objects in the Universe and start a new era of exoplanet atmospheric characterization. This transformative science is enabled by…
▽ More
The James Webb Space Telescope (JWST) is a large, infrared space telescope that has recently started its science program which will enable breakthroughs in astrophysics and planetary science. Notably, JWST will provide the very first observations of the earliest luminous objects in the Universe and start a new era of exoplanet atmospheric characterization. This transformative science is enabled by a 6.6 m telescope that is passively cooled with a 5-layer sunshield. The primary mirror is comprised of 18 controllable, low areal density hexagonal segments, that were aligned and phased relative to each other in orbit using innovative image-based wavefront sensing and control algorithms. This revolutionary telescope took more than two decades to develop with a widely distributed team across engineering disciplines. We present an overview of the telescope requirements, architecture, development, superb on-orbit performance, and lessons learned. JWST successfully demonstrates a segmented aperture space telescope and establishes a path to building even larger space telescopes.
△ Less
Submitted 4 January, 2023;
originally announced January 2023.
-
The Risks of Ranking: Revisiting Graphical Perception to Model Individual Differences in Visualization Performance
Authors:
Russell Davis,
Xiaoying Pu,
Yiren Ding,
Brian D. Hall,
Karen Bonilla,
Mi Feng,
Matthew Kay,
Lane Harrison
Abstract:
Graphical perception studies typically measure visualization encoding effectiveness using the error of an "average observer", leading to canonical rankings of encodings for numerical attributes: e.g., position > area > angle > volume. Yet different people may vary in their ability to read different visualization types, leading to variance in this ranking across individuals not captured by populati…
▽ More
Graphical perception studies typically measure visualization encoding effectiveness using the error of an "average observer", leading to canonical rankings of encodings for numerical attributes: e.g., position > area > angle > volume. Yet different people may vary in their ability to read different visualization types, leading to variance in this ranking across individuals not captured by population-level metrics using "average observer" models. One way we can bridge this gap is by recasting classic visual perception tasks as tools for assessing individual performance, in addition to overall visualization performance. In this paper we replicate and extend Cleveland and McGill's graphical comparison experiment using Bayesian multilevel regression, using these models to explore individual differences in visualization skill from multiple perspectives. The results from experiments and modeling indicate that some people show patterns of accuracy that credibly deviate from the canonical rankings of visualization effectiveness. We discuss implications of these findings, such as a need for new ways to communicate visualization effectiveness to designers, how patterns in individuals' responses may show systematic biases and strategies in visualization judgment, and how recasting classic visual perception tasks as tools for assessing individual performance may offer new ways to quantify aspects of visualization literacy. Experiment data, source code, and analysis scripts are available at the following repository: https://osf.io/8ub7t/?view\_only=9be4798797404a4397be3c6fc2a68cc0.
△ Less
Submitted 21 December, 2022; v1 submitted 20 December, 2022;
originally announced December 2022.
-
Protected Attributes Tell Us Who, Behavior Tells Us How: A Comparison of Demographic and Behavioral Oversampling for Fair Student Success Modeling
Authors:
Jade Maï Cock,
Muhammad Bilal,
Richard Davis,
Mirko Marras,
Tanja Käser
Abstract:
Algorithms deployed in education can shape the learning experience and success of a student. It is therefore important to understand whether and how such algorithms might create inequalities or amplify existing biases. In this paper, we analyze the fairness of models which use behavioral data to identify at-risk students and suggest two novel pre-processing approaches for bias mitigation. Based on…
▽ More
Algorithms deployed in education can shape the learning experience and success of a student. It is therefore important to understand whether and how such algorithms might create inequalities or amplify existing biases. In this paper, we analyze the fairness of models which use behavioral data to identify at-risk students and suggest two novel pre-processing approaches for bias mitigation. Based on the concept of intersectionality, the first approach involves intelligent oversampling on combinations of demographic attributes. The second approach does not require any knowledge of demographic attributes and is based on the assumption that such attributes are a (noisy) proxy for student behavior. We hence propose to directly oversample different types of behaviors identified in a cluster analysis. We evaluate our approaches on data from (i) an open-ended learning environment and (ii) a flipped classroom course. Our results show that both approaches can mitigate model bias. Directly oversampling on behavior is a valuable alternative, when demographic metadata is not available. Source code and extended results are provided in https://github.com/epfl-ml4ed/behavioral-oversampling}{https://github.com/epfl-ml4ed/behavioral-oversampling .
△ Less
Submitted 20 December, 2022;
originally announced December 2022.
-
Spectroscopic map** of Io's surface with HST/STIS: SO$_2$ frost, sulfur allotropes, and large-scale compositional patterns
Authors:
Samantha K. Trumbo,
M. Ryleigh Davis,
Benjamin Cassese,
Michael E. Brown
Abstract:
Io's intense volcanic activity results in one of the most colorful surfaces in the solar system. Ultraviolet and visible-wavelength observations of Io are critical to uncovering the chemistry behind its volcanic hues. Here, we present global, spatially resolved UV-visible spectra of Io from the Space Telescope Imaging Spectrograph on the Hubble Space Telescope (HST), which bridge the gap between p…
▽ More
Io's intense volcanic activity results in one of the most colorful surfaces in the solar system. Ultraviolet and visible-wavelength observations of Io are critical to uncovering the chemistry behind its volcanic hues. Here, we present global, spatially resolved UV-visible spectra of Io from the Space Telescope Imaging Spectrograph on the Hubble Space Telescope (HST), which bridge the gap between previous highly resolved imagery and disk-integrated spectroscopy, to provide an unprecedented combination of spatial and spectral detail. We use this comprehensive dataset to investigate spectral endmembers, map observed spectral features associated with SO$_2$ frost and other sulfur species, and explore possible compositions in the context of Io surface processes. In agreement with past observations, our results are consistent with extensive equatorial SO$_2$ frost deposits that are stable over multi-decade timescales, widespread sulfur-rich plains surrounding the SO$_2$ deposits, and the enrichment of Pele's pyroclastic ring and the high-latitude regions in metastable short-chain sulfur allotropes.
△ Less
Submitted 16 December, 2022;
originally announced December 2022.
-
3UCubed: The IMAP Student Collaboration CubeSat Project
Authors:
Marcus Alfred,
Sonya Smith,
Charles Kim,
Carissma McGee,
Ruth Davis,
Myles Pope,
Taran Richardson,
Trinity Sager,
Avery Williams,
Matthew Gales,
Wilson Jean Baptiste,
Tyrese Kierstdet,
Oluwatamilore Ogunbanjo,
Laura Peticolas,
Lynn Cominsky,
Garrett Jernigan,
Jeffrey Reedy,
Doug Clarke,
Sabrina Blais,
Erik Castellanos-Vasquez,
Jack Dawson,
Erika Diaz Ramirez,
Walter Foster,
Cristopher Gopar Carreno,
Haley Joerger
, et al. (17 additional authors not shown)
Abstract:
The 3UCubed project is a 3U CubeSat being jointly developed by the University of New Hampshire, Sonoma State University, and Howard University as a part of the NASA Interstellar Map** and Acceleration Probe, IMAP, student collaboration. This project comprises of a multidisciplinary team of undergraduate students from all three universities. The mission goal of the 3UCubed is to understand how Ea…
▽ More
The 3UCubed project is a 3U CubeSat being jointly developed by the University of New Hampshire, Sonoma State University, and Howard University as a part of the NASA Interstellar Map** and Acceleration Probe, IMAP, student collaboration. This project comprises of a multidisciplinary team of undergraduate students from all three universities. The mission goal of the 3UCubed is to understand how Earths polar upper atmosphere the thermosphere in Earths auroral regions, responds to particle precipitation and solar wind forcing, and internal magnetospheric processes.
3UCubed includes two instruments with rocket heritage to achieve the science mission: an ultraviolet photomultiplier tube, UVPMT, and an electron retarding potential analyzer ERPA. The spacecraft bus consists of the following subsystems: Attitude Determination and Control, Command and Data Handling, Power, Communication, Structural, and Thermal.
Currently, the project is in the post-PDR stage, starting to build and test engineering models to develop a FlatSat prior to critical design review in 2023. The goal is to launch at least one 3U CubeSat to collect science data close to the anticipated peak of Solar Cycle 25 around July 2025. Our mother mission, IMAP, is also projected to launch in 2025, which will let us jointly analyze the science data of the main mission, providing the solar wind measurements and inputs to the magnetosphere with that of 3UCubed, providing the response of Earths cusp to these inputs.
△ Less
Submitted 28 November, 2022;
originally announced November 2022.
-
Fine-tuned Sentiment Analysis of COVID-19 Vaccine-Related Social Media Data: Comparative Study
Authors:
Chad A Melton,
Brianna M White,
Robert L Davis,
Robert A Bednarczyk,
Arash Shaban-Nejad
Abstract:
This study investigated and compared public sentiment related to COVID-19 vaccines expressed on two popular social media platforms, Reddit and Twitter, harvested from January 1, 2020, to March 1, 2022. To accomplish this task, we created a fine-tuned DistilRoBERTa model to predict sentiments of approximately 9.5 million Tweets and 70 thousand Reddit comments. To fine-tune our model, our team manua…
▽ More
This study investigated and compared public sentiment related to COVID-19 vaccines expressed on two popular social media platforms, Reddit and Twitter, harvested from January 1, 2020, to March 1, 2022. To accomplish this task, we created a fine-tuned DistilRoBERTa model to predict sentiments of approximately 9.5 million Tweets and 70 thousand Reddit comments. To fine-tune our model, our team manually labeled the sentiment of 3600 Tweets and then augmented our dataset by the method of back-translation. Text sentiment for each social media platform was then classified with our fine-tuned model using Python and the Huggingface sentiment analysis pipeline. Our results determined that the average sentiment expressed on Twitter was more negative (52% positive) than positive and the sentiment expressed on Reddit was more positive than negative (53% positive). Though average sentiment was found to vary between these social media platforms, both displayed similar behavior related to sentiment shared at key vaccine-related developments during the pandemic. Considering this similar trend in shared sentiment demonstrated across social media platforms, Twitter and Reddit continue to be valuable data sources that public health officials can utilize to strengthen vaccine confidence and combat misinformation. As the spread of misinformation poses a range of psychological and psychosocial risks (anxiety, fear, etc.), there is an urgency in understanding the public perspective and attitude toward shared falsities. Comprehensive educational delivery systems tailored to the population's expressed sentiments that facilitate digital literacy, health information-seeking behavior, and precision health promotion could aid in clarifying such misinformation.
△ Less
Submitted 17 October, 2022;
originally announced November 2022.
-
Kernel PCA for multivariate extremes
Authors:
Marco Avella-Medina,
Richard A. Davis,
Gennady Samorodnitsky
Abstract:
We propose kernel PCA as a method for analyzing the dependence structure of multivariate extremes and demonstrate that it can be a powerful tool for clustering and dimension reduction. Our work provides some theoretical insight into the preimages obtained by kernel PCA, demonstrating that under certain conditions they can effectively identify clusters in the data. We build on these new insights to…
▽ More
We propose kernel PCA as a method for analyzing the dependence structure of multivariate extremes and demonstrate that it can be a powerful tool for clustering and dimension reduction. Our work provides some theoretical insight into the preimages obtained by kernel PCA, demonstrating that under certain conditions they can effectively identify clusters in the data. We build on these new insights to characterize rigorously the performance of kernel PCA based on an extremal sample, i.e., the angular part of random vectors for which the radius exceeds a large threshold. More specifically, we focus on the asymptotic dependence of multivariate extremes characterized by the angular or spectral measure in extreme value theory and provide a careful analysis in the case where the extremes are generated from a linear factor model. We give theoretical guarantees on the performance of kernel PCA preimages of such extremes by leveraging their asymptotic distribution together with Davis-Kahan perturbation bounds. Our theoretical findings are complemented with numerical experiments illustrating the finite sample performance of our methods.
△ Less
Submitted 23 November, 2022; v1 submitted 23 November, 2022;
originally announced November 2022.
-
Bead-Droplet Reactor for High-Fidelity Solid-Phase Enzymatic DNA Synthesis
Authors:
Punnag Padhy,
Mohammad Asif Zaman,
Michael Anthony Jensen,
Yao-Te Cheng,
Yogi Huang,
Ludwig Galambos,
Ronald Wayne Davis,
Lambertus Hesselink
Abstract:
Solid-phase synthesis techniques underpin the synthesis of DNA, oligopeptides, oligosaccharides, and combinatorial libraries for drug discovery. State-of-the-art solid-phase synthesizers can produce oligonucleotides up to 200-300 nucleotides while using excess reagents. Accumulated errors over multiple reaction cycles prevent the synthesis of longer oligonucleotides for the genome scale engineerin…
▽ More
Solid-phase synthesis techniques underpin the synthesis of DNA, oligopeptides, oligosaccharides, and combinatorial libraries for drug discovery. State-of-the-art solid-phase synthesizers can produce oligonucleotides up to 200-300 nucleotides while using excess reagents. Accumulated errors over multiple reaction cycles prevent the synthesis of longer oligonucleotides for the genome scale engineering of synthetic biological systems. The sources of these errors in synthesis columns remains poorly understood. Here we show that bead-bead stacking significantly contributes to reaction errors in columns by analyzing enzymatic coupling of fluorescently labelled nucleotides onto the initiated beads along with porosity, particle tracking and diffusion calculations. To circumvent stacking, we introduce dielectrophoretic bead-droplet reactor (DBDR); a novel approach to synthesize on individual microbeads within microdroplets. Dielectrophoretic force overcomes the droplet-medium interfacial tension to encapsulate and eject individual beads from microdroplets in a droplet microfluidic device. Faster reagent diffusion in droplets, and non-uniform electric field induced enhancement in reagent concentration at its surface can improve reaction fidelities in DBDR. Fluorescence comparisons suggest around 3-fold enhancement of reaction fidelity compared to columns. DBDR can potentially enable the high-purity synthesis of arbitrarily long strands of DNA to meet the emerging demands in healthcare, environment, agriculture, materials, and computing.
△ Less
Submitted 1 June, 2023; v1 submitted 12 November, 2022;
originally announced November 2022.
-
Design and CT imaging of Casper, an anthropomorphic breathing thorax phantom
Authors:
Josie Laidlaw,
Nicolas Earl,
Nihal Shavdia,
Rayna Davis,
Sarah Mayer,
Dmitri Karaman,
Devon Richtsmeier,
Pierre-Antoine Rodesch,
Magdalena Bazalova-Carter
Abstract:
The goal of this work was to build an anthropomorphic thorax phantom capable of breathing motion with materials mimicking human tissues in x-ray imaging applications. The thorax phantom, named Casper, was composed of resin (body), foam (lungs), glow polyactic acid (bones) and natural polyactic acid (tumours placed in the lungs). X-ray attenuation properties of all materials prior to manufacturing…
▽ More
The goal of this work was to build an anthropomorphic thorax phantom capable of breathing motion with materials mimicking human tissues in x-ray imaging applications. The thorax phantom, named Casper, was composed of resin (body), foam (lungs), glow polyactic acid (bones) and natural polyactic acid (tumours placed in the lungs). X-ray attenuation properties of all materials prior to manufacturing were evaluated by means of photon-counting computed tomography (CT) imaging on a table-top system. Breathing motion was achieved by a scotch-yoke mechanism with diaphragm motion frequencies of 10 - 20 rpm and displacements of 1 to 2 cm. Casper was manufactured by means of 3D printing of moulds and ribs and assembled in a complex process. The final phantom was then scanned using a clinical CT scanner to evaluate material CT numbers and the extent of tumour motion. Casper CT numbers were close to human CT numbers for soft tissue (46 HU), ribs (125 HU), lungs (-840 HU) and tumours (-45 HU). For a 2 cm diaphragm displacement the largest tumour displacement was 0.7 cm. The five tumour volumes were accurately assessed in the static CT images with a mean absolute error of 4.3%. Tumour sizes were either underestimated for smaller tumours or overestimated for larger tumours in dynamic CT images due to motion blurring with a mean absolute difference from true volumes of 10.3%.
△ Less
Submitted 28 September, 2022;
originally announced September 2022.
-
Perfectly Matchable Set Polynomials and $h^*$-polynomials for Stable Set Polytopes of Complements of Graphs
Authors:
Robert Davis,
Florian Kohl
Abstract:
A subset $S$ of vertices of a graph $G$ is called a perfectly matchable set of $G$ if the subgraph induced by $S$ contains a perfect matching. The perfectly matchable set polynomial of $G$, first made explicit by Ohsugi and Tsuchiya, is the (ordinary) generating function $p(G; z)$ for the number of perfectly matchable sets of $G$.
In this work, we provide explicit recurrences for computing…
▽ More
A subset $S$ of vertices of a graph $G$ is called a perfectly matchable set of $G$ if the subgraph induced by $S$ contains a perfect matching. The perfectly matchable set polynomial of $G$, first made explicit by Ohsugi and Tsuchiya, is the (ordinary) generating function $p(G; z)$ for the number of perfectly matchable sets of $G$.
In this work, we provide explicit recurrences for computing $p(G; z)$ for an arbitrary (simple) graph and use these to compute the Ehrhart $h^*$-polynomials for certain lattice polytopes. Namely, we show that $p(G; z)$ is the $h^*$-polynomial for certain classes of stable set polytopes, whose vertices correspond to stable sets of $G$.
△ Less
Submitted 29 July, 2022;
originally announced July 2022.
-
RF-Photonic Deep Learning Processor with Shannon-Limited Data Movement
Authors:
Ronald Davis III,
Zaijun Chen,
Ryan Hamerly,
Dirk Englund
Abstract:
Edholm's Law predicts exponential growth in data rate and spectrum bandwidth for communications and is forecasted to remain true for the upcoming deployment of 6G. Compounding this issue is the exponentially increasing demand for deep neural network (DNN) compute, including DNNs for signal processing. However, the slowing of Moore's Law due to the limitations of transistor-based electronics means…
▽ More
Edholm's Law predicts exponential growth in data rate and spectrum bandwidth for communications and is forecasted to remain true for the upcoming deployment of 6G. Compounding this issue is the exponentially increasing demand for deep neural network (DNN) compute, including DNNs for signal processing. However, the slowing of Moore's Law due to the limitations of transistor-based electronics means that completely new paradigms for computing will be required to meet these increasing demands for advanced communications. Optical neural networks (ONNs) are promising DNN accelerators with ultra-low latency and energy consumption. Yet state-of-the-art ONNs struggle with scalability and implementing linear with in-line nonlinear operations. Here we introduce our multiplicative analog frequency transform ONN (MAFT-ONN) that encodes the data in the frequency domain, achieves matrix-vector products in a single shot using photoelectric multiplication, and uses a single electro-optic modulator for the nonlinear activation of all neurons in each layer. We experimentally demonstrate the first hardware accelerator that computes fully-analog deep learning on raw RF signals, performing single-shot modulation classification with 85% accuracy, where a 'majority vote' multi-measurement scheme can boost the accuracy to 95% within 5 consecutive measurements. In addition, we demonstrate frequency-domain finite impulse response (FIR) linear-time-invariant (LTI) operations, enabling a powerful combination of traditional and AI signal processing. We also demonstrate the scalability of our architecture by computing nearly 4 million fully-analog multiplies-and-accumulates for MNIST digit classification. Our latency estimation model shows that due to the Shannon capacity-limited analog data movement, MAFT-ONN is hundreds of times faster than traditional RF receivers operating at their theoretical peak performance.
△ Less
Submitted 6 June, 2024; v1 submitted 8 July, 2022;
originally announced July 2022.
-
Deep Learning with Coherent VCSEL Neural Networks
Authors:
Zaijun Chen,
Alexander Sludds,
Ronald Davis,
Ian Christen,
Liane Bernstein,
Tobias Heuser,
Niels Heermeier,
James A. Lott,
Stephan Reitzenstein,
Ryan Hamerly,
Dirk Englund
Abstract:
Deep neural networks (DNNs) are resha** the field of information processing. With their exponential growth challenging existing electronic hardware, optical neural networks (ONNs) are emerging to process DNN tasks in the optical domain with high clock rates, parallelism and low-loss data transmission. However, to explore the potential of ONNs, it is necessary to investigate the full-system perfo…
▽ More
Deep neural networks (DNNs) are resha** the field of information processing. With their exponential growth challenging existing electronic hardware, optical neural networks (ONNs) are emerging to process DNN tasks in the optical domain with high clock rates, parallelism and low-loss data transmission. However, to explore the potential of ONNs, it is necessary to investigate the full-system performance incorporating the major DNN elements, including matrix algebra and nonlinear activation. Existing challenges to ONNs are high energy consumption due to low electro-optic (EO) conversion efficiency, low compute density due to large device footprint and channel crosstalk, and long latency due to the lack of inline nonlinearity. Here we experimentally demonstrate an ONN system that simultaneously overcomes all these challenges. We exploit neuron encoding with volume-manufactured micron-scale vertical-cavity surface-emitting laser (VCSEL) transmitter arrays that exhibit high EO conversion (<5 attojoule/symbol with $V_π$=4 mV), high operation bandwidth (up to 25 GS/s), and compact footprint (<0.01 mm$^2$ per device). Photoelectric multiplication allows low-energy matrix operations at the shot-noise quantum limit. Homodyne detection-based nonlinearity enables nonlinear activation with instantaneous response. The full-system energy efficiency and compute density reach 7 femtojoules per operation (fJ/OP) and 25 TeraOP/(mm$^2\cdot$ s), both representing a >100-fold improvement over state-of-the-art digital computers, with substantially several more orders of magnitude for future improvement. Beyond neural network inference, its feature of rapid weight updating is crucial for training deep learning models. Our technique opens an avenue to large-scale optoelectronic processors to accelerate machine learning tasks from data centers to decentralized edge devices.
△ Less
Submitted 12 July, 2022;
originally announced July 2022.
-
Electromagnetic Nonreciprocity in a Magnetized Plasma Circulator
Authors:
Feng Li,
Robert J. Davis,
Sara M. Kandil,
Daniel F. Sievenpiper
Abstract:
Nonreciprocal transport of electromagnetic waves within magnetized plasma is a powerful building block towards understanding and exploiting the properties of more general topological systems. Much recent attention has been paid to the theoretical issues of wave interaction within such a medium, but there is a lack of experimental verification that such systems can be viable in a lab or industrial…
▽ More
Nonreciprocal transport of electromagnetic waves within magnetized plasma is a powerful building block towards understanding and exploiting the properties of more general topological systems. Much recent attention has been paid to the theoretical issues of wave interaction within such a medium, but there is a lack of experimental verification that such systems can be viable in a lab or industrial setting. This work provides an experimental proof-of-concept by demonstrating nonreciprocity in a unit component, a microwave plasma circulator. We design an E-plane Y junction plasma circulator operating in the range of 4 to 6 GHz using standardized waveguide specifications. From both simulations and experiments, we observe wide band isolation for the power transmission through the circulator. The performance and the frequency band of the circulator can be easily tuned by changing the plasma density and the magnetic field strength. By linking simulations and experimental results, we estimate the plasma density for the device.
△ Less
Submitted 29 November, 2022; v1 submitted 24 June, 2022;
originally announced June 2022.
-
Finding the ET Signal from the Cosmic Noise
Authors:
Ross Davis
Abstract:
This paper highlights a methodological approach designed to enhance the search for extraterrestrial intelligence (SETI) by hypothesizing that a transmission technosignature would likely have two features: 1) be wideband in the microwave or higher frequency range that originates from a hub within a supposed ET interplanetary navigation/communication (nav/comm) network, and 2) contain x-ray pulsar-b…
▽ More
This paper highlights a methodological approach designed to enhance the search for extraterrestrial intelligence (SETI) by hypothesizing that a transmission technosignature would likely have two features: 1) be wideband in the microwave or higher frequency range that originates from a hub within a supposed ET interplanetary navigation/communication (nav/comm) network, and 2) contain x-ray pulsar-based navigation (XNAV) metadata.
Potential contributions to the field include improved accuracy in finding transmission technosignatures and other technosignatures in the electromagnetic spectrum, a common standard in reaching a Schelling Point (a mutual realization of how we and ETs can find each other), and operationalizing models such as the Drake Equation.
△ Less
Submitted 18 February, 2024; v1 submitted 9 April, 2022;
originally announced April 2022.
-
Maximization of Mathai's Entropy under the Constraints of Generalized Gini and Gini mean difference indices and its Applications in Insurance
Authors:
Rhea Davis,
Nicy Sebastian
Abstract:
Statistical Physics, Diffusion Entropy Analysis and Information Theory commonly use Mathai's entropy which measures the randomness of probability laws, whereas welfare economics and the Social Sciences commonly use Gini index which measures the evenness of probability laws. Motivated by the principle of maximal entropy, we explore the maximization of Mathai's entropy subject to the conditions in t…
▽ More
Statistical Physics, Diffusion Entropy Analysis and Information Theory commonly use Mathai's entropy which measures the randomness of probability laws, whereas welfare economics and the Social Sciences commonly use Gini index which measures the evenness of probability laws. Motivated by the principle of maximal entropy, we explore the maximization of Mathai's entropy subject to the conditions in the following scenarios: (i) the conditions of a density function and fixed mean; (ii) the conditions of a density function and fixed Generalized Gini index. We also maximizes the Mathai's entropy subject to the constraints of a given Gini mean difference index and the conditions of a density function. The obtained maximum entropy distribution is fitted to the loss ratios (yearly data) for earthquake insurance in California from 1971 through 1994 and its performance with some one-parameter distributions are compared.
△ Less
Submitted 12 March, 2022;
originally announced March 2022.
-
Normalized Volumes of Type-PQ Adjacency Polytopes for Certain Classes of Graphs
Authors:
Robert Davis,
Joakim Jakovleski,
Qizhe Pan
Abstract:
The type-PQ adjacency polytope associated to a simple graph is a $0/1$-polytope containing valuable information about an underlying power network. Chen and the first author have recently demonstrated that, when the underlying graph $G$ is connected, the normalized volumes of the adjacency polytopes can be computed by counting sequences of nonnegative integers satisfying restrictions determined by…
▽ More
The type-PQ adjacency polytope associated to a simple graph is a $0/1$-polytope containing valuable information about an underlying power network. Chen and the first author have recently demonstrated that, when the underlying graph $G$ is connected, the normalized volumes of the adjacency polytopes can be computed by counting sequences of nonnegative integers satisfying restrictions determined by $G$. This article builds upon their work, namely by showing that one of their main results -- the so-called "triangle recurrence" -- applies in a more general setting. Formulas for the normalized volumes when $G$ is obtained by deleting a path or a cycle from a complete graph are also established.
△ Less
Submitted 20 February, 2022;
originally announced February 2022.
-
Designing Topological Defect Lines Protected by Gauge-dependent Symmetry Indicators
Authors:
Erda Wen,
Dia'aaldin J. Bisharat,
Robert J. Davis,
Xiaozhen Yang,
Daniel F. Sievenpiper
Abstract:
Symmetry indicators are a modern tool for characterizing topological phases that require only minimal computational expense but provide an elegant means of designing practical devices. This paper demonstrates how a rotational symmetry indicator can be used to construct and characterize a topologically robust waveguide, which is then verified experimentally on a printed circuit board (PCB) platform…
▽ More
Symmetry indicators are a modern tool for characterizing topological phases that require only minimal computational expense but provide an elegant means of designing practical devices. This paper demonstrates how a rotational symmetry indicator can be used to construct and characterize a topologically robust waveguide, which is then verified experimentally on a printed circuit board (PCB) platform. The design takes advantage of the real-space gauge-dependency of the symmetry indicators and adopts a $C_6$ lattice with simple shifts, forming a defect line supporting topological edge modes. It is shown that the modes can realize the same features as previous topological waveguides, but in addition possesses a greater degree of reconfigurability and the unique ability to form a one-way termination. Moreover, the design illustrates the critical role real space information plays in determining the topological properties of photonic crystals, enabling a wider range of possible realizations.
△ Less
Submitted 15 January, 2022;
originally announced January 2022.
-
Topologically Protected Edge States in Triangular Lattices
Authors:
Robert J. Davis,
Yun Zhou,
Dia'aaldin J. Bisharat,
Prabhakar R. Bandaru,
Daniel F. Sievenpiper
Abstract:
We describe the possibility for topologically robust edge states existing on interfaces of triangular lattices which are supported by rotational symmetries that are sensitive to boundary conditions. Such states are trivial from the perspective of Berry curvature, but result instead from an interplay between crystalline symmetries and finite boundary effects. Regardless, we show such states are in…
▽ More
We describe the possibility for topologically robust edge states existing on interfaces of triangular lattices which are supported by rotational symmetries that are sensitive to boundary conditions. Such states are trivial from the perspective of Berry curvature, but result instead from an interplay between crystalline symmetries and finite boundary effects. Regardless, we show such states are in a distinct topological phase, provided the gauge-dependent symmetries are maintained. Such a model describes a number of recent bosonic experimental demonstrations on triangular lattices, the physics for which has thus far eluded explanation.
△ Less
Submitted 30 September, 2022; v1 submitted 10 January, 2022;
originally announced January 2022.
-
Cauchy, normal and correlations versus heavy tails
Authors:
Hui Xu,
Joel Cohen,
Richard Davis,
Gennady Samorodnitsky
Abstract:
A surprising result of Pillai and Meng (2016) showed that a transformation $\sum_{j=1}^n w_j X_j/Y_j$ of two iid centered normal random vectors, $(X_1,\ldots, X_n)$ and $(Y_1,\ldots, Y_n)$, $n>1$, for any weights $0\leq w_j\leq 1$, $ j=1,\ldots, n$, $\sum_{j=1}^n w_j=1$, has a Cauchy distribution regardless of any correlations within the normal vectors. The correlations appear to lose out in the c…
▽ More
A surprising result of Pillai and Meng (2016) showed that a transformation $\sum_{j=1}^n w_j X_j/Y_j$ of two iid centered normal random vectors, $(X_1,\ldots, X_n)$ and $(Y_1,\ldots, Y_n)$, $n>1$, for any weights $0\leq w_j\leq 1$, $ j=1,\ldots, n$, $\sum_{j=1}^n w_j=1$, has a Cauchy distribution regardless of any correlations within the normal vectors. The correlations appear to lose out in the competition with the heavy tails. To clarify how extensive this phenomenon is, we analyze two other transformations of two iid centered normal random vectors. These transformations are similar in spirit to the transformation considered by Pillai and Meng (2016). One transformation involves absolute values: $\sum_{j=1}^n w_j X_j/|Y_j|$. The second involves randomly stopped Brownian motions: $\sum_{j=1}^n w_j X_j\bigl(Y_j^{-2}\bigr)$, where $\bigl\{\bigl( X_1(t),\ldots, X_n(t)\bigr), \, t\geq 0\bigr\},\ n>1,$ is a Brownian motion with positive variances; $(Y_1,\ldots, Y_n)$ is a centered normal random vector with the same law as $( X_1(1),\ldots, X_n(1))$ and independent of it; and $X(Y^{-2})$ is the value of the Brownian motion $X(t)$ evaluated at the random time $t=Y^{-2}$. All three transformations result in a Cauchy distribution if the covariance matrix of the normal components is diagonal, or if all the correlations implied by the covariance matrix equal 1. However, while the transformation Pillai and Meng (2016) considered produces a Cauchy distribution regardless of the normal covariance matrix. the transformations we consider here do not always produce a Cauchy distribution. The correlations between jointly normal random variables are not always overwhelmed by the heaviness of the marginal tails. The mysteries of the connections between normal and Cauchy laws remain to be understood.
△ Less
Submitted 7 January, 2022;
originally announced January 2022.
-
On-chip unidirectional waveguiding for surface acoustic waves along a defect line in a triangular lattice
Authors:
Yun Zhou,
Naiqing Zhang,
Dia'aaldin J. Bisharat,
Robert J. Davis,
Zichen Zhang,
James Friend,
Prabhakar R. Bandaru,
Daniel F. Sievenpiper
Abstract:
The latest advances in topological physics have yielded a rich toolset to design highly robust wave transfer systems, for overcoming issues like beam steering and lateral diffraction in surface acoustic waves (SAWs). However, presently used designs for topologically protected SAWs have been largely limited to spin or valley-polarized phases, which rely on non-zero Berry curvature effects. Here we…
▽ More
The latest advances in topological physics have yielded a rich toolset to design highly robust wave transfer systems, for overcoming issues like beam steering and lateral diffraction in surface acoustic waves (SAWs). However, presently used designs for topologically protected SAWs have been largely limited to spin or valley-polarized phases, which rely on non-zero Berry curvature effects. Here we propose and experimentally demonstrate a highly robust SAW waveguide on lithium niobate (LiNbO3), based on a line defect within a true triangular phononic lattice, which instead employs an intrinsic chirality of phase vortices and maintains a zero Berry curvature. The guided SAW mode spans a wide bandwidth and shows confinement in the lateral direction with 3 dB attenuation within half of the unit-cell length. SAW routing around sharp bends has been demonstrated in such waveguide, with less than ~4% reflection per bend. The waveguide has also been found robust for defect lines with different configurations. The fully on-chip system permits unidirectional SAW modes that are tightly bound to the waveguide, which provides a compact footprint ideal for miniaturization of practical applications and offers insight into the possibility of manipulating highly focused SAW propagation.
△ Less
Submitted 23 November, 2021;
originally announced November 2021.
-
Spectral learning of multivariate extremes
Authors:
Marco Avella Medina,
Richard A. Davis,
Gennady Samorodnitsky
Abstract:
We propose a spectral clustering algorithm for analyzing the dependence structure of multivariate extremes. More specifically, we focus on the asymptotic dependence of multivariate extremes characterized by the angular or spectral measure in extreme value theory. Our work studies the theoretical performance of spectral clustering based on a random $k$-nearest neighbor graph constructed from an ext…
▽ More
We propose a spectral clustering algorithm for analyzing the dependence structure of multivariate extremes. More specifically, we focus on the asymptotic dependence of multivariate extremes characterized by the angular or spectral measure in extreme value theory. Our work studies the theoretical performance of spectral clustering based on a random $k$-nearest neighbor graph constructed from an extremal sample, i.e., the angular part of random vectors for which the radius exceeds a large threshold. In particular, we derive the asymptotic distribution of extremes arising from a linear factor model and prove that, under certain conditions, spectral clustering can consistently identify the clusters of extremes arising in this model. Leveraging this result we propose a simple consistent estimation strategy for learning the angular measure. Our theoretical findings are complemented with numerical experiments illustrating the finite sample performance of our methods.
△ Less
Submitted 1 August, 2023; v1 submitted 15 November, 2021;
originally announced November 2021.
-
Modeling Item Response Theory with Stochastic Variational Inference
Authors:
Mike Wu,
Richard L. Davis,
Benjamin W. Domingue,
Chris Piech,
Noah Goodman
Abstract:
Item Response Theory (IRT) is a ubiquitous model for understanding human behaviors and attitudes based on their responses to questions. Large modern datasets offer opportunities to capture more nuances in human behavior, potentially improving psychometric modeling leading to improved scientific understanding and public policy. However, while larger datasets allow for more flexible approaches, many…
▽ More
Item Response Theory (IRT) is a ubiquitous model for understanding human behaviors and attitudes based on their responses to questions. Large modern datasets offer opportunities to capture more nuances in human behavior, potentially improving psychometric modeling leading to improved scientific understanding and public policy. However, while larger datasets allow for more flexible approaches, many contemporary algorithms for fitting IRT models may also have massive computational demands that forbid real-world application. To address this bottleneck, we introduce a variational Bayesian inference algorithm for IRT, and show that it is fast and scalable without sacrificing accuracy. Applying this method to five large-scale item response datasets from cognitive science and education yields higher log likelihoods and higher accuracy in imputing missing data than alternative inference algorithms. Using this new inference approach we then generalize IRT with expressive Bayesian models of responses, leveraging recent advances in deep learning to capture nonlinear item characteristic curves (ICC) with neural networks. Using an eigth-grade mathematics test from TIMSS, we show our nonlinear IRT models can capture interesting asymmetric ICCs. The algorithm implementation is open-source, and easily usable.
△ Less
Submitted 28 July, 2022; v1 submitted 26 August, 2021;
originally announced August 2021.
-
Facets and facet subgraphs of symmetric edge polytopes
Authors:
Tianran Chen,
Robert Davis,
Evgeniia Korchevskaia
Abstract:
Symmetric edge polytopes, a.k.a. PV-type adjacency polytopes, associated with undirected graphs have been defined and studied in several seemingly independent areas including number theory, discrete geometry, and dynamical systems. In particular, the authors are motivated by the study of the algebraic Kuramoto equations of unmixed form whose Newton polytopes are the symmetric edge polytopes.
The…
▽ More
Symmetric edge polytopes, a.k.a. PV-type adjacency polytopes, associated with undirected graphs have been defined and studied in several seemingly independent areas including number theory, discrete geometry, and dynamical systems. In particular, the authors are motivated by the study of the algebraic Kuramoto equations of unmixed form whose Newton polytopes are the symmetric edge polytopes.
The interplay between the geometric structure of symmetric edge polytopes and the topological structure of the underlying graphs has been a recurring theme in recent studies. In particular, ``facet/face subgraphs'' have emerged as one of the central concepts in describing this symmetry. Continuing along this line of inquiry we provide a complete description of the correspondence between facets/faces of a symmetric edge polytope and maximal bipartite subgraphs of the underlying connected graph.
△ Less
Submitted 1 December, 2022; v1 submitted 26 July, 2021;
originally announced July 2021.
-
Lower-Luminosity Obscured AGN Host Galaxies are Not Predominantly in Major-Merging Systems at Cosmic Noon
Authors:
Erini Lambrides,
Marco Chiaberge,
Timothy Heckman,
Allison Kirkpatrick,
Eileen T. Meyer,
Andreea Petric,
Kirsten Hall,
Arianna Long,
Duncan J. Watts,
Roberto Gilli,
Raymond Simons,
Kirill Tchernyshyov,
Vicente Rodriguez-Gomez,
Fabio Vito,
Alexander De La Vega,
Jeffrey R. Davis,
Dale D Kocevski,
Colin Norman
Abstract:
For over 60 years, the scientific community has studied actively growing central super-massive black holes (active galactic nuclei -- AGN) but fundamental questions on their genesis remain unanswered. Numerical simulations and theoretical arguments show that black hole growth occurs during short-lived periods ($\sim$ 10$^{7}$ -10$^{8}$ yr) of powerful accretion. Major mergers are commonly invoked…
▽ More
For over 60 years, the scientific community has studied actively growing central super-massive black holes (active galactic nuclei -- AGN) but fundamental questions on their genesis remain unanswered. Numerical simulations and theoretical arguments show that black hole growth occurs during short-lived periods ($\sim$ 10$^{7}$ -10$^{8}$ yr) of powerful accretion. Major mergers are commonly invoked as the most likely dissipative process to trigger the rapid fueling of AGN. If the AGN-merger paradigm is true, we expect galaxy mergers to coincide with black hole accretion during a heavily obscured AGN phase (N$_H$ $ > 10^{23}$ cm$^{-2}$). Starting from one of the largest samples of obscured AGN at 0.5 $<$ $z$ $<$ 3.1, we select 40 non-starbursting lower-luminosity obscured AGN. We then construct a one-to-one matched redshift- and near-IR magnitude-matched non-starbursting inactive galaxy control sample. Combining deep color \textit{Hubble Space Telescope} imaging and a novel method of human classification, we test the merger-AGN paradigm prediction that heavily obscured AGN are strongly associated with galaxies undergoing a major merger. On the total sample of 80 galaxies, we estimate each individual classifier's accuracy at identifying merging galaxies/post-merging systems and isolated galaxies. We calculate the probability of each galaxy being in either a major merger or isolated system, given the accuracy of the human classifiers and the individual classifications of each galaxy. We do not find statistically significant evidence that obscured AGN at cosmic noon are predominately found in systems with evidence of significant merging/post-merging features.
△ Less
Submitted 15 July, 2021;
originally announced July 2021.
-
Time Series Estimation of the Dynamic Effects of Disaster-Type Shock
Authors:
Richard Davis,
Serena Ng
Abstract:
This paper provides three results for SVARs under the assumption that the primitive shocks are mutually independent. First, a framework is proposed to accommodate a disaster-type variable with infinite variance into a SVAR. We show that the least squares estimates of the SVAR are consistent but have non-standard asymptotics. Second, the disaster shock is identified as the component with the larges…
▽ More
This paper provides three results for SVARs under the assumption that the primitive shocks are mutually independent. First, a framework is proposed to accommodate a disaster-type variable with infinite variance into a SVAR. We show that the least squares estimates of the SVAR are consistent but have non-standard asymptotics. Second, the disaster shock is identified as the component with the largest kurtosis and whose impact effect is negative. An estimator that is robust to infinite variance is used to recover the mutually independent components. Third, an independence test on the residuals pre-whitened by the Choleski decomposition is proposed to test the restrictions imposed on a SVAR. The test can be applied whether the data have fat or thin tails, and to over as well as exactly identified models. Three applications are considered. In the first, the independence test is used to shed light on the conflicting evidence regarding the role of uncertainty in economic fluctuations. In the second, disaster shocks are shown to have short term economic impact arising mostly from feedback dynamics. The third uses the framework to study the dynamic effects of economic shocks post-covid.
△ Less
Submitted 8 March, 2022; v1 submitted 14 July, 2021;
originally announced July 2021.