Search | arXiv e-print repository

LaTable: Towards Large Tabular Models

Authors: Boris van Breugel, Jonathan Crabbé, Rob Davis, Mihaela van der Schaar

Abstract: Tabular data is one of the most ubiquitous modalities, yet the literature on tabular generative foundation models is lagging far behind its text and vision counterparts. Creating such a model is hard, due to the heterogeneous feature spaces of different tabular datasets, tabular metadata (e.g. dataset description and feature headers), and tables lacking prior knowledge (e.g. feature order). In thi… ▽ More Tabular data is one of the most ubiquitous modalities, yet the literature on tabular generative foundation models is lagging far behind its text and vision counterparts. Creating such a model is hard, due to the heterogeneous feature spaces of different tabular datasets, tabular metadata (e.g. dataset description and feature headers), and tables lacking prior knowledge (e.g. feature order). In this work we propose LaTable: a novel tabular diffusion model that addresses these challenges and can be trained across different datasets. Through extensive experiments we find that LaTable outperforms baselines on in-distribution generation, and that finetuning LaTable can generate out-of-distribution datasets better with fewer samples. On the other hand, we explore the poor zero-shot performance of LaTable, and what it may teach us about building generative tabular foundation models with better zero- and few-shot generation capabilities. △ Less

Submitted 25 June, 2024; originally announced June 2024.

arXiv:2405.00299 [pdf]

Thermal stability and phase transformation of $α$-, $κ(ε)$-, and $γ$-Ga$_2$O$_3$ thin films to $β$-Ga$_2$O$_3$ under various ambient conditions

Authors: J. Tang, K. Jiang, P. Tseng, R. C. Kurchin, L. M. Porter, R. F. Davis

Abstract: Phase transitions in metastable $α$-, $κ(ε)$-, and $γ$-Ga$_2$O$_3$ films to thermodynamically stable $β$-Ga$_2$O$_3$ during annealing in air, N$_2$, and vacuum have been systematically investigated via in-situ high-temperature X-ray diffraction and scanning electron microscopy. These respective polymorphs exhibited thermal stability to around 471-525$^\circ$C, 773-825$^\circ$C, and 490-575… ▽ More Phase transitions in metastable $α$-, $κ(ε)$-, and $γ$-Ga$_2$O$_3$ films to thermodynamically stable $β$-Ga$_2$O$_3$ during annealing in air, N$_2$, and vacuum have been systematically investigated via in-situ high-temperature X-ray diffraction and scanning electron microscopy. These respective polymorphs exhibited thermal stability to around 471-525$^\circ$C, 773-825$^\circ$C, and 490-575$^\circ$C before transforming into $β$-Ga$_2$O$_3$, across all tested ambient conditions. Particular crystallographic orientation relationships were observed before and after the phase transitions, i.e., (0006) $α$-Ga$_2$O$_3$ $\parallel$ $(\overline{4}02)$ $β$-Ga$_2$O$_3$, (004) $κ(ε)$-Ga$_2$O$_3$ $\parallel$ (310) and $(\overline{4}02)$ $β$-Ga$_2$O$_3$, and (400) $γ$-Ga$_2$O$_3$ $\parallel$ (400) $β$-Ga$_2$O$_3$. The phase transition of $α$-Ga$_2$O$_3$ to $β$-Ga$_2$O$_3$ resulted in catastrophic damage to the film and upheaval of the surface. The respective primary and possibly secondary causes of this damage are the +8.6% volume expansion and the dual displacive and reconstructive transformations that occur during this transition. The $κ(ε)$- and $γ$-Ga$_2$O$_3$ films converted to $β$-Ga$_2$O$_3$ via singular reconstructive transformations with small changes in volume and unchanged surface microstructures. △ Less

Submitted 30 April, 2024; originally announced May 2024.

Comments: 15 pages, 6 figures, 1 table

arXiv:2404.18347 [pdf, other]

Helical Phononic Modes Induced by a Screw Dislocation

Authors: Yun Zhou, Robert Davis, Li Chen, Erda Wen, Prabhakar Bandaru, Daniel Sievenpiper

Abstract: In this study, we investigate a one-dimensional (1D) unidirectional phononic waveguide embedded within a three-dimensional (3D) hexagonal close-packed phononic crystal, achieved by the introduction of a screw dislocation. This approach does not rely on the non-trivial topological characteristics of the 3D crystal. We discover that this dislocation induces a pair of helical modes, characterized by… ▽ More In this study, we investigate a one-dimensional (1D) unidirectional phononic waveguide embedded within a three-dimensional (3D) hexagonal close-packed phononic crystal, achieved by the introduction of a screw dislocation. This approach does not rely on the non-trivial topological characteristics of the 3D crystal. We discover that this dislocation induces a pair of helical modes, characterized by their orthogonal $x$- and $y$-directional displacements being out of phase by 90 degrees, which results in a distinctive rotational motion. These helical modes demonstrate directional propagation, tightly linked to the helicity of the screw dislocation. Through considerations of symmetry, we reveal that the emergence of these helical modes is governed by the symmetry of the screw dislocation itself. Our findings not only provide insights into the interplay between dislocation-induced symmetry and wave propagation in phononic systems but also open new avenues for designing directionally selective waveguides without relying on the crystal's topological properties. △ Less

Submitted 28 April, 2024; originally announced April 2024.

Comments: 13 pages, 4 figures

arXiv:2404.15474 [pdf, other]

doi 10.3847/PSJ/ad3944

Pwyll and Manannán Craters as a Laboratory for Constraining Irradiation Timescales on Europa

Authors: M. Ryleigh Davis, Michael E. Brown

Abstract: We examine high spatial resolution Galileo/NIMS observations of the young (~1 My - 20 My) impact features, Pwyll and Manannán craters, on Europa's trailing hemisphere in an effort to constrain irradiation timescales. We characterize their composition using a linear spectral modeling analysis and find that both craters and their ejecta are depleted in hydrated sulfuric acid relative to nearby older… ▽ More We examine high spatial resolution Galileo/NIMS observations of the young (~1 My - 20 My) impact features, Pwyll and Manannán craters, on Europa's trailing hemisphere in an effort to constrain irradiation timescales. We characterize their composition using a linear spectral modeling analysis and find that both craters and their ejecta are depleted in hydrated sulfuric acid relative to nearby older terrain. This suggests that the radiolytic sulfur cycle has not yet had enough time to build up an equilibrium concentration of H2SO4, and places a strong lower limit of the age of the craters on the equilibrium timescale of the radiolytic sulfur cycle on Europa's trailing hemisphere. Additionally, we find that the dark and red material seen in the craters and proximal ejecta of Pwyll and Manannán show the spectroscopic signature of hydrated, presumably endogenic salts. This suggests that the irradiation-induced darkening and redenning of endogenic salts thought to occur on Europa's trailing hemisphere has already happened at Pwyll and Manannán, thereby placing an upper limit on the timescale by which salts are irradiation reddened. △ Less

Submitted 23 April, 2024; originally announced April 2024.

Comments: 13 pages, 6 figures

Journal ref: The Planetary Science Journal (2024)

arXiv:2404.06660 [pdf, other]

JWST Spectrophotometry of the Small Satellites of Uranus and Neptune

Authors: Matthew Belyakov, M. Ryleigh Davis, Zachariah Milby, Ian Wong, Michael E. Brown

Abstract: We use 1.4-4.6 micron multi-band photometry of the small inner Uranian and Neptunian satellites obtained with the James Webb Space Telescope's near-infrared imager NIRCam to characterize their surface compositions. We find that the satellites of the ice giants have, to first-order, similar compositions to one another, with a 3.0 micron absorption feature possibly associated with an O-H stretch, in… ▽ More We use 1.4-4.6 micron multi-band photometry of the small inner Uranian and Neptunian satellites obtained with the James Webb Space Telescope's near-infrared imager NIRCam to characterize their surface compositions. We find that the satellites of the ice giants have, to first-order, similar compositions to one another, with a 3.0 micron absorption feature possibly associated with an O-H stretch, indicative of water ice or hydrated minerals. Additionally, the spectrophotometry for the small ice giant satellites matches spectra of some Neptune Trojans and excited Kuiper belt objects, suggesting shared properties. Future spectroscopy of these small satellites is necessary to identify and better constrain their specific surface compositions. △ Less

Submitted 9 April, 2024; originally announced April 2024.

Comments: 14 pages, 5 figures, Accepted to the Planetary Science Journal

arXiv:2404.05865 [pdf]

Effectiveness of Self-Assessment Software to Evaluate Preclinical Operative Procedures

Authors: Qi Dai, Ryan Davis, Houlin Hong, Ying Gu

Abstract: Objectives: To assess the effectiveness of digital scanning techniques for self-assessment and of preparations and restorations in preclinical dental education when compared to traditional faculty grading. Methods: Forty-four separate Class I (#30-O), Class II (#30-MO) preparations, and class II amalgam restorations (#31-MO) were generated respectively under preclinical assessment setting. Calibra… ▽ More Objectives: To assess the effectiveness of digital scanning techniques for self-assessment and of preparations and restorations in preclinical dental education when compared to traditional faculty grading. Methods: Forty-four separate Class I (#30-O), Class II (#30-MO) preparations, and class II amalgam restorations (#31-MO) were generated respectively under preclinical assessment setting. Calibrated faculty evaluated the preparations and restorations using a standard rubric from preclinical operative class. The same teeth were scanned using Planmeca PlanScan intraoral scanner and graded using the Romexis E4D Compare Software. Each tooth was compared against a corresponding gold standard tooth with tolerance intervals ranging from 100μm to 500μm. These scores were compared to traditional faculty grades using a linear mixed model to estimate the mean differences at 95% confidence interval for each tolerance level. Results: The average Compare Software grade of Class I preparation at 300μm tolerance had the smallest mean difference of 1.64 points on a 100 points scale compared to the average faculty grade. Class II preparation at 400μm tolerance had the smallest mean difference of 0.41 points. Finally, Class II Restoration at 300μm tolerance had the smallest mean difference at 0.20 points. Conclusion: In this study, tolerance levels that best correlated the Compare Software grades with the faculty grades were determined for three operative procedures: class I preparation, class II preparation and class II restoration. This Compare Software can be used as a useful adjunct method for more objective grading. It also can be used by students as a great self-assessment tool. △ Less

Submitted 8 April, 2024; originally announced April 2024.

arXiv:2404.00477 [pdf, other]

DE-HNN: An effective neural model for Circuit Netlist representation

Authors: Zhishang Luo, Truong Son Hy, Puoya Tabaghi, Donghyeon Koh, Michael Defferrard, Elahe Rezaei, Ryan Carey, Rhett Davis, Rajeev Jain, Yusu Wang

Abstract: The run-time for optimization tools used in chip design has grown with the complexity of designs to the point where it can take several days to go through one design cycle which has become a bottleneck. Designers want fast tools that can quickly give feedback on a design. Using the input and output data of the tools from past designs, one can attempt to build a machine learning model that predicts… ▽ More The run-time for optimization tools used in chip design has grown with the complexity of designs to the point where it can take several days to go through one design cycle which has become a bottleneck. Designers want fast tools that can quickly give feedback on a design. Using the input and output data of the tools from past designs, one can attempt to build a machine learning model that predicts the outcome of a design in significantly shorter time than running the tool. The accuracy of such models is affected by the representation of the design data, which is usually a netlist that describes the elements of the digital circuit and how they are connected. Graph representations for the netlist together with graph neural networks have been investigated for such models. However, the characteristics of netlists pose several challenges for existing graph learning frameworks, due to the large number of nodes and the importance of long-range interactions between nodes. To address these challenges, we represent the netlist as a directed hypergraph and propose a Directional Equivariant Hypergraph Neural Network (DE-HNN) for the effective learning of (directed) hypergraphs. Theoretically, we show that our DE-HNN can universally approximate any node or hyperedge based function that satisfies certain permutation equivariant and invariant properties natural for directed hypergraphs. We compare the proposed DE-HNN with several State-of-the-art (SOTA) machine learning models for (hyper)graphs and netlists, and show that the DE-HNN significantly outperforms them in predicting the outcome of optimized place-and-route tools directly from the input netlists. Our source code and the netlists data used are publicly available at https://github.com/YusuLab/chips.git △ Less

Submitted 16 April, 2024; v1 submitted 30 March, 2024; originally announced April 2024.

arXiv:2403.14661 [pdf, other]

Towards Modeling Learner Performance with Large Language Models

Authors: Seyed Parsa Neshaei, Richard Lee Davis, Adam Hazimeh, Bojan Lazarevski, Pierre Dillenbourg, Tanja Käser

Abstract: Recent work exploring the capabilities of pre-trained large language models (LLMs) has demonstrated their ability to act as general pattern machines by completing complex token sequences representing a wide array of tasks, including time-series prediction and robot control. This paper investigates whether the pattern recognition and sequence modeling capabilities of LLMs can be extended to the dom… ▽ More Recent work exploring the capabilities of pre-trained large language models (LLMs) has demonstrated their ability to act as general pattern machines by completing complex token sequences representing a wide array of tasks, including time-series prediction and robot control. This paper investigates whether the pattern recognition and sequence modeling capabilities of LLMs can be extended to the domain of knowledge tracing, a critical component in the development of intelligent tutoring systems (ITSs) that tailor educational experiences by predicting learner performance over time. In an empirical evaluation across multiple real-world datasets, we compare two approaches to using LLMs for this task, zero-shot prompting and model fine-tuning, with existing, non-LLM approaches to knowledge tracing. While LLM-based approaches do not achieve state-of-the-art performance, fine-tuned LLMs surpass the performance of naive baseline models and perform on par with standard Bayesian Knowledge Tracing approaches across multiple metrics. These findings suggest that the pattern recognition capabilities of LLMs can be used to model complex learning trajectories, opening a novel avenue for applying LLMs to educational contexts. The paper concludes with a discussion of the implications of these findings for future research, suggesting that further refinements and a deeper understanding of LLMs' predictive mechanisms could lead to enhanced performance in knowledge tracing tasks. △ Less

Submitted 29 February, 2024; originally announced March 2024.

Comments: 12 pages, 4 figures

arXiv:2403.07158 [pdf, ps, other]

Sample Splitting and Assessing Goodness-of-fit of Time Series

Authors: Richard A. Davis, Leon Fernandes

Abstract: A fundamental and often final step in time series modeling is to assess the quality of fit of a proposed model to the data. Since the underlying distribution of the innovations that generate a model is often not prescribed, goodness-of-fit tests typically take the form of testing the fitted residuals for serial independence. However, these fitted residuals are inherently dependent since they are b… ▽ More A fundamental and often final step in time series modeling is to assess the quality of fit of a proposed model to the data. Since the underlying distribution of the innovations that generate a model is often not prescribed, goodness-of-fit tests typically take the form of testing the fitted residuals for serial independence. However, these fitted residuals are inherently dependent since they are based on the same parameter estimates and thus standard tests of serial independence, such as those based on the autocorrelation function (ACF) or distance correlation function (ADCF) of the fitted residuals need to be adjusted. The sample splitting procedure in Pfister et al.~(2018) is one such fix for the case of models for independent data, but fails to work in the dependent setting. In this paper sample splitting is leveraged in the time series setting to perform tests of serial dependence of fitted residuals using the ACF and ADCF. Here the first $f_n$ of the data points are used to estimate the parameters of the model and then using these parameter estimates, the last $l_n$ of the data points are used to compute the estimated residuals. Tests for serial independence are then based on these $l_n$ residuals. As long as the overlap between the $f_n$ and $l_n$ data splits is asymptotically 1/2, the ACF and ADCF tests of serial independence tests often have the same limit distributions as though the underlying residuals are indeed iid. In particular if the first half of the data is used to estimate the parameters and the estimated residuals are computed for the entire data set based on these parameter estimates, then the ACF and ADCF can have the same limit distributions as though the residuals were iid. This procedure ameliorates the need for adjustment in the construction of confidence bounds for both the ACF and ADCF in goodness-of-fit testing. △ Less

Submitted 11 March, 2024; originally announced March 2024.

Comments: 31 pages, 4 figures, 1 table

MSC Class: 62M10; 62H20

arXiv:2403.00281 [pdf, other]

Wavelet Based Periodic Autoregressive Moving Average Models

Authors: Rhea Davis, N. Balakrishna

Abstract: This paper proposes a wavelet-based method for analysing periodic autoregressive moving average (PARMA) time series. Even though Fourier analysis provides an effective method for analysing periodic time series, it requires the estimation of a large number of Fourier parameters when the PARMA parameters do not vary smoothly. The wavelet-based analysis helps us to obtain a parsimonious model with a… ▽ More This paper proposes a wavelet-based method for analysing periodic autoregressive moving average (PARMA) time series. Even though Fourier analysis provides an effective method for analysing periodic time series, it requires the estimation of a large number of Fourier parameters when the PARMA parameters do not vary smoothly. The wavelet-based analysis helps us to obtain a parsimonious model with a reduced number of parameters. We have illustrated this with simulated and actual data sets. △ Less

Submitted 29 February, 2024; originally announced March 2024.

MSC Class: 62M10

arXiv:2401.03383 [pdf, other]

On the Ehrhart Theory of Generalized Symmetric Edge Polytopes

Authors: Robert Davis, Akihiro Higashitani, Hidefumi Ohsugi

Abstract: The symmetric edge polytope (SEP) of a (finite, undirected) graph is a centrally symmetric lattice polytope whose vertices are defined by the edges of the graph. SEPs have been studied extensively in the past twenty years. Recently, Tóthmérész and, independently, D'Alí, Juhnke-Kubitzke, and Koch generalized the definition of an SEP to regular matroids, as these are the matroids that can be charact… ▽ More The symmetric edge polytope (SEP) of a (finite, undirected) graph is a centrally symmetric lattice polytope whose vertices are defined by the edges of the graph. SEPs have been studied extensively in the past twenty years. Recently, Tóthmérész and, independently, D'Alí, Juhnke-Kubitzke, and Koch generalized the definition of an SEP to regular matroids, as these are the matroids that can be characterized by totally unimodular matrices. Generalized SEPs are known to have symmetric Ehrhart $h^*$-polynomials, and Ohsugi and Tsuchiya conjectured that (ordinary) SEPs have nonnegative $γ$-vectors. In this article, we use combinatorial and Gröbner basis techniques to extend additional known properties of SEPs to generalized SEPs. Along the way, we show that generalized SEPs are not necessarily $γ$-nonnegative by providing explicit examples. We prove these polytopes to be "nearly" $γ$-nonnegative in the sense that, by deleting exactly two elements from the matroid, one obtains SEPs for graphs that are $γ$-nonnegative. This provides further evidence that Ohsugi and Tsuchiya's conjecture holds in the ordinary case. △ Less

Submitted 6 March, 2024; v1 submitted 6 January, 2024; originally announced January 2024.

Comments: 20 pages. Proof of formula for gamma-vector added

MSC Class: 52B20; 52B40; 05A15

arXiv:2312.07756 [pdf, other]

Characterization of the Repeating FRB 20220912A with the Allen Telescope Array

Authors: Sofia Z. Sheikh, Wael Farah, Alexander W. Pollak, Andrew, P. V., Siemion, Mohammed A. Chamma, Luigi F. Cruz, Roy H. Davis, David R. DeBoer, Vishal Gajjar, Phil Karn, Jamar Kittling, Wenbin Lu, Mark Masters, Pranav Premnath, Sarah Schoultz, Carol Shumaker, Gurmehar Singh, Michael Snodgrass

Abstract: FRB 20220912A is a repeating Fast Radio Burst (FRB) that was discovered in Fall 2022 and remained highly active for several months. We report the detection of 35 FRBs from 541 hours of follow-up observations of this source using the recently refurbished Allen Telescope Array, covering 1344 MHz of bandwidth primarily centered at 1572 MHz. All 35 FRBs were detected in the lower half of the band with… ▽ More FRB 20220912A is a repeating Fast Radio Burst (FRB) that was discovered in Fall 2022 and remained highly active for several months. We report the detection of 35 FRBs from 541 hours of follow-up observations of this source using the recently refurbished Allen Telescope Array, covering 1344 MHz of bandwidth primarily centered at 1572 MHz. All 35 FRBs were detected in the lower half of the band with non-detections in the upper half and covered fluences from 4-431 Jy-ms (median$=$48.27 Jy-ms). We find consistency with previous repeater studies for a range of spectrotemporal features including: bursts with downward frequency drifting over time; a positive correlation between bandwidth and center frequency; and a decrease in sub-burst duration over time. We report an apparent decrease in the center frequency of observed bursts over the 2 months of the observing campaign (corresponding to a drop of $6.21\pm 0.76$ MHz per day). We predict a cut-off fluence for FRB 20220912A of $F_\textrm{max}\lesssim 10^4$ Jy-ms, for this source to be consistent with the all-sky rate, and find that FRB 20220912A significantly contributed to the all-sky FRB rate at a level of a few percent for fluences of $\sim$100 Jy-ms. Finally, we investigate characteristic timescales and sub-burst periodicities and find a) a median inter-subburst timescale of 5.82$\pm$1.16 ms in the multi-component bursts and b) no evidence of strict periodicity even in the most evenly-spaced multi-component burst in the sample. Our results demonstrate the importance of wideband observations of FRBs, and provide an important set of observational parameters against which to compare FRB progenitor and emission mechanism models. △ Less

Submitted 12 December, 2023; originally announced December 2023.

Comments: 17 pages, 8 figures, 2 tables, accepted to MNRAS

arXiv:2310.14463 [pdf, other]

Tightening QC Relaxations of AC Optimal Power Flow through Improved Linear Convex Envelopes

Authors: Mohammad Rasoul Narimani, Daniel K. Molzahn, Katherine R. Davis, Mariesa L. Crow

Abstract: AC optimal power flow (AC OPF) is a fundamental problem in power system operations. Accurately modeling the network physics via the AC power flow equations makes AC OPF a challenging nonconvex problem. To search for global optima, recent research has developed a variety of convex relaxations that bound the optimal objective values of AC OPF problems. The well-known QC relaxation convexifies the AC… ▽ More AC optimal power flow (AC OPF) is a fundamental problem in power system operations. Accurately modeling the network physics via the AC power flow equations makes AC OPF a challenging nonconvex problem. To search for global optima, recent research has developed a variety of convex relaxations that bound the optimal objective values of AC OPF problems. The well-known QC relaxation convexifies the AC OPF problem by enclosing the non-convex terms (trigonometric functions and products) within convex envelopes. The accuracy of this method strongly depends on the tightness of these envelopes. This paper proposes two improvements for tightening QC relaxations of OPF problems. We first consider a particular nonlinear function whose projections are the nonlinear expressions appearing in the polar representation of the power flow equations. We construct a convex envelope around this nonlinear function that takes the form of a polytope and then use projections of this envelope to obtain convex expressions for the nonlinear terms. Second, we use certain characteristics of the sine and cosine expressions along with the changes in their curvature to tighten this convex envelope. We also propose a coordinate transformation that rotates the power flow equations by an angle specific to each bus in order to obtain a tighter envelope. We demonstrate these improvements relative to a state-of-the-art QC relaxation implementation using the PGLib-OPF test cases. The results show improved optimality gaps in 68% of these cases. △ Less

Submitted 6 April, 2024; v1 submitted 22 October, 2023; originally announced October 2023.

arXiv:2310.12493 [pdf]

Atomic-scale investigation of $γ$-Ga$_2$O$_3$ deposited on MgAl$_2$O$_4$ and its relationship with $β$-Ga$_2$O$_3$

Authors: J. Tang, K. Jiang, C. Xu, M. J. Cabral, K. Xiao, L. M. Porter, R. F. Davis

Abstract: Nominally phase-pure $γ$-$Ga_2O_3$ was deposited on (100) $MgAl_2O_4$ within a narrow temperature window centered at $\sim$470 $^{\circ}$C using metal-organic chemical vapor deposition (MOCVD). The film deposited at 440 $^{\circ}$C exhibited either poor crystallization or an amorphous structure; the film grown at 500 $^{\circ}$C contained both $β$-$Ga_2O_3$ and $γ$-$Ga_2O_3$. A nominally phase-pur… ▽ More Nominally phase-pure $γ$-$Ga_2O_3$ was deposited on (100) $MgAl_2O_4$ within a narrow temperature window centered at $\sim$470 $^{\circ}$C using metal-organic chemical vapor deposition (MOCVD). The film deposited at 440 $^{\circ}$C exhibited either poor crystallization or an amorphous structure; the film grown at 500 $^{\circ}$C contained both $β$-$Ga_2O_3$ and $γ$-$Ga_2O_3$. A nominally phase-pure $β$-$Ga_2O_3$ film was obtained at 530 $^{\circ}$C. Atomic-resolution scanning transmission electron microscopy (STEM) investigations of the $γ$-$Ga_2O_3$ film grown at 470 $^{\circ}$C revealed a high density of antiphase boundaries. A planar defect model developed for $γ$-$Al_2O_3$ was extended to explain the stacking sequences of the Ga sublattice observed in the STEM images of $γ$-$Ga_2O_3$. The presence of the 180$^{\circ}$ rotational domains and 90$^{\circ}$ rotational domains of $β$-$Ga_2O_3$ inclusions within the $γ$-$Ga_2O_3$ matrix is discussed within the context of a comprehensive investigation of the epitaxial relationship between those two phases in the as-grown film at 470 $^{\circ}$C and the same film annealed at 600 $^{\circ}$C. The results led to the hypotheses that (i) incorporation of certain dopants including Si, Ge, Sn, Mg, Al, and Sc, into $β$-$Ga_2O_3$, locally stabilizes the "$γ$-phase" and (ii) the site preference(s) for these dopants promotes the formation of the "$γ$-phase" and/or $γ$-$Ga_2O_3$ solid solutions. However, in the absence of such dopants, pure $γ$-$Ga_2O_3$ remains the least stable $Ga_2O_3$ polymorph, as indicated by its very narrow growth window, lower growth temperatures relative to other $Ga_2O_3$ polymorphs, and the largest calculated difference in Helmholtz free energy per formula unit between $γ$-$Ga_2O_3$ and $β$-$Ga_2O_3$ than all other polymorphs. △ Less

Submitted 20 October, 2023; v1 submitted 19 October, 2023; originally announced October 2023.

Comments: The following article has been submitted to APL Materials

arXiv:2308.13787 [pdf, other]

doi 10.3847/PSJ/aced96

The Spatial Distribution of the Unidentified 2.07 \textmu m Absorption Feature on Europa and Implications for its Origin

Authors: M. Ryleigh Davis, Michael E. Brown, Samantha K. Trumbo

Abstract: A weak absorption feature at 2.07 \textmu m on Europa's trailing hemisphere has been suggested to arise from radiolytic processing of an endogenic salt, possibly sourced from the interior ocean. However, if the genesis of this feature requires endogenic material to be present, one might expect to find a correlation between its spatial distribution and the recently disrupted chaos terrains. Using a… ▽ More A weak absorption feature at 2.07 \textmu m on Europa's trailing hemisphere has been suggested to arise from radiolytic processing of an endogenic salt, possibly sourced from the interior ocean. However, if the genesis of this feature requires endogenic material to be present, one might expect to find a correlation between its spatial distribution and the recently disrupted chaos terrains. Using archived near-infrared observations from Very Large Telescope/SINFONI with a $\sim$1 nm spectral resolution and a linear spatial resolution $\sim$130 km, we examine the spatial distribution of this feature in an effort to explore this endogenic formation hypothesis. We find that while the presence of the 2.07 \textmu m feature is strongly associated with the irradiation pattern on Europa's trailing hemisphere, there is no apparent association between the presence or depth of the absorption feature and Europa's large-scale chaos terrain. This spatial distribution suggests that the formation pathway of the 2.07 \textmu m feature on Europa is independent of any endogenous salts within the recent geology. Instead, we propose that the source of this feature may simply be a product of the radiolytic sulfur cycle or arise from some unidentified parallel irradiation process. Notably, the 2.07 \textmu m absorption band is absent from the Pwyll crater ejecta blanket, suggesting that radiolytic processing has not had enough time to form the species responsible and placing a lower limit on the irradiation timescale. We are unable to find a plausible spectral match to the 2.07 \textmu m feature within the available laboratory data. △ Less

Submitted 26 August, 2023; originally announced August 2023.

Comments: 12 pages, 3 figures, published in PSJ

Journal ref: The Planetary Science Journal, 4, 148 (2023)

arXiv:2306.10420 [pdf, other]

Federated Learning Based Distributed Localization of False Data Injection Attacks on Smart Grids

Authors: Cihat Keçeci, Katherine R. Davis, Erchin Serpedin

Abstract: Data analysis and monitoring on smart grids are jeopardized by attacks on cyber-physical systems. False data injection attack (FDIA) is one of the classes of those attacks that target the smart measurement devices by injecting malicious data. The employment of machine learning techniques in the detection and localization of FDIA is proven to provide effective results. Training of such models requi… ▽ More Data analysis and monitoring on smart grids are jeopardized by attacks on cyber-physical systems. False data injection attack (FDIA) is one of the classes of those attacks that target the smart measurement devices by injecting malicious data. The employment of machine learning techniques in the detection and localization of FDIA is proven to provide effective results. Training of such models requires centralized processing of sensitive user data that may not be plausible in a practical scenario. By employing federated learning for the detection of FDIA attacks, it is possible to train a model for the detection and localization of the attacks while preserving the privacy of sensitive user data. However, federated learning introduces new problems such as the personalization of the detectors in each node. In this paper, we propose a federated learning-based scheme combined with a hybrid deep neural network architecture that exploits the local correlations between the connected power buses by employing graph neural networks as well as the temporal patterns in the data by using LSTM layers. The proposed mechanism offers flexible and efficient training of an FDIA detector in a distributed setup while preserving the privacy of the clients. We validate the proposed architecture by extensive simulations on the IEEE 57, 118, and 300 bus systems and real electricity load data. △ Less

Submitted 17 June, 2023; originally announced June 2023.

Comments: 9 pages, 6 figures

arXiv:2306.05308 [pdf, other]

doi 10.3847/1538-3881/acd537

TOI-4010: A System of Three Large Short-Period Planets With a Massive Long-Period Companion

Authors: Michelle Kunimoto, Andrew Vanderburg, Chelsea X. Huang, M. Ryleigh Davis, Laura Affer, Andrew Collier Cameron, David Charbonneau, Rosario Cosentino, Mario Damasso, Xavier Dumusque, A. F. Martnez Fiorenzano, Adriano Ghedina, R. D. Haywood, Florian Lienhard, Mercedes López-Morales, Michel Mayor, Francesco Pepe, Matteo Pinamonti, Ennio Poretti, Jesús Maldonado, Ken Rice, Alessandro Sozzetti, Thomas G. Wilson, Stéphane Udry, Jay Baptista , et al. (31 additional authors not shown)

Abstract: We report the confirmation of three exoplanets transiting TOI-4010 (TIC-352682207), a metal-rich K dwarf observed by TESS in Sectors 24, 25, 52, and 58. We confirm these planets with HARPS-N radial velocity observations and measure their masses with 8 - 12% precision. TOI-4010 b is a sub-Neptune ($P = 1.3$ days, $R_{p} = 3.02_{-0.08}^{+0.08}~R_{\oplus}$, $M_{p} = 11.00_{-1.27}^{+1.29}~M_{\oplus}$)… ▽ More We report the confirmation of three exoplanets transiting TOI-4010 (TIC-352682207), a metal-rich K dwarf observed by TESS in Sectors 24, 25, 52, and 58. We confirm these planets with HARPS-N radial velocity observations and measure their masses with 8 - 12% precision. TOI-4010 b is a sub-Neptune ($P = 1.3$ days, $R_{p} = 3.02_{-0.08}^{+0.08}~R_{\oplus}$, $M_{p} = 11.00_{-1.27}^{+1.29}~M_{\oplus}$) in the hot Neptune desert, and is one of the few such planets with known companions. Meanwhile, TOI-4010 c ($P = 5.4$ days, $R_{p} = 5.93_{-0.12}^{+0.11}~R_{\oplus}$, $M_{p} = 20.31_{-2.11}^{+2.13}~M_{\oplus}$) and TOI-4010 d ($P = 14.7$ days, $R_{p} = 6.18_{-0.14}^{+0.15}~R_{\oplus}$, $M_{p} = 38.15_{-3.22}^{+3.27}~M_{\oplus}$) are similarly-sized sub-Saturns on short-period orbits. Radial velocity observations also reveal a super-Jupiter-mass companion called TOI-4010 e in a long-period, eccentric orbit ($P \sim 762$ days and $e \sim 0.26$ based on available observations). TOI-4010 is one of the few systems with multiple short-period sub-Saturns to be discovered so far. △ Less

Submitted 19 June, 2023; v1 submitted 8 June, 2023; originally announced June 2023.

Comments: 26 pages, 16 figures, published in AJ; (v3) added missing citation

Journal ref: AJ, 166, 7 (2023)

arXiv:2306.02206 [pdf]

Mitigating Molecular Aggregation in Drug Discovery with Predictive Insights from Explainable AI

Authors: Hunter Sturm, Jonas Teufel, Kaitlin A. Isfeld, Pascal Friederich, Rebecca L. Davis

Abstract: As the importance of high-throughput screening (HTS) continues to grow due to its value in early stage drug discovery and data generation for training machine learning models, there is a growing need for robust methods for pre-screening compounds to identify and prevent false-positive hits. Small, colloidally aggregating molecules are one of the primary sources of false-positive hits in high-throu… ▽ More As the importance of high-throughput screening (HTS) continues to grow due to its value in early stage drug discovery and data generation for training machine learning models, there is a growing need for robust methods for pre-screening compounds to identify and prevent false-positive hits. Small, colloidally aggregating molecules are one of the primary sources of false-positive hits in high-throughput screens, making them an ideal candidate to target for removal from libraries using predictive pre-screening tools. However, a lack of understanding of the causes of molecular aggregation introduces difficulty in the development of predictive tools for detecting aggregating molecules. Herein, we present an examination of the molecular features differentiating datasets of aggregating and non-aggregating molecules, as well as a machine learning approach to predicting molecular aggregation. Our method uses explainable graph neural networks and counterfactuals to reliably predict and explain aggregation, giving additional insights and design rules for future screening. The integration of this method in HTS approaches will help combat false positives, providing better lead molecules more rapidly and thus accelerating drug discovery cycles. △ Less

Submitted 3 June, 2023; originally announced June 2023.

Comments: 17 pages, plus SI

arXiv:2304.11120 [pdf, other]

What is missing in autonomous discovery: Open challenges for the community

Authors: Phillip M. Maffettone, Pascal Friederich, Sterling G. Baird, Ben Blaiszik, Keith A. Brown, Stuart I. Campbell, Orion A. Cohen, Tantum Collins, Rebecca L. Davis, Ian T. Foster, Navid Haghmoradi, Mark Hereld, Nicole Jung, Ha-Kyung Kwon, Gabriella Pizzuto, Jacob Rintamaki, Casper Steinmann, Luca Torresi, Shi**g Sun

Abstract: Self-driving labs (SDLs) leverage combinations of artificial intelligence, automation, and advanced computing to accelerate scientific discovery. The promise of this field has given rise to a rich community of passionate scientists, engineers, and social scientists, as evidenced by the development of the Acceleration Consortium and recent Accelerate Conference. Despite its strengths, this rapidly… ▽ More Self-driving labs (SDLs) leverage combinations of artificial intelligence, automation, and advanced computing to accelerate scientific discovery. The promise of this field has given rise to a rich community of passionate scientists, engineers, and social scientists, as evidenced by the development of the Acceleration Consortium and recent Accelerate Conference. Despite its strengths, this rapidly develo** field presents numerous opportunities for growth, challenges to overcome, and potential risks of which to remain aware. This community perspective builds on a discourse instantiated during the first Accelerate Conference, and looks to the future of self-driving labs with a tempered optimism. Incorporating input from academia, government, and industry, we briefly describe the current status of self-driving labs, then turn our attention to barriers, opportunities, and a vision for what is possible. Our field is delivering solutions in technology and infrastructure, artificial intelligence and knowledge generation, and education and workforce development. In the spirit of community, we intend for this work to foster discussion and drive best practices as our field grows. △ Less

Submitted 2 May, 2023; v1 submitted 21 April, 2023; originally announced April 2023.

arXiv:2303.16759 [pdf]

doi 10.1136/bmjhci-2022-100665

Exploring celebrity influence on public attitude towards the COVID-19 pandemic: social media shared sentiment analysis

Authors: Brianna M White, Chad A Melton, Parya Zareie, Robert L Davis, Robert A Bednarczyk, Arash Shaban-Nejad

Abstract: The COVID-19 pandemic has introduced new opportunities for health communication, including an increase in the public use of online outlets for health-related emotions. People have turned to social media networks to share sentiments related to the impacts of the COVID-19 pandemic. In this paper we examine the role of social messaging shared by Persons in the Public Eye (i.e. athletes, politicians,… ▽ More The COVID-19 pandemic has introduced new opportunities for health communication, including an increase in the public use of online outlets for health-related emotions. People have turned to social media networks to share sentiments related to the impacts of the COVID-19 pandemic. In this paper we examine the role of social messaging shared by Persons in the Public Eye (i.e. athletes, politicians, news personnel) in determining overall public discourse direction. We harvested approximately 13 million tweets ranging from 1 January 2020 to 1 March 2022. The sentiment was calculated for each tweet using a fine-tuned DistilRoBERTa model, which was used to compare COVID-19 vaccine-related Twitter posts (tweets) that co-occurred with mentions of People in the Public Eye. Our findings suggest the presence of consistent patterns of emotional content co-occurring with messaging shared by Persons in the Public Eye for the first two years of the COVID-19 pandemic influenced public opinion and largely stimulated online public discourse. We demonstrate that as the pandemic progressed, public sentiment shared on social networks was shaped by risk perceptions, political ideologies and health-protective behaviours shared by Persons in the Public Eye, often in a negative light. △ Less

Submitted 23 February, 2023; originally announced March 2023.

Comments: 7 Pages, 4 Figures

ACM Class: I.2.7

Journal ref: BMJ Health & Care Informatics 2023;30:e100665

arXiv:2303.14295 [pdf, other]

doi 10.1111/jtsa.12688

Clustering Multivariate Time Series using Energy Distance

Authors: Richard A. Davis, Leon Fernandes, Konstantinos Fokianos

Abstract: A novel methodology is proposed for clustering multivariate time series data using energy distance defined in Székely and Rizzo (2013). Specifically, a dissimilarity matrix is formed using the energy distance statistic to measure separation between the finite dimensional distributions for the component time series. Once the pairwise dissimilarity matrix is calculated, a hierarchical clustering met… ▽ More A novel methodology is proposed for clustering multivariate time series data using energy distance defined in Székely and Rizzo (2013). Specifically, a dissimilarity matrix is formed using the energy distance statistic to measure separation between the finite dimensional distributions for the component time series. Once the pairwise dissimilarity matrix is calculated, a hierarchical clustering method is then applied to obtain the dendrogram. This procedure is completely nonparametric as the dissimilarities between stationary distributions are directly calculated without making any model assumptions. In order to justify this procedure, asymptotic properties of the energy distance estimates are derived for general stationary and ergodic time series. The method is illustrated in a simulation study for various component time series that are either linear or nonlinear. Finally the methodology is applied to two examples; one involves GDP of selected countries and the other is population size of various states in the U.S.A. in the years 1900 -1999. △ Less

Submitted 24 March, 2023; originally announced March 2023.

Comments: 26 pages, 7 figures, to be published in Journal of Time Series Anaylsis

MSC Class: 62M10; 62H30 (Primary) 62H20; 62H12 (Secondary)

Journal ref: Journal of Time Series Analysis 44 (2023) 487-504

arXiv:2303.09614 [pdf, ps, other]

Weighted Ehrhart Theory: Extending Stanley's nonnegativity theorem

Authors: Esme Bajo, Robert Davis, Jesús A. De Loera, Alexey Garber, Sofía Garzón Mora, Katharina Jochemko, Josephine Yu

Abstract: We generalize R. P. Stanley's celebrated theorem that the $h^\ast$-polynomial of the Ehrhart series of a rational polytope has nonnegative coefficients and is monotone under containment of polytopes. We show that these results continue to hold for weighted Ehrhart series where lattice points are counted with polynomial weights, as long as the weights are homogeneous polynomials decomposable as sum… ▽ More We generalize R. P. Stanley's celebrated theorem that the $h^\ast$-polynomial of the Ehrhart series of a rational polytope has nonnegative coefficients and is monotone under containment of polytopes. We show that these results continue to hold for weighted Ehrhart series where lattice points are counted with polynomial weights, as long as the weights are homogeneous polynomials decomposable as sums of products of linear forms that are nonnegative on the polytope. We also show nonnegativity of the $h^\ast$-polynomial as a real-valued function for a larger family of weights. We then target the case when the weight function is the square of a single (arbitrary) linear form. We show stronger results for two-dimensional convex lattice polygons and give concrete examples showing tightness of the hypotheses. As an application, we construct a counterexample to a conjecture by Berg, Jochemko, and Silverstein on Ehrhart tensor polynomials. △ Less

Submitted 11 March, 2024; v1 submitted 16 March, 2023; originally announced March 2023.

Comments: 26 pages, 3 figures

MSC Class: 52B20; 05A15; 52B45

arXiv:2303.04281 [pdf, other]

doi 10.1109/PESGM52003.2023.10253322

An Extended Model for Ecological Robustness to Capture Power System Resilience

Authors: Hao Huang, Katherine R. Davis, H. Vincent Poor

Abstract: The long-term resilient property of ecosystems has been quantified as ecological robustness (RECO) in terms of the energy transfer over food webs. The RECO of resilient ecosystems favors a balance of food webs' network efficiency and redundancy. By integrating RECO with power system constraints, the authors are able to optimize power systems' inherent resilience as ecosystems through network desig… ▽ More The long-term resilient property of ecosystems has been quantified as ecological robustness (RECO) in terms of the energy transfer over food webs. The RECO of resilient ecosystems favors a balance of food webs' network efficiency and redundancy. By integrating RECO with power system constraints, the authors are able to optimize power systems' inherent resilience as ecosystems through network design and system operation. A previous model used on real power flows and aggregated redundant components for a rigorous map** between ecosystems and power systems. However, the reactive power flows also determine power systems resilience; and the power components' redundancy is part of the global network redundancy. These characteristics should be considered for RECO-oriented evaluation and optimization for power systems. Thus, this paper extends the model for quantifying RECO in power systems using real, reactive, and apparent power flows with the consideration of redundant placement of generators. Recalling the performance of RECO-oriented optimal power flows under N-x contingencies, the analyses suggest reactive power flows and redundant components should be included for RECO to capture power systems' inherent resilience. △ Less

Submitted 1 October, 2023; v1 submitted 7 March, 2023; originally announced March 2023.

Comments: Accepted By IEEE PES General Meeting 2023

Journal ref: 2023 IEEE Power & Energy Society General Meeting (PESGM)

arXiv:2303.03311 [pdf, other]

Ising Meson Spectroscopy on a Noisy Digital Quantum Simulator

Authors: Christopher Lamb, Yicheng Tang, Robert Davis, Ananda Roy

Abstract: Quantum simulation has the potential to be an indispensable technique for the investigation of non-perturbative phenomena in strongly-interacting quantum field theories (QFTs). In the modern quantum era, with Noisy Intermediate Scale Quantum~(NISQ) simulators widely available and larger-scale quantum machines on the horizon, it is natural to ask: what non-perturbative QFT problems can be solved wi… ▽ More Quantum simulation has the potential to be an indispensable technique for the investigation of non-perturbative phenomena in strongly-interacting quantum field theories (QFTs). In the modern quantum era, with Noisy Intermediate Scale Quantum~(NISQ) simulators widely available and larger-scale quantum machines on the horizon, it is natural to ask: what non-perturbative QFT problems can be solved with the existing quantum hardware? We show that existing noisy quantum machines can be used to analyze the energy spectrum of a large family of strongly-interacting 1+1D QFTs. The latter exhibit a wide-range of non-perturbative effects like `quark confinement' and `false vacuum decay' which are typically associated with higher-dimensional QFTs of elementary particles. We perform quench experiments on IBM's ibmq_mumbai quantum simulator to compute the energy spectrum of 1+1D quantum Ising model with a longitudinal field. The latter model is particularly interesting due to the formation of mesonic bound states arising from a confining potential for the Ising domain-walls, reminiscent of t'Hooft's model of two-dimensional quantum chromodynamics. Our results demonstrate that digital quantum simulation in the NISQ era has the potential to be a viable alternative to numerical techniques such as density matrix renormalization group or the truncated conformal space methods for analyzing QFTs. △ Less

Submitted 10 June, 2024; v1 submitted 6 March, 2023; originally announced March 2023.

Comments: 4 figures, version accepted in Nature Communications

arXiv:2302.05004 [pdf]

doi 10.1021/acs.est.3c03609

Initial validation of a soil-based mass-balance approach for empirical monitoring of enhanced rock weathering rates

Authors: Tom Reershemius, Mike E. Kelland, Jacob S. Jordan, Isabelle R. Davis, Rocco D'Ascanio, Boriana Kalderon-Asael, Dan Asael, T. Jesper Suhrhoff, Dimitar Z. Epihov, David J. Beerling, Christopher T. Reinhard, Noah J. Planavsky

Abstract: Enhanced Rock Weathering (ERW) is a promising scalable and cost-effective Carbon Dioxide Removal (CDR) strategy with significant environmental and agronomic co-benefits. A major barrier to large-scale implementation of ERW is a robust Monitoring, Reporting, and Verification (MRV) framework. To successfully quantify the amount of carbon dioxide removed by ERW, MRV must be accurate, precise, and cos… ▽ More Enhanced Rock Weathering (ERW) is a promising scalable and cost-effective Carbon Dioxide Removal (CDR) strategy with significant environmental and agronomic co-benefits. A major barrier to large-scale implementation of ERW is a robust Monitoring, Reporting, and Verification (MRV) framework. To successfully quantify the amount of carbon dioxide removed by ERW, MRV must be accurate, precise, and cost-effective. Here, we outline a mass-balance-based method where analysis of the chemical composition of soil samples is used to track in-situ silicate rock weathering. We show that signal-to-noise issues of in-situ soil analysis can be mitigated by using isotope-dilution mass spectrometry to reduce analytical error. We implement a proof-of-concept experiment demonstrating the method in controlled mesocosms. In our experiment, basalt rock feedstock is added to soil columns containing the cereal crop Sorghum bicolor at a rate equivalent to 50 t ha$^{-1}$. Using our approach, we calculate rock weathering corresponding to an average initial CDR value of 1.44 +/- 0.27 tCO$_2$eq ha$^{-1}$ from our experiments after 235 days, within error of an independent estimate calculated using conventional elemental budgeting of reaction products. Our method provides a robust time-integrated estimate of initial CDR, to feed into models that track and validate large-scale carbon removal through ERW. △ Less

Submitted 22 October, 2023; v1 submitted 9 February, 2023; originally announced February 2023.

Comments: Environmental Science & Technology (2023)

arXiv:2301.01779 [pdf, other]

doi 10.1088/1538-3873/acada0

The James Webb Space Telescope Mission: Optical Telescope Element Design, Development, and Performance

Authors: Michael W. McElwain, Lee D. Feinberg, Marshall D. Perrin, Mark Clampin, C. Matt Mountain, Matthew D. Lallo, Charles-Philippe Lajoie, Randy A. Kimble, Charles W. Bowers, Christopher C. Stark, D. Scott Acton, Ken Aiello, Charles Atkinson, Beth Barinek, Allison Barto, Scott Basinger, Tracy Beck, Matthew D. Bergkoetter, Marcel Bluth, Rene A. Boucarut, Gregory R. Brady, Keira J. Brooks, Bob Brown, John Byard, Larkin Carey , et al. (104 additional authors not shown)

Abstract: The James Webb Space Telescope (JWST) is a large, infrared space telescope that has recently started its science program which will enable breakthroughs in astrophysics and planetary science. Notably, JWST will provide the very first observations of the earliest luminous objects in the Universe and start a new era of exoplanet atmospheric characterization. This transformative science is enabled by… ▽ More The James Webb Space Telescope (JWST) is a large, infrared space telescope that has recently started its science program which will enable breakthroughs in astrophysics and planetary science. Notably, JWST will provide the very first observations of the earliest luminous objects in the Universe and start a new era of exoplanet atmospheric characterization. This transformative science is enabled by a 6.6 m telescope that is passively cooled with a 5-layer sunshield. The primary mirror is comprised of 18 controllable, low areal density hexagonal segments, that were aligned and phased relative to each other in orbit using innovative image-based wavefront sensing and control algorithms. This revolutionary telescope took more than two decades to develop with a widely distributed team across engineering disciplines. We present an overview of the telescope requirements, architecture, development, superb on-orbit performance, and lessons learned. JWST successfully demonstrates a segmented aperture space telescope and establishes a path to building even larger space telescopes. △ Less

Submitted 4 January, 2023; originally announced January 2023.

Comments: accepted by PASP for JWST Overview Special Issue; 34 pages, 25 figures

arXiv:2212.10533 [pdf, other]

doi 10.1109/TVCG.2022.3226463

The Risks of Ranking: Revisiting Graphical Perception to Model Individual Differences in Visualization Performance

Authors: Russell Davis, Xiaoying Pu, Yiren Ding, Brian D. Hall, Karen Bonilla, Mi Feng, Matthew Kay, Lane Harrison

Abstract: Graphical perception studies typically measure visualization encoding effectiveness using the error of an "average observer", leading to canonical rankings of encodings for numerical attributes: e.g., position > area > angle > volume. Yet different people may vary in their ability to read different visualization types, leading to variance in this ranking across individuals not captured by populati… ▽ More Graphical perception studies typically measure visualization encoding effectiveness using the error of an "average observer", leading to canonical rankings of encodings for numerical attributes: e.g., position > area > angle > volume. Yet different people may vary in their ability to read different visualization types, leading to variance in this ranking across individuals not captured by population-level metrics using "average observer" models. One way we can bridge this gap is by recasting classic visual perception tasks as tools for assessing individual performance, in addition to overall visualization performance. In this paper we replicate and extend Cleveland and McGill's graphical comparison experiment using Bayesian multilevel regression, using these models to explore individual differences in visualization skill from multiple perspectives. The results from experiments and modeling indicate that some people show patterns of accuracy that credibly deviate from the canonical rankings of visualization effectiveness. We discuss implications of these findings, such as a need for new ways to communicate visualization effectiveness to designers, how patterns in individuals' responses may show systematic biases and strategies in visualization judgment, and how recasting classic visual perception tasks as tools for assessing individual performance may offer new ways to quantify aspects of visualization literacy. Experiment data, source code, and analysis scripts are available at the following repository: https://osf.io/8ub7t/?view\_only=9be4798797404a4397be3c6fc2a68cc0. △ Less

Submitted 21 December, 2022; v1 submitted 20 December, 2022; originally announced December 2022.

Comments: 16 pages, 9 figures

ACM Class: H.5.0

Journal ref: IEEE Transactions on Visualization and Computer Graphics 2022

arXiv:2212.10166 [pdf, other]

doi 10.1145/3576050.3576149

Protected Attributes Tell Us Who, Behavior Tells Us How: A Comparison of Demographic and Behavioral Oversampling for Fair Student Success Modeling

Authors: Jade Maï Cock, Muhammad Bilal, Richard Davis, Mirko Marras, Tanja Käser

Abstract: Algorithms deployed in education can shape the learning experience and success of a student. It is therefore important to understand whether and how such algorithms might create inequalities or amplify existing biases. In this paper, we analyze the fairness of models which use behavioral data to identify at-risk students and suggest two novel pre-processing approaches for bias mitigation. Based on… ▽ More Algorithms deployed in education can shape the learning experience and success of a student. It is therefore important to understand whether and how such algorithms might create inequalities or amplify existing biases. In this paper, we analyze the fairness of models which use behavioral data to identify at-risk students and suggest two novel pre-processing approaches for bias mitigation. Based on the concept of intersectionality, the first approach involves intelligent oversampling on combinations of demographic attributes. The second approach does not require any knowledge of demographic attributes and is based on the assumption that such attributes are a (noisy) proxy for student behavior. We hence propose to directly oversample different types of behaviors identified in a cluster analysis. We evaluate our approaches on data from (i) an open-ended learning environment and (ii) a flipped classroom course. Our results show that both approaches can mitigate model bias. Directly oversampling on behavior is a valuable alternative, when demographic metadata is not available. Source code and extended results are provided in https://github.com/epfl-ml4ed/behavioral-oversampling}{https://github.com/epfl-ml4ed/behavioral-oversampling . △ Less

Submitted 20 December, 2022; originally announced December 2022.

Comments: Accepted as a full paper at LAK 2023: The 13th International Learning Analytics and Knowledge Conference, 13-17 of March 2023, Arlington

arXiv:2212.08783 [pdf, other]

doi 10.3847/PSJ/aca46d

Spectroscopic map** of Io's surface with HST/STIS: SO$_2$ frost, sulfur allotropes, and large-scale compositional patterns

Authors: Samantha K. Trumbo, M. Ryleigh Davis, Benjamin Cassese, Michael E. Brown

Abstract: Io's intense volcanic activity results in one of the most colorful surfaces in the solar system. Ultraviolet and visible-wavelength observations of Io are critical to uncovering the chemistry behind its volcanic hues. Here, we present global, spatially resolved UV-visible spectra of Io from the Space Telescope Imaging Spectrograph on the Hubble Space Telescope (HST), which bridge the gap between p… ▽ More Io's intense volcanic activity results in one of the most colorful surfaces in the solar system. Ultraviolet and visible-wavelength observations of Io are critical to uncovering the chemistry behind its volcanic hues. Here, we present global, spatially resolved UV-visible spectra of Io from the Space Telescope Imaging Spectrograph on the Hubble Space Telescope (HST), which bridge the gap between previous highly resolved imagery and disk-integrated spectroscopy, to provide an unprecedented combination of spatial and spectral detail. We use this comprehensive dataset to investigate spectral endmembers, map observed spectral features associated with SO$_2$ frost and other sulfur species, and explore possible compositions in the context of Io surface processes. In agreement with past observations, our results are consistent with extensive equatorial SO$_2$ frost deposits that are stable over multi-decade timescales, widespread sulfur-rich plains surrounding the SO$_2$ deposits, and the enrichment of Pele's pyroclastic ring and the high-latitude regions in metastable short-chain sulfur allotropes. △ Less

Submitted 16 December, 2022; originally announced December 2022.

Comments: 15 pages, 8 figures, published in PSJ

Journal ref: The Planetary Science Journal, 3, 272 (2022)

arXiv:2211.15781 [pdf]

3UCubed: The IMAP Student Collaboration CubeSat Project

Authors: Marcus Alfred, Sonya Smith, Charles Kim, Carissma McGee, Ruth Davis, Myles Pope, Taran Richardson, Trinity Sager, Avery Williams, Matthew Gales, Wilson Jean Baptiste, Tyrese Kierstdet, Oluwatamilore Ogunbanjo, Laura Peticolas, Lynn Cominsky, Garrett Jernigan, Jeffrey Reedy, Doug Clarke, Sabrina Blais, Erik Castellanos-Vasquez, Jack Dawson, Erika Diaz Ramirez, Walter Foster, Cristopher Gopar Carreno, Haley Joerger , et al. (17 additional authors not shown)

Abstract: The 3UCubed project is a 3U CubeSat being jointly developed by the University of New Hampshire, Sonoma State University, and Howard University as a part of the NASA Interstellar Map** and Acceleration Probe, IMAP, student collaboration. This project comprises of a multidisciplinary team of undergraduate students from all three universities. The mission goal of the 3UCubed is to understand how Ea… ▽ More The 3UCubed project is a 3U CubeSat being jointly developed by the University of New Hampshire, Sonoma State University, and Howard University as a part of the NASA Interstellar Map** and Acceleration Probe, IMAP, student collaboration. This project comprises of a multidisciplinary team of undergraduate students from all three universities. The mission goal of the 3UCubed is to understand how Earths polar upper atmosphere the thermosphere in Earths auroral regions, responds to particle precipitation and solar wind forcing, and internal magnetospheric processes. 3UCubed includes two instruments with rocket heritage to achieve the science mission: an ultraviolet photomultiplier tube, UVPMT, and an electron retarding potential analyzer ERPA. The spacecraft bus consists of the following subsystems: Attitude Determination and Control, Command and Data Handling, Power, Communication, Structural, and Thermal. Currently, the project is in the post-PDR stage, starting to build and test engineering models to develop a FlatSat prior to critical design review in 2023. The goal is to launch at least one 3U CubeSat to collect science data close to the anticipated peak of Solar Cycle 25 around July 2025. Our mother mission, IMAP, is also projected to launch in 2025, which will let us jointly analyze the science data of the main mission, providing the solar wind measurements and inputs to the magnetosphere with that of 3UCubed, providing the response of Earths cusp to these inputs. △ Less

Submitted 28 November, 2022; originally announced November 2022.

arXiv:2211.15407 [pdf]

doi 10.2196/40408

Fine-tuned Sentiment Analysis of COVID-19 Vaccine-Related Social Media Data: Comparative Study

Authors: Chad A Melton, Brianna M White, Robert L Davis, Robert A Bednarczyk, Arash Shaban-Nejad

Abstract: This study investigated and compared public sentiment related to COVID-19 vaccines expressed on two popular social media platforms, Reddit and Twitter, harvested from January 1, 2020, to March 1, 2022. To accomplish this task, we created a fine-tuned DistilRoBERTa model to predict sentiments of approximately 9.5 million Tweets and 70 thousand Reddit comments. To fine-tune our model, our team manua… ▽ More This study investigated and compared public sentiment related to COVID-19 vaccines expressed on two popular social media platforms, Reddit and Twitter, harvested from January 1, 2020, to March 1, 2022. To accomplish this task, we created a fine-tuned DistilRoBERTa model to predict sentiments of approximately 9.5 million Tweets and 70 thousand Reddit comments. To fine-tune our model, our team manually labeled the sentiment of 3600 Tweets and then augmented our dataset by the method of back-translation. Text sentiment for each social media platform was then classified with our fine-tuned model using Python and the Huggingface sentiment analysis pipeline. Our results determined that the average sentiment expressed on Twitter was more negative (52% positive) than positive and the sentiment expressed on Reddit was more positive than negative (53% positive). Though average sentiment was found to vary between these social media platforms, both displayed similar behavior related to sentiment shared at key vaccine-related developments during the pandemic. Considering this similar trend in shared sentiment demonstrated across social media platforms, Twitter and Reddit continue to be valuable data sources that public health officials can utilize to strengthen vaccine confidence and combat misinformation. As the spread of misinformation poses a range of psychological and psychosocial risks (anxiety, fear, etc.), there is an urgency in understanding the public perspective and attitude toward shared falsities. Comprehensive educational delivery systems tailored to the population's expressed sentiments that facilitate digital literacy, health information-seeking behavior, and precision health promotion could aid in clarifying such misinformation. △ Less

Submitted 17 October, 2022; originally announced November 2022.

Comments: 11 Pages, 5 Figures, and 1 Table

MSC Class: 92-11 ACM Class: I.2.7

Journal ref: Journal of Medical Internet Research (JMIR) 2022;24(10):e40408

arXiv:2211.13172 [pdf, other]

Kernel PCA for multivariate extremes

Authors: Marco Avella-Medina, Richard A. Davis, Gennady Samorodnitsky

Abstract: We propose kernel PCA as a method for analyzing the dependence structure of multivariate extremes and demonstrate that it can be a powerful tool for clustering and dimension reduction. Our work provides some theoretical insight into the preimages obtained by kernel PCA, demonstrating that under certain conditions they can effectively identify clusters in the data. We build on these new insights to… ▽ More We propose kernel PCA as a method for analyzing the dependence structure of multivariate extremes and demonstrate that it can be a powerful tool for clustering and dimension reduction. Our work provides some theoretical insight into the preimages obtained by kernel PCA, demonstrating that under certain conditions they can effectively identify clusters in the data. We build on these new insights to characterize rigorously the performance of kernel PCA based on an extremal sample, i.e., the angular part of random vectors for which the radius exceeds a large threshold. More specifically, we focus on the asymptotic dependence of multivariate extremes characterized by the angular or spectral measure in extreme value theory and provide a careful analysis in the case where the extremes are generated from a linear factor model. We give theoretical guarantees on the performance of kernel PCA preimages of such extremes by leveraging their asymptotic distribution together with Davis-Kahan perturbation bounds. Our theoretical findings are complemented with numerical experiments illustrating the finite sample performance of our methods. △ Less

Submitted 23 November, 2022; v1 submitted 23 November, 2022; originally announced November 2022.

arXiv:2211.06799 [pdf]

Bead-Droplet Reactor for High-Fidelity Solid-Phase Enzymatic DNA Synthesis

Authors: Punnag Padhy, Mohammad Asif Zaman, Michael Anthony Jensen, Yao-Te Cheng, Yogi Huang, Ludwig Galambos, Ronald Wayne Davis, Lambertus Hesselink

Abstract: Solid-phase synthesis techniques underpin the synthesis of DNA, oligopeptides, oligosaccharides, and combinatorial libraries for drug discovery. State-of-the-art solid-phase synthesizers can produce oligonucleotides up to 200-300 nucleotides while using excess reagents. Accumulated errors over multiple reaction cycles prevent the synthesis of longer oligonucleotides for the genome scale engineerin… ▽ More Solid-phase synthesis techniques underpin the synthesis of DNA, oligopeptides, oligosaccharides, and combinatorial libraries for drug discovery. State-of-the-art solid-phase synthesizers can produce oligonucleotides up to 200-300 nucleotides while using excess reagents. Accumulated errors over multiple reaction cycles prevent the synthesis of longer oligonucleotides for the genome scale engineering of synthetic biological systems. The sources of these errors in synthesis columns remains poorly understood. Here we show that bead-bead stacking significantly contributes to reaction errors in columns by analyzing enzymatic coupling of fluorescently labelled nucleotides onto the initiated beads along with porosity, particle tracking and diffusion calculations. To circumvent stacking, we introduce dielectrophoretic bead-droplet reactor (DBDR); a novel approach to synthesize on individual microbeads within microdroplets. Dielectrophoretic force overcomes the droplet-medium interfacial tension to encapsulate and eject individual beads from microdroplets in a droplet microfluidic device. Faster reagent diffusion in droplets, and non-uniform electric field induced enhancement in reagent concentration at its surface can improve reaction fidelities in DBDR. Fluorescence comparisons suggest around 3-fold enhancement of reaction fidelity compared to columns. DBDR can potentially enable the high-purity synthesis of arbitrarily long strands of DNA to meet the emerging demands in healthcare, environment, agriculture, materials, and computing. △ Less

Submitted 1 June, 2023; v1 submitted 12 November, 2022; originally announced November 2022.

arXiv:2209.14392 [pdf]

Design and CT imaging of Casper, an anthropomorphic breathing thorax phantom

Authors: Josie Laidlaw, Nicolas Earl, Nihal Shavdia, Rayna Davis, Sarah Mayer, Dmitri Karaman, Devon Richtsmeier, Pierre-Antoine Rodesch, Magdalena Bazalova-Carter

Abstract: The goal of this work was to build an anthropomorphic thorax phantom capable of breathing motion with materials mimicking human tissues in x-ray imaging applications. The thorax phantom, named Casper, was composed of resin (body), foam (lungs), glow polyactic acid (bones) and natural polyactic acid (tumours placed in the lungs). X-ray attenuation properties of all materials prior to manufacturing… ▽ More The goal of this work was to build an anthropomorphic thorax phantom capable of breathing motion with materials mimicking human tissues in x-ray imaging applications. The thorax phantom, named Casper, was composed of resin (body), foam (lungs), glow polyactic acid (bones) and natural polyactic acid (tumours placed in the lungs). X-ray attenuation properties of all materials prior to manufacturing were evaluated by means of photon-counting computed tomography (CT) imaging on a table-top system. Breathing motion was achieved by a scotch-yoke mechanism with diaphragm motion frequencies of 10 - 20 rpm and displacements of 1 to 2 cm. Casper was manufactured by means of 3D printing of moulds and ribs and assembled in a complex process. The final phantom was then scanned using a clinical CT scanner to evaluate material CT numbers and the extent of tumour motion. Casper CT numbers were close to human CT numbers for soft tissue (46 HU), ribs (125 HU), lungs (-840 HU) and tumours (-45 HU). For a 2 cm diaphragm displacement the largest tumour displacement was 0.7 cm. The five tumour volumes were accurately assessed in the static CT images with a mean absolute error of 4.3%. Tumour sizes were either underestimated for smaller tumours or overestimated for larger tumours in dynamic CT images due to motion blurring with a mean absolute difference from true volumes of 10.3%. △ Less

Submitted 28 September, 2022; originally announced September 2022.

Comments: This article has been submitted to Physics Medicine & Biology on the 21-Sep-2022

arXiv:2207.14759 [pdf, other]

Perfectly Matchable Set Polynomials and $h^*$-polynomials for Stable Set Polytopes of Complements of Graphs

Authors: Robert Davis, Florian Kohl

Abstract: A subset $S$ of vertices of a graph $G$ is called a perfectly matchable set of $G$ if the subgraph induced by $S$ contains a perfect matching. The perfectly matchable set polynomial of $G$, first made explicit by Ohsugi and Tsuchiya, is the (ordinary) generating function $p(G; z)$ for the number of perfectly matchable sets of $G$. In this work, we provide explicit recurrences for computing… ▽ More A subset $S$ of vertices of a graph $G$ is called a perfectly matchable set of $G$ if the subgraph induced by $S$ contains a perfect matching. The perfectly matchable set polynomial of $G$, first made explicit by Ohsugi and Tsuchiya, is the (ordinary) generating function $p(G; z)$ for the number of perfectly matchable sets of $G$. In this work, we provide explicit recurrences for computing $p(G; z)$ for an arbitrary (simple) graph and use these to compute the Ehrhart $h^*$-polynomials for certain lattice polytopes. Namely, we show that $p(G; z)$ is the $h^*$-polynomial for certain classes of stable set polytopes, whose vertices correspond to stable sets of $G$. △ Less

Submitted 29 July, 2022; originally announced July 2022.

Comments: 15 pages

arXiv:2207.06883 [pdf, other]

RF-Photonic Deep Learning Processor with Shannon-Limited Data Movement

Authors: Ronald Davis III, Zaijun Chen, Ryan Hamerly, Dirk Englund

Abstract: Edholm's Law predicts exponential growth in data rate and spectrum bandwidth for communications and is forecasted to remain true for the upcoming deployment of 6G. Compounding this issue is the exponentially increasing demand for deep neural network (DNN) compute, including DNNs for signal processing. However, the slowing of Moore's Law due to the limitations of transistor-based electronics means… ▽ More Edholm's Law predicts exponential growth in data rate and spectrum bandwidth for communications and is forecasted to remain true for the upcoming deployment of 6G. Compounding this issue is the exponentially increasing demand for deep neural network (DNN) compute, including DNNs for signal processing. However, the slowing of Moore's Law due to the limitations of transistor-based electronics means that completely new paradigms for computing will be required to meet these increasing demands for advanced communications. Optical neural networks (ONNs) are promising DNN accelerators with ultra-low latency and energy consumption. Yet state-of-the-art ONNs struggle with scalability and implementing linear with in-line nonlinear operations. Here we introduce our multiplicative analog frequency transform ONN (MAFT-ONN) that encodes the data in the frequency domain, achieves matrix-vector products in a single shot using photoelectric multiplication, and uses a single electro-optic modulator for the nonlinear activation of all neurons in each layer. We experimentally demonstrate the first hardware accelerator that computes fully-analog deep learning on raw RF signals, performing single-shot modulation classification with 85% accuracy, where a 'majority vote' multi-measurement scheme can boost the accuracy to 95% within 5 consecutive measurements. In addition, we demonstrate frequency-domain finite impulse response (FIR) linear-time-invariant (LTI) operations, enabling a powerful combination of traditional and AI signal processing. We also demonstrate the scalability of our architecture by computing nearly 4 million fully-analog multiplies-and-accumulates for MNIST digit classification. Our latency estimation model shows that due to the Shannon capacity-limited analog data movement, MAFT-ONN is hundreds of times faster than traditional RF receivers operating at their theoretical peak performance. △ Less

Submitted 6 June, 2024; v1 submitted 8 July, 2022; originally announced July 2022.

Comments: This is a substantial improvement to our initial manuscript titled "Frequency-Encoded Deep Learning with Speed-of-Light Dominated Latency," adding both new experiments and analyses. In this new work we explicitly expand and demonstrate the practical applications of our processing architecture regarding RF signal processing for advanced communications

arXiv:2207.05329 [pdf, other]

doi 10.1038/s41566-023-01233-w

Deep Learning with Coherent VCSEL Neural Networks

Authors: Zaijun Chen, Alexander Sludds, Ronald Davis, Ian Christen, Liane Bernstein, Tobias Heuser, Niels Heermeier, James A. Lott, Stephan Reitzenstein, Ryan Hamerly, Dirk Englund

Abstract: Deep neural networks (DNNs) are resha** the field of information processing. With their exponential growth challenging existing electronic hardware, optical neural networks (ONNs) are emerging to process DNN tasks in the optical domain with high clock rates, parallelism and low-loss data transmission. However, to explore the potential of ONNs, it is necessary to investigate the full-system perfo… ▽ More Deep neural networks (DNNs) are resha** the field of information processing. With their exponential growth challenging existing electronic hardware, optical neural networks (ONNs) are emerging to process DNN tasks in the optical domain with high clock rates, parallelism and low-loss data transmission. However, to explore the potential of ONNs, it is necessary to investigate the full-system performance incorporating the major DNN elements, including matrix algebra and nonlinear activation. Existing challenges to ONNs are high energy consumption due to low electro-optic (EO) conversion efficiency, low compute density due to large device footprint and channel crosstalk, and long latency due to the lack of inline nonlinearity. Here we experimentally demonstrate an ONN system that simultaneously overcomes all these challenges. We exploit neuron encoding with volume-manufactured micron-scale vertical-cavity surface-emitting laser (VCSEL) transmitter arrays that exhibit high EO conversion (<5 attojoule/symbol with $V_π$=4 mV), high operation bandwidth (up to 25 GS/s), and compact footprint (<0.01 mm$^2$ per device). Photoelectric multiplication allows low-energy matrix operations at the shot-noise quantum limit. Homodyne detection-based nonlinearity enables nonlinear activation with instantaneous response. The full-system energy efficiency and compute density reach 7 femtojoules per operation (fJ/OP) and 25 TeraOP/(mm$^2\cdot$ s), both representing a >100-fold improvement over state-of-the-art digital computers, with substantially several more orders of magnitude for future improvement. Beyond neural network inference, its feature of rapid weight updating is crucial for training deep learning models. Our technique opens an avenue to large-scale optoelectronic processors to accelerate machine learning tasks from data centers to decentralized edge devices. △ Less

Submitted 12 July, 2022; originally announced July 2022.

Comments: 10 pages, 5 figures

arXiv:2206.12507 [pdf, other]

doi 10.1103/PhysRevApplied.18.064066

Electromagnetic Nonreciprocity in a Magnetized Plasma Circulator

Authors: Feng Li, Robert J. Davis, Sara M. Kandil, Daniel F. Sievenpiper

Abstract: Nonreciprocal transport of electromagnetic waves within magnetized plasma is a powerful building block towards understanding and exploiting the properties of more general topological systems. Much recent attention has been paid to the theoretical issues of wave interaction within such a medium, but there is a lack of experimental verification that such systems can be viable in a lab or industrial… ▽ More Nonreciprocal transport of electromagnetic waves within magnetized plasma is a powerful building block towards understanding and exploiting the properties of more general topological systems. Much recent attention has been paid to the theoretical issues of wave interaction within such a medium, but there is a lack of experimental verification that such systems can be viable in a lab or industrial setting. This work provides an experimental proof-of-concept by demonstrating nonreciprocity in a unit component, a microwave plasma circulator. We design an E-plane Y junction plasma circulator operating in the range of 4 to 6 GHz using standardized waveguide specifications. From both simulations and experiments, we observe wide band isolation for the power transmission through the circulator. The performance and the frequency band of the circulator can be easily tuned by changing the plasma density and the magnetic field strength. By linking simulations and experimental results, we estimate the plasma density for the device. △ Less

Submitted 29 November, 2022; v1 submitted 24 June, 2022; originally announced June 2022.

Comments: Revision 2: Added a section to introduce the scattering matrix in a nonreciprocal microwave systems with additional references. Fixed typo on greek letters. Swapped fig 1 and 2 for clarity. Defined the technical terms in Section II to avoid confusion. Updated the correct value for the minimum normalized isolated power when a positive magnetic field was applied

arXiv:2204.04405 [pdf]

Finding the ET Signal from the Cosmic Noise

Authors: Ross Davis

Abstract: This paper highlights a methodological approach designed to enhance the search for extraterrestrial intelligence (SETI) by hypothesizing that a transmission technosignature would likely have two features: 1) be wideband in the microwave or higher frequency range that originates from a hub within a supposed ET interplanetary navigation/communication (nav/comm) network, and 2) contain x-ray pulsar-b… ▽ More This paper highlights a methodological approach designed to enhance the search for extraterrestrial intelligence (SETI) by hypothesizing that a transmission technosignature would likely have two features: 1) be wideband in the microwave or higher frequency range that originates from a hub within a supposed ET interplanetary navigation/communication (nav/comm) network, and 2) contain x-ray pulsar-based navigation (XNAV) metadata. Potential contributions to the field include improved accuracy in finding transmission technosignatures and other technosignatures in the electromagnetic spectrum, a common standard in reaching a Schelling Point (a mutual realization of how we and ETs can find each other), and operationalizing models such as the Drake Equation. △ Less

Submitted 18 February, 2024; v1 submitted 9 April, 2022; originally announced April 2022.

Comments: 13 pages. v2 incorporates a few grammatical corrections

arXiv:2203.06436 [pdf, ps, other]

Maximization of Mathai's Entropy under the Constraints of Generalized Gini and Gini mean difference indices and its Applications in Insurance

Authors: Rhea Davis, Nicy Sebastian

Abstract: Statistical Physics, Diffusion Entropy Analysis and Information Theory commonly use Mathai's entropy which measures the randomness of probability laws, whereas welfare economics and the Social Sciences commonly use Gini index which measures the evenness of probability laws. Motivated by the principle of maximal entropy, we explore the maximization of Mathai's entropy subject to the conditions in t… ▽ More Statistical Physics, Diffusion Entropy Analysis and Information Theory commonly use Mathai's entropy which measures the randomness of probability laws, whereas welfare economics and the Social Sciences commonly use Gini index which measures the evenness of probability laws. Motivated by the principle of maximal entropy, we explore the maximization of Mathai's entropy subject to the conditions in the following scenarios: (i) the conditions of a density function and fixed mean; (ii) the conditions of a density function and fixed Generalized Gini index. We also maximizes the Mathai's entropy subject to the constraints of a given Gini mean difference index and the conditions of a density function. The obtained maximum entropy distribution is fitted to the loss ratios (yearly data) for earthquake insurance in California from 1971 through 1994 and its performance with some one-parameter distributions are compared. △ Less

Submitted 12 March, 2022; originally announced March 2022.

arXiv:2202.09853 [pdf, ps, other]

Normalized Volumes of Type-PQ Adjacency Polytopes for Certain Classes of Graphs

Authors: Robert Davis, Joakim Jakovleski, Qizhe Pan

Abstract: The type-PQ adjacency polytope associated to a simple graph is a $0/1$-polytope containing valuable information about an underlying power network. Chen and the first author have recently demonstrated that, when the underlying graph $G$ is connected, the normalized volumes of the adjacency polytopes can be computed by counting sequences of nonnegative integers satisfying restrictions determined by… ▽ More The type-PQ adjacency polytope associated to a simple graph is a $0/1$-polytope containing valuable information about an underlying power network. Chen and the first author have recently demonstrated that, when the underlying graph $G$ is connected, the normalized volumes of the adjacency polytopes can be computed by counting sequences of nonnegative integers satisfying restrictions determined by $G$. This article builds upon their work, namely by showing that one of their main results -- the so-called "triangle recurrence" -- applies in a more general setting. Formulas for the normalized volumes when $G$ is obtained by deleting a path or a cycle from a complete graph are also established. △ Less

Submitted 20 February, 2022; originally announced February 2022.

arXiv:2201.05907 [pdf, ps, other]

Designing Topological Defect Lines Protected by Gauge-dependent Symmetry Indicators

Authors: Erda Wen, Dia'aaldin J. Bisharat, Robert J. Davis, Xiaozhen Yang, Daniel F. Sievenpiper

Abstract: Symmetry indicators are a modern tool for characterizing topological phases that require only minimal computational expense but provide an elegant means of designing practical devices. This paper demonstrates how a rotational symmetry indicator can be used to construct and characterize a topologically robust waveguide, which is then verified experimentally on a printed circuit board (PCB) platform… ▽ More Symmetry indicators are a modern tool for characterizing topological phases that require only minimal computational expense but provide an elegant means of designing practical devices. This paper demonstrates how a rotational symmetry indicator can be used to construct and characterize a topologically robust waveguide, which is then verified experimentally on a printed circuit board (PCB) platform. The design takes advantage of the real-space gauge-dependency of the symmetry indicators and adopts a $C_6$ lattice with simple shifts, forming a defect line supporting topological edge modes. It is shown that the modes can realize the same features as previous topological waveguides, but in addition possesses a greater degree of reconfigurability and the unique ability to form a one-way termination. Moreover, the design illustrates the critical role real space information plays in determining the topological properties of photonic crystals, enabling a wider range of possible realizations. △ Less

Submitted 15 January, 2022; originally announced January 2022.

arXiv:2201.03495 [pdf, other]

doi 10.1103/PhysRevB.106.165403

Topologically Protected Edge States in Triangular Lattices

Authors: Robert J. Davis, Yun Zhou, Dia'aaldin J. Bisharat, Prabhakar R. Bandaru, Daniel F. Sievenpiper

Abstract: We describe the possibility for topologically robust edge states existing on interfaces of triangular lattices which are supported by rotational symmetries that are sensitive to boundary conditions. Such states are trivial from the perspective of Berry curvature, but result instead from an interplay between crystalline symmetries and finite boundary effects. Regardless, we show such states are in… ▽ More We describe the possibility for topologically robust edge states existing on interfaces of triangular lattices which are supported by rotational symmetries that are sensitive to boundary conditions. Such states are trivial from the perspective of Berry curvature, but result instead from an interplay between crystalline symmetries and finite boundary effects. Regardless, we show such states are in a distinct topological phase, provided the gauge-dependent symmetries are maintained. Such a model describes a number of recent bosonic experimental demonstrations on triangular lattices, the physics for which has thus far eluded explanation. △ Less

Submitted 30 September, 2022; v1 submitted 10 January, 2022; originally announced January 2022.

Comments: 12 pages, 10 figures. Accepted to Physical Review B

arXiv:2201.02498 [pdf, ps, other]

Cauchy, normal and correlations versus heavy tails

Authors: Hui Xu, Joel Cohen, Richard Davis, Gennady Samorodnitsky

Abstract: A surprising result of Pillai and Meng (2016) showed that a transformation $\sum_{j=1}^n w_j X_j/Y_j$ of two iid centered normal random vectors, $(X_1,\ldots, X_n)$ and $(Y_1,\ldots, Y_n)$, $n>1$, for any weights $0\leq w_j\leq 1$, $ j=1,\ldots, n$, $\sum_{j=1}^n w_j=1$, has a Cauchy distribution regardless of any correlations within the normal vectors. The correlations appear to lose out in the c… ▽ More A surprising result of Pillai and Meng (2016) showed that a transformation $\sum_{j=1}^n w_j X_j/Y_j$ of two iid centered normal random vectors, $(X_1,\ldots, X_n)$ and $(Y_1,\ldots, Y_n)$, $n>1$, for any weights $0\leq w_j\leq 1$, $ j=1,\ldots, n$, $\sum_{j=1}^n w_j=1$, has a Cauchy distribution regardless of any correlations within the normal vectors. The correlations appear to lose out in the competition with the heavy tails. To clarify how extensive this phenomenon is, we analyze two other transformations of two iid centered normal random vectors. These transformations are similar in spirit to the transformation considered by Pillai and Meng (2016). One transformation involves absolute values: $\sum_{j=1}^n w_j X_j/|Y_j|$. The second involves randomly stopped Brownian motions: $\sum_{j=1}^n w_j X_j\bigl(Y_j^{-2}\bigr)$, where $\bigl\{\bigl( X_1(t),\ldots, X_n(t)\bigr), \, t\geq 0\bigr\},\ n>1,$ is a Brownian motion with positive variances; $(Y_1,\ldots, Y_n)$ is a centered normal random vector with the same law as $( X_1(1),\ldots, X_n(1))$ and independent of it; and $X(Y^{-2})$ is the value of the Brownian motion $X(t)$ evaluated at the random time $t=Y^{-2}$. All three transformations result in a Cauchy distribution if the covariance matrix of the normal components is diagonal, or if all the correlations implied by the covariance matrix equal 1. However, while the transformation Pillai and Meng (2016) considered produces a Cauchy distribution regardless of the normal covariance matrix. the transformations we consider here do not always produce a Cauchy distribution. The correlations between jointly normal random variables are not always overwhelmed by the heaviness of the marginal tails. The mysteries of the connections between normal and Cauchy laws remain to be understood. △ Less

Submitted 7 January, 2022; originally announced January 2022.

MSC Class: 60E05; 60E07

arXiv:2111.12249 [pdf]

doi 10.1103/PhysRevApplied.19.024053

On-chip unidirectional waveguiding for surface acoustic waves along a defect line in a triangular lattice

Authors: Yun Zhou, Naiqing Zhang, Dia'aaldin J. Bisharat, Robert J. Davis, Zichen Zhang, James Friend, Prabhakar R. Bandaru, Daniel F. Sievenpiper

Abstract: The latest advances in topological physics have yielded a rich toolset to design highly robust wave transfer systems, for overcoming issues like beam steering and lateral diffraction in surface acoustic waves (SAWs). However, presently used designs for topologically protected SAWs have been largely limited to spin or valley-polarized phases, which rely on non-zero Berry curvature effects. Here we… ▽ More The latest advances in topological physics have yielded a rich toolset to design highly robust wave transfer systems, for overcoming issues like beam steering and lateral diffraction in surface acoustic waves (SAWs). However, presently used designs for topologically protected SAWs have been largely limited to spin or valley-polarized phases, which rely on non-zero Berry curvature effects. Here we propose and experimentally demonstrate a highly robust SAW waveguide on lithium niobate (LiNbO3), based on a line defect within a true triangular phononic lattice, which instead employs an intrinsic chirality of phase vortices and maintains a zero Berry curvature. The guided SAW mode spans a wide bandwidth and shows confinement in the lateral direction with 3 dB attenuation within half of the unit-cell length. SAW routing around sharp bends has been demonstrated in such waveguide, with less than ~4% reflection per bend. The waveguide has also been found robust for defect lines with different configurations. The fully on-chip system permits unidirectional SAW modes that are tightly bound to the waveguide, which provides a compact footprint ideal for miniaturization of practical applications and offers insight into the possibility of manipulating highly focused SAW propagation. △ Less

Submitted 23 November, 2021; originally announced November 2021.

arXiv:2111.07799 [pdf, other]

Spectral learning of multivariate extremes

Authors: Marco Avella Medina, Richard A. Davis, Gennady Samorodnitsky

Abstract: We propose a spectral clustering algorithm for analyzing the dependence structure of multivariate extremes. More specifically, we focus on the asymptotic dependence of multivariate extremes characterized by the angular or spectral measure in extreme value theory. Our work studies the theoretical performance of spectral clustering based on a random $k$-nearest neighbor graph constructed from an ext… ▽ More We propose a spectral clustering algorithm for analyzing the dependence structure of multivariate extremes. More specifically, we focus on the asymptotic dependence of multivariate extremes characterized by the angular or spectral measure in extreme value theory. Our work studies the theoretical performance of spectral clustering based on a random $k$-nearest neighbor graph constructed from an extremal sample, i.e., the angular part of random vectors for which the radius exceeds a large threshold. In particular, we derive the asymptotic distribution of extremes arising from a linear factor model and prove that, under certain conditions, spectral clustering can consistently identify the clusters of extremes arising in this model. Leveraging this result we propose a simple consistent estimation strategy for learning the angular measure. Our theoretical findings are complemented with numerical experiments illustrating the finite sample performance of our methods. △ Less

Submitted 1 August, 2023; v1 submitted 15 November, 2021; originally announced November 2021.

arXiv:2108.11579 [pdf, other]

Modeling Item Response Theory with Stochastic Variational Inference

Authors: Mike Wu, Richard L. Davis, Benjamin W. Domingue, Chris Piech, Noah Goodman

Abstract: Item Response Theory (IRT) is a ubiquitous model for understanding human behaviors and attitudes based on their responses to questions. Large modern datasets offer opportunities to capture more nuances in human behavior, potentially improving psychometric modeling leading to improved scientific understanding and public policy. However, while larger datasets allow for more flexible approaches, many… ▽ More Item Response Theory (IRT) is a ubiquitous model for understanding human behaviors and attitudes based on their responses to questions. Large modern datasets offer opportunities to capture more nuances in human behavior, potentially improving psychometric modeling leading to improved scientific understanding and public policy. However, while larger datasets allow for more flexible approaches, many contemporary algorithms for fitting IRT models may also have massive computational demands that forbid real-world application. To address this bottleneck, we introduce a variational Bayesian inference algorithm for IRT, and show that it is fast and scalable without sacrificing accuracy. Applying this method to five large-scale item response datasets from cognitive science and education yields higher log likelihoods and higher accuracy in imputing missing data than alternative inference algorithms. Using this new inference approach we then generalize IRT with expressive Bayesian models of responses, leveraging recent advances in deep learning to capture nonlinear item characteristic curves (ICC) with neural networks. Using an eigth-grade mathematics test from TIMSS, we show our nonlinear IRT models can capture interesting asymmetric ICCs. The algorithm implementation is open-source, and easily usable. △ Less

Submitted 28 July, 2022; v1 submitted 26 August, 2021; originally announced August 2021.

Comments: version two includes added experiments; 33 pages of content; 6 pages appendix; figures at the bottom. arXiv admin note: text overlap with arXiv:2002.00276

arXiv:2107.12315 [pdf, ps, other]

Facets and facet subgraphs of symmetric edge polytopes

Authors: Tianran Chen, Robert Davis, Evgeniia Korchevskaia

Abstract: Symmetric edge polytopes, a.k.a. PV-type adjacency polytopes, associated with undirected graphs have been defined and studied in several seemingly independent areas including number theory, discrete geometry, and dynamical systems. In particular, the authors are motivated by the study of the algebraic Kuramoto equations of unmixed form whose Newton polytopes are the symmetric edge polytopes. The… ▽ More Symmetric edge polytopes, a.k.a. PV-type adjacency polytopes, associated with undirected graphs have been defined and studied in several seemingly independent areas including number theory, discrete geometry, and dynamical systems. In particular, the authors are motivated by the study of the algebraic Kuramoto equations of unmixed form whose Newton polytopes are the symmetric edge polytopes. The interplay between the geometric structure of symmetric edge polytopes and the topological structure of the underlying graphs has been a recurring theme in recent studies. In particular, ``facet/face subgraphs'' have emerged as one of the central concepts in describing this symmetry. Continuing along this line of inquiry we provide a complete description of the correspondence between facets/faces of a symmetric edge polytope and maximal bipartite subgraphs of the underlying connected graph. △ Less

Submitted 1 December, 2022; v1 submitted 26 July, 2021; originally announced July 2021.

MSC Class: 52B20; 52B40 (Primary) 34C15 (Secondary) ACM Class: G.2.1; G.2.2

arXiv:2107.07533 [pdf, other]

doi 10.3847/1538-4357/ac12c8

Lower-Luminosity Obscured AGN Host Galaxies are Not Predominantly in Major-Merging Systems at Cosmic Noon

Authors: Erini Lambrides, Marco Chiaberge, Timothy Heckman, Allison Kirkpatrick, Eileen T. Meyer, Andreea Petric, Kirsten Hall, Arianna Long, Duncan J. Watts, Roberto Gilli, Raymond Simons, Kirill Tchernyshyov, Vicente Rodriguez-Gomez, Fabio Vito, Alexander De La Vega, Jeffrey R. Davis, Dale D Kocevski, Colin Norman

Abstract: For over 60 years, the scientific community has studied actively growing central super-massive black holes (active galactic nuclei -- AGN) but fundamental questions on their genesis remain unanswered. Numerical simulations and theoretical arguments show that black hole growth occurs during short-lived periods ($\sim$ 10$^{7}$ -10$^{8}$ yr) of powerful accretion. Major mergers are commonly invoked… ▽ More For over 60 years, the scientific community has studied actively growing central super-massive black holes (active galactic nuclei -- AGN) but fundamental questions on their genesis remain unanswered. Numerical simulations and theoretical arguments show that black hole growth occurs during short-lived periods ($\sim$ 10$^{7}$ -10$^{8}$ yr) of powerful accretion. Major mergers are commonly invoked as the most likely dissipative process to trigger the rapid fueling of AGN. If the AGN-merger paradigm is true, we expect galaxy mergers to coincide with black hole accretion during a heavily obscured AGN phase (N$_H$ $ > 10^{23}$ cm$^{-2}$). Starting from one of the largest samples of obscured AGN at 0.5 $<$ $z$ $<$ 3.1, we select 40 non-starbursting lower-luminosity obscured AGN. We then construct a one-to-one matched redshift- and near-IR magnitude-matched non-starbursting inactive galaxy control sample. Combining deep color \textit{Hubble Space Telescope} imaging and a novel method of human classification, we test the merger-AGN paradigm prediction that heavily obscured AGN are strongly associated with galaxies undergoing a major merger. On the total sample of 80 galaxies, we estimate each individual classifier's accuracy at identifying merging galaxies/post-merging systems and isolated galaxies. We calculate the probability of each galaxy being in either a major merger or isolated system, given the accuracy of the human classifiers and the individual classifications of each galaxy. We do not find statistically significant evidence that obscured AGN at cosmic noon are predominately found in systems with evidence of significant merging/post-merging features. △ Less

Submitted 15 July, 2021; originally announced July 2021.

Comments: 19 pages, 11 figures, accepted in ApJ

arXiv:2107.06663 [pdf, other]

Time Series Estimation of the Dynamic Effects of Disaster-Type Shock

Authors: Richard Davis, Serena Ng

Abstract: This paper provides three results for SVARs under the assumption that the primitive shocks are mutually independent. First, a framework is proposed to accommodate a disaster-type variable with infinite variance into a SVAR. We show that the least squares estimates of the SVAR are consistent but have non-standard asymptotics. Second, the disaster shock is identified as the component with the larges… ▽ More This paper provides three results for SVARs under the assumption that the primitive shocks are mutually independent. First, a framework is proposed to accommodate a disaster-type variable with infinite variance into a SVAR. We show that the least squares estimates of the SVAR are consistent but have non-standard asymptotics. Second, the disaster shock is identified as the component with the largest kurtosis and whose impact effect is negative. An estimator that is robust to infinite variance is used to recover the mutually independent components. Third, an independence test on the residuals pre-whitened by the Choleski decomposition is proposed to test the restrictions imposed on a SVAR. The test can be applied whether the data have fat or thin tails, and to over as well as exactly identified models. Three applications are considered. In the first, the independence test is used to shed light on the conflicting evidence regarding the role of uncertainty in economic fluctuations. In the second, disaster shocks are shown to have short term economic impact arising mostly from feedback dynamics. The third uses the framework to study the dynamic effects of economic shocks post-covid. △ Less

Submitted 8 March, 2022; v1 submitted 14 July, 2021; originally announced July 2021.

Showing 1–50 of 391 results for author: Davis, R