-
Geophysical Observations of the 24 September 2023 OSIRIS-REx Sample Return Capsule Re-Entry
Authors:
Elizabeth A. Silber,
Daniel C. Bowman,
Chris G. Carr,
David P. Eisenberg,
Brian R. Elbing,
Benjamin Fernando,
Milton A. Garcés,
Robert Haaser,
Siddharth Krishnamoorthy,
Charles A. Langston,
Yasuhiro Nishikawa,
Jeremy Webster,
Jacob F. Anderson,
Stephen Arrowsmith,
Sonia Bazargan,
Luke Beardslee,
Brant Beck,
Jordan W. Bishop,
Philip Blom,
Grant Bracht,
David L. Chichester,
Anthony Christe,
Kenneth Cummins,
James Cutts,
Lisa Danielson
, et al. (57 additional authors not shown)
Abstract:
Sample Return Capsules (SRCs) entering Earth's atmosphere at hypervelocity from interplanetary space are a valuable resource for studying meteor phenomena. The 24 September 2023 arrival of the OSIRIS-REx (Origins, Spectral Interpretation, Resource Identification, and Security-Regolith Explorer) SRC provided an unprecedented chance for geophysical observations of a well-characterized source with kn…
▽ More
Sample Return Capsules (SRCs) entering Earth's atmosphere at hypervelocity from interplanetary space are a valuable resource for studying meteor phenomena. The 24 September 2023 arrival of the OSIRIS-REx (Origins, Spectral Interpretation, Resource Identification, and Security-Regolith Explorer) SRC provided an unprecedented chance for geophysical observations of a well-characterized source with known parameters, including timing and trajectory. A collaborative effort involving researchers from 16 institutions executed a carefully planned geophysical observational campaign at strategically chosen locations, deploying over 400 ground-based sensors encompassing infrasound, seismic, distributed acoustic sensing (DAS), and GPS technologies. Additionally, balloons equipped with infrasound sensors were launched to capture signals at higher altitudes. This campaign (the largest of its kind so far) yielded a wealth of invaluable data anticipated to fuel scientific inquiry for years to come. The success of the observational campaign is evidenced by the near-universal detection of signals across instruments, both proximal and distal. This paper presents a comprehensive overview of the collective scientific effort, field deployment, and preliminary findings. The early findings have the potential to inform future space missions and terrestrial campaigns, contributing to our understanding of meteoroid interactions with planetary atmospheres. Furthermore, the dataset collected during this campaign will improve entry and propagation models as well as augment the study of atmospheric dynamics and shock phenomena generated by meteoroids and similar sources.
△ Less
Submitted 2 July, 2024;
originally announced July 2024.
-
Captioning Visualizations with Large Language Models (CVLLM): A Tutorial
Authors:
Giuseppe Carenini,
Jordon Johnson,
Ali Salamatian
Abstract:
Automatically captioning visualizations is not new, but recent advances in large language models(LLMs) open exciting new possibilities. In this tutorial, after providing a brief review of Information Visualization (InfoVis) principles and past work in captioning, we introduce neural models and the transformer architecture used in generic LLMs. We then discuss their recent applications in InfoVis,…
▽ More
Automatically captioning visualizations is not new, but recent advances in large language models(LLMs) open exciting new possibilities. In this tutorial, after providing a brief review of Information Visualization (InfoVis) principles and past work in captioning, we introduce neural models and the transformer architecture used in generic LLMs. We then discuss their recent applications in InfoVis, with a focus on captioning. Additionally, we explore promising future directions in this field.
△ Less
Submitted 27 June, 2024;
originally announced June 2024.
-
Equilibria in a Hypercube Spatial Voting Model
Authors:
A. Nicholas Day,
J. Robert Johnson
Abstract:
We give conditions for equilibria in the following Voronoi game on the discrete hypercube. Two players position themselves in $\{0,1\}^d$ and each receives payoff equal to the measure (under some probability distribution) of their Voronoi cell (the set of all points which are closer to them than to the other player). This game can be thought of as a discrete analogue of the Hotelling--Downs spatia…
▽ More
We give conditions for equilibria in the following Voronoi game on the discrete hypercube. Two players position themselves in $\{0,1\}^d$ and each receives payoff equal to the measure (under some probability distribution) of their Voronoi cell (the set of all points which are closer to them than to the other player). This game can be thought of as a discrete analogue of the Hotelling--Downs spatial voting model in which the political spectrum is determined by $d$ binary issues rather than a continuous interval.
We observe that if an equilibrium does exist then it must involve the two players co-locating at the majority point (ie the point representing majority opinion on each separate issue). Our main result is that a sufficient condition for an equilibrium is that on each issue the majority option is held by at least $\frac{3}{4}$ of voters. The value $\frac{3}{4}$ can be improved slightly in a way that depends on $d$ and with this improvement the result is best possible. We give similar sufficient conditions for the existence of a local equilibrium.
We also analyse the situation where the distribution is a mix of two product measures. We show that either there is an equilibrium or the best response to the majority point is its antipode.
△ Less
Submitted 26 June, 2024;
originally announced June 2024.
-
The Design, Implementation, and Performance of the LZ Calibration Systems
Authors:
J. Aalbers,
D. S. Akerib,
A. K. Al Musalhi,
F. Alder,
C. S. Amarasinghe,
A. Ames,
T. J. Anderson,
N. Angelides,
H. M. Araújo,
J. E. Armstrong,
M. Arthurs,
A. Baker,
S. Balashov,
J. Bang,
E. E. Barillier,
J. W. Bargemann,
K. Beattie,
T. Benson,
A. Bhatti,
A. Biekert,
T. P. Biesiadzinski,
H. J. Birch,
E. Bishop,
G. M. Blockinger,
B. Boxer
, et al. (179 additional authors not shown)
Abstract:
LUX-ZEPLIN (LZ) is a tonne-scale experiment searching for direct dark matter interactions and other rare events. It is located at the Sanford Underground Research Facility (SURF) in Lead, South Dakota, USA. The core of the LZ detector is a dual-phase xenon time projection chamber (TPC), designed with the primary goal of detecting Weakly Interacting Massive Particles (WIMPs) via their induced low e…
▽ More
LUX-ZEPLIN (LZ) is a tonne-scale experiment searching for direct dark matter interactions and other rare events. It is located at the Sanford Underground Research Facility (SURF) in Lead, South Dakota, USA. The core of the LZ detector is a dual-phase xenon time projection chamber (TPC), designed with the primary goal of detecting Weakly Interacting Massive Particles (WIMPs) via their induced low energy nuclear recoils. Surrounding the TPC, two veto detectors immersed in an ultra-pure water tank enable reducing background events to enhance the discovery potential. Intricate calibration systems are purposely designed to precisely understand the responses of these three detector volumes to various types of particle interactions and to demonstrate LZ's ability to discriminate between signals and backgrounds. In this paper, we present a comprehensive discussion of the key features, requirements, and performance of the LZ calibration systems, which play a crucial role in enabling LZ's WIMP-search and its broad science program. The thorough description of these calibration systems, with an emphasis on their novel aspects, is valuable for future calibration efforts in direct dark matter and other rare-event search experiments.
△ Less
Submitted 20 June, 2024; v1 submitted 2 May, 2024;
originally announced June 2024.
-
Trials and Tribulations in the Reanalysis of KELT-24 b: a Case Study for the Importance of Stellar Modeling
Authors:
Mark R. Giovinazzi,
Bryson Cale,
Jason D. Eastman,
Joseph E. Rodriguez,
Cullen H. Blake,
Keivan G. Stassun,
Thomas G. Beatty,
Nate McCrady,
Andrew Vanderburg,
Michelle Kunimoto,
Adam L. Kraus,
Joseph Twicken,
Cayla M. Dedrick,
Jonathan Horner,
John A. Johnson,
Samson A. Johnson,
Peter Plavchan,
David H. Sliski,
Maurice L. Wilson,
Robert A. Wittenmyer,
Jason T. Wright,
Marshall C. Johnson,
Mark E. Rose,
Matthew Cornachione
Abstract:
We present a new analysis of the KELT-24 system, comprising a well-aligned hot Jupiter, KELT-24~b, and a bright ($V=8.3$), nearby ($d=96.9~\mathrm{pc}$) F-type host star. KELT-24~b was independently discovered by two groups in 2019, with each reporting best-fit stellar parameters that were notably inconsistent. Here, we present three independent analyses of the KELT-24 system, each incorporating a…
▽ More
We present a new analysis of the KELT-24 system, comprising a well-aligned hot Jupiter, KELT-24~b, and a bright ($V=8.3$), nearby ($d=96.9~\mathrm{pc}$) F-type host star. KELT-24~b was independently discovered by two groups in 2019, with each reporting best-fit stellar parameters that were notably inconsistent. Here, we present three independent analyses of the KELT-24 system, each incorporating a broad range of photometric and spectroscopic data, including eight sectors of TESS photometry and more than 200 new radial velocities (RVs) from MINERVA. Two of these analyses use KELT-24's observed spectral energy distribution (SED) through a direct comparison to stellar evolutionary models, while our third analysis assumes an unknown additional body contributing to the observed broadband photometry and excludes the SED. Ultimately, we find that the models that include the SED are a poor fit to the available data, so we adopt the system parameters derived without it. We also highlight a single transit-like event observed by TESS, deemed likely to be an eclipsing binary bound to KELT-24, that will require follow-up observations to confirm. We discuss the potential of these additional bodies in the KELT-24 system as a possible explanation for the discrepancies between the results of the different modeling approaches, and explore the system for longer-period planets that may be weakly evident in the RV observations. The comprehensive investigations that we present not only increase the fidelity of our understanding of the KELT-24 system, but also serve as a blueprint for future stellar modeling in global analyses of exoplanet systems.
△ Less
Submitted 6 June, 2024;
originally announced June 2024.
-
Probing the Scalar WIMP-Pion Coupling with the first LUX-ZEPLIN data
Authors:
J. Aalbers,
D. S. Akerib,
A. K. Al Musalhi,
F. Alder,
C. S. Amarasinghe,
A. Ames,
T. J. Anderson,
N. Angelides,
H. M. Araújo,
J. E. Armstrong,
M. Arthurs,
A. Baker,
S. Balashov,
J. Bang,
E. E. Barillier,
J. W. Bargemann,
K. Beattie,
T. Benson,
A. Bhatti,
A. Biekert,
T. P. Biesiadzinski,
H. J. Birch,
E. J. Bishop,
G. M. Blockinger,
B. Boxer
, et al. (178 additional authors not shown)
Abstract:
Weakly interacting massive particles (WIMPs) may interact with a virtual pion that is exchanged between nucleons. This interaction channel is important to consider in models where the spin-independent isoscalar channel is suppressed. Using data from the first science run of the LUX-ZEPLIN dark matter experiment, containing 60 live days of data in a 5.5~tonne fiducial mass of liquid xenon, we repor…
▽ More
Weakly interacting massive particles (WIMPs) may interact with a virtual pion that is exchanged between nucleons. This interaction channel is important to consider in models where the spin-independent isoscalar channel is suppressed. Using data from the first science run of the LUX-ZEPLIN dark matter experiment, containing 60 live days of data in a 5.5~tonne fiducial mass of liquid xenon, we report the results on a search for WIMP-pion interactions. We observe no significant excess and set an upper limit of $1.5\times10^{-46}$~cm$^2$ at a 90\% confidence level for a WIMP mass of 33~GeV/c$^2$ for this interaction.
△ Less
Submitted 4 June, 2024;
originally announced June 2024.
-
The SemGuS Toolkit
Authors:
Keith J. C. Johnson,
Andrew Reynolds,
Thomas Reps,
Loris D'Antoni
Abstract:
Semantics-Guided Synthesis (SemGuS) is a programmable framework for defining synthesis problems in a domain- and solver-agnostic way. This paper presents the standardized SemGuS format, together with an open-source toolkit that provides a parser, a verifier, and enumerative SemGuS solvers. The paper also describes an initial set of SemGuS benchmarks, which form the basis for comparing SemGuS solve…
▽ More
Semantics-Guided Synthesis (SemGuS) is a programmable framework for defining synthesis problems in a domain- and solver-agnostic way. This paper presents the standardized SemGuS format, together with an open-source toolkit that provides a parser, a verifier, and enumerative SemGuS solvers. The paper also describes an initial set of SemGuS benchmarks, which form the basis for comparing SemGuS solvers, and presents an evaluation of the baseline enumerative solvers.
△ Less
Submitted 3 June, 2024;
originally announced June 2024.
-
Modeling the distribution of insulin in pancreas
Authors:
Changbing Hu,
Junyuan Yang,
James D. Johnson,
Jiaxu Li
Abstract:
Maintenance of adequate physical and functional pancreatic $β$-cell mass is critical for the prevention or delay of diabetes mellitus. It is well established that insulin potently activates mitogenic and anti-apoptotic signaling cascades in cultured $β$-cells. Loss of $β$-cell insulin receptors is sufficient to induce type 2 diabetes in mice. However, it remains unclear whether the {\em in vitro}…
▽ More
Maintenance of adequate physical and functional pancreatic $β$-cell mass is critical for the prevention or delay of diabetes mellitus. It is well established that insulin potently activates mitogenic and anti-apoptotic signaling cascades in cultured $β$-cells. Loss of $β$-cell insulin receptors is sufficient to induce type 2 diabetes in mice. However, it remains unclear whether the {\em in vitro} effect in human islets and the {\em in vivo} effects in mice can be applied to human physiology. The major obstacle to a complete understanding of the effects of insulin's feedback in human pancreas is the absence of technology to measure the concentrations of insulin inside of pancreas. To contextualize recent {\em in vitro} data, it is essential to know the local concentration and distribution of insulin in pancreas. To this end, we continue to estimate the local insulin concentration within pancreas. In this paper, we investigate the distribution of insulin concentration along the pancreatic vein through a novel mathematical modeling approach using existing physiological data and islet imaging data, in contrast to our previous work focusing on the insulin level within an islet. Our studies suggest that, in response to an increase in glucose, the insulin concentration along the pancreatic vein increases nearly linearly in the fashion of increasing quicker in tail area but slower in head area depending of the initial distribution.
△ Less
Submitted 1 June, 2024;
originally announced June 2024.
-
MANTA: A Negative-Triangularity NASEM-Compliant Fusion Pilot Plant
Authors:
MANTA Collaboration,
G. Rutherford,
H. S. Wilson,
A. Saltzman,
D. Arnold,
J. L. Ball,
S. Benjamin,
R. Bielajew,
N. de Boucaud,
M. Calvo-Carrera,
R. Chandra,
H. Choudhury,
C. Cummings,
L. Corsaro,
N. DaSilva,
R. Diab,
A. R. Devitre,
S. Ferry,
S. J. Frank,
C. J. Hansen,
J. Jerkins,
J. D. Johnson,
P. Lunia,
J. van de Lindt,
S. Mackie
, et al. (16 additional authors not shown)
Abstract:
The MANTA (Modular Adjustable Negative Triangularity ARC-class) design study investigated how negative-triangularity (NT) may be leveraged in a compact, fusion pilot plant (FPP) to take a ``power-handling first" approach. The result is a pulsed, radiative, ELM-free tokamak that satisfies and exceeds the FPP requirements described in the 2021 National Academies of Sciences, Engineering, and Medicin…
▽ More
The MANTA (Modular Adjustable Negative Triangularity ARC-class) design study investigated how negative-triangularity (NT) may be leveraged in a compact, fusion pilot plant (FPP) to take a ``power-handling first" approach. The result is a pulsed, radiative, ELM-free tokamak that satisfies and exceeds the FPP requirements described in the 2021 National Academies of Sciences, Engineering, and Medicine report ``Bringing Fusion to the U.S. Grid". A self-consistent integrated modeling workflow predicts a fusion power of 450 MW and a plasma gain of 11.5 with only 23.5 MW of power to the scrape-off layer (SOL). This low $P_\text{SOL}$ together with impurity seeding and high density at the separatrix results in a peak heat flux of just 2.8 MW/m$^{2}$. MANTA's high aspect ratio provides space for a large central solenoid (CS), resulting in ${\sim}$15 minute inductive pulses. In spite of the high B fields on the CS and the other REBCO-based magnets, the electromagnetic stresses remain below structural and critical current density limits. Iterative optimization of neutron shielding and tritium breeding blanket yield tritium self-sufficiency with a breeding ratio of 1.15, a blanket power multiplication factor of 1.11, toroidal field coil lifetimes of $3100 \pm 400$ MW-yr, and poloidal field coil lifetimes of at least $890 \pm 40$ MW-yr. Following balance of plant modeling, MANTA is projected to generate 90 MW of net electricity at an electricity gain factor of ${\sim}2.4$. Systems-level economic analysis estimates an overnight cost of US\$3.4 billion, meeting the NASEM FPP requirement that this first-of-a-kind be less than US\$5 billion. The toroidal field coil cost and replacement time are the most critical upfront and lifetime cost drivers, respectively.
△ Less
Submitted 30 May, 2024;
originally announced May 2024.
-
The Data Acquisition System of the LZ Dark Matter Detector: FADR
Authors:
J. Aalbers,
D. S. Akerib,
A. K. Al Musalhi,
F. Alder,
C. S. Amarasinghe,
A. Ames,
T. J. Anderson,
N. Angelides,
H. M. Araújo,
J. E. Armstrong,
M. Arthurs,
A. Baker,
S. Balashov,
J. Bang,
E. E. Barillier,
J. W. Bargemann,
K. Beattie,
T. Benson,
A. Bhatti,
A. Biekert,
T. P. Biesiadzinski,
H. J. Birch,
E. Bishop,
G. M. Blockinger,
B. Boxer
, et al. (190 additional authors not shown)
Abstract:
The Data Acquisition System (DAQ) for the LUX-ZEPLIN (LZ) dark matter detector is described. The signals from 745 PMTs, distributed across three subsystems, are sampled with 100-MHz 32-channel digitizers (DDC-32s). A basic waveform analysis is carried out on the on-board Field Programmable Gate Arrays (FPGAs) to extract information about the observed scintillation and electroluminescence signals.…
▽ More
The Data Acquisition System (DAQ) for the LUX-ZEPLIN (LZ) dark matter detector is described. The signals from 745 PMTs, distributed across three subsystems, are sampled with 100-MHz 32-channel digitizers (DDC-32s). A basic waveform analysis is carried out on the on-board Field Programmable Gate Arrays (FPGAs) to extract information about the observed scintillation and electroluminescence signals. This information is used to determine if the digitized waveforms should be preserved for offline analysis.
The system is designed around the Kintex-7 FPGA. In addition to digitizing the PMT signals and providing basic event selection in real time, the flexibility provided by the use of FPGAs allows us to monitor the performance of the detector and the DAQ in parallel to normal data acquisition.
The hardware and software/firmware of this FPGA-based Architecture for Data acquisition and Realtime monitoring (FADR) are discussed and performance measurements are described.
△ Less
Submitted 23 May, 2024;
originally announced May 2024.
-
Is Flash Attention Stable?
Authors:
Alicia Golden,
Samuel Hsia,
Fei Sun,
Bilge Acun,
Basil Hosmer,
Ye** Lee,
Zachary DeVito,
Jeff Johnson,
Gu-Yeon Wei,
David Brooks,
Carole-Jean Wu
Abstract:
Training large-scale machine learning models poses distinct system challenges, given both the size and complexity of today's workloads. Recently, many organizations training state-of-the-art Generative AI models have reported cases of instability during training, often taking the form of loss spikes. Numeric deviation has emerged as a potential cause of this training instability, although quantify…
▽ More
Training large-scale machine learning models poses distinct system challenges, given both the size and complexity of today's workloads. Recently, many organizations training state-of-the-art Generative AI models have reported cases of instability during training, often taking the form of loss spikes. Numeric deviation has emerged as a potential cause of this training instability, although quantifying this is especially challenging given the costly nature of training runs. In this work, we develop a principled approach to understanding the effects of numeric deviation, and construct proxies to put observations into context when downstream effects are difficult to quantify. As a case study, we apply this framework to analyze the widely-adopted Flash Attention optimization. We find that Flash Attention sees roughly an order of magnitude more numeric deviation as compared to Baseline Attention at BF16 when measured during an isolated forward pass. We then use a data-driven analysis based on the Wasserstein Distance to provide upper bounds on how this numeric deviation impacts model weights during training, finding that the numerical deviation present in Flash Attention is 2-5 times less significant than low-precision training.
△ Less
Submitted 4 May, 2024;
originally announced May 2024.
-
Lightplane: Highly-Scalable Components for Neural 3D Fields
Authors:
Ang Cao,
Justin Johnson,
Andrea Vedaldi,
David Novotny
Abstract:
Contemporary 3D research, particularly in reconstruction and generation, heavily relies on 2D images for inputs or supervision. However, current designs for these 2D-3D map** are memory-intensive, posing a significant bottleneck for existing methods and hindering new applications. In response, we propose a pair of highly scalable components for 3D neural fields: Lightplane Render and Splatter, w…
▽ More
Contemporary 3D research, particularly in reconstruction and generation, heavily relies on 2D images for inputs or supervision. However, current designs for these 2D-3D map** are memory-intensive, posing a significant bottleneck for existing methods and hindering new applications. In response, we propose a pair of highly scalable components for 3D neural fields: Lightplane Render and Splatter, which significantly reduce memory usage in 2D-3D map**. These innovations enable the processing of vastly more and higher resolution images with small memory and computational costs. We demonstrate their utility in various applications, from benefiting single-scene optimization with image-level losses to realizing a versatile pipeline for dramatically scaling 3D reconstruction and generation. Code: \url{https://github.com/facebookresearch/lightplane}.
△ Less
Submitted 30 April, 2024;
originally announced April 2024.
-
Hyperplane Representations of Interventional Characteristic Imset Polytopes
Authors:
Benjamin Hollering,
Joseph Johnson,
Liam Solus
Abstract:
Characteristic imsets are 0/1-vectors representing directed acyclic graphs whose edges represent direct cause-effect relations between jointly distributed random variables. A characteristic imset (CIM) polytope is the convex hull of a collection of characteristic imsets. CIM polytopes arise as feasible regions of a linear programming approach to the problem of causal disovery, which aims to infer…
▽ More
Characteristic imsets are 0/1-vectors representing directed acyclic graphs whose edges represent direct cause-effect relations between jointly distributed random variables. A characteristic imset (CIM) polytope is the convex hull of a collection of characteristic imsets. CIM polytopes arise as feasible regions of a linear programming approach to the problem of causal disovery, which aims to infer a cause-effect structure from data. Linear optimization methods typically require a hyperplane representation of the feasible region, which has proven difficult to compute for CIM polytopes despite continued efforts. We solve this problem for CIM polytopes that are the convex hull of imsets associated to DAGs whose underlying graph of adjacencies is a tree. Our methods use the theory of toric fiber products as well as the novel notion of interventional CIM polytopes. Our solution is obtained as a corollary of a more general result for interventional CIM polytopes. The identified hyperplanes are applied to yield a linear optimization-based causal discovery algorithm for learning polytree causal networks from a combination of observational and interventional data.
△ Less
Submitted 29 April, 2024;
originally announced April 2024.
-
Constraints On Covariant WIMP-Nucleon Effective Field Theory Interactions from the First Science Run of the LUX-ZEPLIN Experiment
Authors:
J. Aalbers,
D. S. Akerib,
A. K. Al Musalhi,
F. Alder,
C. S. Amarasinghe,
A. Ames,
T. J. Anderson,
N. Angelides,
H. M. Araújo,
J. E. Armstrong,
M. Arthurs,
A. Baker,
S. Balashov,
J. Bang,
E. E. Barillier,
J. W. Bargemann,
K. Beattie,
T. Benson,
A. Bhatti,
A. Biekert,
T. P. Biesiadzinski,
H. J. Birch,
E. J. Bishop,
G. M. Blockinger,
B. Boxer
, et al. (179 additional authors not shown)
Abstract:
The first science run of the LUX-ZEPLIN (LZ) experiment, a dual-phase xenon time project chamber operating in the Sanford Underground Research Facility in South Dakota, USA, has reported leading limits on spin-independent WIMP-nucleon interactions and interactions described from a non-relativistic effective field theory (NREFT). Using the same 5.5~t fiducial mass and 60 live days of exposure we re…
▽ More
The first science run of the LUX-ZEPLIN (LZ) experiment, a dual-phase xenon time project chamber operating in the Sanford Underground Research Facility in South Dakota, USA, has reported leading limits on spin-independent WIMP-nucleon interactions and interactions described from a non-relativistic effective field theory (NREFT). Using the same 5.5~t fiducial mass and 60 live days of exposure we report on the results of a relativistic extension to the NREFT. We present constraints on couplings from covariant interactions arising from the coupling of vector, axial currents, and electric dipole moments of the nucleon to the magnetic and electric dipole moments of the WIMP which cannot be described by recasting previous results described by an NREFT. Using a profile-likelihood ratio analysis, in an energy region between 0~keV$_\text{nr}$ to 270~keV$_\text{nr}$, we report 90% confidence level exclusion limits on the coupling strength of five interactions in both the isoscalar and isovector bases.
△ Less
Submitted 26 April, 2024;
originally announced April 2024.
-
Resolving the size and charge of small particles: a predictive model of nanopore mechanics
Authors:
Samuel Bearden,
Tigran M. Abramyan,
Dmitry Gil,
Jessica Johnson,
Anton Murashko,
Sergei Makaev,
David Mai,
Alexander Baranchikov,
Vladimir Ivanov,
Vladimir Reukov,
Guigen Zhang
Abstract:
The movement of small particles and molecules through membranes is widespread and has far-reaching implications. Consequently, the development of mathematical models is essential for understanding these processes on a micro level, leading to deeper insights. In this endeavour, we suggested a model based on a set of empirical equations to predict the transport of substances through a solid-state na…
▽ More
The movement of small particles and molecules through membranes is widespread and has far-reaching implications. Consequently, the development of mathematical models is essential for understanding these processes on a micro level, leading to deeper insights. In this endeavour, we suggested a model based on a set of empirical equations to predict the transport of substances through a solid-state nanopore and the associated signals generated during their translocation. This model establishes analytical relationships between the ionic current and electrical double-layer potential observed during ana-lyte translocation and their size, charge, and mobility in an electrolyte solution. This framework allows for rapid interpretation and prediction of the nanopore system's behaviour and provides a means for quantitatively determining the physical properties of molecular analytes. To illustrate the analyt-ical capability of this model, ceria nanoparticles were investigated while undergoing oxidation or reduction within an original nanopore device. The re-sults obtained were found to be in good agreement with predictions from physicochemical methods. This developed approach and model possess transfer-able utility to various porous materials, thereby expediting research efforts in membrane characterization and the advancement of nano- and ultrafiltra-tion or electrodialysis technologies.
△ Less
Submitted 25 April, 2024;
originally announced April 2024.
-
Investigating Resource-efficient Neutron/Gamma Classification ML Models Targeting eFPGAs
Authors:
Jyothisraj Johnson,
Billy Boxer,
Tarun Prakash,
Carl Grace,
Peter Sorensen,
Mani Tripathi
Abstract:
There has been considerable interest and resulting progress in implementing machine learning (ML) models in hardware over the last several years from the particle and nuclear physics communities. A big driver has been the release of the Python package, hls4ml, which has enabled porting models specified and trained using Python ML libraries to register transfer level (RTL) code. So far, the primary…
▽ More
There has been considerable interest and resulting progress in implementing machine learning (ML) models in hardware over the last several years from the particle and nuclear physics communities. A big driver has been the release of the Python package, hls4ml, which has enabled porting models specified and trained using Python ML libraries to register transfer level (RTL) code. So far, the primary end targets have been commercial FPGAs or synthesized custom blocks on ASICs. However, recent developments in open-source embedded FPGA (eFPGA) frameworks now provide an alternate, more flexible pathway for implementing ML models in hardware. These customized eFPGA fabrics can be integrated as part of an overall chip design. In general, the decision between a fully custom, eFPGA, or commercial FPGA ML implementation will depend on the details of the end-use application. In this work, we explored the parameter space for eFPGA implementations of fully-connected neural network (fcNN) and boosted decision tree (BDT) models using the task of neutron/gamma classification with a specific focus on resource efficiency. We used data collected using an AmBe sealed source incident on Stilbene, which was optically coupled to an OnSemi J-series SiPM to generate training and test data for this study. We investigated relevant input features and the effects of bit-resolution and sampling rate as well as trade-offs in hyperparameters for both ML architectures while tracking total resource usage. The performance metric used to track model performance was the calculated neutron efficiency at a gamma leakage of 10$^{-3}$. The results of the study will be used to aid the specification of an eFPGA fabric, which will be integrated as part of a test chip.
△ Less
Submitted 19 April, 2024;
originally announced April 2024.
-
Service Weaver: A Promising Direction for Cloud-native Systems?
Authors:
Jacoby Johnson,
Subash Kharel,
Alan Mannamplackal,
Amr S. Abdelfattah,
Tomas Cerny
Abstract:
Cloud-native and microservice architectures have taken over the development world by storm. While being incredibly scalable and resilient, microservice architectures also come at the cost of increased overhead to build and maintain. Google's Service Weaver aims to simplify the complexities associated with implementing cloud-native systems by introducing the concept of a single modular binary compo…
▽ More
Cloud-native and microservice architectures have taken over the development world by storm. While being incredibly scalable and resilient, microservice architectures also come at the cost of increased overhead to build and maintain. Google's Service Weaver aims to simplify the complexities associated with implementing cloud-native systems by introducing the concept of a single modular binary composed of agent-like components, thereby abstracting away the microservice architecture notion of individual services. While Service Weaver presents a promising approach to streamline the development of cloud-native applications and addresses nearly all significant aspects of conventional cloud-native systems, there are existing tradeoffs affecting the overall functionality of the system. Notably, Service Weaver's straightforward implementation and deployment of components alleviate the overhead of constructing a complex microservice architecture. However, it is important to acknowledge that certain features, including separate code bases, routing mechanisms, resiliency, and security, are presently lacking in the framework.
△ Less
Submitted 14 April, 2024;
originally announced April 2024.
-
Modeling the Galactic Chemical Evolution of Helium
Authors:
Miqaela K. Weller,
David H. Weinberg,
James W. Johnson
Abstract:
We examine the galactic chemical evolution (GCE) of $^4$He in one-zone and multi-zone models, with particular attention to theoretical predictions and empirical constraints on IMF-averaged yields. Published models of massive star winds and core collapse supernovae span a factor of 2 -- 3 in the IMF-averaged $^4$He yield, $y\mathrm{_{He}^{CC}}$. Published models of intermediate mass, asymptotic gia…
▽ More
We examine the galactic chemical evolution (GCE) of $^4$He in one-zone and multi-zone models, with particular attention to theoretical predictions and empirical constraints on IMF-averaged yields. Published models of massive star winds and core collapse supernovae span a factor of 2 -- 3 in the IMF-averaged $^4$He yield, $y\mathrm{_{He}^{CC}}$. Published models of intermediate mass, asymptotic giant branch (AGB) stars show better agreement on the IMF-averaged yield, $y\mathrm{_{He}^{AGB}}$, and they predict that more than half of this yield comes from stars with $M=4-8 M_\odot$, making AGB $^4$He enrichment rapid compared to Fe enrichment from Type Ia supernovae. Although our GCE models include many potentially complicating effects, the short enrichment time delay and mild metallicity dependence of the predicted yields makes the results quite simple: across a wide range of metallicity and age, the non-primordial $^4$He mass fraction $ΔY = Y-Y_{\mathrm{P}}$ is proportional to the abundance of promptly produced $α$-elements, like oxygen, with $ΔY/Z_{\mathrm{O}} \approx (y\mathrm{_{He}^{CC}}+y\mathrm{_{He}^{AGB}})/y\mathrm{_{O}^{CC}}$. Reproducing solar abundances with our fiducial choice of the oxygen yield $y\mathrm{_{O}^{CC}}=0.0071$ implies $y\mathrm{_{He}^{CC}}+y\mathrm{_{He}^{AGB}} \approx 0.022$, i.e., $0.022M_\odot$ of net $^4$He production per solar mass of star formation. Our GCE models with this yield normalization are consistent with most available observations, though the implied $y\mathrm{_{He}^{CC}}$ is low compared to most of the published massive star models. More precise measurements of $ΔY$ in stars and gas across a wide range of metallicity and [$α$/Fe] ratio could test our models more stringently, either confirming the simple picture suggested by our calculations or revealing surprises in the evolution of the second most abundant element.
△ Less
Submitted 12 April, 2024;
originally announced April 2024.
-
Probing the 3D Awareness of Visual Foundation Models
Authors:
Mohamed El Banani,
Amit Raj,
Kevis-Kokitsi Maninis,
Abhishek Kar,
Yuanzhen Li,
Michael Rubinstein,
Deqing Sun,
Leonidas Guibas,
Justin Johnson,
Varun Jampani
Abstract:
Recent advances in large-scale pretraining have yielded visual foundation models with strong capabilities. Not only can recent models generalize to arbitrary images for their training task, their intermediate representations are useful for other visual tasks such as detection and segmentation. Given that such models can classify, delineate, and localize objects in 2D, we ask whether they also repr…
▽ More
Recent advances in large-scale pretraining have yielded visual foundation models with strong capabilities. Not only can recent models generalize to arbitrary images for their training task, their intermediate representations are useful for other visual tasks such as detection and segmentation. Given that such models can classify, delineate, and localize objects in 2D, we ask whether they also represent their 3D structure? In this work, we analyze the 3D awareness of visual foundation models. We posit that 3D awareness implies that representations (1) encode the 3D structure of the scene and (2) consistently represent the surface across views. We conduct a series of experiments using task-specific probes and zero-shot inference procedures on frozen features. Our experiments reveal several limitations of the current models. Our code and analysis can be found at https://github.com/mbanani/probe3d.
△ Less
Submitted 12 April, 2024;
originally announced April 2024.
-
Galactic Chemical Evolution Models Favor an Extended Type Ia Supernova Delay-Time Distribution
Authors:
Liam O. Dubay,
Jennifer A. Johnson,
James W. Johnson
Abstract:
Type Ia supernovae (SNe Ia) produce most of the Fe-peak elements in the Universe and therefore are a crucial ingredient in galactic chemical evolution models. SNe Ia do not explode immediately after star formation, and the delay-time distribution (DTD) has not been definitively determined by supernova surveys or theoretical models. Because the DTD also affects the relationship among age, [Fe/H], a…
▽ More
Type Ia supernovae (SNe Ia) produce most of the Fe-peak elements in the Universe and therefore are a crucial ingredient in galactic chemical evolution models. SNe Ia do not explode immediately after star formation, and the delay-time distribution (DTD) has not been definitively determined by supernova surveys or theoretical models. Because the DTD also affects the relationship among age, [Fe/H], and [$α$/Fe] in chemical evolution models, comparison with observations of stars in the Milky Way is an important consistency check for any proposed DTD. We implement several popular forms of the DTD in combination with multiple star formation histories for the Milky Way in multi-zone chemical evolution models which include radial stellar migration. We compare our predicted interstellar medium abundance tracks, stellar abundance distributions, and stellar age distributions to the final data release of the Apache Point Observatory Galactic Evolution Experiment (APOGEE). We find that the DTD has the largest effect on the [$α$/Fe] distribution: a DTD with more prompt SNe Ia produces a stellar abundance distribution that is skewed toward a lower [$α$/Fe] ratio. While the DTD alone cannot explain the observed bimodality in the [$α$/Fe] distribution, in combination with an appropriate star formation history it affects the goodness of fit between the predicted and observed high-$α$ sequence. Our model results favor an extended DTD with fewer prompt SNe Ia than the fiducial $t^{-1}$ power law.
△ Less
Submitted 11 April, 2024;
originally announced April 2024.
-
View Selection for 3D Captioning via Diffusion Ranking
Authors:
Tiange Luo,
Justin Johnson,
Honglak Lee
Abstract:
Scalable annotation approaches are crucial for constructing extensive 3D-text datasets, facilitating a broader range of applications. However, existing methods sometimes lead to the generation of hallucinated captions, compromising caption quality. This paper explores the issue of hallucination in 3D object captioning, with a focus on Cap3D method, which renders 3D objects into 2D views for captio…
▽ More
Scalable annotation approaches are crucial for constructing extensive 3D-text datasets, facilitating a broader range of applications. However, existing methods sometimes lead to the generation of hallucinated captions, compromising caption quality. This paper explores the issue of hallucination in 3D object captioning, with a focus on Cap3D method, which renders 3D objects into 2D views for captioning using pre-trained models. We pinpoint a major challenge: certain rendered views of 3D objects are atypical, deviating from the training data of standard image captioning models and causing hallucinations. To tackle this, we present DiffuRank, a method that leverages a pre-trained text-to-3D model to assess the alignment between 3D objects and their 2D rendered views, where the view with high alignment closely represent the object's characteristics. By ranking all rendered views and feeding the top-ranked ones into GPT4-Vision, we enhance the accuracy and detail of captions, enabling the correction of 200k captions in the Cap3D dataset and extending it to 1 million captions across Objaverse and Objaverse-XL datasets. Additionally, we showcase the adaptability of DiffuRank by applying it to pre-trained text-to-image models for a Visual Question Answering task, where it outperforms the CLIP model.
△ Less
Submitted 11 April, 2024;
originally announced April 2024.
-
PointInfinity: Resolution-Invariant Point Diffusion Models
Authors:
Zixuan Huang,
Justin Johnson,
Shoubhik Debnath,
James M. Rehg,
Chao-Yuan Wu
Abstract:
We present PointInfinity, an efficient family of point cloud diffusion models. Our core idea is to use a transformer-based architecture with a fixed-size, resolution-invariant latent representation. This enables efficient training with low-resolution point clouds, while allowing high-resolution point clouds to be generated during inference. More importantly, we show that scaling the test-time reso…
▽ More
We present PointInfinity, an efficient family of point cloud diffusion models. Our core idea is to use a transformer-based architecture with a fixed-size, resolution-invariant latent representation. This enables efficient training with low-resolution point clouds, while allowing high-resolution point clouds to be generated during inference. More importantly, we show that scaling the test-time resolution beyond the training resolution improves the fidelity of generated point clouds and surfaces. We analyze this phenomenon and draw a link to classifier-free guidance commonly used in diffusion models, demonstrating that both allow trading off fidelity and variability during inference. Experiments on CO3D show that PointInfinity can efficiently generate high-resolution point clouds (up to 131k points, 31 times more than Point-E) with state-of-the-art quality.
△ Less
Submitted 4 April, 2024;
originally announced April 2024.
-
Benchmarking Object Detectors with COCO: A New Path Forward
Authors:
Shweta Singh,
Aayan Yadav,
Jitesh Jain,
Humphrey Shi,
Justin Johnson,
Karan Desai
Abstract:
The Common Objects in Context (COCO) dataset has been instrumental in benchmarking object detectors over the past decade. Like every dataset, COCO contains subtle errors and imperfections stemming from its annotation procedure. With the advent of high-performing models, we ask whether these errors of COCO are hindering its utility in reliably benchmarking further progress. In search for an answer,…
▽ More
The Common Objects in Context (COCO) dataset has been instrumental in benchmarking object detectors over the past decade. Like every dataset, COCO contains subtle errors and imperfections stemming from its annotation procedure. With the advent of high-performing models, we ask whether these errors of COCO are hindering its utility in reliably benchmarking further progress. In search for an answer, we inspect thousands of masks from COCO (2017 version) and uncover different types of errors such as imprecise mask boundaries, non-exhaustively annotated instances, and mislabeled masks. Due to the prevalence of COCO, we choose to correct these errors to maintain continuity with prior research. We develop COCO-ReM (Refined Masks), a cleaner set of annotations with visibly better mask quality than COCO-2017. We evaluate fifty object detectors and find that models that predict visually sharper masks score higher on COCO-ReM, affirming that they were being incorrectly penalized due to errors in COCO-2017. Moreover, our models trained using COCO-ReM converge faster and score higher than their larger variants trained using COCO-2017, highlighting the importance of data quality in improving object detectors. With these findings, we advocate using COCO-ReM for future object detection research. Our dataset is available at https://cocorem.xyz
△ Less
Submitted 27 March, 2024;
originally announced March 2024.
-
Divide, Conquer, Combine Bayesian Decision Tree Sampling
Authors:
Jodie A. Cochrane,
Adrian Wills,
Sarah J. Johnson
Abstract:
Decision trees are commonly used predictive models due to their flexibility and interpretability. This paper is directed at quantifying the uncertainty of decision tree predictions by employing a Bayesian inference approach. This is challenging because these approaches need to explore both the tree structure space and the space of decision parameters associated with each tree structure. This has b…
▽ More
Decision trees are commonly used predictive models due to their flexibility and interpretability. This paper is directed at quantifying the uncertainty of decision tree predictions by employing a Bayesian inference approach. This is challenging because these approaches need to explore both the tree structure space and the space of decision parameters associated with each tree structure. This has been handled by using Markov Chain Monte Carlo (MCMC) methods, where a Markov Chain is constructed to provide samples from the desired Bayesian estimate. Importantly, the structure and the decision parameters are tightly coupled; small changes in the tree structure can demand vastly different decision parameters to provide accurate predictions. A challenge for existing MCMC approaches is proposing joint changes in both the tree structure and the decision parameters that result in efficient sampling. This paper takes a different approach, where each distinct tree structure is associated with a unique set of decision parameters. The proposed approach, entitled DCC-Tree, is inspired by the work in Zhou et al. [23] for probabilistic programs and Cochrane et al. [4] for Hamiltonian Monte Carlo (HMC) based sampling for decision trees. Results show that DCC-Tree performs comparably to other HMC-based methods and better than existing Bayesian tree methods while improving on consistency and reducing the per-proposal complexity.
△ Less
Submitted 26 March, 2024;
originally announced March 2024.
-
The APO-K2 Catalog. II. Accurate Stellar Ages for Red Giant Branch Stars across the Milky Way
Authors:
Jack T. Warfield,
Joel C. Zinn,
Jessica Schonhut-Stasik,
James W. Johnson,
Marc H. Pinsonneault,
Jennifer A. Johnson,
Dennis Stello,
Rachael L. Beaton,
Yvonne Elsworth,
Rafael A. García,
Savita Mathur,
Benoît Mosser,
Aldo Serenelli,
Jamie Tayar
Abstract:
We present stellar age determinations for 4661 red giant branch stars in the APO-K2 catalog, derived using mass estimates from K2 asteroseismology from the K2 Galactic Archaeology Program and elemental abundances from the Apache Point Galactic Evolution Experiment survey. Our sample includes 17 of the 19 fields observed by K2, making it one of the most comprehensive catalogs of accurate stellar ag…
▽ More
We present stellar age determinations for 4661 red giant branch stars in the APO-K2 catalog, derived using mass estimates from K2 asteroseismology from the K2 Galactic Archaeology Program and elemental abundances from the Apache Point Galactic Evolution Experiment survey. Our sample includes 17 of the 19 fields observed by K2, making it one of the most comprehensive catalogs of accurate stellar ages across the Galaxy in terms of the wide range of populations spanned by its stars, enabling rigorous tests of Galactic chemical evolution models. Taking into account the selection functions of the K2 sample, the data appear to support the age-chemistry morphology of stellar populations predicted by both inside-out and late-burst scenarios. We also investigate trends in age versus stellar chemistry and Galactic position, which are consistent with previous findings. Comparisons against APOKASC-3 asteroseismic ages show agreement to within ~3%. We also discuss offsets between our ages and spectroscopic ages. Finally, we note that ignoring the effects of $α$-enhancement on stellar opacity (either directly or with the Salaris metallicity correction) results in an ~10% offset in age estimates for the most $α$-enhanced stars, which is an important consideration for continued tests of Galactic models with this and other asteroseismic age samples.
△ Less
Submitted 15 April, 2024; v1 submitted 24 March, 2024;
originally announced March 2024.
-
A Gap in the Densities of Small Planets Orbiting M Dwarfs: Rigorous Statistical Confirmation Using the Open-source Code RhoPop
Authors:
J. G. Schulze,
Ji Wang,
J. A. Johnson,
B. S. Gaudi,
R. Rodriguez Martinez,
C. T. Unterborn,
W. R. Panero
Abstract:
Using mass-radius-composition models, small planets ($\mathrm{R}\lesssim 2 \mathrm{R_\oplus}$) are typically classified into three types: iron-rich, nominally Earth-like, and those with solid/liquid water and/or atmosphere. These classes are generally expected to be variations within a compositional continuum. Recently, however, Luque & Pallé observed that potentially Earth-like planets around M d…
▽ More
Using mass-radius-composition models, small planets ($\mathrm{R}\lesssim 2 \mathrm{R_\oplus}$) are typically classified into three types: iron-rich, nominally Earth-like, and those with solid/liquid water and/or atmosphere. These classes are generally expected to be variations within a compositional continuum. Recently, however, Luque & Pallé observed that potentially Earth-like planets around M dwarfs are separated from a lower-density population by a density gap. Meanwhile, the results of Adibekyan et al. hint that iron-rich planets around FGK stars are also a distinct population. It therefore remains unclear whether small planets represent a continuum or multiple distinct populations. Differentiating the nature of these populations will help constrain potential formation mechanisms. We present the RhoPop software for identifying small-planet populations. RhoPop employs mixture models in a hierarchical framework and a nested sampler for parameter and evidence estimates. Using RhoPop, we confirm the two populations of Luque & Pallé with $>4σ$ significance. The intrinsic scatter in the Earth-like subpopulation is roughly half that expected based on stellar abundance variations in local FGK stars, perhaps implying M dwarfs have a smaller spread in the major rock-building elements (Fe, Mg, Si) than FGK stars. We apply RhoPop to the Adibekyan et al. sample and find no evidence of more than one population. We estimate the sample size required to resolve a population of planets with Mercury-like compositions from those with Earth-like compositions for various mass-radius precisions. Only 16 planets are needed when $σ_{M_p} = 5\%$ and $σ_{R_p} = 1\%$. At $σ_{M_p} = 10\%$ and $σ_{R_p} = 2.5\%$, however, over 154 planets are needed, an order of magnitude increase.
△ Less
Submitted 20 March, 2024;
originally announced March 2024.
-
Fusing Climate Data Products using a Spatially Varying Autoencoder
Authors:
Jacob A. Johnson,
Matthew J. Heaton,
William F. Christensen,
Lynsie R. Warr,
Summer B. Rupper
Abstract:
Autoencoders are powerful machine learning models used to compress information from multiple data sources. However, autoencoders, like all artificial neural networks, are often unidentifiable and uninterpretable. This research focuses on creating an identifiable and interpretable autoencoder that can be used to meld and combine climate data products. The proposed autoencoder utilizes a Bayesian st…
▽ More
Autoencoders are powerful machine learning models used to compress information from multiple data sources. However, autoencoders, like all artificial neural networks, are often unidentifiable and uninterpretable. This research focuses on creating an identifiable and interpretable autoencoder that can be used to meld and combine climate data products. The proposed autoencoder utilizes a Bayesian statistical framework, allowing for probabilistic interpretations while also varying spatially to capture useful spatial patterns across the various data products. Constraints are placed on the autoencoder as it learns patterns in the data, creating an interpretable consensus that includes the important features from each input. We demonstrate the utility of the autoencoder by combining information from multiple precipitation products in High Mountain Asia.
△ Less
Submitted 12 March, 2024;
originally announced March 2024.
-
Nature vs. Nurture: Distinguishing Effects from Stellar Processing and Chemical Evolution on Carbon and Nitrogen in Red Giant Stars
Authors:
John D. Roberts,
Marc H. Pinsonneault,
Jennifer A. Johnson,
Joel C. Zinn,
David H. Weinberg,
Mathieu Vrard,
Jamie Tayar,
Dennis Stello,
Benoît Mosser,
James W. Johnson,
Kaili Cao,
Keivan G. Stassun,
Guy S. Stringfellow,
Aldo Serenelli,
Savita Mathur,
Saskia Hekker,
Rafael A. García,
Yvonne P. Elsworth,
Enrico Corsaro
Abstract:
The surface [C/N] ratios of evolved giants are strongly affected by the first dredge-up (FDU) of nuclear-processed material from stellar cores. C and N also have distinct nucleosynthetic origins and serve as diagnostics of mixing and mass loss. We use subgiants to find strong trends in the birth [C/N] with [Fe/H], which differ between the low-$α$ and high-$α$ populations. We demonstrate that these…
▽ More
The surface [C/N] ratios of evolved giants are strongly affected by the first dredge-up (FDU) of nuclear-processed material from stellar cores. C and N also have distinct nucleosynthetic origins and serve as diagnostics of mixing and mass loss. We use subgiants to find strong trends in the birth [C/N] with [Fe/H], which differ between the low-$α$ and high-$α$ populations. We demonstrate that these birth trends have a strong impact on the surface abundances after the FDU. This effect is neglected in current stellar models, which use solar-scaled C and N. We map out the FDU as a function of evolutionary state, mass, and composition using a large and precisely measured asteroseismic dataset in first-ascent red giant branch (RGB) and core He-burning, or red clump (RC), stars. We describe the domains where [C/N] is a useful mass diagnostic and find that the RC complements the RGB and extends the range of validity to higher mass. We find evidence for extra mixing on the RGB below [Fe/H]= -0.4, matching literature results, for high-$α$ giants, but there is no clear evidence of mixing in the low-$α$ giants. The predicted signal of mass loss is weak and difficult to detect in our sample. We discuss implications for stellar physics and stellar population applications.
△ Less
Submitted 5 March, 2024;
originally announced March 2024.
-
FAR: Flexible, Accurate and Robust 6DoF Relative Camera Pose Estimation
Authors:
Chris Rockwell,
Nilesh Kulkarni,
Linyi **,
Jeong Joon Park,
Justin Johnson,
David F. Fouhey
Abstract:
Estimating relative camera poses between images has been a central problem in computer vision. Methods that find correspondences and solve for the fundamental matrix offer high precision in most cases. Conversely, methods predicting pose directly using neural networks are more robust to limited overlap and can infer absolute translation scale, but at the expense of reduced precision. We show how t…
▽ More
Estimating relative camera poses between images has been a central problem in computer vision. Methods that find correspondences and solve for the fundamental matrix offer high precision in most cases. Conversely, methods predicting pose directly using neural networks are more robust to limited overlap and can infer absolute translation scale, but at the expense of reduced precision. We show how to combine the best of both methods; our approach yields results that are both precise and robust, while also accurately inferring translation scales. At the heart of our model lies a Transformer that (1) learns to balance between solved and learned pose estimations, and (2) provides a prior to guide a solver. A comprehensive analysis supports our design choices and demonstrates that our method adapts flexibly to various feature extractors and correspondence estimators, showing state-of-the-art performance in 6DoF pose estimation on Matterport3D, InteriorNet, StreetLearn, and Map-free Relocalization.
△ Less
Submitted 5 March, 2024;
originally announced March 2024.
-
A Dynamic Model of Integration
Authors:
Joseph D. Johnson,
Marisa C. Eisenberg
Abstract:
Thomas Schelling introduced his agent-based model of segregation in 1971 and concluded that even when there is a low amount of intolerance within society that segregation will develop if people follow their individual preferences. A large body of literature building of this framework has been built and has bolstered this claim. This paper aims to take the same framework but instead look for ways t…
▽ More
Thomas Schelling introduced his agent-based model of segregation in 1971 and concluded that even when there is a low amount of intolerance within society that segregation will develop if people follow their individual preferences. A large body of literature building of this framework has been built and has bolstered this claim. This paper aims to take the same framework but instead look for ways to get to an integrated state. We focus on Allport's contact hypothesis that states that if there is equal status among groups, common goals among groups, and an institutional mechanism supporting intergroup contact then intergroup contact can reduce prejudice. We incorporate the contact hypothesis by having individuals adjust their intolerance based on their current neighborhood composition and the ease of conforming to their surroundings. Furthermore, we add in positive and negative media effects, as individuals are likely to get information about an outgroup from the media (e.g., news, TV, movies, etc.) that they consume. We find that having a society composed of individuals who do not easily conform to their surroundings and displaying positive examples of both groups in media promote integration within society.
△ Less
Submitted 4 March, 2024;
originally announced March 2024.
-
A perspective on the Milky Way Bulge-Bar as seen from the neutron-capture elements Cerium and Neodymium with APOGEE
Authors:
J. V. Sales-Silva,
K. Cunha,
V. V. Smith,
S. Daflon,
D. Souto,
R. Guerço,
A. Queiroz,
C. Chiappini,
C. R. Hayes,
T. Masseron,
Sten Hasselquist,
D. Horta,
N. Prantzos,
M. Zoccali,
C. Allende Prieto,
B. Barbuy,
R. Beaton,
D. Bizyaev,
J. G. Fernández-Trincado,
P. M. Frinchaboy,
J. A. Holtzman,
J. A. Johnson,
Henrik Jönsson,
S. R. Majewski,
D. Minniti
, et al. (6 additional authors not shown)
Abstract:
This study probes the chemical abundances of the neutron-capture elements cerium and neodymium in the inner Milky Way from an analysis of a sample of $\sim$2000 stars in the Galactic Bulge/bar spatially contained within $|X_{Gal}|<$5 kpc, $|Y_{Gal}|<$3.5 kpc, and $|Z_{Gal}|<$1 kpc, and spanning metallicities between $-$2.0$\lesssim$[Fe/H]$\lesssim$+0.5. We classify the sample stars into low- or hi…
▽ More
This study probes the chemical abundances of the neutron-capture elements cerium and neodymium in the inner Milky Way from an analysis of a sample of $\sim$2000 stars in the Galactic Bulge/bar spatially contained within $|X_{Gal}|<$5 kpc, $|Y_{Gal}|<$3.5 kpc, and $|Z_{Gal}|<$1 kpc, and spanning metallicities between $-$2.0$\lesssim$[Fe/H]$\lesssim$+0.5. We classify the sample stars into low- or high-[Mg/Fe] populations and find that, in general, values of [Ce/Fe] and [Nd/Fe] increase as the metallicity decreases for the low- and high-[Mg/Fe] populations. Ce abundances show a more complex variation across the metallicity range of our Bulge-bar sample when compared to Nd, with the r-process dominating the production of neutron-capture elements in the high-[Mg/Fe] population ([Ce/Nd]$<$0.0). We find a spatial chemical dependence of Ce and Nd abundances for our sample of Bulge-bar stars, with low- and high-[Mg/Fe] populations displaying a distinct abundance distribution. In the region close to the center of the MW, the low-[Mg/Fe] population is dominated by stars with low [Ce/Fe], [Ce/Mg], [Nd/Mg], [Nd/Fe], and [Ce/Nd] ratios. The low [Ce/Nd] ratio indicates a significant contribution in this central region from r-process yields for the low-[Mg/Fe] population. The chemical pattern of the most metal-poor stars in our sample suggests an early chemical enrichment of the Bulge dominated by yields from core-collapse supernovae and r-process astrophysical sites, such as magneto-rotational supernovae.
△ Less
Submitted 19 April, 2024; v1 submitted 22 February, 2024;
originally announced February 2024.
-
New directions in algebraic statistics: Three challenges from 2023
Authors:
Yulia Alexandr,
Miles Bakenhus,
Mark Curiel,
Sameer K. Deshpande,
Elizabeth Gross,
Yuqi Gu,
Max Hill,
Joseph Johnson,
Bryson Kagy,
Vishesh Karwa,
Jiayi Li,
Hanbaek Lyu,
Sonja Petrović,
Jose Israel Rodriguez
Abstract:
In the last quarter of a century, algebraic statistics has established itself as an expanding field which uses multilinear algebra, commutative algebra, computational algebra, geometry, and combinatorics to tackle problems in mathematical statistics. These developments have found applications in a growing number of areas, including biology, neuroscience, economics, and social sciences.
Naturally…
▽ More
In the last quarter of a century, algebraic statistics has established itself as an expanding field which uses multilinear algebra, commutative algebra, computational algebra, geometry, and combinatorics to tackle problems in mathematical statistics. These developments have found applications in a growing number of areas, including biology, neuroscience, economics, and social sciences.
Naturally, new connections continue to be made with other areas of mathematics and statistics. This paper outlines three such connections: to statistical models used in educational testing, to a classification problem for a family of nonparametric regression models, and to phase transition phenomena under uniform sampling of contingency tables. We illustrate the motivating problems, each of which is for algebraic statistics a new direction, and demonstrate an enhancement of related methodologies.
△ Less
Submitted 21 February, 2024;
originally announced February 2024.
-
A 350-MHz Green Bank Telescope Survey of Unassociated Fermi LAT Sources: Discovery and Timing of Ten Millisecond Pulsars
Authors:
P. Bangale,
B. Bhattacharyya,
F. Camilo,
C. J. Clark,
I. Cognard,
M. E. DeCesar,
E. C. Ferrara,
P. Gentile,
L. Guillemot,
J. W. T. Hessels,
T. J. Johnson,
M. Kerr,
M. A. McLaughlin,
L. Nieder,
S. M. Ransom,
P. S. Ray,
M. S. E. Roberts,
J. Roy,
S. Sanpa-Arsa,
G. Theureau,
M. T. Wolff
Abstract:
We have searched for radio pulsations towards 49 Fermi Large Area Telescope (LAT) 1FGL Catalog $γ$-ray sources using the Green Bank Telescope at 350 MHz. We detected 18 millisecond pulsars (MSPs) in blind searches of the data; 10 of these were discoveries unique to our survey. Sixteen are binaries, with eight having short orbital periods $P_B < 1$ day. No radio pulsations from young pulsars were d…
▽ More
We have searched for radio pulsations towards 49 Fermi Large Area Telescope (LAT) 1FGL Catalog $γ$-ray sources using the Green Bank Telescope at 350 MHz. We detected 18 millisecond pulsars (MSPs) in blind searches of the data; 10 of these were discoveries unique to our survey. Sixteen are binaries, with eight having short orbital periods $P_B < 1$ day. No radio pulsations from young pulsars were detected, although three targets are coincident with apparently radio-quiet $γ$-ray pulsars discovered in LAT data. Here, we give an overview of the survey and present radio and $γ$-ray timing results for the 10 MSPs discovered. These include the only isolated MSP discovered in our survey and six short-$P_B$ binary MSPs. Of these, three have very low-mass companions ($M_c$ $\ll$ 0.1M$_{\odot}$) and hence belong to the class of black widow pulsars. Two have more massive, non-degenerate companions with extensive radio eclipses and orbitally modulated X-ray emission consistent with the redback class. Significant $γ$-ray pulsations have been detected from nine of the discoveries. This survey and similar efforts suggest that the majority of Galactic $γ$-ray sources at high Galactic latitudes are either MSPs or relatively nearby non-recycled pulsars, with the latter having on average a much smaller radio/$γ$-ray beaming ratio as compared to MSPs. It also confirms that past surveys suffered from an observational bias against finding short-$P_B$ MSP systems.
△ Less
Submitted 14 February, 2024;
originally announced February 2024.
-
New constraints on ultraheavy dark matter from the LZ experiment
Authors:
J. Aalbers,
D. S. Akerib,
A. K. Al Musalhi,
C. S. Amarasinghe,
A. Ames,
T. J. Anderson,
N. Angelides,
H. M. Araújo,
J. E. Armstrong,
M. Arthurs,
A. Baker,
S. Balashov,
J. Bang,
J. W. Bargemann,
A. Baxter,
K. Beattie,
T. Benson,
A. Bhatti,
A. Biekert,
T. P. Biesiadzinski,
H. J. Birch,
E. Bishop,
G. M. Blockinger,
B. Boxer,
C. A. J. Brew
, et al. (174 additional authors not shown)
Abstract:
Searches for dark matter with liquid xenon time projection chamber experiments have traditionally focused on the region of the parameter space that is characteristic of weakly interacting massive particles, ranging from a few GeV/$c^2$ to a few TeV/$c^2$. Models of dark matter with a mass much heavier than this are well motivated by early production mechanisms different from the standard thermal f…
▽ More
Searches for dark matter with liquid xenon time projection chamber experiments have traditionally focused on the region of the parameter space that is characteristic of weakly interacting massive particles, ranging from a few GeV/$c^2$ to a few TeV/$c^2$. Models of dark matter with a mass much heavier than this are well motivated by early production mechanisms different from the standard thermal freeze-out, but they have generally been less explored experimentally. In this work, we present a re-analysis of the first science run (SR1) of the LZ experiment, with an exposure of $0.9$ tonne$\times$year, to search for ultraheavy particle dark matter. The signal topology consists of multiple energy deposits in the active region of the detector forming a straight line, from which the velocity of the incoming particle can be reconstructed on an event-by-event basis. Zero events with this topology were observed after applying the data selection calibrated on a simulated sample of signal-like events. New experimental constraints are derived, which rule out previously unexplored regions of the dark matter parameter space of spin-independent interactions beyond a mass of 10$^{17}$ GeV/$c^2$.
△ Less
Submitted 13 February, 2024;
originally announced February 2024.
-
PhosNetVis: a web-based tool for kinase enrichment analysis and interactive 2D/3D network visualizations of phosphoproteomics data
Authors:
Osho Rawal,
Berk Turhan,
Irene Font Peradejordi,
Shreya Chandrasekar,
Selim Kalayci,
Jeffrey Johnson,
Mehdi Bouhaddou,
Zeynep H. Gümüş
Abstract:
Protein phosphorylation is a vital process in cellular signaling that involves the reversible modification of a protein (substrate) residue by another protein (kinase). Advances in liquid chromatography-mass spectrometry have enabled the rapid generation of massive protein phosphorylation datasets across multiple conditions by many research groups. Researchers are then tasked with inferring kinase…
▽ More
Protein phosphorylation is a vital process in cellular signaling that involves the reversible modification of a protein (substrate) residue by another protein (kinase). Advances in liquid chromatography-mass spectrometry have enabled the rapid generation of massive protein phosphorylation datasets across multiple conditions by many research groups. Researchers are then tasked with inferring kinases responsible for changes in phosphorylation sites of each substrate. Despite the recent explosion of tools to infer kinase-substrate interactions (KSIs) from such datasets, these are not optimized for the interactive exploration of the resulting large and complex KSI networks together with significant phosphorylation sites and states. There are also no dedicated tools that streamline kinase inferences together with interactive visualizations of the resulting networks. There is thus an unmet need for a tool that facilitates uster-intuitive analysis, interactive exploration, visualization, and communication of datasets from phosphoproteomics experiments. Here, we present PhosNetVis, a freely available web-based tool for researchers of all computational skill levels to easily infer, generate and interactively explore KSI networks in 2D or 3D by streamlining multiple phosphoproteomics data analysis steps within one single tool. PhostNetVis significantly lowers the barriers for researchers in rapidly generating high-quality visualizations to translate their rich phosphoproteomics datasets into biological and clinical insights.
△ Less
Submitted 8 February, 2024; v1 submitted 7 February, 2024;
originally announced February 2024.
-
Bluesky and the AT Protocol: Usable Decentralized Social Media
Authors:
Martin Kleppmann,
Paul Frazee,
Jake Gold,
Jay Graber,
Daniel Holmgren,
Devin Ivy,
Jeromy Johnson,
Bryan Newbold,
Jaz Volpert
Abstract:
Bluesky is a new social network built upon the AT Protocol, a decentralized foundation for public social media. It was launched in private beta in February 2023, and has grown to over 3 million registered users in the following year. In this paper we introduce the architecture of Bluesky and the AT Protocol, which is inspired by the web itself, but modernized to include streams of real-time update…
▽ More
Bluesky is a new social network built upon the AT Protocol, a decentralized foundation for public social media. It was launched in private beta in February 2023, and has grown to over 3 million registered users in the following year. In this paper we introduce the architecture of Bluesky and the AT Protocol, which is inspired by the web itself, but modernized to include streams of real-time updates and cryptographic authentication. We explain how the technical design of Bluesky is informed by our goals: to enable decentralization by having multiple interoperable providers for every part of the system; to make it easy for users to switch providers; to give users agency over the content they see; and to provide a simple user experience that does not burden users with complexity arising from the system's decentralized nature. The system's openness allows anybody to contribute to content moderation and community management, and we invite the research community to use Bluesky as a dataset and testing ground for new approaches in social media moderation.
△ Less
Submitted 5 February, 2024;
originally announced February 2024.
-
SPDE priors for uncertainty quantification of end-to-end neural data assimilation schemes
Authors:
Maxime Beauchamp,
Nicolas Desassis,
J. Emmanuel Johnson,
Simon Benaichouche,
Pierre Tandeo,
Ronan Fablet
Abstract:
The spatio-temporal interpolation of large geophysical datasets has historically been adressed by Optimal Interpolation (OI) and more sophisticated model-based or data-driven DA techniques. In the last ten years, the link established between Stochastic Partial Differential Equations (SPDE) and Gaussian Markov Random Fields (GMRF) opened a new way of handling both large datasets and physically-indu…
▽ More
The spatio-temporal interpolation of large geophysical datasets has historically been adressed by Optimal Interpolation (OI) and more sophisticated model-based or data-driven DA techniques. In the last ten years, the link established between Stochastic Partial Differential Equations (SPDE) and Gaussian Markov Random Fields (GMRF) opened a new way of handling both large datasets and physically-induced covariance matrix in Optimal Interpolation. Recent advances in the deep learning community also enables to adress this problem as neural architecture embedding data assimilation variational framework. The reconstruction task is seen as a joint learning problem of the prior involved in the variational inner cost and the gradient-based minimization of the latter: both prior models and solvers are stated as neural networks with automatic differentiation which can be trained by minimizing a loss function, typically stated as the mean squared error between some ground truth and the reconstruction. In this work, we draw from the SPDE-based Gaussian Processes to estimate complex prior models able to handle non-stationary covariances in both space and time and provide a stochastic framework for interpretability and uncertainty quantification. Our neural variational scheme is modified to embed an augmented state formulation with both state and SPDE parametrization to estimate. Instead of a neural prior, we use a stochastic PDE as surrogate model along the data assimilation window. The training involves a loss function for both reconstruction task and SPDE prior model, where the likelihood of the SPDE parameters given the true states is involved in the training. Because the prior is stochastic, we can easily draw samples in the prior distribution before conditioning to provide a flexible way to estimate the posterior distribution based on thousands of members.
△ Less
Submitted 2 February, 2024;
originally announced February 2024.
-
Continuous-time Trajectory Estimation: A Comparative Study Between Gaussian Process and Spline-based Approaches
Authors:
Jacob Johnson,
Joshua Mangelson,
Timothy Barfoot,
Randal Beard
Abstract:
Continuous-time trajectory estimation is an attractive alternative to discrete-time batch estimation due to the ability to incorporate high-frequency measurements from asynchronous sensors while kee** the number of optimization parameters bounded. Two types of continuous-time estimation have become prevalent in the literature: Gaussian process regression and spline-based estimation. In this pape…
▽ More
Continuous-time trajectory estimation is an attractive alternative to discrete-time batch estimation due to the ability to incorporate high-frequency measurements from asynchronous sensors while kee** the number of optimization parameters bounded. Two types of continuous-time estimation have become prevalent in the literature: Gaussian process regression and spline-based estimation. In this paper, we present a direct comparison between these two methods. We first compare them using a simple linear system, and then compare them in a camera and IMU sensor fusion scenario on SE(3) in both simulation and hardware. Our results show that if the same measurements and motion model are used, the two methods achieve similar trajectory accuracy. In addition, if the spline order is chosen so that the degree-of-differentiability of the two trajectory representations match, then they achieve similar solve times as well.
△ Less
Submitted 1 February, 2024;
originally announced February 2024.
-
The Faiss library
Authors:
Matthijs Douze,
Alexandr Guzhva,
Chengqi Deng,
Jeff Johnson,
Gergely Szilvasy,
Pierre-Emmanuel Mazaré,
Maria Lomeli,
Lucas Hosseini,
Hervé Jégou
Abstract:
Vector databases manage large collections of embedding vectors. As AI applications are growing rapidly, so are the number of embeddings that need to be stored and indexed. The Faiss library is dedicated to vector similarity search, a core functionality of vector databases. Faiss is a toolkit of indexing methods and related primitives used to search, cluster, compress and transform vectors. This pa…
▽ More
Vector databases manage large collections of embedding vectors. As AI applications are growing rapidly, so are the number of embeddings that need to be stored and indexed. The Faiss library is dedicated to vector similarity search, a core functionality of vector databases. Faiss is a toolkit of indexing methods and related primitives used to search, cluster, compress and transform vectors. This paper first describes the tradeoff space of vector search, then the design principles of Faiss in terms of structure, approach to optimization and interfacing. We benchmark key features of the library and discuss a few selected applications to highlight its broad applicability.
△ Less
Submitted 16 January, 2024;
originally announced January 2024.
-
Characterizing the Gamma-ray Emission Properties of the Globular Cluster M5 with the Fermi-LAT
Authors:
X. Hou,
W. Zhang,
P. C. C. Freire,
D. F. Torres,
J. Ballet,
D. A. Smith,
T. J. Johnson,
M. Kerr,
C. C. Cheung,
L. Guillemot,
J. Li,
L. Zhang,
A. Ridolfi,
P. Wang,
D. Li,
J. Yuan,
N. Wang
Abstract:
We analyzed the globular cluster M5 (NGC 5904) using 15 years of gamma-ray data from the Fermi Large Area Telescope (LAT). Using rotation ephemerides generated from Arecibo and FAST radio telescope observations, we searched for gamma-ray pulsations from the seven millisecond pulsars (MSPs) identified in M5. We detected no significant pulsations from any of the individual pulsars. Also, we searched…
▽ More
We analyzed the globular cluster M5 (NGC 5904) using 15 years of gamma-ray data from the Fermi Large Area Telescope (LAT). Using rotation ephemerides generated from Arecibo and FAST radio telescope observations, we searched for gamma-ray pulsations from the seven millisecond pulsars (MSPs) identified in M5. We detected no significant pulsations from any of the individual pulsars. Also, we searched for possible variations of the gamma-ray emission as a function of orbital phase for all the six MSPs in binary systems, but did not detect any significant modulations. The gamma-ray emission from the direction of M5 is well described by an exponentially cutoff power-law spectral model, although other models cannot be excluded. The phase-averaged emission is consistent with being steady on a time scale of a few months. We estimate the number of MSPs in M5 to be between 1 and 10, using the gamma-ray conversion efficiencies for well-characterized gamma-ray MSPs in the Third Fermi Large Area Telescope Catalog of Gamma-ray Pulsars, suggesting that the sample of known MSPs in M5 is (nearly) complete, even if it is not currently possible to rule out a diffuse component of the observed gamma rays from the cluster.
△ Less
Submitted 23 March, 2024; v1 submitted 16 January, 2024;
originally announced January 2024.
-
Spectacular nucleosynthesis from early massive stars
Authors:
Alexander P. Ji,
Sanjana Curtis,
Nicholas Storm,
Vedant Chandra,
Kevin C. Schlaufman,
Keivan G. Stassun,
Alexander Heger,
Marco Pignatari,
Adrian M. Price-Whelan,
Maria Bergemann,
Guy S. Stringfellow,
Carla Frohlich,
Henrique Reggiani,
Erika M. Holmbeck,
Jamie Tayar,
Shivani P. Shah,
Emily J. Griffith,
Chervin F. P. Laporte,
Andrew R. Casey,
Keith Hawkins,
Danny Horta,
William Cerny,
Pierre Thibodeaux,
Sam A. Usman,
Joao A. S. Amarante
, et al. (17 additional authors not shown)
Abstract:
Stars formed with initial mass over 50 Msun are very rare today, but they are thought to be more common in the early universe. The fates of those early, metal-poor, massive stars are highly uncertain. Most are expected to directly collapse to black holes, while some may explode as a result of rotationally powered engines or the pair-creation instability. We present the chemical abundances of J0931…
▽ More
Stars formed with initial mass over 50 Msun are very rare today, but they are thought to be more common in the early universe. The fates of those early, metal-poor, massive stars are highly uncertain. Most are expected to directly collapse to black holes, while some may explode as a result of rotationally powered engines or the pair-creation instability. We present the chemical abundances of J0931+0038, a nearby low-mass star identified in early followup of SDSS-V Milky Way Mapper, which preserves the signature of unusual nucleosynthesis from a massive star in the early universe. J0931+0038 has relatively high metallicity ([Fe/H] = -1.76 +/- 0.13) but an extreme odd-even abundance pattern, with some of the lowest known abundance ratios of [N/Fe], [Na/Fe], [K/Fe], [Sc/Fe], and [Ba/Fe]. The implication is that a majority of its metals originated in a single extremely metal-poor nucleosynthetic source. An extensive search through nucleosynthesis predictions finds a clear preference for progenitors with initial mass > 50 Msun, making J0931+0038 one of the first observational constraints on nucleosynthesis in this mass range. However the full abundance pattern is not matched by any models in the literature. J0931+0038 thus presents a challenge for the next generation of nucleosynthesis models and motivates study of high-mass progenitor stars impacted by convection, rotation, jets, and/or binary companions. Though rare, more examples of unusual early nucleosynthesis in metal-poor stars should be found in upcoming large spectroscopic surveys.
△ Less
Submitted 4 January, 2024;
originally announced January 2024.
-
Generative AI Beyond LLMs: System Implications of Multi-Modal Generation
Authors:
Alicia Golden,
Samuel Hsia,
Fei Sun,
Bilge Acun,
Basil Hosmer,
Ye** Lee,
Zachary DeVito,
Jeff Johnson,
Gu-Yeon Wei,
David Brooks,
Carole-Jean Wu
Abstract:
As the development of large-scale Generative AI models evolve beyond text (1D) generation to include image (2D) and video (3D) generation, processing spatial and temporal information presents unique challenges to quality, performance, and efficiency. We present the first work towards understanding this new system design space for multi-modal text-to-image (TTI) and text-to-video (TTV) generation m…
▽ More
As the development of large-scale Generative AI models evolve beyond text (1D) generation to include image (2D) and video (3D) generation, processing spatial and temporal information presents unique challenges to quality, performance, and efficiency. We present the first work towards understanding this new system design space for multi-modal text-to-image (TTI) and text-to-video (TTV) generation models. Current model architecture designs are bifurcated into 2 categories: Diffusion- and Transformer-based models. Our systematic performance characterization on a suite of eight representative TTI/TTV models shows that after state-of-the-art optimization techniques such as Flash Attention are applied, Convolution accounts for up to 44% of execution time for Diffusion-based TTI models, while Linear layers consume up to 49% of execution time for Transformer-based models. We additionally observe that Diffusion-based TTI models resemble the Prefill stage of LLM inference, and benefit from 1.1-2.5x greater speedup from Flash Attention than Transformer-based TTI models that resemble the Decode phase. Since optimizations designed for LLMs do not map directly onto TTI/TTV models, we must conduct a thorough characterization of these workloads to gain insights for new optimization opportunities. In doing so, we define sequence length in the context of TTI/TTV models and observe sequence length can vary up to 4x in Diffusion model inference. We additionally observe temporal aspects of TTV workloads pose unique system bottlenecks, with Temporal Attention accounting for over 60% of total Attention time. Overall, our in-depth system performance characterization is a critical first step towards designing efficient and deployable systems for emerging TTI/TTV workloads.
△ Less
Submitted 5 May, 2024; v1 submitted 21 December, 2023;
originally announced December 2023.
-
An Estimate of the Impact of Reionization on Supermassive Black Hole Growth
Authors:
Phoebe R. Upton Sanderbeck,
Jarrett L. Johnson,
Madeline A. Marshall
Abstract:
The supermassive black holes (SMBHs) that power active galactic nuclei found at $z\geq 6$ were formed during the epoch of reionization. Because reionization is a spatially inhomogeneous process, where different regions of the Universe become reionized at different times, the physical properties of SMBH host galaxy environments will vary spatially during reionization. We construct a semi-analytic m…
▽ More
The supermassive black holes (SMBHs) that power active galactic nuclei found at $z\geq 6$ were formed during the epoch of reionization. Because reionization is a spatially inhomogeneous process, where different regions of the Universe become reionized at different times, the physical properties of SMBH host galaxy environments will vary spatially during reionization. We construct a semi-analytic model to estimate the impact of reionization on SMBH growth due to reduced gas accretion onto dark matter halos. Using a series of merger trees, reionization models, and black hole growth models, we find that early reionization can reduce an SMBH's mass by up to [50, 70, 90] % within dark matter halos of mass [$10^{12}$, $10^{11}$, $10^{10}$] M$_{\odot}$ by $z$ = 6. Our findings also suggest that the redshift range in which black hole growth is impacted by reionization is strongly dependent on whether the Eddington accretion rate can be exceeded. If so, we find that black hole masses are significantly suppressed principally during the early phases of reionization ($z$ $\gtrsim$ 10), while they are more readily suppressed across the full redshift range if super-Eddington growth is not allowed. We find that the global average impact of reionization may be to reduce the masses of black holes residing in $\lesssim$ 10$^{11}$ M$_{\odot}$ halos by a factor of $\gtrsim$ 2. The census of SMBHs that the James Webb Space Telescope is uncovering provides a promising means by which to test these predictions.
△ Less
Submitted 13 December, 2023;
originally announced December 2023.
-
Optimal Routes to Ultrafast Polarization Reversal in Ferroelectric LiNbO3
Authors:
R. Tanner Hardy,
Conrad Rosenbrock,
Gus L. W. Hart,
Jeremy A. Johnson
Abstract:
We use the frozen phonon method to calculate the anharmonic potential energy surface and to model the ultrafast ferroelectric polarization reversal in LiNbO3 driven by intense pulses of THz light. Before stable switching of the polarization occurs, there exists a region of excitation field-strengths where transient switching can occur, as observed experimentally [Physical Review Letters 118, 19760…
▽ More
We use the frozen phonon method to calculate the anharmonic potential energy surface and to model the ultrafast ferroelectric polarization reversal in LiNbO3 driven by intense pulses of THz light. Before stable switching of the polarization occurs, there exists a region of excitation field-strengths where transient switching can occur, as observed experimentally [Physical Review Letters 118, 197601 (2017)]. By varying the excitation frequency from 4 to 20 THz, our modeling suggests that more efficient, permanent polarization switching can occur by directly exciting the soft mode at 7 THz, compared to nonlinear phononic-induced switching driven by exciting a high frequency mode at 18 THz. We also show that neglecting anharmonic coupling pathways in the modeled experiment can lead to significant differences in the modeled switching field strengths.
△ Less
Submitted 15 December, 2023; v1 submitted 7 December, 2023;
originally announced December 2023.
-
First Constraints on WIMP-Nucleon Effective Field Theory Couplings in an Extended Energy Region From LUX-ZEPLIN
Authors:
LZ Collaboration,
J. Aalbers,
D. S. Akerib,
A. K. Al Musalhi,
F. Alder,
C. S. Amarasinghe,
A. Ames,
T. J. Anderson,
N. Angelides,
H. M. Araújo,
J. E. Armstrong,
M. Arthurs,
A. Baker,
S. Balashov,
J. Bang,
J. W. Bargemann,
A. Baxter,
K. Beattie,
T. Benson,
A. Bhatti,
A. Biekert,
T. P. Biesiadzinski,
H. J. Birch,
E. Bishop,
G. M. Blockinger
, et al. (175 additional authors not shown)
Abstract:
Following the first science results of the LUX-ZEPLIN (LZ) experiment, a dual-phase xenon time projection chamber operating from the Sanford Underground Research Facility in Lead, South Dakota, USA, we report the initial limits on a model-independent non-relativistic effective field theory describing the complete set of possible interactions of a weakly interacting massive particle (WIMP) with a n…
▽ More
Following the first science results of the LUX-ZEPLIN (LZ) experiment, a dual-phase xenon time projection chamber operating from the Sanford Underground Research Facility in Lead, South Dakota, USA, we report the initial limits on a model-independent non-relativistic effective field theory describing the complete set of possible interactions of a weakly interacting massive particle (WIMP) with a nucleon. These results utilize the same 5.5 t fiducial mass and 60 live days of exposure collected for the LZ spin-independent and spin-dependent analyses while extending the upper limit of the energy region of interest by a factor of 7.5 to 270 keVnr. No significant excess in this high energy region is observed. Using a profile-likelihood ratio analysis, we report 90% confidence level exclusion limits on the coupling of each individual non-relativistic WIMP-nucleon operator for both elastic and inelastic interactions in the isoscalar and isovector bases.
△ Less
Submitted 26 February, 2024; v1 submitted 4 December, 2023;
originally announced December 2023.
-
RJHMC-Tree for Exploration of the Bayesian Decision Tree Posterior
Authors:
Jodie A. Cochrane,
Adrian G. Wills,
Sarah J. Johnson
Abstract:
Decision trees have found widespread application within the machine learning community due to their flexibility and interpretability. This paper is directed towards learning decision trees from data using a Bayesian approach, which is challenging due to the potentially enormous parameter space required to span all tree models. Several approaches have been proposed to combat this challenge, with on…
▽ More
Decision trees have found widespread application within the machine learning community due to their flexibility and interpretability. This paper is directed towards learning decision trees from data using a Bayesian approach, which is challenging due to the potentially enormous parameter space required to span all tree models. Several approaches have been proposed to combat this challenge, with one of the more successful being Markov chain Monte Carlo (MCMC) methods. The efficacy and efficiency of MCMC methods fundamentally rely on the quality of the so-called proposals, which is the focus of this paper. In particular, this paper investigates using a Hamiltonian Monte Carlo (HMC) approach to explore the posterior of Bayesian decision trees more efficiently by exploiting the geometry of the likelihood within a global update scheme. Two implementations of the novel algorithm are developed and compared to existing methods by testing against standard datasets in the machine learning and Bayesian decision tree literature. HMC-based methods are shown to perform favourably with respect to predictive test accuracy, acceptance rate, and tree complexity.
△ Less
Submitted 3 December, 2023;
originally announced December 2023.
-
Interval and $\ell$-interval Rational Parking Functions
Authors:
Tomás Aguilar-Fraga,
Jennifer Elder,
Rebecca E. Garcia,
Kimberly P. Hadaway,
Pamela E. Harris,
Kimberly J. Harry,
Imhotep B. Hogan,
Jakeyl Johnson,
Jan Kretschmann,
Kobe Lawson-Chavanu,
J. Carlos Martínez Mori,
Casandra D. Monroe,
Daniel Quiñonez,
Dirk Tolson III,
Dwight Anderson Williams II
Abstract:
Interval parking functions are a generalization of parking functions in which cars have an interval preference for their parking. We generalize this definition to parking functions with $n$ cars and $m\geq n$ parking spots, which we call interval rational parking functions and provide a formula for their enumeration. By specifying an integer parameter $\ell\geq 0$, we then consider the subset of i…
▽ More
Interval parking functions are a generalization of parking functions in which cars have an interval preference for their parking. We generalize this definition to parking functions with $n$ cars and $m\geq n$ parking spots, which we call interval rational parking functions and provide a formula for their enumeration. By specifying an integer parameter $\ell\geq 0$, we then consider the subset of interval rational parking functions in which each car parks at most $\ell$ spots away from their initial preference. We call these $\ell$-interval rational parking functions and provide recursive formulas to enumerate this set for all positive integers $m\geq n$ and $\ell$. We also establish formulas for the number of nondecreasing $\ell$-interval rational parking functions via the outcome map on rational parking functions. We also consider the intersection between $\ell$-interval parking functions and Fubini rankings and show the enumeration of these sets is given by generalized Fibonacci numbers. We conclude by specializing $\ell=1$, and establish that the set of $1$-interval rational parking functions with $n$ cars and $m$ spots are in bijection with the set of barred preferential arrangements of $[n]$ with $m-n$ bars. This readily implies enumerative formulas. Further, in the case where $\ell=1$, we recover the results of Hadaway and Harris that unit interval parking functions are in bijection with the set of Fubini rankings, which are enumerated by the Fubini numbers.
△ Less
Submitted 24 May, 2024; v1 submitted 23 November, 2023;
originally announced November 2023.
-
Learning Realistic Joint Space Boundaries for Range of Motion Analysis of Healthy and Impaired Human Arms
Authors:
Shafagh Keyvanian,
Michelle J. Johnson,
Nadia Figueroa
Abstract:
A realistic human kinematic model that satisfies anatomical constraints is essential for human-robot interaction, biomechanics and robot-assisted rehabilitation. Modeling realistic joint constraints, however, is challenging as human arm motion is constrained by joint limits, inter- and intra-joint dependencies, self-collisions, individual capabilities and muscular or neurological constraints which…
▽ More
A realistic human kinematic model that satisfies anatomical constraints is essential for human-robot interaction, biomechanics and robot-assisted rehabilitation. Modeling realistic joint constraints, however, is challenging as human arm motion is constrained by joint limits, inter- and intra-joint dependencies, self-collisions, individual capabilities and muscular or neurological constraints which are difficult to represent. Hence, physicians and researchers have relied on simple box-constraints, ignoring important anatomical factors. In this paper, we propose a data-driven method to learn realistic anatomically constrained upper-limb range of motion (RoM) boundaries from motion capture data. This is achieved by fitting a one-class support vector machine to a dataset of upper-limb joint space exploration motions with an efficient hyper-parameter tuning scheme. Our approach outperforms similar works focused on valid RoM learning. Further, we propose an impairment index (II) metric that offers a quantitative assessment of capability/impairment when comparing healthy and impaired arms. We validate the metric on healthy subjects physically constrained to emulate hemiplegia and different disability levels as stroke patients.
△ Less
Submitted 17 November, 2023;
originally announced November 2023.
-
Plane partitions and rowmotion on rectangular and trapezoidal posets
Authors:
Joseph Johnson,
Ricky Ini Liu
Abstract:
We define a birational map between labelings of a rectangular poset and its associated trapezoidal poset. This map tropicalizes to a bijection between the plane partitions of these posets of fixed height, giving a new bijective proof of a result by Proctor. We also show that this map is equivariant with respect to birational rowmotion, resolving a conjecture of Williams and implying that birationa…
▽ More
We define a birational map between labelings of a rectangular poset and its associated trapezoidal poset. This map tropicalizes to a bijection between the plane partitions of these posets of fixed height, giving a new bijective proof of a result by Proctor. We also show that this map is equivariant with respect to birational rowmotion, resolving a conjecture of Williams and implying that birational rowmotion on trapezoidal posets has finite order.
△ Less
Submitted 13 November, 2023;
originally announced November 2023.
-
Thriving in a Pandemic: Lessons Learned from a Resilient University Program Seen Through the CoI Lens
Authors:
Zihui Ma,
Lingyao Li,
John C. E. Johnson
Abstract:
In March 2020, college campuses underwent a sudden transformation to online learning due to the COVID-19 outbreak. To understand the impact of COVID-19 on students' expectations, this study conducted a three-year survey from ten core courses within the Project Management Center for Excellence at the University of Maryland. The study involved two main steps: 1) a statistical analysis to evaluate st…
▽ More
In March 2020, college campuses underwent a sudden transformation to online learning due to the COVID-19 outbreak. To understand the impact of COVID-19 on students' expectations, this study conducted a three-year survey from ten core courses within the Project Management Center for Excellence at the University of Maryland. The study involved two main steps: 1) a statistical analysis to evaluate students' expectations regarding "student," "class," "instructor," and "effort;" and 2) a lexical salience-valence analysis (LSVA) through the lens of the Community of Inquiry (CoI) framework to show the changes of students' expectations. The results revealed that students' overall evaluations maintained relatively consistent amid the COVID-19 teaching period. However, there were significant shifts of the student expectations toward Cognitive, Social and Teaching Presence course elements based on LSVA results. Also, clear differences emerged between under-graduates and graduates in their expectations and preferences in course design and delivery. These insights provide practical recommendations for course instructors in designing effective online courses.
△ Less
Submitted 31 October, 2023;
originally announced October 2023.