-
Cosmological constraints from the cross-correlation of DESI Luminous Red Galaxies with CMB lensing from Planck PR4 and ACT DR6
Authors:
Noah Sailer,
Joshua Kim,
Simone Ferraro,
Mathew S. Madhavacheril,
Martin White,
Irene Abril-Cabezas,
Jessica Nicole Aguilar,
Steven Ahlen,
J. Richard Bond,
David Brooks,
Etienne Burtin,
Erminia Calabrese,
Shi-Fan Chen,
Steve K. Choi,
Todd Claybaugh,
Kyle Dawson,
Axel de la Macorra,
Joseph DeRose,
Arjun Dey,
Biprateep Dey,
Peter Doel,
Jo Dunkley,
Carmen Embil-Villagra,
Gerrit S. Farren,
Andreu Font-Ribera
, et al. (41 additional authors not shown)
Abstract:
We infer the growth of large scale structure over the redshift range $0.4\lesssim z \lesssim 1$ from the cross-correlation of spectroscopically calibrated Luminous Red Galaxies (LRGs) selected from the Dark Energy Spectroscopic Instrument (DESI) legacy imaging survey with CMB lensing maps reconstructed from the latest Planck and ACT data. We adopt a hybrid effective field theory (HEFT) model that…
▽ More
We infer the growth of large scale structure over the redshift range $0.4\lesssim z \lesssim 1$ from the cross-correlation of spectroscopically calibrated Luminous Red Galaxies (LRGs) selected from the Dark Energy Spectroscopic Instrument (DESI) legacy imaging survey with CMB lensing maps reconstructed from the latest Planck and ACT data. We adopt a hybrid effective field theory (HEFT) model that robustly regulates the cosmological information obtainable from smaller scales, such that our cosmological constraints are reliably derived from the (predominantly) linear regime. We perform an extensive set of bandpower- and parameter-level systematics checks to ensure the robustness of our results and to characterize the uniformity of the LRG sample. We demonstrate that our results are stable to a wide range of modeling assumptions, finding excellent agreement with a linear theory analysis performed on a restricted range of scales. From a tomographic analysis of the four LRG photometric redshift bins we find that the rate of structure growth is consistent with $Λ$CDM with an overall amplitude that is $\simeq5-7\%$ lower than predicted by primary CMB measurements with modest $(\sim2σ)$ statistical significance. From the combined analysis of all four bins and their cross-correlations with Planck we obtain $S_8 = 0.765\pm0.023$, which is less discrepant with primary CMB measurements than previous DESI LRG cross Planck CMB lensing results. From the cross-correlation with ACT we obtain $S_8 = 0.790^{+0.024}_{-0.027}$, while when jointly analyzing Planck and ACT we find $S_8 = 0.775^{+0.019}_{-0.022}$ from our data alone and $σ_8 = 0.772^{+0.020}_{-0.023}$ with the addition of BAO data. These constraints are consistent with the latest Planck primary CMB analyses at the $\simeq 1.6-2.2σ$ level, and are in excellent agreement with galaxy lensing surveys.
△ Less
Submitted 5 July, 2024;
originally announced July 2024.
-
The Atacama Cosmology Telescope DR6 and DESI: Structure formation over cosmic time with a measurement of the cross-correlation of CMB Lensing and Luminous Red Galaxies
Authors:
Joshua Kim,
Noah Sailer,
Mathew S. Madhavacheril,
Simone Ferraro,
Irene Abril-Cabezas,
Jessica Nicole Aguilar,
Steven Ahlen,
J. Richard Bond,
David Brooks,
Etienne Burtin,
Erminia Calabrese,
Shi-Fan Chen,
Steve K. Choi,
Todd Claybaugh,
Omar Darwish,
Axel de la Macorra,
Joseph DeRose,
Mark Devlin,
Arjun Dey,
Peter Doel,
Jo Dunkley,
Carmen Embil-Villagra,
Gerrit S. Farren,
Andreu Font-Ribera,
Jaime E. Forero-Romero
, et al. (48 additional authors not shown)
Abstract:
We present a high-significance cross-correlation of CMB lensing maps from the Atacama Cosmology Telescope (ACT) Data Release 6 (DR6) with spectroscopically calibrated luminous red galaxies (LRGs) from the Dark Energy Spectroscopic Instrument (DESI). We detect this cross-correlation at a significance of 38$σ$; combining our measurement with the Planck Public Release 4 (PR4) lensing map, we detect t…
▽ More
We present a high-significance cross-correlation of CMB lensing maps from the Atacama Cosmology Telescope (ACT) Data Release 6 (DR6) with spectroscopically calibrated luminous red galaxies (LRGs) from the Dark Energy Spectroscopic Instrument (DESI). We detect this cross-correlation at a significance of 38$σ$; combining our measurement with the Planck Public Release 4 (PR4) lensing map, we detect the cross-correlation at 50$σ$. Fitting this jointly with the galaxy auto-correlation power spectrum to break the galaxy bias degeneracy with $σ_8$, we perform a tomographic analysis in four LRG redshift bins spanning $0.4 \le z \le 1.0$ to constrain the amplitude of matter density fluctuations through the parameter combination $S_8^\times = σ_8 \left(Ω_m / 0.3\right)^{0.4}$. Prior to unblinding, we confirm with extragalactic simulations that foreground biases are negligible and carry out a comprehensive suite of null and consistency tests. Using a hybrid effective field theory (HEFT) model that allows scales as small as $k_{\rm max}=0.6$ $h/{\rm Mpc}$, we obtain a 3.3% constraint on $S_8^\times = σ_8 \left(Ω_m / 0.3\right)^{0.4} = 0.792^{+0.024}_{-0.028}$ from ACT data, as well as constraints on $S_8^\times(z)$ that probe structure formation over cosmic time. Our result is consistent with the early-universe extrapolation from primary CMB anisotropies measured by Planck PR4 within 1.2$σ$. Jointly fitting ACT and Planck lensing cross-correlations we obtain a 2.7% constraint of $S_8^\times = 0.776^{+0.019}_{-0.021}$, which is consistent with the Planck early-universe extrapolation within 2.1$σ$, with the lowest redshift bin showing the largest difference in mean. The latter may motivate further CMB lensing tomography analyses at $z<0.6$ to assess the impact of potential systematics or the consistency of the $Λ$CDM model over cosmic time.
△ Less
Submitted 5 July, 2024;
originally announced July 2024.
-
Entropy Computing: A Paradigm for Optimization in an Open Quantum System
Authors:
Lac Nguyen,
Mohammad-Ali Miri,
R. Joseph Rupert,
Wesley Dyk,
Sam Wu,
Nick Vrahoretis,
Irwin Huang,
Milan Begliarbekov,
Nicholas Chancellor,
Uchenna Chukwu,
Pranav Mahamuni,
Cesar Martinez-Delgado,
David Haycraft,
Carrie Spear,
Mark Campanelli,
Russell Huffman,
Yong Meng Sua,
Yu** Huang
Abstract:
Modern quantum technologies using matter are designed as closed quantum systems to isolate them from interactions with the environment. This design paradigm greatly constrains the scalability and limits practical implementation of such systems. Here, we introduce a novel computing paradigm, entropy computing, that works by conditioning a quantum reservoir thereby enabling the stabilization of a gr…
▽ More
Modern quantum technologies using matter are designed as closed quantum systems to isolate them from interactions with the environment. This design paradigm greatly constrains the scalability and limits practical implementation of such systems. Here, we introduce a novel computing paradigm, entropy computing, that works by conditioning a quantum reservoir thereby enabling the stabilization of a ground state. In this work, we experimentally demonstrate the feasibility of entropy computing by building a hybrid photonic-electronic computer that uses measurement-based feedback to solve non-convex optimization problems. The system functions by using temporal photonic modes to create qudits in order to encode probability amplitudes in the time-frequency degree of freedom of a photon. This scheme, when coupled with electronic interconnects, allows us to encode an arbitrary Hamiltonian into the system and solve non-convex continuous variables and combinatorial optimization problems. We show that the proposed entropy computing paradigm can act as a scalable and versatile platform for tackling a large range of NP-hard optimization problems.
△ Less
Submitted 5 July, 2024;
originally announced July 2024.
-
EventChat: Implementation and user-centric evaluation of a large language model-driven conversational recommender system for exploring leisure events in an SME context
Authors:
Hannes Kunstmann,
Joseph Ollier,
Joel Persson,
Florian von Wangenheim
Abstract:
Large language models (LLMs) present an enormous evolution in the strategic potential of conversational recommender systems (CRS). Yet to date, research has predominantly focused upon technical frameworks to implement LLM-driven CRS, rather than end-user evaluations or strategic implications for firms, particularly from the perspective of a small to medium enterprises (SME) that makeup the bedrock…
▽ More
Large language models (LLMs) present an enormous evolution in the strategic potential of conversational recommender systems (CRS). Yet to date, research has predominantly focused upon technical frameworks to implement LLM-driven CRS, rather than end-user evaluations or strategic implications for firms, particularly from the perspective of a small to medium enterprises (SME) that makeup the bedrock of the global economy. In the current paper, we detail the design of an LLM-driven CRS in an SME setting, and its subsequent performance in the field using both objective system metrics and subjective user evaluations. While doing so, we additionally outline a short-form revised ResQue model for evaluating LLM-driven CRS, enabling replicability in a rapidly evolving field. Our results reveal good system performance from a user experience perspective (85.5% recommendation accuracy) but underscore latency, cost, and quality issues challenging business viability. Notably, with a median cost of $0.04 per interaction and a latency of 5.7s, cost-effectiveness and response time emerge as crucial areas for achieving a more user-friendly and economically viable LLM-driven CRS for SME settings. One major driver of these costs is the use of an advanced LLM as a ranker within the retrieval-augmented generation (RAG) technique. Our results additionally indicate that relying solely on approaches such as Prompt-based learning with ChatGPT as the underlying LLM makes it challenging to achieve satisfying quality in a production environment. Strategic considerations for SMEs deploying an LLM-driven CRS are outlined, particularly considering trade-offs in the current technical landscape.
△ Less
Submitted 5 July, 2024;
originally announced July 2024.
-
Towards Generalized On-Chip Communication for Programmable Accelerators in Heterogeneous Architectures
Authors:
Joseph Zuckerman,
John-David Wellman,
Ajay Vanamali,
Manish Shankar,
Gabriele Tombesi,
Karthik Swaminathan,
Kevin Lee,
Mohit Kapur,
Robert Philhower,
Pradip Bose,
Luca P. Carloni
Abstract:
We present several enhancements to the open-source ESP platform to support flexible and efficient on-chip communication for programmable accelerators in heterogeneous SoCs. These enhancements include 1) a flexible point-to-point communication mechanism between accelerators, 2) a multicast NoC that supports data forwarding to multiple accelerators simultaneously, 3) accelerator synchronization leve…
▽ More
We present several enhancements to the open-source ESP platform to support flexible and efficient on-chip communication for programmable accelerators in heterogeneous SoCs. These enhancements include 1) a flexible point-to-point communication mechanism between accelerators, 2) a multicast NoC that supports data forwarding to multiple accelerators simultaneously, 3) accelerator synchronization leveraging the SoC's coherence protocol, 4) an accelerator interface that offers fine-grained control over the communication mode used, and 5) an example ISA extension to support our enhancements. Our solution adds negligible area to the SoC architecture and requires minimal changes to the accelerators themselves. We have validated most of these features in complex FPGA prototypes and plan to include them in the open-source release of ESP in the coming months.
△ Less
Submitted 4 July, 2024;
originally announced July 2024.
-
A Quantum Information Perspective on Many-Body Dispersive Forces
Authors:
Christopher Willby,
Martin Kiffner,
Joseph Tindall,
Jason Crain,
Dieter Jaksch
Abstract:
Despite its ubiquity, many-body dispersion remains poorly understood. Here we investigate the distribution of entanglement in quantum Drude oscillator assemblies, minimal models for dispersion bound systems. We analytically determine a relation between entanglement and energy, showing how the entanglement distribution governs dispersive bonding. This suggests that the monogamy of entanglement expl…
▽ More
Despite its ubiquity, many-body dispersion remains poorly understood. Here we investigate the distribution of entanglement in quantum Drude oscillator assemblies, minimal models for dispersion bound systems. We analytically determine a relation between entanglement and energy, showing how the entanglement distribution governs dispersive bonding. This suggests that the monogamy of entanglement explains deviations of multipartite dispersive binding energies compared to the commonly used pairwise prediction. We illustrate our findings using examples of a trimer and extended crystal lattices.
△ Less
Submitted 4 July, 2024;
originally announced July 2024.
-
A Critical Assessment of Interpretable and Explainable Machine Learning for Intrusion Detection
Authors:
Omer Subasi,
Johnathan Cree,
Joseph Manzano,
Elena Peterson
Abstract:
There has been a large number of studies in interpretable and explainable ML for cybersecurity, in particular, for intrusion detection. Many of these studies have significant amount of overlap** and repeated evaluations and analysis. At the same time, these studies overlook crucial model, data, learning process, and utility related issues and many times completely disregard them. These issues in…
▽ More
There has been a large number of studies in interpretable and explainable ML for cybersecurity, in particular, for intrusion detection. Many of these studies have significant amount of overlap** and repeated evaluations and analysis. At the same time, these studies overlook crucial model, data, learning process, and utility related issues and many times completely disregard them. These issues include the use of overly complex and opaque ML models, unaccounted data imbalances and correlated features, inconsistent influential features across different explanation methods, the inconsistencies stemming from the constituents of a learning process, and the implausible utility of explanations. In this work, we empirically demonstrate these issues, analyze them and propose practical solutions in the context of feature-based model explanations. Specifically, we advise avoiding complex opaque models such as Deep Neural Networks and instead using interpretable ML models such as Decision Trees as the available intrusion datasets are not difficult for such interpretable models to classify successfully. Then, we bring attention to the binary classification metrics such as Matthews Correlation Coefficient (which are well-suited for imbalanced datasets. Moreover, we find that feature-based model explanations are most often inconsistent across different settings. In this respect, to further gauge the extent of inconsistencies, we introduce the notion of cross explanations which corroborates that the features that are determined to be impactful by one explanation method most often differ from those by another method. Furthermore, we show that strongly correlated data features and the constituents of a learning process, such as hyper-parameters and the optimization routine, become yet another source of inconsistent explanations. Finally, we discuss the utility of feature-based explanations.
△ Less
Submitted 4 July, 2024;
originally announced July 2024.
-
Deep learning architectures for data-driven damage detection in nonlinear dynamic systems
Authors:
Harrish Joseph,
Giuseppe Quaranta,
Biagio Carboni,
Walter Lacarbonara
Abstract:
The primary goal of structural health monitoring is to detect damage at its onset before it reaches a critical level. The in-depth investigation in the present work addresses deep learning applied to data-driven damage detection in nonlinear dynamic systems. In particular, autoencoders (AEs) and generative adversarial networks (GANs) are implemented leveraging on 1D convolutional neural networks.…
▽ More
The primary goal of structural health monitoring is to detect damage at its onset before it reaches a critical level. The in-depth investigation in the present work addresses deep learning applied to data-driven damage detection in nonlinear dynamic systems. In particular, autoencoders (AEs) and generative adversarial networks (GANs) are implemented leveraging on 1D convolutional neural networks. The onset of damage is detected in the investigated nonlinear dynamic systems by exciting random vibrations of varying intensity, without prior knowledge of the system or the excitation and in unsupervised manner. The comprehensive numerical study is conducted on dynamic systems exhibiting different types of nonlinear behavior. An experimental application related to a magneto-elastic nonlinear system is also presented to corroborate the conclusions.
△ Less
Submitted 4 July, 2024;
originally announced July 2024.
-
Elementary Formulas for Greatest Common Divisors and Semiprime Factors
Authors:
Joseph M. Shunia
Abstract:
We present new formulas for computing greatest common divisors (GCDs) and extracting the prime factors of semiprimes using only elementary arithmetic operations: addition, subtraction, multiplication, floored division, and exponentiation. Our GCD formula simplifies a result of Mazzanti, and is derived using Kronecker substitution techniques from our previous work. We utilize the GCD formula, along…
▽ More
We present new formulas for computing greatest common divisors (GCDs) and extracting the prime factors of semiprimes using only elementary arithmetic operations: addition, subtraction, multiplication, floored division, and exponentiation. Our GCD formula simplifies a result of Mazzanti, and is derived using Kronecker substitution techniques from our previous work. We utilize the GCD formula, along with recent developments on arithmetic terms for square roots and factorials, to derive explicit expressions for the prime factors of a semiprime $n=pq$.
△ Less
Submitted 22 June, 2024;
originally announced July 2024.
-
Accelerating quantum imaginary-time evolution with random measurements
Authors:
Ioannis Kolotouros,
David Joseph,
Anand Kumar Narayanan
Abstract:
Quantum imaginary-time evolution (QITE) is a promising tool to prepare thermal or ground states of Hamiltonians, as convergence is guaranteed when the evolved state overlaps with the ground state. However, its implementation using a parameterized quantum circuit is impractical as the number of parameters $m$ increases, since each step in the evolution takes $Θ(m^2)$ state preparations to calculate…
▽ More
Quantum imaginary-time evolution (QITE) is a promising tool to prepare thermal or ground states of Hamiltonians, as convergence is guaranteed when the evolved state overlaps with the ground state. However, its implementation using a parameterized quantum circuit is impractical as the number of parameters $m$ increases, since each step in the evolution takes $Θ(m^2)$ state preparations to calculate the quantum Fisher information matrix (QFIM). In this work, we accelerate QITE by rapid estimation of the QFIM, while conserving the convergence guarantees to the extent possible. To this end, we prove that if a parameterized state is rotated by a 2-design and measured in the computational basis, then the QFIM can be inferred from partial derivative cross correlations of the probability outcomes. One sample estimate costs only $Θ(m)$ state preparations, leading to rapid QFIM estimation when a few samples suffice. The second family of estimators take greater liberties and replace QFIMs with averaged classical Fisher information matrices (CFIMs). In an extreme special case optimized for rapid (over accurate) descent, just one CFIM sample is drawn. We justify the second estimator family by proving rapid descent. Guided by these results, we propose the random-measurement imaginary-time evolution (RMITE) algorithm, which we showcase and test in several molecular systems, with the goal of preparing ground states.
△ Less
Submitted 3 July, 2024;
originally announced July 2024.
-
Implementation and Analysis of GPU Algorithms for Vecchia Approximation
Authors:
Zachary James,
Joseph Guinness
Abstract:
Gaussian Processes have become an indispensable part of the spatial statistician's toolbox but are unsuitable for analyzing large dataset because of the significant time and memory needed to fit the associated model exactly. Vecchia Approximation is widely used to reduce the computational complexity and can be calculated with embarrassingly parallel algorithms. While multi-core software has been d…
▽ More
Gaussian Processes have become an indispensable part of the spatial statistician's toolbox but are unsuitable for analyzing large dataset because of the significant time and memory needed to fit the associated model exactly. Vecchia Approximation is widely used to reduce the computational complexity and can be calculated with embarrassingly parallel algorithms. While multi-core software has been developed for Vecchia Approximation, such as the GpGp R package, software designed to run on graphics processing units (GPU) is lacking, despite the tremendous success GPUs have had in statistics and machine learning. We compare three different ways to implement Vecchia Approximation on a GPU: two of which are similar to methods used for other Gaussian Process approximations and one that is new. The impact of memory type on performance is investigated and the final method is optimized accordingly. We show that our new method outperforms the other two and then present it in the GpGpU R package. We compare GpGpU to existing multi-core and GPU-accelerated software by fitting Gaussian Process models on various datasets, including a large spatial-temporal dataset of $n>10^6$ points collected from an earth-observing satellite. Our results show that GpGpU achieves faster runtimes and better predictive accuracy.
△ Less
Submitted 2 July, 2024;
originally announced July 2024.
-
Large-scale quantum reservoir learning with an analog quantum computer
Authors:
Milan Kornjača,
Hong-Ye Hu,
Chen Zhao,
Jonathan Wurtz,
Phillip Weinberg,
Majd Hamdan,
Andrii Zhdanov,
Sergio H. Cantu,
Hengyun Zhou,
Rodrigo Araiza Bravo,
Kevin Bagnall,
James I. Basham,
Joseph Campo,
Adam Choukri,
Robert DeAngelo,
Paige Frederick,
David Haines,
Julian Hammett,
Ning Hsu,
Ming-Guang Hu,
Florian Huber,
Paul Niklas Jepsen,
Ningyuan Jia,
Thomas Karolyshyn,
Minho Kwon
, et al. (28 additional authors not shown)
Abstract:
Quantum machine learning has gained considerable attention as quantum technology advances, presenting a promising approach for efficiently learning complex data patterns. Despite this promise, most contemporary quantum methods require significant resources for variational parameter optimization and face issues with vanishing gradients, leading to experiments that are either limited in scale or lac…
▽ More
Quantum machine learning has gained considerable attention as quantum technology advances, presenting a promising approach for efficiently learning complex data patterns. Despite this promise, most contemporary quantum methods require significant resources for variational parameter optimization and face issues with vanishing gradients, leading to experiments that are either limited in scale or lack potential for quantum advantage. To address this, we develop a general-purpose, gradient-free, and scalable quantum reservoir learning algorithm that harnesses the quantum dynamics of neutral-atom analog quantum computers to process data. We experimentally implement the algorithm, achieving competitive performance across various categories of machine learning tasks, including binary and multi-class classification, as well as timeseries prediction. Effective and improving learning is observed with increasing system sizes of up to 108 qubits, demonstrating the largest quantum machine learning experiment to date. We further observe comparative quantum kernel advantage in learning tasks by constructing synthetic datasets based on the geometric differences between generated quantum and classical data kernels. Our findings demonstrate the potential of utilizing classically intractable quantum correlations for effective machine learning. We expect these results to stimulate further extensions to different quantum hardware and machine learning paradigms, including early fault-tolerant hardware and generative machine learning tasks.
△ Less
Submitted 2 July, 2024;
originally announced July 2024.
-
N-body linear force law allowing analytic solutions
Authors:
Joseph West,
Sean P. Bartz
Abstract:
We present a pair-wise force law in a system of N particles that produces analytic solutions for arbitrary number of particles, masses, and initial conditions. Each pair of particles interacts via a force that is proportional to the product of their masses and their separation distance, with the force directed radially. We show that, despite the N-body interaction, each particle behaves as if it i…
▽ More
We present a pair-wise force law in a system of N particles that produces analytic solutions for arbitrary number of particles, masses, and initial conditions. Each pair of particles interacts via a force that is proportional to the product of their masses and their separation distance, with the force directed radially. We show that, despite the N-body interaction, each particle behaves as if it interacts only with the center of mass of the system. This effective two-body interaction behaves as Hooke's Law with a common frequency for all particles, with the familiar analytic solutions for the trajectories. With these analytic solutions, it is possible to efficiently simulate a collection of these particles and incorporate other external forces. As an example, we simulate the particles within an adiabatically expanding container and calculate pressure and temperature in both the attractive and repulsive cases.
△ Less
Submitted 2 July, 2024;
originally announced July 2024.
-
Learning Bipedal Walking on a Quadruped Robot via Adversarial Motion Priors
Authors:
Tianhu Peng,
Lingfan Bao,
Joseph Humphreys,
Andromachi Maria Delfaki,
Dimitrios Kanoulas,
Chengxu Zhou
Abstract:
Previous studies have successfully demonstrated agile and robust locomotion in challenging terrains for quadrupedal robots. However, the bipedal locomotion mode for quadruped robots remains unverified. This paper explores the adaptation of a learning framework originally designed for quadrupedal robots to operate blind locomotion in biped mode. We leverage a framework that incorporates Adversarial…
▽ More
Previous studies have successfully demonstrated agile and robust locomotion in challenging terrains for quadrupedal robots. However, the bipedal locomotion mode for quadruped robots remains unverified. This paper explores the adaptation of a learning framework originally designed for quadrupedal robots to operate blind locomotion in biped mode. We leverage a framework that incorporates Adversarial Motion Priors with a teacher-student policy to enable imitation of a reference trajectory and navigation on tough terrain. Our work involves transferring and evaluating a similar learning framework on a quadruped robot in biped mode, aiming to achieve stable walking on both flat and complicated terrains. Our simulation results demonstrate that the trained policy enables the quadruped robot to navigate both flat and challenging terrains, including stairs and uneven surfaces.
△ Less
Submitted 2 July, 2024;
originally announced July 2024.
-
Can Small Language Models Learn, Unlearn, and Retain Noise Patterns?
Authors:
Nicy Scaria,
Silvester John Joseph Kennedy,
Deepak Subramani
Abstract:
Small Language Models (SLMs) are generally considered to be more compact versions of large language models (LLMs), typically having fewer than 7 billion parameters. This study investigates the ability of small language models to learn, retain, and subsequently eliminate noise that is typically not found on the internet, where most pretraining datasets are sourced. For this, four pre-trained SLMs w…
▽ More
Small Language Models (SLMs) are generally considered to be more compact versions of large language models (LLMs), typically having fewer than 7 billion parameters. This study investigates the ability of small language models to learn, retain, and subsequently eliminate noise that is typically not found on the internet, where most pretraining datasets are sourced. For this, four pre-trained SLMs were utilized: Olmo 1B, Qwen1.5 1.8B, Gemma 2B, and Phi2 2.7B. The models were instruction-tuned without noise and tested for task execution with in-context learning. Afterward, noise patterns were introduced to evaluate the models' learning and unlearning capabilities. We evaluated the models' performance at various training levels. Phi consistently excelled with word-level noise but performed the worst with character-level noise. Despite being the smallest with approximately 1 billion parameters, Olmo performed consistently well on tasks.
△ Less
Submitted 1 July, 2024;
originally announced July 2024.
-
Scaling Technology Acceptance Analysis with Large Language Model (LLM) Annotation Systems
Authors:
Pawel Robert Smolinski,
Joseph Januszewicz,
Jacek Winiarski
Abstract:
Technology acceptance models effectively predict how users will adopt new technology products. Traditional surveys, often expensive and cumbersome, are commonly used for this assessment. As an alternative to surveys, we explore the use of large language models for annotating online user-generated content, like digital reviews and comments. Our research involved designing an LLM annotation system t…
▽ More
Technology acceptance models effectively predict how users will adopt new technology products. Traditional surveys, often expensive and cumbersome, are commonly used for this assessment. As an alternative to surveys, we explore the use of large language models for annotating online user-generated content, like digital reviews and comments. Our research involved designing an LLM annotation system that transform reviews into structured data based on the Unified Theory of Acceptance and Use of Technology model. We conducted two studies to validate the consistency and accuracy of the annotations. Results showed moderate-to-strong consistency of LLM annotation systems, improving further by lowering the model temperature. LLM annotations achieved close agreement with human expert annotations and outperformed the agreement between experts for UTAUT variables. These results suggest that LLMs can be an effective tool for analyzing user sentiment, offering a practical alternative to traditional survey methods and enabling deeper insights into technology design and adoption.
△ Less
Submitted 30 June, 2024;
originally announced July 2024.
-
MnRhBi3: A Cleavable Antiferromagnetic Metal
Authors:
Eleanor M. Clements,
Dmitry Ovchinnikov,
Parul R. Raghuvanshi,
Valentino R. Cooper,
Satoshi Okamoto,
Andrew D. Christianson,
Joseph A. M. Paddison,
Brenden R. Ortiz,
Stuart Calder,
Andrew F. May,
Xiaodong Xu,
Jiaqiang Yan,
Michael A. McGuire
Abstract:
Cleavable metallic antiferromagnets may be of use for low-dissipation spintronic devices; however, few are currently known. Here we present orthorhombic MnRhBi3 as one such compound and present a thorough study of its physical properties. Exfoliation is demonstrated experimentally, and the cleavage energy and electronic structure are examined by density functional theory calculations. It is conclu…
▽ More
Cleavable metallic antiferromagnets may be of use for low-dissipation spintronic devices; however, few are currently known. Here we present orthorhombic MnRhBi3 as one such compound and present a thorough study of its physical properties. Exfoliation is demonstrated experimentally, and the cleavage energy and electronic structure are examined by density functional theory calculations. It is concluded that MnRhBi3 is a van der Waals layered material that cleaves easily between neighboring Bi layers, and that the Bi atoms have lone pairs extending into the van der Waals gaps. A series of four phase transitions are observed below room temperature, and neutron diffraction shows that at least two of the transitions involve the formation of antiferromagnetic order. Anomalous thermal expansion points to a crystallographic phase transition and/or strong magnetoelastic coupling. This work reveals a complex phase evolution in MnRhBi3 and establishes this cleavable antiferromagnetic metal as an interesting material for studying the interplay of structure, magnetism, and transport in the bulk and ultrathin limits as well as the role of lone pair electrons in interface chemistry and proximity effects in van der Waals heterostructures.
△ Less
Submitted 30 June, 2024;
originally announced July 2024.
-
Closure invariants for polarised radio interferometric observations: a graph theoretical approach
Authors:
Vinay Kumar,
Rajaram Nityananda,
Joseph Samuel
Abstract:
Aperture synthesis observations with full polarisation have long been used to study the magnetic fields of synchrotron emitting sources. Recently proposed closure invariants give us a powerful method for extracting information from measured visibilities which are corrupted by antenna and polarisation dependent gains. In this paper, a formalism developed earlier for complete graphs (where all visib…
▽ More
Aperture synthesis observations with full polarisation have long been used to study the magnetic fields of synchrotron emitting sources. Recently proposed closure invariants give us a powerful method for extracting information from measured visibilities which are corrupted by antenna and polarisation dependent gains. In this paper, a formalism developed earlier for complete graphs (where all visibilities are available) is extended to incomplete graphs. The formalism provides a complete and independent set of closure invariants from the measured visibilities in a general situation where not all visibilities are available. We then show in a simulated, quasi-realistic case that the invariants developed here contain usable information even in the presence of noise.
△ Less
Submitted 30 June, 2024;
originally announced July 2024.
-
Sparse Actuator Scheduling for Discrete-Time Linear Dynamical Systems
Authors:
Krishna Praveen V. S. Kondapi,
Chandrasekhar Sriram,
Geethu Joseph,
Chandra R. Murthy
Abstract:
We consider the control of discrete-time linear dynamical systems using sparse inputs where we limit the number of active actuators at every time step. We develop an algorithm for determining a sparse actuator schedule that ensures the existence of a sparse control input sequence, following the schedule, that takes the system from any given initial state to any desired final state. Since such an a…
▽ More
We consider the control of discrete-time linear dynamical systems using sparse inputs where we limit the number of active actuators at every time step. We develop an algorithm for determining a sparse actuator schedule that ensures the existence of a sparse control input sequence, following the schedule, that takes the system from any given initial state to any desired final state. Since such an actuator schedule is not unique, we look for a schedule that minimizes the energy of sparse inputs. For this, we optimize the trace of the inverse of the resulting controllability Gramian, which is an approximate measure of the average energy of the inputs. We present a greedy algorithm along with its theoretical guarantees. Finally, we empirically show that our greedy algorithm ensures the controllability of the linear system with a small number of active actuators per time step without a significant average energy expenditure compared to the fully actuated system.
△ Less
Submitted 29 June, 2024;
originally announced July 2024.
-
Towards Universal Mesh Movement Networks
Authors:
Mingrui Zhang,
Chunyang Wang,
Stephan Kramer,
Joseph G. Wallwork,
Siyi Li,
Jiancheng Liu,
Xiang Chen,
Matthew D. Piggott
Abstract:
Solving complex Partial Differential Equations (PDEs) accurately and efficiently is an essential and challenging problem in all scientific and engineering disciplines. Mesh movement methods provide the capability to improve the accuracy of the numerical solution without increasing the overall mesh degree of freedom count. Conventional sophisticated mesh movement methods are extremely expensive and…
▽ More
Solving complex Partial Differential Equations (PDEs) accurately and efficiently is an essential and challenging problem in all scientific and engineering disciplines. Mesh movement methods provide the capability to improve the accuracy of the numerical solution without increasing the overall mesh degree of freedom count. Conventional sophisticated mesh movement methods are extremely expensive and struggle to handle scenarios with complex boundary geometries. However, existing learning-based methods require re-training from scratch given a different PDE type or boundary geometry, which limits their applicability, and also often suffer from robustness issues in the form of inverted elements. In this paper, we introduce the Universal Mesh Movement Network (UM2N), which -- once trained -- can be applied in a non-intrusive, zero-shot manner to move meshes with different size distributions and structures, for solvers applicable to different PDE types and boundary geometries. UM2N consists of a Graph Transformer (GT) encoder for extracting features and a Graph Attention Network (GAT) based decoder for moving the mesh. We evaluate our method on advection and Navier-Stokes based examples, as well as a real-world tsunami simulation case. Our method outperforms existing learning-based mesh movement methods in terms of the benchmarks described above. In comparison to the conventional sophisticated Monge-Ampère PDE-solver based method, our approach not only significantly accelerates mesh movement, but also proves effective in scenarios where the conventional method fails. Our project page is at https://erizmr.github.io/UM2N/.
△ Less
Submitted 1 July, 2024; v1 submitted 29 June, 2024;
originally announced July 2024.
-
Observations of the 2024 May 14 X8.7 Solar Flare with the Goldstone-Apple Valley Radio Telescope (GAVRT)
Authors:
Thangasamy Velusamy,
Ryan Dorcey,
Nancy Kreuser-Jenkins,
Lisa Nichole Lamb,
Erica Pagano,
Marin M. Anderson,
Joseph Lazio,
Steven Levin
Abstract:
The Goldstone-Apple Valley Radio Telescope (GAVRT) project conducts a regular monitoring program of the Sun. The GAVRT Solar Patrol project uses a 34 m diameter antenna to produce raster-scan maps of the Sun simultaneously at 4 frequencies ranging from approximately 3 GHz to 14 GHz. On 2024 May 14, as part of regular GAVRT Solar Patrol observations, raster maps were produced when an X8.7 solar fla…
▽ More
The Goldstone-Apple Valley Radio Telescope (GAVRT) project conducts a regular monitoring program of the Sun. The GAVRT Solar Patrol project uses a 34 m diameter antenna to produce raster-scan maps of the Sun simultaneously at 4 frequencies ranging from approximately 3 GHz to 14 GHz. On 2024 May 14, as part of regular GAVRT Solar Patrol observations, raster maps were produced when an X8.7 solar flare occurred in active region AR13664. Here we present the GAVRT maps of the May 14 flare along with microwave flux density spectra showing the non-thermal microwave burst emission from mildly relativistic electrons produced in this largest flare of Solar Cycle 25 to date. AR13664 reappeared as AR13697 and continued to be very active, producing X flares while GAVRT monitored its activity. GAVRT microwave data provide a powerful complement to the energetic electrons tracked by X-ray, millimeter-wave and γ-ray emissions.
△ Less
Submitted 28 June, 2024;
originally announced July 2024.
-
Effects of resonant dipole-dipole interactions in the spin noise of atomic vapors
Authors:
J. Delpy,
N. Fayard,
F. Bretenaker,
F. Goldfarb
Abstract:
We perform spin noise spectroscopy close to resonance in a 1-mm-thick cell containing a dense Rubidium vapor. A laser is used to excite optical dipoles in the vapor while probing the Faraday rotation noise. We report unusual lineshapes of the spin noise spectra with a strong density dependence, which we attribute to interactions arising between particles in the system. Introducing a two-body model…
▽ More
We perform spin noise spectroscopy close to resonance in a 1-mm-thick cell containing a dense Rubidium vapor. A laser is used to excite optical dipoles in the vapor while probing the Faraday rotation noise. We report unusual lineshapes of the spin noise spectra with a strong density dependence, which we attribute to interactions arising between particles in the system. Introducing a two-body model and simulations, we show that these features are the hallmark of a strong dipole-dipole interaction between binaries within the ensemble. A precise fit of the experimental spectra allows to extract the strength and the duration of the dipole-dipole interaction. We unveil its impact on the spin noise frequency and investigate the role of the atomic motion in the unexpected lineshapes. This work demonstrates the potential of spin noise spectroscopy to observe and quantify strong interactions occurring within a particle ensemble.
△ Less
Submitted 28 June, 2024;
originally announced July 2024.
-
Long-range interactions revealed by collective spin noise spectra in atomic vapors
Authors:
J. Delpy,
N. Fayard,
F. Bretenaker,
F. Goldfarb
Abstract:
We report anomalous features in the spin noise spectroscopy (SNS) of a thin cell of a dense vapor of alkali atoms. At high densities and close to resonance, we observe a dramatic broadening of the spin noise spectra as well as an unexpected extra low-frequency noise component. With the help of a two-body model and simulations, we show that these features are the hallmark of a strong, long-range di…
▽ More
We report anomalous features in the spin noise spectroscopy (SNS) of a thin cell of a dense vapor of alkali atoms. At high densities and close to resonance, we observe a dramatic broadening of the spin noise spectra as well as an unexpected extra low-frequency noise component. With the help of a two-body model and simulations, we show that these features are the hallmark of a strong, long-range dipole-dipole interaction within the ensemble. The additional low-frequency noise reveals the correlated evolution of pair of atoms beyond the impact approximation. In this regime, we demonstrate that spin noise can no longer be obtained from one-body dynamics, opening the way for the characterization of many-body spin noise, atomic entanglement or higher order spin correlators in atomic vapors using SNS.
△ Less
Submitted 28 June, 2024;
originally announced July 2024.
-
Approximate Solutions for Multi-Trip Route Planning in Time-Sensitive Situations
Authors:
Bahar Cavdar,
Joseph Geunes,
Xiaofeng Nie,
Yue Wang
Abstract:
We consider emergent situations that require transporting individuals from their locations to a facility using a single capacitated vehicle, where transportation duration has a negative impact on the individuals. A dispatcher determines routes to maximize total satisfaction. We call this problem the Ambulance Bus Routing Problem. We develop efficient approximate policies for the dispatcher to allo…
▽ More
We consider emergent situations that require transporting individuals from their locations to a facility using a single capacitated vehicle, where transportation duration has a negative impact on the individuals. A dispatcher determines routes to maximize total satisfaction. We call this problem the Ambulance Bus Routing Problem. We develop efficient approximate policies for the dispatcher to allocate individuals to multiple routes, characterize an optimal solution of the relaxed approximate model, and devise a heuristic to obtain a near-optimal integer solution quickly.
△ Less
Submitted 28 June, 2024;
originally announced July 2024.
-
Extended sample size calculations for evaluation of prediction models using a threshold for classification
Authors:
Rebecca Whittle,
Joie Ensor,
Lucinda Archer,
Gary S. Collins,
Paula Dhiman,
Alastair Denniston,
Joseph Alderman,
Amardeep Legha,
Maarten van Smeden,
Karel G. Moons,
Jean-Baptiste Cazier,
Richard D. Riley,
Kym I. E. Snell
Abstract:
When evaluating the performance of a model for individualised risk prediction, the sample size needs to be large enough to precisely estimate the performance measures of interest. Current sample size guidance is based on precisely estimating calibration, discrimination, and net benefit, which should be the first stage of calculating the minimum required sample size. However, when a clinically impo…
▽ More
When evaluating the performance of a model for individualised risk prediction, the sample size needs to be large enough to precisely estimate the performance measures of interest. Current sample size guidance is based on precisely estimating calibration, discrimination, and net benefit, which should be the first stage of calculating the minimum required sample size. However, when a clinically important threshold is used for classification, other performance measures can also be used. We extend the previously published guidance to precisely estimate threshold-based performance measures. We have developed closed-form solutions to estimate the sample size required to target sufficiently precise estimates of accuracy, specificity, sensitivity, PPV, NPV, and F1-score in an external evaluation study of a prediction model with a binary outcome. This approach requires the user to pre-specify the target standard error and the expected value for each performance measure. We describe how the sample size formulae were derived and demonstrate their use in an example. Extension to time-to-event outcomes is also considered. In our examples, the minimum sample size required was lower than that required to precisely estimate the calibration slope, and we expect this would most often be the case. Our formulae, along with corresponding Python code and updated R and Stata commands (pmvalsampsize), enable researchers to calculate the minimum sample size needed to precisely estimate threshold-based performance measures in an external evaluation study. These criteria should be used alongside previously published criteria to precisely estimate the calibration, discrimination, and net-benefit.
△ Less
Submitted 28 June, 2024;
originally announced June 2024.
-
Equi-isoclinic subspaces from symmetry
Authors:
Matthew Fickus,
Joseph W. Iverson,
John Jasper,
Dustin G. Mixon
Abstract:
We describe a flexible technique that constructs tight fusion frames with prescribed transitive symmetry. Applying this technique with representations of the symmetric and alternating groups, we obtain several new infinite families of equi-isoclinic tight fusion frames, each with the remarkable property that its automorphism group is either $S_n$ or $A_n$. These ensembles are optimal packings for…
▽ More
We describe a flexible technique that constructs tight fusion frames with prescribed transitive symmetry. Applying this technique with representations of the symmetric and alternating groups, we obtain several new infinite families of equi-isoclinic tight fusion frames, each with the remarkable property that its automorphism group is either $S_n$ or $A_n$. These ensembles are optimal packings for Grassmannian space equipped with spectral distance, and as such, they find applications in block compressed sensing.
△ Less
Submitted 27 June, 2024;
originally announced June 2024.
-
Tradition or Innovation: A Comparison of Modern ASR Methods for Forced Alignment
Authors:
Rotem Rousso,
Eyal Cohen,
Joseph Keshet,
Eleanor Chodroff
Abstract:
Forced alignment (FA) plays a key role in speech research through the automatic time alignment of speech signals with corresponding text transcriptions. Despite the move towards end-to-end architectures for speech technology, FA is still dominantly achieved through a classic GMM-HMM acoustic model. This work directly compares alignment performance from leading automatic speech recognition (ASR) me…
▽ More
Forced alignment (FA) plays a key role in speech research through the automatic time alignment of speech signals with corresponding text transcriptions. Despite the move towards end-to-end architectures for speech technology, FA is still dominantly achieved through a classic GMM-HMM acoustic model. This work directly compares alignment performance from leading automatic speech recognition (ASR) methods, WhisperX and Massively Multilingual Speech Recognition (MMS), against a Kaldi-based GMM-HMM system, the Montreal Forced Aligner (MFA). Performance was assessed on the manually aligned TIMIT and Buckeye datasets, with comparisons conducted only on words correctly recognized by WhisperX and MMS. The MFA outperformed both WhisperX and MMS, revealing a shortcoming of modern ASR systems. These findings highlight the need for advancements in forced alignment and emphasize the importance of integrating traditional expertise with modern innovation to foster progress. Index Terms: forced alignment, phoneme alignment, word alignment
△ Less
Submitted 27 June, 2024;
originally announced June 2024.
-
Enhanced ASR Robustness to Packet Loss with a Front-End Adaptation Network
Authors:
Yehoshua Dissen,
Shiry Yonash,
Israel Cohen,
Joseph Keshet
Abstract:
In the realm of automatic speech recognition (ASR), robustness in noisy environments remains a significant challenge. Recent ASR models, such as Whisper, have shown promise, but their efficacy in noisy conditions can be further enhanced. This study is focused on recovering from packet loss to improve the word error rate (WER) of ASR models. We propose using a front-end adaptation network connected…
▽ More
In the realm of automatic speech recognition (ASR), robustness in noisy environments remains a significant challenge. Recent ASR models, such as Whisper, have shown promise, but their efficacy in noisy conditions can be further enhanced. This study is focused on recovering from packet loss to improve the word error rate (WER) of ASR models. We propose using a front-end adaptation network connected to a frozen ASR model. The adaptation network is trained to modify the corrupted input spectrum by minimizing the criteria of the ASR model in addition to an enhancement loss function. Our experiments demonstrate that the adaptation network, trained on Whisper's criteria, notably reduces word error rates across domains and languages in packet-loss scenarios. This improvement is achieved with minimal affect to Whisper model's foundational performance, underscoring our method's practicality and potential in enhancing ASR models in challenging acoustic environments.
△ Less
Submitted 27 June, 2024;
originally announced June 2024.
-
Direct measurement of forces in air-based acoustic levitation systems
Authors:
Nina M. Brown,
Bryan VanSaders,
Jason M. Kronenfeld,
Joseph M. DeSimone,
Heinrich M. Jaeger
Abstract:
Acoustic levitation is frequently used for non-contact manipulation of objects and to study the impact of microgravity on physical and biological processes. While the force field produced by sound pressure lifts particles against gravity (primary acoustic force), multiple levitating objects in the same acoustic cavity interact via forces that arise from scattered sound (secondary acoustic forces).…
▽ More
Acoustic levitation is frequently used for non-contact manipulation of objects and to study the impact of microgravity on physical and biological processes. While the force field produced by sound pressure lifts particles against gravity (primary acoustic force), multiple levitating objects in the same acoustic cavity interact via forces that arise from scattered sound (secondary acoustic forces). Current experimental techniques for obtaining these force fields are not well-suited for map** the primary force field at high spatial resolution and cannot directly measure the secondary scattering force. Here we introduce a method that can measure both acoustic forces in situ, including secondary forces in the near-field limit between arbitrarily shaped, closely spaced objects. Operating similarly to an atomic force microscope, the method inserts into the acoustic cavity a suitably shaped probe tip at the end of a long, flexible cantilever and optically detects its deflection. This makes it possible to measure forces with a resolution better than 50 nN, and also to apply stress or strain in a controlled manner to manipulate levitated objects. We demonstrate this by extracting the acoustic potential present in a levitation cavity, directly measuring the acoustic scattering force between two objects, and applying tension to a levitated granular raft of acoustically-bound particles in order to obtain the force-displacement curve for its deformation.
△ Less
Submitted 26 June, 2024;
originally announced June 2024.
-
RouteLLM: Learning to Route LLMs with Preference Data
Authors:
Isaac Ong,
Amjad Almahairi,
Vincent Wu,
Wei-Lin Chiang,
Tianhao Wu,
Joseph E. Gonzalez,
M Waleed Kadous,
Ion Stoica
Abstract:
Large language models (LLMs) exhibit impressive capabilities across a wide range of tasks, yet the choice of which model to use often involves a trade-off between performance and cost. More powerful models, though effective, come with higher expenses, while less capable models are more cost-effective. To address this dilemma, we propose several efficient router models that dynamically select betwe…
▽ More
Large language models (LLMs) exhibit impressive capabilities across a wide range of tasks, yet the choice of which model to use often involves a trade-off between performance and cost. More powerful models, though effective, come with higher expenses, while less capable models are more cost-effective. To address this dilemma, we propose several efficient router models that dynamically select between a stronger and a weaker LLM during inference, aiming to optimize the balance between cost and response quality. We develop a training framework for these routers leveraging human preference data and data augmentation techniques to enhance performance. Our evaluation on widely-recognized benchmarks shows that our approach significantly reduces costs-by over 2 times in certain cases-without compromising the quality of responses. Interestingly, our router models also demonstrate significant transfer learning capabilities, maintaining their performance even when the strong and weak models are changed at test time. This highlights the potential of these routers to provide a cost-effective yet high-performance solution for deploying LLMs.
△ Less
Submitted 1 July, 2024; v1 submitted 26 June, 2024;
originally announced June 2024.
-
Using 3.4-$μ$m Variability towards White Dwarfs as a Signpost of Remnant Planetary Systems
Authors:
Joseph A. Guidry,
J. J. Hermes,
Kishalay De,
Lou Baya Ould Rouis,
Brison B. Ewing,
B. C. Kaiser
Abstract:
Roughly 2% of white dwarfs harbor planetary debris disks detectable via infrared excesses, but only a few percent of these disks show a gaseous component, distinguished by their double-peaked emission at the near-infrared calcium triplet. Previous studies found most debris disks around white dwarfs are variable at 3.4 and 4.5 $μ$m, but they analyzed only a few of the now 21 published disks showing…
▽ More
Roughly 2% of white dwarfs harbor planetary debris disks detectable via infrared excesses, but only a few percent of these disks show a gaseous component, distinguished by their double-peaked emission at the near-infrared calcium triplet. Previous studies found most debris disks around white dwarfs are variable at 3.4 and 4.5 $μ$m, but they analyzed only a few of the now 21 published disks showing calcium emission. To test if most published calcium emission disks exhibit large-amplitude stochastic variability in the near-infrared, we use light curves generated from the unWISE images at 3.4 $μ$m that are corrected for proper motion to characterize the near-infrared variability of these disks against samples of disks without calcium emission, highly variable cataclysmic variables, and 3215 isolated white dwarfs. We find most calcium emission disks are extremely variable: 6/11 with sufficient signal-to-noise show high-amplitude variability in their 3.4-$μ$m light curves. These results lend further credence to the notion that disks showing gaseous debris in emission are the most collisionally active. Under the assumption that 3.4-$μ$m variability is characteristic of white dwarfs with dusty debris disks, we generate a catalog of 104 high-confidence near-infrared variable white dwarfs, 84 of which are published as variable for the first time. We do near-infrared spectroscopic follow-up of seven new candidate 3.4-$μ$m variables, confirming at least one new remnant planetary system, and posit that empirical near-infrared variability can be a discovery engine for debris disks showing gaseous emission.
△ Less
Submitted 26 June, 2024;
originally announced June 2024.
-
Gauge Freedom and Objective Rates in the Morphodynamics of Fluid Deformable Surfaces: the Jaumann Rate vs. the Material Derivative
Authors:
Joseph Pollard,
Sami Al-Izzi,
Richard G. Morris
Abstract:
Morphodynamic descriptions of fluid deformable surfaces are relevant for a range of biological and soft matter phenomena, spanning materials that can be passive or active, as well as ordered or topological. However, a principled, geometric formulation of the correct hydrodynamic equations has remained opaque, with objective rates proving a central, contentious issue. We argue that this is due to a…
▽ More
Morphodynamic descriptions of fluid deformable surfaces are relevant for a range of biological and soft matter phenomena, spanning materials that can be passive or active, as well as ordered or topological. However, a principled, geometric formulation of the correct hydrodynamic equations has remained opaque, with objective rates proving a central, contentious issue. We argue that this is due to a conflation of several important notions that must be disambiguated when describing fluid deformable surfaces. These are the Eulerian and Lagrangian perspectives on fluid motion, and three different types of gauge freedom: in the ambient space; in the parameterisation of the surface, and; in the choice of frame field on the surface. We clarify these ideas, and show that objective rates in fluid deformable surfaces are time derivatives that are invariant under the first of these gauge freedoms, and which also preserve the structure of the ambient metric. The latter condition reduces a potentially infinite number of possible objective rates to only two: the material derivative and the Jaumann rate. The material derivative is invariant under the Galilean group, and therefore applies to velocities, whose rate captures the conservation of momentum. The Jaumann derivative is invariant under all time-dependent isometries, and therefore applies to local order parameters, or symmetry-broken variables, such as the nematic $Q$-tensor. We provide examples of material and Jaumann rates in two different frame fields that are pertinent to the current applications of the fluid mechanics of deformable surfaces.
△ Less
Submitted 25 June, 2024;
originally announced June 2024.
-
NEOWISE-R Caught the Luminous SN 2023ixf in Messier 101
Authors:
Schuyler D. Van Dyk,
Tamas Szalai,
Roc M. Cutri,
J. Davy Kirkpatrick,
Carl J. Grillmair,
Sergio B. Fajardo-Acosta,
Joseph R. Masiero,
Amy K. Mainzer,
Christopher R. Gelino,
Jozsef Vinko,
Andras Peter Joo,
Andras Pal,
Reka Konyves-Toth,
Levente Kriskovics,
Robert Szakats,
Krisztian Vida,
WeiKang Zheng,
Thomas G. Brink,
Alexei V. Filippenko
Abstract:
The reactivated Near-Earth Object Wide-field Infrared Survey Explorer (NEOWISE-R) serendipitously caught the Type II supernova SN 2023ixf in Messier 101 on the rise, starting day 3.6 through day 10.9, and on the late-time decline from days 211 through 213 and days 370 through 372. We have considered these mid-infrared (mid-IR) data together with observations from the ultraviolet (UV) through the n…
▽ More
The reactivated Near-Earth Object Wide-field Infrared Survey Explorer (NEOWISE-R) serendipitously caught the Type II supernova SN 2023ixf in Messier 101 on the rise, starting day 3.6 through day 10.9, and on the late-time decline from days 211 through 213 and days 370 through 372. We have considered these mid-infrared (mid-IR) data together with observations from the ultraviolet (UV) through the near-IR, when possible. At day 3.6 we approximated the optical emission with a hot, ~26,630 K blackbody, with a notable UV excess likely from strong SN shock interaction with circumstellar matter (CSM). In the IR, however, a clear excess is also obvious, and we fit it with a cooler, ~1,620 K blackbody with radius of ~2.6 x 10^{15} cm, consistent with dust in the progenitor's circumstellar shell likely heated by the UV emission from the CSM interaction. On day 10.8, the light detected was consistent with SN ejecta-dominated emission. At late times we also observed a clear NEOWISE-R excess, which could arise either from newly formed dust in the inner ejecta or in the contact discontinuity between the forward and reverse shocks, or from more distant pre-existing dust grains in the SN environment. Furthermore, the large 4.6 micron excess at late times can also be explained by the emergence of the carbon monoxide 1--0 vibrational band. SN 2023ixf is the best-observed SN IIP in the mid-IR during the first several days after explosion and one of the most luminous such SNe ever seen.
△ Less
Submitted 25 June, 2024;
originally announced June 2024.
-
Domain Adaptation of Echocardiography Segmentation Via Reinforcement Learning
Authors:
Arnaud Judge,
Thierry Judge,
Nicolas Duchateau,
Roman A. Sandler,
Joseph Z. Sokol,
Olivier Bernard,
Pierre-Marc Jodoin
Abstract:
Performance of deep learning segmentation models is significantly challenged in its transferability across different medical imaging domains, particularly when aiming to adapt these models to a target domain with insufficient annotated data for effective fine-tuning. While existing domain adaptation (DA) methods propose strategies to alleviate this problem, these methods do not explicitly incorpor…
▽ More
Performance of deep learning segmentation models is significantly challenged in its transferability across different medical imaging domains, particularly when aiming to adapt these models to a target domain with insufficient annotated data for effective fine-tuning. While existing domain adaptation (DA) methods propose strategies to alleviate this problem, these methods do not explicitly incorporate human-verified segmentation priors, compromising the potential of a model to produce anatomically plausible segmentations. We introduce RL4Seg, an innovative reinforcement learning framework that reduces the need to otherwise incorporate large expertly annotated datasets in the target domain, and eliminates the need for lengthy manual human review. Using a target dataset of 10,000 unannotated 2D echocardiographic images, RL4Seg not only outperforms existing state-of-the-art DA methods in accuracy but also achieves 99% anatomical validity on a subset of 220 expert-validated subjects from the target domain. Furthermore, our framework's reward network offers uncertainty estimates comparable with dedicated state-of-the-art uncertainty methods, demonstrating the utility and effectiveness of RL4Seg in overcoming domain adaptation challenges in medical image segmentation.
△ Less
Submitted 25 June, 2024;
originally announced June 2024.
-
EXTRACT: Efficient Policy Learning by Extracting Transferrable Robot Skills from Offline Data
Authors:
Jesse Zhang,
Minho Heo,
Zuxin Liu,
Erdem Biyik,
Joseph J Lim,
Yao Liu,
Rasool Fakoor
Abstract:
Most reinforcement learning (RL) methods focus on learning optimal policies over low-level action spaces. While these methods can perform well in their training environments, they lack the flexibility to transfer to new tasks. Instead, RL agents that can act over useful, temporally extended skills rather than low-level actions can learn new tasks more easily. Prior work in skill-based RL either re…
▽ More
Most reinforcement learning (RL) methods focus on learning optimal policies over low-level action spaces. While these methods can perform well in their training environments, they lack the flexibility to transfer to new tasks. Instead, RL agents that can act over useful, temporally extended skills rather than low-level actions can learn new tasks more easily. Prior work in skill-based RL either requires expert supervision to define useful skills, which is hard to scale, or learns a skill-space from offline data with heuristics that limit the adaptability of the skills, making them difficult to transfer during downstream RL. Our approach, EXTRACT, instead utilizes pre-trained vision language models to extract a discrete set of semantically meaningful skills from offline data, each of which is parameterized by continuous arguments, without human supervision. This skill parameterization allows robots to learn new tasks by only needing to learn when to select a specific skill and how to modify its arguments for the specific task. We demonstrate through experiments in sparse-reward, image-based, robot manipulation environments that EXTRACT can more quickly learn new tasks than prior works, with major gains in sample efficiency and performance over prior skill-based RL. Website at https://www.jessezhang.net/projects/extract/.
△ Less
Submitted 25 June, 2024;
originally announced June 2024.
-
Interpreting Attention Layer Outputs with Sparse Autoencoders
Authors:
Connor Kissane,
Robert Krzyzanowski,
Joseph Isaac Bloom,
Arthur Conmy,
Neel Nanda
Abstract:
Decomposing model activations into interpretable components is a key open problem in mechanistic interpretability. Sparse autoencoders (SAEs) are a popular method for decomposing the internal activations of trained transformers into sparse, interpretable features, and have been applied to MLP layers and the residual stream. In this work we train SAEs on attention layer outputs and show that also h…
▽ More
Decomposing model activations into interpretable components is a key open problem in mechanistic interpretability. Sparse autoencoders (SAEs) are a popular method for decomposing the internal activations of trained transformers into sparse, interpretable features, and have been applied to MLP layers and the residual stream. In this work we train SAEs on attention layer outputs and show that also here SAEs find a sparse, interpretable decomposition. We demonstrate this on transformers from several model families and up to 2B parameters.
We perform a qualitative study of the features computed by attention layers, and find multiple families: long-range context, short-range context and induction features. We qualitatively study the role of every head in GPT-2 Small, and estimate that at least 90% of the heads are polysemantic, i.e. have multiple unrelated roles.
Further, we show that Sparse Autoencoders are a useful tool that enable researchers to explain model behavior in greater detail than prior work. For example, we explore the mystery of why models have so many seemingly redundant induction heads, use SAEs to motivate the hypothesis that some are long-prefix whereas others are short-prefix, and confirm this with more rigorous analysis. We use our SAEs to analyze the computation performed by the Indirect Object Identification circuit (Wang et al.), validating that the SAEs find causally meaningful intermediate variables, and deepening our understanding of the semantics of the circuit. We open-source the trained SAEs and a tool for exploring arbitrary prompts through the lens of Attention Output SAEs.
△ Less
Submitted 25 June, 2024;
originally announced June 2024.
-
Counting of surfaces and computational complexity in column sums of symmetric group character tables
Authors:
Joseph Ben Geloun,
Sanjaye Ramgoolam
Abstract:
The character table of the symmetric group $S_n$, of permutations of $n$ objects, is of fundamental interest in theoretical physics, combinatorics as well as computational complexity theory. We investigate the implications of an identity, which has a geometrical interpretation in combinatorial topological field theories, relating the column sum of normalised central characters of $S_n$ to a sum of…
▽ More
The character table of the symmetric group $S_n$, of permutations of $n$ objects, is of fundamental interest in theoretical physics, combinatorics as well as computational complexity theory. We investigate the implications of an identity, which has a geometrical interpretation in combinatorial topological field theories, relating the column sum of normalised central characters of $S_n$ to a sum of structure constants of multiplication in the centre of the group algebra of $S_n$. The identity leads to the proof that a combinatorial computation of the column sum belongs to complexity class \shP. The sum of structure constants has an interpretation in terms of the counting of branched covers of the sphere. This allows the identification of a tractable subset of the structure constants related to genus zero covers. We use this subset to prove that the column sum for a conjugacy class labelled by partition $λ$ is non-vanishing if and only if the permutations in the conjugacy class are even. This leads to the result that the determination of the vanishing or otherwise of the column sum is in complexity class \pP. The subset gives a positive lower bound on the column sum for any even $ λ$. For any disjoint decomposition of $ λ$ as $λ_1 \sqcup λ_2 $ we obtain a lower bound for the column sum at $ λ$ in terms of the product of the column sums for $ λ_1$ and$λ_2$. This can be expressed as a super-additivity property for the logarithms of column sums of normalized characters.
△ Less
Submitted 25 June, 2024;
originally announced June 2024.
-
Low-Crosstalk, Silicon-Fabricated Optical Waveguides for Laser Delivery to Matter Qubits
Authors:
Clayton L. Craft,
Nicholas J. Barton,
Andrew C. Klug,
Kenneth Scalzi,
Ian Wildemann,
Pramod Asagodu,
Joseph D. Broz,
Nikola L. Porto,
Michael Macalik,
Anthony Rizzo,
Garrett Percevault,
Christopher C. Tison,
A. Matthew Smith,
Michael L. Fanto,
James Schneeloch,
Erin Sheridan,
Dylan Heberle,
Andrew Brownell,
Vijay S. S. Sundaram,
Venkatesh Deenadayalan,
Matthew van Niekerk,
Evan Manfreda-Schulz,
Gregory A. Howland,
Stefan F. Preble,
Daniel Coleman
, et al. (8 additional authors not shown)
Abstract:
Reliable control of quantum information in matter-based qubits requires precisely applied external fields, and unaccounted for spatial cross-talk of these fields between adjacent qubits leads to loss of fidelity. We report a CMOS foundry-produced, micro-fabricated silicon nitride (Si3N4) optical waveguide for addressing a chain of eight, unequally-spaced trapped barium ions with crosstalk compatib…
▽ More
Reliable control of quantum information in matter-based qubits requires precisely applied external fields, and unaccounted for spatial cross-talk of these fields between adjacent qubits leads to loss of fidelity. We report a CMOS foundry-produced, micro-fabricated silicon nitride (Si3N4) optical waveguide for addressing a chain of eight, unequally-spaced trapped barium ions with crosstalk compatible with scalable quantum information processing. The crosstalk mitigation techniques incorporated into the chip design result in a reduction of the measured optical field by at least 50.8(1.3) dB between adjacent waveguide outputs near 650 nm and similar behavior for devices designed for 493 nm and 585 nm. The waveguide outputs near 650 nm, along with a global laser near 493 nm were used to laser-cool a chain of eight barium-138 ions, and a camera imaged the resulting fluorescence at 493 nm.
△ Less
Submitted 27 June, 2024; v1 submitted 25 June, 2024;
originally announced June 2024.
-
Development of a digital tool for monitoring the behaviour of pre-weaned calves using accelerometer neck-collars
Authors:
Oshana Dissanayake,
Sarah E. Mcpherson,
Joseph Allyndrée,
Emer Kennedy,
Pádraig Cunningham,
Lucile Riaboff
Abstract:
Automatic monitoring of calf behaviour is a promising way of assessing animal welfare from their first week on farms. This study aims to (i) develop machine learning models from accelerometer data to classify the main behaviours of pre-weaned calves and (ii) set up a digital tool for monitoring the behaviour of pre-weaned calves from the models' prediction. Thirty pre-weaned calves were equipped w…
▽ More
Automatic monitoring of calf behaviour is a promising way of assessing animal welfare from their first week on farms. This study aims to (i) develop machine learning models from accelerometer data to classify the main behaviours of pre-weaned calves and (ii) set up a digital tool for monitoring the behaviour of pre-weaned calves from the models' prediction. Thirty pre-weaned calves were equipped with a 3-D accelerometer attached to a neck-collar for two months and filmed simultaneously. The behaviours were annotated, resulting in 27.4 hours of observation aligned with the accelerometer data. The time-series were then split into 3 seconds windows. Two machine learning models were tuned using data from 80% of the calves: (i) a Random Forest model to classify between active and inactive behaviours using a set of 11 hand-craft features [model 1] and (ii) a RidgeClassifierCV model to classify between lying, running, drinking milk and other behaviours using ROCKET features [model 2]. The performance of the models was tested using data from the remaining 20% of the calves. Model 1 achieved a balanced accuracy of 0.92. Model 2 achieved a balanced accuracy of 0.84. Behavioural metrics such as daily activity ratio and episodes of running, lying, drinking milk, and other behaviours expressed over time were deduced from the predictions. All the development was finally embedded into a Python dashboard so that the individual calf metrics could be displayed directly from the raw accelerometer files.
△ Less
Submitted 25 June, 2024;
originally announced June 2024.
-
The California Legacy Survey V. Chromospheric Activity Cycles in Main Sequence Stars
Authors:
Howard Isaacson,
Andrew W. Howard,
Benjamin Fulton,
Erik A. Petigura,
Lauren M. Weiss,
Stephen R. Kane,
Brad Carter,
Corey Beard,
Steven Giacalone,
Judah Van Zandt,
Joseph M. Akana Murphy,
Fei Dai,
Ashley Chontos,
Alex S. Polanski,
Malena Rice,
Jack Lubin,
Casey Brinkman,
Ryan A. Rubenzahl,
Sarah Blunt,
Samuel W. Yee,
Mason G. MacDougall,
Paul A. Dalba,
Dakotah Tyler,
Aida Behmard,
Isabel Angelo
, et al. (9 additional authors not shown)
Abstract:
We present optical spectroscopy of 710 solar neighborhood stars collected over twenty years to catalog chromospheric activity and search for stellar activity cycles. The California Legacy Survey stars are amenable to exoplanet detection using precise radial velocities, and we present their Ca II H and K time series as a proxy for stellar and chromospheric activity. Using the HIRES spectrometer at…
▽ More
We present optical spectroscopy of 710 solar neighborhood stars collected over twenty years to catalog chromospheric activity and search for stellar activity cycles. The California Legacy Survey stars are amenable to exoplanet detection using precise radial velocities, and we present their Ca II H and K time series as a proxy for stellar and chromospheric activity. Using the HIRES spectrometer at Keck Observatory, we measured stellar flux in the cores of the Ca II H and K lines to determine S-values on the Mt. Wilson scale and the log(R'HK) metric, which is comparable across a wide range of spectral types. From the 710 stars, with 52,372 observations, 285 stars are sufficiently sampled to search for stellar activity cycles with periods of 2-25 years, and 138 stars show stellar cycles of varying length and amplitude. S-values can be used to mitigate stellar activity in the detection and characterization of exoplanets. We use them to probe stellar dynamos and to place the Sun's magnetic activity into context among solar neighborhood stars. Using precise stellar parameters and time-averaged activity measurements, we find tightly constrained cycle periods as a function of stellar temperature between log(R'HK) of -4.7 and -4.9, a range of activity in which nearly every star has a periodic cycle. These observations present the largest sample of spectroscopically determined stellar activity cycles to date.
△ Less
Submitted 25 June, 2024;
originally announced June 2024.
-
Cascade Generalization-based Classifiers for Software Defect Prediction
Authors:
Aminat Bashir,
Abdullateef Balogun,
Matthew Adigun,
Sunday Ajagbe,
Luiz Fernando Capretz,
Joseph Awotunde,
Hammed Mojeed
Abstract:
The process of software defect prediction (SDP) involves predicting which software system modules or components pose the highest risk of being defective. The projections and discernments derived from SDP can then assist the software development team in effectively allocating its finite resources toward potentially susceptible defective modules. Because of this, SDP models need to be improved and r…
▽ More
The process of software defect prediction (SDP) involves predicting which software system modules or components pose the highest risk of being defective. The projections and discernments derived from SDP can then assist the software development team in effectively allocating its finite resources toward potentially susceptible defective modules. Because of this, SDP models need to be improved and refined continuously. Hence, this research proposes the deployment of a cascade generalization (CG) function to enhance the predictive performances of machine learning (ML)-based SDP models. The CG function extends the initial sample space by introducing new samples into the neighbourhood of the distribution function generated by the base classification algorithm, subsequently mitigating its bias. Experiments were conducted to investigate the effectiveness of CG-based Naïve Bayes (NB), Decision Tree (DT), and k-Nearest Neighbor (kNN) models on NASA software defect datasets. Based on the experimental results, the CG-based models (CG-NB, CG-DT, CG-kNN) were superior in prediction performance when compared with the baseline NB, DT, and kNN models respectively. Accordingly, the average accuracy value of CG-NB, CG-DT, and CG-kNN models increased by +11.06%, +3.91%, and +5.14%, respectively, over baseline NB, DT, and kNN models. A similar performance was observed for the area under the curve (AUC) value with CG-NB, CG-DT, and CG-kNN recording an average AUC value of +7.98%, +26%, and +24.9% improvement over the baseline NB, DT, and kNN respectively. In addition, the suggested CG-based models outperformed the Bagging and Boosting ensemble variants of the NB, DT, and kNN models as well as existing computationally diverse SDP models.
△ Less
Submitted 24 June, 2024;
originally announced June 2024.
-
Some relations in topological dynamics
Authors:
Joseph Auslander,
Anima Nagar
Abstract:
Relations always play an important role in the study of topological dynamics. Proximal, distal and almost periodic relations are well studied in literature. We further this direction and analogously study the strongly proximal and weakly distal relations.
This gives a new class of flows - the weakly distal flows. We observe that the well known Morse-Thue substitution flows and Chacon transformat…
▽ More
Relations always play an important role in the study of topological dynamics. Proximal, distal and almost periodic relations are well studied in literature. We further this direction and analogously study the strongly proximal and weakly distal relations.
This gives a new class of flows - the weakly distal flows. We observe that the well known Morse-Thue substitution flows and Chacon transformations are weakly distal.
△ Less
Submitted 24 June, 2024;
originally announced June 2024.
-
The $L^p$ Poisson-Neumann problem and its relation to the Neumann problem
Authors:
Joseph Feneuil,
Linhan Li
Abstract:
We introduce the $L^p$ Poisson-Neumann problem for an uniformly elliptic operator $L=-\rm{div }A\nabla$ in divergence form in a bounded 1-sided Chord Arc Domain $Ω$, which considers solutions to $Lu=h-\rm{div}\vec{F}$ in $Ω$ with zero Neumann data on the boundary for $h$ and $\vec F$ in some tent spaces. We give different characterizations of solvability of the $L^p$ Poisson-Neumann problem and it…
▽ More
We introduce the $L^p$ Poisson-Neumann problem for an uniformly elliptic operator $L=-\rm{div }A\nabla$ in divergence form in a bounded 1-sided Chord Arc Domain $Ω$, which considers solutions to $Lu=h-\rm{div}\vec{F}$ in $Ω$ with zero Neumann data on the boundary for $h$ and $\vec F$ in some tent spaces. We give different characterizations of solvability of the $L^p$ Poisson-Neumann problem and its weaker variants, and in particular, we show that solvability of the weak $L^p$ Poisson-Neumann probelm is equivalent to a weak reverse Hölder inequality. We show that the Poisson-Neumman problem is closely related to the $L^p$ Neumann problem, whose solvability is a long-standing open problem. We are able to improve the extrapolation of the $L^p$ Neumann problem from Kenig and Pipher by obtaining an extrapolation result on the Poisson-Neumann problem.
△ Less
Submitted 24 June, 2024;
originally announced June 2024.
-
Dedicated beam position monitor pair for model-independent lattice characterization at NSLS-II
Authors:
Yongjun Li,
Kiman Ha,
Danny Padrazo,
Bernard Kosciuk,
Belkacem Bacha,
Michael Seegitz,
Robert Rainer,
Joseph Mead,
Xi Yang,
Yuke Tian,
Robert Todd,
Victor Smaluk,
Weixing Cheng
Abstract:
This paper reports recent lattice characterization results obtained at the National Synchrotron Light Source II (NSLS-II) storage ring, conducted without reliance on a lattice model. A pair of beam position monitors (BPMs) with bunch-by-bunch (B$\times$B) resolution, were recently installed in a section of the storage ring free of magnetic fields. The new BPM pair measured the beam, or bunch's tra…
▽ More
This paper reports recent lattice characterization results obtained at the National Synchrotron Light Source II (NSLS-II) storage ring, conducted without reliance on a lattice model. A pair of beam position monitors (BPMs) with bunch-by-bunch (B$\times$B) resolution, were recently installed in a section of the storage ring free of magnetic fields. The new BPM pair measured the beam, or bunch's transverse Poincaré map precisely after the beam was excited. Linear one-turn-matrices (OTM) were then derived, and from these, the 4-dimensional coupled Twiss parameters were extracted at the locations of the BPM pair. By normalizing beam oscillation amplitudes with the Twiss parameters, the global action-variables were obtained. These action-variables facilitated the measurement of the local Twiss parameters observed by other BPMs independent on lattice model. This method is general, and particularly useful in certain scenarios such as a round beam mode in a diffraction-limited light source ring. We applied it to assess both weakly and strongly coupled lattices at the NSLS-II ring. Through analysis of the strongly coupled lattice, the quadrupole tilt errors were estimated to be less than 400 \siμrad. Utilizing the BPMs' B$\times$B resolution, for the first time we observed the variations of the linear lattice along a long bunch-train.
△ Less
Submitted 24 June, 2024;
originally announced June 2024.
-
The CRS: a scalable full-stack control system for Microwave Kinetic Inductance Detectors
Authors:
Joshua Montgomery,
Wellington Avelino,
Matt Dobbs,
Joseph Letang,
Maclean Rouble,
Sofiia Savchyn,
Graeme Smecher
Abstract:
The t0.technology Control and Readout System (CRS) is a modular microwave control and readout system for mm-wave and radio astronomy, THz imaging, noise radar, and superconducting qubit control. The configuration discussed in this work implements firmware for readout of microwave Kinetic Inductance Detector (KID) arrays. The CRS can operate 4,096 KIDs over 2.5 GHz of complex bandwidth between 0-10…
▽ More
The t0.technology Control and Readout System (CRS) is a modular microwave control and readout system for mm-wave and radio astronomy, THz imaging, noise radar, and superconducting qubit control. The configuration discussed in this work implements firmware for readout of microwave Kinetic Inductance Detector (KID) arrays. The CRS can operate 4,096 KIDs over 2.5 GHz of complex bandwidth between 0-10 GHz, typically allocated across four independent RF chains at 1,024x multiplexing and 625 MHz of complex bandwidth each. Every CRS can operate as a standalone unit or collectively within one or more backplane-enabled subracks that distribute power, clocking, and synchronization, scaling to an arbitrary number of channels. Each fully populated subrack supports arrays of more than 65,000 KIDs. The signal processing and control software supports recent innovations in multi-probe measurements and dynamic feedback modes, which are described in (Rouble et al. 2024). The CRS has recently been selected as the new baseline readout system for the proposed South Pole Telescope instrument, SPT-3G+. We present the hardware design, firmware capabilities, open-source control and data acquisition software, and the first laboratory characterization measurements.
△ Less
Submitted 23 June, 2024;
originally announced June 2024.
-
Source anisotropies and pulsar timing arrays
Authors:
Bruce Allen,
Deepali Agarwal,
Joseph D. Romano,
Serena Valtolina
Abstract:
Pulsar timing arrays (PTA) hunt for gravitational waves (GW) by searching for the correlations that GWs induce in the time-of-arrival residuals from different pulsars. If the GW sources are of astrophysical origin, then they are located at discrete points on the sky. However, PTA data are often modeled, and subsequently analyzed, via a "standard Gaussian ensemble". That ensemble is obtained in the…
▽ More
Pulsar timing arrays (PTA) hunt for gravitational waves (GW) by searching for the correlations that GWs induce in the time-of-arrival residuals from different pulsars. If the GW sources are of astrophysical origin, then they are located at discrete points on the sky. However, PTA data are often modeled, and subsequently analyzed, via a "standard Gaussian ensemble". That ensemble is obtained in the limit of an infinite density of vanishingly weak, Poisson-distributed sources. In this paper, we move away from that ensemble, to study the effects of two types of "source anisotropy". The first (a), which is often called "shot noise", arises because there are $N$ discrete GW sources at specific sky locations. The second (b) arises because the GW source positions are not a Poisson process, for example, because galaxy locations are clustered. Here, we quantify the impact of (a) and (b) on the mean and variance of the pulsar-averaged Hellings and Downs correlation. For conventional PTA sources, we show that the effects of shot noise (a) are much larger than the effects of clustering (b).
△ Less
Submitted 23 June, 2024;
originally announced June 2024.
-
Completely Multipolar Model as a General Framework for Many-Body Interactions as Illustrated for Water
Authors:
Joseph P. Heindel,
Selim Sami,
Teresa Head-Gordon
Abstract:
We introduce a general framework for many-body force field models, the Completely Multipolar Model (CMM), that utilizes multipolar electrical moments modulated by exponential decay of electron density as a common functional form for all piecewise terms of an energy decomposition analysis of intermolecular interactions. With this common functional form the CMM model establishes well-formulated damp…
▽ More
We introduce a general framework for many-body force field models, the Completely Multipolar Model (CMM), that utilizes multipolar electrical moments modulated by exponential decay of electron density as a common functional form for all piecewise terms of an energy decomposition analysis of intermolecular interactions. With this common functional form the CMM model establishes well-formulated damped tensors that reach the correct asymptotes at both long- and short-range while formally ensuring no short-range catastrophes. The CMM describes the separable EDA terms of dispersion, exchange polarization, and Pauli repulsion with short-ranged anisotropy, polarization as intramolecular charge fluctuations and induced dipoles, while charge transfer describes explicit movement of charge between molecules, and naturally describes many-body charge transfer by coupling into the polarization equations. We also utilize a new one-body potential that accounts for intramolecular polarization by including an electric field-dependent correction to the Morse potential to ensure that the CMM reproduces all physically relevant monomer properties including the dipole moment, molecular polarizability, and dipole and polarizability derivatives. The quality of the CMM is illustrated through agreement of individual terms of the EDA and excellent extrapolation to energies and geometries of an extensive validation set of water cluster data.
△ Less
Submitted 22 June, 2024;
originally announced June 2024.
-
Active and inactive contributions to the wall pressure and wall-shear stress in turbulent boundary layers
Authors:
Rahul Deshpande,
Ricardo Vinuesa,
Joseph Klewicki,
Ivan Marusic
Abstract:
A phenomenological description is presented to explain the low-frequency/large-scale contributions to the wall-shear-stress ($τ_w$) and wall-pressure (${p}_w$) spectra of canonical turbulent boundary layers, which are well known to increase with Reynolds number. The explanation is based on the concept of active and inactive motions (Townsend, J. Fluid Mech., vol. 11, 1961) associated with the atta…
▽ More
A phenomenological description is presented to explain the low-frequency/large-scale contributions to the wall-shear-stress ($τ_w$) and wall-pressure (${p}_w$) spectra of canonical turbulent boundary layers, which are well known to increase with Reynolds number. The explanation is based on the concept of active and inactive motions (Townsend, J. Fluid Mech., vol. 11, 1961) associated with the attached-eddy hypothesis. Unique data sets of simultaneously acquired $τ_w$, ${p}_w$ and velocity fluctuation time series in the log region are considered, across friction-Reynolds-number ($Re_τ$) range of $\mathcal{O}$($10^3$) $\lesssim$ $Re_τ$ $\lesssim$ $\mathcal{O}$($10^6$). A recently proposed energy-decomposition methodology (Deshpande et al., J. Fluid Mech., vol. 914, 2021) is implemented to reveal the active and inactive contributions to the $τ_w$- and $p_w$-spectra. Empirical evidence is provided in support of Bradshaw's (J. Fluid Mech., vol. 30, 1967) hypothesis that the inactive motions are responsible for the non-local wall-ward transport of the large-scale inertia-dominated energy, which is produced in the log region by active motions. This explains the large-scale signatures in the $τ_w$-spectrum that are noted, despite negligible large-scale turbulence production near the wall. For wall pressure, active and inactive motions contribute to the $p_w$-spectra at intermediate and large scales, respectively. Their contributions are found to increase with increasing $Re_τ$ due to the broadening and energization of the wall-scaled (attached) eddy hierarchy.
△ Less
Submitted 22 June, 2024;
originally announced June 2024.
-
Hidden Variables unseen by Random Forests
Authors:
Ricardo Blum,
Munir Hiabu,
Enno Mammen,
Joseph Theo Meyer
Abstract:
Random Forests are widely claimed to capture interactions well. However, some simple examples suggest that they perform poorly in the presence of certain pure interactions that the conventional CART criterion struggles to capture during tree construction. We argue that simple alternative partitioning schemes used in the tree growing procedure can enhance identification of these interactions. In a…
▽ More
Random Forests are widely claimed to capture interactions well. However, some simple examples suggest that they perform poorly in the presence of certain pure interactions that the conventional CART criterion struggles to capture during tree construction. We argue that simple alternative partitioning schemes used in the tree growing procedure can enhance identification of these interactions. In a simulation study we compare these variants to conventional Random Forests and Extremely Randomized trees. Our results validate that the modifications considered enhance the model's fitting ability in scenarios where pure interactions play a crucial role.
△ Less
Submitted 19 June, 2024;
originally announced June 2024.
-
The ACT-DR5 MCMF Galaxy Cluster Catalog
Authors:
Matthias Klein,
Joseph J. Mohr,
Christopher T. Davies
Abstract:
Galaxy clusters are useful cosmological probes and interesting astrophysical laboratories. With growing cluster samples, a deeper understanding of the sample characteristics and improved control of systematics becomes more crucial. In this analysis we create a new and larger ACT-DR5-based thermal Sunyaev-Zel'dovich Effect (tSZE) selected galaxy cluster catalog with improved control over sample pur…
▽ More
Galaxy clusters are useful cosmological probes and interesting astrophysical laboratories. With growing cluster samples, a deeper understanding of the sample characteristics and improved control of systematics becomes more crucial. In this analysis we create a new and larger ACT-DR5-based thermal Sunyaev-Zel'dovich Effect (tSZE) selected galaxy cluster catalog with improved control over sample purity and completeness. We employ the red sequence based cluster redshift and confirmation tool MCMF together with optical imaging data from the Legacy Survey DR-10 and infrared data from the WISE satellite to systematicallyidentify true clusters from a new cluster candidate detection run on the ACT-DR5 dataset. The resulting ACT-DR5 MCMF sample contains 6,237 clusters with a residual contamination of 10.7%. This is an increase of 51% compared to the previous ACT-DR5 cluster catalog, making this catalog the largest tSZE-selected cluster catalog to date. The z>1 subsample contains 703 clusters, three times more than in the previous ACT-DR5 catalog. Matching the ACT-DR5 MCMF cluster catalog with a deeper tSZE sample from SPTpol 500d allows us to confirm the completeness and purity of the new ACT-DR5 MCMF sample. Cross-matching to the two largest X-ray selected cluster samples, the all-sky RASS MCMF and the half-sky eRASS1, confirms the sample purity of the RASS MCMF sample and in the case of eRASS1 reveals that 43% of the matched clusters are designated in eRASS1 as X-ray point sources rather than clusters. Cross-correlating the ACT-DR5 MCMF cluster catalog with ACT-DR6 lensing maps results in a 16.4σdetection of CMB lensing around the clusters, corresponding to the strongest signal found so far for a galaxy cluster sample. Repeating the measurement for the z>1 cluster subsample yields a significance of 4.3σ, which is the strongest CMB lensing detection in a z>1 cluster sample to date.
△ Less
Submitted 20 June, 2024;
originally announced June 2024.