-
Dynamical Measure Transport and Neural PDE Solvers for Sampling
Authors:
**gtong Sun,
Julius Berner,
Lorenz Richter,
Marius Zeinhofer,
Johannes Müller,
Kamyar Azizzadenesheli,
Anima Anandkumar
Abstract:
The task of sampling from a probability density can be approached as transporting a tractable density function to the target, known as dynamical measure transport. In this work, we tackle it through a principled unified framework using deterministic or stochastic evolutions described by partial differential equations (PDEs). This framework incorporates prior trajectory-based sampling methods, such…
▽ More
The task of sampling from a probability density can be approached as transporting a tractable density function to the target, known as dynamical measure transport. In this work, we tackle it through a principled unified framework using deterministic or stochastic evolutions described by partial differential equations (PDEs). This framework incorporates prior trajectory-based sampling methods, such as diffusion models or Schrödinger bridges, without relying on the concept of time-reversals. Moreover, it allows us to propose novel numerical methods for solving the transport task and thus sampling from complicated targets without the need for the normalization constant or data samples. We employ physics-informed neural networks (PINNs) to approximate the respective PDE solutions, implying both conceptional and computational advantages. In particular, PINNs allow for simulation- and discretization-free optimization and can be trained very efficiently, leading to significantly better mode coverage in the sampling task compared to alternative methods. Moreover, they can readily be fine-tuned with Gauss-Newton methods to achieve high accuracy in sampling.
△ Less
Submitted 10 July, 2024;
originally announced July 2024.
-
AI Driven Laser Parameter Search: Inverse Design of Photonic Surfaces using Greedy Surrogate-based Optimization
Authors:
Luka Grbcic,
Minok Park,
Juliane Müller,
Vassilia Zorba,
Wibe Albert de Jong
Abstract:
Photonic surfaces designed with specific optical characteristics are becoming increasingly important for use in in various energy harvesting and storage systems. , In this study, we develop a surrogate-based optimization approach for designing such surfaces. The surrogate-based optimization framework employs the Random Forest algorithm and uses a greedy, prediction-based exploration strategy to id…
▽ More
Photonic surfaces designed with specific optical characteristics are becoming increasingly important for use in in various energy harvesting and storage systems. , In this study, we develop a surrogate-based optimization approach for designing such surfaces. The surrogate-based optimization framework employs the Random Forest algorithm and uses a greedy, prediction-based exploration strategy to identify the laser fabrication parameters that minimize the discrepancy relative to a user-defined target optical characteristics. We demonstrate the approach on two synthetic benchmarks and two specific cases of photonic surface inverse design targets. It exhibits superior performance when compared to other optimization algorithms across all benchmarks. Additionally, we demonstrate a technique of inverse design warm starting for changed target optical characteristics which enhances the performance of the introduced approach.
△ Less
Submitted 20 June, 2024;
originally announced July 2024.
-
1D $Z_2$ lattice gauge theory in periodic Gauss law sectors
Authors:
Vaibhav Sharma,
Erich J Mueller
Abstract:
We calculate the properties of a 1D $Z_2$ lattice gauge theory in different Gauss law sectors, corresponding to different configurations of static charges set by the orientations of the gauge spins. Importantly, in quantum simulator experiments these sectors can be accessed without adding any additional physical particles or changing the Hamiltonian: The Gauss law sectors are simply set by the ini…
▽ More
We calculate the properties of a 1D $Z_2$ lattice gauge theory in different Gauss law sectors, corresponding to different configurations of static charges set by the orientations of the gauge spins. Importantly, in quantum simulator experiments these sectors can be accessed without adding any additional physical particles or changing the Hamiltonian: The Gauss law sectors are simply set by the initial conditions. We study the interplay between conservation laws and interactions when the static charges are chosen to form periodic patterns. We classify the different Gauss law sectors and use the density matrix renormalization group to calculate the ground state compressibility, density profiles, charge density wave order parameters, and single particle correlation functions as a function of matter density. We find confined and deconfined phases, charge density waves, correlated insulators, and supersolids.
△ Less
Submitted 27 June, 2024;
originally announced June 2024.
-
Gauge Invariance of Equilibrium Statistical Mechanics
Authors:
Johanna Müller,
Sophie Hermann,
Florian Sammüller,
Matthias Schmidt
Abstract:
We identify a recently proposed shifting operation on classical phase space as a gauge transformation for statistical mechanical microstates. The infinitesimal generators of the continuous gauge group form a non-commutative Lie algebra, which induces exact sum rules when thermally averaged. Gauge invariance with respect to finite shifting is demonstrated via Monte Carlo simulation in the transform…
▽ More
We identify a recently proposed shifting operation on classical phase space as a gauge transformation for statistical mechanical microstates. The infinitesimal generators of the continuous gauge group form a non-commutative Lie algebra, which induces exact sum rules when thermally averaged. Gauge invariance with respect to finite shifting is demonstrated via Monte Carlo simulation in the transformed phase space which generates identical equilibrium averages. Our results point towards a deeper basis of statistical mechanics than previously known and they offer avenues for systematic construction of exact identities and of sampling algorithms.
△ Less
Submitted 27 June, 2024;
originally announced June 2024.
-
Meta-experiments: Improving experimentation through experimentation
Authors:
Melanie J. I. Müller
Abstract:
A/B testing is widexly used in the industry to optimize customer facing websites. Many companies employ experimentation specialists to facilitate and improve the process of A/B testing. Here, we present the application of A/B testing to this improvement effort itself, by running experiments on the experimentation process, which we call 'meta-experiments'. We discuss the challenges of this approach…
▽ More
A/B testing is widexly used in the industry to optimize customer facing websites. Many companies employ experimentation specialists to facilitate and improve the process of A/B testing. Here, we present the application of A/B testing to this improvement effort itself, by running experiments on the experimentation process, which we call 'meta-experiments'. We discuss the challenges of this approach using the example of one of our meta-experiments, which helped experimenters to run more sufficiently powered A/B tests. We also point out the benefits of 'dog fooding' for the experimentation specialists when running their own experiments.
△ Less
Submitted 24 June, 2024;
originally announced June 2024.
-
On acoustic space-time media that compute their own inverse
Authors:
Dirk-Jan van Manen,
Johannes Aichele,
Jonas Müller,
Marc Serra-Garcia,
Kees Wapenaar
Abstract:
We derive time reflection and transmission coefficients for 1D acoustic waves encountering a time boundary at which the properties of the medium change instantaneously. The time reflection and transmission coefficients are shown to be identical to so-called reverse-space reflection and transmission coefficients which appear in the recursive computation of focusing wavefields used in seismology. We…
▽ More
We derive time reflection and transmission coefficients for 1D acoustic waves encountering a time boundary at which the properties of the medium change instantaneously. The time reflection and transmission coefficients are shown to be identical to so-called reverse-space reflection and transmission coefficients which appear in the recursive computation of focusing wavefields used in seismology. We establish a bijectivity between the focusing wavefields and the wavefields produced by time scattering and show how this can be used to construct a space-time medium where the time scattering anticipates the space scattering and "computes" the exact inverse for the space scattering. The construction is shown to be independent of the boundary conditions chosen to compute the reflection and transmission coefficients. We demonstrate the construction with a simple numerical example of a single pulse encountering a series of time boundaries before reaching a spatial inhomogeneity. The time boundaries scatter the single pulse into a focusing wavefield that subsequently focuses through the spatial inhomogeneity. Under certain conditions, the transmitted wave has both the same wave shape and amplitude as the original pulse, yielding a transmission coefficient of unity. The reflection coefficient of the space-time medium is always non-zero however.
△ Less
Submitted 21 June, 2024;
originally announced June 2024.
-
Deciphering the Definition of Adversarial Robustness for post-hoc OOD Detectors
Authors:
Peter Lorenz,
Mario Fernandez,
Jens Müller,
Ullrich Köthe
Abstract:
Detecting out-of-distribution (OOD) inputs is critical for safely deploying deep learning models in real-world scenarios. In recent years, many OOD detectors have been developed, and even the benchmarking has been standardized, i.e. OpenOOD. The number of post-hoc detectors is growing fast and showing an option to protect a pre-trained classifier against natural distribution shifts, claiming to be…
▽ More
Detecting out-of-distribution (OOD) inputs is critical for safely deploying deep learning models in real-world scenarios. In recent years, many OOD detectors have been developed, and even the benchmarking has been standardized, i.e. OpenOOD. The number of post-hoc detectors is growing fast and showing an option to protect a pre-trained classifier against natural distribution shifts, claiming to be ready for real-world scenarios. However, its efficacy in handling adversarial examples has been neglected in the majority of studies. This paper investigates the adversarial robustness of the 16 post-hoc detectors on several evasion attacks and discuss a roadmap towards adversarial defense in OOD detectors.
△ Less
Submitted 28 June, 2024; v1 submitted 21 June, 2024;
originally announced June 2024.
-
An efficient singlet-triplet spin qubit to fiber interface assisted by a photonic crystal cavity
Authors:
Kui Wu,
Sebastian Kindel,
Thomas Descamps,
Tobias Hangleiter,
Jan Christoph Müller,
Rebecca Rodrigo,
Florian Merget,
Hendrik Bluhm,
Jeremy Witzens
Abstract:
We introduce a novel optical interface between a singlet-triplet spin qubit and a photonic qubit which would offer new prospects for future quantum communication applications. The interface is based on a 220 nm thick GaAs/Al-GaAs heterostructure membrane and features a gate-defined singlet-triplet qubit, a gate-defined optically active quantum dot, a photonic crystal cavity and a bot-tom gold refl…
▽ More
We introduce a novel optical interface between a singlet-triplet spin qubit and a photonic qubit which would offer new prospects for future quantum communication applications. The interface is based on a 220 nm thick GaAs/Al-GaAs heterostructure membrane and features a gate-defined singlet-triplet qubit, a gate-defined optically active quantum dot, a photonic crystal cavity and a bot-tom gold reflector. All essential components can be lithographically defined and deterministically fabricated, which greatly increases the scalability of on-chip in-tegration. According to our FDTD simulations, the interface provides an overall coupling efficiency of 28.7% into a free space Gaussian beam, assuming an SiO2 interlayer filling the space between the reflector and the membrane. The performance can be further increased to 48.5% by undercutting this SiO2 interlayer below the photonic crystal.
△ Less
Submitted 20 June, 2024;
originally announced June 2024.
-
Resource-efficient Medical Image Analysis with Self-adapting Forward-Forward Networks
Authors:
Johanna P. Müller,
Bernhard Kainz
Abstract:
We introduce a fast Self-adapting Forward-Forward Network (SaFF-Net) for medical imaging analysis, mitigating power consumption and resource limitations, which currently primarily stem from the prevalent reliance on back-propagation for model training and fine-tuning. Building upon the recently proposed Forward-Forward Algorithm (FFA), we introduce the Convolutional Forward-Forward Algorithm (CFFA…
▽ More
We introduce a fast Self-adapting Forward-Forward Network (SaFF-Net) for medical imaging analysis, mitigating power consumption and resource limitations, which currently primarily stem from the prevalent reliance on back-propagation for model training and fine-tuning. Building upon the recently proposed Forward-Forward Algorithm (FFA), we introduce the Convolutional Forward-Forward Algorithm (CFFA), a parameter-efficient reformulation that is suitable for advanced image analysis and overcomes the speed and generalisation constraints of the original FFA. To address hyper-parameter sensitivity of FFAs we are also introducing a self-adapting framework SaFF-Net fine-tuning parameters during warmup and training in parallel. Our approach enables more effective model training and eliminates the previously essential requirement for an arbitrarily chosen Goodness function in FFA. We evaluate our approach on several benchmarking datasets in comparison with standard Back-Propagation (BP) neural networks showing that FFA-based networks with notably fewer parameters and function evaluations can compete with standard models, especially, in one-shot scenarios and large batch sizes. The code will be available at the time of the conference.
△ Less
Submitted 20 June, 2024;
originally announced June 2024.
-
XENONnT WIMP Search: Signal & Background Modeling and Statistical Inference
Authors:
XENON Collaboration,
E. Aprile,
J. Aalbers,
K. Abe,
S. Ahmed Maouloud,
L. Althueser,
B. Andrieu,
E. Angelino,
D. Antón Martin,
F. Arneodo,
L. Baudis,
M. Bazyk,
L. Bellagamba,
R. Biondi,
A. Bismark,
K. Boese,
A. Brown,
G. Bruno,
R. Budnik,
J. M. R. Cardoso,
A. P. Cimental Chávez,
A. P. Colijn,
J. Conrad,
J. J. Cuenca-García,
V. D'Andrea
, et al. (139 additional authors not shown)
Abstract:
The XENONnT experiment searches for weakly-interacting massive particle (WIMP) dark matter scattering off a xenon nucleus. In particular, XENONnT uses a dual-phase time projection chamber with a 5.9-tonne liquid xenon target, detecting both scintillation and ionization signals to reconstruct the energy, position, and type of recoil. A blind search for nuclear recoil WIMPs with an exposure of 1.1 t…
▽ More
The XENONnT experiment searches for weakly-interacting massive particle (WIMP) dark matter scattering off a xenon nucleus. In particular, XENONnT uses a dual-phase time projection chamber with a 5.9-tonne liquid xenon target, detecting both scintillation and ionization signals to reconstruct the energy, position, and type of recoil. A blind search for nuclear recoil WIMPs with an exposure of 1.1 tonne-years yielded no signal excess over background expectations, from which competitive exclusion limits were derived on WIMP-nucleon elastic scatter cross sections, for WIMP masses ranging from 6 GeV/$c^2$ up to the TeV/$c^2$ scale. This work details the modeling and statistical methods employed in this search. By means of calibration data, we model the detector response, which is then used to derive background and signal models. The construction and validation of these models is discussed, alongside additional purely data-driven backgrounds. We also describe the statistical inference framework, including the definition of the likelihood function and the construction of confidence intervals.
△ Less
Submitted 19 June, 2024;
originally announced June 2024.
-
Demystifying Higher-Order Graph Neural Networks
Authors:
Maciej Besta,
Florian Scheidl,
Lukas Gianinazzi,
Shachar Klaiman,
Jürgen Müller,
Torsten Hoefler
Abstract:
Higher-order graph neural networks (HOGNNs) are an important class of GNN models that harness polyadic relations between vertices beyond plain edges. They have been used to eliminate issues such as over-smoothing or over-squashing, to significantly enhance the accuracy of GNN predictions, to improve the expressiveness of GNN architectures, and for numerous other goals. A plethora of HOGNN models h…
▽ More
Higher-order graph neural networks (HOGNNs) are an important class of GNN models that harness polyadic relations between vertices beyond plain edges. They have been used to eliminate issues such as over-smoothing or over-squashing, to significantly enhance the accuracy of GNN predictions, to improve the expressiveness of GNN architectures, and for numerous other goals. A plethora of HOGNN models have been introduced, and they come with diverse neural architectures, and even with different notions of what the "higher-order" means. This richness makes it very challenging to appropriately analyze and compare HOGNN models, and to decide in what scenario to use specific ones. To alleviate this, we first design an in-depth taxonomy and a blueprint for HOGNNs. This facilitates designing models that maximize performance. Then, we use our taxonomy to analyze and compare the available HOGNN models. The outcomes of our analysis are synthesized in a set of insights that help to select the most beneficial GNN model in a given scenario, and a comprehensive list of challenges and opportunities for further research into more powerful HOGNNs.
△ Less
Submitted 18 June, 2024;
originally announced June 2024.
-
A formation pathway for terrestrial planets with moderate water content involving atmospheric-volatile recycling
Authors:
Jonas Müller,
Bertram Bitsch,
Aaron David Schneider
Abstract:
Of the many recently discovered terrestrial exoplanets, some are expected to harbor moderate water mass fractions of a few percent. The formation pathways that can produce planets with these water mass fractions are not fully understood. Here, we use the code chemcomp, which consists of a semi-analytical 1D protoplanetary disk model harboring a migrating and accreting planet, to model the growth a…
▽ More
Of the many recently discovered terrestrial exoplanets, some are expected to harbor moderate water mass fractions of a few percent. The formation pathways that can produce planets with these water mass fractions are not fully understood. Here, we use the code chemcomp, which consists of a semi-analytical 1D protoplanetary disk model harboring a migrating and accreting planet, to model the growth and composition of planets with moderate water mass fractions by pebble accretion in a protoplanetary disk around a TRAPPIST-1 analog star. This star is accompanied by seven terrestrial planets, of which the outer four planets likely contain water mass fractions of between 1\% and 10\%. We adopt a published model that considers the evaporation of pebbles in the planetary envelope, from where recycling flows can transport the volatile vapor back into the disk. We find that with this model, the planetary water content depends on the influx rate of pebbles onto the planet. A decreasing pebble influx with time reduces the envelope temperature and consequently allows the formation of planets with moderate water mass fractions as inferred for the outer TRAPPIST-1 planets for a number of different simulation configurations. This is further evidence that the recycling of vapor is an important component of planet formation needed to explain the vast and diverse population of exoplanets.
△ Less
Submitted 30 June, 2024; v1 submitted 13 June, 2024;
originally announced June 2024.
-
Heavy-to-light form factors to three loops
Authors:
Matteo Fael,
Tobias Huber,
Fabian Lange,
Jakob Müller,
Kay Schönwald,
Matthias Steinhauser
Abstract:
We compute three-loop corrections of $\mathcal{O}(α_{s}^3)$ to form factors with one massive and one massless quark coupling to an external vector, axialvector, scalar, pseudoscalar, or tensor current. We obtain analytic results for the color-planar contributions, for the contributions of light-quark loops, and the contributions with two heavy-quark loops. For the computation of the remaining mast…
▽ More
We compute three-loop corrections of $\mathcal{O}(α_{s}^3)$ to form factors with one massive and one massless quark coupling to an external vector, axialvector, scalar, pseudoscalar, or tensor current. We obtain analytic results for the color-planar contributions, for the contributions of light-quark loops, and the contributions with two heavy-quark loops. For the computation of the remaining master integrals we use the "expand and match" approach which leads to semi-analytic results for the form factors. We implement our results in a {\tt Mathematica} and a {\tt Fortran} code which allows for fast and precise numerical evaluations in the physically relevant phase space. The form factors are used to compute the hard matching coefficients in Soft-Collinear Effective Theory for all currents. The tensor coefficients at light-like momentum transfer are used to extract the hard function in $\bar B \to X_s γ$ to three loops.
△ Less
Submitted 12 June, 2024;
originally announced June 2024.
-
Multi-Head RAG: Solving Multi-Aspect Problems with LLMs
Authors:
Maciej Besta,
Ales Kubicek,
Roman Niggli,
Robert Gerstenberger,
Lucas Weitzendorf,
Mingyuan Chi,
Patrick Iff,
Joanna Gajda,
Piotr Nyczyk,
Jürgen Müller,
Hubert Niewiadomski,
Marcin Chrapek,
Michał Podstawski,
Torsten Hoefler
Abstract:
Retrieval Augmented Generation (RAG) enhances the abilities of Large Language Models (LLMs) by enabling the retrieval of documents into the LLM context to provide more accurate and relevant responses. Existing RAG solutions do not focus on queries that may require fetching multiple documents with substantially different contents. Such queries occur frequently, but are challenging because the embed…
▽ More
Retrieval Augmented Generation (RAG) enhances the abilities of Large Language Models (LLMs) by enabling the retrieval of documents into the LLM context to provide more accurate and relevant responses. Existing RAG solutions do not focus on queries that may require fetching multiple documents with substantially different contents. Such queries occur frequently, but are challenging because the embeddings of these documents may be distant in the embedding space, making it hard to retrieve them all. This paper introduces Multi-Head RAG (MRAG), a novel scheme designed to address this gap with a simple yet powerful idea: leveraging activations of Transformer's multi-head attention layer, instead of the decoder layer, as keys for fetching multi-aspect documents. The driving motivation is that different attention heads can learn to capture different data aspects. Harnessing the corresponding activations results in embeddings that represent various facets of data items and queries, improving the retrieval accuracy for complex queries. We provide an evaluation methodology and metrics, synthetic datasets, and real-world use cases to demonstrate MRAG's effectiveness, showing improvements of up to 20% in relevance over standard RAG baselines. MRAG can be seamlessly integrated with existing RAG frameworks and benchmarking tools like RAGAS as well as different classes of data stores.
△ Less
Submitted 7 June, 2024;
originally announced June 2024.
-
Essentially Sharp Estimates on the Entropy Regularization Error in Discrete Discounted Markov Decision Processes
Authors:
Johannes Müller,
Semih Cayci
Abstract:
We study the error introduced by entropy regularization of infinite-horizon discrete discounted Markov decision processes. We show that this error decreases exponentially in the inverse regularization strength both in a weighted KL-divergence and in value with a problem-specific exponent. We provide a lower bound matching our upper bound up to a polynomial factor. Our proof relies on the correspon…
▽ More
We study the error introduced by entropy regularization of infinite-horizon discrete discounted Markov decision processes. We show that this error decreases exponentially in the inverse regularization strength both in a weighted KL-divergence and in value with a problem-specific exponent. We provide a lower bound matching our upper bound up to a polynomial factor. Our proof relies on the correspondence of the solutions of entropy-regularized Markov decision processes with gradient flows of the unregularized reward with respect to a Riemannian metric common in natural policy gradient methods. Further, this correspondence allows us to identify the limit of the gradient flow as the generalized maximum entropy optimal policy, thereby characterizing the implicit bias of the Kakade gradient flow which corresponds to a time-continuous version of the natural policy gradient method. We use this to show that for entropy-regularized natural policy gradient methods the overall error decays exponentially in the square root of the number of iterations improving existing sublinear guarantees.
△ Less
Submitted 25 June, 2024; v1 submitted 6 June, 2024;
originally announced June 2024.
-
Inverse design of photonic surfaces on Inconel via multi-fidelity machine learning ensemble framework and high throughput femtosecond laser processing
Authors:
Luka Grbcic,
Minok Park,
Mahmoud Elzouka,
Ravi Prasher,
Juliane Müller,
Costas P. Grigoropoulos,
Sean D. Lubner,
Vassilia Zorba,
Wibe Albert de Jong
Abstract:
We demonstrate a multi-fidelity (MF) machine learning ensemble framework for the inverse design of photonic surfaces, trained on a dataset of 11,759 samples that we fabricate using high throughput femtosecond laser processing. The MF ensemble combines an initial low fidelity model for generating design solutions, with a high fidelity model that refines these solutions through local optimization. T…
▽ More
We demonstrate a multi-fidelity (MF) machine learning ensemble framework for the inverse design of photonic surfaces, trained on a dataset of 11,759 samples that we fabricate using high throughput femtosecond laser processing. The MF ensemble combines an initial low fidelity model for generating design solutions, with a high fidelity model that refines these solutions through local optimization. The combined MF ensemble can generate multiple disparate sets of laser-processing parameters that can each produce the same target input spectral emissivity with high accuracy (root mean squared errors < 2%). SHapley Additive exPlanations analysis shows transparent model interpretability of the complex relationship between laser parameters and spectral emissivity. Finally, the MF ensemble is experimentally validated by fabricating and evaluating photonic surface designs that it generates for improved efficiency energy harvesting devices. Our approach provides a powerful tool for advancing the inverse design of photonic surfaces in energy harvesting applications.
△ Less
Submitted 3 June, 2024;
originally announced June 2024.
-
ADESSE: Advice Explanations in Complex Repeated Decision-Making Environments
Authors:
Sören Schleibaum,
Lu Feng,
Sarit Kraus,
Jörg P. Müller
Abstract:
In the evolving landscape of human-centered AI, fostering a synergistic relationship between humans and AI agents in decision-making processes stands as a paramount challenge. This work considers a problem setup where an intelligent agent comprising a neural network-based prediction component and a deep reinforcement learning component provides advice to a human decision-maker in complex repeated…
▽ More
In the evolving landscape of human-centered AI, fostering a synergistic relationship between humans and AI agents in decision-making processes stands as a paramount challenge. This work considers a problem setup where an intelligent agent comprising a neural network-based prediction component and a deep reinforcement learning component provides advice to a human decision-maker in complex repeated decision-making environments. Whether the human decision-maker would follow the agent's advice depends on their beliefs and trust in the agent and on their understanding of the advice itself. To this end, we developed an approach named ADESSE to generate explanations about the adviser agent to improve human trust and decision-making. Computational experiments on a range of environments with varying model sizes demonstrate the applicability and scalability of ADESSE. Furthermore, an interactive game-based user study shows that participants were significantly more satisfied, achieved a higher reward in the game, and took less time to select an action when presented with explanations generated by ADESSE. These findings illuminate the critical role of tailored, human-centered explanations in AI-assisted decision-making.
△ Less
Submitted 31 May, 2024;
originally announced May 2024.
-
Kronecker-Factored Approximate Curvature for Physics-Informed Neural Networks
Authors:
Felix Dangel,
Johannes Müller,
Marius Zeinhofer
Abstract:
Physics-informed neural networks (PINNs) are infamous for being hard to train. Recently, second-order methods based on natural gradient and Gauss-Newton methods have shown promising performance, improving the accuracy achieved by first-order methods by several orders of magnitude. While promising, the proposed methods only scale to networks with a few thousand parameters due to the high computatio…
▽ More
Physics-informed neural networks (PINNs) are infamous for being hard to train. Recently, second-order methods based on natural gradient and Gauss-Newton methods have shown promising performance, improving the accuracy achieved by first-order methods by several orders of magnitude. While promising, the proposed methods only scale to networks with a few thousand parameters due to the high computational cost to evaluate, store, and invert the curvature matrix. We propose Kronecker-factored approximate curvature (KFAC) for PINN losses that greatly reduces the computational cost and allows scaling to much larger networks. Our approach goes beyond the established KFAC for traditional deep learning problems as it captures contributions from a PDE's differential operator that are crucial for optimization. To establish KFAC for such losses, we use Taylor-mode automatic differentiation to describe the differential operator's computation graph as a forward network with shared weights. This allows us to apply KFAC thanks to a recently-developed general formulation for networks with weight sharing. Empirically, we find that our KFAC-based optimizers are competitive with expensive second-order methods on small problems, scale more favorably to higher-dimensional neural networks and PDEs, and consistently outperform first-order methods and LBFGS.
△ Less
Submitted 27 May, 2024; v1 submitted 24 May, 2024;
originally announced May 2024.
-
Reentrant multiple-$\mathbf{q}$ magnetic order and a "spin-cholesteric" phase in Sr$_3$Fe$_2$O$_7$
Authors:
N. D. Andriushin,
J. Muller,
N. S. Pavlovskii,
J. Grumbach,
S. Granovsky,
Y. V. Tymoshenko,
O. Zaharko,
A. Ivanov,
J. Ollivier,
M. Doerr,
B. Keimer,
M. Mostovoy,
D. S. Inosov,
D. C. Peets
Abstract:
Spin-nematic and spin-smectic phases have been reported in magnetic materials, which break rotational symmetry while preserving translational symmetry along certain directions. However, until now the analogy to liquid crystals remained incomplete because no magnetic analog of cholesteric order was known. Here we show that the bilayer perovskite Sr$_3$Fe$_2$O$_7$, previously believed to adopt a sim…
▽ More
Spin-nematic and spin-smectic phases have been reported in magnetic materials, which break rotational symmetry while preserving translational symmetry along certain directions. However, until now the analogy to liquid crystals remained incomplete because no magnetic analog of cholesteric order was known. Here we show that the bilayer perovskite Sr$_3$Fe$_2$O$_7$, previously believed to adopt a simple single-$\mathbf{q}$ spin-helical order, hosts two distinct types of multi-$\mathbf{q}$ spin textures and the first "spin-cholesteric". Its ground state represents a novel multi-$\mathbf{q}$ spin texture with unequally intense spin modulations at the two ordering vectors. This is followed in temperature by the new "spin-cholesteric" phase with spontaneously broken chiral symmetry, in which the translational symmetry is broken only along one of the crystal directions while the weaker orthogonal modulation melts, giving rise to intense short-range dynamical fluctuations. Shortly before the transition to the paramagnetic state, vortex-crystal order spanned by two equivalent $\mathbf{q}$ vectors emerges. The "spin-cholesteric" phase completes the spin analogy with liquid crystals and renders Sr$_3$Fe$_2$O$_7$ a touchstone for studying transitions among multiple-$\mathbf{q}$ spin textures in a centrosymmetric host.
△ Less
Submitted 21 May, 2024;
originally announced May 2024.
-
Detection of a 2.85 micrometer Feature on 5 Spinel-rich Asteroids from JWST
Authors:
Jonathan Gomez Barrientos,
Katherine de Kleer,
Bethany L. Ehlmann,
Francois L. H. Tissot,
Jessica Mueller
Abstract:
Ground-based observations of `Barbarian' L-type asteroids at 1 to 2.5-$μ$m indicate that their near-infrared spectra are dominated by the mineral spinel, which has been attributed to a high abundance of calcium-aluminum inclusions (CAIs) -- the first solids to condense out of the protoplanetary disk during the formation of the Solar System. However, the spectral properties of these asteroids from…
▽ More
Ground-based observations of `Barbarian' L-type asteroids at 1 to 2.5-$μ$m indicate that their near-infrared spectra are dominated by the mineral spinel, which has been attributed to a high abundance of calcium-aluminum inclusions (CAIs) -- the first solids to condense out of the protoplanetary disk during the formation of the Solar System. However, the spectral properties of these asteroids from 2.5 to 5-$μ$m, a wavelength region that covers signatures of hydrated minerals, water, and organics, have not yet been explored. Here, we present 2 to 5-$μ$m reflectance spectra of five spinel-rich asteroids obtained with the NIRSpec instrument on the James Webb Space Telescope. All five targets exhibit a $\sim$ 2.85-$μ$m absorption feature with a band depth of 3-6$\%$ that appears correlated in strength with that of the 2-$μ$m spinel absorption feature. The shape and position of the 2.85-$μ$m feature are not a good match to the 2.7-$μ$m feature commonly seen in carbonaceous CM meteorites or C-type asteroids. The closest spectral matches are to the Moon and Vesta, suggesting commonalities in aqueous alteration across silicate bodies, infall of hydrated material, and/or space weathering by solar wind H implantation. Lab spectra of CO/CV chondrites, CAIs, as well as the minerals cronstedtite and spinel, also show a similar feature, providing clues into the origin of the 2.85-$μ$m feature.
△ Less
Submitted 20 May, 2024;
originally announced May 2024.
-
Combined Classical and Quantum Accelerometers For the Next Generation of Satellite Gravity Missions
Authors:
Alireza HosseiniArani,
Manuel Schilling,
Benjamin Tennstedt,
Alexey Kupriyanov,
Quentin Beaufils,
Annike Knabe,
Arpetha C. Sreekantaiah,
Franck Pereira dos Santos,
Steffen Schön,
Jürgen Müller
Abstract:
Cold atom interferometry (CAI)-based quantum accelerometers are very promising for future satellite gravity missions thanks to their strength in providing long-term stable and precise measurements of non-gravitational accelerations. However, their limitations due to the low measurement rate and the existence of ambiguities in the raw sensor measurements call for hybridization of the quantum accele…
▽ More
Cold atom interferometry (CAI)-based quantum accelerometers are very promising for future satellite gravity missions thanks to their strength in providing long-term stable and precise measurements of non-gravitational accelerations. However, their limitations due to the low measurement rate and the existence of ambiguities in the raw sensor measurements call for hybridization of the quantum accelerometer (Q-ACC) with a classical one (e.g., electrostatic) with higher bandwidth. While previous hybridization studies have so far considered simple noise models for the Q-ACC and neglected the impact of satellite rotation on the phase shift of the accelerometer, we perform here a more advanced hybridization simulation by implementing a comprehensive noise model for the satellite-based quantum accelerometers and considering the full impact of rotation, gravity gradient, and self-gravity on the instrument. We perform simulation studies for scenarios with different assumptions about quantum and classical sensors and satellite missions. The performance benefits of the hybrid solutions, taking the synergy of both classical and quantum accelerometers into account, will be quantified. We found that implementing a hybrid accelerometer onboard a future gravity mission improves the gravity solution by one to two orders in lower and higher degrees. In particular, the produced global gravity field maps show a drastic reduction in the instrumental contribution to the stri** effect after introducing measurements from the hybrid accelerometers.
△ Less
Submitted 18 May, 2024;
originally announced May 2024.
-
Simulating X-ray absorption spectroscopy of battery materials on a quantum computer
Authors:
Stepan Fomichev,
Kasra Hejazi,
Ignacio Loaiza,
Modjtaba Shokrian Zini,
Alain Delgado,
Arne-Christian Voigt,
Jonathan E. Mueller,
Juan Miguel Arrazola
Abstract:
X-ray absorption spectroscopy is a crucial experimental technique for elucidating the mechanisms of structural degradation in battery materials. However, extracting information from the measured spectrum is challenging without high-quality simulations. In this work, we propose simulating near-edge X-ray absorption spectra as a promising application for quantum computing. It is attractive due to th…
▽ More
X-ray absorption spectroscopy is a crucial experimental technique for elucidating the mechanisms of structural degradation in battery materials. However, extracting information from the measured spectrum is challenging without high-quality simulations. In this work, we propose simulating near-edge X-ray absorption spectra as a promising application for quantum computing. It is attractive due to the ultralocal nature of X-ray absorption that significantly reduces the sizes of problems to be simulated, and because of the classical hardness of simulating spectra. We describe three quantum algorithms to compute the X-ray absorption spectrum and provide their asymptotic cost. One of these is a Monte-Carlo based time-domain algorithm, which is cost-friendly to early fault-tolerant quantum computers. We then apply the framework to an industrially relevant example, a CAS(22e,18o) active space for an O-Mn cluster in a Li-excess battery cathode, showing that practically useful simulations could be obtained with much fewer qubits and gates than ground-state energy estimation of the same material.
△ Less
Submitted 17 May, 2024;
originally announced May 2024.
-
Dimensionality reduction in bulk-boundary reaction-diffusion systems
Authors:
Tom Burkart,
Benedikt J. Müller,
Erwin Frey
Abstract:
Intracellular protein patterns regulate many vital cellular functions, such as the processing of spatiotemporal information or the control of shape deformations. To do so, pattern-forming systems can be sensitive to the cell geometry by means of coupling the protein dynamics on the cell membrane to dynamics in the cytosol. Recent studies demonstrated that modeling the cytosolic dynamics in terms o…
▽ More
Intracellular protein patterns regulate many vital cellular functions, such as the processing of spatiotemporal information or the control of shape deformations. To do so, pattern-forming systems can be sensitive to the cell geometry by means of coupling the protein dynamics on the cell membrane to dynamics in the cytosol. Recent studies demonstrated that modeling the cytosolic dynamics in terms of an averaged protein pool disregards possibly crucial aspects of the pattern formation, most importantly concentration gradients normal to the membrane. At the same time, the coupling of two domains (surface and volume) with different dimensions renders many standard tools for the numerical analysis of self-organizing systems inefficient. Here, we present a generic framework for projecting the cytosolic dynamics onto the lower-dimensional surface that respects the influence of cytosolic concentration gradients in static and evolving geometries. This method uses a priori physical information about the system to approximate the cytosolic dynamics by a small number of dominant characteristic concentration profiles (basis), akin to basis transformations of finite element methods. As a proof of concept, we apply our framework to a toy model for volume-dependent interrupted coarsening, evaluate the accuracy of the results for various basis choices, and discuss the optimal basis choice for biologically relevant systems. Our analysis presents an efficient yet accurate method for analysing pattern formation with surface-volume coupling in evolving geometries.
△ Less
Submitted 14 May, 2024;
originally announced May 2024.
-
Effective Quadratic Error Bounds for Floating-Point Algorithms Computing the Hypotenuse Function
Authors:
Jean-Michel Muller,
Bruno Salvy
Abstract:
We provide tools to help automate the error analysis of algorithms that evaluate simple functions over the floating-point numbers. The aim is to obtain tight relative error bounds for these algorithms, expressed as a function of the unit round-off. Due to the discrete nature of the set of floating-point numbers, the largest errors are often intrinsically "arithmetic" in the sense that their appear…
▽ More
We provide tools to help automate the error analysis of algorithms that evaluate simple functions over the floating-point numbers. The aim is to obtain tight relative error bounds for these algorithms, expressed as a function of the unit round-off. Due to the discrete nature of the set of floating-point numbers, the largest errors are often intrinsically "arithmetic" in the sense that their appearance may depend on specific bit patterns in the binary representations of intermediate variables, which may be present only for some precisions. We focus on generic (i.e., parameterized by the precision) and analytic over-estimations that still capture the correlations between the errors made at each step of the algorithms. Using methods from computer algebra, which we adapt to the particular structure of the polynomial systems that encode the errors, we obtain bounds with a linear term in the unit round-off that is sharp in manycases. An explicit quadratic bound is given, rather than the $O()$-estimate that is more common in this area. This is particularly important when using low precision formats, which are increasingly common in modern processors. Using this approach, we compare five algorithms for computing the hypotenuse function, ranging from elementary to quite challenging.
△ Less
Submitted 6 May, 2024;
originally announced May 2024.
-
BSMPT v3 A Tool for Phase Transitions and Primordial Gravitational Waves in Extended Higgs Sectors
Authors:
Philipp Basler,
Lisa Biermann,
Margarete Mühlleitner,
Jonas Müller,
Rui Santos,
João Viana
Abstract:
Strong first-order phase transitions (SFOPT) during the evolution of the Higgs potential in the early universe not only allow for the dynamical generation of the observed matter-antimatter asymmetry, they can also source a stochastic gravitational wave (GW) background possibly detectable with future space-based gravitational waves interferometers. As SFOPTs are phenomenologically incompatible with…
▽ More
Strong first-order phase transitions (SFOPT) during the evolution of the Higgs potential in the early universe not only allow for the dynamical generation of the observed matter-antimatter asymmetry, they can also source a stochastic gravitational wave (GW) background possibly detectable with future space-based gravitational waves interferometers. As SFOPTs are phenomenologically incompatible with the Standard Model (SM) Higgs sector, the observation of GWs from SFOPTs provides an exciting interplay between cosmology and particle physics in the search for new physics. With the C++ code BSMPTv3, we present for the first time a tool that performs the whole chain from the particle physics model to the gravitational wave spectrum. Extending the previous versions BSMPTv1 and v2, it traces the phases of beyond-SM (BSM) Higgs potentials and is capable of treating multiple vacuum directions and multi-step phase transitions. During the tracing, it checks for discrete symmetries, flat directions, and electroweak symmetry restoration, and finally reports the transition history. The transition probability from the false to the true vacuum is obtained from the solution of the bounce equation which allows for the calculation of the nucleation, percolation and completion temperatures. The peak amplitude and frequency of the GWs originating from sound waves and turbulence, are evaluated after the calculation of the thermal parameters at the transition temperature, and finally the signal-to-noise ratio at LISA is provided. The code BSMPTv3 is a powerful self-contained tool that comes more than timely and will be of great benefit for investigations of the vacuum structure of the early universe of not only simple but also complicated Higgs potentials involving several vacuum directions, with exciting applications in the search for new physics.
△ Less
Submitted 29 April, 2024;
originally announced April 2024.
-
SIM2VR: Towards Automated Biomechanical Testing in VR
Authors:
Florian Fischer,
Aleksi Ikkala,
Markus Klar,
Arthur Fleig,
Miroslav Bachinski,
Roderick Murray-Smith,
Perttu Hämäläinen,
Antti Oulasvirta,
Jörg Müller
Abstract:
Automated biomechanical testing has great potential for the development of VR applications, as initial insights into user behaviour can be gained in silico early in the design process. In particular, it allows prediction of user movements and ergonomic variables, such as fatigue, prior to conducting user studies. However, there is a fundamental disconnect between simulators hosting state-of-the-ar…
▽ More
Automated biomechanical testing has great potential for the development of VR applications, as initial insights into user behaviour can be gained in silico early in the design process. In particular, it allows prediction of user movements and ergonomic variables, such as fatigue, prior to conducting user studies. However, there is a fundamental disconnect between simulators hosting state-of-the-art biomechanical user models and simulators used to develop and run VR applications. Existing user simulators often struggle to capture the intricacies and nuances of real-world VR applications, reducing ecological validity of user predictions. In this paper, we introduce SIM2VR, a system that aligns user simulation with a given VR application by establishing a continuous closed loop between the two processes. This, for the first time, enables training simulated users directly in the same VR application that real users interact with. We demonstrate that SIM2VR can predict differences in user performance, ergonomics and strategies in a fast-paced, dynamic arcade game. In order to expand the scope of automated biomechanical testing beyond simple visuomotor tasks, advances in cognitive models and reward function design will be needed.
△ Less
Submitted 26 April, 2024;
originally announced April 2024.
-
Dynamics of spin helices in the diluted one-dimensional $XX$ model
Authors:
Darren Pereira,
Erich J. Mueller
Abstract:
Motivated by discrepancies between recent cold atom experiments and the associated theory, we explore the effect of immobile holes on the quantum dynamics of $x$-$z$ spin helices in the one-dimensional $XX$ model. We calculate the exact spin dynamics by map** onto a system of non-interacting fermions, averaging over the distribution of holes. At small hole densities we find that the helical spin…
▽ More
Motivated by discrepancies between recent cold atom experiments and the associated theory, we explore the effect of immobile holes on the quantum dynamics of $x$-$z$ spin helices in the one-dimensional $XX$ model. We calculate the exact spin dynamics by map** onto a system of non-interacting fermions, averaging over the distribution of holes. At small hole densities we find that the helical spin pattern decays exponentially, with a pitch dependence that agrees with the experiments. At large hole densities we instead find persistent oscillations. While our analytic approach does not generalize to the $XXZ$ model with arbitrary anisotropies, we validate a matrix product state technique which might be used to model the experiments in those settings.
△ Less
Submitted 26 April, 2024;
originally announced April 2024.
-
Advances in Atom Interferometry and their Impacts on the Performance of Quantum Accelerometers On-board Future Satellite Gravity Missions
Authors:
Alireza HosseiniArania,
Manuel Schilling,
Quentin Beaufils,
Annike Knabe,
Benjamin Tennstedt,
Alexey Kupriyanov,
Steffen Schön,
Franck Pereira dos Santos,
Jürgen Müller
Abstract:
Recent advances in cold atom interferometry have cleared the path for space applications of quantum inertial sensors, whose level of stability is expected to increase dramatically with the longer interrogation times accessible in space. In this study, a comprehensive in-orbit model is developed for a Mach-Zehnder-type cold-atom accelerometer. Performance tests are realized under different assumpti…
▽ More
Recent advances in cold atom interferometry have cleared the path for space applications of quantum inertial sensors, whose level of stability is expected to increase dramatically with the longer interrogation times accessible in space. In this study, a comprehensive in-orbit model is developed for a Mach-Zehnder-type cold-atom accelerometer. Performance tests are realized under different assumptions, and the impact of various sources of errors on instrument stability is evaluated. Current and future advances for space-based atom interferometry are discussed, and their impact on the performance of quantum sensors on-board satellite gravity missions is investigated in three different scenarios: state-of-the-art scenario, near-future (between the next 5 and 10 years) and far-future scenarios (between the next 10 to 20 years). We show that one can achieve a sensitivity level close to 5E-10 with the current state-of-the-art technology. We also estimate that in the near and far-future, atom interferometry in space is expected to achieve sensitivity levels of 1E-11 and 1E-12, respectively. A roadmap for improvements in atom interferometry is provided that would maximize the performance of future CAI accelerometers, considering their technical capabilities. Finally, the possibility and challenges of having ultra-sensitive atom interferometry in space for future space missions are discussed.
△ Less
Submitted 16 April, 2024;
originally announced April 2024.
-
High temperature transport in the one dimensional mass-imbalanced Fermi-Hubbard model
Authors:
Thomas G. Kiely,
Erich J. Mueller
Abstract:
We study transport in the one-dimensional mass-imbalanced Fermi-Hubbard model at infinite temperature, focusing on the case of strong interactions. Prior theoretical and experimental investigations have revealed unconventionally long transport timescales, with complications due to strong finite size effects. We compute the dynamical current-current correlation function directly in the thermodynami…
▽ More
We study transport in the one-dimensional mass-imbalanced Fermi-Hubbard model at infinite temperature, focusing on the case of strong interactions. Prior theoretical and experimental investigations have revealed unconventionally long transport timescales, with complications due to strong finite size effects. We compute the dynamical current-current correlation function directly in the thermodynamic limit using infinite tensor network techniques. We show that transport in the strong-imbalance limit is dominated by AC resonances, which we compute with an analytic expansion. We study the dephasing of these resonances with mass imbalance, $η$. In the small-imbalance limit, the model is nearly integrable. We connect these unusual limits by computing the DC conductivity and transport decay time as a function of $η$ and the interaction strength $U/t$. We propose an experimental protocol to measure these correlation functions in cold atom experiments.
△ Less
Submitted 31 May, 2024; v1 submitted 11 April, 2024;
originally announced April 2024.
-
Adaptive Computing for Scale-up Problems
Authors:
Hilary Egan,
Kevin Patrick Griffin,
Marc T. Henry de Frahan,
Juliane Mueller,
Deepthi Vaidhynatha,
Dylan Wald,
Rohit Chintala,
Olga A. Doronina,
Ryan King,
Jibonananda Sanyal,
Marc Day
Abstract:
Adaptive Computing is an application-agnostic outer loop framework to strategically deploy simulations and experiments to guide decision making for scale-up analysis. Resources are allocated over successive batches, which makes the allocation adaptive to some objective such as optimization or model training. The framework enables the characterization and management of uncertainties associated with…
▽ More
Adaptive Computing is an application-agnostic outer loop framework to strategically deploy simulations and experiments to guide decision making for scale-up analysis. Resources are allocated over successive batches, which makes the allocation adaptive to some objective such as optimization or model training. The framework enables the characterization and management of uncertainties associated with predictive models of complex systems when scale-up questions lead to significant model extrapolation. A key feature of this framework is the ability to explicitly utilize user-specified uncertainty priors, which we call model-specific local trust estimates, that are provided directly together with the problem specification and exploited in adaptive sampling strategies. A multi-fidelity model hierarchy is supported to allow trade-offs in accuracy and data acquisition cost while exploring the search space given a specified budget of potentially distributed, heterogeneous resources. We discuss application of this framework to problems in the renewable energy space, including biofuels production, material synthesis, perovskite crystal growth, and building electrical loads.
△ Less
Submitted 25 March, 2024;
originally announced April 2024.
-
Implications of the AI Act for Non-Discrimination Law and Algorithmic Fairness
Authors:
Luca Deck,
Jan-Laurin Müller,
Conradin Braun,
Domenique Zipperling,
Niklas Kühl
Abstract:
The topic of fairness in AI, as debated in the FATE (Fairness, Accountability, Transparency, and Ethics in AI) communities, has sparked meaningful discussions in the past years. However, from a legal perspective, particularly from the perspective of European Union law, many open questions remain. Whereas algorithmic fairness aims to mitigate structural inequalities at design-level, European non-di…
▽ More
The topic of fairness in AI, as debated in the FATE (Fairness, Accountability, Transparency, and Ethics in AI) communities, has sparked meaningful discussions in the past years. However, from a legal perspective, particularly from the perspective of European Union law, many open questions remain. Whereas algorithmic fairness aims to mitigate structural inequalities at design-level, European non-discrimination law is tailored to individual cases of discrimination after an AI model has been deployed. The AI Act might present a tremendous step towards bridging these two approaches by shifting non-discrimination responsibilities into the design stage of AI models. Based on an integrative reading of the AI Act, we comment on legal as well as technical enforcement problems and propose practical implications on bias detection and bias correction in order to specify and comply with specific technical requirements.
△ Less
Submitted 26 June, 2024; v1 submitted 29 March, 2024;
originally announced March 2024.
-
Fisher-Rao Gradient Flows of Linear Programs and State-Action Natural Policy Gradients
Authors:
Johannes Müller,
Semih Çaycı,
Guido Montúfar
Abstract:
Kakade's natural policy gradient method has been studied extensively in the last years showing linear convergence with and without regularization. We study another natural gradient method which is based on the Fisher information matrix of the state-action distributions and has received little attention from the theoretical side. Here, the state-action distributions follow the Fisher-Rao gradient f…
▽ More
Kakade's natural policy gradient method has been studied extensively in the last years showing linear convergence with and without regularization. We study another natural gradient method which is based on the Fisher information matrix of the state-action distributions and has received little attention from the theoretical side. Here, the state-action distributions follow the Fisher-Rao gradient flow inside the state-action polytope with respect to a linear potential. Therefore, we study Fisher-Rao gradient flows of linear programs more generally and show linear convergence with a rate that depends on the geometry of the linear program. Equivalently, this yields an estimate on the error induced by entropic regularization of the linear program which improves existing results. We extend these results and show sublinear convergence for perturbed Fisher-Rao gradient flows and natural gradient flows up to an approximation error. In particular, these general results cover the case of state-action natural policy gradients.
△ Less
Submitted 28 March, 2024;
originally announced March 2024.
-
Offline tagging of radon-induced backgrounds in XENON1T and applicability to other liquid xenon detectors
Authors:
E. Aprile,
J. Aalbers,
K. Abe,
S. Ahmed Maouloud,
L. Althueser,
B. Andrieu,
E. Angelino,
J. R. Angevaare,
D. Antón Martin,
F. Arneodo,
L. Baudis,
A. L. Baxter,
M. Bazyk,
L. Bellagamba,
R. Biondi,
A. Bismark,
E. J. Brookes,
A. Brown,
G. Bruno,
R. Budnik,
T. K. Bui,
J. M. R. Cardoso,
A. P. Cimental Chavez,
A. P. Colijn,
J. Conrad
, et al. (142 additional authors not shown)
Abstract:
This paper details the first application of a software tagging algorithm to reduce radon-induced backgrounds in liquid noble element time projection chambers, such as XENON1T and XENONnT. The convection velocity field in XENON1T was mapped out using $^{222}\text{Rn}$ and $^{218}\text{Po}$ events, and the root-mean-square convection speed was measured to be $0.30 \pm 0.01$ cm/s. Given this velocity…
▽ More
This paper details the first application of a software tagging algorithm to reduce radon-induced backgrounds in liquid noble element time projection chambers, such as XENON1T and XENONnT. The convection velocity field in XENON1T was mapped out using $^{222}\text{Rn}$ and $^{218}\text{Po}$ events, and the root-mean-square convection speed was measured to be $0.30 \pm 0.01$ cm/s. Given this velocity field, $^{214}\text{Pb}$ background events can be tagged when they are followed by $^{214}\text{Bi}$ and $^{214}\text{Po}$ decays, or preceded by $^{218}\text{Po}$ decays. This was achieved by evolving a point cloud in the direction of a measured convection velocity field, and searching for $^{214}\text{Bi}$ and $^{214}\text{Po}$ decays or $^{218}\text{Po}$ decays within a volume defined by the point cloud. In XENON1T, this tagging system achieved a $^{214}\text{Pb}$ background reduction of $6.2^{+0.4}_{-0.9}\%$ with an exposure loss of $1.8\pm 0.2 \%$, despite the timescales of convection being smaller than the relevant decay times. We show that the performance can be improved in XENONnT, and that the performance of such a software-tagging approach can be expected to be further improved in a diffusion-limited scenario. Finally, a similar method might be useful to tag the cosmogenic $^{137}\text{Xe}$ background, which is relevant to the search for neutrinoless double-beta decay.
△ Less
Submitted 19 June, 2024; v1 submitted 21 March, 2024;
originally announced March 2024.
-
Automated Data Curation for Robust Language Model Fine-Tuning
Authors:
Jiuhai Chen,
Jonas Mueller
Abstract:
Large Language Models have become the de facto approach to sequence-to-sequence text generation tasks, but for specialized tasks/domains, a pretrained LLM lacks specific capabilities to produce accurate or well-formatted responses. Supervised fine-tuning specializes a LLM by training it on dataset of example prompts with target responses, but real-world data tends to be noisy. While many fine-tuni…
▽ More
Large Language Models have become the de facto approach to sequence-to-sequence text generation tasks, but for specialized tasks/domains, a pretrained LLM lacks specific capabilities to produce accurate or well-formatted responses. Supervised fine-tuning specializes a LLM by training it on dataset of example prompts with target responses, but real-world data tends to be noisy. While many fine-tuning algorithms exist, here we consider a \emph{data-centric AI} perspective on LLM fine-tuning, studying how to \emph{systematically} curate the training dataset to improve the LLM produced via \emph{any} fine-tuning algorithm.
We introduce an automated data curation pipeline CLEAR (Confidence-based LLM Evaluation And Rectification) for instruction tuning datasets, that can be used with any LLM and fine-tuning procedure. CLEAR estimates which training data is low-quality and either filters or corrects it. Automatically identifying which data to filter or correct is done via LLM-derived confidence estimates, to ensure only confident modifications to the dataset. Unlike existing data curation techniques, CLEAR is a comprehensive framework that can improve a dataset (and trained model outputs) without additional fine-tuning computations. We don't assume access to a stronger LLM than the model being fine-tuned (e.g.\ relying on GPT-4 when fine-tuning GPT-3.5), to see whether CLEAR can meaningfully improve the capabilities of any LLM. Experiments reveal that CLEAR consistently improves the performance of fine-tuned models across many datasets and models (like GPT-3.5 and Llama2).
△ Less
Submitted 19 March, 2024;
originally announced March 2024.
-
Laser Annealed SiO2/Si1-xGex Scaffolds for Nanoscaled Devices, Synergy of Experiment and Computation
Authors:
Damiano Ricciarelli,
Jonas Müller,
Guilhem Larrieu,
Ioannis Deretzis,
Gaetano Calogero,
Enrico Martello,
Giuseppe Fisicaro,
Jean-Michel Hartmann,
Sébastien Kerdilès,
Mathieu Opprecht,
Antonio Massimiliano Mio,
Richard Daubriac,
Fuccio Cristiano,
Antonino La Magna
Abstract:
Ultraviolet nanosecond laser annealing (UV-NLA) proves to be an important technique, particularly when tightly controlled heating and melting are necessary. In the realm of semiconductor technologies, the significance of nanosecond laser annealing (NLA) grows in tandem with the escalating intricacy of integration schemes in nano-scaled devices. Silicon-germanium alloys have been studied for decade…
▽ More
Ultraviolet nanosecond laser annealing (UV-NLA) proves to be an important technique, particularly when tightly controlled heating and melting are necessary. In the realm of semiconductor technologies, the significance of nanosecond laser annealing (NLA) grows in tandem with the escalating intricacy of integration schemes in nano-scaled devices. Silicon-germanium alloys have been studied for decades for their compatibility with silicon devices. Indeed, they enable the manipulation of properties like strain, carrier mobilities and bandgap. In this framework, they can for instance boost the performances of p-type MOSFETs but also enable near infra-red absorption and emission for applications in photo-detection and photonics. Laser melting on such type of layers, however results, up to now, in the development of extended defects and poor control over layer morphology and homogeneity. In our study, we investigate the laser melting of ~700 nm thick relaxed silicon-germanium samples coated with SiO2 nano-arrays, observing the resulting material to maintain an unaltered lattice. We found the geometrical parameters of the silicon oxide having an impact on the thermal budget samples see, influencing melt threshold, melt depth and germanium distribution.
△ Less
Submitted 6 May, 2024; v1 submitted 18 March, 2024;
originally announced March 2024.
-
Coherent Acoustic Control of Defect Orbital States in the Strong-Driving Limit
Authors:
B. A. McCullian,
V. Sharma,
H. Y. Chen,
J. C. Crossman,
E. J. Mueller,
G. D. Fuchs
Abstract:
We use a bulk acoustic wave resonator to demonstrate coherent control of the excited orbital states in a diamond nitrogen-vacancy (NV) center at cryogenic temperature. Coherent quantum control is an essential tool for understanding and mitigating decoherence. Moreover, characterizing and controlling orbital states is a central challenge for quantum networking, where optical coherence is tied to or…
▽ More
We use a bulk acoustic wave resonator to demonstrate coherent control of the excited orbital states in a diamond nitrogen-vacancy (NV) center at cryogenic temperature. Coherent quantum control is an essential tool for understanding and mitigating decoherence. Moreover, characterizing and controlling orbital states is a central challenge for quantum networking, where optical coherence is tied to orbital coherence. We study resonant multi-phonon orbital Rabi oscillations in both the frequency and time domain, extracting the strength of the orbital-phonon interactions and the coherence of the acoustically driven orbital states. We reach the strong-driving limit, where the physics is dominated by the coupling induced by the acoustic waves. We find agreement between our measurements, quantum master equation simulations, and a Landau-Zener transition model in the strong-driving limit. Using perturbation theory, we derive an expression for the orbital Rabi frequency versus acoustic drive strength that is non-perturbative in the drive strength and agrees well with our measurements for all acoustic powers. Motivated by continuous wave spin resonance-based decoherence protection schemes, we model the orbital decoherence and find good agreement between our model and our measured few-to-several nanoseconds orbital decoherence times. We discuss the outlook for orbital decoherence protection.
△ Less
Submitted 16 March, 2024;
originally announced March 2024.
-
Performance of a modular ton-scale pixel-readout liquid argon time projection chamber
Authors:
DUNE Collaboration,
A. Abed Abud,
B. Abi,
R. Acciarri,
M. A. Acero,
M. R. Adames,
G. Adamov,
M. Adamowski,
D. Adams,
M. Adinolfi,
C. Adriano,
A. Aduszkiewicz,
J. Aguilar,
B. Aimard,
F. Akbar,
K. Allison,
S. Alonso Monsalve,
M. Alrashed,
A. Alton,
R. Alvarez,
T. Alves,
H. Amar,
P. Amedo,
J. Anderson,
D. A. Andrade
, et al. (1340 additional authors not shown)
Abstract:
The Module-0 Demonstrator is a single-phase 600 kg liquid argon time projection chamber operated as a prototype for the DUNE liquid argon near detector. Based on the ArgonCube design concept, Module-0 features a novel 80k-channel pixelated charge readout and advanced high-coverage photon detection system. In this paper, we present an analysis of an eight-day data set consisting of 25 million cosmi…
▽ More
The Module-0 Demonstrator is a single-phase 600 kg liquid argon time projection chamber operated as a prototype for the DUNE liquid argon near detector. Based on the ArgonCube design concept, Module-0 features a novel 80k-channel pixelated charge readout and advanced high-coverage photon detection system. In this paper, we present an analysis of an eight-day data set consisting of 25 million cosmic ray events collected in the spring of 2021. We use this sample to demonstrate the imaging performance of the charge and light readout systems as well as the signal correlations between the two. We also report argon purity and detector uniformity measurements, and provide comparisons to detector simulations.
△ Less
Submitted 5 March, 2024;
originally announced March 2024.
-
Scaling Rectified Flow Transformers for High-Resolution Image Synthesis
Authors:
Patrick Esser,
Sumith Kulal,
Andreas Blattmann,
Rahim Entezari,
Jonas Müller,
Harry Saini,
Yam Levi,
Dominik Lorenz,
Axel Sauer,
Frederic Boesel,
Dustin Podell,
Tim Dockhorn,
Zion English,
Kyle Lacey,
Alex Goodwin,
Yannik Marek,
Robin Rombach
Abstract:
Diffusion models create data from noise by inverting the forward paths of data towards noise and have emerged as a powerful generative modeling technique for high-dimensional, perceptual data such as images and videos. Rectified flow is a recent generative model formulation that connects data and noise in a straight line. Despite its better theoretical properties and conceptual simplicity, it is n…
▽ More
Diffusion models create data from noise by inverting the forward paths of data towards noise and have emerged as a powerful generative modeling technique for high-dimensional, perceptual data such as images and videos. Rectified flow is a recent generative model formulation that connects data and noise in a straight line. Despite its better theoretical properties and conceptual simplicity, it is not yet decisively established as standard practice. In this work, we improve existing noise sampling techniques for training rectified flow models by biasing them towards perceptually relevant scales. Through a large-scale study, we demonstrate the superior performance of this approach compared to established diffusion formulations for high-resolution text-to-image synthesis. Additionally, we present a novel transformer-based architecture for text-to-image generation that uses separate weights for the two modalities and enables a bidirectional flow of information between image and text tokens, improving text comprehension, typography, and human preference ratings. We demonstrate that this architecture follows predictable scaling trends and correlates lower validation loss to improved text-to-image synthesis as measured by various metrics and human evaluations. Our largest models outperform state-of-the-art models, and we will make our experimental data, code, and model weights publicly available.
△ Less
Submitted 5 March, 2024;
originally announced March 2024.
-
Magnetism, heat capacity and electronic structure of EuCd$_2$P$_2$ in view of its colossal magnetoresistance
Authors:
Dmitry Yu. Usachov,
Sarah Krebber,
Kirill A. Bokai,
Artem V. Tarasov,
Marvin Kopp,
Charu Garg,
Alexander Virovets,
Jens Müller,
Max Mende,
Georg Poelchen,
Denis V. Vyalikh,
Cornelius Krellner,
Kristin Kliemt
Abstract:
The mechanism of the peculiar transport properties around the magnetic ordering temperature of semiconducting antiferromagnetic EuCd$_2$P$_2$ is not yet understood. With a huge peak in the resistivity observed above the Néel temperature, $T_{\rm N}=10.6\,\rm K$, it exhibits a colossal magnetoresistance effect. Recent reports on observations of ferromagnetic contributions above $T_{\rm N}$ as well…
▽ More
The mechanism of the peculiar transport properties around the magnetic ordering temperature of semiconducting antiferromagnetic EuCd$_2$P$_2$ is not yet understood. With a huge peak in the resistivity observed above the Néel temperature, $T_{\rm N}=10.6\,\rm K$, it exhibits a colossal magnetoresistance effect. Recent reports on observations of ferromagnetic contributions above $T_{\rm N}$ as well as metallic behavior below this temperature have motivated us to perform a comprehensive characterization of this material, including its resistivity, heat capacity, magnetic properties and electronic structure. Our transport measurements revealed quite different temperature dependence of resistivity with the maximum at $14\,\rm K$ instead of previously reported $18\,\rm K$. Low-field susceptibility data support the presence of static ferromagnetism above $T_{\rm N}$ and show a complex behavior of the material at small applied magnetic fields. Namely, signatures of reorientation of magnetic domains are observed up to $T=16\,\rm K$. Our magnetization measurements indicate a magnetocrystalline anisotropy which also leads to a preferred alignment of the magnetic clusters above $T_{\rm N}$. The momentum-resolved photoemission experiments at temperatures from $24\,\rm K$ down to $2.5\,\rm K$ indicate the permanent presence of a fundamental band gap without change of the electronic structure when going through $T_N$ that is in contradiction with previous results. We performed \textit{ab initio} band structure calculations which are in good agreement with the measured photoemission data when assuming an antiferromagnetic ground state. Calculations for the ferromagnetic phase show a much smaller bandgap, indicating the importance of possible ferromagnetic contributions for the explanation of the colossal magnetoresistance effect in the related EuZn$_2$P$_2$.
△ Less
Submitted 29 February, 2024;
originally announced February 2024.
-
Origin of magnetic switching cascades in tetrahedral CoFe nanostructures
Authors:
Christian Schröder,
Bereket Ghebretinsae,
Martin Lonsky,
Mohanad Al Mamoori,
Fabrizio Porrati,
Michael Huth,
Jens Müller
Abstract:
We present a comprehensive study of small-scale three-dimensional (3D) tetrahedral CoFe nanostructure arrays prepared by focused electron beam-induced deposition (FEBID) and placed in two distinct orientations with respect to the direction of an external magnetic field. Using ultra-sensitive micro-Hall magnetometry we obtain angular-dependent magnetic stray field hysteresis loops that show charact…
▽ More
We present a comprehensive study of small-scale three-dimensional (3D) tetrahedral CoFe nanostructure arrays prepared by focused electron beam-induced deposition (FEBID) and placed in two distinct orientations with respect to the direction of an external magnetic field. Using ultra-sensitive micro-Hall magnetometry we obtain angular-dependent magnetic stray field hysteresis loops that show characteristic cascading magnetic switching close to zero magnetic field. By employing micromagnetic simulations we could reproduce the hysteresis loops and identify characteristic field dependent magnetic configurations including a vortex-type groundstate. From this we derive a coarse-graining macrospin model and show that the complex switching behavior can be explained by the reorientation dynamics of non-interacting uniaxial anisotropic magnetic grains modeled as a superposition of Stoner-Wohlfarth particles.
△ Less
Submitted 27 February, 2024;
originally announced February 2024.
-
The XENONnT Dark Matter Experiment
Authors:
XENON Collaboration,
E. Aprile,
J. Aalbers,
K. Abe,
S. Ahmed Maouloud,
L. Althueser,
B. Andrieu,
E. Angelino,
J. R. Angevaare,
V. C. Antochi,
D. Antón Martin,
F. Arneodo,
M. Balata,
L. Baudis,
A. L. Baxter,
M. Bazyk,
L. Bellagamba,
R. Biondi,
A. Bismark,
E. J. Brookes,
A. Brown,
S. Bruenner,
G. Bruno,
R. Budnik,
T. K. Bui
, et al. (170 additional authors not shown)
Abstract:
The multi-staged XENON program at INFN Laboratori Nazionali del Gran Sasso aims to detect dark matter with two-phase liquid xenon time projection chambers of increasing size and sensitivity. The XENONnT experiment is the latest detector in the program, planned to be an upgrade of its predecessor XENON1T. It features an active target of 5.9 tonnes of cryogenic liquid xenon (8.5 tonnes total mass in…
▽ More
The multi-staged XENON program at INFN Laboratori Nazionali del Gran Sasso aims to detect dark matter with two-phase liquid xenon time projection chambers of increasing size and sensitivity. The XENONnT experiment is the latest detector in the program, planned to be an upgrade of its predecessor XENON1T. It features an active target of 5.9 tonnes of cryogenic liquid xenon (8.5 tonnes total mass in cryostat). The experiment is expected to extend the sensitivity to WIMP dark matter by more than an order of magnitude compared to XENON1T, thanks to the larger active mass and the significantly reduced background, improved by novel systems such as a radon removal plant and a neutron veto. This article describes the XENONnT experiment and its sub-systems in detail and reports on the detector performance during the first science run.
△ Less
Submitted 15 February, 2024;
originally announced February 2024.
-
Position: Optimization in SciML Should Employ the Function Space Geometry
Authors:
Johannes Müller,
Marius Zeinhofer
Abstract:
Scientific machine learning (SciML) is a relatively new field that aims to solve problems from different fields of natural sciences using machine learning tools. It is well-documented that the optimizers commonly used in other areas of machine learning perform poorly on many SciML problems. We provide an infinite-dimensional view on optimization problems encountered in scientific machine learning…
▽ More
Scientific machine learning (SciML) is a relatively new field that aims to solve problems from different fields of natural sciences using machine learning tools. It is well-documented that the optimizers commonly used in other areas of machine learning perform poorly on many SciML problems. We provide an infinite-dimensional view on optimization problems encountered in scientific machine learning and advocate for the paradigm first optimize, then discretize for their solution. This amounts to first choosing an appropriate infinite-dimensional algorithm which is then discretized in a second step. To illustrate this point, we show that recently proposed state-of-the-art algorithms for SciML applications can be derived within this framework. As the infinite-dimensional viewpoint is presently underdeveloped in scientific machine learning, we formalize it here and advocate for its use in SciML in the development of efficient optimization algorithms.
△ Less
Submitted 28 May, 2024; v1 submitted 11 February, 2024;
originally announced February 2024.
-
Opinion models, data, and politics
Authors:
Matthias Gsänger,
Volker Hösel,
Christoph Mohamad-Klotzbach,
Johannes Müller
Abstract:
We investigate the connection between Potts (Curie-Weiss) models and stochastic opinion models in the view of the Boltzmann distribution and stochastic Glauber dynamics. We particularly find that the q-voter model can be considered as a natural extension of the Zealot model which is adapted by Lagrangian parameters. We also discuss weak and strong effects continuum limits for the models. We then f…
▽ More
We investigate the connection between Potts (Curie-Weiss) models and stochastic opinion models in the view of the Boltzmann distribution and stochastic Glauber dynamics. We particularly find that the q-voter model can be considered as a natural extension of the Zealot model which is adapted by Lagrangian parameters. We also discuss weak and strong effects continuum limits for the models. We then fit four models (Curie-Weiss, strong and weak effects limit for the q-voter model, and the reinforcement model) to election data from United States, United Kingdom, France and Germany. We find that particularly the weak effects models are able to fit the data (Kolmogorov-Smirnov test), where the weak effects reinforcement model performs best (AIC). The resulting estimates are interpreted in the view of political sciences, and also the importance of this kind of model-based approaches to election data for the political sciences is discussed.
△ Less
Submitted 10 February, 2024;
originally announced February 2024.
-
Efficient and Interpretable Traffic Destination Prediction using Explainable Boosting Machines
Authors:
Yasin Yousif,
Jörg Müller
Abstract:
Develo** accurate models for traffic trajectory predictions is crucial for achieving fully autonomous driving. Various deep neural network models have been employed to address this challenge, but their black-box nature hinders transparency and debugging capabilities in a deployed system. Glass-box models offer a solution by providing full interpretability through methods like \ac{GAM}. In this s…
▽ More
Develo** accurate models for traffic trajectory predictions is crucial for achieving fully autonomous driving. Various deep neural network models have been employed to address this challenge, but their black-box nature hinders transparency and debugging capabilities in a deployed system. Glass-box models offer a solution by providing full interpretability through methods like \ac{GAM}. In this study, we evaluate an efficient additive model called \ac{EBM} for traffic prediction on three popular mixed traffic datasets: \ac{SDD}, \ac{InD}, and Argoverse. Our results show that the \ac{EBM} models perform competitively in predicting pedestrian destinations within \ac{SDD} and \ac{InD} while providing modest predictions for vehicle-dominant Argoverse dataset. Additionally, our transparent trained models allow us to analyse feature importance and interactions, as well as provide qualitative examples of predictions explanation. The full training code will be made public upon publication.
△ Less
Submitted 5 February, 2024;
originally announced February 2024.
-
Creating a Synthesizer from Schrödinger's Equation
Authors:
Arthur Freye,
Jannis Müller
Abstract:
Our project offers an alternative approach to the sensory perception of the Schrödinger equation (an elementary model of quantum phenomena) by interpreting it as a sound wave. We are building a synthesizer plugin that simulates a quantum mechanical state that evolves over time. Thus, our tool allows the creation of unique sounds that are in motion and feel alive. These can be used in professional…
▽ More
Our project offers an alternative approach to the sensory perception of the Schrödinger equation (an elementary model of quantum phenomena) by interpreting it as a sound wave. We are building a synthesizer plugin that simulates a quantum mechanical state that evolves over time. Thus, our tool allows the creation of unique sounds that are in motion and feel alive. These can be used in professional music production without any knowledge of physics, while at the same time providing insight into a chapter of quantum mechanics. The goal is to lower the threshold for entering complex theory by first develo** an intuition for the subject; but the tool can also be used purely as a musical instrument. The user is encouraged, but not forced, to learn more about the underlying physics. Simulation parameters are adjustable in real-time, allowing intuitive experimentation. Despite the approximate calculations, real physical effects such as quantum tunneling can be observed acoustically and visually.
△ Less
Submitted 1 February, 2024;
originally announced February 2024.
-
Do** Liquid Argon with Xenon in ProtoDUNE Single-Phase: Effects on Scintillation Light
Authors:
DUNE Collaboration,
A. Abed Abud,
B. Abi,
R. Acciarri,
M. A. Acero,
M. R. Adames,
G. Adamov,
M. Adamowski,
D. Adams,
M. Adinolfi,
C. Adriano,
A. Aduszkiewicz,
J. Aguilar,
B. Aimard,
F. Akbar,
K. Allison,
S. Alonso Monsalve,
M. Alrashed,
A. Alton,
R. Alvarez,
H. Amar Es-sghir,
P. Amedo,
J. Anderson,
D. A. Andrade,
C. Andreopoulos
, et al. (1300 additional authors not shown)
Abstract:
Do** of liquid argon TPCs (LArTPCs) with a small concentration of xenon is a technique for light-shifting and facilitates the detection of the liquid argon scintillation light. In this paper, we present the results of the first do** test ever performed in a kiloton-scale LArTPC. From February to May 2020, we carried out this special run in the single-phase DUNE Far Detector prototype (ProtoDUN…
▽ More
Do** of liquid argon TPCs (LArTPCs) with a small concentration of xenon is a technique for light-shifting and facilitates the detection of the liquid argon scintillation light. In this paper, we present the results of the first do** test ever performed in a kiloton-scale LArTPC. From February to May 2020, we carried out this special run in the single-phase DUNE Far Detector prototype (ProtoDUNE-SP) at CERN, featuring 770 t of total liquid argon mass with 410 t of fiducial mass. The goal of the run was to measure the light and charge response of the detector to the addition of xenon, up to a concentration of 18.8 ppm. The main purpose was to test the possibility for reduction of non-uniformities in light collection, caused by deployment of photon detectors only within the anode planes. Light collection was analysed as a function of the xenon concentration, by using the pre-existing photon detection system (PDS) of ProtoDUNE-SP and an additional smaller set-up installed specifically for this run. In this paper we first summarize our current understanding of the argon-xenon energy transfer process and the impact of the presence of nitrogen in argon with and without xenon dopant. We then describe the key elements of ProtoDUNE-SP and the injection method deployed. Two dedicated photon detectors were able to collect the light produced by xenon and the total light. The ratio of these components was measured to be about 0.65 as 18.8 ppm of xenon were injected. We performed studies of the collection efficiency as a function of the distance between tracks and light detectors, demonstrating enhanced uniformity of response for the anode-mounted PDS. We also show that xenon do** can substantially recover light losses due to contamination of the liquid argon by nitrogen.
△ Less
Submitted 9 February, 2024; v1 submitted 2 February, 2024;
originally announced February 2024.
-
Algorithms for $p$-adic Heights on Hyperelliptic Curves of Arbitrary Reduction
Authors:
Francesca Bianchi,
Enis Kaya,
J. Steffen Müller
Abstract:
In this paper, we develop an algorithm for computing Coleman--Gross (and hence Nekovář) $p$-adic heights on hyperelliptic curves over number fields with arbitrary reduction type above $p$. This height is defined as a sum of local heights at each finite place and we use algorithms for Vologodsky integrals, developed by Katz and the second-named author, to compute the local heights above $p$. We als…
▽ More
In this paper, we develop an algorithm for computing Coleman--Gross (and hence Nekovář) $p$-adic heights on hyperelliptic curves over number fields with arbitrary reduction type above $p$. This height is defined as a sum of local heights at each finite place and we use algorithms for Vologodsky integrals, developed by Katz and the second-named author, to compute the local heights above $p$. We also discuss an alternative method to compute these for odd degree genus 2 curves via $p$-adic sigma functions, via work of the first-named author. For both approaches one needs to choose a splitting of the Hodge filtration. A canonical choice for this is due to Blakestad in the case of an odd degree curve of genus $2$ that has semistable ordinary reduction at $p$. We provide an algorithm to compute Blakestad's splitting, which is conjecturally the unit root splitting for the action of Frobenius. We give several numerical examples, including the first worked quadratic Chabauty example in the literature for a curve with bad reduction.
△ Less
Submitted 31 January, 2024;
originally announced February 2024.
-
Two-Dimensional Phase-Fluctuating Superconductivity in Bulk-Crystalline NdO$_{0.5}$F$_{0.5}$BiS$_2$
Authors:
C. S. Chen,
J. Küspert,
I. Biało,
J. Mueller,
K. W. Chen,
M. Y. Zou,
D. G. Mazzone,
D. Bucher,
K. Tanaka,
O. Ivashko,
M. v. Zimmermann,
Qisi Wang,
Lei Shu,
J. Chang
Abstract:
We present a combined growth and transport study of superconducting single-crystalline NdO$_{0.5}$F$_{0.5}$BiS$_2$. Evidence of two-dimensional superconductivity with significant phase fluctuations of preformed Cooper pairs preceding the superconducting transition is reported. This result is based on three key observations. (1) The resistive superconducting transition temperature $T_c$ (defined by…
▽ More
We present a combined growth and transport study of superconducting single-crystalline NdO$_{0.5}$F$_{0.5}$BiS$_2$. Evidence of two-dimensional superconductivity with significant phase fluctuations of preformed Cooper pairs preceding the superconducting transition is reported. This result is based on three key observations. (1) The resistive superconducting transition temperature $T_c$ (defined by resistivity $ρ\rightarrow 0$) increases with increasing disorder. (2) As $T\rightarrow T_c$, the conductivity diverges significantly faster than what is expected from Gaussian fluctuations in two and three dimensions. (3) Non-Ohmic resistance behavior is observed in the superconducting state. Altogether, our observations are consistent with a temperature regime of phase-fluctuating superconductivity. The crystal structure with magnetic ordering tendencies in the NdO$_{0.5}$F$_{0.5}$ layers and (super)conductivity in the BiS$_2$ layers is likely responsible for the two-dimensional phase fluctuations. As such, NdO$_{0.5}$F$_{0.5}$BiS$_2$ falls into the class of unconventional ``laminar" bulk superconductors that include cuprate materials and 4Hb-TaS$_2$.
△ Less
Submitted 24 February, 2024; v1 submitted 30 January, 2024;
originally announced January 2024.
-
Hold Tight: Identifying Behavioral Patterns During Prolonged Work in VR through Video Analysis
Authors:
Verena Biener,
Forouzan Farzinnejad,
Rinaldo Schuster,
Seyedmasih Tabaei,
Leon Lindlein,
**ghui Hu,
Negar Nouri,
John J. Dudley,
Per Ola Kristensson,
Jörg Müller,
Jens Grubert
Abstract:
VR devices have recently been actively promoted as tools for knowledge workers and prior work has demonstrated that VR can support some knowledge worker tasks. However, only a few studies have explored the effects of prolonged use of VR such as a study observing 16 participant working in VR and a physical environment for one work-week each and reporting mainly on subjective feedback. As a nuanced…
▽ More
VR devices have recently been actively promoted as tools for knowledge workers and prior work has demonstrated that VR can support some knowledge worker tasks. However, only a few studies have explored the effects of prolonged use of VR such as a study observing 16 participant working in VR and a physical environment for one work-week each and reporting mainly on subjective feedback. As a nuanced understanding of participants' behavior in VR and how it evolves over time is still missing, we report on the results from an analysis of 559 hours of video material obtained in this prior study. Among other findings, we report that (1) the frequency of actions related to adjusting the headset reduced by 46% and the frequency of actions related to supporting the headset reduced by 42% over the five days; (2) the HMD was removed 31% less frequently over the five days but for 41% longer periods; (3) wearing an HMD is disruptive to normal patterns of eating and drinking, but not to social interactions, such as talking. The combined findings in this work demonstrate the value of long-term studies of deployed VR systems and can be used to inform the design of better, more ergonomic VR systems as tools for knowledge workers.
△ Less
Submitted 29 January, 2024; v1 submitted 26 January, 2024;
originally announced January 2024.
-
Demystifying Chains, Trees, and Graphs of Thoughts
Authors:
Maciej Besta,
Florim Memedi,
Zhenyu Zhang,
Robert Gerstenberger,
Guangyuan Piao,
Nils Blach,
Piotr Nyczyk,
Marcin Copik,
Grzegorz Kwaśniewski,
Jürgen Müller,
Lukas Gianinazzi,
Ales Kubicek,
Hubert Niewiadomski,
Aidan O'Mahony,
Onur Mutlu,
Torsten Hoefler
Abstract:
The field of natural language processing (NLP) has witnessed significant progress in recent years, with a notable focus on improving large language models' (LLM) performance through innovative prompting techniques. Among these, prompt engineering coupled with structures has emerged as a promising paradigm, with designs such as Chain-of-Thought, Tree of Thoughts, or Graph of Thoughts, in which the…
▽ More
The field of natural language processing (NLP) has witnessed significant progress in recent years, with a notable focus on improving large language models' (LLM) performance through innovative prompting techniques. Among these, prompt engineering coupled with structures has emerged as a promising paradigm, with designs such as Chain-of-Thought, Tree of Thoughts, or Graph of Thoughts, in which the overall LLM reasoning is guided by a structure such as a graph. As illustrated with numerous examples, this paradigm significantly enhances the LLM's capability to solve numerous tasks, ranging from logical or mathematical reasoning to planning or creative writing. To facilitate the understanding of this growing field and pave the way for future developments, we devise a general blueprint for effective and efficient LLM reasoning schemes. For this, we conduct an in-depth analysis of the prompt execution pipeline, clarifying and clearly defining different concepts. We then build the first taxonomy of structure-enhanced LLM reasoning schemes. We focus on identifying fundamental classes of harnessed structures, and we analyze the representations of these structures, algorithms executed with these structures, and many others. We refer to these structures as reasoning topologies, because their representation becomes to a degree spatial, as they are contained within the LLM context. Our study compares existing prompting schemes using the proposed taxonomy, discussing how certain design choices lead to different patterns in performance and cost. We also outline theoretical underpinnings, relationships between prompting and other parts of the LLM ecosystem such as knowledge bases, and the associated research challenges. Our work will help to advance future prompt engineering techniques.
△ Less
Submitted 5 April, 2024; v1 submitted 25 January, 2024;
originally announced January 2024.