-
Tsang's resolution enhancement method for imaging with focused illumination
Authors:
Alexander Duplinskiy,
Jernej Frank,
Kaden Bearne,
A. I. Lvovsky
Abstract:
A widely tested approach to overcoming the diffraction limit in microscopy without disturbing the sample relies on substituting widefield sample illumination with a structured light beam. This gives rise to confocal, image-scanning and structured-illumination microscopy methods. On the other hand, as shown recently by Tsang and others, subdiffractional resolution at the detection end of the micros…
▽ More
A widely tested approach to overcoming the diffraction limit in microscopy without disturbing the sample relies on substituting widefield sample illumination with a structured light beam. This gives rise to confocal, image-scanning and structured-illumination microscopy methods. On the other hand, as shown recently by Tsang and others, subdiffractional resolution at the detection end of the microscope can be achieved by replacing the intensity measurement in the image plane with spatial mode demultiplexing. In this work we study the combined action of Tsang's method with image scanning. We experimentally demonstrate superior lateral resolution and enhanced image quality compared to either method alone. This result paves the way for integrating spatial demultiplexing into existing microscopes, contributing to further pushing the boundaries of optical resolution.
△ Less
Submitted 31 May, 2024;
originally announced May 2024.
-
MANTA: A Negative-Triangularity NASEM-Compliant Fusion Pilot Plant
Authors:
MANTA Collaboration,
G. Rutherford,
H. S. Wilson,
A. Saltzman,
D. Arnold,
J. L. Ball,
S. Benjamin,
R. Bielajew,
N. de Boucaud,
M. Calvo-Carrera,
R. Chandra,
H. Choudhury,
C. Cummings,
L. Corsaro,
N. DaSilva,
R. Diab,
A. R. Devitre,
S. Ferry,
S. J. Frank,
C. J. Hansen,
J. Jerkins,
J. D. Johnson,
P. Lunia,
J. van de Lindt,
S. Mackie
, et al. (16 additional authors not shown)
Abstract:
The MANTA (Modular Adjustable Negative Triangularity ARC-class) design study investigated how negative-triangularity (NT) may be leveraged in a compact, fusion pilot plant (FPP) to take a ``power-handling first" approach. The result is a pulsed, radiative, ELM-free tokamak that satisfies and exceeds the FPP requirements described in the 2021 National Academies of Sciences, Engineering, and Medicin…
▽ More
The MANTA (Modular Adjustable Negative Triangularity ARC-class) design study investigated how negative-triangularity (NT) may be leveraged in a compact, fusion pilot plant (FPP) to take a ``power-handling first" approach. The result is a pulsed, radiative, ELM-free tokamak that satisfies and exceeds the FPP requirements described in the 2021 National Academies of Sciences, Engineering, and Medicine report ``Bringing Fusion to the U.S. Grid". A self-consistent integrated modeling workflow predicts a fusion power of 450 MW and a plasma gain of 11.5 with only 23.5 MW of power to the scrape-off layer (SOL). This low $P_\text{SOL}$ together with impurity seeding and high density at the separatrix results in a peak heat flux of just 2.8 MW/m$^{2}$. MANTA's high aspect ratio provides space for a large central solenoid (CS), resulting in ${\sim}$15 minute inductive pulses. In spite of the high B fields on the CS and the other REBCO-based magnets, the electromagnetic stresses remain below structural and critical current density limits. Iterative optimization of neutron shielding and tritium breeding blanket yield tritium self-sufficiency with a breeding ratio of 1.15, a blanket power multiplication factor of 1.11, toroidal field coil lifetimes of $3100 \pm 400$ MW-yr, and poloidal field coil lifetimes of at least $890 \pm 40$ MW-yr. Following balance of plant modeling, MANTA is projected to generate 90 MW of net electricity at an electricity gain factor of ${\sim}2.4$. Systems-level economic analysis estimates an overnight cost of US\$3.4 billion, meeting the NASEM FPP requirement that this first-of-a-kind be less than US\$5 billion. The toroidal field coil cost and replacement time are the most critical upfront and lifetime cost drivers, respectively.
△ Less
Submitted 30 May, 2024;
originally announced May 2024.
-
Heuristic-free Verification-inspired Quantum Benchmarking
Authors:
Johannes Frank,
Elham Kashefi,
Dominik Leichtle,
Michael de Oliveira
Abstract:
In this paper, we introduce a new approach to quantum benchmarking inspired by quantum verification motivating new paradigms of quantum benchmarking. Our proposed benchmark not only serves as a robust indicator of computational capability but also offers scalability, customizability, and universality. By providing formal statements regarding the quality of quantum devices while assuming device con…
▽ More
In this paper, we introduce a new approach to quantum benchmarking inspired by quantum verification motivating new paradigms of quantum benchmarking. Our proposed benchmark not only serves as a robust indicator of computational capability but also offers scalability, customizability, and universality. By providing formal statements regarding the quality of quantum devices while assuming device consistency, we eliminate the reliance on heuristics. We establish a deep connection between quantum verification and quantum benchmarking. For practical application, we present a concrete benchmarking protocol derived from a quantum verification protocol, and prove it to match our redefined standards for quantum benchmarking.
△ Less
Submitted 16 April, 2024;
originally announced April 2024.
-
Hydrodynamic simulations of WD-WD mergers and the origin of RCB stars
Authors:
Sagiv Shiber,
Orsola De Marco,
Patrick M. Motl,
Bradley Munson,
Dominic C. Marcello,
Juhan Frank,
Patrick Diehl,
Geoffrey C. Clayton,
Bennett N. Skinner,
Hartmut Kaiser,
Gregor Daiss,
Dirk Pfluger,
Jan E. Staff
Abstract:
We study the properties of double white dwarf (DWD) mergers by performing hydrodynamic simulations using the new and improved adaptive mesh refinement code Octo-Tiger. We follow the orbital evolution of DWD systems of mass ratio q=0.7 for tens of orbits until and after the merger to investigate them as a possible origin for R Coronae Borealis (RCB) type stars. We reproduce previous results, findin…
▽ More
We study the properties of double white dwarf (DWD) mergers by performing hydrodynamic simulations using the new and improved adaptive mesh refinement code Octo-Tiger. We follow the orbital evolution of DWD systems of mass ratio q=0.7 for tens of orbits until and after the merger to investigate them as a possible origin for R Coronae Borealis (RCB) type stars. We reproduce previous results, finding that during the merger, the Helium WD donor star is tidally disrupted within 20-80 minutes since the beginning of the simulation onto the accretor Carbon-Oxygen WD, creating a high temperature shell around the accretor. We investigate the possible Helium burning in this shell and the merged object's general structure. Specifically, we are interested in the amount of Oxygen-16 dredged-up to the hot shell and the amount of Oxygen-18 produced. This is critical as the discovery of very low Oxygen-16 to Oxygen-18 ratios in RCB stars pointed out the merger scenario as a favorable explanation for their origin. A small amount of hydrogen in the donor may help keep the Oxygen-16 to Oxygen-18 ratios within observational bounds, even if moderate dredge-up from the accretor occurs. In addition, we perform a resolution study to reconcile the difference found in the amount of Oxygen-16 dredge-up between smoothed-particle hydrodynamics and grid-based simulations.
△ Less
Submitted 10 April, 2024;
originally announced April 2024.
-
Search for CP-violating Neutrino Non-Standard Interactions with the NOvA Experiment
Authors:
NOvA Collaboration,
M. A. Acero,
B. Acharya,
P. Adamson,
L. Aliaga,
N. Anfimov,
A. Antoshkin,
E. Arrieta-Diaz,
L. Asquith,
A. Aurisano,
A. Back,
N. Balashov,
P. Baldi,
B. A. Bambah,
A. Bat,
K. Bays,
R. Bernstein,
T. J. C. Bezerra,
V. Bhatnagar,
D. Bhattarai,
B. Bhuyan,
J. Bian,
A. C. Booth,
R. Bowles,
B. Brahma
, et al. (182 additional authors not shown)
Abstract:
This Letter reports a search for charge-parity (CP) symmetry violating non-standard interactions (NSI) of neutrinos with matter using the NOvA Experiment, and examines their effects on the determination of the standard oscillation parameters. Data from $ν_μ(\barν_μ)\rightarrowν_μ(\barν_μ)$ and $ν_μ(\barν_μ)\rightarrowν_{e}(\barν_{e})$ oscillation channels are used to measure the effect of the NSI…
▽ More
This Letter reports a search for charge-parity (CP) symmetry violating non-standard interactions (NSI) of neutrinos with matter using the NOvA Experiment, and examines their effects on the determination of the standard oscillation parameters. Data from $ν_μ(\barν_μ)\rightarrowν_μ(\barν_μ)$ and $ν_μ(\barν_μ)\rightarrowν_{e}(\barν_{e})$ oscillation channels are used to measure the effect of the NSI parameters $\varepsilon_{eμ}$ and $\varepsilon_{eτ}$. With 90% C.L. the magnitudes of the NSI couplings are constrained to be $|\varepsilon_{eμ}| \, \lesssim 0.3$ and $|\varepsilon_{eτ}| \, \lesssim 0.4$. A degeneracy at $|\varepsilon_{eτ}| \, \approx 1.8$ is reported, and we observe that the presence of NSI limits sensitivity to the standard CP phase $δ_{\tiny\text{CP}}$.
△ Less
Submitted 11 March, 2024;
originally announced March 2024.
-
Human Curriculum Effects Emerge with In-Context Learning in Neural Networks
Authors:
Jacob Russin,
Ellie Pavlick,
Michael J. Frank
Abstract:
Human learning is sensitive to rule-like structure and the curriculum of examples used for training. In tasks governed by succinct rules, learning is more robust when related examples are blocked across trials, but in the absence of such rules, interleaving is more effective. To date, no neural model has simultaneously captured these seemingly contradictory effects. Here we show that this same tra…
▽ More
Human learning is sensitive to rule-like structure and the curriculum of examples used for training. In tasks governed by succinct rules, learning is more robust when related examples are blocked across trials, but in the absence of such rules, interleaving is more effective. To date, no neural model has simultaneously captured these seemingly contradictory effects. Here we show that this same tradeoff spontaneously emerges with ``in-context learning'' (ICL) both in neural networks trained with metalearning and in large language models (LLMs). ICL is the ability to learn new tasks ``in context'' -- without weight changes -- via an inner-loop algorithm implemented in activation dynamics. Experiments with pretrained LLMs and metalearning transformers show that ICL exhibits the blocking advantage demonstrated in humans on a task involving rule-like structure, and conversely, that concurrent in-weight learning reproduces the interleaving advantage observed in humans on tasks lacking such structure.
△ Less
Submitted 12 May, 2024; v1 submitted 13 February, 2024;
originally announced February 2024.
-
Transformer Mechanisms Mimic Frontostriatal Gating Operations When Trained on Human Working Memory Tasks
Authors:
Aaron Traylor,
Jack Merullo,
Michael J. Frank,
Ellie Pavlick
Abstract:
Models based on the Transformer neural network architecture have seen success on a wide variety of tasks that appear to require complex "cognitive branching" -- or the ability to maintain pursuit of one goal while accomplishing others. In cognitive neuroscience, success on such tasks is thought to rely on sophisticated frontostriatal mechanisms for selective \textit{gating}, which enable role-addr…
▽ More
Models based on the Transformer neural network architecture have seen success on a wide variety of tasks that appear to require complex "cognitive branching" -- or the ability to maintain pursuit of one goal while accomplishing others. In cognitive neuroscience, success on such tasks is thought to rely on sophisticated frontostriatal mechanisms for selective \textit{gating}, which enable role-addressable updating -- and later readout -- of information to and from distinct "addresses" of memory, in the form of clusters of neurons. However, Transformer models have no such mechanisms intentionally built-in. It is thus an open question how Transformers solve such tasks, and whether the mechanisms that emerge to help them to do so bear any resemblance to the gating mechanisms in the human brain. In this work, we analyze the mechanisms that emerge within a vanilla attention-only Transformer trained on a simple sequence modeling task inspired by a task explicitly designed to study working memory gating in computational cognitive neuroscience. We find that, as a result of training, the self-attention mechanism within the Transformer specializes in a way that mirrors the input and output gating mechanisms which were explicitly incorporated into earlier, more biologically-inspired architectures. These results suggest opportunities for future research on computational similarities between modern AI architectures and models of the human brain.
△ Less
Submitted 12 February, 2024;
originally announced February 2024.
-
The distribution of braid indices of 2-bridge knots
Authors:
Tobias Clark,
Jeremy Frank,
Adam M. Lowrance
Abstract:
In this article we study the braid indices of 2-bridge knots with a fixed crossing number $c$. We show that the average braid index of the set of $2$-bridge knots of crossing number $c$ is asymptotically linear, approaching $\frac{c}{3}+\frac{11}{9}$. Additionally, we show that the variance of the braid indices of the set of $2$-bridge knots of crossing number $c$ is also asymptotically linear, ap…
▽ More
In this article we study the braid indices of 2-bridge knots with a fixed crossing number $c$. We show that the average braid index of the set of $2$-bridge knots of crossing number $c$ is asymptotically linear, approaching $\frac{c}{3}+\frac{11}{9}$. Additionally, we show that the variance of the braid indices of the set of $2$-bridge knots of crossing number $c$ is also asymptotically linear, approaching $\frac{2c}{27} - \frac{10}{81}$. Finally, we find a formula for the number $k_{c,b}$ of $2$-bridge knots with crossing number $c$ and braid index $b$, and show that for any fixed $c$, the braid index where $k_{c,b}$ achieves its maximum is $b=\left\lceil \frac{c}{3}\right\rceil +1$.
△ Less
Submitted 16 January, 2024;
originally announced January 2024.
-
A Representative Study on Human Detection of Artificially Generated Media Across Countries
Authors:
Joel Frank,
Franziska Herbert,
Jonas Ricker,
Lea Schönherr,
Thorsten Eisenhofer,
Asja Fischer,
Markus Dürmuth,
Thorsten Holz
Abstract:
AI-generated media has become a threat to our digital society as we know it. These forgeries can be created automatically and on a large scale based on publicly available technology. Recognizing this challenge, academics and practitioners have proposed a multitude of automatic detection strategies to detect such artificial media. However, in contrast to these technical advances, the human percepti…
▽ More
AI-generated media has become a threat to our digital society as we know it. These forgeries can be created automatically and on a large scale based on publicly available technology. Recognizing this challenge, academics and practitioners have proposed a multitude of automatic detection strategies to detect such artificial media. However, in contrast to these technical advances, the human perception of generated media has not been thoroughly studied yet.
In this paper, we aim at closing this research gap. We perform the first comprehensive survey into people's ability to detect generated media, spanning three countries (USA, Germany, and China) with 3,002 participants across audio, image, and text media. Our results indicate that state-of-the-art forgeries are almost indistinguishable from "real" media, with the majority of participants simply guessing when asked to rate them as human- or machine-generated. In addition, AI-generated media receive is voted more human like across all media types and all countries. To further understand which factors influence people's ability to detect generated media, we include personal variables, chosen based on a literature review in the domains of deepfake and fake news research. In a regression analysis, we found that generalized trust, cognitive reflection, and self-reported familiarity with deepfakes significantly influence participant's decision across all media categories.
△ Less
Submitted 10 December, 2023;
originally announced December 2023.
-
Expanding neutrino oscillation parameter measurements in NOvA using a Bayesian approach
Authors:
NOvA Collaboration,
M. A. Acero,
B. Acharya,
P. Adamson,
N. Anfimov,
A. Antoshkin,
E. Arrieta-Diaz,
L. Asquith,
A. Aurisano,
A. Back,
N. Balashov,
P. Baldi,
B. A. Bambah,
A. Bat,
K. Bays,
R. Bernstein,
T. J. C. Bezerra,
V. Bhatnagar,
D. Bhattarai,
B. Bhuyan,
J. Bian,
A. C. Booth,
R. Bowles,
B. Brahma,
C. Bromberg
, et al. (174 additional authors not shown)
Abstract:
NOvA is a long-baseline neutrino oscillation experiment that measures oscillations in charged-current $ν_μ \rightarrow ν_μ$ (disappearance) and $ν_μ \rightarrow ν_{e}$ (appearance) channels, and their antineutrino counterparts, using neutrinos of energies around 2 GeV over a distance of 810 km. In this work we reanalyze the dataset first examined in our previous paper [Phys. Rev. D 106, 032004 (20…
▽ More
NOvA is a long-baseline neutrino oscillation experiment that measures oscillations in charged-current $ν_μ \rightarrow ν_μ$ (disappearance) and $ν_μ \rightarrow ν_{e}$ (appearance) channels, and their antineutrino counterparts, using neutrinos of energies around 2 GeV over a distance of 810 km. In this work we reanalyze the dataset first examined in our previous paper [Phys. Rev. D 106, 032004 (2022)] using an alternative statistical approach based on Bayesian Markov Chain Monte Carlo. We measure oscillation parameters consistent with the previous results. We also extend our inferences to include the first NOvA measurements of the reactor mixing angle $θ_{13}$ and the Jarlskog invariant. We use these results to quantify the strength of our inferences about CP violation, as well as to examine the effects of constraints from short-baseline measurements of $θ_{13}$ using antineutrinos from nuclear reactors when making NOvA measurements of $θ_{23}$. Our long-baseline measurement of $θ_{13}$ is also shown to be consistent with the reactor measurements, supporting the general applicability and robustness of the PMNS framework for neutrino oscillations.
△ Less
Submitted 27 May, 2024; v1 submitted 13 November, 2023;
originally announced November 2023.
-
Evidence for stellar mergers of evolved massive binaries: blue supergiants in the Large Magellanic Cloud
Authors:
Athira Menon,
Andrea Ercolino,
Miguel A. Urbaneja,
Daniel J. Lennon,
Artemio Herrero,
Ryosuke Hirai,
Norbert Langer,
Abel Schootemeijer,
Emmanouil Chatzopoulos,
Juhan Frank,
Sagiv Shiber
Abstract:
Blue supergiants are the brightest stars in their host galaxies and yet their evolutionary status has been a long-standing problem in stellar astrophysics. In this pioneering work, we present a large sample of 59 early B-type supergiants in the Large Magellanic Cloud with newly derived stellar parameters and identify the signatures of stars born from binary mergers among them. We simulate novel 1D…
▽ More
Blue supergiants are the brightest stars in their host galaxies and yet their evolutionary status has been a long-standing problem in stellar astrophysics. In this pioneering work, we present a large sample of 59 early B-type supergiants in the Large Magellanic Cloud with newly derived stellar parameters and identify the signatures of stars born from binary mergers among them. We simulate novel 1D merger models of binaries consisting of supergiants with hydrogen-free cores (primaries) and main-sequence companions (secondaries) and consider the effects of interaction of the secondary with the core of the primary. We follow the evolution of the new-born $16-40$ M$_{\odot}$ stars until core-carbon depletion, close to their final pre-explosion structure. Unlike stars which are born alone, stars born from such stellar mergers are blue throughout their core helium-burning phase and reproduce the surface gravities and Hertzsprung-Russel diagram positions of most of our sample. This indicates that the observed blue supergiants are structurally similar to merger-born stars. Moreover, the large nitrogen-to-carbon and oxygen ratios, and helium enhancements exhibited by at least half our data sample are uniquely consistent with our model predictions, leading us to conclude that a large fraction of blue supergiants are indeed products of binary mergers.
△ Less
Submitted 9 November, 2023;
originally announced November 2023.
-
Betelgeuse as a Merger of a Massive Star with a Companion
Authors:
Sagiv Shiber,
Emmanouil Chatzopoulos,
Bradley Munson,
Juhan Frank
Abstract:
We investigate the merger between a 16 solar mass star, on its way to becoming a red supergiant (RSG), and a 4 solar mass main-sequence companion. Our study employs three-dimensional hydrodynamic simulations using the state-of-the-art adaptive mesh refinement code Octo-Tiger. The initially corotating binary undergoes interaction and mass transfer, resulting in the accumulation of mass around the c…
▽ More
We investigate the merger between a 16 solar mass star, on its way to becoming a red supergiant (RSG), and a 4 solar mass main-sequence companion. Our study employs three-dimensional hydrodynamic simulations using the state-of-the-art adaptive mesh refinement code Octo-Tiger. The initially corotating binary undergoes interaction and mass transfer, resulting in the accumulation of mass around the companion and its subsequent loss through the second Lagrangian point (L2). The companion eventually plunges into the envelope of the primary, leading to its spin-up and subsequent merger with the helium core. We examine the internal structural properties of the post-merger star, as well as the merger environment and the outflow driven by the merger. Our findings reveal the ejection of approximately 0.6 solar mass of material in an asymmetric and somewhat bipolar outflow. We import the post-merger stellar structure into the MESA stellar evolution code to model its long-term nuclear evolution. In certain cases, the post-merger star exhibits persistent rapid equatorial surface rotation as it evolves in the H-R diagram towards the observed location of Betelgeuse. These cases demonstrate surface rotation velocities of a similar magnitude to those observed in Betelgeuse, along with a chemical composition resembling that of Betelgeuse. In other cases, efficient rotationally-induced mixing leads to slower surface rotation. This pioneering study aims to model stellar mergers across critical timescales, encompassing dynamical, thermal, and nuclear evolutionary stages.
△ Less
Submitted 23 October, 2023;
originally announced October 2023.
-
From Peptides to Nanostructures: A Euclidean Transformer for Fast and Stable Machine Learned Force Fields
Authors:
J. Thorben Frank,
Oliver T. Unke,
Klaus-Robert Müller,
Stefan Chmiela
Abstract:
Recent years have seen vast progress in the development of machine learned force fields (MLFFs) based on ab-initio reference calculations. Despite achieving low test errors, the reliability of MLFFs in molecular dynamics (MD) simulations is facing growing scrutiny due to concerns about instability over extended simulation timescales. Our findings suggest a potential connection between robustness t…
▽ More
Recent years have seen vast progress in the development of machine learned force fields (MLFFs) based on ab-initio reference calculations. Despite achieving low test errors, the reliability of MLFFs in molecular dynamics (MD) simulations is facing growing scrutiny due to concerns about instability over extended simulation timescales. Our findings suggest a potential connection between robustness to cumulative inaccuracies and the use of equivariant representations in MLFFs, but the computational cost associated with these representations can limit this advantage in practice. To address this, we propose a transformer architecture called SO3krates that combines sparse equivariant representations (Euclidean variables) with a self-attention mechanism that separates invariant and equivariant information, eliminating the need for expensive tensor products. SO3krates achieves a unique combination of accuracy, stability, and speed that enables insightful analysis of quantum properties of matter on extended time and system size scales. To showcase this capability, we generate stable MD trajectories for flexible peptides and supra-molecular structures with hundreds of atoms. Furthermore, we investigate the PES topology for medium-sized chainlike molecules (e.g., small peptides) by exploring thousands of minima. Remarkably, SO3krates demonstrates the ability to strike a balance between the conflicting demands of stability and the emergence of new minimum-energy conformations beyond the training data, which is crucial for realistic exploration tasks in the field of biochemistry.
△ Less
Submitted 16 February, 2024; v1 submitted 21 September, 2023;
originally announced September 2023.
-
Diagnosing and exploiting the computational demands of videos games for deep reinforcement learning
Authors:
Lakshmi Narasimhan Govindarajan,
Rex G Liu,
Drew Linsley,
Alekh Karkada Ashok,
Max Reuter,
Michael J Frank,
Thomas Serre
Abstract:
Humans learn by interacting with their environments and perceiving the outcomes of their actions. A landmark in artificial intelligence has been the development of deep reinforcement learning (dRL) algorithms capable of doing the same in video games, on par with or better than humans. However, it remains unclear whether the successes of dRL models reflect advances in visual representation learning…
▽ More
Humans learn by interacting with their environments and perceiving the outcomes of their actions. A landmark in artificial intelligence has been the development of deep reinforcement learning (dRL) algorithms capable of doing the same in video games, on par with or better than humans. However, it remains unclear whether the successes of dRL models reflect advances in visual representation learning, the effectiveness of reinforcement learning algorithms at discovering better policies, or both. To address this question, we introduce the Learning Challenge Diagnosticator (LCD), a tool that separately measures the perceptual and reinforcement learning demands of a task. We use LCD to discover a novel taxonomy of challenges in the Procgen benchmark, and demonstrate that these predictions are both highly reliable and can instruct algorithmic development. More broadly, the LCD reveals multiple failure cases that can occur when optimizing dRL algorithms over entire video game benchmarks like Procgen, and provides a pathway towards more efficient progress.
△ Less
Submitted 22 September, 2023;
originally announced September 2023.
-
On the cyclic homology of certain universal differential graded algebras
Authors:
Christopher Davis,
Julius Frank,
Irakli Patchkoria
Abstract:
Let $p$ be an odd prime and $R$ a $p$-torsion-free commutative $\mathbb{Z}_{(p)}$-algebra. We compute the periodic cyclic homology over $R$ of the universal differential graded algebra $R//p$ which is obtained from $R$ by universally killing $p$. We furthermore compute the cyclic and negative cyclic homologies of $R//p$ over $R$ in infinitely many degrees.
Let $p$ be an odd prime and $R$ a $p$-torsion-free commutative $\mathbb{Z}_{(p)}$-algebra. We compute the periodic cyclic homology over $R$ of the universal differential graded algebra $R//p$ which is obtained from $R$ by universally killing $p$. We furthermore compute the cyclic and negative cyclic homologies of $R//p$ over $R$ in infinitely many degrees.
△ Less
Submitted 23 August, 2023;
originally announced August 2023.
-
Cluster tomography in percolation
Authors:
Helen S. Ansell,
Samuel J. Frank,
István A. Kovács
Abstract:
In cluster tomography, we propose measuring the number of clusters $N$ intersected by a line segment of length $\ell$ across a finite sample. As expected, the leading order of $N(\ell)$ scales as $a\ell$, where $a$ depends on microscopic details of the system. However, at criticality, there is often an additional nonlinearity of the form $b\ln(\ell)$, originating from the endpoints of the line seg…
▽ More
In cluster tomography, we propose measuring the number of clusters $N$ intersected by a line segment of length $\ell$ across a finite sample. As expected, the leading order of $N(\ell)$ scales as $a\ell$, where $a$ depends on microscopic details of the system. However, at criticality, there is often an additional nonlinearity of the form $b\ln(\ell)$, originating from the endpoints of the line segment. By performing large scale Monte Carlo simulations of both 2$d$ and 3$d$ percolation, we find that $b$ is universal and depends only on the angles encountered at the endpoints of the line segment intersecting the sample. Our findings are further supported by analytic arguments in 2$d$, building on results in conformal field theory. Being broadly applicable, cluster tomography can be an efficient tool to detect phase transitions and to characterize the corresponding universality class in classical or quantum systems with a relevant cluster structure.
△ Less
Submitted 9 July, 2023;
originally announced July 2023.
-
Stress and heat flux via automatic differentiation
Authors:
Marcel F. Langer,
J. Thorben Frank,
Florian Knoop
Abstract:
Machine-learning potentials provide computationally efficient and accurate approximations of the Born-Oppenheimer potential energy surface. This potential determines many materials properties and simulation techniques usually require its gradients, in particular forces and stress for molecular dynamics, and heat flux for thermal transport properties. Recently developed potentials feature high body…
▽ More
Machine-learning potentials provide computationally efficient and accurate approximations of the Born-Oppenheimer potential energy surface. This potential determines many materials properties and simulation techniques usually require its gradients, in particular forces and stress for molecular dynamics, and heat flux for thermal transport properties. Recently developed potentials feature high body order and can include equivariant semi-local interactions through message-passing mechanisms. Due to their complex functional forms, they rely on automatic differentiation (AD), overcoming the need for manual implementations or finite-difference schemes to evaluate gradients. This study demonstrates a unified AD approach to obtain forces, stress, and heat flux for such potentials, and provides a model-independent implementation. The method is tested on the Lennard-Jones potential, and then applied to predict cohesive properties and thermal conductivity of tin selenide using an equivariant message-passing neural network potential.
△ Less
Submitted 2 May, 2023;
originally announced May 2023.
-
Passive superresolution imaging of incoherent objects
Authors:
Jernej Frank,
Alexander Duplinskiy,
Kaden Bearne,
A. I. Lvovsky
Abstract:
We investigate Hermite Gaussian Imaging (HGI) -- a novel passive super-resolution technique -- for complex 2D incoherent objects in the sub-Rayleigh regime. The method consists of measuring the field's spatial mode components in the image plane in the overcomplete basis of Hermite-Gaussian modes and their superpositions and subsequently using a deep neural network to reconstruct the object from th…
▽ More
We investigate Hermite Gaussian Imaging (HGI) -- a novel passive super-resolution technique -- for complex 2D incoherent objects in the sub-Rayleigh regime. The method consists of measuring the field's spatial mode components in the image plane in the overcomplete basis of Hermite-Gaussian modes and their superpositions and subsequently using a deep neural network to reconstruct the object from these measurements. We show a three-fold resolution improvement over direct imaging. Our HGI reconstruction retains its superiority even if the same neural network is applied to improve the resolution of direct imaging. This superiority is also preserved in the presence of shot noise. Our findings are the first step towards passive super-resolution imaging protocols in fluorescent microscopy and astronomy.
△ Less
Submitted 19 April, 2023;
originally announced April 2023.
-
Using Cryogenic CMOS Control Electronics To Enable A Two-Qubit Cross-Resonance Gate
Authors:
Devin L. Underwood,
Joseph A. Glick,
Ken Inoue,
David J. Frank,
John Timmerwilke,
Emily Pritchett,
Sudipto Chakraborty,
Kevin Tien,
Mark Yeck,
John F. Bulzacchelli,
Chris Baks,
Pat Rosno,
Raphael Robertazzi,
Matthew Beck,
Rajiv V. Joshi,
Dorothy Wisnieff,
Daniel Ramirez,
Jeff Ruedinger,
Scott Lekuch,
Brian P. Gaucher,
Daniel J. Friedman
Abstract:
Qubit control electronics composed of CMOS circuits are of critical interest for next generation quantum computing systems. A CMOS-based application specific integrated circuit (ASIC) fabricated in 14nm FinFET technology was used to generate and sequence qubit control waveforms and demonstrate a two-qubit cross resonance gate between fixed frequency transmons. The controller was thermally anchored…
▽ More
Qubit control electronics composed of CMOS circuits are of critical interest for next generation quantum computing systems. A CMOS-based application specific integrated circuit (ASIC) fabricated in 14nm FinFET technology was used to generate and sequence qubit control waveforms and demonstrate a two-qubit cross resonance gate between fixed frequency transmons. The controller was thermally anchored to the T = 4K stage of a dilution refrigerator and the measured power was 23 mW per qubit under active control. The chip generated single--side banded output frequencies between 4.5 and 5.5 GHz with a maximum power output of -18 dBm. Randomized benchmarking (RB) experiments revealed an average number of 1.71 instructions per Clifford (IPC) for single-qubit gates, and 17.51 IPC for two-qubit gates. A single-qubit error per gate of $ε_{\text{1Q}}$=8e-4 and two-qubit error per gate of $ε_\text{2Q}$=1.4e-2 is shown. A drive-induced Z-rotation is observed by way of a rotary echo experiment; this observation is consistent with expected qubit behavior given measured excess local oscillator (LO) leakage from the CMOS chip. The effect of spurious drive induced Z-errors is numerically evaluated with a two-qubit model Hamiltonian, and shown to be in good agreement with measured RB data. The modeling results suggest the Z-error varies linearly with pulse amplitude.
△ Less
Submitted 8 December, 2023; v1 submitted 22 February, 2023;
originally announced February 2023.
-
Intensity modulated proton arc therapy via geometry-based energy selection for ependymoma
Authors:
Wenhua Cao,
Yupeng Li,
Xiaodong Zhang,
Falk Poenisch,
Pablo Yepes,
Narayan Sahoo,
David Grosshans,
Susan McGovern,
G. Brandon Gunn,
Steven J. Frank,
Xiaorong R. Zhu
Abstract:
We developed a novel method of creating intensity modulated proton arc therapy (IMPAT) plans that uses computing resources efficiently and may offer a dosimetric benefit for patients with ependymoma or similar tumor geometries. Our IMPAT planning method consists of a geometry-based energy selection step with major scanning spot contributions as inputs computed using ray-tracing and single-Gaussian…
▽ More
We developed a novel method of creating intensity modulated proton arc therapy (IMPAT) plans that uses computing resources efficiently and may offer a dosimetric benefit for patients with ependymoma or similar tumor geometries. Our IMPAT planning method consists of a geometry-based energy selection step with major scanning spot contributions as inputs computed using ray-tracing and single-Gaussian approximation of lateral spot profiles. Based on the geometric relation of scanning spots and dose voxels, our energy selection module selects a minimum set of energy layers at each gantry angle such that each target voxel is covered by sufficient scanning spots as specified by the planner, with dose contributions above the specified threshold. Finally, IMPAT plans are generated by robustly optimizing scanning spots of the selected energy layers using a commercial proton treatment planning system. The IMPAT plan quality was assessed for four ependymoma patients. Reference three-field IMPT plans were created with similar planning objective functions and compared with the IMPAT plans. In all plans, the prescribed dose covered 95% of the clinical target volume (CTV) while maintaining similar maximum doses for the brainstem. While IMPAT and IMPT achieved comparable plan robustness, the IMPAT plans achieved better homogeneity and conformity than the IMPT plans. The IMPAT plans also exhibited higher relative biological effectiveness (RBE) enhancement than did the corresponding reference IMPT plans for the CTV in all four patients and brainstem in three of them. The proposed method demonstrated potential as an efficient technique for IMPAT planning and may offer a dosimetric benefit for patients with ependymoma or tumors in close proximity to critical organs. IMPAT plans created using this method had elevated RBE enhancement associated with increased linear energy transfer.
△ Less
Submitted 23 November, 2022;
originally announced November 2022.
-
Reward-Predictive Clustering
Authors:
Lucas Lehnert,
Michael J. Frank,
Michael L. Littman
Abstract:
Recent advances in reinforcement-learning research have demonstrated impressive results in building algorithms that can out-perform humans in complex tasks. Nevertheless, creating reinforcement-learning systems that can build abstractions of their experience to accelerate learning in new contexts still remains an active area of research. Previous work showed that reward-predictive state abstractio…
▽ More
Recent advances in reinforcement-learning research have demonstrated impressive results in building algorithms that can out-perform humans in complex tasks. Nevertheless, creating reinforcement-learning systems that can build abstractions of their experience to accelerate learning in new contexts still remains an active area of research. Previous work showed that reward-predictive state abstractions fulfill this goal, but have only be applied to tabular settings. Here, we provide a clustering algorithm that enables the application of such state abstractions to deep learning settings, providing compressed representations of an agent's inputs that preserve the ability to predict sequences of reward. A convergence theorem and simulations show that the resulting reward-predictive deep network maximally compresses the agent's inputs, significantly speeding up learning in high dimensional visual control tasks. Furthermore, we present different generalization experiments and analyze under which conditions a pre-trained reward-predictive representation network can be re-used without re-training to accelerate learning -- a form of systematic out-of-distribution transfer.
△ Less
Submitted 6 November, 2022;
originally announced November 2022.
-
A quasi-local inhomogeneous dielectric tensor for arbitrary distribution functions
Authors:
S. J. Frank,
J. C. Wright,
P. T. Bonoli
Abstract:
Treatments of plasma waves usually assume homogeneity, but the parallel gradients ubiquitous in plasmas can modify wave propagation and absorption. We derive a quasilocal inhomogeneous correction to the plasma dielectric for arbitrary distributions by expanding the phase correlation integral and develop a novel integration technique that allows our correction to be applied in many situations and h…
▽ More
Treatments of plasma waves usually assume homogeneity, but the parallel gradients ubiquitous in plasmas can modify wave propagation and absorption. We derive a quasilocal inhomogeneous correction to the plasma dielectric for arbitrary distributions by expanding the phase correlation integral and develop a novel integration technique that allows our correction to be applied in many situations and has greater accuracy than other inhomogeneous dielectric formulas found in the literature. We apply this dielectric tensor to the lower-hybrid current drive problem and demonstrate that inhomogeneous wave dam** does not affect the lower-hybrid wave's linear dam** condition, and in the non-Maxwellian problem dam** and propagation should remain unchanged except in the case of waves with very large phase velocities.
△ Less
Submitted 18 October, 2022;
originally announced October 2022.
-
Distributed, combined CPU and GPU profiling within HPX using APEX
Authors:
Patrick Diehl,
Gregor Daiss,
Kevin Huck,
Dominic Marcello,
Sagiv Shiber,
Hartmut Kaiser,
Juhan Frank,
Geoffrey C. Clayton,
Dirk Pflueger
Abstract:
Benchmarking and comparing performance of a scientific simulation across hardware platforms is a complex task. When the simulation in question is constructed with an asynchronous, many-task (AMT) runtime offloading work to GPUs, the task becomes even more complex. In this paper, we discuss the use of a uniquely suited performance measurement library, APEX, to capture the performance behavior of a…
▽ More
Benchmarking and comparing performance of a scientific simulation across hardware platforms is a complex task. When the simulation in question is constructed with an asynchronous, many-task (AMT) runtime offloading work to GPUs, the task becomes even more complex. In this paper, we discuss the use of a uniquely suited performance measurement library, APEX, to capture the performance behavior of a simulation built on HPX, a highly scalable, distributed AMT runtime. We examine the performance of the astrophysics simulation carried-out by Octo-Tiger on two different supercomputing architectures. We analyze the results of scaling and measurement overheads. In addition, we look in-depth at two similarly configured executions on the two systems to study how architectural differences affect performance and identify opportunities for optimization. As one such opportunity, we optimize the communication for the hydro solver and investigated its performance impact.
△ Less
Submitted 21 September, 2022;
originally announced October 2022.
-
Magnetic field driven dynamics in twisted bilayer artificial spin ice at superlattice angles
Authors:
Rehana Begum Popy,
Julia Frank,
Robert L. Stamps
Abstract:
Geometrical designs of interacting nanomagnets have been studied extensively in the form of two dimensional arrays called artificial spin ice. These systems are usually designed to create geometrical frustration and are of interest for the unusual and often surprising phenomena that can emerge. Advanced lithographic and element growth techniques have enabled the realization of complex designs that…
▽ More
Geometrical designs of interacting nanomagnets have been studied extensively in the form of two dimensional arrays called artificial spin ice. These systems are usually designed to create geometrical frustration and are of interest for the unusual and often surprising phenomena that can emerge. Advanced lithographic and element growth techniques have enabled the realization of complex designs that can involve elements arranged in three dimensions. Using numerical simulations employing the dumbbell approximation, we examine possible magnetic behaviours for bilayer artificial spin ice (BASI) in which the individual layers are rotated with respect to one another. The goal is to understand how magnetization dynamics are affected by long-range dipolar coupling that can be modified by varying the layer separation and layer alignment through rotation. We consider bilayers where the layers are both either square or pinwheel arrangements of islands. Magnetic reversal processes are studied and discussed in terms of domain and domain wall configurations of the magnetic islands. Unusual magnetic ordering is predicted for special angles which define lateral spin superlattices for the bilayer systems.
△ Less
Submitted 3 August, 2022;
originally announced August 2022.
-
The Profiled Feldman-Cousins technique for confidence interval construction in the presence of nuisance parameters
Authors:
M. A. Acero,
B. Acharya,
P. Adamson,
L. Aliaga,
N. Anfimov,
A. Antoshkin,
E. Arrieta-Diaz,
L. Asquith,
A. Aurisano,
A. Back,
C. Backhouse,
M. Baird,
N. Balashov,
P. Baldi,
B. A. Bambah,
S. Bashar,
A. Bat,
K. Bays,
R. Bernstein,
V. Bhatnagar,
D. Bhattarai,
B. Bhuyan,
J. Bian,
A. C. Booth,
R. Bowles
, et al. (196 additional authors not shown)
Abstract:
Measuring observables to constrain models using maximum-likelihood estimation is fundamental to many physics experiments. The Profiled Feldman-Cousins method described here is a potential solution to common challenges faced in constructing accurate confidence intervals: small datasets, bounded parameters, and the need to properly handle nuisance parameters. This method achieves more accurate frequ…
▽ More
Measuring observables to constrain models using maximum-likelihood estimation is fundamental to many physics experiments. The Profiled Feldman-Cousins method described here is a potential solution to common challenges faced in constructing accurate confidence intervals: small datasets, bounded parameters, and the need to properly handle nuisance parameters. This method achieves more accurate frequentist coverage than other methods in use, and is generally applicable to the problem of parameter estimation in neutrino oscillations and similar measurements. We describe an implementation of this method in the context of the NOvA experiment.
△ Less
Submitted 1 August, 2022; v1 submitted 28 July, 2022;
originally announced July 2022.
-
Verifying raytracing/Fokker-Planck lower-hybrid current drive predictions with self-consistent full-wave/Fokker-Planck simulations
Authors:
S. J. Frank,
J. P. Lee,
J. C. Wright,
I. H. Hutchinson,
P. T. Bonoli
Abstract:
Raytracing/Fokker-Planck (FP) simulations used to model lower-hybrid current drive (LHCD) often fail to reproduce experimental results, particularly when LHCD is weakly damped. A proposed reason for this discrepancy is the lack of "full-wave" effects, such as diffraction and interference, in raytracing simulations and the breakdown of raytracing approximation. Previous studies of LHCD using non-Ma…
▽ More
Raytracing/Fokker-Planck (FP) simulations used to model lower-hybrid current drive (LHCD) often fail to reproduce experimental results, particularly when LHCD is weakly damped. A proposed reason for this discrepancy is the lack of "full-wave" effects, such as diffraction and interference, in raytracing simulations and the breakdown of raytracing approximation. Previous studies of LHCD using non-Maxwellian full-wave/FP simulations have been performed, but these simulations were not self-consistent and enforced power conservation between the FP and full-wave code using a numerical rescaling factor. Here we have created a fully-self consistent full-wave/FP model for LHCD that is automatically power conserving. This was accomplished by coupling an overhauled version of the non-Maxwellian TORLH full-wave solver and the CQL3D FP code using the Integrated Plasma Simulator. We performed converged full-wave/FP simulations of Alcator C-Mod discharges and compared them to raytracing. We found that excellent agreement in the power deposition profiles from raytracing and TORLH could be obtained, however, TORLH had somewhat lower current drive efficiency and broader power deposition profiles in some cases. This discrepancy appears to be a result of numerical limitations present in the TORLH model and a small amount of diffractional broadening of the TORLH wave spectrum. Our results suggest full-wave simulation of LHCD is likely not necessary as diffraction and interference represented only a small correction that could not account for the differences between simulations and experiment.
△ Less
Submitted 18 July, 2022;
originally announced July 2022.
-
Radiative pulsed L-mode operation in ARC-class reactors
Authors:
S. J. Frank,
C. J. Perks,
A. O. Nelson,
T. Qian,
S. **,
A. J. Cavallaro,
A. Rutkowski,
A. H. Reiman,
J. P. Freidberg,
P. Rodriguez-Fernandez,
D. G. Whyte
Abstract:
A new ARC-class, highly-radiative, pulsed, L-mode, burning plasma scenario is developed and evaluated as a candidate for future tokamak reactors. Pulsed inductive operation alleviates the stringent current drive requirements of steady-state reactors, and operation in L-mode affords ELM-free access to $\sim90\%$ core radiation fractions, significantly reducing the divertor power handling requiremen…
▽ More
A new ARC-class, highly-radiative, pulsed, L-mode, burning plasma scenario is developed and evaluated as a candidate for future tokamak reactors. Pulsed inductive operation alleviates the stringent current drive requirements of steady-state reactors, and operation in L-mode affords ELM-free access to $\sim90\%$ core radiation fractions, significantly reducing the divertor power handling requirements. In this configuration the fusion power density can be maximized despite L-mode confinement by utilizing high-field to increase plasma densities and current. This allows us to obtain high gain in robust scenarios in compact devices with $P_\mathrm{fus} > 1000\,$MW despite low confinement. We demonstrate the feasibility of such scenarios here; first by showing that they avoid violating 0-D tokamak limits, and then by performing self-consistent integrated simulations of flattop operation including neoclassical and turbulent transport, magnetic equilibrium, and RF current drive models. Finally we examine the potential effect of introducing negative triangularity with a 0-D model. Our results show high-field radiative pulsed L-mode scenarios are a promising alternative to the typical steady state advanced tokamak scenarios which have dominated tokamak reactor development.
△ Less
Submitted 9 September, 2022; v1 submitted 18 July, 2022;
originally announced July 2022.
-
Measurement of the $ν_e-$Nucleus Charged-Current Double-Differential Cross Section at $\left< E_ν \right> = $ 2.4 GeV using NOvA
Authors:
M. A. Acero,
P. Adamson,
L. Aliaga,
N. Anfimov,
A. Antoshkin,
E. Arrieta-Diaz,
L. Asquith,
A. Aurisano,
A. Back,
C. Backhouse,
M. Baird,
N. Balashov,
P. Baldi,
B. A. Bambah,
S. Bashar,
K. Bays,
R. Bernstein,
V. Bhatnagar,
D. Bhattarai,
B. Bhuyan,
J. Bian,
A. C. Booth,
R. Bowles,
B. Brahma,
C. Bromberg
, et al. (190 additional authors not shown)
Abstract:
The inclusive electron neutrino charged-current cross section is measured in the NOvA near detector using $8.02\times10^{20}$ protons-on-target (POT) in the NuMI beam. The sample of GeV electron neutrino interactions is the largest analyzed to date and is limited by $\simeq$ 17\% systematic rather than the $\simeq$ 7.4\% statistical uncertainties. The double-differential cross section in final-sta…
▽ More
The inclusive electron neutrino charged-current cross section is measured in the NOvA near detector using $8.02\times10^{20}$ protons-on-target (POT) in the NuMI beam. The sample of GeV electron neutrino interactions is the largest analyzed to date and is limited by $\simeq$ 17\% systematic rather than the $\simeq$ 7.4\% statistical uncertainties. The double-differential cross section in final-state electron energy and angle is presented for the first time, together with the single-differential dependence on $Q^{2}$ (squared four-momentum transfer) and energy, in the range 1 GeV $ \leq E_ν < $6 GeV. Detailed comparisons are made to the predictions of the GENIE, GiBUU, NEUT, and NuWro neutrino event generators. The data do not strongly favor a model over the others consistently across all three cross sections measured, though some models have especially good or poor agreement in the single differential cross section vs. $Q^{2}$.
△ Less
Submitted 21 June, 2022;
originally announced June 2022.
-
An Assessment Of Full-Wave Effects On Maxwellian Lower-Hybrid Wave Dam**
Authors:
S J Frank,
J C Wright,
I H Hutchinson,
P T Bonoli
Abstract:
Lower-hybrid current drive (LHCD) actuators are important components of modern day fusion experiments as well as proposed fusion reactors. However, simulations of LHCD often differ substantially from experimental results, and from each other, especially in the inferred power deposition profile shape. Here we investigate some possible causes of this discrepancy; "full-wave" effects such as interfer…
▽ More
Lower-hybrid current drive (LHCD) actuators are important components of modern day fusion experiments as well as proposed fusion reactors. However, simulations of LHCD often differ substantially from experimental results, and from each other, especially in the inferred power deposition profile shape. Here we investigate some possible causes of this discrepancy; "full-wave" effects such as interference and diffraction, which are omitted from standard raytracing simulations and the breakdown of the raytracing near reflections and caustics. We compare raytracing simulations to state-of-the-art full-wave simulations using matched hot-plasma dielectric tensors in realistic tokamak scenarios for the first time. We show that differences between full-wave simulations and raytracing in previous work were primarily due to numerical and physical inconsistencies in the simulations, and we demonstrate that good agreement between raytracing and converged full-wave simulations can be obtained in reactor relevant-scenarios with large ray caustics and in situations with weak dam**.
△ Less
Submitted 13 July, 2022; v1 submitted 3 June, 2022;
originally announced June 2022.
-
So3krates: Equivariant attention for interactions on arbitrary length-scales in molecular systems
Authors:
J. Thorben Frank,
Oliver T. Unke,
Klaus-Robert Müller
Abstract:
The application of machine learning methods in quantum chemistry has enabled the study of numerous chemical phenomena, which are computationally intractable with traditional ab-initio methods. However, some quantum mechanical properties of molecules and materials depend on non-local electronic effects, which are often neglected due to the difficulty of modeling them efficiently. This work proposes…
▽ More
The application of machine learning methods in quantum chemistry has enabled the study of numerous chemical phenomena, which are computationally intractable with traditional ab-initio methods. However, some quantum mechanical properties of molecules and materials depend on non-local electronic effects, which are often neglected due to the difficulty of modeling them efficiently. This work proposes a modified attention mechanism adapted to the underlying physics, which allows to recover the relevant non-local effects. Namely, we introduce spherical harmonic coordinates (SPHCs) to reflect higher-order geometric information for each atom in a molecule, enabling a non-local formulation of attention in the SPHC space. Our proposed model So3krates - a self-attention based message passing neural network - uncouples geometric information from atomic features, making them independently amenable to attention mechanisms. Thereby we construct spherical filters, which extend the concept of continuous filters in Euclidean space to SPHC space and serve as foundation for a spherical self-attention mechanism. We show that in contrast to other published methods, So3krates is able to describe non-local quantum mechanical effects over arbitrary length scales. Further, we find evidence that the inclusion of higher-order geometric correlations increases data efficiency and improves generalization. So3krates matches or exceeds state-of-the-art performance on popular benchmarks, notably, requiring a significantly lower number of parameters (0.25 - 0.4x) while at the same time giving a substantial speedup (6 - 14x for training and 2 - 11x for inference) compared to other models.
△ Less
Submitted 9 January, 2023; v1 submitted 27 May, 2022;
originally announced May 2022.
-
Solving Disjunctive Temporal Networks with Uncertainty under Restricted Time-Based Controllability using Tree Search and Graph Neural Networks
Authors:
Kevin Osanlou,
Jeremy Frank,
Andrei Bursuc,
Tristan Cazenave,
Eric Jacopin,
Christophe Guettier,
J. Benton
Abstract:
Planning under uncertainty is an area of interest in artificial intelligence. We present a novel approach based on tree search and graph machine learning for the scheduling problem known as Disjunctive Temporal Networks with Uncertainty (DTNU). Dynamic Controllability (DC) of DTNUs seeks a reactive scheduling strategy to satisfy temporal constraints in response to uncontrollable action durations.…
▽ More
Planning under uncertainty is an area of interest in artificial intelligence. We present a novel approach based on tree search and graph machine learning for the scheduling problem known as Disjunctive Temporal Networks with Uncertainty (DTNU). Dynamic Controllability (DC) of DTNUs seeks a reactive scheduling strategy to satisfy temporal constraints in response to uncontrollable action durations. We introduce new semantics for reactive scheduling: Time-based Dynamic Controllability (TDC) and a restricted subset of TDC, R-TDC. We design a tree search algorithm to determine whether or not a DTNU is R-TDC. Moreover, we leverage a graph neural network as a heuristic for tree search guidance. Finally, we conduct experiments on a known benchmark on which we show R-TDC to retain significant completeness with regard to DC, while being faster to prove. This results in the tree search processing fifty percent more DTNU problems in R-TDC than the state-of-the-art DC solver does in DC with the same time budget. We also observe that graph neural network search guidance leads to substantial performance gains on benchmarks of more complex DTNUs, with up to eleven times more problems solved than the baseline tree search.
△ Less
Submitted 30 March, 2022; v1 submitted 28 March, 2022;
originally announced March 2022.
-
WaveFake: A Data Set to Facilitate Audio Deepfake Detection
Authors:
Joel Frank,
Lea Schönherr
Abstract:
Deep generative modeling has the potential to cause significant harm to society. Recognizing this threat, a magnitude of research into detecting so-called "Deepfakes" has emerged. This research most often focuses on the image domain, while studies exploring generated audio signals have, so-far, been neglected. In this paper we make three key contributions to narrow this gap. First, we provide rese…
▽ More
Deep generative modeling has the potential to cause significant harm to society. Recognizing this threat, a magnitude of research into detecting so-called "Deepfakes" has emerged. This research most often focuses on the image domain, while studies exploring generated audio signals have, so-far, been neglected. In this paper we make three key contributions to narrow this gap. First, we provide researchers with an introduction to common signal processing techniques used for analyzing audio signals. Second, we present a novel data set, for which we collected nine sample sets from five different network architectures, spanning two languages. Finally, we supply practitioners with two baseline models, adopted from the signal processing community, to facilitate further research in this area.
△ Less
Submitted 4 November, 2021;
originally announced November 2021.
-
Measurement of the Double-Differential Muon-neutrino Charged-Current Inclusive Cross Section in the NOvA Near Detector
Authors:
M. A. Acero,
P. Adamson,
L. Aliaga,
N. Anfimov,
A. Antoshkin,
E. Arrieta-Diaz,
L. Asquith,
A. Aurisano,
A. Back,
C. Backhouse,
M. Baird,
N. Balashov,
P. Baldi,
B. A. Bambah,
S. Bashar,
K. Bays,
B. Behera,
R. Bernstein,
V. Bhatnagar,
D. Bhattarai,
B. Bhuyan,
J. Bian,
J. Blair,
A. C. Booth,
R. Bowles
, et al. (181 additional authors not shown)
Abstract:
We report cross-section measurements of the final-state muon kinematics for \numu charged-current interactions in the NOvA near detector using an accumulated 8.09$\times10^{20}$ protons-on-target (POT) in the NuMI beam. We present the results as a double-differential cross section in the observed outgoing muon energy and angle, as well as single-differential cross sections in the derived neutrino…
▽ More
We report cross-section measurements of the final-state muon kinematics for \numu charged-current interactions in the NOvA near detector using an accumulated 8.09$\times10^{20}$ protons-on-target (POT) in the NuMI beam. We present the results as a double-differential cross section in the observed outgoing muon energy and angle, as well as single-differential cross sections in the derived neutrino energy, $E_ν$, and square of the four-momentum transfer, $Q^2$. We compare the results to inclusive cross-section predictions from various neutrino event generators via $χ^2$ calculations using a covariance matrix that accounts for bin-to-bin correlations of systematic uncertainties. These comparisons show a clear discrepancy between the data and each of the tested predictions at forward muon angle and low $Q^2$, indicating a missing suppression of the cross section in current neutrino-nucleus scattering models.
△ Less
Submitted 18 July, 2023; v1 submitted 24 September, 2021;
originally announced September 2021.
-
An Improved Measurement of Neutrino Oscillation Parameters by the NOvA Experiment
Authors:
M. A. Acero,
P. Adamson,
L. Aliaga,
N. Anfimov,
A. Antoshkin,
E. Arrieta-Diaz,
L. Asquith,
A. Aurisano,
A. Back,
C. Backhouse,
M. Baird,
N. Balashov,
P. Baldi,
B. A. Bambah,
S. Bashar,
K. Bays,
R. Bernstein,
V. Bhatnagar,
D. Bhattarai,
B. Bhuyan,
J. Bian,
J. Blair,
A. C. Booth,
R. Bowles,
C. Bromberg
, et al. (180 additional authors not shown)
Abstract:
We present new $ν_μ\rightarrowν_e$, $ν_μ\rightarrowν_μ$, $\overlineν_μ\rightarrow\overlineν_e$, and $\overlineν_μ\rightarrow\overlineν_μ$ oscillation measurements by the NOvA experiment, with a 50% increase in neutrino-mode beam exposure over the previously reported results. The additional data, combined with previously published neutrino and antineutrino data, are all analyzed using improved tech…
▽ More
We present new $ν_μ\rightarrowν_e$, $ν_μ\rightarrowν_μ$, $\overlineν_μ\rightarrow\overlineν_e$, and $\overlineν_μ\rightarrow\overlineν_μ$ oscillation measurements by the NOvA experiment, with a 50% increase in neutrino-mode beam exposure over the previously reported results. The additional data, combined with previously published neutrino and antineutrino data, are all analyzed using improved techniques and simulations. A joint fit to the $ν_e$, $ν_μ$, $\overlineν_e$, and $\overlineν_μ$ candidate samples within the 3-flavor neutrino oscillation framework continues to yield a best-fit point in the normal mass ordering and the upper octant of the $θ_{23}$ mixing angle, with $Δm^{2}_{32} = (2.41\pm0.07)\times 10^{-3}$ eV$^2$ and $\sin^2θ_{23} = 0.57^{+0.03}_{-0.04}$. The data disfavor combinations of oscillation parameters that give rise to a large asymmetry in the rates of $ν_e$ and $\overlineν_e$ appearance. This includes values of the CP-violating phase in the vicinity of $δ_\text{CP} = π/2$ which are excluded by $>3σ$ for the inverted mass ordering, and values around $δ_\text{CP} = 3π/2$ in the normal ordering which are disfavored at 2$σ$ confidence.
△ Less
Submitted 8 August, 2022; v1 submitted 18 August, 2021;
originally announced August 2021.
-
Time-based Dynamic Controllability of Disjunctive Temporal Networks with Uncertainty: A Tree Search Approach with Graph Neural Network Guidance
Authors:
Kevin Osanlou,
Jeremy Frank,
J. Benton,
Andrei Bursuc,
Christophe Guettier,
Eric Jacopin,
Tristan Cazenave
Abstract:
Scheduling in the presence of uncertainty is an area of interest in artificial intelligence due to the large number of applications. We study the problem of dynamic controllability (DC) of disjunctive temporal networks with uncertainty (DTNU), which seeks a strategy to satisfy all constraints in response to uncontrollable action durations. We introduce a more restricted, stronger form of controlla…
▽ More
Scheduling in the presence of uncertainty is an area of interest in artificial intelligence due to the large number of applications. We study the problem of dynamic controllability (DC) of disjunctive temporal networks with uncertainty (DTNU), which seeks a strategy to satisfy all constraints in response to uncontrollable action durations. We introduce a more restricted, stronger form of controllability than DC for DTNUs, time-based dynamic controllability (TDC), and present a tree search approach to determine whether or not a DTNU is TDC. Moreover, we leverage the learning capability of a message passing neural network (MPNN) as a heuristic for tree search guidance. Finally, we conduct experiments for which the tree search shows superior results to state-of-the-art timed-game automata (TGA) based approaches. We observe that using an MPNN for tree search guidance leads to a significant increase in solving performance and scalability to harder DTNU problems.
△ Less
Submitted 2 August, 2021;
originally announced August 2021.
-
Octo-Tiger's New Hydro Module and Performance Using HPX+CUDA on ORNL's Summit
Authors:
Patrick Diehl,
Gregor Daiß,
Dominic Marcello,
Kevin Huck,
Sagiv Shiber,
Hartmut Kaiser,
Juhan Frank,
Dirk Pflüger
Abstract:
Octo-Tiger is a code for modeling three-dimensional self-gravitating astrophysical fluids. It was particularly designed for the study of dynamical mass transfer between interacting binary stars. Octo-Tiger is parallelized for distributed systems using the asynchronous many-task runtime system, the C++ standard library for parallelism and concurrency (HPX) and utilizes CUDA for its gravity solver.…
▽ More
Octo-Tiger is a code for modeling three-dimensional self-gravitating astrophysical fluids. It was particularly designed for the study of dynamical mass transfer between interacting binary stars. Octo-Tiger is parallelized for distributed systems using the asynchronous many-task runtime system, the C++ standard library for parallelism and concurrency (HPX) and utilizes CUDA for its gravity solver. Recently, we have remodeled Octo-Tiger's hydro solver to use a three-dimensional reconstruction scheme. In addition, we have ported the hydro solver to GPU using CUDA kernels. We present scaling results for the new hydro kernels on ORNL's Summit machine using a Sedov-Taylor blast wave problem. We also compare Octo-Tiger's new hydro scheme with its old hydro scheme, using a rotating star as a test problem.
△ Less
Submitted 26 July, 2021; v1 submitted 22 July, 2021;
originally announced July 2021.
-
Single-mode input squeezing and tripartite entanglement in three-mode ponderomotive optomechanics simulations
Authors:
Kahlil Y. Dixon,
Lior Cohen,
Narayan Bhusal,
Jesse Frank,
Jonathan P. Dowling,
Thomas Corbitt
Abstract:
Quantum entanglement is a crucial resource for a wide variety of quantum technologies. However, the current state-of-art methods to generate quantum entanglement in optomechanical systems are not as efficient as all-optical methods utilizing nonlinear crystals. This article proposes a new scheme in which two single-mode squeezed light fields are injected into an optomechanical cavity. We demonstra…
▽ More
Quantum entanglement is a crucial resource for a wide variety of quantum technologies. However, the current state-of-art methods to generate quantum entanglement in optomechanical systems are not as efficient as all-optical methods utilizing nonlinear crystals. This article proposes a new scheme in which two single-mode squeezed light fields are injected into an optomechanical cavity. We demonstrate through our numerical simulations that the quantum entanglement can be substantially enhanced with the careful selection of squeezing strength and squeezing angle of the two quadrature squeezed light fields. Our results represent a significant improvement in output bipartite photon-photon entanglement over the previously demonstrated schemes using two coherent light fields as inputs. These simulations predict a maximum increase in bipartite optical entanglement by a factor of about 6, as well as increases in the quantum noise of the output light. A perceived loss of quantum information at certain squeezing angles is attributed to tripartite entanglement between the two optical fields and the optomechanical oscillator (OMO). At particular squeezing angles, the bipartite (or tripartite) entanglement can be increased, thus introducing a method of optically controlling the intracavity entanglement. These mechanics can benefit various optical quantum technologies utilizing optomechanical entanglement and continuous variable quantum optics.
△ Less
Submitted 16 December, 2021; v1 submitted 14 July, 2021;
originally announced July 2021.
-
Extended search for supernova-like neutrinos in NOvA coincident with LIGO/Virgo detections
Authors:
M. A. Acero,
P. Adamson,
L. Aliaga,
N. Anfimov,
A. Antoshkin,
E. Arrieta-Diaz,
L. Asquith,
A. Aurisano,
A. Back,
C. Backhouse,
M. Baird,
N. Balashov,
P. Baldi,
B. A. Bambah,
S. Bashar,
K. Bays,
R. Bernstein,
V. Bhatnagar,
B. Bhuyan,
J. Bian,
J. Blair,
A. C. Booth,
R. Bowles,
C. Bromberg,
N. Buchanan
, et al. (178 additional authors not shown)
Abstract:
A search is performed for supernova-like neutrino interactions coincident with 76 gravitational wave events detected by the LIGO/Virgo Collaboration. For 40 of these events, full readout of the time around the gravitational wave is available from the NOvA Far Detector. For these events, we set limits on the fluence of the sum of all neutrino flavors of $F < 7(4)\times 10^{10}\mathrm{cm}^{-2}$ at 9…
▽ More
A search is performed for supernova-like neutrino interactions coincident with 76 gravitational wave events detected by the LIGO/Virgo Collaboration. For 40 of these events, full readout of the time around the gravitational wave is available from the NOvA Far Detector. For these events, we set limits on the fluence of the sum of all neutrino flavors of $F < 7(4)\times 10^{10}\mathrm{cm}^{-2}$ at 90% C.L. assuming energy and time distributions corresponding to the Garching supernova models with masses 9.6(27)$\mathrm{M}_\odot$. Under the hypothesis that any given gravitational wave event was caused by a supernova, this corresponds to a distance of $r > 29(50)$kpc at 90% C.L. Weaker limits are set for other gravitational wave events with partial Far Detector data and/or Near Detector data.
△ Less
Submitted 23 August, 2021; v1 submitted 10 June, 2021;
originally announced June 2021.
-
Search for active-sterile antineutrino mixing using neutral-current interactions with the NOvA experiment
Authors:
M. A. Acero,
P. Adamson,
L. Aliaga,
N. Anfimov,
A. Antoshkin,
E. Arrieta-Diaz,
L. Asquith,
A. Aurisano,
A. Back,
C. Backhouse,
M. Baird,
N. Balashov,
P. Baldi,
B. A. Bambah,
S. Bashar,
K. Bays,
R. Bernstein,
V. Bhatnagar,
B. Bhuyan,
J. Bian,
J. Blair,
A. C. Booth,
R. Bowles,
C. Bromberg,
N. Buchanan
, et al. (174 additional authors not shown)
Abstract:
This Letter reports results from the first long-baseline search for sterile antineutrinos mixing in an accelerator-based antineutrino-dominated beam. The rate of neutral-current interactions in the two NOvA detectors, at distances of 1 km and 810 km from the beam source, is analyzed using an exposure of $12.51\times10^{20}$ protons-on-target from the NuMI beam at Fermilab running in antineutrino m…
▽ More
This Letter reports results from the first long-baseline search for sterile antineutrinos mixing in an accelerator-based antineutrino-dominated beam. The rate of neutral-current interactions in the two NOvA detectors, at distances of 1 km and 810 km from the beam source, is analyzed using an exposure of $12.51\times10^{20}$ protons-on-target from the NuMI beam at Fermilab running in antineutrino mode. A total of $121$ of neutral-current candidates are observed at the Far Detector, compared to a prediction of $122\pm11$(stat.)$\pm15$(syst.) assuming mixing between three active flavors. No evidence for $\barν_μ\rightarrow\barν_{s}$ oscillation is observed. Interpreting this result within a 3+1 model, constraints are placed on the mixing angles $θ_{24} < 25^{\circ}$ and $θ_{34} < 32^{\circ}$ at the 90% C.L. for $0.05$eV$^{2} \leq Δm^{2}_{41} \leq 0.5$eV$^{2}$, the range of mass splittings that produces no significant oscillations at the Near Detector. These are the first 3+1 confidence limits set using long-baseline accelerator antineutrinos.
△ Less
Submitted 30 September, 2021; v1 submitted 8 June, 2021;
originally announced June 2021.
-
Machine learning on knowledge graphs for context-aware security monitoring
Authors:
Josep Soler Garrido,
Dominik Dold,
Johannes Frank
Abstract:
Machine learning techniques are gaining attention in the context of intrusion detection due to the increasing amounts of data generated by monitoring tools, as well as the sophistication displayed by attackers in hiding their activity. However, existing methods often exhibit important limitations in terms of the quantity and relevance of the generated alerts. Recently, knowledge graphs are finding…
▽ More
Machine learning techniques are gaining attention in the context of intrusion detection due to the increasing amounts of data generated by monitoring tools, as well as the sophistication displayed by attackers in hiding their activity. However, existing methods often exhibit important limitations in terms of the quantity and relevance of the generated alerts. Recently, knowledge graphs are finding application in the cybersecurity domain, showing the potential to alleviate some of these drawbacks thanks to their ability to seamlessly integrate data from multiple domains using human-understandable vocabularies. We discuss the application of machine learning on knowledge graphs for intrusion detection and experimentally evaluate a link-prediction method for scoring anomalous activity in industrial systems. After initial unsupervised training, the proposed method is shown to produce intuitively well-calibrated and interpretable alerts in a diverse range of scenarios, hinting at the potential benefits of relational machine learning on knowledge graphs for intrusion detection purposes.
△ Less
Submitted 18 May, 2021;
originally announced May 2021.
-
Seasonal Variation of Multiple-Muon Cosmic Ray Air Showers Observed in the NOvA Detector on the Surface
Authors:
M. A. Acero,
P. Adamson,
L. Aliaga,
N. Anfimov,
A. Antoshkin,
E. Arrieta-Diaz,
L. Asquith,
A. Aurisano,
A. Back,
C. Backhouse,
M. Baird,
N. Balashov,
P. Baldi,
B. A. Bambah,
S. Bashar,
K. Bays,
R. Bernstein,
V. Bhatnagar,
B. Bhuyan,
J. Bian,
J. Blair,
A. C. Booth,
R. Bowles,
C. Bromberg,
N. Buchanan
, et al. (172 additional authors not shown)
Abstract:
We report the rate of cosmic ray air showers with multiplicities exceeding 15 muon tracks recorded in the NOvA Far Detector between May 2016 and May 2018. The detector is located on the surface under an overburden of 3.6 meters water equivalent. We observe a seasonal dependence in the rate of multiple-muon showers, which varies in magnitude with multiplicity and zenith angle. During this period, t…
▽ More
We report the rate of cosmic ray air showers with multiplicities exceeding 15 muon tracks recorded in the NOvA Far Detector between May 2016 and May 2018. The detector is located on the surface under an overburden of 3.6 meters water equivalent. We observe a seasonal dependence in the rate of multiple-muon showers, which varies in magnitude with multiplicity and zenith angle. During this period, the effective atmospheric temperature and surface pressure ranged between 210 K to 230 K and 940mbar to 990mbar, respectively; the shower rates are anti-correlated with the variation in the effective temperature. The variations are about 30% larger for the highest multiplicities than the lowest multiplicities and 20% larger for showers near the horizon than vertical showers.
△ Less
Submitted 13 July, 2021; v1 submitted 9 May, 2021;
originally announced May 2021.
-
Learning Neural Network Quantum States with the Linear Method
Authors:
J. Thorben Frank,
Michael J. Kastoryano
Abstract:
Due to the strong correlations present in quantum systems, classical machine learning algorithms like stochastic gradient descent are often insufficient for the training of neural network quantum states (NQSs). These difficulties can be overcome by using physically inspired learning algorithm, the most prominent of which is the stochastic reconfiguration (SR) which mimics imaginary time evolution.…
▽ More
Due to the strong correlations present in quantum systems, classical machine learning algorithms like stochastic gradient descent are often insufficient for the training of neural network quantum states (NQSs). These difficulties can be overcome by using physically inspired learning algorithm, the most prominent of which is the stochastic reconfiguration (SR) which mimics imaginary time evolution. Here we explore an alternative algorithms for the optimization of complex valued NQSs based on the linear method (LM), and present the explicit formulation in terms of complex valued parameters. Beyond the theoretical formulation, we present numerical evidence that the LM can be used successfully for the optimization of complex valued NQSs, to our knowledge for the first time. We compare the LM to the state-of-the-art SR algorithm and find that the LM requires up to an order of magnitude fewer iterations for convergence, albeit at a higher cost per epoch. We further demonstrate that the LM becomes the more efficient training algorithm whenever the cost of sampling is high. This advantage, however, comes at the price of a larger variance.
△ Less
Submitted 22 April, 2021;
originally announced April 2021.
-
Alpha Buckets in Longitudinal Phase Space: a Bifurcation Analysis
Authors:
Jernej Frank,
Tom Mertens,
Markus Ries
Abstract:
At HZB's BESSY II and PTB's Metrology Light Source (MLS) facilities we have the ability to tune the momentum compaction factor $α$ up to second non-linear order. The non-linear dependence $α(δ)$ brings qualitative changes to the longitudinal phase space and introduces new fix points $α(δ)=0$ which produce the so-called $α$-buckets. We present with this paper an analysis of this phenomena from the…
▽ More
At HZB's BESSY II and PTB's Metrology Light Source (MLS) facilities we have the ability to tune the momentum compaction factor $α$ up to second non-linear order. The non-linear dependence $α(δ)$ brings qualitative changes to the longitudinal phase space and introduces new fix points $α(δ)=0$ which produce the so-called $α$-buckets. We present with this paper an analysis of this phenomena from the standpoint of bifurcation theory. With this approach we were able to characterize the nature of the fix points and their position in direct dependence on the tunable parameters. Furthermore, we are able to place stringent conditions onto the tunable parameters to either create or destroy $α$-buckets.
△ Less
Submitted 27 April, 2021; v1 submitted 16 April, 2021;
originally announced April 2021.
-
[RE] CNN-generated images are surprisingly easy to spot...for now
Authors:
Joel Frank,
Thorsten Holz
Abstract:
This work evaluates the reproducibility of the paper "CNN-generated images are surprisingly easy to spot... for now" by Wang et al. published at CVPR 2020. The paper addresses the challenge of detecting CNN-generated imagery, which has reached the potential to even fool humans. The authors propose two methods which help an image classifier to generalize from being trained on one specific CNN to de…
▽ More
This work evaluates the reproducibility of the paper "CNN-generated images are surprisingly easy to spot... for now" by Wang et al. published at CVPR 2020. The paper addresses the challenge of detecting CNN-generated imagery, which has reached the potential to even fool humans. The authors propose two methods which help an image classifier to generalize from being trained on one specific CNN to detecting imagery produced by unseen architectures, training methods, or data sets. The paper proposes two methods to help a classifier generalize: (i) utilizing different kinds of data augmentations and (ii) using a diverse data set. This report focuses on assessing if these techniques indeed help the generalization process. Furthermore, we perform additional experiments to study the limitations of the proposed techniques.
△ Less
Submitted 7 April, 2021;
originally announced April 2021.
-
R Coronae Borealis Star Evolution: Simulating 3D Merger Events to 1D Stellar Evolution Including Large Scale Nucleosynthesis
Authors:
Bradley Munson,
Emmanouil Chatzopoulos,
Juhan Frank,
Geoffrey C. Clayton,
Courtney L. Crawford,
Pavel A. Denissenkov,
Falk Herwig
Abstract:
R Coronae Borealis (RCB) stars are rare hydrogen-deficient carbon-rich variable supergiants thought to be the result of dynamically unstable white dwarf mergers. We attempt to model RCBs through all the relevant timescales by simulating a merger event in Octo-tiger, a 3D adaptive mesh refinement (AMR) hydrodynamics code and map** the post-merger object into MESA, a 1D stellar evolution code. We…
▽ More
R Coronae Borealis (RCB) stars are rare hydrogen-deficient carbon-rich variable supergiants thought to be the result of dynamically unstable white dwarf mergers. We attempt to model RCBs through all the relevant timescales by simulating a merger event in Octo-tiger, a 3D adaptive mesh refinement (AMR) hydrodynamics code and map** the post-merger object into MESA, a 1D stellar evolution code. We then post-process the nucleosynthesis on a much larger nuclear reaction network to study the enhancement of s-process elements. We present models that match observations or previous studies in most surface abundances, isotopic ratios, early evolution and lifetimes. We also observe similar mixing behavior as previous modeling attempts which result in the partial He-burning products visible on the surface in observations. However, we do note that our sub-solar models lack any enhancement in s-process elements, which we attribute to a lack of hydrogen in the envelope. We also find that the Oxygen-16/Oxygen-18 isotopic ratio is very sensitive to initial hydrogen abundance and increases outside of the acceptable range with a hydrogen mass fraction greater than $10^{-4}$.
△ Less
Submitted 2 March, 2021;
originally announced March 2021.
-
Dompteur: Taming Audio Adversarial Examples
Authors:
Thorsten Eisenhofer,
Lea Schönherr,
Joel Frank,
Lars Speckemeier,
Dorothea Kolossa,
Thorsten Holz
Abstract:
Adversarial examples seem to be inevitable. These specifically crafted inputs allow attackers to arbitrarily manipulate machine learning systems. Even worse, they often seem harmless to human observers. In our digital society, this poses a significant threat. For example, Automatic Speech Recognition (ASR) systems, which serve as hands-free interfaces to many kinds of systems, can be attacked with…
▽ More
Adversarial examples seem to be inevitable. These specifically crafted inputs allow attackers to arbitrarily manipulate machine learning systems. Even worse, they often seem harmless to human observers. In our digital society, this poses a significant threat. For example, Automatic Speech Recognition (ASR) systems, which serve as hands-free interfaces to many kinds of systems, can be attacked with inputs incomprehensible for human listeners. The research community has unsuccessfully tried several approaches to tackle this problem. In this paper we propose a different perspective: We accept the presence of adversarial examples against ASR systems, but we require them to be perceivable by human listeners. By applying the principles of psychoacoustics, we can remove semantically irrelevant information from the ASR input and train a model that resembles human perception more closely. We implement our idea in a tool named DOMPTEUR and demonstrate that our augmented system, in contrast to an unmodified baseline, successfully focuses on perceptible ranges of the input signal. This change forces adversarial examples into the audible range, while using minimal computational overhead and preserving benign performance. To evaluate our approach, we construct an adaptive attacker that actively tries to avoid our augmentations and demonstrate that adversarial examples from this attacker remain clearly perceivable. Finally, we substantiate our claims by performing a hearing test with crowd-sourced human listeners.
△ Less
Submitted 3 June, 2021; v1 submitted 10 February, 2021;
originally announced February 2021.
-
Performance Measurements within Asynchronous Task-based Runtime Systems: A Double White Dwarf Merger as an Application
Authors:
Patrick Diehl,
Dominic Marcello,
Parsa Amini,
Hartmut Kaiser,
Sagiv Shiber,
Geoffrey C. Clayton,
Juhan Frank,
Gregor Daiß,
Dirk Pflüger,
David Eder,
Alice Koniges,
Kevin Huck
Abstract:
Analyzing performance within asynchronous many-task-based runtime systems is challenging because millions of tasks are launched concurrently. Especially for long-term runs the amount of data collected becomes overwhelming. We study HPX and its performance-counter framework and APEX to collect performance data and energy consumption. We added HPX application-specific performance counters to the Oct…
▽ More
Analyzing performance within asynchronous many-task-based runtime systems is challenging because millions of tasks are launched concurrently. Especially for long-term runs the amount of data collected becomes overwhelming. We study HPX and its performance-counter framework and APEX to collect performance data and energy consumption. We added HPX application-specific performance counters to the Octo-Tiger full 3D AMR astrophysics application. This enables the combined visualization of physical and performance data to highlight bottlenecks with respect to different solvers. We examine the overhead introduced by these measurements, which is around 1%, with respect to the overall application runtime. We perform a convergence study for four different levels of refinement and analyze the application's performance with respect to adaptive grid refinement. The measurements' overheads are small, enabling the combined use of performance data and physical properties with the goal of improving the code's performance. All of these measurements were obtained on NERSC's Cori, Louisiana Optical Network Infrastructure's QueenBee2, and Indiana University's Big Red 3.
△ Less
Submitted 9 June, 2021; v1 submitted 30 January, 2021;
originally announced February 2021.
-
The Work of Art in an Age of Mechanical Generation
Authors:
Steven J. Frank
Abstract:
Can we define what it means to be "creative," and if so, can our definition drive artificial intelligence (AI) systems to feats of creativity indistinguishable from human efforts? This mixed question is considered from technological and social perspectives. Beginning with an exploration of the value we attach to authenticity in works of art, the article considers the ability of AI to detect forger…
▽ More
Can we define what it means to be "creative," and if so, can our definition drive artificial intelligence (AI) systems to feats of creativity indistinguishable from human efforts? This mixed question is considered from technological and social perspectives. Beginning with an exploration of the value we attach to authenticity in works of art, the article considers the ability of AI to detect forgeries of renowned paintings and, in so doing, somehow reveal the quiddity of a work of art. We conclude by considering whether evolving technical capability can revise traditional relationships among art, artist, and the market.
△ Less
Submitted 10 August, 2022; v1 submitted 27 January, 2021;
originally announced January 2021.
-
Octo-Tiger: A New, 3D Hydrodynamic Code for Stellar Mergers that uses HPX Parallelisation
Authors:
Dominic C. Marcello,
Sagiv Shiber,
Orsola De Marco,
Juhan Frank,
Geoffrey C. Clayton,
Patrick M. Motl,
Patrick Diehl,
Hartmut Kaiser
Abstract:
OCTO-TIGER is an astrophysics code to simulate the evolution of self-gravitating and rotat-ing systems of arbitrary geometry based on the fast multipole method, using adaptive mesh refinement. OCTO-TIGER is currently optimised to simulate the merger of well-resolved stars that can be approximated by barotropic structures, such as white dwarfs or main sequence stars. The gravity solver conserves an…
▽ More
OCTO-TIGER is an astrophysics code to simulate the evolution of self-gravitating and rotat-ing systems of arbitrary geometry based on the fast multipole method, using adaptive mesh refinement. OCTO-TIGER is currently optimised to simulate the merger of well-resolved stars that can be approximated by barotropic structures, such as white dwarfs or main sequence stars. The gravity solver conserves angular momentum to machine precision, thanks to a correction algorithm. This code uses HPX parallelization, allowing the overlap of work and communication and leading to excellent scaling properties, allowing for the computation of large problems in reasonable wall-clock times. In this paper, we investigate the code performance and precision by running benchmarking tests. These include simple problems, such as the Sod shock tube, as well as sophisticated, full, white-dwarf binary simulations. Results are compared to analytic solutions, when known, and to other grid based codes such as FLASH. We also compute the interaction between two white dwarfs from the early mass transfer through to the merger and compare with past simulations of similar systems. We measure OCTO-TIGERs scaling properties up to a core count of 80,000, showing excellent performance for large problems. Finally, we outline the current and planned areas of development aimed at tackling a number of physical phenomena connected to observations of transients.
△ Less
Submitted 10 August, 2021; v1 submitted 20 January, 2021;
originally announced January 2021.
-
Disruption Avoidance via RF Current Condensation in Magnetic Islands Produced by Off-Normal Events
Authors:
A. H. Reiman,
N. Bertelli,
P. T. Bonoli,
N. J. Fisch,
S. J. Frank,
S. **,
R. Nies,
E. Rodriguez
Abstract:
As tokamaks are designed and built with increasing levels of stored energy in the plasma, disruptions become increasingly dangerous. It has been reported that 95% of the disruptions in the Joint European Torus (JET) tokamak with the ITER-like wall are preceded by the growth of large locked islands, and these large islands are mostly produced by off-normal events other than neoclassical tearing mod…
▽ More
As tokamaks are designed and built with increasing levels of stored energy in the plasma, disruptions become increasingly dangerous. It has been reported that 95% of the disruptions in the Joint European Torus (JET) tokamak with the ITER-like wall are preceded by the growth of large locked islands, and these large islands are mostly produced by off-normal events other than neoclassical tearing modes. This paper discusses the use of RF current drive to stabilize large islands, focusing on nonlinear effects that appear when relatively high powers are used to stabilize large islands. An RF current condensation effect can concentrate the RF driven current near the center of the island, increasing the efficiency of the stabilization. A nonlinear shadowing effect can hinder the stabilization of islands if the aiming of the ray trajectories does not properly consider the nonlinear effects.
△ Less
Submitted 30 December, 2020;
originally announced December 2020.