-
Optimal input excitations for suppressing nonlinear instabilities in multimode fibers
Authors:
Kabish Wisal,
Chun-Wei Chen,
Zeyu Kuang,
Owen D. Miller,
Hui Cao,
A. Douglas Stone
Abstract:
Wavefront sha** has become a powerful tool for manipulating light propagation in various complex media undergoing linear scattering. Controlling nonlinear optical interactions with spatial degrees of freedom is a relatively recent but growing area of research. A wavefront-sha**-based approach can be used to suppress nonlinear stimulated Brillouin scattering (SBS) and transverse mode instabilit…
▽ More
Wavefront sha** has become a powerful tool for manipulating light propagation in various complex media undergoing linear scattering. Controlling nonlinear optical interactions with spatial degrees of freedom is a relatively recent but growing area of research. A wavefront-sha**-based approach can be used to suppress nonlinear stimulated Brillouin scattering (SBS) and transverse mode instability (TMI), which are the two main limitations to power scaling in high-power narrowband fiber amplifiers. Here we formulate both SBS and TMI suppression as optimization problems with respect to coherent multimode input excitation in a given multimode fiber. We develop an efficient method for finding the globally optimal input excitation for SBS and TMI suppression using linear programming. We theoretically show that optimally exciting a standard multimode fiber leads to roughly an order of magnitude enhancement in output power limited by SBS and TMI, compared to fundamental-mode-only excitation. We find that the optimal mode content is robust to small perturbations and our approach works even in the presence of mode dependent loss and gain. Optimal mode content can be excited in real experiments using spatial light modulators, creating a novel platform for instability-free ultrahigh-power fiber lasers.
△ Less
Submitted 6 July, 2024;
originally announced July 2024.
-
Curved detectors for future X-ray astrophysics missions
Authors:
Eric D. Miller,
James A. Gregory,
Marshall W. Bautz,
Harry R. Clark,
Michael Cooper,
Kevan Donlon,
Richard F. Foster,
Catherine E. Grant,
Mallory Jensen,
Beverly LaMarr,
Renee Lambert,
Christopher Leitz,
Andrew Malonis,
Mo Neak,
Gregory Prigozhin,
Kevin Ryu,
Benjamin Schneider,
Keith Warner,
Douglas J. Young,
William W. Zhang
Abstract:
Future X-ray astrophysics missions will survey large areas of the sky with unparalleled sensitivity, enabled by lightweight, high-resolution optics. These optics inherently produce curved focal surfaces with radii as small as 2 m, requiring a large area detector system that closely conforms to the curved focal surface. We have embarked on a project using a curved charge-coupled device (CCD) detect…
▽ More
Future X-ray astrophysics missions will survey large areas of the sky with unparalleled sensitivity, enabled by lightweight, high-resolution optics. These optics inherently produce curved focal surfaces with radii as small as 2 m, requiring a large area detector system that closely conforms to the curved focal surface. We have embarked on a project using a curved charge-coupled device (CCD) detector technology developed at MIT Lincoln Laboratory to provide large-format, curved detectors for such missions, improving performance and simplifying design. We present the current status of this work, which aims to curve back-illuminated, large-format (5 cm x 4 cm) CCDs to 2.5-m radius and confirm X-ray performance. We detail the design of fixtures and the curving process, and present intial results on curving bare silicon samples and monitor devices and characterizing the surface geometric accuracy. The tests meet our accuracy requirement of <5 $μ$m RMS surface non-conformance for samples of similar thickness to the functional detectors. We finally show X-ray performance measurements of planar CCDs that will serve as a baseline to evaluate the curved detectors. The detectors exhibit low noise, good charge-transfer efficiency, and excellent, uniform spectroscopic performance, including in the important soft X-ray band.
△ Less
Submitted 26 June, 2024;
originally announced June 2024.
-
Property-Based Testing by Elaborating Proof Outlines
Authors:
Dale Miller,
Alberto Momigliano
Abstract:
Property-based testing (PBT) is a technique for validating code against an executable specification by automatically generating test-data. We present a proof-theoretical reconstruction of this style of testing for relational specifications and employ the Foundational Proof Certificate framework to describe test generators. We do this by encoding certain kinds of ``proof outlines'' as proof certifi…
▽ More
Property-based testing (PBT) is a technique for validating code against an executable specification by automatically generating test-data. We present a proof-theoretical reconstruction of this style of testing for relational specifications and employ the Foundational Proof Certificate framework to describe test generators. We do this by encoding certain kinds of ``proof outlines'' as proof certificates that can describe various common generation strategies in the PBT literature, ranging from random to exhaustive, including their combination. We also address the shrinking of counterexamples as a first step toward their explanation. Once generation is accomplished, the testing phase is a standard logic programming search. After illustrating our techniques on simple, first-order (algebraic) data structures, we lift it to data structures containing bindings by using the $λ$-tree syntax approach to encode bindings. The $λ$Prolog programming language can perform both generating and checking of tests using this approach to syntax. We then further extend PBT to specifications in a fragment of linear logic. Under consideration in Theory and Practice of Logic Programming (TPLP).
△ Less
Submitted 14 June, 2024;
originally announced June 2024.
-
On Trojans in Refined Language Models
Authors:
Jayaram Raghuram,
George Kesidis,
David J. Miller
Abstract:
A Trojan in a language model can be inserted when the model is refined for a particular application such as determining the sentiment of product reviews. In this paper, we clarify and empirically explore variations of the data-poisoning threat model. We then empirically assess two simple defenses each for a different defense scenario. Finally, we provide a brief survey of related attacks and defen…
▽ More
A Trojan in a language model can be inserted when the model is refined for a particular application such as determining the sentiment of product reviews. In this paper, we clarify and empirically explore variations of the data-poisoning threat model. We then empirically assess two simple defenses each for a different defense scenario. Finally, we provide a brief survey of related attacks and defenses.
△ Less
Submitted 11 June, 2024;
originally announced June 2024.
-
Open-Vocabulary Part-Based Gras**
Authors:
Tjeard van Oort,
Dimity Miller,
Will N. Browne,
Nicolas Marticorena,
Jesse Haviland,
Niko Suenderhauf
Abstract:
Many robotic applications require to grasp objects not arbitrarily but at a very specific object part. This is especially important for manipulation tasks beyond simple pick-and-place scenarios or in robot-human interactions, such as object handovers. We propose AnyPart, a practical system that combines open-vocabulary object detection, open-vocabulary part segmentation and 6DOF grasp pose predict…
▽ More
Many robotic applications require to grasp objects not arbitrarily but at a very specific object part. This is especially important for manipulation tasks beyond simple pick-and-place scenarios or in robot-human interactions, such as object handovers. We propose AnyPart, a practical system that combines open-vocabulary object detection, open-vocabulary part segmentation and 6DOF grasp pose prediction to infer a grasp pose on a specific part of an object in 800 milliseconds. We contribute two new datasets for the task of open-vocabulary part-based gras**, a hand-segmented dataset containing 1014 object-part segmentations, and a dataset of real-world scenarios gathered during our robot trials for individual objects and table-clearing tasks. We evaluate AnyPart on a mobile manipulator robot using a set of 28 common household objects over 360 gras** trials. AnyPart is capable of producing successful grasps 69.52 %, when ignoring robot-based grasp failures, AnyPart predicts a grasp location on the correct part 88.57 % of the time.
△ Less
Submitted 9 June, 2024;
originally announced June 2024.
-
Testing Sign Congruence Between Two Parameters
Authors:
Douglas L. Miller,
Francesca Molinari,
Jörg Stoye
Abstract:
We test the null hypothesis that two parameters $(μ_1,μ_2)$ have the same sign, assuming that (asymptotically) normal estimators $(\hatμ_1,\hatμ_2)$ are available. Examples of this problem include the analysis of heterogeneous treatment effects, causal interpretation of reduced-form estimands, meta-studies, and mediation analysis. A number of tests were recently proposed. We recommend a test that…
▽ More
We test the null hypothesis that two parameters $(μ_1,μ_2)$ have the same sign, assuming that (asymptotically) normal estimators $(\hatμ_1,\hatμ_2)$ are available. Examples of this problem include the analysis of heterogeneous treatment effects, causal interpretation of reduced-form estimands, meta-studies, and mediation analysis. A number of tests were recently proposed. We recommend a test that is simple and rejects more often than many of these recent proposals. Like all other tests in the literature, it is conservative if the truth is near $(0,0)$ and therefore also biased. To clarify whether these features are avoidable, we also provide a test that is unbiased and has exact size control on the boundary of the null hypothesis, but which has counterintuitive properties and hence we do not recommend. We use the test to improve p-values in Kowalski (2022) from information contained in that paper's main text and to establish statistical significance of some key estimates in Dippel et al. (2021).
△ Less
Submitted 12 June, 2024; v1 submitted 19 May, 2024;
originally announced May 2024.
-
Challenges and Opportunities for Large-Scale Exploration with Air-Ground Teams using Semantics
Authors:
Fernando Cladera,
Ian D. Miller,
Zachary Ravichandran,
Varun Murali,
Jason Hughes,
M. Ani Hsieh,
C. J. Taylor,
Vijay Kumar
Abstract:
One common and desirable application of robots is exploring potentially hazardous and unstructured environments. Air-ground collaboration offers a synergistic approach to addressing such exploration challenges. In this paper, we demonstrate a system for large-scale exploration using a team of aerial and ground robots. Our system uses semantics as lingua franca, and relies on fully opportunistic co…
▽ More
One common and desirable application of robots is exploring potentially hazardous and unstructured environments. Air-ground collaboration offers a synergistic approach to addressing such exploration challenges. In this paper, we demonstrate a system for large-scale exploration using a team of aerial and ground robots. Our system uses semantics as lingua franca, and relies on fully opportunistic communications. We highlight the unique challenges from this approach, explain our system architecture and showcase lessons learned during our experiments. All our code is open-source, encouraging researchers to use it and build upon.
△ Less
Submitted 12 May, 2024;
originally announced May 2024.
-
Advancing Precision Particle Background Estimation for Future X-ray Missions: Correlated Variability between AMS and Chandra/XMM-Newton
Authors:
Arnab Sarkar,
Catherine E. Grant,
Eric D. Miller,
Mark Bautz,
Benjamin Schneider,
Rick F. Foster,
Gerrit Schellenberger,
Steven Allen,
Ralph P. Kraft,
Dan Wilkins,
Abe Falcone,
Andrew Ptak
Abstract:
Galactic cosmic ray (GCR) particles have a significant impact on the particle-induced background of X-ray observatories, and their flux exhibits substantial temporal variability, potentially influencing background levels. In this study, we present one-day binned high-energy reject rates derived from the Chandra-ACIS and XMM-Newton EPIC-pn instruments, serving as proxies for GCR particle flux. We s…
▽ More
Galactic cosmic ray (GCR) particles have a significant impact on the particle-induced background of X-ray observatories, and their flux exhibits substantial temporal variability, potentially influencing background levels. In this study, we present one-day binned high-energy reject rates derived from the Chandra-ACIS and XMM-Newton EPIC-pn instruments, serving as proxies for GCR particle flux. We systematically analyze the ACIS and EPIC-pn reject rates and compare them with the AMS proton flux. Our analysis initially reveals robust correlations between the AMS proton flux and the ACIS/EPIC-pn reject rates when binned over 27-day intervals. However, a closer examination reveals substantial fluctuations within each 27-day bin, indicating shorter-term variability. Upon daily binning, we observe finer. temporal structures in the datasets, demonstrating the presence of recurrent variations with periods of $\sim$ 25 days and 23 days in ACIS and EPIC-pn reject rates, respectively, spanning the years 2014 to 2018. Notably, during the 2016--2017 period, we additionally detect periodicities of $\sim$13.5 days and 9 days in the ACIS and EPIC-pn reject rates, respectively. Intriguingly, we observe a time lag of $\sim$ 6 days between the AMS proton flux and the ACIS/EPIC-pn reject rates during the second half of 2016. This time lag is not visible before 2016 and aftern2017. The underlying physical mechanisms responsible for this time lag remain a subject of ongoing investigation.
△ Less
Submitted 10 May, 2024;
originally announced May 2024.
-
The Simons Observatory: Design, integration, and testing of the small aperture telescopes
Authors:
Nicholas Galitzki,
Tran Tsan,
Jake Spisak,
Michael Randall,
Max Silva-Feaver,
Joseph Seibert,
Jacob Lashner,
Shunsuke Adachi,
Sean M. Adkins,
Thomas Alford,
Kam Arnold,
Peter C. Ashton,
Jason E. Austermann,
Carlo Baccigalupi,
Andrew Bazarko,
James A. Beall,
Sanah Bhimani,
Bryce Bixler,
Gabriele Coppi,
Lance Corbett,
Kevin D. Crowley,
Kevin T. Crowley,
Samuel Day-Weiss,
Simon Dicker,
Peter N. Dow
, et al. (55 additional authors not shown)
Abstract:
The Simons Observatory (SO) is a cosmic microwave background (CMB) survey experiment that includes small-aperture telescopes (SATs) observing from an altitude of 5,200 m in the Atacama Desert in Chile. The SO SATs will cover six spectral bands between 27 and 280 GHz to search for primordial B-modes to a sensitivity of $σ(r)=0.002$, with quantified systematic errors well below this value. Each SAT…
▽ More
The Simons Observatory (SO) is a cosmic microwave background (CMB) survey experiment that includes small-aperture telescopes (SATs) observing from an altitude of 5,200 m in the Atacama Desert in Chile. The SO SATs will cover six spectral bands between 27 and 280 GHz to search for primordial B-modes to a sensitivity of $σ(r)=0.002$, with quantified systematic errors well below this value. Each SAT is a self-contained cryogenic telescope with a 35$^\circ$ field of view, 42 cm diameter optical aperture, 40 K half-wave plate, 1 K refractive optics, and $<0.1$ K focal plane that holds $>12,000$ TES detectors. We describe the nominal design of the SATs and present details about the integration and testing for one operating at 93 and 145 GHz.
△ Less
Submitted 10 May, 2024; v1 submitted 9 May, 2024;
originally announced May 2024.
-
Unlearning Backdoor Attacks through Gradient-Based Model Pruning
Authors:
Kealan Dunnett,
Reza Arablouei,
Dimity Miller,
Volkan Dedeoglu,
Raja Jurdak
Abstract:
In the era of increasing concerns over cybersecurity threats, defending against backdoor attacks is paramount in ensuring the integrity and reliability of machine learning models. However, many existing approaches require substantial amounts of data for effective mitigation, posing significant challenges in practical deployment. To address this, we propose a novel approach to counter backdoor atta…
▽ More
In the era of increasing concerns over cybersecurity threats, defending against backdoor attacks is paramount in ensuring the integrity and reliability of machine learning models. However, many existing approaches require substantial amounts of data for effective mitigation, posing significant challenges in practical deployment. To address this, we propose a novel approach to counter backdoor attacks by treating their mitigation as an unlearning task. We tackle this challenge through a targeted model pruning strategy, leveraging unlearning loss gradients to identify and eliminate backdoor elements within the model. Built on solid theoretical insights, our approach offers simplicity and effectiveness, rendering it well-suited for scenarios with limited data availability. Our methodology includes formulating a suitable unlearning loss and devising a model-pruning technique tailored for convolutional neural networks. Comprehensive evaluations demonstrate the efficacy of our proposed approach compared to state-of-the-art approaches, particularly in realistic data settings.
△ Less
Submitted 6 May, 2024;
originally announced May 2024.
-
An Interdisciplinary Perspective of the Built-Environment Microbiome
Authors:
John S. McAlister,
Michael J. Blum,
Yana Bromberg,
Nina H. Fefferman,
Qiang He,
Eric Lofgren,
Debra L. Miller,
Courtney Schreiner,
K. Selcuk Candan,
Heather Szabo-Rogers,
J. Michael Reed
Abstract:
The built environment provides an excellent setting for interdisciplinary research on the dynamics of microbial communities. The system is simplified compared to many natural settings, and to some extent the entire environment can be manipulated, from architectural design, to materials use, air flow, human traffic, and capacity to disrupt microbial communities through cleaning. Here we provide an…
▽ More
The built environment provides an excellent setting for interdisciplinary research on the dynamics of microbial communities. The system is simplified compared to many natural settings, and to some extent the entire environment can be manipulated, from architectural design, to materials use, air flow, human traffic, and capacity to disrupt microbial communities through cleaning. Here we provide an overview of the ecology of the microbiome in the built environment. We address niche space and refugia, population and community (metagenomic) dynamics, spatial ecology within a building, including the major microbial transmission mechanisms, as well as evolution. We also address the landscape ecology connecting microbiomes between physically separated buildings. At each stage we pay particular attention to the actual and potential interface between disciplines, such as ecology, epidemiology, materials science, and human social behavior. We end by identifying some opportunities for future interdisciplinary research on the microbiome of the built environment.
△ Less
Submitted 4 May, 2024;
originally announced May 2024.
-
Curvature of Gaussian quantum states
Authors:
Harry J. D. Miller
Abstract:
The space of quantum states can be endowed with a metric structure using the second order derivatives of the relative entropy, giving rise to the so-called Kubo-Mori-Bogoliubov inner product. We explore its geometric properties on the submanifold of faithful, zero-displacement Gaussian states parameterised by their covariance matrices, deriving expressions for the geodesic equations, curvature ten…
▽ More
The space of quantum states can be endowed with a metric structure using the second order derivatives of the relative entropy, giving rise to the so-called Kubo-Mori-Bogoliubov inner product. We explore its geometric properties on the submanifold of faithful, zero-displacement Gaussian states parameterised by their covariance matrices, deriving expressions for the geodesic equations, curvature tensors and scalar curvature. Our analysis suggests that the curvature of the manifold is strictly monotonic with respect to the von Neumann entropy, and thus can be interpreted as a measure of state uncertainty. This provides supporting evidence for the Petz conjecture in continuous variable systems.
△ Less
Submitted 15 April, 2024;
originally announced April 2024.
-
Maximal quantum interaction between free electrons and photons
Authors:
Zetao Xie,
Zeling Chen,
Hao Li,
Qinghui Yan,
Hongsheng Chen,
Xiao Lin,
Ido Kaminer,
Owen D. Miller,
Yi Yang
Abstract:
The emerging field of free-electron quantum optics enables electron-photon entanglement and holds the potential for generating nontrivial photon states for quantum information processing. Although recent experimental studies have entered the quantum regime, rapid theoretical developments predict that qualitatively unique phenomena only emerge beyond a certain interaction strength. It is thus perti…
▽ More
The emerging field of free-electron quantum optics enables electron-photon entanglement and holds the potential for generating nontrivial photon states for quantum information processing. Although recent experimental studies have entered the quantum regime, rapid theoretical developments predict that qualitatively unique phenomena only emerge beyond a certain interaction strength. It is thus pertinent to identify the maximal electron-photon interaction strength and the materials, geometries, and particle energies that enable one to approach it. We derive an upper limit to the quantum vacuum interaction strength between free electrons and single-mode photons, which illuminates the conditions for the strongest interaction. Crucially, we obtain an explicit energy selection recipe for electrons and photons to achieve maximal interaction at arbitrary separations and identify two optimal regimes favoring either fast or slow electrons over those with intermediate velocities. We validate the limit by analytical and numerical calculations on canonical geometries and provide near-optimal designs indicating the feasibility of strong quantum interactions. Our findings offer fundamental intuition for maximizing the quantum interaction between free electrons and photons and provide practical design rules for future experiments on electron-photon and electron-mediated photon-photon entanglement. They should also enable the evaluation of key metrics for applications such as the maximum power of free-electron radiation sources and the maximum acceleration gradient of dielectric laser accelerators.
△ Less
Submitted 3 April, 2024; v1 submitted 30 March, 2024;
originally announced April 2024.
-
Open-Set Recognition in the Age of Vision-Language Models
Authors:
Dimity Miller,
Niko Sünderhauf,
Alex Kenna,
Keita Mason
Abstract:
Are vision-language models (VLMs) open-set models because they are trained on internet-scale datasets? We answer this question with a clear no - VLMs introduce closed-set assumptions via their finite query set, making them vulnerable to open-set conditions. We systematically evaluate VLMs for open-set recognition and find they frequently misclassify objects not contained in their query set, leadin…
▽ More
Are vision-language models (VLMs) open-set models because they are trained on internet-scale datasets? We answer this question with a clear no - VLMs introduce closed-set assumptions via their finite query set, making them vulnerable to open-set conditions. We systematically evaluate VLMs for open-set recognition and find they frequently misclassify objects not contained in their query set, leading to alarmingly low precision when tuned for high recall and vice versa. We show that naively increasing the size of the query set to contain more and more classes does not mitigate this problem, but instead causes diminishing task performance and open-set performance. We establish a revised definition of the open-set problem for the age of VLMs, define a new benchmark and evaluation protocol to facilitate standardised evaluation and research in this important area, and evaluate promising baseline approaches based on predictive uncertainty and dedicated negative embeddings on a range of VLM classifiers and object detectors.
△ Less
Submitted 25 March, 2024;
originally announced March 2024.
-
Quantum tomography of molecules using ultrafast electron diffraction
Authors:
Jiayang Jiang,
Ming Zhang,
Aosheng Gu,
R. J. Dwayne Miller,
Zheng Li
Abstract:
We propose a quantum tomography (QT) approach to retrieve the temporally evolving reduced density matrix in elecotronic state basis, where the populations and coherence between ground state and excited state are reconstructed from the ultrafast electron diffraction signal. In order to showcase the capability of the proposed QT approach, we simulate the nuclear wavepacket dynamics and ultrafast ele…
▽ More
We propose a quantum tomography (QT) approach to retrieve the temporally evolving reduced density matrix in elecotronic state basis, where the populations and coherence between ground state and excited state are reconstructed from the ultrafast electron diffraction signal. In order to showcase the capability of the proposed QT approach, we simulate the nuclear wavepacket dynamics and ultrafast electron diffraction of photoexcited pyrrole molecules using ab initio quantum chemical CASSCF method. From simulated time-resolved diffraction data, we retrieve the evolving density matrix in a crude diabatic representation basis and reveal the symmetry of the excited pyrrole wavepacket. Our QT approach opens the route to make quantum version of "molecular movie" that covers the electronic degree of freedom, and equips ultrafast electron diffraction with the power to reveal the coherence between electronic states, relaxation and dynamics of population transfer.
△ Less
Submitted 8 March, 2024;
originally announced March 2024.
-
Methylation Operation Wizard (MeOW): Identification of differentially methylated regions in long-read sequencing data
Authors:
Miranda PG Zalusky,
Danny E Miller
Abstract:
Long-read sequencing (LRS) is able to simultaneously capture information about both DNA sequence and modifications, such as CpG methylation in a single sequencing experiment. Here we present Methylation Operation Wizard (MeOW), a program to identify and prioritize differentially methylated regions (DMRs) genome-wide using LRS data. MeOW can be run using either a file containing counts of per-nucle…
▽ More
Long-read sequencing (LRS) is able to simultaneously capture information about both DNA sequence and modifications, such as CpG methylation in a single sequencing experiment. Here we present Methylation Operation Wizard (MeOW), a program to identify and prioritize differentially methylated regions (DMRs) genome-wide using LRS data. MeOW can be run using either a file containing counts of per-nucleotide methylated CpG sites or with a bam file containing modified base tags.
△ Less
Submitted 26 February, 2024;
originally announced February 2024.
-
Approximate Analytical Solutions for the Circular Restricted Three-Body Problem Including Non-Hamiltonian Solar Radiation Pressure
Authors:
Hailee Hettrick,
David W. Miller,
Begum Cannataro
Abstract:
The circular restricted three-body problem (CR3BP) with solar radiation pressure (SRP) has often been analyzed with assumptions made on a spacecraft's attitude, such that the problem remains Hamiltonian. These assumptions are unsatisfactorily limiting for a starshade mission since the starshade's attitude will inherently vary from the configuration that corresponds to Hamiltonian dynamics. This pa…
▽ More
The circular restricted three-body problem (CR3BP) with solar radiation pressure (SRP) has often been analyzed with assumptions made on a spacecraft's attitude, such that the problem remains Hamiltonian. These assumptions are unsatisfactorily limiting for a starshade mission since the starshade's attitude will inherently vary from the configuration that corresponds to Hamiltonian dynamics. This paper presents the derivation of the equations of motion for CR3BP with SRP that permit the application of the Lindstedt-Poincare method, such that approximate solutions are produced, which may serve as invaluable trajectory design tools. Examples of periodic orbits and manifolds corresponding to three sets of attitude angles are shown and the accuracy of their seventh-order approximations is considered.
△ Less
Submitted 12 February, 2024;
originally announced February 2024.
-
Universal Post-Training Reverse-Engineering Defense Against Backdoors in Deep Neural Networks
Authors:
Xi Li,
Hang Wang,
David J. Miller,
George Kesidis
Abstract:
A variety of defenses have been proposed against backdoors attacks on deep neural network (DNN) classifiers. Universal methods seek to reliably detect and/or mitigate backdoors irrespective of the incorporation mechanism used by the attacker, while reverse-engineering methods often explicitly assume one. In this paper, we describe a new detector that: relies on internal feature map of the defended…
▽ More
A variety of defenses have been proposed against backdoors attacks on deep neural network (DNN) classifiers. Universal methods seek to reliably detect and/or mitigate backdoors irrespective of the incorporation mechanism used by the attacker, while reverse-engineering methods often explicitly assume one. In this paper, we describe a new detector that: relies on internal feature map of the defended DNN to detect and reverse-engineer the backdoor and identify its target class; can operate post-training (without access to the training dataset); is highly effective for various incorporation mechanisms (i.e., is universal); and which has low computational overhead and so is scalable. Our detection approach is evaluated for different attacks on benchmark CIFAR-10 and CIFAR-100 image classifiers.
△ Less
Submitted 22 May, 2024; v1 submitted 3 February, 2024;
originally announced February 2024.
-
Measuring, processing, and generating partially coherent light with self-configuring optics
Authors:
Charles Roques-Carmes,
Shanhui Fan,
David Miller
Abstract:
Optical phenomena always display some degree of partial coherence between their respective degrees of freedom. Partial coherence is of particular interest in multimodal systems, where classical and quantum correlations between spatial, polarization, and spectral degrees of freedom can lead to fascinating phenomena (e.g., entanglement) and be leveraged for advanced imaging and sensing modalities (e…
▽ More
Optical phenomena always display some degree of partial coherence between their respective degrees of freedom. Partial coherence is of particular interest in multimodal systems, where classical and quantum correlations between spatial, polarization, and spectral degrees of freedom can lead to fascinating phenomena (e.g., entanglement) and be leveraged for advanced imaging and sensing modalities (e.g., in hyperspectral, polarization, and ghost imaging). Here, we present a universal method to analyze, process, and generate spatially partially coherent light in multimode systems by using self-configuring optical networks. Our method relies on cascaded self-configuring layers whose average power outputs are sequentially optimized. Once optimized, the network separates the input light into its mutually incoherent components, which is formally equivalent to a diagonalization of the input density matrix. We illustrate our method with arrays of Mach-Zehnder interferometers and show how this method can be used to perform partially coherent environmental light sensing, generation of multimode partially coherent light with arbitrary coherency matrices, and unscrambling of quantum optical mixtures. We provide guidelines for the experimental realization of this method, paving the way for self-configuring photonic devices that can automatically learn optimal modal representations of partially coherent light fields.
△ Less
Submitted 1 February, 2024;
originally announced February 2024.
-
Peano Arithmetic and $μ$MALL
Authors:
Matteo Manighetti,
Dale Miller
Abstract:
Formal theories of arithmetic have traditionally been based on either classical or intuitionistic logic, leading to the development of Peano and Heyting arithmetic, respectively. We propose a use $μ$MALL as a formal theory of arithmetic based on linear logic. This formal system is presented as a sequent calculus proof system that extends the standard proof system for multiplicative-additive linear…
▽ More
Formal theories of arithmetic have traditionally been based on either classical or intuitionistic logic, leading to the development of Peano and Heyting arithmetic, respectively. We propose a use $μ$MALL as a formal theory of arithmetic based on linear logic. This formal system is presented as a sequent calculus proof system that extends the standard proof system for multiplicative-additive linear logic (MALL) with the addition of the logical connectives universal and existential quantifiers (first-order quantifiers), term equality and non-equality, and the least and greatest fixed point operators. We first demonstrate how functions defined using $μ$MALL relational specifications can be computed using a simple proof search algorithm. By incorporating weakening and contraction into $μ$MALL, we obtain $μ$LK+, a natural candidate for a classical sequent calculus for arithmetic. While important proof theory results are still lacking for $μ$LK+ (including cut-elimination and the completeness of focusing), we prove that $μ$LK+ is consistent and that it contains Peano arithmetic. We also prove two conservativity results regarding $μ$LK+ over $μ$MALL.
△ Less
Submitted 21 December, 2023;
originally announced December 2023.
-
Phenomenology of a Deconstructed Electroweak Force
Authors:
Joe Davighi,
Alastair Gosnay,
David J Miller,
Sophie Renner
Abstract:
We study an effective theory of flavour in which the $SU(2)_L$ interaction is `flavour-deconstructed' near the TeV scale. This arises, for example, in UV models that unify all three generations of left-handed fermions via an $Sp(6)_L$ symmetry. Flavour-universality of the electroweak force emerges accidentally (but naturally) from breaking the $\prod_{i=1}^3 SU(2)_{L,i}$ gauge group to its diagona…
▽ More
We study an effective theory of flavour in which the $SU(2)_L$ interaction is `flavour-deconstructed' near the TeV scale. This arises, for example, in UV models that unify all three generations of left-handed fermions via an $Sp(6)_L$ symmetry. Flavour-universality of the electroweak force emerges accidentally (but naturally) from breaking the $\prod_{i=1}^3 SU(2)_{L,i}$ gauge group to its diagonal subgroup, delivering hierarchical fermion masses and left-handed mixing angles in the process. The heavy gauge bosons transform as two $SU(2)_L$ triplets that mediate new flavour non-universal forces. The lighter of these couples universally to the light generations, allowing consistency with flavour bounds even for a TeV scale mass. Constraints from flavour, high mass LHC searches, and electroweak precision are then highly complementary, excluding masses below 9 TeV. The heavier triplet must instead be hundreds of TeV to be consistent with meson mixing constraints. Because only the lighter triplet couples to the Higgs, we find radiative Higgs mass corrections of a few hundred GeV, meaning this model of flavour is arguably natural. The natural region will, however, be almost completely covered by the planned electroweak programme at FCC-ee. On shorter timescales, significant parameter space will be explored by the High-Luminosity LHC measurements at high-$p_T$, and upcoming lepton flavour violation experiments, principally Mu3e.
△ Less
Submitted 11 April, 2024; v1 submitted 20 December, 2023;
originally announced December 2023.
-
Gemini: A Family of Highly Capable Multimodal Models
Authors:
Gemini Team,
Rohan Anil,
Sebastian Borgeaud,
Jean-Baptiste Alayrac,
Jiahui Yu,
Radu Soricut,
Johan Schalkwyk,
Andrew M. Dai,
Anja Hauth,
Katie Millican,
David Silver,
Melvin Johnson,
Ioannis Antonoglou,
Julian Schrittwieser,
Amelia Glaese,
Jilin Chen,
Emily Pitler,
Timothy Lillicrap,
Angeliki Lazaridou,
Orhan Firat,
James Molloy,
Michael Isard,
Paul R. Barham,
Tom Hennigan,
Benjamin Lee
, et al. (1325 additional authors not shown)
Abstract:
This report introduces a new family of multimodal models, Gemini, that exhibit remarkable capabilities across image, audio, video, and text understanding. The Gemini family consists of Ultra, Pro, and Nano sizes, suitable for applications ranging from complex reasoning tasks to on-device memory-constrained use-cases. Evaluation on a broad range of benchmarks shows that our most-capable Gemini Ultr…
▽ More
This report introduces a new family of multimodal models, Gemini, that exhibit remarkable capabilities across image, audio, video, and text understanding. The Gemini family consists of Ultra, Pro, and Nano sizes, suitable for applications ranging from complex reasoning tasks to on-device memory-constrained use-cases. Evaluation on a broad range of benchmarks shows that our most-capable Gemini Ultra model advances the state of the art in 30 of 32 of these benchmarks - notably being the first model to achieve human-expert performance on the well-studied exam benchmark MMLU, and improving the state of the art in every one of the 20 multimodal benchmarks we examined. We believe that the new capabilities of the Gemini family in cross-modal reasoning and language understanding will enable a wide variety of use cases. We discuss our approach toward post-training and deploying Gemini models responsibly to users through services including Gemini, Gemini Advanced, Google AI Studio, and Cloud Vertex AI.
△ Less
Submitted 17 June, 2024; v1 submitted 18 December, 2023;
originally announced December 2023.
-
Tracing Position in the Regime of the Restricted Three-Body Problem to a Halo Orbit
Authors:
Hailee E. Hettrick,
Begum Cannataro,
David W. Miller
Abstract:
Driven by the desire to find positions that satisfy keepout constraints for a space-based telescope mission, this work develops a process for tracing a point in space in the regime of the restricted three-body problem to a halo orbit, characterized by its out-of-plane amplitude, and its position on that halo orbit, denoted by the halo orbit time. This process utilizes third-order solutions from th…
▽ More
Driven by the desire to find positions that satisfy keepout constraints for a space-based telescope mission, this work develops a process for tracing a point in space in the regime of the restricted three-body problem to a halo orbit, characterized by its out-of-plane amplitude, and its position on that halo orbit, denoted by the halo orbit time. This process utilizes third-order solutions from the Lindstedt-Poincare method, which have been partially inverted to expect a point in space as an input. Three different methodologies that use these partially inverted expressions are presented. Results are produced for 1,000 randomly selected points using all three methods and are compared to truth. Ultimately, the method that employed two distinct accuracy metrics yielded the most accurate results for the dataset.
△ Less
Submitted 18 December, 2023;
originally announced December 2023.
-
ZWCL 1856.8 : A rare double radio relic system captured within NuSTAR and Chandra field of view
Authors:
Ayşegül Tümer,
Daniel R. Wik,
Gerrit Schellenberger,
Eric D. Miller,
Marshall W. Bautz
Abstract:
Observations of galaxy cluster mergers provide insights on the particle acceleration and heating mechanisms taking place within the intracluster medium. Mergers form shocks that propagate through the plasma, which result in shock/cold fronts in the X-ray, and radio halos and/or relics in the radio regime. The connection between these tracers and the mechanisms driving non-thermal processes, such a…
▽ More
Observations of galaxy cluster mergers provide insights on the particle acceleration and heating mechanisms taking place within the intracluster medium. Mergers form shocks that propagate through the plasma, which result in shock/cold fronts in the X-ray, and radio halos and/or relics in the radio regime. The connection between these tracers and the mechanisms driving non-thermal processes, such as inverse Compton, are not well understood. ZWCL 1856.8 is one of the few known double radio relic systems that originate from nearly head-on collisions observed close to the plane of the sky. For the first time, we study NuSTAR and Chandra observations of such a system that contains both relics within their field of view. The spectro-imaging analyses results of the system suggest weak shock fronts with $\mathcal{M}$ numbers within 2$σ$ of the radio derived values, and provide evidence of inverse Compton emission at both relic sites. Our findings have great uncertainties due to the shallow exposure times available. Deeper NuSTAR and Chandra data are crucial for studying the connection of the radio and X-ray emission features and for constraining the thermal vs. non-thermal emission contributions in this system. We also present methods and approaches on how to investigate X-ray properties of double relic systems by taking full advantage of the complementary properties of NuSTAR and Chandra missions.
△ Less
Submitted 10 December, 2023;
originally announced December 2023.
-
Near-Infrared Observations of Outflows and YSOs in the Massive Star-Forming Region AFGL 5180
Authors:
S. Crowe,
R. Fedriani,
J. C. Tan,
M. Whittle,
Y. Zhang,
A. Caratti o Garatti,
J. P. Farias,
A. Gautam,
Z. Telkamp,
B. Rothberg,
M. Grudic,
M. Andersen,
G. Cosentino,
R. Garcia-Lopez,
V. Rosero,
K. Tanaka,
E. Pinna,
F. Rossi,
D. Miller,
G. Agapito,
C. Plantet,
E. Ghose,
J. Christou,
J. Power,
A. Puglisi
, et al. (8 additional authors not shown)
Abstract:
Methods: Broad- and narrow-band imaging of AFGL 5180 was made in the NIR with the LBT, in both seeing-limited ($\sim0.5\arcsec$) and high angular resolution ($\sim0.09\arcsec$) Adaptive Optics (AO) modes, as well as with HST. Archival ALMA continuum data was also utilized.
Results: At least 40 jet knots were identified via NIR emission from H$_2$ and [FeII] tracing shocked gas. Bright jet knots…
▽ More
Methods: Broad- and narrow-band imaging of AFGL 5180 was made in the NIR with the LBT, in both seeing-limited ($\sim0.5\arcsec$) and high angular resolution ($\sim0.09\arcsec$) Adaptive Optics (AO) modes, as well as with HST. Archival ALMA continuum data was also utilized.
Results: At least 40 jet knots were identified via NIR emission from H$_2$ and [FeII] tracing shocked gas. Bright jet knots outflowing from the central most massive protostar, S4, are detected towards the east of the source and are resolved in fine detail with the AO imaging. Additional knots are distributed throughout the field, likely indicating the presence of multiple driving sources. Sub-millimeter sources detected by ALMA are shown to be grouped in two main complexes, AFGL 5180 M and a small cluster $\sim15\arcsec$ to the south, AFGL 5180 S. From our NIR continuum images we identify YSO candidates down to masses of $\sim 0.1\:M_\odot$. Combined with the sub-mm sources, this yields a surface number density of such YSOs of $N_* \sim 10^3 {\rm pc}^{-2}$ within a projected radius of about 0.1 pc. Such a value is similar to those predicted by models of both Core Accretion from a turbulent clump environment and Competitive Accretion. The radial profile of $N_*$ is relatively flat on scales out to 0.2~pc, with only modest enhancement around the massive protostar inside 0.05~pc.
Conclusions: This study demonstrates the utility of high-resolution NIR imaging, in particular with AO, for detecting outflow activity and YSOs in distant regions. The presented images reveal the complex morphology of outflow-shocked gas within the large-scale bipolar flow of a massive protostar, as well as clear evidence for several other outflow driving sources in the region. Finally, this work presents a novel approach to compare the observed YSO surface number density from our study against different models of massive star formation.
△ Less
Submitted 20 November, 2023;
originally announced November 2023.
-
The evolution of galaxies and clusters at high spatial resolution with AXIS
Authors:
H. R. Russell,
L. A. Lopez,
S. W. Allen,
G. Chartas,
P. P. Choudhury,
R. A. Dupke,
A. C. Fabian,
A. M. Flores,
K. Garofali,
E. Hodges-Kluck,
M. J. Koss,
L. Lanz,
B. D. Lehmer,
J. -T. Li,
W. P. Maksym,
A. B. Mantz,
M. McDonald,
E. D. Miller,
R. F. Mushotzky,
Y. Qiu,
C. S. Reynolds,
F. Tombesi,
P. Tozzi,
A. Trindade-Falcao,
S. A. Walker
, et al. (3 additional authors not shown)
Abstract:
Stellar and black hole feedback heat and disperse surrounding cold gas clouds, launching gas flows off circumnuclear and galactic disks and producing a dynamic interstellar medium. On large scales bordering the cosmic web, feedback drives enriched gas out of galaxies and groups, seeding the intergalactic medium with heavy elements. In this way, feedback shapes galaxy evolution by shutting down sta…
▽ More
Stellar and black hole feedback heat and disperse surrounding cold gas clouds, launching gas flows off circumnuclear and galactic disks and producing a dynamic interstellar medium. On large scales bordering the cosmic web, feedback drives enriched gas out of galaxies and groups, seeding the intergalactic medium with heavy elements. In this way, feedback shapes galaxy evolution by shutting down star formation and ultimately curtailing the growth of structure after the peak at redshift 2-3. To understand the complex interplay between gravity and feedback, we must resolve both the key physics within galaxies and map the impact of these processes over large scales, out into the cosmic web. The Advanced X-ray Imaging Satellite (AXIS) is a proposed X-ray probe mission for the 2030s with arcsecond spatial resolution, large effective area, and low background. AXIS will untangle the interactions of winds, radiation, jets, and supernovae with the surrounding ISM across the wide range of mass scales and large volumes driving galaxy evolution and trace the establishment of feedback back to the main event at cosmic noon.
△ Less
Submitted 13 November, 2023;
originally announced November 2023.
-
On Strong Zero-Dispersion Asymptotics for Benjamin-Ono Soliton Ensembles
Authors:
Elliot Blackstone,
Louise Gassot,
Peter D. Miller
Abstract:
A soliton ensemble is a particular kind of approximation of the solution of an initial-value problem for an integrable equation by a reflectionless potential that is well adapted to singular asymptotics like the small-dispersion limit. We show how soliton ensembles for the Benjamin-Ono equation can be analyzed in this limit via the construction of local approximations that capture highly oscillato…
▽ More
A soliton ensemble is a particular kind of approximation of the solution of an initial-value problem for an integrable equation by a reflectionless potential that is well adapted to singular asymptotics like the small-dispersion limit. We show how soliton ensembles for the Benjamin-Ono equation can be analyzed in this limit via the construction of local approximations that capture highly oscillatory features of the solution and hence provide more information than weak convergence results that are easier to obtain. These local approximations are deduced from the distributions of eigenvalues of two related matrices, one Hermitian and another non-Hermitian. We perform careful numerical experiments to deduce the asymptotic behavior of the eigenvalues of these matrices in the small-dispersion limit, and formulate conjectures reflecting our observations. Then we apply the conjectures to construct the local approximations of slowly varying profiles and rapidly oscillating profiles as well. We show that the latter profiles are consistent with the predictions of Whitham modulation theory as originally developed for the Benjamin-Ono equation by Dobrokhotov and Krichever.
△ Less
Submitted 9 November, 2023;
originally announced November 2023.
-
Bypassing thermalization timescales in temperature estimation using prethermal probes
Authors:
Nicholas Anto-Sztrikacs,
Harry J. D. Miller,
Ahsan Nazir,
Dvira Segal
Abstract:
We introduce prethermal temperature probes for sensitive, fast and robust temperature estimation. While equilibrium thermal probes with a manifold of quasidegenerate excited states have been previously recognized for their maximal sensitivity, they suffer from long thermalization timescales. When considering time as a critical resource in thermometry, it becomes evident that these equilibrium prob…
▽ More
We introduce prethermal temperature probes for sensitive, fast and robust temperature estimation. While equilibrium thermal probes with a manifold of quasidegenerate excited states have been previously recognized for their maximal sensitivity, they suffer from long thermalization timescales. When considering time as a critical resource in thermometry, it becomes evident that these equilibrium probes fall short of ideal performance. Here, we propose a different paradigm for thermometry, where setups originally suggested for optimal equilibrium thermometry should instead be employed as prethermal probes, by making use of their long-lived quasiequilibrium state. This transient state emerges from the buildup of quantum coherences among quasidegenerate levels. For a class of physically-motivated initial conditions, we find that energy measurements of the prethermal state exhibit a similar sensitivity as the equilibrium state. However, they offer the distinct benefit of orders of magnitude reduction in the time required for the estimation protocol. Upon introducing a figure-of-merit that accounts for the estimation protocol time, prethermal probes surpass the corresponding equilibrium probes in terms of effective thermal sensitivity, opening avenues for rapid thermometry by harnessing the long-lived prethermal states.
△ Less
Submitted 9 November, 2023;
originally announced November 2023.
-
Tunneling escape of waves
Authors:
David A. B. Miller,
Zeyu Kuang,
Owen D. Miller
Abstract:
We solve a long-standing set of problems in optics and waves: why does a volume have only so many useful orthogonal wave channels in or out of it, why do coupling strengths fall off dramatically past this number, and, indeed, just what precisely defines that number? Increasingly in applications in communications, information processing, and sensing, in optics, acoustics, and electromagnetic waves…
▽ More
We solve a long-standing set of problems in optics and waves: why does a volume have only so many useful orthogonal wave channels in or out of it, why do coupling strengths fall off dramatically past this number, and, indeed, just what precisely defines that number? Increasingly in applications in communications, information processing, and sensing, in optics, acoustics, and electromagnetic waves generally, we need to understand this number. We can numerically find such channels for many problems, but more fundamentally, these questions have arguably never had a clear answer or physical explanation. We have found a simple and general result and intuition that lets us understand and bound this behavior for any volume. This is based on a tunneling that has been somewhat hidden in the mathematics of spherical waves: beyond a certain complexity of the wave, it must tunnel to escape the volume. By counting the number of waves that do not have to tunnel, we get a simple and precise number or bound for well-coupled channels, even for arbitrary volumes. The necessary tunneling for other waves explains the rapid fall-off in their coupling, and shows all such waves do escape to propagation to some degree after tunneling. This approach connects multipole expansions in electromagnetic antennas and nanophotonics smoothly to apparently evanescent waves in large optics. It works over all size scales, from nanophotonics, small radio-frequency antennas, or acoustic microphones and loudspeakers up to imaging optics with millions of channels, and gives a precise diffraction limit for any volume.
△ Less
Submitted 6 November, 2023; v1 submitted 5 November, 2023;
originally announced November 2023.
-
The Case for Controls: Identifying outbreak risk factors through case-control comparisons
Authors:
Nina H. Fefferman,
Michael J. Blum,
Lydia Bourouiba,
Nathaniel L. Gibson,
Qiang He,
Debra L. Miller,
Monica Papes,
Dana K. Pasquale,
Connor Verheyen,
Sadie J. Ryan
Abstract:
Investigations of infectious disease outbreaks often focus on identifying place- and context-dependent factors responsible for emergence and spread, resulting in phenomenological narratives ill-suited to develo** generalizable predictive and preventive measures. We contend that case-control hypothesis testing is a more powerful framework for epidemiological investigation. The approach, widely us…
▽ More
Investigations of infectious disease outbreaks often focus on identifying place- and context-dependent factors responsible for emergence and spread, resulting in phenomenological narratives ill-suited to develo** generalizable predictive and preventive measures. We contend that case-control hypothesis testing is a more powerful framework for epidemiological investigation. The approach, widely used in medical research, involves identifying counterfactuals, with case-control comparisons drawn to test hypotheses about the conditions that manifest outbreaks. Here we outline the merits of applying a case-control framework as epidemiological study design. We first describe a framework for iterative multidisciplinary interrogation to discover minimally sufficient sets of factors that can lead to disease outbreaks. We then lay out how case-control comparisons can respectively center on pathogen(s), factor(s), or landscape(s) with vignettes focusing on pathogen transmission. Finally, we consider how adopting case-control approaches can promote evidence-based decision making for responding to and preventing outbreaks.
△ Less
Submitted 3 November, 2023;
originally announced November 2023.
-
Overview of the Advanced X-ray Imaging Satellite (AXIS)
Authors:
Christopher S. Reynolds,
Erin A. Kara,
Richard F. Mushotzky,
Andrew Ptak,
Michael J. Koss,
Brian J. Williams,
Steven W. Allen,
Franz E. Bauer,
Marshall Bautz,
Arash Bodaghee,
Kevin B. Burdge,
Nico Cappelluti,
Brad Cenko,
George Chartas,
Kai-Wing Chan,
Lía Corrales,
Tansu Daylan,
Abraham D. Falcone,
Adi Foord,
Catherine E. Grant,
Mélanie Habouzit,
Daryl Haggard,
Sven Herrmann,
Edmund Hodges-Kluck,
Oleg Kargaltsev
, et al. (18 additional authors not shown)
Abstract:
The Advanced X-ray Imaging Satellite (AXIS) is a Probe-class concept that will build on the legacy of the Chandra X-ray Observatory by providing low-background, arcsecond-resolution imaging in the 0.3-10 keV band across a 450 arcminute$^2$ field of view, with an order of magnitude improvement in sensitivity. AXIS utilizes breakthroughs in the construction of lightweight segmented X-ray optics usin…
▽ More
The Advanced X-ray Imaging Satellite (AXIS) is a Probe-class concept that will build on the legacy of the Chandra X-ray Observatory by providing low-background, arcsecond-resolution imaging in the 0.3-10 keV band across a 450 arcminute$^2$ field of view, with an order of magnitude improvement in sensitivity. AXIS utilizes breakthroughs in the construction of lightweight segmented X-ray optics using single-crystal silicon, and developments in the fabrication of large-format, small-pixel, high readout rate CCD detectors with good spectral resolution, allowing a robust and cost-effective design. Further, AXIS will be responsive to target-of-opportunity alerts and, with onboard transient detection, will be a powerful facility for studying the time-varying X-ray universe, following on from the legacy of the Neil Gehrels (Swift) X-ray observatory that revolutionized studies of the transient X-ray Universe. In this paper, we present an overview of AXIS, highlighting the prime science objectives driving the AXIS concept and how the observatory design will achieve these objectives.
△ Less
Submitted 1 November, 2023;
originally announced November 2023.
-
SOUL at LBT: commissioning results, science and future
Authors:
Enrico Pinna,
Fabio Rossi,
Guido Agapito,
Alfio Puglisi,
Cédric Plantet,
Essna Ghose,
Matthieu Bec,
Marco Bonaglia,
Runa Briguglio,
Guido Brusa,
Luca Carbonaro,
Alessandro Cavallaro,
Julian Christou,
Olivier Durney,
Steve Ertel,
Simone Esposito,
Paolo Grani,
Juan Carlos Guerra,
Philip Hinz,
Michael Lefebvre,
Tommaso Mazzoni,
Brandon Mechtley,
Douglas L. Miller,
Manny Montoya,
Jennifer Power
, et al. (5 additional authors not shown)
Abstract:
The SOUL systems at the Large Bincoular Telescope can be seen such as precursor for the ELT SCAO systems, combining together key technologies such as EMCCD, Pyramid WFS and adaptive telescopes. After the first light of the first upgraded system on September 2018, going through COVID and technical stops, we now have all the 4 systems working on-sky. Here, we report about some key control improvemen…
▽ More
The SOUL systems at the Large Bincoular Telescope can be seen such as precursor for the ELT SCAO systems, combining together key technologies such as EMCCD, Pyramid WFS and adaptive telescopes. After the first light of the first upgraded system on September 2018, going through COVID and technical stops, we now have all the 4 systems working on-sky. Here, we report about some key control improvements and the system performance characterized during the commissioning. The upgrade allows us to correct more modes (500) in the bright end and increases the sky coverage providing SR(K)>20% with reference stars G$_{RP}$<17, opening to extragalcatic targets with NGS systems. Finally, we review the first astrophysical results, looking forward to the next generation instruments (SHARK-NIR, SHARK-Vis and iLocater), to be fed by the SOUL AO correction.
△ Less
Submitted 22 October, 2023;
originally announced October 2023.
-
First Results from a Broadband Search for Dark Photon Dark Matter in the $44$ to $52\,μ$eV range with a coaxial dish antenna
Authors:
Stefan Knirck,
Gabe Hoshino,
Mohamed H. Awida,
Gustavo I. Cancelo,
Martin Di Federico,
Benjamin Knepper,
Alex Lapuente,
Mira Littmann,
David W. Miller,
Donald V. Mitchell,
Derrick Rodriguez,
Mark K. Ruschman,
Matthew A. Sawtell,
Leandro Stefanazzi,
Andrew Sonnenschein,
Gary W. Teafoe,
Daniel Bowring,
G. Carosi,
Aaron Chou,
Clarence L. Chang,
Kristin Dona,
Rakshya Khatiwada,
Noah A. Kurinsky,
Jesse Liu,
Cristián Pena
, et al. (3 additional authors not shown)
Abstract:
We present first results from a dark photon dark matter search in the mass range from 44 to 52 $μ{\rm eV}$ ($10.7 - 12.5\,{\rm GHz}$) using a room-temperature dish antenna setup called GigaBREAD. Dark photon dark matter converts to ordinary photons on a cylindrical metallic emission surface with area $0.5\,{\rm m}^2$ and is focused by a novel parabolic reflector onto a horn antenna. Signals are re…
▽ More
We present first results from a dark photon dark matter search in the mass range from 44 to 52 $μ{\rm eV}$ ($10.7 - 12.5\,{\rm GHz}$) using a room-temperature dish antenna setup called GigaBREAD. Dark photon dark matter converts to ordinary photons on a cylindrical metallic emission surface with area $0.5\,{\rm m}^2$ and is focused by a novel parabolic reflector onto a horn antenna. Signals are read out with a low-noise receiver system. A first data taking run with 24 days of data does not show evidence for dark photon dark matter in this mass range, excluding dark photon - photon mixing parameters $χ\gtrsim 10^{-12}$ in this range at 90% confidence level. This surpasses existing constraints by about two orders of magnitude and is the most stringent bound on dark photons in this range below 49 $μ$eV.
△ Less
Submitted 3 May, 2024; v1 submitted 20 October, 2023;
originally announced October 2023.
-
CrysFormer: Protein Structure Prediction via 3d Patterson Maps and Partial Structure Attention
Authors:
Chen Dun,
Qiutai Pan,
Shikai **,
Ria Stevens,
Mitchell D. Miller,
George N. Phillips, Jr.,
Anastasios Kyrillidis
Abstract:
Determining the structure of a protein has been a decades-long open question. A protein's three-dimensional structure often poses nontrivial computation costs, when classical simulation algorithms are utilized. Advances in the transformer neural network architecture -- such as AlphaFold2 -- achieve significant improvements for this problem, by learning from a large dataset of sequence information…
▽ More
Determining the structure of a protein has been a decades-long open question. A protein's three-dimensional structure often poses nontrivial computation costs, when classical simulation algorithms are utilized. Advances in the transformer neural network architecture -- such as AlphaFold2 -- achieve significant improvements for this problem, by learning from a large dataset of sequence information and corresponding protein structures. Yet, such methods only focus on sequence information; other available prior knowledge, such as protein crystallography and partial structure of amino acids, could be potentially utilized. To the best of our knowledge, we propose the first transformer-based model that directly utilizes protein crystallography and partial structure information to predict the electron density maps of proteins. Via two new datasets of peptide fragments (2-residue and 15-residue) , we demonstrate our method, dubbed \texttt{CrysFormer}, can achieve accurate predictions, based on a much smaller dataset size and with reduced computation costs.
△ Less
Submitted 5 October, 2023;
originally announced October 2023.
-
An Extremely Massive White Dwarf Escaped From the Hyades Star Cluster
Authors:
David R. Miller,
Ilaria Caiazzo,
Jeremy Heyl,
Harvey B. Richer,
Kareem El-Badry,
Antonio C. Rodriguez,
Zachary P. Vanderbosch,
Jan van Roestel
Abstract:
We searched the Gaia DR3 database for ultramassive white dwarfs with kinematics consistent with having escaped the nearby Hyades open cluster, identifying three such candidates. Two of these candidates have masses estimated from Gaia photometry of approximately 1.1 solar masses; their status as products of single stellar evolution that have escaped the cluster was deemed too questionable for immed…
▽ More
We searched the Gaia DR3 database for ultramassive white dwarfs with kinematics consistent with having escaped the nearby Hyades open cluster, identifying three such candidates. Two of these candidates have masses estimated from Gaia photometry of approximately 1.1 solar masses; their status as products of single stellar evolution that have escaped the cluster was deemed too questionable for immediate follow-up analysis. The remaining candidate has an expected mass >1.3 solar masses, significantly reducing the probability of it being an interloper. Analysis of follow-up Gemini GMOS spectroscopy for this source reveals a non-magnetized hydrogen atmosphere white dwarf with a mass and age consistent with having formed from a single star. Assuming a single-stellar evolution formation channel, we estimate a 97.8% chance that the candidate is a true escapee from the Hyades. With a determined mass of 1.317 solar masses, this is potentially the most massive known single-evolution white dwarf and is by far the most massive with a strong association with an open cluster.
△ Less
Submitted 4 October, 2023;
originally announced October 2023.
-
Synthesis technique and electron beam damage study of nanometer-thin single-crystalline Thymine
Authors:
Hazem Daoud,
Sreelaja Pulleri Vadhyar,
Ehsan Nikbin,
Cheng Lu,
R. J. Dwayne Miller
Abstract:
Samples suitable for electron diffraction studies must satisfy certain characteristics such as having a thickness in the range of 10 - 100 nm. We report, to our knowledge, the first successful synthesis technique of nanometer-thin sheets of single-crystalline thymine suitable for electron diffraction and spectroscopy studies. This development provides a well defined system to explore issues relate…
▽ More
Samples suitable for electron diffraction studies must satisfy certain characteristics such as having a thickness in the range of 10 - 100 nm. We report, to our knowledge, the first successful synthesis technique of nanometer-thin sheets of single-crystalline thymine suitable for electron diffraction and spectroscopy studies. This development provides a well defined system to explore issues related to UV photochemistry of DNA and high intrinsic stability essential to maintaining integrity of genetic information. The crystals are grown using the evaporation technique and the nanometer-thin sheets are obtained via microtoming. The sample is characterized via x-ray diffraction (XRD) and is subsequently studied using electron diffraction via a transmission electron microscope (TEM). Thymine is found to be more radiation resistant than similar molecular moieties (e.g., carbamazepine) by a factor of 5. This raises interesting questions about the role of the fast relaxation processes of electron scattering-induced excited states, extending the concept of radiation hardening beyond photoexcited states. The high stability of thymine in particular opens the door for further studies of these ultrafast relaxation processes giving rise to the high stability of DNA to UV radiation.
△ Less
Submitted 12 January, 2024; v1 submitted 2 October, 2023;
originally announced October 2023.
-
Arcus X-ray telescope performance and alignment
Authors:
Hans Moritz Günther,
Peter Cheimets,
Eric D. Miller,
Casey DeRoo,
Randall K. Smith,
Andrew Ptak,
Ralf K. Heilmann
Abstract:
Arcus is a concept for a probe class mission to deliver high-resolution FUV and X-ray spectroscopy. For X-rays, it combines cost-effective silicon pore optics (SPO) with high-throughput critical-angle transmission (CAT) gratings to achieve $R> 3000$ in a bandpass from 12-50 Angstroem. We show in detail how the X-ray and the UV spectrographs (XRS and UVS) on Arcus will be aligned to each other. For…
▽ More
Arcus is a concept for a probe class mission to deliver high-resolution FUV and X-ray spectroscopy. For X-rays, it combines cost-effective silicon pore optics (SPO) with high-throughput critical-angle transmission (CAT) gratings to achieve $R> 3000$ in a bandpass from 12-50 Angstroem. We show in detail how the X-ray and the UV spectrographs (XRS and UVS) on Arcus will be aligned to each other. For XRS we present ray-tracing studies to derive performance characteristics such as the spectral resolving power and effective area, study the effect of misalignments on the performance, and conclude that most tolerances can be achieved with mechanical means alone. We also present an estimate of the expected on-orbit background.
△ Less
Submitted 28 September, 2023;
originally announced September 2023.
-
Post-Training Overfitting Mitigation in DNN Classifiers
Authors:
Hang Wang,
David J. Miller,
George Kesidis
Abstract:
Well-known (non-malicious) sources of overfitting in deep neural net (DNN) classifiers include: i) large class imbalances; ii) insufficient training-set diversity; and iii) over-training. In recent work, it was shown that backdoor data-poisoning also induces overfitting, with unusually large classification margins to the attacker's target class, mediated particularly by (unbounded) ReLU activation…
▽ More
Well-known (non-malicious) sources of overfitting in deep neural net (DNN) classifiers include: i) large class imbalances; ii) insufficient training-set diversity; and iii) over-training. In recent work, it was shown that backdoor data-poisoning also induces overfitting, with unusually large classification margins to the attacker's target class, mediated particularly by (unbounded) ReLU activations that allow large signals to propagate in the DNN. Thus, an effective post-training (with no knowledge of the training set or training process) mitigation approach against backdoors was proposed, leveraging a small clean dataset, based on bounding neural activations. Improving upon that work, we threshold activations specifically to limit maximum margins (MMs), which yields performance gains in backdoor mitigation. We also provide some analytical support for this mitigation approach. Most importantly, we show that post-training MM-based regularization substantially mitigates non-malicious overfitting due to class imbalances and overtraining. Thus, unlike adversarial training, which provides some resilience against attacks but which harms clean (attack-free) generalization, we demonstrate an approach originating from adversarial learning that helps clean generalization accuracy. Experiments on CIFAR-10 and CIFAR-100, in comparison with peer methods, demonstrate strong performance of our methods.
△ Less
Submitted 28 September, 2023;
originally announced September 2023.
-
Secondary Whistler and Ion-cyclotron Instabilities driven by Mirror Modes in Galaxy Clusters
Authors:
Francisco Ley,
Ellen G. Zweibel,
Drake Miller,
Mario Riquelme
Abstract:
Electron cyclotron waves (whistlers), are commonly observed in plasmas near Earth and the solar wind. In the presence of nonlinear mirror modes, bursts of whistlers, usually called lion roars, have been observed within low magnetic field regions associated to these modes. In the intracluster medium (ICM) of galaxy clusters, the excitation of the mirror instability is expected, but it is not yet cl…
▽ More
Electron cyclotron waves (whistlers), are commonly observed in plasmas near Earth and the solar wind. In the presence of nonlinear mirror modes, bursts of whistlers, usually called lion roars, have been observed within low magnetic field regions associated to these modes. In the intracluster medium (ICM) of galaxy clusters, the excitation of the mirror instability is expected, but it is not yet clear whether electron and ion cyclotron waves can also be present under conditions where gas pressure dominates over magnetic pressure (high $β$). In this work, we perform fully kinetic particle-in-cell (PIC) simulations of a plasma subject to a continuous amplification of the mean magnetic field $\textbf{B}(t)$ to study the nonlinear stages of the mirror instability and the ensuing excitation of whistler and ion cyclotron (IC) waves under ICM conditions. Once mirror modes reach nonlinear amplitudes, both whistler and IC waves start to emerge simultaneously, with sub-dominant amplitudes, propagating in low-$\textbf{B}$ regions, and quasi-parallel to $\textbf{B}(t)$. We show that the underlying source of excitation is the pressure anisotropy of electrons and ions trapped in mirror modes with loss-cone type distributions. We also observe that IC waves play an essential role in regulating the ion pressure anisotropy at nonlinear stages. We argue that whistler and IC waves are a concomitant feature at late stages of the mirror instability even at high-$β$, and therefore expected to be present in astrophysical environments like the ICM. We discuss the implications of our results for collisionless heating and dissipation of turbulence in the ICM.
△ Less
Submitted 28 September, 2023;
originally announced September 2023.
-
Enabling Large-scale Heterogeneous Collaboration with Opportunistic Communications
Authors:
Fernando Cladera,
Zachary Ravichandran,
Ian D. Miller,
M. Ani Hsieh,
C. J. Taylor,
Vijay Kumar
Abstract:
Multi-robot collaboration in large-scale environments with limited-sized teams and without external infrastructure is challenging, since the software framework required to support complex tasks must be robust to unreliable and intermittent communication links. In this work, we present MOCHA (Multi-robot Opportunistic Communication for Heterogeneous Collaboration), a framework for resilient multi-r…
▽ More
Multi-robot collaboration in large-scale environments with limited-sized teams and without external infrastructure is challenging, since the software framework required to support complex tasks must be robust to unreliable and intermittent communication links. In this work, we present MOCHA (Multi-robot Opportunistic Communication for Heterogeneous Collaboration), a framework for resilient multi-robot collaboration that enables large-scale exploration in the absence of continuous communications. MOCHA is based on a gossip communication protocol that allows robots to interact opportunistically whenever communication links are available, propagating information on a peer-to-peer basis. We demonstrate the performance of MOCHA through real-world experiments with commercial-off-the-shelf (COTS) communication hardware. We further explore the system's scalability in simulation, evaluating the performance of our approach as the number of robots increases and communication ranges vary. Finally, we demonstrate how MOCHA can be tightly integrated with the planning stack of autonomous robots. We show a communication-aware planning algorithm for a high-altitude aerial robot executing a collaborative task while maximizing the amount of information shared with ground robots. The source code for MOCHA and the high-altitude UAV planning system is available open source: http://github.com/KumarRobotics/MOCHA, http://github.com/KumarRobotics/air_router.
△ Less
Submitted 27 September, 2023;
originally announced September 2023.
-
Gas clum** in the outskirts of galaxy clusters, an assessment of the sensitivity of STAR-X
Authors:
Christian T. Norseth,
Daniel R. Wik,
John A. ZuHone,
Eric D. Miller,
Marshall W. Bautz,
Michael McDonald
Abstract:
In the outskirts of galaxy clusters, entropy profiles measured from X-ray observations of the hot intracluster medium (ICM) drops off unexpectedly. One possible explanation for this effect is gas clum**, where pockets of cooler and denser structures within the ICM are present. Current observatories are unable to directly detect these hypothetical gas clumps. One of the science drivers of the pro…
▽ More
In the outskirts of galaxy clusters, entropy profiles measured from X-ray observations of the hot intracluster medium (ICM) drops off unexpectedly. One possible explanation for this effect is gas clum**, where pockets of cooler and denser structures within the ICM are present. Current observatories are unable to directly detect these hypothetical gas clumps. One of the science drivers of the proposed STAR-X observatory is to resolve these or similar structures. Its high spatial resolution, large effective area, and low instrumental background make STAR-X ideal for directly detecting and characterizing clumps and diffuse emission in cluster outskirts. The aim of this work is to simulate observations of clum** in clusters to determine how well STAR-X will be able to detect clumps, as well as what clum** properties reproduce observed entropy profiles. This is achieved by using yt, pyXSIM, SOXS, and other tools to inject ideally modeled clumps into three-dimensional models derived from actual clusters using their observed profiles from other X-ray missions. Radial temperature and surface brightness profiles are then extracted from mock observations using concentric annuli. We find that in simulated observations for STAR-X, a parameter space of clump properties exists where gas clumps can be successfully identified using wavdetect and masked, and are able to recover the true cluster profiles. This demonstrates that STAR-X could be capable of detecting substructure in the outskirts of nearby clusters and that the properties of both the outskirts and the clumps will be revealed.
△ Less
Submitted 4 October, 2023; v1 submitted 4 September, 2023;
originally announced September 2023.
-
The high-speed X-ray camera on AXIS
Authors:
Eric D. Miller,
Marshall W. Bautz,
Catherine E. Grant,
Richard F. Foster,
Beverly LaMarr,
Andrew Malonis,
Gregory Prigozhin,
Benjamin Schneider,
Christopher Leitz,
Sven Herrmann,
Steven W. Allen,
Tanmoy Chattopadhyay,
Peter Orel,
R. Glenn Morris,
Haley Stueber,
Abraham D. Falcone,
Andrew Ptak,
Christopher Reynolds
Abstract:
AXIS is a Probe-class mission concept that will provide high-throughput, high-spatial-resolution X-ray spectral imaging, enabling transformative studies of high-energy astrophysical phenomena. To take advantage of the advanced optics and avoid photon pile-up, the AXIS focal plane requires detectors with readout rates at least 20 times faster than previous soft X-ray imaging spectrometers flying ab…
▽ More
AXIS is a Probe-class mission concept that will provide high-throughput, high-spatial-resolution X-ray spectral imaging, enabling transformative studies of high-energy astrophysical phenomena. To take advantage of the advanced optics and avoid photon pile-up, the AXIS focal plane requires detectors with readout rates at least 20 times faster than previous soft X-ray imaging spectrometers flying aboard missions such as Chandra and Suzaku, while retaining the low noise, excellent spectral performance, and low power requirements of those instruments. We present the design of the AXIS high-speed X-ray camera, which baselines large-format MIT Lincoln Laboratory CCDs employing low-noise pJFET output amplifiers and a single-layer polysilicon gate structure that allows fast, low-power clocking. These detectors are combined with an integrated high-speed, low-noise ASIC readout chip from Stanford University that provides better performance than conventional discrete solutions at a fraction of their power consumption and footprint. Our complementary front-end electronics concept employs state of the art digital video waveform capture and advanced signal processing to deliver low noise at high speed. We review the current performance of this technology, highlighting recent improvements on prototype devices that achieve excellent noise characteristics at the required readout rate. We present measurements of the CCD spectral response across the AXIS energy band, augmenting lab measurements with detector simulations that help us understand sources of charge loss and evaluate the quality of the CCD backside passivation technique. We show that our technology is on a path that will meet our requirements and enable AXIS to achieve world-class science.
△ Less
Submitted 1 September, 2023;
originally announced September 2023.
-
Differential Equations for Approximate Solutions of Painlevé Equations: Application to the Algebraic Solutions of the Painlevé-III $({\rm D}_7)$ Equation
Authors:
Robert J. Buckingham,
Peter D. Miller
Abstract:
It is well known that the Painlevé equations can formally degenerate to autonomous differential equations with elliptic function solutions in suitable scaling limits. A way to make this degeneration rigorous is to apply Deift-Zhou steepest-descent techniques to a Riemann-Hilbert representation of a family of solutions. This method leads to an explicit approximation formula in terms of theta functi…
▽ More
It is well known that the Painlevé equations can formally degenerate to autonomous differential equations with elliptic function solutions in suitable scaling limits. A way to make this degeneration rigorous is to apply Deift-Zhou steepest-descent techniques to a Riemann-Hilbert representation of a family of solutions. This method leads to an explicit approximation formula in terms of theta functions and related algebro-geometric ingredients that is difficult to directly link to the expected limiting differential equation. However, the approximation arises from an outer parametrix that satisfies relatively simple conditions. By applying a method that we learned from Alexander Its, it is possible to use these simple conditions to directly obtain the limiting differential equation, bypassing the details of the algebro-geometric solution of the outer parametrix problem. In this paper, we illustrate the use of this method to relate an approximation of the algebraic solutions of the Painlevé-III (D$_7$) equation valid in the part of the complex plane where the poles and zeros of the solutions asymptotically reside to a form of the Weierstrass equation.
△ Less
Submitted 20 January, 2024; v1 submitted 30 August, 2023;
originally announced August 2023.
-
Backdoor Mitigation by Correcting the Distribution of Neural Activations
Authors:
Xi Li,
Zhen Xiang,
David J. Miller,
George Kesidis
Abstract:
Backdoor (Trojan) attacks are an important type of adversarial exploit against deep neural networks (DNNs), wherein a test instance is (mis)classified to the attacker's target class whenever the attacker's backdoor trigger is present. In this paper, we reveal and analyze an important property of backdoor attacks: a successful attack causes an alteration in the distribution of internal layer activa…
▽ More
Backdoor (Trojan) attacks are an important type of adversarial exploit against deep neural networks (DNNs), wherein a test instance is (mis)classified to the attacker's target class whenever the attacker's backdoor trigger is present. In this paper, we reveal and analyze an important property of backdoor attacks: a successful attack causes an alteration in the distribution of internal layer activations for backdoor-trigger instances, compared to that for clean instances. Even more importantly, we find that instances with the backdoor trigger will be correctly classified to their original source classes if this distribution alteration is corrected. Based on our observations, we propose an efficient and effective method that achieves post-training backdoor mitigation by correcting the distribution alteration using reverse-engineered triggers. Notably, our method does not change any trainable parameters of the DNN, but achieves generally better mitigation performance than existing methods that do require intensive DNN parameter tuning. It also efficiently detects test instances with the trigger, which may help to catch adversarial entities in the act of exploiting the backdoor.
△ Less
Submitted 18 August, 2023;
originally announced August 2023.
-
Fullwave design of cm-scale cylindrical metasurfaces via fast direct solvers
Authors:
Wen** Xue,
Hanwen Zhang,
Abinand Gopal,
Vladimir Rokhlin,
Owen D. Miller
Abstract:
Large-scale metasurfaces promise nanophotonic performance improvements to macroscopic optics functionality, for applications from imaging to analog computing. Yet the size scale mismatch of centimeter-scale chips versus micron-scale wavelengths prohibits use of conventional full-wave simulation techniques, and has necessitated dramatic approximations. Here, we show that tailoring "fast direct" int…
▽ More
Large-scale metasurfaces promise nanophotonic performance improvements to macroscopic optics functionality, for applications from imaging to analog computing. Yet the size scale mismatch of centimeter-scale chips versus micron-scale wavelengths prohibits use of conventional full-wave simulation techniques, and has necessitated dramatic approximations. Here, we show that tailoring "fast direct" integral-equation simulation techniques to the form factor of metasurfaces offers the possibility for accurate and efficient full-wave, large-scale metasurface simulations. For cylindrical (two-dimensional) metasurfaces, we demonstrate accurate simulations whose solution time scales \emph{linearly} with the metasurface diameter. Moreover, the solver stores compressed information about the simulation domain that is reusable over many design iterations. We demonstrate the capabilities of our solver through two designs: first, a high-efficiency, high-numerical-aperture metalens that is 20,000 wavelengths in diameter. Second, a high-efficiency, large-beam-width grating coupler. The latter corresponds to millimeter-scale beam design at standard telecommunications wavelengths, while the former, at a visible wavelength of 500 nm, corresponds to a design diameter of 1 cm, created through full simulations of Maxwell's equations.
△ Less
Submitted 15 August, 2023;
originally announced August 2023.
-
Improved Activation Clip** for Universal Backdoor Mitigation and Test-Time Detection
Authors:
Hang Wang,
Zhen Xiang,
David J. Miller,
George Kesidis
Abstract:
Deep neural networks are vulnerable to backdoor attacks (Trojans), where an attacker poisons the training set with backdoor triggers so that the neural network learns to classify test-time triggers to the attacker's designated target class. Recent work shows that backdoor poisoning induces over-fitting (abnormally large activations) in the attacked model, which motivates a general, post-training c…
▽ More
Deep neural networks are vulnerable to backdoor attacks (Trojans), where an attacker poisons the training set with backdoor triggers so that the neural network learns to classify test-time triggers to the attacker's designated target class. Recent work shows that backdoor poisoning induces over-fitting (abnormally large activations) in the attacked model, which motivates a general, post-training clip** method for backdoor mitigation, i.e., with bounds on internal-layer activations learned using a small set of clean samples. We devise a new such approach, choosing the activation bounds to explicitly limit classification margins. This method gives superior performance against peer methods for CIFAR-10 image classification. We also show that this method has strong robustness against adaptive attacks, X2X attacks, and on different datasets. Finally, we demonstrate a method extension for test-time detection and correction based on the output differences between the original and activation-bounded networks. The code of our method is online available.
△ Less
Submitted 8 August, 2023;
originally announced August 2023.
-
Explainable Equivariant Neural Networks for Particle Physics: PELICAN
Authors:
Alexander Bogatskiy,
Timothy Hoffman,
David W. Miller,
Jan T. Offermann,
Xiaoyang Liu
Abstract:
PELICAN is a novel permutation equivariant and Lorentz invariant or covariant aggregator network designed to overcome common limitations found in architectures applied to particle physics problems. Compared to many approaches that use non-specialized architectures that neglect underlying physics principles and require very large numbers of parameters, PELICAN employs a fundamentally symmetry group…
▽ More
PELICAN is a novel permutation equivariant and Lorentz invariant or covariant aggregator network designed to overcome common limitations found in architectures applied to particle physics problems. Compared to many approaches that use non-specialized architectures that neglect underlying physics principles and require very large numbers of parameters, PELICAN employs a fundamentally symmetry group-based architecture that demonstrates benefits in terms of reduced complexity, increased interpretability, and raw performance. We present a comprehensive study of the PELICAN algorithm architecture in the context of both tagging (classification) and reconstructing (regression) Lorentz-boosted top quarks, including the difficult task of specifically identifying and measuring the $W$-boson inside the dense environment of the Lorentz-boosted top-quark hadronic final state. We also extend the application of PELICAN to the tasks of identifying quark-initiated vs.~gluon-initiated jets, and a multi-class identification across five separate target categories of jets. When tested on the standard task of Lorentz-boosted top-quark tagging, PELICAN outperforms existing competitors with much lower model complexity and high sample efficiency. On the less common and more complex task of 4-momentum regression, PELICAN also outperforms hand-crafted, non-machine learning algorithms. We discuss the implications of symmetry-restricted architectures for the wider field of machine learning for physics.
△ Less
Submitted 23 February, 2024; v1 submitted 31 July, 2023;
originally announced July 2023.
-
Unraveling Quantum Coherences Mediating Primary Charge Transfer Processes in Photosystem II Reaction Center
Authors:
Ajay Jha,
Pan-Pan Zhang,
Vandana Tiwari,
Lipeng Chen,
Michael Thorwart,
R. J. Dwayne Miller,
Hong-Guang Duan
Abstract:
Photosystem II (PSII) reaction center is a unique protein-chromophore complex that is capable of efficiently separating electronic charges across the membrane after photoexcitation. In the PSII reaction center, the primary energy- and charge-transfer (CT) processes occur on comparable ultrafast timescales, which makes it extremely challenging to understand the fundamental mechanism responsible for…
▽ More
Photosystem II (PSII) reaction center is a unique protein-chromophore complex that is capable of efficiently separating electronic charges across the membrane after photoexcitation. In the PSII reaction center, the primary energy- and charge-transfer (CT) processes occur on comparable ultrafast timescales, which makes it extremely challenging to understand the fundamental mechanism responsible for the near-unity quantum efficiency of the transfer. Here, we elucidate the role of quantum coherences in the ultrafast energy and CT in the PSII reaction center by performing two-dimensional (2D) electronic spectroscopy at the cryogenic temperature of 20 K, which captures the distinct underlying quantum coherences. Specifically, we uncover the electronic and vibrational coherences along with their lifetimes during the primary ultrafast processes of energy and CT. We also examine the functional role of the observed quantum coherences. To gather further insight, we construct a structure-based excitonic model that provided evidence for coherent energy and CT at low temperature in the 2D electronic spectra. The principles, uncovered by this combination of experimental and theoretical analyses, could provide valuable guidelines for creating artificial photosystems with exploitation of system-bath coupling and control of coherences to optimize the photon conversion efficiency to specific functions.
△ Less
Submitted 24 July, 2023;
originally announced July 2023.
-
A system of inference based on proof search: an extended abstract
Authors:
Dale Miller
Abstract:
Gentzen designed his natural deduction proof system to ``come as close as possible to actual reasoning.'' Indeed, natural deduction proofs closely resemble the static structure of logical reasoning in mathematical arguments. However, different features of inference are compelling to capture when one wants to support the process of searching for proofs. PSF (Proof Search Framework) attempts to c…
▽ More
Gentzen designed his natural deduction proof system to ``come as close as possible to actual reasoning.'' Indeed, natural deduction proofs closely resemble the static structure of logical reasoning in mathematical arguments. However, different features of inference are compelling to capture when one wants to support the process of searching for proofs. PSF (Proof Search Framework) attempts to capture these features naturally and directly. The design and metatheory of PSF are presented, and its ability to specify a range of proof systems for classical, intuitionistic, and linear logic is illustrated.
△ Less
Submitted 24 July, 2023;
originally announced July 2023.
-
Painlevé-III Monodromy Maps Under the $D_6\to D_8$ Confluence and Applications to the Large-Parameter Asymptotics of Rational Solutions
Authors:
Ahmad Barhoumi,
Oleg Lisovyy,
Peter D. Miller,
Andrei Prokhorov
Abstract:
The third Painlevé equation in its generic form, often referred to as Painlevé-III($D_6$), is given by $$ \frac{{\rm d}^2u}{{\rm d}x^2} =\frac{1}{u}\left(\frac{{\rm d}u}{{\rm d}x}\right)^2-\frac{1}{x}\frac{{\rm d}u}{{\rm d}x}+\frac{αu^2+β}{x}+4u^3-\frac{4}{u}, \qquad α,β\in \mathbb C.$$ Starting from a generic initial solution $u_0(x)$ corresponding to parameters $α$, $β$, denoted as the triple…
▽ More
The third Painlevé equation in its generic form, often referred to as Painlevé-III($D_6$), is given by $$ \frac{{\rm d}^2u}{{\rm d}x^2} =\frac{1}{u}\left(\frac{{\rm d}u}{{\rm d}x}\right)^2-\frac{1}{x}\frac{{\rm d}u}{{\rm d}x}+\frac{αu^2+β}{x}+4u^3-\frac{4}{u}, \qquad α,β\in \mathbb C.$$ Starting from a generic initial solution $u_0(x)$ corresponding to parameters $α$, $β$, denoted as the triple $(u_0(x),α,β)$, we apply an explicit Bäcklund transformation to generate a family of solutions $(u_n(x),α+4n,β+4n)$ indexed by $n \in \mathbb N$. We study the large $n$ behavior of the solutions $(u_n(x),α+4n,β+4n)$ under the scaling $x=z/n$ in two different ways: (a) analyzing the convergence properties of series solutions to the equation, and (b) using a Riemann-Hilbert representation of the solution $u_n(z/n)$. Our main result is a proof that the limit of solutions $u_n(z/n)$ exists and is given by a solution of the degenerate Painlevé-III equation, known as Painlevé-III($D_8$), $$ \frac{{\rm d}^2U}{{\rm d}z^2} =\frac{1}{U}\left(\frac{{\rm d}U}{{\rm d}z}\right)^2-\frac{1}{z}\frac{{\rm d}U}{{\rm d}z}+\frac{4U^2+4}{z}.$$ A notable application of our result is to rational solutions of Painlevé-III($D_6$), which are constructed using the seed solution $(1,4m,-4m)$ where $m \in \mathbb C \setminus \big(\mathbb Z +\frac{1}{2}\big)$ and can be written as a particular ratio of Umemura polynomials. We identify the limiting solution in terms of both its initial condition at $z=0$ when it is well defined, and by its monodromy data in the general case. Furthermore, as a consequence of our analysis, we deduce the asymptotic behavior of generic solutions of Painlevé-III, both $D_6$ and $D_8$ at $z=0$. We also deduce the large $n$ behavior of the Umemura polynomials in a neighborhood of $z=0$.
△ Less
Submitted 9 March, 2024; v1 submitted 20 July, 2023;
originally announced July 2023.