-
Gauging The Diamond: Integrable Coset Models from Twistor Space
Authors:
Lewis T. Cole,
Ryan A. Cullinan,
Ben Hoare,
Joaquin Liniado,
Daniel C. Thompson
Abstract:
Recent work has shown that certain integrable and conformal field theories in two dimensions can be given a higher-dimensional origin from holomorphic Chern-Simons in six dimensions. Along with anti-self-dual Yang-Mills and four-dimensional Chern-Simons, this gives rise to a diamond correspondence of theories. In this work we extend this framework to incorporate models realised through gaugings. A…
▽ More
Recent work has shown that certain integrable and conformal field theories in two dimensions can be given a higher-dimensional origin from holomorphic Chern-Simons in six dimensions. Along with anti-self-dual Yang-Mills and four-dimensional Chern-Simons, this gives rise to a diamond correspondence of theories. In this work we extend this framework to incorporate models realised through gaugings. As well as describing a higher-dimensional origin of coset CFTs, by choosing the details of the reduction from higher dimensions, we obtain rich classes of two-dimensional integrable models including homogeneous sine-Gordon models and generalisations that are new to the literature.
△ Less
Submitted 12 July, 2024;
originally announced July 2024.
-
Galaxy Mergers in the Epoch of Reionization I: A JWST Study of Pair Fractions, Merger Rates, and Stellar Mass Accretion Rates at $z = 4.5-11.5$
Authors:
Qiao Duan,
Christopher J. Conselice,
Qiong Li,
Duncan Austin,
Thomas Harvey,
Nathan J. Adams,
Kenneth J. Duncan,
James Trussler,
Leonardo Ferreira,
Lewi Westcott,
Honor Harris,
Rogier A. Windhorst,
Benne W. Holwerda,
Thomas J. Broadhurst,
Dan Coe,
Seth H. Cohen,
Simon P. Driver,
Brenda Frye,
Norman A. Grogin,
Nimish P. Hathi,
Rolf A. Jansen,
Anton M. Koekemoer,
Madeline A. Marshall,
Mario Nonino,
Rafael Ortiz III
, et al. (7 additional authors not shown)
Abstract:
We present a full analysis of galaxy major merger pair fractions, merger rates, and mass accretion rates, thus uncovering the role of mergers in galaxy formation at the earliest previously unexplored epoch of $4.5<z<11.5$. We target galaxies with masses $\log_{10}(\mathrm{M}_*/\mathrm{M}_\odot) = 8.0 - 10.0$, utilizing data from eight JWST Cycle-1 fields (CEERS, JADES GOODS-S, NEP-TDF, NGDEEP, GLA…
▽ More
We present a full analysis of galaxy major merger pair fractions, merger rates, and mass accretion rates, thus uncovering the role of mergers in galaxy formation at the earliest previously unexplored epoch of $4.5<z<11.5$. We target galaxies with masses $\log_{10}(\mathrm{M}_*/\mathrm{M}_\odot) = 8.0 - 10.0$, utilizing data from eight JWST Cycle-1 fields (CEERS, JADES GOODS-S, NEP-TDF, NGDEEP, GLASS, El-Gordo, SMACS-0723, MACS-0416), covering an unmasked area of 189.36 $\mathrm{arcmin}^2$. We develop a new probabilistic pair-counting methodology that integrates full photometric redshift posteriors and corrects for detection incompleteness to quantify close pairs with physical projected separations between 20 and 50 kpc. Our analysis reveals an increase in pair fractions up to $z = 8$, reaching $0.211 \pm 0.065$, followed by a statistically flat evolution to $z = 11.5$. We find that the galaxy merger rate increases from the local Universe up to $z = 6$ and then stabilizes at a value of $\sim 6$ Gyr$^{-1}$ up to $z = 11.5$. We fit both a power-law and a power-law + exponential model to our pair fraction and merger rate redshift evolution, finding that the latter model describes the trends more accurately, particularly at $z = 8.0 - 11.5$. In addition, we measure that the average galaxy increases its stellar mass due to mergers by a factor of $2.77 \pm 0.99$ from redshift $z = 10.5$ to $z = 5.0$. Lastly, we investigate the impact of mergers on galaxy stellar mass growth, revealing that mergers contribute $71 \pm 25\%$ as much to galaxy stellar mass increases as star formation from gas. This indicates that mergers drive about half of galaxy assembly at high redshift.
△ Less
Submitted 12 July, 2024;
originally announced July 2024.
-
A Versatile Side Entry Laser System for Scanning Transmission Electron Microscopy
Authors:
Ondrej Dyck,
Olugbenga Olunloyo,
Kai Xiao,
Benjamin Wolf,
Thomas M. Moore,
Andrew R. Lupini,
Stephen Jesse
Abstract:
We present the design and implementation of a side entry laser system designed for an ultra-high vacuum scanning transmission electron microscope. This system uses a versatile probe design enclosed in a vacuum envelope such that parts can be easily aligned, modified, or exchanged without disturbing the vacuum. The system uses a mirror mounted on the sample holder such that the sample can be illumi…
▽ More
We present the design and implementation of a side entry laser system designed for an ultra-high vacuum scanning transmission electron microscope. This system uses a versatile probe design enclosed in a vacuum envelope such that parts can be easily aligned, modified, or exchanged without disturbing the vacuum. The system uses a mirror mounted on the sample holder such that the sample can be illuminated without being tilted. Notably the mirror can be removed and replaced with an ablation target and a higher power laser used to ablate material directly onto the sample. We argue that new capabilities hold the potential to transform the electron microscope from an analysis tool towards a more flexible synthesis system, where atomic scale fabrication and atom-by-atom experiments can be performed.
△ Less
Submitted 12 July, 2024;
originally announced July 2024.
-
How coronal mass ejections are influenced by the morphology and toroidal flux of their source magnetic flux ropes?
Authors:
J. H. Guo,
L. Linan,
S. Poedts,
Y. Guo,
B. Schmieder,
A. Lani,
Y. W. Ni,
M. Brchnelova,
B. Perri,
T. Baratashvili,
S. T. Li,
P. F. Chen
Abstract:
Coronal mass ejections (CMEs) stand as intense eruptions of magnetized plasma from the Sun, playing a pivotal role in driving significant changes of the heliospheric environment. Deducing the properties of CMEs from their progenitors in solar source regions is crucial for space weather forecasting. Deducing the properties of CMEs from their progenitors in solar source regions is crucial for space…
▽ More
Coronal mass ejections (CMEs) stand as intense eruptions of magnetized plasma from the Sun, playing a pivotal role in driving significant changes of the heliospheric environment. Deducing the properties of CMEs from their progenitors in solar source regions is crucial for space weather forecasting. Deducing the properties of CMEs from their progenitors in solar source regions is crucial for space weather forecasting. The primary objective of this paper is to establish a connection between CMEs and their progenitors in solar source regions, enabling us to infer the magnetic structures of CMEs before their full development. To this end, we create a dataset comprising a magnetic flux rope series with varying projection shapes, sizes and toroidal fluxes, using the Regularized Biot-Savart Laws (RBSL). Thereafter, we simulate the propagation of these flux ropes from the solar surface to a distance of 25$R_{\odot}$ with our global coronal MHD model which is named COCONUT. Our parametric survey reveals significant impacts of source flux ropes on the consequent CMEs. We find that the projection shape can influence the magnetic structures of CMEs at 20$R_{\odot}$, albeit with minimal impacts on the propagation speed. However, these impacts diminish as source flux ropes become fat. In terms of toroidal flux, our simulation results demonstrate a pronounced correlation with the propagation speed of CMEs, as well as the successfulness in erupting. This work builds the bridge between the CMEs in the outer corona and their progenitors in solar source regions. Our parametric survey suggests that the projection shape, cross-section radius and toroidal flux of source flux ropes are crucial parameters in predicting magnetic structures and propagation speed of CMEs, providing valuable insights for space weather prediction.
△ Less
Submitted 12 July, 2024;
originally announced July 2024.
-
ASTPrompter: Weakly Supervised Automated Language Model Red-Teaming to Identify Likely Toxic Prompts
Authors:
Amelia F. Hardy,
Houjun Liu,
Bernard Lange,
Mykel J. Kochenderfer
Abstract:
Typical schemes for automated red-teaming large language models (LLMs) focus on discovering prompts that trigger a frozen language model (the defender) to generate toxic text. This often results in the prompting model (the adversary) producing text that is unintelligible and unlikely to arise. Here, we propose a reinforcement learning formulation of the LLM red-teaming task which allows us to disc…
▽ More
Typical schemes for automated red-teaming large language models (LLMs) focus on discovering prompts that trigger a frozen language model (the defender) to generate toxic text. This often results in the prompting model (the adversary) producing text that is unintelligible and unlikely to arise. Here, we propose a reinforcement learning formulation of the LLM red-teaming task which allows us to discover prompts that both (1) trigger toxic outputs from a frozen defender and (2) have low perplexity as scored by the defender. We argue these cases are most pertinent in a red-teaming setting because of their likelihood to arise during normal use of the defender model. We solve this formulation through a novel online and weakly supervised variant of Identity Preference Optimization (IPO) on GPT-2 and GPT-2 XL defenders. We demonstrate that our policy is capable of generating likely prompts that also trigger toxicity. Finally, we qualitatively analyze learned strategies, trade-offs of likelihood and toxicity, and discuss implications. Source code is available for this project at: https://github.com/sisl/ASTPrompter/.
△ Less
Submitted 12 July, 2024;
originally announced July 2024.
-
Intensive broadband reverberation map** of Fairall 9 with 1.8 years of daily Swift monitoring
Authors:
R. Edelson,
B. M. Peterson,
J. Gelbord,
K. Horne,
M. Goad,
I. McHardy,
S. Vaughan,
M. Vestergaard
Abstract:
We present 1.8 years of near-daily Swift monitoring of the bright, strongly variable Type 1 AGN Fairall 9. Totaling 575 successful visits, this is the largest such campaign reported to date. Variations within the UV/optical are well-correlated, with longer wavelengths lagging shorter wavelengths in the direction predicted by thin disk/lamp-post models. The correlations are improved by detrending;…
▽ More
We present 1.8 years of near-daily Swift monitoring of the bright, strongly variable Type 1 AGN Fairall 9. Totaling 575 successful visits, this is the largest such campaign reported to date. Variations within the UV/optical are well-correlated, with longer wavelengths lagging shorter wavelengths in the direction predicted by thin disk/lamp-post models. The correlations are improved by detrending; subtracting a second-order polynomial fit to the UV/optical light curves to remove long-term trends that are not of interest to this study. Extensive testing indicates detrending with higher-order polynomials removes too much intrinsic variability signal on reverberation timescales. These data provide the clearest detection to date of interband lags within the UV, indicating that neither emission from a large disk nor diffuse continuum emission from the broad-line region can independently explain the full observed lag spectrum. The observed X-ray flux variations are poorly correlated with those in the UV/optical. Further, subdivision of the data into four ~160 day light curves shows that the UV/optical lag spectrum is highly stable throughout the four periods, but the X-ray to UV lags are unstable, significantly changing magnitude and even direction from one period to the next. This indicates the X-ray to UV relationship is more complex than predicted by the simple reprocessing model often adopted for AGN. A bowl model (lamp-post irradiation and blackbody reprocessing on a disk with a steep rim) fit suggests the disk thickens at a distance (~10 lt-day) and temperature (~8000K) consistent with the inner edge of the BLR.
△ Less
Submitted 12 July, 2024;
originally announced July 2024.
-
Addressing Confounding and Continuous Exposure Measurement Error Using Corrected Score Functions
Authors:
Brian D. Richardson,
Bryan S. Blette,
Peter B. Gilbert,
Michael G. Hudgens
Abstract:
Confounding and exposure measurement error can introduce bias when drawing inference about the marginal effect of an exposure on an outcome of interest. While there are broad methodologies for addressing each source of bias individually, confounding and exposure measurement error frequently co-occur and there is a need for methods that address them simultaneously. In this paper, corrected score me…
▽ More
Confounding and exposure measurement error can introduce bias when drawing inference about the marginal effect of an exposure on an outcome of interest. While there are broad methodologies for addressing each source of bias individually, confounding and exposure measurement error frequently co-occur and there is a need for methods that address them simultaneously. In this paper, corrected score methods are derived under classical additive measurement error to draw inference about marginal exposure effects using only measured variables. Three estimators are proposed based on g-formula, inverse probability weighting, and doubly-robust estimation techniques. The estimators are shown to be consistent and asymptotically normal, and the doubly-robust estimator is shown to exhibit its namesake property. The methods, which are implemented in the R package mismex, perform well in finite samples under both confounding and measurement error as demonstrated by simulation studies. The proposed doubly-robust estimator is applied to study the effects of two biomarkers on HIV-1 infection using data from the HVTN 505 preventative vaccine trial.
△ Less
Submitted 12 July, 2024;
originally announced July 2024.
-
Neuroevolution of Decentralized Decision-Making in N-Bead Swimmers Leads to Scalable and Robust Collective Locomotion
Authors:
Benedikt Hartl,
Michael Levin,
Andreas Zöttl
Abstract:
Many microorganisms are capable of swimming through viscous fluids such as water in order to search for nutrients, swim toward oxygen or light, or to escape from predators. To navigate their environment they often perform large nonreciprocal periodic deformations of their shape, by waving appendages such as cilia or flagella, or by deforming their entire body. Even unicellular organisms are fundam…
▽ More
Many microorganisms are capable of swimming through viscous fluids such as water in order to search for nutrients, swim toward oxygen or light, or to escape from predators. To navigate their environment they often perform large nonreciprocal periodic deformations of their shape, by waving appendages such as cilia or flagella, or by deforming their entire body. Even unicellular organisms are fundamentally made of parts, which need to be cooperatively utilized to allow these creatures to navigate their environment, without using a centralized control mechanism. Here, we investigate the physical implications of decentralized decision-making of the actuators of a generalized N-bead Najafi Golestanian microswimmer, self-propelling via coordinated non-reciprocal swimming strokes. We treat each bead as an artificial neural network-based agent that perceives information about its neighbors and whose actions induce strokes of its adjacent arms. With neuroevolution techniques, we evolve optimal policies for the single-bead decision centers such that the N-bead collective efficiently self-propels as an individual, allowing us to investigate optimal locomotion policies for increasingly large microswimmer bodies. We demonstrate that such decentralized policies are robust and tolerant concerning morphological changes or defects and facilitate cargo transport or drug delivery applications "out of the box", without further optimization. Our approach allows us to train large swimmers ($N=100$ and more), and we show that long-wavelength solutions lead to surprisingly efficient swimming gaits. Our work is of relevance to understand robust locomotion of biological microswimmers, to develop robust artificial microswimmer navigation strategies, and, in a broader conceptional context, for Artificial Life< and in general emergent levels of individuality.
△ Less
Submitted 12 July, 2024;
originally announced July 2024.
-
A Perspective on Foundation Models for the Electric Power Grid
Authors:
Hendrik F. Hamann,
Thomas Brunschwiler,
Blazhe Gjorgiev,
Leonardo S. A. Martins,
Alban Puech,
Anna Varbella,
Jonas Weiss,
Juan Bernabe-Moreno,
Alexandre Blondin Massé,
Seong Choi,
Ian Foster,
Bri-Mathias Hodge,
Rishabh Jain,
Kibaek Kim,
Vincent Mai,
François Mirallès,
Martin De Montigny,
Octavio Ramos-Leaños,
Hussein Suprême,
Le Xie,
El-Nasser S. Youssef,
Arnaud Zinflou,
Alexander J. Belvi,
Ricardo J. Bessa,
Bishnu Prasad Bhattari
, et al. (2 additional authors not shown)
Abstract:
Foundation models (FMs) currently dominate news headlines. They employ advanced deep learning architectures to extract structural information autonomously from vast datasets through self-supervision. The resulting rich representations of complex systems and dynamics can be applied to many downstream applications. Therefore, FMs can find uses in electric power grids, challenged by the energy transi…
▽ More
Foundation models (FMs) currently dominate news headlines. They employ advanced deep learning architectures to extract structural information autonomously from vast datasets through self-supervision. The resulting rich representations of complex systems and dynamics can be applied to many downstream applications. Therefore, FMs can find uses in electric power grids, challenged by the energy transition and climate change. In this paper, we call for the development of, and state why we believe in, the potential of FMs for electric grids. We highlight their strengths and weaknesses amidst the challenges of a changing grid. We argue that an FM learning from diverse grid data and topologies could unlock transformative capabilities, pioneering a new approach in leveraging AI to redefine how we manage complexity and uncertainty in the electric grid. Finally, we discuss a power grid FM concept, namely GridFM, based on graph neural networks and show how different downstream tasks benefit.
△ Less
Submitted 12 July, 2024;
originally announced July 2024.
-
Open (Clinical) LLMs are Sensitive to Instruction Phrasings
Authors:
Alberto Mario Ceballos Arroyo,
Monica Munnangi,
Jiuding Sun,
Karen Y. C. Zhang,
Denis Jered McInerney,
Byron C. Wallace,
Silvio Amir
Abstract:
Instruction-tuned Large Language Models (LLMs) can perform a wide range of tasks given natural language instructions to do so, but they are sensitive to how such instructions are phrased. This issue is especially concerning in healthcare, as clinicians are unlikely to be experienced prompt engineers and the potential consequences of inaccurate outputs are heightened in this domain.
This raises a…
▽ More
Instruction-tuned Large Language Models (LLMs) can perform a wide range of tasks given natural language instructions to do so, but they are sensitive to how such instructions are phrased. This issue is especially concerning in healthcare, as clinicians are unlikely to be experienced prompt engineers and the potential consequences of inaccurate outputs are heightened in this domain.
This raises a practical question: How robust are instruction-tuned LLMs to natural variations in the instructions provided for clinical NLP tasks? We collect prompts from medical doctors across a range of tasks and quantify the sensitivity of seven LLMs -- some general, others specialized -- to natural (i.e., non-adversarial) instruction phrasings. We find that performance varies substantially across all models, and that -- perhaps surprisingly -- domain-specific models explicitly trained on clinical data are especially brittle, compared to their general domain counterparts. Further, arbitrary phrasing differences can affect fairness, e.g., valid but distinct instructions for mortality prediction yield a range both in overall performance, and in terms of differences between demographic groups.
△ Less
Submitted 12 July, 2024;
originally announced July 2024.
-
Ambiguous Resonances in Multipulse Quantum Sensing with Nitrogen Vacancy Centers
Authors:
Lucas Tsunaki,
Anmol Singh,
Kseniia Volkova,
Sergei Trofimov,
Tommaso Pregnolato,
Tim Schröder,
Boris Naydenov
Abstract:
Dynamical decoupling multipulse sequences can be applied to solid state spins for sensing weak oscillating fields from nearby single nuclear spins. By periodically reversing the probing system's evolution, other noises are counteracted and filtered out over the total evolution. However, the technique is subject to intricate interactions resulting in additional resonant responses, which can be misi…
▽ More
Dynamical decoupling multipulse sequences can be applied to solid state spins for sensing weak oscillating fields from nearby single nuclear spins. By periodically reversing the probing system's evolution, other noises are counteracted and filtered out over the total evolution. However, the technique is subject to intricate interactions resulting in additional resonant responses, which can be misinterpreted with the actual signal intended to be measured. We experimentally characterized three of these effects present in single nitrogen vacancy centers in diamond, where we also developed a numerical simulations model without rotating wave approximations, showing robust correlation to the experimental data. Regarding centers with the $^{15}$N nitrogen isotope, we observed that a small misalignment in the bias magnetic field causes the precession of the nitrogen nuclear spin to be sensed by the electronic spin of the center. Another studied case of ambiguous resonances comes from the coupling with lattice $^{13}$C nuclei, where we reconstructed the interaction Hamiltonian based on echo modulation frequencies and used this Hamiltonian to simulate multipulse sequences. Finally, we also measured and simulated the effects from the free evolution of the quantum system during finite pulse durations. Due to the large data volume and the strong dependency of these ambiguous resonances with specific experimental parameters, we provide a simulations dataset with a user-friendly graphical interface, where users can compare simulations with their own experimental data for spectral disambiguation. Although focused with nitrogen vacancy centers and dynamical decoupling sequences, these results and the developed model can potentially be applied to other solid state spins and quantum sensing techniques.
△ Less
Submitted 12 July, 2024;
originally announced July 2024.
-
Taming spin susceptibilities in frustrated quantum magnets: Mean-field form and approximate nature of the quantum-to-classical correspondence
Authors:
Benedikt Schneider,
Björn Sbierski
Abstract:
In frustrated quantum magnets the empirically found quantum-to-classical correspondence (QCC) matches the real-space static susceptibility pattern of a quantum spin-$1/2$ model with its classical counterpart computed at a certain elevated temperature. This puzzling relation was observed via bold line diagrammatic Monte Carlo simulations in dimensions two and three, where the matching was within er…
▽ More
In frustrated quantum magnets the empirically found quantum-to-classical correspondence (QCC) matches the real-space static susceptibility pattern of a quantum spin-$1/2$ model with its classical counterpart computed at a certain elevated temperature. This puzzling relation was observed via bold line diagrammatic Monte Carlo simulations in dimensions two and three, where the matching was within error bars and seemed valid down to the lowest accessible temperatures $T$ about an order of magnitude smaller than the exchange coupling $J$. Here we employ resummed spin diagrammatic perturbation theory to show analytically that the QCC breaks at fourth order in $J/T$ and provide the approximate map** between classical and quantum temperatures. Our treatment further reveals that QCC is an indication of the surprising accuracy with which static correlators can be approximated by a simple renormalized mean-field form. We illustrate this for all models discussed in the context of QCC so far, including a recent example of the $S=1$ material $\mathrm{K}_2\mathrm{Ni}_2(\mathrm{SO}_4)_3$. The success of the mean-field form is traced back to partial diagrammatic cancellations.
△ Less
Submitted 12 July, 2024;
originally announced July 2024.
-
A grid of self-consistent MSG (MARCS-StaticWeather-GGchem) cool stellar, sub-stellar, and exoplanetary model atmospheres
Authors:
Uffe G. Jørgensen,
Flavia Amadio,
Beatriz Campos Estrada,
Kristian Holten Møller,
Aaron D. Schneider,
Thorsten Balduin,
Azzurra D'Alessandro,
Eftychia Symeonidou,
Christiane Helling,
Åke Nordlund,
Peter Woitke
Abstract:
Computation of a grid of self consistent 1D model atmospheres of cool stars, sub-stellar objects and exoplanets in the effective temperature range 300K to 3000K, including cloud formation, chemical non-equilibrium effects, and stellar irradiation.
The models are called MSG, because they are based on an iterative coupling between three well tested codes, the MARCS stellar atmosphere code, the Sta…
▽ More
Computation of a grid of self consistent 1D model atmospheres of cool stars, sub-stellar objects and exoplanets in the effective temperature range 300K to 3000K, including cloud formation, chemical non-equilibrium effects, and stellar irradiation.
The models are called MSG, because they are based on an iterative coupling between three well tested codes, the MARCS stellar atmosphere code, the StaticWeather cloud formation code and the GGchem chemical equilibrium code. It includes up-to-date molecular and atomic opacities, cloud formation and advanced chemical equilibrium calculations, and involves new numerical methods at low temperatures to allow robust convergence.
The coupling between the MARCS radiative transfer and GGchem chemical equilibrium computations has made it possibly effectively to reach convergence based on electron pressure for the warmer models and gas pressure for the cooler models, enabling self-consistent modelling of stellar, sub-stellar and exoplanetary objects in a very wide range of effective temperatures. Here we describe the basic details of the models, with illustrative examples of cloudy and irradiated models as well as models based on non-equilibrium chemistry.
The qualitative changes in the relative abundances of TiO, H2O, CH4, NH3, and other molecules in our models follow the observationally defined M, L, T (and Y) sequences, but reveal more complex and depth dependent abundance changes, and therefore a spectral classification depending on more parameters. The self consistent coupling to Static-Weather cloud computations, allows detailed comparison between nucleation and observed relative dimming of different spectral bands, with advanced applications for new identification methods of potential exoplanetary biology.
△ Less
Submitted 12 July, 2024;
originally announced July 2024.
-
Deep Bag-of-Words Model: An Efficient and Interpretable Relevance Architecture for Chinese E-Commerce
Authors:
Zhe Lin,
Jiwei Tan,
Dan Ou,
Xi Chen,
Shaowei Yao,
Bo Zheng
Abstract:
Text relevance or text matching of query and product is an essential technique for the e-commerce search system to ensure that the displayed products can match the intent of the query. Many studies focus on improving the performance of the relevance model in search system. Recently, pre-trained language models like BERT have achieved promising performance on the text relevance task. While these mo…
▽ More
Text relevance or text matching of query and product is an essential technique for the e-commerce search system to ensure that the displayed products can match the intent of the query. Many studies focus on improving the performance of the relevance model in search system. Recently, pre-trained language models like BERT have achieved promising performance on the text relevance task. While these models perform well on the offline test dataset, there are still obstacles to deploy the pre-trained language model to the online system as their high latency. The two-tower model is extensively employed in industrial scenarios, owing to its ability to harmonize performance with computational efficiency. Regrettably, such models present an opaque ``black box'' nature, which prevents developers from making special optimizations. In this paper, we raise deep Bag-of-Words (DeepBoW) model, an efficient and interpretable relevance architecture for Chinese e-commerce. Our approach proposes to encode the query and the product into the sparse BoW representation, which is a set of word-weight pairs. The weight means the important or the relevant score between the corresponding word and the raw text. The relevance score is measured by the accumulation of the matched word between the sparse BoW representation of the query and the product. Compared to popular dense distributed representation that usually suffers from the drawback of black-box, the most advantage of the proposed representation model is highly explainable and interventionable, which is a superior advantage to the deployment and operation of online search engines. Moreover, the online efficiency of the proposed model is even better than the most efficient inner product form of dense representation ...
△ Less
Submitted 12 July, 2024;
originally announced July 2024.
-
Seasonal variation of Saturn's Lyman-$α$ brightness
Authors:
P. Stephenson,
T. T. Koskinen,
Z. Brown,
E. Quémerais,
P. Lavvas,
J. I. Moses,
B. Sandel,
R. Yelle
Abstract:
We examine Saturn's non-auroral (dayglow) emissions at Lyman-$α$ observed by the {Cassini/UVIS} instrument from 2004 until 2016, to constrain meridional and seasonal trends in the upper atmosphere. We separate viewing geometry effects from trends driven by atmospheric properties, by applying a multi-variate regression to the observed emissions. The Lyman-$α$ dayglow brightnesses depend on the inci…
▽ More
We examine Saturn's non-auroral (dayglow) emissions at Lyman-$α$ observed by the {Cassini/UVIS} instrument from 2004 until 2016, to constrain meridional and seasonal trends in the upper atmosphere. We separate viewing geometry effects from trends driven by atmospheric properties, by applying a multi-variate regression to the observed emissions. The Lyman-$α$ dayglow brightnesses depend on the incident solar flux, solar incidence angle, emission angle, and observed latitude. The emissions across latitudes and seasons show a strong dependence with solar incidence angle, typical of resonantly scattered solar flux and consistent with no significant internal source. We observe a bulge in Ly-$α$ brightness that shifts with the summer season from the southern to the northern hemisphere. We estimate atomic hydrogen optical depths above the methane homopause level for dayside disk observations (2004-2016) by comparing observed Lyman-$α$ emissions to a radiative transfer model. We model emissions from resonantly scattered solar flux and a smaller but significant contribution by scattered photons from the interplanetary hydrogen (IPH) background. During northern summer, inferred hydrogen optical depths steeply decrease with latitude towards the winter hemisphere from a northern hemisphere bulge, as predicted by a 2D seasonal photochemical model. The southern hemisphere mirrors this trend during its summer. However, inferred optical depths show substantially more temporal variation between 2004 and 2016 than predicted by the photochemical model.
△ Less
Submitted 12 July, 2024;
originally announced July 2024.
-
SynCOM: An Empirical Model for High-Resolution Simulations of Transient Solar Wind Flows
Authors:
Valmir P. Moraes Filho,
Vadim M. Uritsky,
Barbara J. Thompson,
Sarah E. Gibson,
Craig E. DeForest
Abstract:
The Synthetic Corona Outflow Model (SynCOM), an empirical model, simulates the solar corona's dynamics to match high-resolution observations, providing a useful resource for testing velocity measurement algorithms. SynCOM generates synthetic images depicting radial variability in polarized brightness and includes stochastic elements for plasma outflows and instrumental noise. It employs a predefin…
▽ More
The Synthetic Corona Outflow Model (SynCOM), an empirical model, simulates the solar corona's dynamics to match high-resolution observations, providing a useful resource for testing velocity measurement algorithms. SynCOM generates synthetic images depicting radial variability in polarized brightness and includes stochastic elements for plasma outflows and instrumental noise. It employs a predefined flow velocity probability distribution and an adjustable signal-to-noise ratio to evaluate different data analysis methods for coronal flows. By adjusting parameters to match specific coronal and instrumental conditions, SynCOM offers a platform to assess these methods for determining coronal velocity and acceleration. Validating these measurements would help to understand solar wind origins and support missions such as the Polarimeter to Unify the Corona and Heliosphere (PUNCH). In this study, we demonstrate how SynCOM can be employed to assess the precision and performance of two different flow tracking methods. By providing a ground-truth based on observational data, we highlight the importance of SynCOM in confirming observational standards for detecting coronal flows.
△ Less
Submitted 12 July, 2024;
originally announced July 2024.
-
Graph Neural Network Causal Explanation via Neural Causal Models
Authors:
Arman Behnam,
Binghui Wang
Abstract:
Graph neural network (GNN) explainers identify the important subgraph that ensures the prediction for a given graph. Until now, almost all GNN explainers are based on association, which is prone to spurious correlations. We propose {\name}, a GNN causal explainer via causal inference. Our explainer is based on the observation that a graph often consists of a causal underlying subgraph. {\name} inc…
▽ More
Graph neural network (GNN) explainers identify the important subgraph that ensures the prediction for a given graph. Until now, almost all GNN explainers are based on association, which is prone to spurious correlations. We propose {\name}, a GNN causal explainer via causal inference. Our explainer is based on the observation that a graph often consists of a causal underlying subgraph. {\name} includes three main steps: 1) It builds causal structure and the corresponding structural causal model (SCM) for a graph, which enables the cause-effect calculation among nodes. 2) Directly calculating the cause-effect in real-world graphs is computationally challenging. It is then enlightened by the recent neural causal model (NCM), a special type of SCM that is trainable, and design customized NCMs for GNNs. By training these GNN NCMs, the cause-effect can be easily calculated. 3) It uncovers the subgraph that causally explains the GNN predictions via the optimized GNN-NCMs. Evaluation results on multiple synthetic and real-world graphs validate that {\name} significantly outperforms existing GNN explainers in exact groundtruth explanation identification
△ Less
Submitted 12 July, 2024;
originally announced July 2024.
-
Grain boundaries control lithiation of solid solution substrates in lithium metal batteries
Authors:
Leonardo Shoji Aota,
Chanwon Jung,
Siyuan Zhang,
Ömer K. Büyükuslu,
Poonam Yadav,
Mahander Pratap Singh,
Xinren Chen,
Eric Woods,
Christina Scheu,
Se-Ho Kim,
Dierk Raabe,
Baptiste Gault
Abstract:
The development of sustainable transportation and communication systems requires an increase in both energy density and capacity retention of Li-batteries. Using substrates forming a solid solution with body centered cubic Li enhances the cycle stability of anode-less batteries. However, it remains unclear how the substrate microstructure affects the lithiation behavior. Here, we deploy a correlat…
▽ More
The development of sustainable transportation and communication systems requires an increase in both energy density and capacity retention of Li-batteries. Using substrates forming a solid solution with body centered cubic Li enhances the cycle stability of anode-less batteries. However, it remains unclear how the substrate microstructure affects the lithiation behavior. Here, we deploy a correlative, near-atomic scale probing approach through combined ion- and electron-microscopy to examine the distribution of Li in Li-Ag diffusion couples as model system. We reveal that Li regions with over 93.8% at.% nucleate within Ag at random high angle grain boundaries, whereas grain interiors are not lithiated. We evidence the role of kinetics and mechanical constraint from the microstructure over equilibrium thermodynamics in dictating the lithiation process. The findings suggest that grain size and grain boundary character are critical to enhance the electrochemical performance of interlayers/electrodes, particularly for improving lithiation kinetics and hence reducing dendrite formation.
△ Less
Submitted 12 July, 2024;
originally announced July 2024.
-
Kekulé valence bond order in the honeycomb lattice optical Su-Schrieffer-Heeger Model and its relevance to Graphen
Authors:
Sohan Malkaruge Costa,
Benjamin Cohen-Stead,
Steven Johnston
Abstract:
We perform sign-problem-free determinant quantum Monte Carlo simulations of the optical Su-Schrieffer-Heeger (SSH) model on a half-filled honeycomb lattice. In particular, we investigate the model's semi-metal (SM) to Kekul{é} Valence Bond Solid (KVBS) phase transition at zero and finite temperatures as a function of phonon energy and interaction strength. Using hybrid Monte Carlo sampling methods…
▽ More
We perform sign-problem-free determinant quantum Monte Carlo simulations of the optical Su-Schrieffer-Heeger (SSH) model on a half-filled honeycomb lattice. In particular, we investigate the model's semi-metal (SM) to Kekul{é} Valence Bond Solid (KVBS) phase transition at zero and finite temperatures as a function of phonon energy and interaction strength. Using hybrid Monte Carlo sampling methods we can simulate the model near the adiabatic regime, allowing us to access regions of parameter space relevant to graphene. Our simulations suggest that the SM-KVBS transition is weakly first-order at all temperatures, with graphene situated close to the phase boundary in the SM region of the phase diagram. Our results highlight the important role bond-stretching phonon modes play in the formation of KVBS order in strained graphene-derived systems.
△ Less
Submitted 12 July, 2024;
originally announced July 2024.
-
Any-Property-Conditional Molecule Generation with Self-Criticism using Spanning Trees
Authors:
Alexia Jolicoeur-Martineau,
Aristide Baratin,
Kisoo Kwon,
Boris Knyazev,
Yan Zhang
Abstract:
Generating novel molecules is challenging, with most representations leading to generative models producing many invalid molecules. Spanning Tree-based Graph Generation (STGG) is a promising approach to ensure the generation of valid molecules, outperforming state-of-the-art SMILES and graph diffusion models for unconditional generation. In the real world, we want to be able to generate molecules…
▽ More
Generating novel molecules is challenging, with most representations leading to generative models producing many invalid molecules. Spanning Tree-based Graph Generation (STGG) is a promising approach to ensure the generation of valid molecules, outperforming state-of-the-art SMILES and graph diffusion models for unconditional generation. In the real world, we want to be able to generate molecules conditional on one or multiple desired properties rather than unconditionally. Thus, in this work, we extend STGG to multi-property-conditional generation. Our approach, STGG+, incorporates a modern Transformer architecture, random masking of properties during training (enabling conditioning on any subset of properties and classifier-free guidance), an auxiliary property-prediction loss (allowing the model to self-criticize molecules and select the best ones), and other improvements. We show that STGG+ achieves state-of-the-art performance on in-distribution and out-of-distribution conditional generation, and reward maximization.
△ Less
Submitted 12 July, 2024;
originally announced July 2024.
-
Imaging Interiors: An Implicit Solution to Electromagnetic Inverse Scattering Problems
Authors:
Ziyuan Luo,
Boxin Shi,
Haoliang Li,
Renjie Wan
Abstract:
Electromagnetic Inverse Scattering Problems (EISP) have gained wide applications in computational imaging. By solving EISP, the internal relative permittivity of the scatterer can be non-invasively determined based on the scattered electromagnetic fields. Despite previous efforts to address EISP, achieving better solutions to this problem has remained elusive, due to the challenges posed by invers…
▽ More
Electromagnetic Inverse Scattering Problems (EISP) have gained wide applications in computational imaging. By solving EISP, the internal relative permittivity of the scatterer can be non-invasively determined based on the scattered electromagnetic fields. Despite previous efforts to address EISP, achieving better solutions to this problem has remained elusive, due to the challenges posed by inversion and discretization. This paper tackles those challenges in EISP via an implicit approach. By representing the scatterer's relative permittivity as a continuous implicit representation, our method is able to address the low-resolution problems arising from discretization. Further, optimizing this implicit representation within a forward framework allows us to conveniently circumvent the challenges posed by inverse estimation. Our approach outperforms existing methods on standard benchmark datasets. Project page: https://luo-ziyuan.github.io/Imaging-Interiors
△ Less
Submitted 12 July, 2024;
originally announced July 2024.
-
Heterogeneous integration of amorphous silicon carbide on thin film lithium niobate
Authors:
Zizheng Li,
Naresh Sharma,
Bruno Lopez-Rodriguez,
Roald van der Kolk,
Thomas Scholte,
Hugo Voncken,
Jasper van der Boom,
Simon Gröblacher,
Iman Esmaeil Zadeh
Abstract:
In the past decade, lithium niobate (LiNbO3 or LN) photonics, thanks to its heat-free and fast electro-optical modulation, second-order non-linearities and low loss, has been extensively investigated. Despite numerous demonstrations of high-performance LN photonics, processing lithium niobate remains challenging and suffers from incompatibilities with standard complementary metal-oxide semiconduct…
▽ More
In the past decade, lithium niobate (LiNbO3 or LN) photonics, thanks to its heat-free and fast electro-optical modulation, second-order non-linearities and low loss, has been extensively investigated. Despite numerous demonstrations of high-performance LN photonics, processing lithium niobate remains challenging and suffers from incompatibilities with standard complementary metal-oxide semiconductor (CMOS) fabrication lines, limiting its scalability. Silicon carbide (SiC) is an emerging material platform with a high refractive index, a large non-linear Kerr coefficient, and a promising candidate for heterogeneous integration with LN photonics. Current approaches of SiC/LN integration require transfer-bonding techniques, which are time-consuming, expensive, and lack precision in layer thickness. Here we show that amorphous silicon carbide (a-SiC), deposited using inductively coupled plasma enhanced chemical vapor deposition (ICPCVD) at low temperatures (< 165 C), can be conveniently integrated with LiNbO3 and processed to form high-performance photonics. Most importantly, the fabrication only involves a standard, silicon-compatible, reactive ion etching step and leaves the LiNbO3 intact, hence its compatibility with standard foundry processes. As a proof-of-principle, we fabricated waveguides and ring resonators on the developed a-SiC/LN platform and achieved intrinsic quality factors higher than 106,000 and resonance electro-optic tunability of 3.4 pm/V with 3 mm tuning length. We showcase the possibility of dense integration by fabricating and testing ring resonators with 40um radius without a noticeable loss penalty. Our platform offers a CMOS-compatible and scalable approach for implementation of future fast electro-optic modulators and reconfigurable photonic circuits as well as nonlinear processes which can benefit from involving both second and third-order nonlinearities.
△ Less
Submitted 12 July, 2024;
originally announced July 2024.
-
Exactly-solved model of light-scattering errors in quantum simulations with metastable trapped-ion qubits
Authors:
Phillip C. Lotshaw,
Brian C. Sawyer,
Creston D. Herold,
Gilles Buchs
Abstract:
We analytically solve a model for light scattering in Ising dynamics of metastable atomic qubits, generalizing the approach of Foss-Feig {\it et al.}~[Phys.~Rev.~A {\bf 87}, 042101 (2013)] to include leakage outside the qubit manifold. We analyze the influence of these fundamental errors in simulations of proposed experiments with metastable levels of $^{40}$Ca$^+$ ions. We find that ``effective m…
▽ More
We analytically solve a model for light scattering in Ising dynamics of metastable atomic qubits, generalizing the approach of Foss-Feig {\it et al.}~[Phys.~Rev.~A {\bf 87}, 042101 (2013)] to include leakage outside the qubit manifold. We analyze the influence of these fundamental errors in simulations of proposed experiments with metastable levels of $^{40}$Ca$^+$ ions. We find that ``effective magnetic fields" generated by leaked qubits have significant impacts on spin-spin correlation functions for Greenberger-Horne-Zeilinger state preparation or for quantum simulations with strong coupling, while spin squeezing uses a much weaker coupling and is largely insensitive to the simulated leakage errors, even with a few hundred ions. Our theory and results are expected to be useful in modeling a variety of metastable qubit experiments in the future.
△ Less
Submitted 12 July, 2024;
originally announced July 2024.
-
Nationwide frequency-dependent seismic site amplification models for Iceland
Authors:
Atefe Darzi,
Benedikt Halldorsson,
Fabrice Cotton,
Sahar Rahpeyma
Abstract:
Seismic wave amplification due to localized site conditions is an important aspect of regional seismic hazard assessment. Without systematic studies of frequency-dependent site-effects during strong Icelandic earthquakes, various local site proxies of large-scale studies in other seismic regions have been used in Iceland. Recently, earthquake site-effects were rigorously quantified for 34 stations…
▽ More
Seismic wave amplification due to localized site conditions is an important aspect of regional seismic hazard assessment. Without systematic studies of frequency-dependent site-effects during strong Icelandic earthquakes, various local site proxies of large-scale studies in other seismic regions have been used in Iceland. Recently, earthquake site-effects were rigorously quantified for 34 stations in Southwest Iceland for the first time and correlated to distinct Icelandic geological units of hard rock, rock, lava rock, and sedimentary soil. These units are prevalent throughout Iceland and herein we present 1) nationwide maps of proxies (slope, Vs30, geological units) that may contribute to a better estimation of site effects and associated, 2) frequency-dependent site-amplification maps of Iceland. The frequency-dependent site factors for each geological unit are presented at 1-30 Hz and PGA. Finally, we generate site amplification maps based on recent large-scale models developed in other seismic regions (ESRM20) and various site proxies they are based on (geology- and slope-based inferred Vs30, geomorphological sedimentary thickness). We compare site-proxy maps and amplification maps from both Icelandic and large-scale, non-Icelandic, models. Neither spatial patterns nor amplification levels in either proxy or amplification maps from large-scale non-Icelandic studies resemble those observed from local quantitative strong-motion research as presented in this study. We attribute the discrepancy primarily to the young geology of Iceland and its formation history. Additionally, we compare model performance across frequencies by assessing the bias of model predictions against empirical site amplifications in the South Iceland Seismic Zone, accounting for site-to-site variability of residuals indicating the superior performance of the local amplification model.
△ Less
Submitted 12 July, 2024;
originally announced July 2024.
-
Function spaces on formal manifolds
Authors:
Fulin Chen,
Binyong Sun,
Chuyun Wang
Abstract:
This is a paper in a series that studies smooth relative Lie algebra homologies and cohomologies based on the theory of formal manifolds and formal Lie groups. In a previous paper, we introduce the notion of formal manifolds and develop the foundational framework of formal manifolds. In this paper, we study various function spaces on formal manifolds, including generalizations of vector-valued gen…
▽ More
This is a paper in a series that studies smooth relative Lie algebra homologies and cohomologies based on the theory of formal manifolds and formal Lie groups. In a previous paper, we introduce the notion of formal manifolds and develop the foundational framework of formal manifolds. In this paper, we study various function spaces on formal manifolds, including generalizations of vector-valued generalized functions and vector-valued distributions on smooth manifolds to the setting of formal manifolds.
△ Less
Submitted 10 July, 2024;
originally announced July 2024.
-
Defining Name Accessibility using Scope Graphs (Extended Edition)
Authors:
Aron Zwaan,
Casper Bach Poulsen
Abstract:
Many programming languages allow programmers to regulate accessibility; i.e., annotating a declaration with keywords such as export and private to indicate where it can be accessed. Despite the importance of name accessibility for, e.g., compilers, editor auto-completion and tooling, and automated refactorings, few existing type systems provide a formal account of name accessibility.
We present…
▽ More
Many programming languages allow programmers to regulate accessibility; i.e., annotating a declaration with keywords such as export and private to indicate where it can be accessed. Despite the importance of name accessibility for, e.g., compilers, editor auto-completion and tooling, and automated refactorings, few existing type systems provide a formal account of name accessibility.
We present a declarative, executable, and language-parametric model for name accessibility, which provides a formal specification of name accessibility in Java, C#, C++, Rust, and Eiffel. We achieve this by defining name accessibility as a predicate on resolution paths through scope graphs. Since scope graphs are a language-independent model of name resolution, our model provides a uniform approach to defining different accessibility policies for different languages.
Our model is implemented in Statix, a logic language for executable type system specification using scope graphs. We evaluate its correctness on a test suite that compares it with the C#, Java, and Rust compilers, and show we can synthesize access modifiers in programs with holes accurately.
△ Less
Submitted 12 July, 2024;
originally announced July 2024.
-
Detailed Map** of the Galactic Disk Structure in the Solar Neighborhood through LAMOST K Dwarfs
Authors:
Xi-Can Tang,
Hao Tian,
**g Li,
Bing-qiu Chen,
Yi-Rong Chen,
Chao Liu,
Dan Qiu
Abstract:
The Galactic disk is one of the main components of the Milky Way, which contributes most of the luminosity. Its structure is essential for understanding the formation and evolution of the Milky Way. Using 174,443 K-type dwarf stars observed by both LAMOST and Gaia DR3, we study the disk density profile in the local volume within 1,200 pc. In the azimuthal dimension, we find strong asymmetric signa…
▽ More
The Galactic disk is one of the main components of the Milky Way, which contributes most of the luminosity. Its structure is essential for understanding the formation and evolution of the Milky Way. Using 174,443 K-type dwarf stars observed by both LAMOST and Gaia DR3, we study the disk density profile in the local volume within 1,200 pc. In the azimuthal dimension, we find strong asymmetric signal of the thin disk. The surface density and the scale height of the southern disk significantly change versus the azimuthal angle at the same galactocentric distance $R$. Meanwhile, in the vertical dimension, the scale height of the northern disk has quite different trend than that of the southern one. The scale height of the southern disk shows a decreasing trend with $φ\sim-2.5^\circ$, and change to an increasing one with $φ\sim5.0^°$. Meanwhile, the scale height of the northern disk has a consistently smaller increase. Finally, we divide the entire sample into three subsamples based on metallicity and all three subsamples show significant non-axisymmetric and north-south asymmetric signals in the Galactic disk. Furthermore, we find that the scale height of the metal-poor ([Fe/H] $<$ -0.4 dex) subsample in the northern disk is greater than that of the metal-rich ([Fe/H] $>$ -0.1 dex) subsample. However, in the southern disk, the scale height exhibits varying relationships across different metallicity slices.
△ Less
Submitted 12 July, 2024;
originally announced July 2024.
-
Experimental verifiable multi-client blind quantum computing on a Qline architecture
Authors:
Beatrice Polacchi,
Dominik Leichtle,
Gonzalo Carvacho,
Giorgio Milani,
Nicolò Spagnolo,
Marc Kaplan,
Elham Kashefi,
Fabio Sciarrino
Abstract:
The exploitation of certification tools by end users represents a fundamental aspect of the development of quantum technologies as the hardware scales up beyond the regime of classical simulatability. Certifying quantum networks becomes even more crucial when the privacy of their users is exposed to malicious quantum nodes or servers as in the case of multi-client distributed blind quantum computi…
▽ More
The exploitation of certification tools by end users represents a fundamental aspect of the development of quantum technologies as the hardware scales up beyond the regime of classical simulatability. Certifying quantum networks becomes even more crucial when the privacy of their users is exposed to malicious quantum nodes or servers as in the case of multi-client distributed blind quantum computing, where several clients delegate a joint private computation to remote quantum servers, e.g. federated quantum machine learning. In such protocols, security must be provided not only by kee** data hidden but also by verifying that the server is correctly performing the requested computation while minimizing the hardware assumptions on the employed devices. Notably, standard verification techniques fail in scenarios where the clients receive quantum states from untrusted sources such as, for example, in a recently demonstrated linear quantum network performing multi-client blind quantum computation. However, recent theoretical results provide techniques to verify blind quantum computations even in the case of untrusted state preparation. Equipped with such theoretical tools, in this work, we provide the first experimental implementation of a two-client verifiable blind quantum computing protocol in a distributed architecture. The obtained results represent novel perspectives for the verification of multi-tenant distributed quantum computation in large-scale networks.
△ Less
Submitted 12 July, 2024;
originally announced July 2024.
-
Dynamic-Mode Decomposition of Geostrophically Balanced and Unbalanced Motions from SWOT
Authors:
Takaya Uchida,
Yadidya Badarvada,
Karl E. Lapo,
Xiaobiao Xu,
Brian K. Arbic,
Dimitris Menemenlis,
Luna Hiron,
Eric P. Chassignet,
Jay F. Shriver
Abstract:
The decomposition of oceanic flow into its balanced and unbalanced motions carries theoretical and practical significance for the oceanographic community. These two motions have distinct dynamical characteristics and affect the transport of tracers differently from one another. The launch of Surface Water and Ocean Topography (SWOT) satellite provides a prime opportunity to diagnose the surface ba…
▽ More
The decomposition of oceanic flow into its balanced and unbalanced motions carries theoretical and practical significance for the oceanographic community. These two motions have distinct dynamical characteristics and affect the transport of tracers differently from one another. The launch of Surface Water and Ocean Topography (SWOT) satellite provides a prime opportunity to diagnose the surface balanced and unbalanced motions on a global scale at an unprecedented spatial resolution. Here, we apply dynamic-mode decomposition (DMD), a linear-algebraic data-driven method, to a tidally-forced numerical simulation and one-day-repeat SWOT observations of sea-surface height (SSH) in the Gulf Stream extension. DMD is able to separate out the spatial modes associated with sub-inertial periods from super-inertial periods. The sub-inertial modes of DMD can be used to extract geostrophically balanced motions from SSH fields, which have an imprint of internal tides.
△ Less
Submitted 12 July, 2024;
originally announced July 2024.
-
DAHRS: Divergence-Aware Hallucination-Remediated SRL Projection
Authors:
Sangpil Youm,
Brodie Mather,
Chathuri Jayaweera,
Juliana Prada,
Bonnie Dorr
Abstract:
Semantic role labeling (SRL) enriches many downstream applications, e.g., machine translation, question answering, summarization, and stance/belief detection. However, building multilingual SRL models is challenging due to the scarcity of semantically annotated corpora for multiple languages. Moreover, state-of-the-art SRL projection (XSRL) based on large language models (LLMs) yields output that…
▽ More
Semantic role labeling (SRL) enriches many downstream applications, e.g., machine translation, question answering, summarization, and stance/belief detection. However, building multilingual SRL models is challenging due to the scarcity of semantically annotated corpora for multiple languages. Moreover, state-of-the-art SRL projection (XSRL) based on large language models (LLMs) yields output that is riddled with spurious role labels. Remediation of such hallucinations is not straightforward due to the lack of explainability of LLMs. We show that hallucinated role labels are related to naturally occurring divergence types that interfere with initial alignments. We implement Divergence-Aware Hallucination-Remediated SRL projection (DAHRS), leveraging linguistically-informed alignment remediation followed by greedy First-Come First-Assign (FCFA) SRL projection. DAHRS improves the accuracy of SRL projection without additional transformer-based machinery, beating XSRL in both human and automatic comparisons, and advancing beyond headwords to accommodate phrase-level SRL projection (e.g., EN-FR, EN-ES). Using CoNLL-2009 as our ground truth, we achieve a higher word-level F1 over XSRL: 87.6% vs. 77.3% (EN-FR) and 89.0% vs. 82.7% (EN-ES). Human phrase-level assessments yield 89.1% (EN-FR) and 91.0% (EN-ES). We also define a divergence metric to adapt our approach to other language pairs (e.g., English-Tagalog).
△ Less
Submitted 12 July, 2024;
originally announced July 2024.
-
High-dimensional maximally entangled photon pairs in parametric down-conversion
Authors:
Richard Bernecker,
Baghdasar Baghdasaryan,
Stephan Fritzsche
Abstract:
Photon pairs generated from spontaneous parametric down-conversion are a well-established method to realize entangled bipartite photonic systems. Laguerre-Gaussian modes, which carry orbital angular momentum (OAM), are commonly exploited to engineer high-dimensional entangled quantum states experimentally. For Hilbert spaces with dimension d>2, maximally entangled states (MES) help to improve the…
▽ More
Photon pairs generated from spontaneous parametric down-conversion are a well-established method to realize entangled bipartite photonic systems. Laguerre-Gaussian modes, which carry orbital angular momentum (OAM), are commonly exploited to engineer high-dimensional entangled quantum states experimentally. For Hilbert spaces with dimension d>2, maximally entangled states (MES) help to improve the capacity and security of quantum communication protocols, among several other promising features. However, the direct generation of MES in well-defined high-dimensional subspaces of the infinite OAM basis has remained a challenge. Here, we formalize how the spatial distribution of the pump beam and the nonlinear profile of the crystal can be utilized to generate MES without additional spatial filtering of OAM modes within a subspace. We illustrate our approach with maximally entangled qutrits (d=3) and ququints (d=5).
△ Less
Submitted 12 July, 2024;
originally announced July 2024.
-
Region Attention Transformer for Medical Image Restoration
Authors:
Zhiwen Yang,
Haowei Chen,
Ziniu Qian,
Yang Zhou,
Hui Zhang,
Dan Zhao,
Bingzheng Wei,
Yan Xu
Abstract:
Transformer-based methods have demonstrated impressive results in medical image restoration, attributed to the multi-head self-attention (MSA) mechanism in the spatial dimension. However, the majority of existing Transformers conduct attention within fixed and coarsely partitioned regions (\text{e.g.} the entire image or fixed patches), resulting in interference from irrelevant regions and fragmen…
▽ More
Transformer-based methods have demonstrated impressive results in medical image restoration, attributed to the multi-head self-attention (MSA) mechanism in the spatial dimension. However, the majority of existing Transformers conduct attention within fixed and coarsely partitioned regions (\text{e.g.} the entire image or fixed patches), resulting in interference from irrelevant regions and fragmentation of continuous image content. To overcome these challenges, we introduce a novel Region Attention Transformer (RAT) that utilizes a region-based multi-head self-attention mechanism (R-MSA). The R-MSA dynamically partitions the input image into non-overlap** semantic regions using the robust Segment Anything Model (SAM) and then performs self-attention within these regions. This region partitioning is more flexible and interpretable, ensuring that only pixels from similar semantic regions complement each other, thereby eliminating interference from irrelevant regions. Moreover, we introduce a focal region loss to guide our model to adaptively focus on recovering high-difficulty regions. Extensive experiments demonstrate the effectiveness of RAT in various medical image restoration tasks, including PET image synthesis, CT image denoising, and pathological image super-resolution. Code is available at \href{https://github.com/Yaziwel/Region-Attention-Transformer-for-Medical-Image-Restoration.git}{https://github.com/RAT}.
△ Less
Submitted 12 July, 2024;
originally announced July 2024.
-
The Sociolinguistic Foundations of Language Modeling
Authors:
Jack Grieve,
Sara Bartl,
Matteo Fuoli,
Jason Grafmiller,
Weihang Huang,
Alejandro Jawerbaum,
Akira Murakami,
Marcus Perlman,
Dana Roemling,
Bodo Winter
Abstract:
In this paper, we introduce a sociolinguistic perspective on language modeling. We claim that large language models are inherently models of varieties of language, and we consider how this insight can inform the development and deployment of large language models. We begin by presenting a technical definition of the concept of a variety of language as developed in sociolinguistics. We then discuss…
▽ More
In this paper, we introduce a sociolinguistic perspective on language modeling. We claim that large language models are inherently models of varieties of language, and we consider how this insight can inform the development and deployment of large language models. We begin by presenting a technical definition of the concept of a variety of language as developed in sociolinguistics. We then discuss how this perspective can help address five basic challenges in language modeling: social bias, domain adaptation, alignment, language change, and scale. Ultimately, we argue that it is crucial to carefully define and compile training corpora that accurately represent the specific varieties of language being modeled to maximize the performance and societal value of large language models.
△ Less
Submitted 12 July, 2024;
originally announced July 2024.
-
Prompts First, Finally
Authors:
Brent N. Reeves,
James Prather,
Paul Denny,
Juho Leinonen,
Stephen MacNeil,
Brett A. Becker,
Andrew Luxton-Reilly
Abstract:
Generative AI (GenAI) and large language models in particular, are disrupting Computer Science Education. They are proving increasingly capable at more and more challenges. Some educators argue that they pose a serious threat to computing education, and that we should ban their use in the classroom. While there are serious GenAI issues that remain unsolved, it may be useful in the present moment t…
▽ More
Generative AI (GenAI) and large language models in particular, are disrupting Computer Science Education. They are proving increasingly capable at more and more challenges. Some educators argue that they pose a serious threat to computing education, and that we should ban their use in the classroom. While there are serious GenAI issues that remain unsolved, it may be useful in the present moment to step back and examine the overall trajectory of Computer Science writ large. Since the very beginning, our discipline has sought to increase the level of abstraction in each new representation. We have progressed from hardware dip switches, through special purpose languages and visual representations like flow charts, all the way now to ``natural language.'' With the advent of GenAI, students can finally change the abstraction level of a problem to the ``language'' they've been ``problem solving'' with all their lives. In this paper, we argue that our programming abstractions were always headed here -- to natural language. Now is the time to adopt a ``Prompts First'' approach to Computer Science Education.
△ Less
Submitted 12 July, 2024;
originally announced July 2024.
-
HUP-3D: A 3D multi-view synthetic dataset for assisted-egocentric hand-ultrasound pose estimation
Authors:
Manuel Birlo,
Razvan Caramalau,
Philip J. "Eddie" Edwards,
Brian Dromey,
Matthew J. Clarkson,
Danail Stoyanov
Abstract:
We present HUP-3D, a 3D multi-view multi-modal synthetic dataset for hand-ultrasound (US) probe pose estimation in the context of obstetric ultrasound. Egocentric markerless 3D joint pose estimation has potential applications in mixed reality based medical education. The ability to understand hand and probe movements programmatically opens the door to tailored guidance and mentoring applications.…
▽ More
We present HUP-3D, a 3D multi-view multi-modal synthetic dataset for hand-ultrasound (US) probe pose estimation in the context of obstetric ultrasound. Egocentric markerless 3D joint pose estimation has potential applications in mixed reality based medical education. The ability to understand hand and probe movements programmatically opens the door to tailored guidance and mentoring applications. Our dataset consists of over 31k sets of RGB, depth and segmentation mask frames, including pose related ground truth data, with a strong emphasis on image diversity and complexity. Adopting a camera viewpoint-based sphere concept allows us to capture a variety of views and generate multiple hand grasp poses using a pre-trained network. Additionally, our approach includes a software-based image rendering concept, enhancing diversity with various hand and arm textures, lighting conditions, and background images. Furthermore, we validated our proposed dataset with state-of-the-art learning models and we obtained the lowest hand-object keypoint errors. The dataset and other details are provided with the supplementary material. The source code of our grasp generation and rendering pipeline will be made publicly available.
△ Less
Submitted 12 July, 2024;
originally announced July 2024.
-
Projection onto hyperbolicity cones and beyond: a dual Frank-Wolfe approach
Authors:
Takayuki Nagano,
Bruno F. Lourenço,
Akiko Takeda
Abstract:
We discuss the problem of projecting a point onto an arbitrary hyperbolicity cone from both theoretical and numerical perspectives. While hyperbolicity cones are furnished with a generalization of the notion of eigenvalues, obtaining closed form expressions for the projection operator as in the case of semidefinite matrices is an elusive endeavour. To address that we propose a Frank-Wolfe method t…
▽ More
We discuss the problem of projecting a point onto an arbitrary hyperbolicity cone from both theoretical and numerical perspectives. While hyperbolicity cones are furnished with a generalization of the notion of eigenvalues, obtaining closed form expressions for the projection operator as in the case of semidefinite matrices is an elusive endeavour. To address that we propose a Frank-Wolfe method to handle this task and, more generally, strongly convex optimization over closed convex cones. One of our innovations is that the Frank-Wolfe method is actually applied to the dual problem and, by doing so, subproblems can be solved in closed-form using minimum eigenvalue functions and conjugate vectors. To test the validity of our proposed approach, we present numerical experiments where we check the performance of alternative approaches including interior point methods and an earlier accelerated gradient method proposed by Renegar. We also show numerical examples where the hyperbolic polynomial has millions of monomials. Finally, we also discuss the problem of projecting onto p-cones which, although not hyperbolicity cones in general, are still amenable to our techniques.
△ Less
Submitted 12 July, 2024;
originally announced July 2024.
-
Generating SROI^{-} Ontologies via Knowledge Graph Query Embedding Learning
Authors:
Yunjie He,
Daniel Hernandez,
Mojtaba Nayyeri,
Bo Xiong,
Yuqicheng Zhu,
Evgeny Kharlamov,
Steffen Staab
Abstract:
Query embedding approaches answer complex logical queries over incomplete knowledge graphs (KGs) by computing and operating on low-dimensional vector representations of entities, relations, and queries. However, current query embedding models heavily rely on excessively parameterized neural networks and cannot explain the knowledge learned from the graph. We propose a novel query embedding method,…
▽ More
Query embedding approaches answer complex logical queries over incomplete knowledge graphs (KGs) by computing and operating on low-dimensional vector representations of entities, relations, and queries. However, current query embedding models heavily rely on excessively parameterized neural networks and cannot explain the knowledge learned from the graph. We propose a novel query embedding method, AConE, which explains the knowledge learned from the graph in the form of SROI^{-} description logic axioms while being more parameter-efficient than most existing approaches. AConE associates queries to a SROI^{-} description logic concept. Every SROI^{-} concept is embedded as a cone in complex vector space, and each SROI^{-} relation is embedded as a transformation that rotates and scales cones. We show theoretically that AConE can learn SROI^{-} axioms, and defines an algebra whose operations correspond one to one to SROI^{-} description logic concept constructs. Our empirical study on multiple query datasets shows that AConE achieves superior results over previous baselines with fewer parameters. Notably on the WN18RR dataset, AConE achieves significant improvement over baseline models. We provide comprehensive analyses showing that the capability to represent axioms positively impacts the results of query answering.
△ Less
Submitted 12 July, 2024;
originally announced July 2024.
-
On the Design and Security of Collective Remote Attestation Protocols
Authors:
Sharar Ahmadi,
Jay Le-Papin,
Liqun Chen,
Brijesh Dongol,
Sasa Radomirovic,
Helen Treharne
Abstract:
Collective remote attestation (CRA) is a security service that aims to efficiently identify compromised (often low-powered) devices in a (heterogeneous) network. The last few years have seen an extensive growth in CRA protocol proposals, showing a variety of designs guided by different network topologies, hardware assumptions and other functional requirements. However, they differ in their trust a…
▽ More
Collective remote attestation (CRA) is a security service that aims to efficiently identify compromised (often low-powered) devices in a (heterogeneous) network. The last few years have seen an extensive growth in CRA protocol proposals, showing a variety of designs guided by different network topologies, hardware assumptions and other functional requirements. However, they differ in their trust assumptions, adversary models and role descriptions making it difficult to uniformly assess their security guarantees. In this paper we present Catt, a unifying framework for CRA protocols that enables them to be compared systematically, based on a comprehensive study of 40 CRA protocols and their adversary models. Catt characterises the roles that devices can take and based on these we develop a novel set of security properties for CRA protocols. We then classify the security aims of all the studied protocols. We illustrate the applicability of our security properties by encoding them in the tamarin prover and verifying the SIMPLE+ protocol against them.
△ Less
Submitted 12 July, 2024;
originally announced July 2024.
-
The MICADO first light imager for the ELT: off-axis performance of PSF reconstruction
Authors:
Matteo Simioni,
Daniel Jodlbauer,
Carmelo Arcidiacono,
Andrea Grazian,
Marco Gullieuszik,
Elisa Portaluri,
Benedetta Vulcani,
Roland Wagner,
Anita Zanella,
Johanna Hartke,
Tapio Helin,
Hanindyo Kuncarayakti,
Fernando Pedichini,
Roberto Piazzesi,
Piero Vaccari
Abstract:
The highest scientific return, for adaptive optics (AO) observations, is achieved with a reliable reconstruction of the PSF. This is especially true for MICADO@ELT. In this presentation, we will focus on extending the MICADO PSF reconstruction (PSF-R) method to the off-axis case. Specifically, a novel approach based on temporal-based tomography of AO telemetry data has been recently implemented. R…
▽ More
The highest scientific return, for adaptive optics (AO) observations, is achieved with a reliable reconstruction of the PSF. This is especially true for MICADO@ELT. In this presentation, we will focus on extending the MICADO PSF reconstruction (PSF-R) method to the off-axis case. Specifically, a novel approach based on temporal-based tomography of AO telemetry data has been recently implemented. Results from the PSF-R of both simulated and real data show that, at half isoplanatic angle distances, a precision of about 10-15% is achievable in both Strehl ratio and full-width at half maximum, paving the way to extend the MICADO PSF-R tool also to the multi-conjugated AO case.
△ Less
Submitted 12 July, 2024;
originally announced July 2024.
-
A Chatbot for Asylum-Seeking Migrants in Europe
Authors:
Bettina Fazzinga,
Elena Palmieri,
Margherita Vestoso,
Luca Bolognini,
Andrea Galassi,
Filippo Furfaro,
Paolo Torroni
Abstract:
We present ACME: A Chatbot for asylum-seeking Migrants in Europe. ACME relies on computational argumentation and aims to help migrants identify the highest level of protection they can apply for. This would contribute to a more sustainable migration by reducing the load on territorial commissions, Courts, and humanitarian organizations supporting asylum applicants. We describe the context, system…
▽ More
We present ACME: A Chatbot for asylum-seeking Migrants in Europe. ACME relies on computational argumentation and aims to help migrants identify the highest level of protection they can apply for. This would contribute to a more sustainable migration by reducing the load on territorial commissions, Courts, and humanitarian organizations supporting asylum applicants. We describe the context, system architectures, technologies, and the case study used to run the demonstration.
△ Less
Submitted 12 July, 2024;
originally announced July 2024.
-
The JWST Weather Report from the Nearest Brown Dwarfs I: multi-period JWST NIRSpec + MIRI monitoring of the benchmark binary brown dwarf WISE 1049AB
Authors:
Beth A. Biller,
Johanna M. Vos,
Yifan Zhou,
Allison M. McCarthy,
Xianyu Tan,
Ian J. M. Crossfield,
Niall Whiteford,
Genaro Suarez,
Jacqueline Faherty,
Elena Manjavacas,
Xueqing Chen,
Pengyu Liu,
Ben J. Sutlieff,
Mary Anne Limbach,
Paul Molliere,
Trent J. Dupuy,
Natalia Oliveros-Gomez,
Philip S. Muirhead,
Thomas Henning,
Gregory Mace,
Nicolas Crouzet,
Theodora Karalidi,
Caroline V. Morley,
Pascal Tremblin,
Tiffany Kataria
Abstract:
We report results from 8 hours of JWST/MIRI LRS spectroscopic monitoring directly followed by 7 hours of JWST/NIRSpec prism spectroscopic monitoring of the benchmark binary brown dwarf WISE 1049AB, the closest, brightest brown dwarfs known. We find water, methane, and CO absorption features in both components, including the 3.3 $μ$m methane absorption feature and a tentative detection of small gra…
▽ More
We report results from 8 hours of JWST/MIRI LRS spectroscopic monitoring directly followed by 7 hours of JWST/NIRSpec prism spectroscopic monitoring of the benchmark binary brown dwarf WISE 1049AB, the closest, brightest brown dwarfs known. We find water, methane, and CO absorption features in both components, including the 3.3 $μ$m methane absorption feature and a tentative detection of small grain ($<$ 1$μ$m) silicate absorption at $>$8.5 $μ$m in WISE 1049A. Both components vary significantly ($>$1$\%$), with WISE 1049B displaying larger variations than WISE 1049A. Using K-means clustering, we find three main transition points in wavelength for both components of the binary: 1) change in behavior at $\sim$2.3 $μ$m coincident with a CO absorption bandhead, 2) change in behavior at 4.2 $μ$m, close to the CO fundamental band at $λ>$ 4.4 $μ$m, and 3) change in behavior at 8.3-8.5 $μ$m, potentially corresponding to silicate absorption. We interpret the lightcurves observed with both NIRSpec and MIRI as likely stemming from 1) a deep pressure level driving the double-peaked variability seen in WISE 1049B at wavelengths $<$2.3 $μ$m and $>$8.5 $μ$m, 2) an intermediate pressure level sha** the lightcurve morphology between 2.3 and 4.2 $μ$m, and 3) a higher-altitude pressure level producing single-peaked and plateaued lightcurve behavior between 4.2 and 8.5 $μ$m.
△ Less
Submitted 12 July, 2024;
originally announced July 2024.
-
Classical solutions to the soap film capillarity problem for plane boundaries
Authors:
Giulia Bevilacqua,
Salvatore Stuvard,
Bozhidar Velichkov
Abstract:
We study the soap film capillarity problem, in which soap films are modeled as sets of least perimeter among those having prescribed (small) volume and satisfying a topological spanning condition. When the given boundary is the closed tubular neighborhood in $\mathbb{R}^3$ of a smooth Jordan curve (or, more generally, the closed tubular neighborhood in $\mathbb{R}^d$ of a smooth embedding of…
▽ More
We study the soap film capillarity problem, in which soap films are modeled as sets of least perimeter among those having prescribed (small) volume and satisfying a topological spanning condition. When the given boundary is the closed tubular neighborhood in $\mathbb{R}^3$ of a smooth Jordan curve (or, more generally, the closed tubular neighborhood in $\mathbb{R}^d$ of a smooth embedding of $\mathbb{S}^{d-2}$ in a hyperplane), we prove existence and uniqueness of classical minimizers, for which the collapsing phenomenon does not occur. We show that the boundary of the unique minimizer is the union of two symmetric smooth normal graphs over a portion of the plane; the graphs have positive constant mean curvature bounded linearly in terms of the volume parameter, and meet the boundary of the tubular neighbourhood orthogonally. Moreover, we prove uniform bounds on the sectional curvatures in order to show that the boundaries of solutions corresponding to varying volumes are ordered monotonically and produce a foliation of space by constant mean curvature hypersurfaces.
△ Less
Submitted 12 July, 2024;
originally announced July 2024.
-
Enhancing Depressive Post Detection in Bangla: A Comparative Study of TF-IDF, BERT and FastText Embeddings
Authors:
Saad Ahmed Sazan,
Mahdi H. Miraz,
A B M Muntasir Rahman
Abstract:
Due to massive adoption of social media, detection of users' depression through social media analytics bears significant importance, particularly for underrepresented languages, such as Bangla. This study introduces a well-grounded approach to identify depressive social media posts in Bangla, by employing advanced natural language processing techniques. The dataset used in this work, annotated by…
▽ More
Due to massive adoption of social media, detection of users' depression through social media analytics bears significant importance, particularly for underrepresented languages, such as Bangla. This study introduces a well-grounded approach to identify depressive social media posts in Bangla, by employing advanced natural language processing techniques. The dataset used in this work, annotated by domain experts, includes both depressive and non-depressive posts, ensuring high-quality data for model training and evaluation. To address the prevalent issue of class imbalance, we utilised random oversampling for the minority class, thereby enhancing the model's ability to accurately detect depressive posts. We explored various numerical representation techniques, including Term Frequency-Inverse Document Frequency (TF-IDF), Bidirectional Encoder Representations from Transformers (BERT) embedding and FastText embedding, by integrating them with a deep learning-based Convolutional Neural Network-Bidirectional Long Short-Term Memory (CNN-BiLSTM) model. The results obtained through extensive experimentation, indicate that the BERT approach performed better the others, achieving a F1-score of 84%. This indicates that BERT, in combination with the CNN-BiLSTM architecture, effectively recognises the nuances of Bangla texts relevant to depressive contents. Comparative analysis with the existing state-of-the-art methods demonstrates that our approach with BERT embedding performs better than others in terms of evaluation metrics and the reliability of dataset annotations. Our research significantly contribution to the development of reliable tools for detecting depressive posts in the Bangla language. By highlighting the efficacy of different embedding techniques and deep learning models, this study paves the way for improved mental health monitoring through social media platforms.
△ Less
Submitted 12 July, 2024;
originally announced July 2024.
-
Magnetic properties and field-induced phenomena in the Jeff = 1/2 distorted kagome antiferromagnet
Authors:
A. Yadav,
A. Elghandour,
T. Arh,
D. T. Adroja,
M. D. Le,
G. B. G. Stenning,
M. Aouane,
S. Luther,
F. Hotz,
T. J. Hicken,
H. Luetkens,
A. Zorko,
R. Klingeler,
P. Khuntia
Abstract:
The intertwining between competing degrees of freedom, anisotropy, and frustration-induced quantum fluctuations offers an ideal ground to realize exotic quantum phenomena in the rare-earth-based kagome lattice. The magnetic susceptibility reveals the presence of two energy scales in agreement with the INS results. The higher energy state is dominated by CEF excitations, where the lowest Kramers gr…
▽ More
The intertwining between competing degrees of freedom, anisotropy, and frustration-induced quantum fluctuations offers an ideal ground to realize exotic quantum phenomena in the rare-earth-based kagome lattice. The magnetic susceptibility reveals the presence of two energy scales in agreement with the INS results. The higher energy state is dominated by CEF excitations, where the lowest Kramers ground-state doublet is well separated from the excited state suggesting that the compound realizes a low-energy state at low temperatures. The second energy scale is witnessed via thermodynamic results that reveal an anomaly at 0.3 K typical of a phase transition, which is attributed to the presence of complex magnetic ordering phenomena. The broad maximum in the specific heat well above 0.3 K indicates the presence of short-range spin correlations that is corroborated by muon spin relaxation rate results. The isothermal magnetization reveals a field-induced 1/3 magnetization plateau at low temperatures. muSR relaxation rate experiments, on the other hand, neither show the signature of a phase transition nor spin-freezing down to 34 mK. The ZF muSR relaxation is governed by the Orbach process and reveals the presence of a fluctuating state owing to the depopulation of crystal field levels reflected as a constant value of relaxation rate in the temperature range 0.4-10 K. NMR results indicate the presence of fluctuating Nd3+ moments down to 1.8 K consistent with muSR experiments. Our comprehensive results reveal that a field-induced quantum critical phenomenon is at play in this frustrated kagome magnet and enable us to construct a phase diagram exemplifying the proximity effect of competing magnetic states. This sets the stage to investigate the broad RE3BWO9 family of rare-earth kagome magnets promising to host exotic quantum states driven by spin-orbit coupling and geometrical frustration.
△ Less
Submitted 12 July, 2024;
originally announced July 2024.
-
Localized States in Dipolar Bose-Einstein Condensates: To be or not to be of second order
Authors:
Alina Barbara Steinberg,
Fabian Maucher,
Svetlana Gurevich,
Uwe Thiele
Abstract:
We report the existence of localized states in dipolar Bose-Einstein condensates confined to a tubular geometry. We first perform a bifurcation analysis to track their emergence in a one-dimensional domain for numerical feasibility and find that localized states can become the ground state in suitable parameter regions. Their existence for parameters featuring a supercritical primary bifurcation s…
▽ More
We report the existence of localized states in dipolar Bose-Einstein condensates confined to a tubular geometry. We first perform a bifurcation analysis to track their emergence in a one-dimensional domain for numerical feasibility and find that localized states can become the ground state in suitable parameter regions. Their existence for parameters featuring a supercritical primary bifurcation shows that the latter is not sufficient to conclude that the phase transition is of second order, hence density modulations can jump rather than emerging gradually. Finally, we show that localized states also exist in a three-dimensional domain.
△ Less
Submitted 12 July, 2024;
originally announced July 2024.
-
Tensor networks enable the calculation of turbulence probability distributions
Authors:
Nikita Gourianov,
Peyman Givi,
Dieter Jaksch,
Stephen B. Pope
Abstract:
Predicting the dynamics of turbulent fluid flows has long been a central goal of science and engineering. Yet, even with modern computing technology, accurate simulation of all but the simplest turbulent flow-fields remains impossible: the fields are too chaotic and multi-scaled to directly store them in memory and perform time-evolution. An alternative is to treat turbulence…
▽ More
Predicting the dynamics of turbulent fluid flows has long been a central goal of science and engineering. Yet, even with modern computing technology, accurate simulation of all but the simplest turbulent flow-fields remains impossible: the fields are too chaotic and multi-scaled to directly store them in memory and perform time-evolution. An alternative is to treat turbulence $\textit{probabilistically}$, viewing flow properties as random variables distributed according to joint probability density functions (PDFs). Turbulence PDFs are neither chaotic nor multi-scale, but are still challenging to simulate due to their high dimensionality. Here we show how to overcome the dimensionality problem by parameterising turbulence PDFs into an extremely compressed format known as a "tensor network" (TN). The TN paradigm enables simulations on single CPU cores that would otherwise be impractical even with supercomputers: for a $5+1$ dimensional PDF of a chemically reactive turbulent flow, we achieve reductions in memory and computational costs by factors of $\mathcal{O}(10^6)$ and $\mathcal{O}(10^3)$, respectively, compared to standard finite difference algorithms. A future path is opened towards something heretofore regarded as infeasible: directly simulating high-dimensional PDFs of both turbulent flows and other chaotic systems that are useful to describe probabilistically.
△ Less
Submitted 12 July, 2024;
originally announced July 2024.
-
TAPI: Towards Target-Specific and Adversarial Prompt Injection against Code LLMs
Authors:
Yuchen Yang,
Hongwei Yao,
Bingrun Yang,
Yiling He,
Yiming Li,
Tianwei Zhang,
Zhan Qin,
Kui Ren
Abstract:
Recently, code-oriented large language models (Code LLMs) have been widely and successfully used to simplify and facilitate code programming. With these tools, developers can easily generate desired complete functional codes based on incomplete code and natural language prompts. However, a few pioneering works revealed that these Code LLMs are also vulnerable, e.g., against backdoor and adversaria…
▽ More
Recently, code-oriented large language models (Code LLMs) have been widely and successfully used to simplify and facilitate code programming. With these tools, developers can easily generate desired complete functional codes based on incomplete code and natural language prompts. However, a few pioneering works revealed that these Code LLMs are also vulnerable, e.g., against backdoor and adversarial attacks. The former could induce LLMs to respond to triggers to insert malicious code snippets by poisoning the training data or model parameters, while the latter can craft malicious adversarial input codes to reduce the quality of generated codes. However, both attack methods have underlying limitations: backdoor attacks rely on controlling the model training process, while adversarial attacks struggle with fulfilling specific malicious purposes.
To inherit the advantages of both backdoor and adversarial attacks, this paper proposes a new attack paradigm, i.e., target-specific and adversarial prompt injection (TAPI), against Code LLMs. TAPI generates unreadable comments containing information about malicious instructions and hides them as triggers in the external source code. When users exploit Code LLMs to complete codes containing the trigger, the models will generate attacker-specified malicious code snippets at specific locations. We evaluate our TAPI attack on four representative LLMs under three representative malicious objectives and seven cases. The results show that our method is highly threatening (achieving an attack success rate of up to 89.3\%) and stealthy (saving an average of 53.1\% of tokens in the trigger design). In particular, we successfully attack some famous deployed code completion integrated applications, including CodeGeex and Github Copilot. This further confirms the realistic threat of our attack.
△ Less
Submitted 12 July, 2024;
originally announced July 2024.
-
The Volterra Integrable case. Novel analytical and numerical results
Authors:
M. Scalia,
O. Ragnisco,
B. Tirozzi,
F. Zullo
Abstract:
In the present paper we reconsider the integrable case of the Hamiltonian $N$-species Volterra system, as it has been introduced by Vito Volterra in 1937, and significantly enrich the results already published in the ArXiv in 2019. In fact, we present a new approach to the construction of conserved quantities and comment about the solutions of the equations of motion; we display mostly new analyti…
▽ More
In the present paper we reconsider the integrable case of the Hamiltonian $N$-species Volterra system, as it has been introduced by Vito Volterra in 1937, and significantly enrich the results already published in the ArXiv in 2019. In fact, we present a new approach to the construction of conserved quantities and comment about the solutions of the equations of motion; we display mostly new analytical and numerical results, starting from the classical predator-prey model till the general $N-$species model.
△ Less
Submitted 12 July, 2024;
originally announced July 2024.
-
Evaluating the Adversarial Robustness of Semantic Segmentation: Trying Harder Pays Off
Authors:
Levente Halmosi,
Bálint Mohos,
Márk Jelasity
Abstract:
Machine learning models are vulnerable to tiny adversarial input perturbations optimized to cause a very large output error. To measure this vulnerability, we need reliable methods that can find such adversarial perturbations. For image classification models, evaluation methodologies have emerged that have stood the test of time. However, we argue that in the area of semantic segmentation, a good…
▽ More
Machine learning models are vulnerable to tiny adversarial input perturbations optimized to cause a very large output error. To measure this vulnerability, we need reliable methods that can find such adversarial perturbations. For image classification models, evaluation methodologies have emerged that have stood the test of time. However, we argue that in the area of semantic segmentation, a good approximation of the sensitivity to adversarial perturbations requires significantly more effort than what is currently considered satisfactory. To support this claim, we re-evaluate a number of well-known robust segmentation models in an extensive empirical study. We propose new attacks and combine them with the strongest attacks available in the literature. We also analyze the sensitivity of the models in fine detail. The results indicate that most of the state-of-the-art models have a dramatically larger sensitivity to adversarial perturbations than previously reported. We also demonstrate a size-bias: small objects are often more easily attacked, even if the large objects are robust, a phenomenon not revealed by current evaluation metrics. Our results also demonstrate that a diverse set of strong attacks is necessary, because different models are often vulnerable to different attacks.
△ Less
Submitted 12 July, 2024;
originally announced July 2024.
-
Measurement of $CP$ asymmetries in $B^0 \to K^0_S π^0 γ$ decays at Belle II
Authors:
Belle II Collaboration,
I. Adachi,
L. Aggarwal,
H. Ahmed,
H. Aihara,
N. Akopov,
A. Aloisio,
N. Anh Ky,
D. M. Asner,
H. Atmacan,
T. Aushev,
V. Aushev,
M. Aversano,
R. Ayad,
V. Babu,
H. Bae,
S. Bahinipati,
P. Bambade,
Sw. Banerjee,
S. Bansal,
M. Barrett,
J. Baudot,
A. Baur,
A. Beaubien,
F. Becherer
, et al. (414 additional authors not shown)
Abstract:
We report measurements of time-dependent $CP$ asymmetries in $B^0 \to K^0_S π^0 γ$ decays based on a data sample of $(388\pm6)\times10^6$ $B\bar{B}$ events collected at the $Υ(4S)$ resonance with the Belle II detector. The Belle II experiment operates at the SuperKEKB asymmetric-energy $e^+e^-$ collider. We measure decay-time distributions to determine $CP$-violating parameters $S$ and $C$. We det…
▽ More
We report measurements of time-dependent $CP$ asymmetries in $B^0 \to K^0_S π^0 γ$ decays based on a data sample of $(388\pm6)\times10^6$ $B\bar{B}$ events collected at the $Υ(4S)$ resonance with the Belle II detector. The Belle II experiment operates at the SuperKEKB asymmetric-energy $e^+e^-$ collider. We measure decay-time distributions to determine $CP$-violating parameters $S$ and $C$. We determine these parameters for two ranges of $K^0_S π^0$ invariant mass: $m(K^0_S π^0)\in (0.8, 1.0)$ $GeV/c^2$, which is dominated by $B^0 \to K^{*0} (\to K^0_S π^0) γ$ decays, and a complementary region $m(K^0_S π^0)\in (0.6, 0.8)\cup(1.0, 1.8)$ $GeV/c^2$. Our results have improved precision as compared to previous measurements and are consistent with theory predictions.
△ Less
Submitted 12 July, 2024;
originally announced July 2024.