-
An infrared census of R Coronae Borealis Stars II -- Spectroscopic classifications and implications for the rate of low-mass white dwarf mergers
Authors:
Viraj R. Karambelkar,
Mansi M. Kasliwal,
Patrick Tisserand,
Shreya Anand,
Michael C. B. Ashley,
Lars Bildsten,
Geoffrey C. Clayton,
Courtney C. Crawford,
Kishalay De,
Nicholas Earley,
Matthew J. Hankins,
Xander Hall,
Astrid Lamberts,
Ryan M. Lau,
Dan McKenna,
Anna Moore,
Eran O. Ofek,
Roger M. Smith,
Roberto Soria,
Jamie Soon,
Tony Travouillon
Abstract:
We present results from a systematic infrared (IR) census of R Coronae Borealis (RCB) stars in the Milky Way, using data from the Palomar Gattini IR (PGIR) survey. R Coronae Borealis stars are dusty, erratic variable stars presumably formed from the merger of a He-core and a CO-core white dwarf (WD). PGIR is a 30 cm $J$-band telescope with a 25 deg$^{2}$ camera that surveys 18000 deg$^{2}$ of the…
▽ More
We present results from a systematic infrared (IR) census of R Coronae Borealis (RCB) stars in the Milky Way, using data from the Palomar Gattini IR (PGIR) survey. R Coronae Borealis stars are dusty, erratic variable stars presumably formed from the merger of a He-core and a CO-core white dwarf (WD). PGIR is a 30 cm $J$-band telescope with a 25 deg$^{2}$ camera that surveys 18000 deg$^{2}$ of the northern sky ($δ>-28^{o}$) at a cadence of 2 days. Using PGIR J-band lightcurves for $\sim$60 million stars together with mid-IR colors from WISE, we selected a sample of 530 candidate RCB stars. We obtained near-IR spectra for these candidates and identified 53 RCB stars in our sample. Accounting for our selection criteria, we find that there are a total of $\approx350^{+150}_{-100}$ RCB stars in the Milky Way. Assuming typical RCB lifetimes, this corresponds to an RCB formation rate of 0.8 - 5 $\times$ 10$^{-3}$ yr$^{-1}$, consistent with observational and theoretical estimates of the He-CO WD merger rate. We searched for quasi-periodic pulsations in the PGIR lightcurves of RCB stars and present pulsation periods for 16 RCB stars. We also examined high-cadenced TESS lightcurves for RCB and the chemically similar, but dustless hydrogen-deficient carbon (dLHdC) stars. We find that dLHdC stars show variations on timescales shorter than RCB stars, suggesting that they may have lower masses than RCB stars. Finally, we identified 3 new spectroscopically confirmed and 12 candidate Galactic DY Per type stars - believed to be colder cousins of RCB stars - doubling the sample of Galactic DY Per type stars.
△ Less
Submitted 11 July, 2024;
originally announced July 2024.
-
Latent Spaces Enable Transformer-Based Dose Prediction in Complex Radiotherapy Plans
Authors:
Edward Wang,
Ryan Au,
Pencilla Lang,
Sarah A. Mattonen
Abstract:
Evidence is accumulating in favour of using stereotactic ablative body radiotherapy (SABR) to treat multiple cancer lesions in the lung. Multi-lesion lung SABR plans are complex and require significant resources to create. In this work, we propose a novel two-stage latent transformer framework (LDFormer) for dose prediction of lung SABR plans with varying numbers of lesions. In the first stage, pa…
▽ More
Evidence is accumulating in favour of using stereotactic ablative body radiotherapy (SABR) to treat multiple cancer lesions in the lung. Multi-lesion lung SABR plans are complex and require significant resources to create. In this work, we propose a novel two-stage latent transformer framework (LDFormer) for dose prediction of lung SABR plans with varying numbers of lesions. In the first stage, patient anatomical information and the dose distribution are encoded into a latent space. In the second stage, a transformer learns to predict the dose latent from the anatomical latents. Causal attention is modified to adapt to different numbers of lesions. LDFormer outperforms a state-of-the-art generative adversarial network on dose conformality in and around lesions, and the performance gap widens when considering overlap** lesions. LDFormer generates predictions of 3-D dose distributions in under 30s on consumer hardware, and has the potential to assist physicians with clinical decision making, reduce resource costs, and accelerate treatment planning.
△ Less
Submitted 11 July, 2024;
originally announced July 2024.
-
CAR-MFL: Cross-Modal Augmentation by Retrieval for Multimodal Federated Learning with Missing Modalities
Authors:
Pranav Poudel,
Prashant Shrestha,
Sanskar Amgain,
Yash Raj Shrestha,
Prashnna Gyawali,
Binod Bhattarai
Abstract:
Multimodal AI has demonstrated superior performance over unimodal approaches by leveraging diverse data sources for more comprehensive analysis. However, applying this effectiveness in healthcare is challenging due to the limited availability of public datasets. Federated learning presents an exciting solution, allowing the use of extensive databases from hospitals and health centers without centr…
▽ More
Multimodal AI has demonstrated superior performance over unimodal approaches by leveraging diverse data sources for more comprehensive analysis. However, applying this effectiveness in healthcare is challenging due to the limited availability of public datasets. Federated learning presents an exciting solution, allowing the use of extensive databases from hospitals and health centers without centralizing sensitive data, thus maintaining privacy and security. Yet, research in multimodal federated learning, particularly in scenarios with missing modalities a common issue in healthcare datasets remains scarce, highlighting a critical area for future exploration. Toward this, we propose a novel method for multimodal federated learning with missing modalities. Our contribution lies in a novel cross-modal data augmentation by retrieval, leveraging the small publicly available dataset to fill the missing modalities in the clients. Our method learns the parameters in a federated manner, ensuring privacy protection and improving performance in multiple challenging multimodal benchmarks in the medical domain, surpassing several competitive baselines. Code Available: https://github.com/bhattarailab/CAR-MFL
△ Less
Submitted 11 July, 2024;
originally announced July 2024.
-
From Real to Cloned Singer Identification
Authors:
Dorian Desblancs,
Gabriel Meseguer-Brocal,
Romain Hennequin,
Manuel Moussallam
Abstract:
Cloned voices of popular singers sound increasingly realistic and have gained popularity over the past few years. They however pose a threat to the industry due to personality rights concerns. As such, methods to identify the original singer in synthetic voices are needed. In this paper, we investigate how singer identification methods could be used for such a task. We present three embedding mode…
▽ More
Cloned voices of popular singers sound increasingly realistic and have gained popularity over the past few years. They however pose a threat to the industry due to personality rights concerns. As such, methods to identify the original singer in synthetic voices are needed. In this paper, we investigate how singer identification methods could be used for such a task. We present three embedding models that are trained using a singer-level contrastive learning scheme, where positive pairs consist of segments with vocals from the same singers. These segments can be mixtures for the first model, vocals for the second, and both for the third. We demonstrate that all three models are highly capable of identifying real singers. However, their performance deteriorates when classifying cloned versions of singers in our evaluation set. This is especially true for models that use mixtures as an input. These findings highlight the need to understand the biases that exist within singer identification systems, and how they can influence the identification of voice deepfakes in music.
△ Less
Submitted 11 July, 2024;
originally announced July 2024.
-
JADES -- The Rosetta Stone of JWST-discovered AGN: deciphering the intriguing nature of early AGN
Authors:
Ignas Juodžbalis,
Xihan Ji,
Roberto Maiolino,
Francesco D'Eugenio,
Jan Scholtz,
Guido Risaliti,
Andrew C. Fabian,
Giovanni Mazzolari,
Roberto Gilli,
Isabella Prandoni,
Santiago Arribas,
Andrew J. Bunker,
Stefano Carniani,
Stéphane Charlot,
Emma Curtis-Lake,
Anna de Graaff,
Kevin Hainline,
Eleonora Parlanti,
Michele Perna,
Pablo G. Pérez-González,
Brant Robertson,
Sandro Tacchella,
Hannah Übler,
Christina C. Williams,
Chris Willott
, et al. (1 additional authors not shown)
Abstract:
JWST has discovered a large population of Active Galactic Nuclei (AGN) at high redshift. Many of these newly discovered AGN have broad permitted lines (typically H$α$), but are extremely weak in the X-rays. Here we present the NIRSpec spectrum of the most extreme of these objects, GN-28074, an AGN at $z=2.26$ with prominent Balmer, Paschen and \HeI broad lines, and with the highest limit on the bo…
▽ More
JWST has discovered a large population of Active Galactic Nuclei (AGN) at high redshift. Many of these newly discovered AGN have broad permitted lines (typically H$α$), but are extremely weak in the X-rays. Here we present the NIRSpec spectrum of the most extreme of these objects, GN-28074, an AGN at $z=2.26$ with prominent Balmer, Paschen and \HeI broad lines, and with the highest limit on the bolometric to X-ray luminosity ratio among all spectroscopically confirmed AGN in GOODS. This source is also characterized by a mid-IR excess, most likely associated with the AGN torus' hot dust. The high bolometric luminosity and moderate redshift of this AGN allow us to explore its properties more in depth relative to other JWST-discovered AGN. The NIRSpec spectrum reveals prominent, slightly blueshifted absorption of H$α$, H$β$ and \HeI$λ$10830. The Balmer absorption lines require gas with densities of $n_{\rm H}> 10^8~{\rm cm}^{-3}$, inconsistent with an ISM origin, but fully consistent with clouds in the Broad Line Region (BLR). This finding suggests that at least part of the X-ray weakness is due to high (Compton thick) X-ray absorption by (dust-free) clouds in the BLR, or in its outer, slowly outflowing regions. GN-28074 is also extremely radio-weak. The radio weakness can also be explained in terms of absorption, as the inferred density of the clouds responsible for H$α$ absorption makes them optically thick to radio emission through free-free absorption. Alternatively, in this and other JWST-discovered AGN, the nuclear magnetic field may have not developed properly yet, resulting both in intrinsically weak radio emission and also lack of hot corona, hence intrinsic X-ray weakness. Finally, we show that recently proposed scenarios, invoking hyper-dense and ultra-metal-poor outflows or Raman scattering to explain the broad H$α$, are completely ruled out.
△ Less
Submitted 11 July, 2024;
originally announced July 2024.
-
Strings as particle arrays
Authors:
Renann Lipinski Jusinskas
Abstract:
These notes discuss the emergence of the Polyakov action from the low-energy limit of an array of relativistic particles with harmonic interactions, which is suggestive of a ``microscopic'' description of string theory.
These notes discuss the emergence of the Polyakov action from the low-energy limit of an array of relativistic particles with harmonic interactions, which is suggestive of a ``microscopic'' description of string theory.
△ Less
Submitted 11 July, 2024;
originally announced July 2024.
-
A lattice framework for generalizing shellable complexes and matroids
Authors:
Rakhi Pratihar,
Tovohery H. Randrianarisoa,
Klara Stokes
Abstract:
We introduce the notion of power lattices that unifies and extends the equicardinal geometric lattices, Cartesian products of subspace lattices, and multiset subset lattices, among several others. The notions of shellability for simplicial complexes, q-complexes, and multicomplexes are then unified and extended to that of complexes in power lattices, which we name as P-complexes. A nontrivial clas…
▽ More
We introduce the notion of power lattices that unifies and extends the equicardinal geometric lattices, Cartesian products of subspace lattices, and multiset subset lattices, among several others. The notions of shellability for simplicial complexes, q-complexes, and multicomplexes are then unified and extended to that of complexes in power lattices, which we name as P-complexes. A nontrivial class of shellable P-complexes are obtained via P-complexes of the independent sets of a matroid in power lattice, which we introduce to generalize matroids in Boolean lattices, q-matroids in subspace lattices, and sum-matroids in Cartesian products of subspace lattices. We also prove that shellable P-complexes in a power lattice yield shellable order complexes, extending the celebrated result of shellability of order complexes of (equicardinal) geometric lattices by Björner and also, a recent result on shellability of order complexes of lexicographically shellable q-complexes. Finally, we provide a construction of matroids on the lattice of multiset subsets from weighted graphs. We also consider a variation of Stanley-Reisner rings associated with shellable multicomplexes than the one considered by Herzog and Popescu and proved that these rings are sequentially Cohen-Macaulay.
△ Less
Submitted 11 July, 2024;
originally announced July 2024.
-
GA-NIFS: the interplay between merger, star formation and chemical enrichment in MACS1149-JD1 at z=9.11 with JWST/NIRSpec
Authors:
Cosimo Marconcini,
Francesco D'Eugenio,
Roberto Maiolino,
Santiago Arribas,
Andrew J. Bunker,
Stefano Carniani,
Stephane Charlot,
Michele Perna,
Bruno Rodriguez Del Pino,
Hannah Ubler,
Chris J. Willott,
Torsten Boker,
Giovanni Cresci,
Mirko Curti,
Gareth C. Jones,
Isabella Lamperti,
Eleonora Parlanti,
Giacomo Venturi
Abstract:
We present JWST/NIRSpec integral-field spectroscopy observations of the z ~ 9.11 lensed galaxy MACS1149-JD1, as part of the GA-NIFS programme. The data was obtained with both the G395H grating (R~ 2700) and the prism (R~ 100). This target shows a main elongated UV-bright clump and a secondary component detected in continuum emission at a projected distance of 2 kpc. The R2700 data trace the ionise…
▽ More
We present JWST/NIRSpec integral-field spectroscopy observations of the z ~ 9.11 lensed galaxy MACS1149-JD1, as part of the GA-NIFS programme. The data was obtained with both the G395H grating (R~ 2700) and the prism (R~ 100). This target shows a main elongated UV-bright clump and a secondary component detected in continuum emission at a projected distance of 2 kpc. The R2700 data trace the ionised-gas morpho-kinematics in between the two components, showing an elongated emission mainly traced by [O III]5007. We spatially resolve [O II]3726,3729, [O III]4959,5007, and [O III]4363, which enable us to map the electron density (ne ~ 1.0 x 103 cm-3), temperature (Te ~ 1.6 x 104 K), and direct-method gas-phase metallicity (-1.2 to -0.7 dex solar). A spatially resolved full-spectrum modelling of the prism indicates a north-south gas metallicity and stellar age gradient between the two components. We found 3-sigma evidence of a spatially resolved anti-correlation of the gas-phase metallicity and the star formation rate density, which is likely driven by gas inflows, enhancing the star formation in JD1. We employ high-z sensitive diagnostic diagrams to rule out the presence of a strong AGN in the main component. These findings show the unambiguous presence of two distinct stellar populations, with the majority of the mass ascribed to an old star formation burst, as suggested by previous works. We disfavour the possibility of a rotating-disc nature for MACS1149-JD1; we favour a merger event that has led to a recent burst of star formation in two separate regions, as supported by high values of [O III]5007/Hbeta, ionised gas velocity dispersion, and gas-phase metallicity.
△ Less
Submitted 11 July, 2024;
originally announced July 2024.
-
Source-Independent Fault Detection Method for Transmission Lines in IBR-Dominated Grids
Authors:
Julio Rodriguez,
Isaac Kofi Otchere,
Reza Jalilzadeh Hamidi
Abstract:
This paper proposes a source-independent method for the detection and classification of faults along Transmission Lines (TLs). It aims to reduce the protection issues arising from Inverter-Based Resources (IBRs). Inspired by Power Line Communication (PLC), the proposed method utilizes high-frequency carrier waves which are sent from either side of a TL over each phase. As faults disrupt the propag…
▽ More
This paper proposes a source-independent method for the detection and classification of faults along Transmission Lines (TLs). It aims to reduce the protection issues arising from Inverter-Based Resources (IBRs). Inspired by Power Line Communication (PLC), the proposed method utilizes high-frequency carrier waves which are sent from either side of a TL over each phase. As faults disrupt the propagation of carriers, the receiving carrier waves before and during faults exhibit differences. Based on this principle, the proposed method continuously compares the receiving carrier waves with a short history of them to detect and classify faults. The performance of the proposed method was evaluated using EMTP-RV and MATLAB, and compared to traditional phasor-based distance relays. The simulation results confirm the capability of the proposed method in detection and classification of different faults regardless of power sources types.
△ Less
Submitted 11 July, 2024;
originally announced July 2024.
-
BiasPruner: Debiased Continual Learning for Medical Image Classification
Authors:
Nourhan Bayasi,
Jamil Fayyad,
Alceu Bissoto,
Ghassan Hamarneh,
Rafeef Garbi
Abstract:
Continual Learning (CL) is crucial for enabling networks to dynamically adapt as they learn new tasks sequentially, accommodating new data and classes without catastrophic forgetting. Diverging from conventional perspectives on CL, our paper introduces a new perspective wherein forgetting could actually benefit the sequential learning paradigm. Specifically, we present BiasPruner, a CL framework t…
▽ More
Continual Learning (CL) is crucial for enabling networks to dynamically adapt as they learn new tasks sequentially, accommodating new data and classes without catastrophic forgetting. Diverging from conventional perspectives on CL, our paper introduces a new perspective wherein forgetting could actually benefit the sequential learning paradigm. Specifically, we present BiasPruner, a CL framework that intentionally forgets spurious correlations in the training data that could lead to shortcut learning. Utilizing a new bias score that measures the contribution of each unit in the network to learning spurious features, BiasPruner prunes those units with the highest bias scores to form a debiased subnetwork preserved for a given task. As BiasPruner learns a new task, it constructs a new debiased subnetwork, potentially incorporating units from previous subnetworks, which improves adaptation and performance on the new task. During inference, BiasPruner employs a simple task-agnostic approach to select the best debiased subnetwork for predictions. We conduct experiments on three medical datasets for skin lesion classification and chest X-Ray classification and demonstrate that BiasPruner consistently outperforms SOTA CL methods in terms of classification performance and fairness. Our code is available here.
△ Less
Submitted 11 July, 2024;
originally announced July 2024.
-
DFT+DMFT study of correlated electronic structure in the monolayer-trilayer phase of La$_3$Ni$_2$O$_7$
Authors:
Zhenfeng Ouyang,
Rong-Qiang He,
Zhong-Yi Lu
Abstract:
By preforming DFT+DMFT calculations, we systematically investigate the correlated electronic structure in the newly discovered monolayer-trilayer (ML-TL) phase of La$_3$Ni$_2$O$_7$ (1313-La327). Our calculated Fermi surfaces are in good agreement with the angle-resolved photoemission spectroscopy (ARPES) results. We find that 1313-La327 is a multiorbital correlated metal. An orbital-selective Mott…
▽ More
By preforming DFT+DMFT calculations, we systematically investigate the correlated electronic structure in the newly discovered monolayer-trilayer (ML-TL) phase of La$_3$Ni$_2$O$_7$ (1313-La327). Our calculated Fermi surfaces are in good agreement with the angle-resolved photoemission spectroscopy (ARPES) results. We find that 1313-La327 is a multiorbital correlated metal. An orbital-selective Mott behavior is found in ML. The ML Ni-3$d_{z^2}$ orbitals exhibit a Mott behavior, while the ML Ni-3$d_{x^2-y^2}$ orbitals are metallic due to self-do**. And the ML also shows features of heavy fermions, which indicates that there may be Kondo physics in 1313-La327. We also find a large static local spin susceptibility of ML Ni, suggesting that there is large spin fluctuation in 1313-La327. The TL Ni-$e_g$ orbitals possess similar electronic correlation to those in La$_4$Ni$_3$O$_{10}$ (La4310). The $e_g$ orbitals of the outer-layer Ni in TL (TL-outer Ni) show non-Fermi liquid behaviors. Besides, large weight of high-spin states are found in TL-outer Ni and ML Ni, implying Hundness. Under 16 GPa, a Lifshitz transition is revealed by our calculations and a La-related band crosses the Fermi level. Our work provides a theoretical reference for studying other potential mixed-stacked nickelate superconductors.
△ Less
Submitted 11 July, 2024;
originally announced July 2024.
-
Modeling X-Ray Multi-Reflection in Super-Eddington Winds
Authors:
Zijian Zhang,
Lars Lund Thomsen,
Lixin Dai,
Christopher S. Reynolds,
Javier A. García,
Erin Kara,
Riley Connors,
Megan Masterson,
Yuhan Yao,
Thomas Dauser
Abstract:
It has been recently discovered that a few super-Eddington sources undergoing black hole super-Eddington accretion exhibit X-ray reflection signatures. In such new systems, one expects that the coronal X-ray emissions are mainly reflected by optically thick super-Eddington winds instead of thin disks. In this paper, we conduct a series of general relativistic ray-tracing and Monte Carlo radiative…
▽ More
It has been recently discovered that a few super-Eddington sources undergoing black hole super-Eddington accretion exhibit X-ray reflection signatures. In such new systems, one expects that the coronal X-ray emissions are mainly reflected by optically thick super-Eddington winds instead of thin disks. In this paper, we conduct a series of general relativistic ray-tracing and Monte Carlo radiative transfer simulations to model the X-ray reflection signatures, especially the characteristic Fe K$α$ line, produced from super-Eddington accretion flows. In particular, we allow the photons emitted by a lamppost corona to be reflected multiple times in a cone-like funnel surrounded by fast winds. We find that the Fe K$α$ line profile most sensitively depends on the wind kinematics, while its exact shape also depends on the funnel open angle and corona height. Furthermore, very interestingly, we find that the Fe K$α$ line can have a prominent double-peak profile in certain parameter spaces even with a face-on orientation. Moreover, we compare the Fe K$α$ line profiles produced from super-Eddington and thin disks and show that such lines can provide important insights into the understanding of black hole systems undergoing super-Eddington accretion.
△ Less
Submitted 11 July, 2024;
originally announced July 2024.
-
Viscoelastic model hierarchy for fiber melt spinning of semi-crystalline polymers
Authors:
Manuel Ettmüller,
Walter Arne,
Nicole Marheineke,
Raimund Wegener
Abstract:
In the fiber melt spinning of semi-crystalline polymers, the degree of crystallization can be non-homogeneous over the cross-section of the fiber, affecting the properties of the end product. For simulation-based process design, the question arises as to which fiber quantities and hence model equations must be resolved in radial direction to capture all practically relevant effects and at the same…
▽ More
In the fiber melt spinning of semi-crystalline polymers, the degree of crystallization can be non-homogeneous over the cross-section of the fiber, affecting the properties of the end product. For simulation-based process design, the question arises as to which fiber quantities and hence model equations must be resolved in radial direction to capture all practically relevant effects and at the same time imply a model that can be computed with reasonable effort. In this paper, we present a hierarchy of viscoelastic two-phase fiber models ranging from a complex, fully resolved and highly expensive three-dimensional description to a cross-sectionally averaged, cheap-to-evaluate one-dimensional model. In particular, we propose a novel stress-averaged one-two-dimensional fiber model, which circumvents additional assumptions on the inlet profiles needed in the established stress-resolved fiber model by Doufas et al.\ (2001). Simulation results demonstrate the performance and application regime of the dimensionally reduced models. The novel stress-averaged variant provides fast and reliable results, especially in the regime of low flow-enhanced crystallization.
△ Less
Submitted 11 July, 2024;
originally announced July 2024.
-
Centrality dependence of Lévy-stable two-pion Bose-Einstein correlations in $\sqrt{s_{_{NN}}}=200$ GeV Au$+$Au collisions
Authors:
PHENIX Collaboration,
N. J. Abdulameer,
U. Acharya,
A. Adare,
C. Aidala,
N. N. Ajitanand,
Y. Akiba,
R. Akimoto,
H. Al-Ta'ani,
J. Alexander,
A. Angerami,
K. Aoki,
N. Apadula,
Y. Aramaki,
H. Asano,
E. C. Aschenauer,
E. T. Atomssa,
T. C. Awes,
B. Azmoun,
V. Babintsev,
M. Bai,
B. Bannier,
K. N. Barish,
B. Bassalleck,
S. Bathe
, et al. (377 additional authors not shown)
Abstract:
The PHENIX experiment measured the centrality dependence of two-pion Bose-Einstein correlation functions in $\sqrt{s_{_{NN}}}=200$~GeV Au$+$Au collisions at the Relativistic Heavy Ion Collider at Brookhaven National Laboratory. The data are well represented by Lévy-stable source distributions. The extracted source parameters are the correlation-strength parameter $λ$, the Lévy index of stability…
▽ More
The PHENIX experiment measured the centrality dependence of two-pion Bose-Einstein correlation functions in $\sqrt{s_{_{NN}}}=200$~GeV Au$+$Au collisions at the Relativistic Heavy Ion Collider at Brookhaven National Laboratory. The data are well represented by Lévy-stable source distributions. The extracted source parameters are the correlation-strength parameter $λ$, the Lévy index of stability $α$, and the Lévy-scale parameter $R$ as a function of transverse mass $m_T$ and centrality. The $λ(m_T)$ parameter is constant at larger values of $m_T$, but decreases as $m_T$ decreases. The Lévy scale parameter $R(m_T)$ decreases with $m_T$ and exhibits proportionality to the length scale of the nuclear overlap region. The Lévy exponent $α(m_T)$ is independent of $m_T$ within uncertainties in each investigated centrality bin, but shows a clear centrality dependence. At all centralities, the Lévy exponent $α$ is significantly different from that of Gaussian ($α=2$) or Cauchy ($α=1$) source distributions. Comparisons to the predictions of Monte-Carlo simulations of resonance-decay chains show that in all but the most peripheral centrality class (50%-60%), the obtained results are inconsistent with the measurements, unless a significant reduction of the in-medium mass of the $η'$ meson is included. In each centrality class, the best value of the in-medium $η'$ mass is compared to the mass of the $η$ meson, as well as to several theoretical predictions that consider restoration of $U_A(1)$ symmetry in hot hadronic matter.
△ Less
Submitted 11 July, 2024;
originally announced July 2024.
-
Discovery of a Hypervelocity L Subdwarf at the Star/Brown Dwarf Mass Limit
Authors:
Adam J. Burgasser,
Roman Gerasimov,
Kyle Kremer,
Hunter Brooks,
Efrain Alvarado III,
Adam C. Schneider,
Aaron M. Meisner,
Christopher A. Theissen,
Emma Softich,
Preethi Karpoor,
Thomas P. Bickle,
Martin Kabatnik,
Austin Rothermich,
Dan Caselden,
J. Davy Kirkpatrick,
Jacqueline K. Faherty,
Sarah L. Casewell,
Marc J. Kuchner,
the Backyard Worlds,
:,
Planet 9 Collaboration
Abstract:
We report the discovery of a high velocity, very low-mass star or brown dwarf whose kinematics suggest it is unbound to the Milky Way. CWISE J124909.08+362116.0 was identified by citizen scientists in the Backyard Worlds: Planet 9 program as a high proper motion ($μ$ $=$ 0''9/yr) faint red source. Moderate resolution spectroscopy with Keck/NIRES reveals it to be a metal-poor early L subdwarf with…
▽ More
We report the discovery of a high velocity, very low-mass star or brown dwarf whose kinematics suggest it is unbound to the Milky Way. CWISE J124909.08+362116.0 was identified by citizen scientists in the Backyard Worlds: Planet 9 program as a high proper motion ($μ$ $=$ 0''9/yr) faint red source. Moderate resolution spectroscopy with Keck/NIRES reveals it to be a metal-poor early L subdwarf with a large radial velocity ($-$103$\pm$10 km/s), and its estimated distance of 125$\pm$8 pc yields a speed of 456$\pm$27 km/s in the Galactic rest frame, near the local escape velocity for the Milky Way. We explore several potential scenarios for the origin of this source, including ejection from the Galactic center $\gtrsim$3 Gyr in the past, survival as the mass donor companion to an exploded white dwarf. acceleration through a three-body interaction with a black hole binary in a globular cluster, and accretion from a Milky Way satellite system. CWISE J1249+3621 is the first hypervelocity very low mass star or brown dwarf to be found, and the nearest of all such systems. It may represent a broader population of very high velocity, low-mass objects that have undergone extreme accelerations.
△ Less
Submitted 11 July, 2024;
originally announced July 2024.
-
Noncrossing partition posets
Authors:
Richard Ehrenborg,
Gábor Hetyei
Abstract:
We introduce the poset NC^d_n of all noncrossing partitions such that each block has cardinality 1 modulo d and each block of the dual partition also has cardinality 1 modulo d. We obtain the cardinality, the Möbius function, the rank numbers, the antipode, and the number of maximal chains. Generalizing work of Stanley, we give an edge labeling such that the labels of the maximal chains are exactl…
▽ More
We introduce the poset NC^d_n of all noncrossing partitions such that each block has cardinality 1 modulo d and each block of the dual partition also has cardinality 1 modulo d. We obtain the cardinality, the Möbius function, the rank numbers, the antipode, and the number of maximal chains. Generalizing work of Stanley, we give an edge labeling such that the labels of the maximal chains are exactly the d-parking functions. We also introduce two classes of labeled trees: the first class is in bijective correspondence with the noncrossing partitions in NC^d_n and the second class is in bijective correspondence with the maximal chains.
△ Less
Submitted 11 July, 2024;
originally announced July 2024.
-
Non-maximal entanglement of photons from positron-electron annihilation demonstrated using a novel plastic PET scanner
Authors:
P. Moskal,
D. Kumar,
S. Sharma,
E. Y. Beyene,
N. Chug,
A. Coussat,
C. Curceanu,
E. Czerwinski,
M. Das,
K. Dulski,
M. Gorgol,
B. Jasinska,
K. Kacprzak,
T. Kaplanoglu,
L. Kaplon,
K. Klimaszewski,
T. Kozik,
E. Lisowski,
F. Lisowski,
W. Mryka,
S. Niedzwiecki,
S. Parzych,
E. P. del Rio,
L. Raczynski,
M. Radler
, et al. (7 additional authors not shown)
Abstract:
In the state-of-the-art Positron Emission Tomography (PET), information about the polarization of annihilation photons is not available. Current PET systems track molecules labeled with positron-emitting radioisotopes by detecting the propagation direction of two photons from positron-electron annihilation. However, annihilation photons carry more information than just the site where they originat…
▽ More
In the state-of-the-art Positron Emission Tomography (PET), information about the polarization of annihilation photons is not available. Current PET systems track molecules labeled with positron-emitting radioisotopes by detecting the propagation direction of two photons from positron-electron annihilation. However, annihilation photons carry more information than just the site where they originated. Here we present a novel J-PET scanner built from plastic scintillators, in which annihilation photons interact predominantly via the Compton effect, providing information about photon polarization in addition to information on photon direction of propagation. Theoretically, photons from the decay of positronium in a vacuum are maximally entangled in polarization. However, in matter, when the positron from positronium annihilates with the electron bound to the atom, the question arises whether the photons from such annihilation are maximally entangled. In this work, we determine the distribution of the relative angle between polarization orientations of two photons from positron-electron annihilation in a porous polymer. Contrary to prior results for positron annihilation in aluminum and copper, where the strength of observed correlations is as expected for maximally entangled photons, our results show a significant deviation. We demonstrate that in porous polymer, photon polarization correlation is weaker than for maximally entangled photons but stronger than for separable photons. The data indicate that more than 40% of annihilations in Amberlite resin lead to a non-maximally entangled state. Our result indicates the degree of correlation depends on the annihilation mechanism and the molecular arrangement. We anticipate that the introduced Compton interaction-based PET system opens a promising perspective for exploring polarization correlations in PET as a novel diagnostic indicator.
△ Less
Submitted 11 July, 2024;
originally announced July 2024.
-
Effective descent morphisms of ordered families
Authors:
Maria Manuel Clementino,
Rui Prezado
Abstract:
We present a characterization of effective descent morphisms in the lax comma category $\mathsf{Ord}//X$ when $X$ is a locally complete ordered set with a bottom element.
We present a characterization of effective descent morphisms in the lax comma category $\mathsf{Ord}//X$ when $X$ is a locally complete ordered set with a bottom element.
△ Less
Submitted 11 July, 2024;
originally announced July 2024.
-
Boosting Adversarial Transferability for Skeleton-based Action Recognition via Exploring the Model Posterior Space
Authors:
Yunfeng Diao,
Baiqi Wu,
Ruixuan Zhang,
Xun Yang,
Meng Wang,
He Wang
Abstract:
Skeletal motion plays a pivotal role in human activity recognition (HAR). Recently, attack methods have been proposed to identify the universal vulnerability of skeleton-based HAR(S-HAR). However, the research of adversarial transferability on S-HAR is largely missing. More importantly, existing attacks all struggle in transfer across unknown S-HAR models. We observed that the key reason is that t…
▽ More
Skeletal motion plays a pivotal role in human activity recognition (HAR). Recently, attack methods have been proposed to identify the universal vulnerability of skeleton-based HAR(S-HAR). However, the research of adversarial transferability on S-HAR is largely missing. More importantly, existing attacks all struggle in transfer across unknown S-HAR models. We observed that the key reason is that the loss landscape of the action recognizers is rugged and sharp. Given the established correlation in prior studies~\cite{qin2022boosting,wu2020towards} between loss landscape and adversarial transferability, we assume and empirically validate that smoothing the loss landscape could potentially improve adversarial transferability on S-HAR. This is achieved by proposing a new post-train Dual Bayesian strategy, which can effectively explore the model posterior space for a collection of surrogates without the need for re-training. Furthermore, to craft adversarial examples along the motion manifold, we incorporate the attack gradient with information of the motion dynamics in a Bayesian manner. Evaluated on benchmark datasets, e.g. HDM05 and NTU 60, the average transfer success rate can reach as high as 35.9\% and 45.5\% respectively. In comparison, current state-of-the-art skeletal attacks achieve only 3.6\% and 9.8\%. The high adversarial transferability remains consistent across various surrogate, victim, and even defense models. Through a comprehensive analysis of the results, we provide insights on what surrogates are more likely to exhibit transferability, to shed light on future research.
△ Less
Submitted 11 July, 2024;
originally announced July 2024.
-
Approaching Outside: Scaling Unsupervised 3D Object Detection from 2D Scene
Authors:
Ruiyang Zhang,
Hu Zhang,
Hang Yu,
Zhedong Zheng
Abstract:
The unsupervised 3D object detection is to accurately detect objects in unstructured environments with no explicit supervisory signals. This task, given sparse LiDAR point clouds, often results in compromised performance for detecting distant or small objects due to the inherent sparsity and limited spatial resolution. In this paper, we are among the early attempts to integrate LiDAR data with 2D…
▽ More
The unsupervised 3D object detection is to accurately detect objects in unstructured environments with no explicit supervisory signals. This task, given sparse LiDAR point clouds, often results in compromised performance for detecting distant or small objects due to the inherent sparsity and limited spatial resolution. In this paper, we are among the early attempts to integrate LiDAR data with 2D images for unsupervised 3D detection and introduce a new method, dubbed LiDAR-2D Self-paced Learning (LiSe). We argue that RGB images serve as a valuable complement to LiDAR data, offering precise 2D localization cues, particularly when scarce LiDAR points are available for certain objects. Considering the unique characteristics of both modalities, our framework devises a self-paced learning pipeline that incorporates adaptive sampling and weak model aggregation strategies. The adaptive sampling strategy dynamically tunes the distribution of pseudo labels during training, countering the tendency of models to overfit easily detected samples, such as nearby and large-sized objects. By doing so, it ensures a balanced learning trajectory across varying object scales and distances. The weak model aggregation component consolidates the strengths of models trained under different pseudo label distributions, culminating in a robust and powerful final model. Experimental evaluations validate the efficacy of our proposed LiSe method, manifesting significant improvements of +7.1% AP$_{BEV}$ and +3.4% AP$_{3D}$ on nuScenes, and +8.3% AP$_{BEV}$ and +7.4% AP$_{3D}$ on Lyft compared to existing techniques.
△ Less
Submitted 11 July, 2024;
originally announced July 2024.
-
Asgard/NOTT: First lab assembly and experimental results
Authors:
G. Garreau,
A. Bigioli,
R. Laugier,
B. La Torre,
M-A. Martinod,
K. Missiaen,
J. Morren,
G. Raskin,
M. Salman,
S. Gross,
M. Ireland,
A. P. Joó,
L. Labadie,
S. Madden,
A. Mazzoli,
G. Medgyesi,
A. Sanny,
A. Taras,
B. Vandenbussche,
D. Defrère
Abstract:
Asgard/NOTT is an ERC-funded project hosted at KU Leuven and is part of a new visitor instrumental suite, called Asgard, under preparation for the Very Large Telescope Interferometer (VLTI). Leveraging nulling capabilities and the long VLTI baselines, it is optimized for high-contrast imaging of the snow line region around young nearby main-sequence stars. This will enable the characterization of…
▽ More
Asgard/NOTT is an ERC-funded project hosted at KU Leuven and is part of a new visitor instrumental suite, called Asgard, under preparation for the Very Large Telescope Interferometer (VLTI). Leveraging nulling capabilities and the long VLTI baselines, it is optimized for high-contrast imaging of the snow line region around young nearby main-sequence stars. This will enable the characterization of the atmosphere of young giant exoplanets and warm/hot exozodiacal dust with spectroscopy in the L'-band (3.5-4.0$μ$m). In this work, we present the first lab assembly of the instrument done at KU Leuven and the technical solutions to tackle the challenge of performing nulling in the mid-infrared despite the thermal background. The opto-mechanical design of the warm optics and the injection system for the photonic chip are described. The alignment procedure used to assemble the system is also presented. Finally, the first experimental results, including fringes and null measurements, are given and confirm the adequacy of the bench to test and optimize the Asgard/NOTT instrument.
△ Less
Submitted 11 July, 2024;
originally announced July 2024.
-
BriDe Arbitrager: Enhancing Arbitrage in Ethereum 2.0 via Bribery-enabled Delayed Block Production
Authors:
Hulin Yang,
Mingzhe Li,
** Zhang,
Alia Asheralieva,
Qingsong Wei,
Siow Mong Rick Goh
Abstract:
The advent of Ethereum 2.0 has introduced significant changes, particularly the shift to Proof-of-Stake consensus. This change presents new opportunities and challenges for arbitrage. Amidst these changes, we introduce BriDe Arbitrager, a novel tool designed for Ethereum 2.0 that leverages Bribery-driven attacks to Delay block production and increase arbitrage gains. The main idea is to allow mali…
▽ More
The advent of Ethereum 2.0 has introduced significant changes, particularly the shift to Proof-of-Stake consensus. This change presents new opportunities and challenges for arbitrage. Amidst these changes, we introduce BriDe Arbitrager, a novel tool designed for Ethereum 2.0 that leverages Bribery-driven attacks to Delay block production and increase arbitrage gains. The main idea is to allow malicious proposers to delay block production by bribing validators/proposers, thereby gaining more time to identify arbitrage opportunities. Through analysing the bribery process, we design an adaptive bribery strategy. Additionally, we propose a Delayed Transaction Ordering Algorithm to leverage the delayed time to amplify arbitrage profits for malicious proposers. To ensure fairness and automate the bribery process, we design and implement a bribery smart contract and a bribery client. As a result, BriDe Arbitrager enables adversaries controlling a limited (< 1/4) fraction of the voting powers to delay block production via bribery and arbitrage more profit. Extensive experimental results based on Ethereum historical transactions demonstrate that BriDe Arbitrager yields an average of 8.66 ETH (16,442.23 USD) daily profits. Furthermore, our approach does not trigger any slashing mechanisms and remains effective even under Proposer Builder Separation and other potential mechanisms will be adopted by Ethereum.
△ Less
Submitted 11 July, 2024;
originally announced July 2024.
-
Enhancing Privacy of Spatiotemporal Federated Learning against Gradient Inversion Attacks
Authors:
Lele Zheng,
Yang Cao,
Renhe Jiang,
Kenjiro Taura,
Yulong Shen,
Sheng Li,
Masatoshi Yoshikawa
Abstract:
Spatiotemporal federated learning has recently raised intensive studies due to its ability to train valuable models with only shared gradients in various location-based services. On the other hand, recent studies have shown that shared gradients may be subject to gradient inversion attacks (GIA) on images or texts. However, so far there has not been any systematic study of the gradient inversion a…
▽ More
Spatiotemporal federated learning has recently raised intensive studies due to its ability to train valuable models with only shared gradients in various location-based services. On the other hand, recent studies have shown that shared gradients may be subject to gradient inversion attacks (GIA) on images or texts. However, so far there has not been any systematic study of the gradient inversion attacks in spatiotemporal federated learning. In this paper, we explore the gradient attack problem in spatiotemporal federated learning from attack and defense perspectives. To understand privacy risks in spatiotemporal federated learning, we first propose Spatiotemporal Gradient Inversion Attack (ST-GIA), a gradient attack algorithm tailored to spatiotemporal data that successfully reconstructs the original location from gradients. Furthermore, we design an adaptive defense strategy to mitigate gradient inversion attacks in spatiotemporal federated learning. By dynamically adjusting the perturbation levels, we can offer tailored protection for varying rounds of training data, thereby achieving a better trade-off between privacy and utility than current state-of-the-art methods. Through intensive experimental analysis on three real-world datasets, we reveal that the proposed defense strategy can well preserve the utility of spatiotemporal federated learning with effective security protection.
△ Less
Submitted 11 July, 2024; v1 submitted 11 July, 2024;
originally announced July 2024.
-
Enhancing octree-based context models for point cloud geometry compression with attention-based child node number prediction
Authors:
Chang Sun,
Hui Yuan,
Xiaolong Mao,
Xin Lu,
Raouf Hamzaoui
Abstract:
In point cloud geometry compression, most octreebased context models use the cross-entropy between the onehot encoding of node occupancy and the probability distribution predicted by the context model as the loss. This approach converts the problem of predicting the number (a regression problem) and the position (a classification problem) of occupied child nodes into a 255-dimensional classificati…
▽ More
In point cloud geometry compression, most octreebased context models use the cross-entropy between the onehot encoding of node occupancy and the probability distribution predicted by the context model as the loss. This approach converts the problem of predicting the number (a regression problem) and the position (a classification problem) of occupied child nodes into a 255-dimensional classification problem. As a result, it fails to accurately measure the difference between the one-hot encoding and the predicted probability distribution. We first analyze why the cross-entropy loss function fails to accurately measure the difference between the one-hot encoding and the predicted probability distribution. Then, we propose an attention-based child node number prediction (ACNP) module to enhance the context models. The proposed module can predict the number of occupied child nodes and map it into an 8- dimensional vector to assist the context model in predicting the probability distribution of the occupancy of the current node for efficient entropy coding. Experimental results demonstrate that the proposed module enhances the coding efficiency of octree-based context models.
△ Less
Submitted 11 July, 2024;
originally announced July 2024.
-
Gaia DR3 asteroid reflectance spectra: L-type families, memberships, and ages
Authors:
Roberto Balossi,
Paolo Tanga,
Alexey Sergeyev,
Alberto Cellino,
Federica Spoto
Abstract:
The Gaia Data Release 3 (DR3) contains reflectance spectra at visible wavelengths for 60,518 asteroids over the range between 374-1034 nm, representing a large sample that is well suited to studies of asteroid families. We want to assess the potential of Gaia spectra in identifying asteroid family members. Here, we focus on two L-type families, namely Tirela/Klumpkea and Watsonia. These families a…
▽ More
The Gaia Data Release 3 (DR3) contains reflectance spectra at visible wavelengths for 60,518 asteroids over the range between 374-1034 nm, representing a large sample that is well suited to studies of asteroid families. We want to assess the potential of Gaia spectra in identifying asteroid family members. Here, we focus on two L-type families, namely Tirela/Klumpkea and Watsonia. These families are known for their connection to Barbarian asteroids, which are potentially abundant in calcium-aluminum rich inclusions (CAIs). Our method is based (1) on a color taxonomy specifically built on Gaia data and (2) the similarity of spectra of candidate members with the template spectrum of a specific family. We identified objects in the halo of Tirela/Klumpkea, along with possible interlopers. We also found an independent group of eight asteroids erroneously linked to the family by the hierarchical clustering method (HCM). Consequently, the knowledge of the size distribution of the family has been significantly improved, with a more consistent shape at the larger end. The Watsonia family is a more intricate case, mainly due to its smaller size and the less marked difference between the spectral types of the background and of the family members. However, the spectral selection helps identify objects that were not seen by HCM, including a cluster separated from the family core by a resonance. For both families, the V-shape is better defined, leading to a revised age estimation based on the memberships established mainly from spectral properties. Our work demonstrates the advantage of combining the classical HCM approach to spectral properties obtained by Gaia for the study of asteroid families. Future data releases are expected to further expand the capabilities in this domain
△ Less
Submitted 11 July, 2024;
originally announced July 2024.
-
Enhancing context models for point cloud geometry compression with context feature residuals and multi-loss
Authors:
Chang Sun,
Hui Yuan,
Shuai Li,
Xin Lu,
Raouf Hamzaoui
Abstract:
In point cloud geometry compression, context models usually use the one-hot encoding of node occupancy as the label, and the cross-entropy between the one-hot encoding and the probability distribution predicted by the context model as the loss function. However, this approach has two main weaknesses. First, the differences between contexts of different nodes are not significant, making it difficul…
▽ More
In point cloud geometry compression, context models usually use the one-hot encoding of node occupancy as the label, and the cross-entropy between the one-hot encoding and the probability distribution predicted by the context model as the loss function. However, this approach has two main weaknesses. First, the differences between contexts of different nodes are not significant, making it difficult for the context model to accurately predict the probability distribution of node occupancy. Second, as the one-hot encoding is not the actual probability distribution of node occupancy, the cross-entropy loss function is inaccurate. To address these problems, we propose a general structure that can enhance existing context models. We introduce the context feature residuals into the context model to amplify the differences between contexts. We also add a multi-layer perception branch, that uses the mean squared error between its output and node occupancy as a loss function to provide accurate gradients in backpropagation. We validate our method by showing that it can improve the performance of an octree-based model (OctAttention) and a voxel-based model (VoxelDNN) on the object point cloud datasets MPEG 8i and MVUB, as well as the LiDAR point cloud dataset SemanticKITTI.
△ Less
Submitted 11 July, 2024;
originally announced July 2024.
-
Precise Bolometric Luminosities and Effective Temperatures of 23 late-T and Y dwarfs Obtained with JWST
Authors:
Samuel A. Beiler,
Michael C. Cushing,
J. Davy Kirkpatrick,
Adam C. Schneider,
Sagnick Mukherjee,
Mark S. Marley,
Federico Marocco,
Richard L. Smart
Abstract:
We present infrared spectral energy distributions of 23 late-type T and Y dwarfs obtained with the James Webb Space Telescope. The spectral energy distributions consist of NIRSpec PRISM and MIRI LRS spectra covering the $\sim$1--12 $μ$m wavelength range at $λ/ Δλ\approx 100$ and broadband photometry at 15, 18, and 21 $μ$m. The spectra exhibit absorption features common to these objects including H…
▽ More
We present infrared spectral energy distributions of 23 late-type T and Y dwarfs obtained with the James Webb Space Telescope. The spectral energy distributions consist of NIRSpec PRISM and MIRI LRS spectra covering the $\sim$1--12 $μ$m wavelength range at $λ/ Δλ\approx 100$ and broadband photometry at 15, 18, and 21 $μ$m. The spectra exhibit absorption features common to these objects including H$_2$O, CH$_4$, CO, CO$_2$, and NH$_3$. Interestingly, while the spectral morphology changes relatively smoothly with spectral type at $λ< 3$ $μ$m and $λ> 8$ $μ$m, it shows no clear trend in the 5 $μ$m region where a large fraction of the flux emerges. The broad wavelength coverage of the data enables us to compute the first accurate measurements of the bolometric fluxes of cool brown dwarfs. Combining these bolometric fluxes with parallaxes from Spitzer and HST, we also obtain the first accurate bolometric luminosities of these cool dwarfs. We then used the Sonora Bobcat solar metallicity evolutionary models to estimate the radii of the dwarfs which results in effective temperature estimates ranging from $\sim$1000 to 350 K with a median uncertainty of $\pm$20 K which is nearly an order of magnitude improvement over previous work. We also discuss how various portions of the spectra either do or do not exhibit a clear sequence when ordered by their effective temperatures.
△ Less
Submitted 11 July, 2024;
originally announced July 2024.
-
PEP: a tackle value measuring the prevention of expected points
Authors:
Robert Bajons,
Jan-Ole Koslik,
Rouven Michels,
Marius Ötting
Abstract:
Traditional assessments of tackling in American Football often only consider the number of tackles made, without adequately accounting for their context and importance for the game. Aiming for improvement, we develop a metric that quantifies the value of a tackle in terms of the prevented expected points (PEP). Specifically, we compare the real end-of-play yard line of tackles with the predicted y…
▽ More
Traditional assessments of tackling in American Football often only consider the number of tackles made, without adequately accounting for their context and importance for the game. Aiming for improvement, we develop a metric that quantifies the value of a tackle in terms of the prevented expected points (PEP). Specifically, we compare the real end-of-play yard line of tackles with the predicted yard line given the hypothetical situation that the tackle had been missed. For this, we use high-resolution tracking data, that capture the position and velocity of players, and a random forest to account for uncertainty and multi-modality in yard-line prediction. Moreover, we acknowledge the difference in the importance of tackles by assigning an expected points value to each individual tree prediction of the random forest. Finally, to relate the value of tackles to a player's ability to tackle, we fit a suitable mixed-effect model to the PEP values. Our approach contributes to a deeper understanding of defensive performances in American football and offers valuable insights for coaches and analysts.
△ Less
Submitted 11 July, 2024;
originally announced July 2024.
-
Imitation Learning for Robotic Assisted Ultrasound Examination of Deep Venous Thrombosis using Kernelized Movement Primitives
Authors:
Diego Dall'Alba,
Lorenzo Busellato,
Thiusius Rajeeth Savarimuthu,
Zhuoqi Cheng,
Iñigo Iturrate
Abstract:
Deep Vein Thrombosis (DVT) is a common yet potentially fatal condition, often leading to critical complications like pulmonary embolism. DVT is commonly diagnosed using Ultrasound (US) imaging, which can be inconsistent due to its high dependence on the operator's skill. Robotic US Systems (RUSs) aim to improve diagnostic test consistency but face challenges with the complex scanning pattern neede…
▽ More
Deep Vein Thrombosis (DVT) is a common yet potentially fatal condition, often leading to critical complications like pulmonary embolism. DVT is commonly diagnosed using Ultrasound (US) imaging, which can be inconsistent due to its high dependence on the operator's skill. Robotic US Systems (RUSs) aim to improve diagnostic test consistency but face challenges with the complex scanning pattern needed for DVT assessment, where precise control over US probe pressure is crucial for indirectly detecting occlusions. This work introduces an imitation learning method, based on Kernelized Movement Primitives (KMP), to standardize DVT US exams by training an autonomous robotic controller using sonographer demonstrations. A new recording device design enhances demonstration ergonomics, integrating with US probes and enabling seamless force and position data recording. KMPs are used to capture scanning skills, linking scan trajectory and force, enabling generalization beyond the demonstrations. Our approach, evaluated on synthetic models and volunteers, shows that the KMP-based RUS can replicate an expert's force control and image quality in DVT US examination. It outperforms previous methods using manually defined force profiles, improving exam standardization and reducing reliance on specialized sonographers.
△ Less
Submitted 11 July, 2024;
originally announced July 2024.
-
JWST/NIRSpec insights into the circumnuclear region of Arp 220: A detailed kinematic study
Authors:
L. Ulivi,
M. Perna,
I. Lamperti,
S. Arribas,
G. Cresci,
B. Rodríguez Del Pino,
T. Boeker,
A. J. Bunker,
M. Ceci,
S. Charlot,
F. D Eugenio,
K. Fahrion,
R. Maiolino,
A. Marconi,
M. Pereira-Santaella
Abstract:
The study of starburst and AGN feedback is crucial for understanding the regulation of star formation and the evolution of galaxies across cosmic time. Arp 220, the closest ultraluminous infrared galaxy (ULIRG), is in an advanced phase of a major merger with two distinct nuclei, and shows evidence of multi-phase (molecular, ionised, neutral) and multi-scale (from < 0.1 to > 5 kpc) outflows. Theref…
▽ More
The study of starburst and AGN feedback is crucial for understanding the regulation of star formation and the evolution of galaxies across cosmic time. Arp 220, the closest ultraluminous infrared galaxy (ULIRG), is in an advanced phase of a major merger with two distinct nuclei, and shows evidence of multi-phase (molecular, ionised, neutral) and multi-scale (from < 0.1 to > 5 kpc) outflows. Therefore, it represents an ideal system for investigating outflow mechanisms and feedback phenomena in detail. Using new JWST NIRSpec IFU observations, we investigate the spatially resolved gaseous (in both ionized and hot molecular phases) and stellar kinematics in the innermost 1 kpc. We decouple the different gas kinematic components through multi-Gaussian fitting, identifying distinct multi-phase outflows associated with the two nuclei, with velocities up to $\sim$ 1000 km/s. We compute the mass ($\sim 10^7$ M$_\odot$), mass outflow rate ($\sim 20$ M$_\odot$/yr) and energetics ($\dot E_{out}\sim 10^{42}$ erg/s) for each outflow, finding that the ionized and hot molecular outflowing gas contribute around 2-30% to the total mass and the energy of the outflows, as inferred from the combination of multi-wavelength information. We discuss the possible origin of the outflows, finding no compelling evidence to prefer a starburst or AGN driven scenario. Regardless of their nature, outflows in Arp 220 propagate in multiple directions from parsec to kiloparsec scales, potentially impacting a significant portion of the host galaxy. This contrasts with isolated systems where outflows typically follow a more collimated path, and do not affect the interstellar medium throughout the entire galaxy. This study highlights the importance of investigating merging systems with multi-wavelength facilities, including JWST/NIRSpec IFS, to obtain a comprehensive understanding of feedback mechanisms in galaxy evolution.
△ Less
Submitted 11 July, 2024;
originally announced July 2024.
-
Lynx: An Open Source Hallucination Evaluation Model
Authors:
Selvan Sunitha Ravi,
Bartosz Mielczarek,
Anand Kannappan,
Douwe Kiela,
Rebecca Qian
Abstract:
Retrieval Augmented Generation (RAG) techniques aim to mitigate hallucinations in Large Language Models (LLMs). However, LLMs can still produce information that is unsupported or contradictory to the retrieved contexts. We introduce LYNX, a SOTA hallucination detection LLM that is capable of advanced reasoning on challenging real-world hallucination scenarios. To evaluate LYNX, we present HaluBenc…
▽ More
Retrieval Augmented Generation (RAG) techniques aim to mitigate hallucinations in Large Language Models (LLMs). However, LLMs can still produce information that is unsupported or contradictory to the retrieved contexts. We introduce LYNX, a SOTA hallucination detection LLM that is capable of advanced reasoning on challenging real-world hallucination scenarios. To evaluate LYNX, we present HaluBench, a comprehensive hallucination evaluation benchmark, consisting of 15k samples sourced from various real-world domains. Our experiment results show that LYNX outperforms GPT-4o, Claude-3-Sonnet, and closed and open-source LLM-as-a-judge models on HaluBench. We release LYNX, HaluBench and our evaluation code for public access.
△ Less
Submitted 11 July, 2024;
originally announced July 2024.
-
Magic silicon dioxide for widely tunable integrated photonics
Authors:
Bruno Lopez-Rodriguez,
Naresh Sharma,
Zizheng Li,
Roald van der Kolk,
Jasper van der Boom,
Thomas Scholte,
** Chang,
Simon Groblacher,
Iman Esmaeil Zadeh
Abstract:
Integrated photonic circuits have transformed data communication, biosensing, and light detection and ranging, and hold wide-ranging potential for optical computing, optical imaging and signal processing. These applications often require tunable and reconfigurable photonic components, most commonly accomplished through the thermo-optic effect. However, the resulting tuning window is limited for st…
▽ More
Integrated photonic circuits have transformed data communication, biosensing, and light detection and ranging, and hold wide-ranging potential for optical computing, optical imaging and signal processing. These applications often require tunable and reconfigurable photonic components, most commonly accomplished through the thermo-optic effect. However, the resulting tuning window is limited for standard optical materials such as silicon dioxide and silicon nitride. Most importantly, bidirectional thermal tuning on a single platform has not been realized. For the first time, we show that by tuning and optimizing the deposition conditions in inductively-coupled plasma chemical vapor deposition (ICPCVD) of silicon dioxide, this material can be used to deterministically tune the thermo-optic properties of optical devices without introducing significant losses. We demonstrate that we can deterministically integrate positive and negative wavelength shifts on a single chip, validated on amorphous silicon carbide (a-SiC), silicon nitride (SiN) and silicon-on-insulator (SOI) platforms. We observe up to a 10-fold improvement of the thermo-optic tunability and, in addition, demonstrate athermal ring resonators with shifts as low as 1.5 pm/°C. This enables the fabrication of a novel tunable coupled ring optical waveguide (CROW) requiring only a single heater. In addition, the low-temperature deposition of our silicon dioxide cladding can be combined with lift-off to isolate the optical devices resulting in a decrease in thermal crosstalk by at least two orders of magnitude. Our method paves the way for novel photonic architectures incorporating bidirectional thermo-optic tunability.
△ Less
Submitted 11 July, 2024;
originally announced July 2024.
-
Investigating Public Fine-Tuning Datasets: A Complex Review of Current Practices from a Construction Perspective
Authors:
Runyuan Ma,
Wei Li,
Fukai Shang
Abstract:
With the rapid development of the large model domain, research related to fine-tuning has concurrently seen significant advancement, given that fine-tuning is a constituent part of the training process for large-scale models. Data engineering plays a fundamental role in the training process of models, which includes data infrastructure, data processing, etc. Data during fine-tuning likewise forms…
▽ More
With the rapid development of the large model domain, research related to fine-tuning has concurrently seen significant advancement, given that fine-tuning is a constituent part of the training process for large-scale models. Data engineering plays a fundamental role in the training process of models, which includes data infrastructure, data processing, etc. Data during fine-tuning likewise forms the base for large models. In order to embrace the power and explore new possibilities of fine-tuning datasets, this paper reviews current public fine-tuning datasets from the perspective of data construction. An overview of public fine-tuning datasets from two sides: evolution and taxonomy, is provided in this review, aiming to chart the development trajectory. Construction techniques and methods for public fine-tuning datasets of Large Language Models (LLMs), including data generation and data augmentation among others, are detailed. This elaboration follows the aforementioned taxonomy, specifically across demonstration, comparison, and generalist categories. Additionally, a category tree of data generation techniques has been abstracted in our review to assist researchers in gaining a deeper understanding of fine-tuning datasets from the construction dimension. Our review also summarizes the construction features in different data preparation phases of current practices in this field, aiming to provide a comprehensive overview and inform future research. Fine-tuning dataset practices, encompassing various data modalities, are also discussed from a construction perspective in our review. Towards the end of the article, we offer insights and considerations regarding the future construction and developments of fine-tuning datasets.
△ Less
Submitted 11 July, 2024;
originally announced July 2024.
-
Exotic edge states of C3 high-fold fermions in honeycomb lattices
Authors:
L. Madail,
R. G. Dias,
J. Fernández-Rossier
Abstract:
A generalization of the graphene honeycomb model to the case where each site in the honeycomb lattice contains a $n-$fold degenerate set of eigenstates of the $C_3$ symmetry has been recently proposed to describe several systems, including triangulene crystals and photonic lattices. These generalized honeycomb models are defined by $(n_a,n_b)$, the number $C_3$ eigenstates in the $a$ and $b$ sites…
▽ More
A generalization of the graphene honeycomb model to the case where each site in the honeycomb lattice contains a $n-$fold degenerate set of eigenstates of the $C_3$ symmetry has been recently proposed to describe several systems, including triangulene crystals and photonic lattices. These generalized honeycomb models are defined by $(n_a,n_b)$, the number $C_3$ eigenstates in the $a$ and $b$ sites of the unit cell, resulting in $n_a+n_b$ bands. Thus, the $(1,1)$ case gives the coventional honeycomb model that describes the two low-energy bands in graphene. Generalizations, such as $(2,1)$, $(2,2)$ and $(3,3)$ display several non-trivial features, such as coexisting graphene-like Dirac cones with flat-bands, both at zero and finite-energy, as well as robust degeneracy points where a flat-band and a parabolic band meet at the $Γ$-point. Here, we explore the edge states of this class of crystals, using as reference triangulene crystals, and we find several types of edge states absent in the conventional $(1,1)$ honeycomb case, associated to the non-trivial features of the two-dimensional (2D) bands of the high-fold case. First, we find dispersive edge states associated to the finite-energy flat-bands, that occur both at the armchair and zigzag termination. Second, in the case of non-centrosymmetric triangulene crystals that lead to a $S=1$ Dirac band, we have a bonding-antibonding pair of dispersive edge states, localized in the same edge so that their energy splitting is reduced as their localization increases, opposite to the conventional behavior of pairs of states localized in opposite edges. Third, for the $(3,3)$ case, that hosts a gap separating a pair of flat conduction and valence bands, we find non-dispersive edge states with $E=0$ in all edge terminations.
△ Less
Submitted 11 July, 2024;
originally announced July 2024.
-
Global Spatial-Temporal Information-based Residual ConvLSTM for Video Space-Time Super-Resolution
Authors:
Congrui Fu,
Hui Yuan,
Shiqi Jiang,
Guanghui Zhang,
Liquan Shen,
Raouf Hamzaoui
Abstract:
By converting low-frame-rate, low-resolution videos into high-frame-rate, high-resolution ones, space-time video super-resolution techniques can enhance visual experiences and facilitate more efficient information dissemination. We propose a convolutional neural network (CNN) for space-time video super-resolution, namely GIRNet. To generate highly accurate features and thus improve performance, th…
▽ More
By converting low-frame-rate, low-resolution videos into high-frame-rate, high-resolution ones, space-time video super-resolution techniques can enhance visual experiences and facilitate more efficient information dissemination. We propose a convolutional neural network (CNN) for space-time video super-resolution, namely GIRNet. To generate highly accurate features and thus improve performance, the proposed network integrates a feature-level temporal interpolation module with deformable convolutions and a global spatial-temporal information-based residual convolutional long short-term memory (convLSTM) module. In the feature-level temporal interpolation module, we leverage deformable convolution, which adapts to deformations and scale variations of objects across different scene locations. This presents a more efficient solution than conventional convolution for extracting features from moving objects. Our network effectively uses forward and backward feature information to determine inter-frame offsets, leading to the direct generation of interpolated frame features. In the global spatial-temporal information-based residual convLSTM module, the first convLSTM is used to derive global spatial-temporal information from the input features, and the second convLSTM uses the previously computed global spatial-temporal information feature as its initial cell state. This second convLSTM adopts residual connections to preserve spatial information, thereby enhancing the output features. Experiments on the Vimeo90K dataset show that the proposed method outperforms state-of-the-art techniques in peak signal-to-noise-ratio (by 1.45 dB, 1.14 dB, and 0.02 dB over STARnet, TMNet, and 3DAttGAN, respectively), structural similarity index(by 0.027, 0.023, and 0.006 over STARnet, TMNet, and 3DAttGAN, respectively), and visually.
△ Less
Submitted 11 July, 2024;
originally announced July 2024.
-
Propagation and non-reciprocity in time-modulated diffusion through the lens of high-order homogenization
Authors:
Marie Touboul,
Bruno Lombard,
Raphaël Assier,
Sébastien Guenneau,
Richard Craster
Abstract:
The homogenization procedure developed here is conducted on a laminate with periodic space-time modulation on the fine scale: at leading order, this modulation creates convection in the low-wavelength regime if both parameters are modulated. However, if only one parameter is modulated, which is more realistic, this convective term disappears and one recovers a standard diffusion equation with effe…
▽ More
The homogenization procedure developed here is conducted on a laminate with periodic space-time modulation on the fine scale: at leading order, this modulation creates convection in the low-wavelength regime if both parameters are modulated. However, if only one parameter is modulated, which is more realistic, this convective term disappears and one recovers a standard diffusion equation with effective homogeneous parameters; this does not describe the non-reciprocity and the propagation of the field observed from exact dispersion diagrams. This inconsistency is corrected here by considering second-order homogenization which results in a non-reciprocal propagation term that is proved to be non-zero for any laminate and verified via numerical simulation. The same methodology is also applied to the case when the density is modulated in the heat equation, leading therefore to a corrective advective term which cancels out non-reciprocity at the leading order but not at the second order.
△ Less
Submitted 11 July, 2024;
originally announced July 2024.
-
Inductive Predicate Synthesis Modulo Programs (Extended)
Authors:
Scott Wesley,
Maria Christakis,
Jorge A. Navas,
Richard Trefler,
Valentin Wüstholz,
Arie Gurfinkel
Abstract:
A growing trend in program analysis is to encode verification conditions within the language of the input program. This simplifies the design of analysis tools by utilizing off-the-shelf verifiers, but makes communication with the underlying solver more challenging. Essentially, the analyzer operates at the level of input programs, whereas the solver operates at the level of problem encodings. To…
▽ More
A growing trend in program analysis is to encode verification conditions within the language of the input program. This simplifies the design of analysis tools by utilizing off-the-shelf verifiers, but makes communication with the underlying solver more challenging. Essentially, the analyzer operates at the level of input programs, whereas the solver operates at the level of problem encodings. To bridge this gap, the verifier must pass along proof-rules from the analyzer to the solver. For example, an analyzer for concurrent programs built on an inductive program verifier might need to declare Owicki-Gries style proof-rules for the underlying solver. Each such proof-rule further specifies how a program should be verified, meaning that the problem of passing proof-rules is a form of invariant synthesis.
Similarly, many program analysis tasks reduce to the synthesis of pure, loop-free Boolean functions (i.e., predicates), relative to a program. From this observation, we propose Inductive Predicate Synthesis Modulo Programs (IPS-MP) which extends high-level languages with minimal synthesis features to guide analysis. In IPS-MP, unknown predicates appear under assume and assert statements, acting as specifications modulo the program semantics. Existing synthesis solvers are inefficient at IPS-MP as they target more general problems. In this paper, we show that IPS-MP admits an efficient solution in the Boolean case, despite being generally undecidable. Moreover, we show that IPS-MP reduces to the satisfiability of constrained Horn clauses, which is less general than existing synthesis problems, yet expressive enough to encode verification tasks. We provide reductions from challenging verification tasks -- such as parameterized model checking -- to IPS-MP. We realize these reductions with an efficient IPS-MP-solver based on SeaHorn, and describe a application to smart-contract verification.
△ Less
Submitted 11 July, 2024;
originally announced July 2024.
-
MITL Model Checking via Generalized Timed Automata and a New Liveness Algorithm
Authors:
S. Akshay,
Paul Gastin,
R. Govind,
B. Srivathsan
Abstract:
The translation of Metric Interval Temporal Logic (MITL) to timed automata is a topic that has been extensively studied. A key challenge here is the conversion of future modalities into equivalent automata. Typical conversions equip the automata with a guess-and-check mechanism to ascertain the truth of future modalities. Guess-and-check can be naturally implemented via alternation. However, since…
▽ More
The translation of Metric Interval Temporal Logic (MITL) to timed automata is a topic that has been extensively studied. A key challenge here is the conversion of future modalities into equivalent automata. Typical conversions equip the automata with a guess-and-check mechanism to ascertain the truth of future modalities. Guess-and-check can be naturally implemented via alternation. However, since timed automata tools do not handle alternation, existing methods perform an additional step of converting the alternating timed automata into timed automata. This de-alternation step proceeds by an intricate finite abstraction of the space of configurations of the alternating automaton.
Recently, a model of generalized timed automata (GTA) has been proposed. The model comes with several powerful additional features, and yet, the best known zone-based reachability algorithms for timed automata have been extended to the GTA model, with the same complexity for all the zone operations. We provide a new concise translation from MITL to GTA. In particular, for the timed until modality, our translation offers an exponential improvement w.r.t. the state-of-the-art.
Thanks to this conversion, MITL model checking reduces to checking liveness for GTAs. However, no liveness algorithm is known for GTAs. Due to the presence of future clocks, there is no finite time-abstract bisimulation (region equivalence) for GTAs, whereas liveness algorithms for timed automata crucially rely on the presence of the finite region equivalence. As our second contribution, we provide a new zone-based algorithm for checking Buchi non-emptiness in GTAs, which circumvents this fundamental challenge.
△ Less
Submitted 11 July, 2024;
originally announced July 2024.
-
How Deep is your Guess? A Fresh Perspective on Deep Learning for Medical Time-Series Imputation
Authors:
Linglong Qian,
Tao Wang,
Jun Wang,
Hugh Logan Ellis,
Robin Mitra,
Richard Dobson,
Zina Ibrahim
Abstract:
We introduce a novel classification framework for time-series imputation using deep learning, with a particular focus on clinical data. By identifying conceptual gaps in the literature and existing reviews, we devise a taxonomy grounded on the inductive bias of neural imputation frameworks, resulting in a classification of existing deep imputation strategies based on their suitability for specific…
▽ More
We introduce a novel classification framework for time-series imputation using deep learning, with a particular focus on clinical data. By identifying conceptual gaps in the literature and existing reviews, we devise a taxonomy grounded on the inductive bias of neural imputation frameworks, resulting in a classification of existing deep imputation strategies based on their suitability for specific imputation scenarios and data-specific properties. Our review further examines the existing methodologies employed to benchmark deep imputation models, evaluating their effectiveness in capturing the missingness scenarios found in clinical data and emphasising the importance of reconciling mathematical abstraction with clinical insights. Our classification aims to serve as a guide for researchers to facilitate the selection of appropriate deep learning imputation techniques tailored to their specific clinical data. Our novel perspective also highlights the significance of bridging the gap between computational methodologies and medical insights to achieve clinically sound imputation models.
△ Less
Submitted 11 July, 2024;
originally announced July 2024.
-
Are Large Language Models Really Bias-Free? Jailbreak Prompts for Assessing Adversarial Robustness to Bias Elicitation
Authors:
Riccardo Cantini,
Giada Cosenza,
Alessio Orsino,
Domenico Talia
Abstract:
Large Language Models (LLMs) have revolutionized artificial intelligence, demonstrating remarkable computational power and linguistic capabilities. However, these models are inherently prone to various biases stemming from their training data. These include selection, linguistic, and confirmation biases, along with common stereotypes related to gender, ethnicity, sexual orientation, religion, soci…
▽ More
Large Language Models (LLMs) have revolutionized artificial intelligence, demonstrating remarkable computational power and linguistic capabilities. However, these models are inherently prone to various biases stemming from their training data. These include selection, linguistic, and confirmation biases, along with common stereotypes related to gender, ethnicity, sexual orientation, religion, socioeconomic status, disability, and age. This study explores the presence of these biases within the responses given by the most recent LLMs, analyzing the impact on their fairness and reliability. We also investigate how known prompt engineering techniques can be exploited to effectively reveal hidden biases of LLMs, testing their adversarial robustness against jailbreak prompts specially crafted for bias elicitation. Extensive experiments are conducted using the most widespread LLMs at different scales, confirming that LLMs can still be manipulated to produce biased or inappropriate responses, despite their advanced capabilities and sophisticated alignment processes. Our findings underscore the importance of enhancing mitigation techniques to address these safety issues, toward a more sustainable and inclusive artificial intelligence.
△ Less
Submitted 11 July, 2024;
originally announced July 2024.
-
Pushing high angular resolution and high contrast observations on the VLTI from Y to L band with the Asgard instrumental suite: integration status and plans
Authors:
Marc-Antoine Martinod,
Denis Defrère,
Michael J. Ireland,
Stefan Kraus,
Frantz Martinache,
Peter G. Tuthill,
Fatmé Allouche,
Emilie Bouzerand,
Julia Bryant,
Josh Carter,
Sorabh Chhabra,
Benjamin Courtney-Barrer,
Fred Crous,
Nick Cvetojevic,
Colin Dandumont,
Steve Ertel,
Tyler Gardner,
Germain Garreau,
Adrian M. Glauser,
Xavier Haubois,
Lucas Labadie,
Stéphane Lagarde,
Daniel Lancaster,
Romain Laugier,
Alexandra Mazzoli
, et al. (13 additional authors not shown)
Abstract:
ESO's Very Large Telescope Interferometer has a history of record-breaking discoveries in astrophysics and significant advances in instrumentation. The next leap forward is its new visitor instrument, called Asgard. It comprises four natively collaborating instruments: HEIMDALLR, an instrument performing both fringe tracking and stellar interferometry simultaneously with the same optics, operating…
▽ More
ESO's Very Large Telescope Interferometer has a history of record-breaking discoveries in astrophysics and significant advances in instrumentation. The next leap forward is its new visitor instrument, called Asgard. It comprises four natively collaborating instruments: HEIMDALLR, an instrument performing both fringe tracking and stellar interferometry simultaneously with the same optics, operating in the K band; Baldr, a Strehl optimizer in the H band; BIFROST, a spectroscopic combiner to study the formation processes and properties of stellar and planetary systems in the Y-J-H bands; and NOTT, a nulling interferometer dedicated to imaging nearby young planetary systems in the L band. The suite is in its integration phase in Europe and should be shipped to Paranal in 2025. In this article, we present details of the alignment and calibration unit, the observing modes, the integration plan, the software architecture, and the roadmap to completion of the project.
△ Less
Submitted 11 July, 2024;
originally announced July 2024.
-
Skin Effect of Nonlinear Optical Responses in Antiferromagnets
Authors:
Hang Zhou,
Rui-Chun Xiao,
Shu-Hui Zhang,
Wei Gan,
Hui Han,
Hong-Miao Zhao,
Wenjian Lu,
Chang** Zhang,
Yu** Sun,
Hui Li,
Ding-Fu Shao
Abstract:
Nonlinear optics plays important roles in the research of fundamental physics and the applications of highperformance optoelectronic devices. The bulk nonlinear optical responses arise from the uniform light absorption in noncentrosymmetric crystals, and hence are usually considered to be the collective phenomena of all atoms. Here we show, in contrast to this common expectation, the nonlinear opt…
▽ More
Nonlinear optics plays important roles in the research of fundamental physics and the applications of highperformance optoelectronic devices. The bulk nonlinear optical responses arise from the uniform light absorption in noncentrosymmetric crystals, and hence are usually considered to be the collective phenomena of all atoms. Here we show, in contrast to this common expectation, the nonlinear optical responses in antiferromagnets can be selectively accumulated near the surfaces, representing a skin effect. This is because the inversion symmetry, despite being broken globally, is barely violated locally deeply inside these antiferromagnets. Using A-type layered antiferromagnets as the representatives, we predict that the spatial-dependent nonlinear optical responses, such as bulk photovoltaic effect (BPVE) and second harmonic generation (SHG), are notable in the top- and bottom-most layers and decay rapidly when moving away from the surfaces. Such a phenomenon exists in a broad range of antiferromagnets composed of centrosymmetric sublattices, offering promising device applications using these antiferromagnets. Our work uncovers a previously overlooked property of nonlinear optical responses and opens new opportunities for high-performance antiferromagnetic optospintronics.
△ Less
Submitted 11 July, 2024;
originally announced July 2024.
-
CLEO: Continual Learning of Evolving Ontologies
Authors:
Shishir Muralidhara,
Saqib Bukhari,
Georg Schneider,
Didier Stricker,
René Schuster
Abstract:
Continual learning (CL) addresses the problem of catastrophic forgetting in neural networks, which occurs when a trained model tends to overwrite previously learned information, when presented with a new task. CL aims to instill the lifelong learning characteristic of humans in intelligent systems, making them capable of learning continuously while retaining what was already learned. Current CL pr…
▽ More
Continual learning (CL) addresses the problem of catastrophic forgetting in neural networks, which occurs when a trained model tends to overwrite previously learned information, when presented with a new task. CL aims to instill the lifelong learning characteristic of humans in intelligent systems, making them capable of learning continuously while retaining what was already learned. Current CL problems involve either learning new domains (domain-incremental) or new and previously unseen classes (class-incremental). However, general learning processes are not just limited to learning information, but also refinement of existing information. In this paper, we define CLEO - Continual Learning of Evolving Ontologies, as a new incremental learning setting under CL to tackle evolving classes. CLEO is motivated by the need for intelligent systems to adapt to real-world ontologies that change over time, such as those in autonomous driving. We use Cityscapes, PASCAL VOC, and Mapillary Vistas to define the task settings and demonstrate the applicability of CLEO. We highlight the shortcomings of existing CIL methods in adapting to CLEO and propose a baseline solution, called Modelling Ontologies (MoOn). CLEO is a promising new approach to CL that addresses the challenge of evolving ontologies in real-world applications. MoOn surpasses previous CL approaches in the context of CLEO.
△ Less
Submitted 11 July, 2024;
originally announced July 2024.
-
Specialist vision-language models for clinical ophthalmology
Authors:
Robbie Holland,
Thomas R. P. Taylor,
Christopher Holmes,
Sophie Riedl,
Julia Mai,
Maria Patsiamanidi,
Dimitra Mitsopoulou,
Paul Hager,
Philip Müller,
Hendrik P. N. Scholl,
Hrvoje Bogunović,
Ursula Schmidt-Erfurth,
Daniel Rueckert,
Sobha Sivaprasad,
Andrew J. Lotery,
Martin J. Menten
Abstract:
Clinicians spend a significant amount of time reviewing medical images and transcribing their findings regarding patient diagnosis, referral and treatment in text form. Vision-language models (VLMs), which automatically interpret images and summarize their findings as text, have enormous potential to alleviate clinical workloads and increase patient access to high-quality medical care. While found…
▽ More
Clinicians spend a significant amount of time reviewing medical images and transcribing their findings regarding patient diagnosis, referral and treatment in text form. Vision-language models (VLMs), which automatically interpret images and summarize their findings as text, have enormous potential to alleviate clinical workloads and increase patient access to high-quality medical care. While foundational models have stirred considerable interest in the medical community, it is unclear whether their general capabilities translate to real-world clinical utility. In this work, we show that foundation VLMs markedly underperform compared to practicing ophthalmologists on specialist tasks crucial to the care of patients with age-related macular degeneration (AMD). To address this, we initially identified the essential capabilities required for image-based clinical decision-making, and then developed a curriculum to selectively train VLMs in these skills. The resulting model, RetinaVLM, can be instructed to write reports that significantly outperform those written by leading foundation medical VLMs in disease staging (F1 score of 0.63 vs. 0.11) and patient referral (0.67 vs. 0.39), and approaches the diagnostic performance of junior ophthalmologists (who achieve 0.77 and 0.78 on the respective tasks). Furthermore, in a reader study involving two senior ophthalmologists with up to 32 years of experience, RetinaVLM's reports were found to be similarly correct (78.6% vs. 82.1%) and complete (both 78.6%) as reports written by junior ophthalmologists with up to 10 years of experience. These results demonstrate that our curriculum-based approach provides a blueprint for specializing generalist foundation medical VLMs to handle real-world clinical tasks.
△ Less
Submitted 11 July, 2024;
originally announced July 2024.
-
Normed equivariant ring spectra and higher Tambara functors
Authors:
Bastiaan Cnossen,
Rune Haugseng,
Tobias Lenz,
Sil Linskens
Abstract:
In this paper we extend equivariant infinite loop space theory to take into account multiplicative norms: For every finite group $G$, we construct a multiplicative refinement of the comparison between the $\infty$-categories of connective genuine $G$-spectra and space-valued Mackey functors, first proven by Guillou-May, and use this to give a description of connective normed equivariant ring spect…
▽ More
In this paper we extend equivariant infinite loop space theory to take into account multiplicative norms: For every finite group $G$, we construct a multiplicative refinement of the comparison between the $\infty$-categories of connective genuine $G$-spectra and space-valued Mackey functors, first proven by Guillou-May, and use this to give a description of connective normed equivariant ring spectra as space-valued Tambara functors.
In more detail, we first introduce and study a general notion of homotopy-coherent normed (semi)rings, and identify these with product-preserving functors out of a corresponding $\infty$-category of bispans. In the equivariant setting, this identifies space-valued Tambara functors with normed algebras with respect to a certain normed monoidal structure on grouplike $G$-commutative monoids in spaces. We then show that the latter is canonically equivalent to the normed monoidal structure on connective $G$-spectra given by the Hill-Hopkins-Ravenel norms. Combining our comparison with results of Elmanto-Haugseng and Barwick-Glasman-Mathew-Nikolaus, we produce normed ring structures on equivariant algebraic K-theory spectra.
△ Less
Submitted 11 July, 2024;
originally announced July 2024.
-
L-band nulling interferometry at the VLTI with Asgard/NOTT: status and plans
Authors:
Denis Defrère,
Romain Laugier,
Marc-Antoine Martinod,
Germain Garreau,
Kwinten Missiaen,
Muhammad Salman,
Gert Raskin,
Colin Dandumont,
Steve Ertel,
Michael J. Ireland,
Stefan Kraus,
Lucas Labadie,
Alexandra Mazzoli,
Gyorgy Medgyesi,
Ahmed Sanny,
Olivier Absil,
Peter Ábráham,
Jean-Philippe Berger,
Myriam Bonduelle,
Azzurra Bigioli,
Emilie Bouzerand,
Josh Carter,
Nick Cvetojevic,
Benjamin Courtney-Barrer,
Adrian M. Glauser
, et al. (21 additional authors not shown)
Abstract:
NOTT (formerly Hi-5) is the L'-band (3.5-4.0~microns) nulling interferometer of Asgard, an instrument suite in preparation for the VLTI visitor focus. The primary scientific objectives of NOTT include characterizing (i) young planetary systems near the snow line, a critical region for giant planet formation, and (ii) nearby main-sequence stars close to the habitable zone, with a focus on detecting…
▽ More
NOTT (formerly Hi-5) is the L'-band (3.5-4.0~microns) nulling interferometer of Asgard, an instrument suite in preparation for the VLTI visitor focus. The primary scientific objectives of NOTT include characterizing (i) young planetary systems near the snow line, a critical region for giant planet formation, and (ii) nearby main-sequence stars close to the habitable zone, with a focus on detecting exozodiacal dust that could obscure Earth-like planets. In 2023-2024, the final warm optics have been procured and assembled in a new laboratory at KU Leuven. First fringes and null measurements were obtained using a Gallium Lanthanum Sulfide (GLS) photonic chip that was also tested at cryogenic temperatures. In this paper, we present an overall update of the NOTT project with a particular focus on the cold mechanical design, the first results in the laboratory with the final NOTT warm optics, and the ongoing Asgard integration activities. We also report on other ongoing activities such as the characterization of the photonic chip (GLS, LiNbO3, SiO), the development of the exoplanet science case, the design of the dispersion control module, and the progress with the self-calibration data reduction software.
△ Less
Submitted 11 July, 2024;
originally announced July 2024.
-
Approximate Degree Composition for Recursive Functions
Authors:
Sourav Chakraborty,
Chandrima Kayal,
Rajat Mittal,
Manaswi Paraashar,
Nitin Saurabh
Abstract:
Determining the approximate degree composition for Boolean functions remains a significant unsolved problem in Boolean function complexity. In recent decades, researchers have concentrated on proving that approximate degree composes for special types of inner and outer functions. An important and extensively studied class of functions are the recursive functions, i.e.~functions obtained by composi…
▽ More
Determining the approximate degree composition for Boolean functions remains a significant unsolved problem in Boolean function complexity. In recent decades, researchers have concentrated on proving that approximate degree composes for special types of inner and outer functions. An important and extensively studied class of functions are the recursive functions, i.e.~functions obtained by composing a base function with itself a number of times. Let $h^d$ denote the standard $d$-fold composition of the base function $h$.
The main result of this work is to show that the approximate degree composes if either of the following conditions holds:
\begin{itemize}
\item The outer function $f:\{0,1\}^n\to \{0,1\}$ is a recursive function of the form $h^d$, with $h$ being any base function and $d= Ω(\log\log n)$.
\item The inner function is a recursive function of the form $h^d$, with $h$ being any constant arity base function (other than AND and OR) and $d= Ω(\log\log n)$, where $n$ is the arity of the outer function.
\end{itemize}
In terms of proof techniques, we first observe that the lower bound for composition can be obtained by introducing majority in between the inner and the outer functions. We then show that majority can be \emph{efficiently eliminated} if the inner or outer function is a recursive function.
△ Less
Submitted 11 July, 2024;
originally announced July 2024.
-
Error estimates of physics-informed neural networks for approximating Boltzmann equation
Authors:
Elie Abdo,
Lihui Chai,
Ruimeng Hu,
Xu Yang
Abstract:
Motivated by the recent successful application of physics-informed neural networks (PINNs) to solve Boltzmann-type equations [S. **, Z. Ma, and K. Wu, J. Sci. Comput., 94 (2023), pp. 57], we provide a rigorous error analysis for PINNs in approximating the solution of the Boltzmann equation near a global Maxwellian. The challenge arises from the nonlocal quadratic interaction term defined in the u…
▽ More
Motivated by the recent successful application of physics-informed neural networks (PINNs) to solve Boltzmann-type equations [S. **, Z. Ma, and K. Wu, J. Sci. Comput., 94 (2023), pp. 57], we provide a rigorous error analysis for PINNs in approximating the solution of the Boltzmann equation near a global Maxwellian. The challenge arises from the nonlocal quadratic interaction term defined in the unbounded domain of velocity space. Analyzing this term on an unbounded domain requires the inclusion of a truncation function, which demands delicate analysis techniques. As a generalization of this analysis, we also provide proof of the asymptotic preserving property when using micro-macro decomposition-based neural networks.
△ Less
Submitted 11 July, 2024;
originally announced July 2024.
-
Light transformation: A Celestial and Carrollian perspective
Authors:
Sourish Banerjee,
Rudranil Basu,
Sayali Atul Bhatkar
Abstract:
In this paper, we first study the consequence of spacetime translations and Lorentz transformations on Celestial CFT OPEs. Working with the light transforms of the operators belonging to the modified Mellin basis, we found that the leading order singularity in the OPE of such operators could be fixed purely using Poincaré symmetries owing to the non-trivial action of the translations on these oper…
▽ More
In this paper, we first study the consequence of spacetime translations and Lorentz transformations on Celestial CFT OPEs. Working with the light transforms of the operators belonging to the modified Mellin basis, we found that the leading order singularity in the OPE of such operators could be fixed purely using Poincaré symmetries owing to the non-trivial action of the translations on these operators. The OPE coefficient is then fixed using the soft limit of the correlation functions. We check that this singular structure obtained from symmetries is consistent with the OPE limit of three-point functions. This approach could potentially be useful for studying Celestial CFT without adverting to bulk physics. As another goal, we explore the significance of light transformation in Carrollian CFTs. In the special cases we considered, we show that light transformation equips us with a map between two branches of Carroll CFT in $d=3$ dimension at the level of correlation functions in near coincident limit.
△ Less
Submitted 11 July, 2024;
originally announced July 2024.
-
Long-range Turbulence Mitigation: A Large-scale Dataset and A Coarse-to-fine Framework
Authors:
Shengqi Xu,
Run Sun,
Yi Chang,
Shuning Cao,
Xueyao Xiao,
Luxin Yan
Abstract:
Long-range imaging inevitably suffers from atmospheric turbulence with severe geometric distortions due to random refraction of light. The further the distance, the more severe the disturbance. Despite existing research has achieved great progress in tackling short-range turbulence, there is less attention paid to long-range turbulence with significant distortions. To address this dilemma and adva…
▽ More
Long-range imaging inevitably suffers from atmospheric turbulence with severe geometric distortions due to random refraction of light. The further the distance, the more severe the disturbance. Despite existing research has achieved great progress in tackling short-range turbulence, there is less attention paid to long-range turbulence with significant distortions. To address this dilemma and advance the field, we construct a large-scale real long-range atmospheric turbulence dataset (RLR-AT), including 1500 turbulence sequences spanning distances from 1 Km to 13 Km. The advantages of RLR-AT compared to existing ones: turbulence with longer-distances and higher-diversity, scenes with greater-variety and larger-scale. Moreover, most existing work adopts either registration-based or decomposition-based methods to address distortions through one-step mitigation. However, they fail to effectively handle long-range turbulence due to its significant pixel displacements. In this work, we propose a coarse-to-fine framework to handle severe distortions, which cooperates dynamic turbulence and static background priors (CDSP). On the one hand, we discover the pixel motion statistical prior of turbulence, and propose a frequency-aware reference frame for better large-scale distortion registration, greatly reducing the burden of refinement. On the other hand, we take advantage of the static prior of background, and propose a subspace-based low-rank tensor refinement model to eliminate the misalignments inevitably left by registration while well preserving details. The dynamic and static priors complement to each other, facilitating us to progressively mitigate long-range turbulence with severe distortions. Extensive experiments demonstrate that the proposed method outperforms SOTA methods on different datasets.
△ Less
Submitted 11 July, 2024;
originally announced July 2024.