Search | arXiv e-print repository

Surgical Text-to-Image Generation

Authors: Chinedu Innocent Nwoye, Rupak Bose, Kareem Elgohary, Lorenzo Arboit, Giorgio Carlino, Joël L. Lavanchy, Pietro Mascagni, Nicolas Padoy

Abstract: Acquiring surgical data for research and development is significantly hindered by high annotation costs and practical and ethical constraints. Utilizing synthetically generated images could offer a valuable alternative. In this work, we conduct an in-depth analysis on adapting text-to-image generative models for the surgical domain, leveraging the CholecT50 dataset, which provides surgical images… ▽ More Acquiring surgical data for research and development is significantly hindered by high annotation costs and practical and ethical constraints. Utilizing synthetically generated images could offer a valuable alternative. In this work, we conduct an in-depth analysis on adapting text-to-image generative models for the surgical domain, leveraging the CholecT50 dataset, which provides surgical images annotated with surgical action triplets (instrument, verb, target). We investigate various language models and find T5 to offer more distinct features for differentiating surgical actions based on triplet-based textual inputs. Our analysis demonstrates strong alignment between long and triplet-based captions, supporting the use of triplet-based labels. We address the challenges in training text-to-image models on triplet-based captions without additional input signals by uncovering that triplet text embeddings are instrument-centric in the latent space and then, by designing an instrument-based class balancing technique to counteract the imbalance and skewness in the surgical data, improving training convergence. Extending Imagen, a diffusion-based generative model, we develop Surgical Imagen to generate photorealistic and activity-aligned surgical images from triplet-based textual prompts. We evaluate our model using diverse metrics, including human expert surveys and automated methods like FID and CLIP scores. We assess the model performance on key aspects: quality, alignment, reasoning, knowledge, and robustness, demonstrating the effectiveness of our approach in providing a realistic alternative to real data collection. △ Less

Submitted 12 July, 2024; originally announced July 2024.

Comments: 11 pages, 11 figures, 3 tables, project page at https://camma-public.github.io/surgicalimagen/

arXiv:2407.09229 [pdf, ps, other]

On Hölder continuity and $p^\mathrm{th}$-variation function of Weierstrass-type functions

Authors: Matyas Barczy, Peter Kern

Abstract: We study Hölder continuity, $p^\mathrm{th}$-variation function and Riesz variation of Weierstrass-type functions along a sequence of $b$-adic partitions, where $b>1$ is an integer. By a Weierstrass-type function, we mean that in the definition of the well-known Weierstrass function, the power function is replaced by a submultiplicative function, and the Lipschitz continuous cosine and sine functio… ▽ More We study Hölder continuity, $p^\mathrm{th}$-variation function and Riesz variation of Weierstrass-type functions along a sequence of $b$-adic partitions, where $b>1$ is an integer. By a Weierstrass-type function, we mean that in the definition of the well-known Weierstrass function, the power function is replaced by a submultiplicative function, and the Lipschitz continuous cosine and sine functions are replaced by a general Hölder continuous function. Our results extend some of the recent results of Schied and Zhang (2020, 2024). △ Less

Submitted 12 July, 2024; originally announced July 2024.

Comments: 27 pages

MSC Class: 26A16; 26A45; 60F99

arXiv:2407.09223 [pdf, other]

Codification of Good Seamanship in Complex and Congested Waterways

Authors: Yaqub Aris Prabowo, Peter Nicholas Hansen, Dimitrios Papageorgiou, Roberto Galeazzi

Abstract: This paper presents a novel method to quantify seafarers' good seamanship during navigation scenarios with multi-vessel encounters -- in open and confined waters --, and to compute COLREG's-compliant trajectories for avoiding collision and grounding. The quantification of good seamanship requires knowledge about the state of the vessels (position, heading, and speed) and the surrounding sailing en… ▽ More This paper presents a novel method to quantify seafarers' good seamanship during navigation scenarios with multi-vessel encounters -- in open and confined waters --, and to compute COLREG's-compliant trajectories for avoiding collision and grounding. The quantification of good seamanship requires knowledge about the state of the vessels (position, heading, and speed) and the surrounding sailing environment. Such information is accessible through the AIS system and the electronic nautical chart. The proposed method evaluates mutual collision risk by examining domain violations of each vessel, and comparing them to the seaman's actions. This results in a comprehensive metric of good seamanship. As risk free actions are not always possible in the resolution of a potential collision and grounding, the method adopts a branch-and-bound scheme to identify achievable maneuvers that minimize the risk. Further, the dynamic nature of vessel speed in congested scenarios is considered, recognizing potential changes in both own and target vessels' forward speeds. The proposed method is experimentally evaluated using historical AIS data and sea charts of Danish waters. This research contributes to the field by providing a more realistic perspective on seamanship in complex maritime environments. △ Less

Submitted 12 July, 2024; originally announced July 2024.

Comments: The 15th IFAC Conference on Control Applications in Marine Systems, Robotics, and Vehicles (CAMS 2024)

arXiv:2407.09220 [pdf, other]

An improved formulation for the wake-added turbulence for intra-farm and farm-to-farm wake modeling

Authors: Navid Zehtabiyan-Rezaie, Josephine Perto Justsen, Mahdi Abkar

Abstract: In this study, we present an improved formulation for the wake-added turbulence to enhance the accuracy of intra-farm and farm-to-farm wake modeling through analytical frameworks. Our goal is to address the tendency of a commonly used formulation to overestimate turbulence intensity within wind farms and to overcome its limitations in predicting the streamwise evolution of turbulence intensity bey… ▽ More In this study, we present an improved formulation for the wake-added turbulence to enhance the accuracy of intra-farm and farm-to-farm wake modeling through analytical frameworks. Our goal is to address the tendency of a commonly used formulation to overestimate turbulence intensity within wind farms and to overcome its limitations in predicting the streamwise evolution of turbulence intensity beyond them. To this end, we utilize high-fidelity data and adopt an optimization technique to derive an optimized functional form of the wake-added turbulence. We then integrate the achieved formulation with a widely used Gaussian wake model to study various intra-farm and farm-to-farm scenarios. The outcomes reveal that the new methodology effectively addresses the overestimation of power in both standalone wind farms and those impacted by upstream counterparts. Our new approach meets the need for accurate and lightweight models, ensuring the effective coexistence of wind farms within clusters as the wind-energy capacity rapidly expands. △ Less

Submitted 12 July, 2024; originally announced July 2024.

arXiv:2407.09215 [pdf, other]

HUP-3D: A 3D multi-view synthetic dataset for assisted-egocentric hand-ultrasound pose estimation

Authors: Manuel Birlo, Razvan Caramalau, Philip J. "Eddie" Edwards, Brian Dromey, Matthew J. Clarkson, Danail Stoyanov

Abstract: We present HUP-3D, a 3D multi-view multi-modal synthetic dataset for hand-ultrasound (US) probe pose estimation in the context of obstetric ultrasound. Egocentric markerless 3D joint pose estimation has potential applications in mixed reality based medical education. The ability to understand hand and probe movements programmatically opens the door to tailored guidance and mentoring applications.… ▽ More We present HUP-3D, a 3D multi-view multi-modal synthetic dataset for hand-ultrasound (US) probe pose estimation in the context of obstetric ultrasound. Egocentric markerless 3D joint pose estimation has potential applications in mixed reality based medical education. The ability to understand hand and probe movements programmatically opens the door to tailored guidance and mentoring applications. Our dataset consists of over 31k sets of RGB, depth and segmentation mask frames, including pose related ground truth data, with a strong emphasis on image diversity and complexity. Adopting a camera viewpoint-based sphere concept allows us to capture a variety of views and generate multiple hand grasp poses using a pre-trained network. Additionally, our approach includes a software-based image rendering concept, enhancing diversity with various hand and arm textures, lighting conditions, and background images. Furthermore, we validated our proposed dataset with state-of-the-art learning models and we obtained the lowest hand-object keypoint errors. The dataset and other details are provided with the supplementary material. The source code of our grasp generation and rendering pipeline will be made publicly available. △ Less

Submitted 12 July, 2024; originally announced July 2024.

Comments: https://conferences.miccai.org/2024/en/

arXiv:2407.09208 [pdf, other]

Fabrication of low-loss lithium niobate on insulator waveguides on the wafer scale

Authors: Mohammadreza Younesi, Thomas Kasebier, Ilia Elmanov, Yang-Teng Li, Pawan Kumar, Reinhard Geiss, Thomas Siefke, Falk Eilenberger, Frank Setzpfandt, Uwe Zeitner, Thomas Pertsch

Abstract: We report on the wafer scale fabrication of single mode low-loss lithium niobate on insulator waveguides utilizing a chemically amplified resist and an optimized dry etching method. The fabricated single mode waveguides are free of residuals and re-deposition, with measured losses for straight waveguides around 2 dB/m (0.02 dB/cm). We present on a method offering advantages for large-scale product… ▽ More We report on the wafer scale fabrication of single mode low-loss lithium niobate on insulator waveguides utilizing a chemically amplified resist and an optimized dry etching method. The fabricated single mode waveguides are free of residuals and re-deposition, with measured losses for straight waveguides around 2 dB/m (0.02 dB/cm). We present on a method offering advantages for large-scale production due to its cost-effectiveness, faster writing time, and simplified processes. This work holds promise for advancing integrated photonics and optical communication technologies. △ Less

Submitted 12 July, 2024; originally announced July 2024.

arXiv:2407.09207 [pdf, other]

Quantification of the low-$p_{\rm T}$ pion excess in heavy-ion collisions at the LHC and top RHIC energy

Authors: Pengzhong Lu, Rafet Kavak, Andrea Dubla, Silvia Masciocchi, Ilya Selyuzhenkov

Abstract: A Bayesian inference analysis is performed to quantify the pion excess in the low transverse momentum ($p_{\rm T}$) regime in heavy-ion collisions. This quantification is conducted across centrality classes and collision systems at the LHC and top RHIC energy. The analysis is based on the relativistic fluid dynamics description of $p_{\rm T}$ spectra of identified charged hadrons. A $p_{\rm T}$ ra… ▽ More A Bayesian inference analysis is performed to quantify the pion excess in the low transverse momentum ($p_{\rm T}$) regime in heavy-ion collisions. This quantification is conducted across centrality classes and collision systems at the LHC and top RHIC energy. The analysis is based on the relativistic fluid dynamics description of $p_{\rm T}$ spectra of identified charged hadrons. A $p_{\rm T}$ range scan investigation to determine the optimal range for the pion $p_{\rm T}$ spectra with respect to a fluid dynamic description is performed, finding 0.5~$< p_{\rm T} < $~2.0~GeV$/c$. A significant low-$p_{\rm T}$ pion excess is computed across all centrality classes and collision systems investigated, indicating that a clear low-$p_{\rm T}$ component arises from different physics mechanisms, rather than thermal production. Further comparison with measurements by the PHOBOS Collaboration confirms the persistence of the pion excess into very low-$p_{\rm T}$, with no enhancement observed in kaons and protons within the current experimental precision. △ Less

Submitted 12 July, 2024; originally announced July 2024.

Comments: 9 pages, 6 figures

arXiv:2407.09201 [pdf, other]

The MICADO first light imager for the ELT: off-axis performance of PSF reconstruction

Authors: Matteo Simioni, Daniel Jodlbauer, Carmelo Arcidiacono, Andrea Grazian, Marco Gullieuszik, Elisa Portaluri, Benedetta Vulcani, Roland Wagner, Anita Zanella, Johanna Hartke, Tapio Helin, Hanindyo Kuncarayakti, Fernando Pedichini, Roberto Piazzesi, Piero Vaccari

Abstract: The highest scientific return, for adaptive optics (AO) observations, is achieved with a reliable reconstruction of the PSF. This is especially true for MICADO@ELT. In this presentation, we will focus on extending the MICADO PSF reconstruction (PSF-R) method to the off-axis case. Specifically, a novel approach based on temporal-based tomography of AO telemetry data has been recently implemented. R… ▽ More The highest scientific return, for adaptive optics (AO) observations, is achieved with a reliable reconstruction of the PSF. This is especially true for MICADO@ELT. In this presentation, we will focus on extending the MICADO PSF reconstruction (PSF-R) method to the off-axis case. Specifically, a novel approach based on temporal-based tomography of AO telemetry data has been recently implemented. Results from the PSF-R of both simulated and real data show that, at half isoplanatic angle distances, a precision of about 10-15% is achievable in both Strehl ratio and full-width at half maximum, paving the way to extend the MICADO PSF-R tool also to the multi-conjugated AO case. △ Less

Submitted 12 July, 2024; originally announced July 2024.

Comments: 8 pages, 4 figures. Proceeding of SPIE Astronomical Telescopes + Instrumentation 2024, Adaptive Optics Systems IX

arXiv:2407.09197 [pdf, other]

A Chatbot for Asylum-Seeking Migrants in Europe

Authors: Bettina Fazzinga, Elena Palmieri, Margherita Vestoso, Luca Bolognini, Andrea Galassi, Filippo Furfaro, Paolo Torroni

Abstract: We present ACME: A Chatbot for asylum-seeking Migrants in Europe. ACME relies on computational argumentation and aims to help migrants identify the highest level of protection they can apply for. This would contribute to a more sustainable migration by reducing the load on territorial commissions, Courts, and humanitarian organizations supporting asylum applicants. We describe the context, system… ▽ More We present ACME: A Chatbot for asylum-seeking Migrants in Europe. ACME relies on computational argumentation and aims to help migrants identify the highest level of protection they can apply for. This would contribute to a more sustainable migration by reducing the load on territorial commissions, Courts, and humanitarian organizations supporting asylum applicants. We describe the context, system architectures, technologies, and the case study used to run the demonstration. △ Less

Submitted 12 July, 2024; originally announced July 2024.

arXiv:2407.09194 [pdf, other]

doi 10.1093/mnras/stae1602

The JWST Weather Report from the Nearest Brown Dwarfs I: multi-period JWST NIRSpec + MIRI monitoring of the benchmark binary brown dwarf WISE 1049AB

Authors: Beth A. Biller, Johanna M. Vos, Yifan Zhou, Allison M. McCarthy, Xianyu Tan, Ian J. M. Crossfield, Niall Whiteford, Genaro Suarez, Jacqueline Faherty, Elena Manjavacas, Xueqing Chen, Pengyu Liu, Ben J. Sutlieff, Mary Anne Limbach, Paul Molliere, Trent J. Dupuy, Natalia Oliveros-Gomez, Philip S. Muirhead, Thomas Henning, Gregory Mace, Nicolas Crouzet, Theodora Karalidi, Caroline V. Morley, Pascal Tremblin, Tiffany Kataria

Abstract: We report results from 8 hours of JWST/MIRI LRS spectroscopic monitoring directly followed by 7 hours of JWST/NIRSpec prism spectroscopic monitoring of the benchmark binary brown dwarf WISE 1049AB, the closest, brightest brown dwarfs known. We find water, methane, and CO absorption features in both components, including the 3.3 $μ$m methane absorption feature and a tentative detection of small gra… ▽ More We report results from 8 hours of JWST/MIRI LRS spectroscopic monitoring directly followed by 7 hours of JWST/NIRSpec prism spectroscopic monitoring of the benchmark binary brown dwarf WISE 1049AB, the closest, brightest brown dwarfs known. We find water, methane, and CO absorption features in both components, including the 3.3 $μ$m methane absorption feature and a tentative detection of small grain ($<$ 1$μ$m) silicate absorption at $>$8.5 $μ$m in WISE 1049A. Both components vary significantly ($>$1$\%$), with WISE 1049B displaying larger variations than WISE 1049A. Using K-means clustering, we find three main transition points in wavelength for both components of the binary: 1) change in behavior at $\sim$2.3 $μ$m coincident with a CO absorption bandhead, 2) change in behavior at 4.2 $μ$m, close to the CO fundamental band at $λ>$ 4.4 $μ$m, and 3) change in behavior at 8.3-8.5 $μ$m, potentially corresponding to silicate absorption. We interpret the lightcurves observed with both NIRSpec and MIRI as likely stemming from 1) a deep pressure level driving the double-peaked variability seen in WISE 1049B at wavelengths $<$2.3 $μ$m and $>$8.5 $μ$m, 2) an intermediate pressure level sha** the lightcurve morphology between 2.3 and 4.2 $μ$m, and 3) a higher-altitude pressure level producing single-peaked and plateaued lightcurve behavior between 4.2 and 8.5 $μ$m. △ Less

Submitted 12 July, 2024; originally announced July 2024.

Comments: 28 pages, 27 figures, accepted to MNRAS

arXiv:2407.09188 [pdf, other]

First principles calculations of dynamical Born effective charges, quadrupoles and higher order terms from the charge response in large semiconducting and metallic systems

Authors: Francesco Macheda, Paolo Barone, Francesco Mauri

Abstract: Within the context of first principles techniques we present a theoretical and computational framework to quickly determine, at finite momentum, the self-consistent (longitudinal) charge response to an external perturbation, that enters the determination of the scattering cross section of inelastic scattering processes such as EELS. We also determine the (tranverse) charge response computed in sho… ▽ More Within the context of first principles techniques we present a theoretical and computational framework to quickly determine, at finite momentum, the self-consistent (longitudinal) charge response to an external perturbation, that enters the determination of the scattering cross section of inelastic scattering processes such as EELS. We also determine the (tranverse) charge response computed in short-circuit condition. The all-order quasimomentum expansion of the tranverse charge response to an atomic displacement are the Born effective charges, quadrupoles, octupoles etc. We demonstrate that the transverse charge response can be related to the longitudinal one via a well-defined long-range dielectric function. Our advancements lead to an efficient use of perturbation theory. Due to its more favorable scaling, our method provides an interesting computational alternative to the use of the 2n+1 theorem, especially for semiconductors and metals with large unit cells. For semiconductors, we compute the piezoelectric properties of a large cell solid-solution of semiconducting hafniun oxide containing 96 atoms. We here show that the clamped ion piezoelectric response can be decomposed into real-space localized contributions that mostly depend on the chemical environment, paving the way for the use of machine-learning techniques in the material search for optimized piezoelectrics. We further apply our methodology to determine the density response of metals. Here, the leading terms of the charge expansion are related to the Fermi energy shift of the potential and by Born effective charges which do not sum to zero over the atoms. We apply our developments to the TEM-EELS spectroscopy of lithium intercalated graphites, where we find that the use of the atomic form-factor in the long-wavelength limit does not take into account for the anisotropy of the atomic chemical bonding. △ Less

Submitted 12 July, 2024; originally announced July 2024.

arXiv:2407.09185 [pdf, other]

Silicon drift detectors for the Spectroscopy Focusing Array of eXTP

Authors: A. Altmann, T. F. Bechteler, R. Strecker, P. Lechner, R. Andritschke, G. Hauser, C. Fiorini, K. Nandra

Abstract: We present a silicon drift detector (SDD) system for the spectroscopy focusing array (SFA) of the enhanced X-ray timing and polarimetry (eXTP) mission. The SFA focuses on fast timing (time resolution below 10 μs) and good spectroscopy capabilities (energy resolution better than 180 eV @ 6 keV). The sensor, consisting of 19 hexagonally shaped pixels with a total sensitive area of ${5.05}\, cm^{2}$,… ▽ More We present a silicon drift detector (SDD) system for the spectroscopy focusing array (SFA) of the enhanced X-ray timing and polarimetry (eXTP) mission. The SFA focuses on fast timing (time resolution below 10 μs) and good spectroscopy capabilities (energy resolution better than 180 eV @ 6 keV). The sensor, consisting of 19 hexagonally shaped pixels with a total sensitive area of ${5.05}\, cm^{2}$, is connected to three high time resolution spectroscopy (HTRS) ASICs, allowing a fast readout of the detector signals. The detector works in a Charge- Sensitive Amplifier configuration. We assembled a prototype detector module and present here its mechanical design, describe the used sensor, and report about its performance. △ Less

Submitted 12 July, 2024; originally announced July 2024.

Comments: 9 pages, 5 figures; Conference: SPIE Space Telescopes and Instrumentation 2024: Ultraviolet to Gamma Ray

arXiv:2407.09179 [pdf]

Magnetic properties and field-induced phenomena in the Jeff = 1/2 distorted kagome antiferromagnet

Authors: A. Yadav, A. Elghandour, T. Arh, D. T. Adroja, M. D. Le, G. B. G. Stenning, M. Aouane, S. Luther, F. Hotz, T. J. Hicken, H. Luetkens, A. Zorko, R. Klingeler, P. Khuntia

Abstract: The intertwining between competing degrees of freedom, anisotropy, and frustration-induced quantum fluctuations offers an ideal ground to realize exotic quantum phenomena in the rare-earth-based kagome lattice. The magnetic susceptibility reveals the presence of two energy scales in agreement with the INS results. The higher energy state is dominated by CEF excitations, where the lowest Kramers gr… ▽ More The intertwining between competing degrees of freedom, anisotropy, and frustration-induced quantum fluctuations offers an ideal ground to realize exotic quantum phenomena in the rare-earth-based kagome lattice. The magnetic susceptibility reveals the presence of two energy scales in agreement with the INS results. The higher energy state is dominated by CEF excitations, where the lowest Kramers ground-state doublet is well separated from the excited state suggesting that the compound realizes a low-energy state at low temperatures. The second energy scale is witnessed via thermodynamic results that reveal an anomaly at 0.3 K typical of a phase transition, which is attributed to the presence of complex magnetic ordering phenomena. The broad maximum in the specific heat well above 0.3 K indicates the presence of short-range spin correlations that is corroborated by muon spin relaxation rate results. The isothermal magnetization reveals a field-induced 1/3 magnetization plateau at low temperatures. muSR relaxation rate experiments, on the other hand, neither show the signature of a phase transition nor spin-freezing down to 34 mK. The ZF muSR relaxation is governed by the Orbach process and reveals the presence of a fluctuating state owing to the depopulation of crystal field levels reflected as a constant value of relaxation rate in the temperature range 0.4-10 K. NMR results indicate the presence of fluctuating Nd3+ moments down to 1.8 K consistent with muSR experiments. Our comprehensive results reveal that a field-induced quantum critical phenomenon is at play in this frustrated kagome magnet and enable us to construct a phase diagram exemplifying the proximity effect of competing magnetic states. This sets the stage to investigate the broad RE3BWO9 family of rare-earth kagome magnets promising to host exotic quantum states driven by spin-orbit coupling and geometrical frustration. △ Less

Submitted 12 July, 2024; originally announced July 2024.

arXiv:2407.09169 [pdf, other]

Tensor networks enable the calculation of turbulence probability distributions

Authors: Nikita Gourianov, Peyman Givi, Dieter Jaksch, Stephen B. Pope

Abstract: Predicting the dynamics of turbulent fluid flows has long been a central goal of science and engineering. Yet, even with modern computing technology, accurate simulation of all but the simplest turbulent flow-fields remains impossible: the fields are too chaotic and multi-scaled to directly store them in memory and perform time-evolution. An alternative is to treat turbulence… ▽ More Predicting the dynamics of turbulent fluid flows has long been a central goal of science and engineering. Yet, even with modern computing technology, accurate simulation of all but the simplest turbulent flow-fields remains impossible: the fields are too chaotic and multi-scaled to directly store them in memory and perform time-evolution. An alternative is to treat turbulence $\textit{probabilistically}$, viewing flow properties as random variables distributed according to joint probability density functions (PDFs). Turbulence PDFs are neither chaotic nor multi-scale, but are still challenging to simulate due to their high dimensionality. Here we show how to overcome the dimensionality problem by parameterising turbulence PDFs into an extremely compressed format known as a "tensor network" (TN). The TN paradigm enables simulations on single CPU cores that would otherwise be impractical even with supercomputers: for a $5+1$ dimensional PDF of a chemically reactive turbulent flow, we achieve reductions in memory and computational costs by factors of $\mathcal{O}(10^6)$ and $\mathcal{O}(10^3)$, respectively, compared to standard finite difference algorithms. A future path is opened towards something heretofore regarded as infeasible: directly simulating high-dimensional PDFs of both turbulent flows and other chaotic systems that are useful to describe probabilistically. △ Less

Submitted 12 July, 2024; originally announced July 2024.

Comments: Initial submission

arXiv:2407.09152 [pdf]

The Two Sides of the Coin: Hallucination Generation and Detection with LLMs as Evaluators for LLMs

Authors: Anh Thu Maria Bui, Saskia Felizitas Brech, Natalie Hußfeldt, Tobias Jennert, Melanie Ullrich, Timo Breuer, Narjes Nikzad Khasmakhi, Philipp Schaer

Abstract: Hallucination detection in Large Language Models (LLMs) is crucial for ensuring their reliability. This work presents our participation in the CLEF ELOQUENT HalluciGen shared task, where the goal is to develop evaluators for both generating and detecting hallucinated content. We explored the capabilities of four LLMs: Llama 3, Gemma, GPT-3.5 Turbo, and GPT-4, for this purpose. We also employed ens… ▽ More Hallucination detection in Large Language Models (LLMs) is crucial for ensuring their reliability. This work presents our participation in the CLEF ELOQUENT HalluciGen shared task, where the goal is to develop evaluators for both generating and detecting hallucinated content. We explored the capabilities of four LLMs: Llama 3, Gemma, GPT-3.5 Turbo, and GPT-4, for this purpose. We also employed ensemble majority voting to incorporate all four models for the detection task. The results provide valuable insights into the strengths and weaknesses of these LLMs in handling hallucination generation and detection tasks. △ Less

Submitted 12 July, 2024; originally announced July 2024.

Comments: Paper accepted at ELOQUENT@CLEF'24

arXiv:2407.09147 [pdf, other]

AI-Powered Immersive Assistance for Interactive Task Execution in Industrial Environments

Authors: Tomislav Duricic, Peter Müllner, Nicole Weidinger, Neven ElSayed, Dominik Kowald, Eduardo Veas

Abstract: Many industrial sectors rely on well-trained employees that are able to operate complex machinery. In this work, we demonstrate an AI-powered immersive assistance system that supports users in performing complex tasks in industrial environments. Specifically, our system leverages a VR environment that resembles a juice mixer setup. This digital twin of a physical setup simulates complex industrial… ▽ More Many industrial sectors rely on well-trained employees that are able to operate complex machinery. In this work, we demonstrate an AI-powered immersive assistance system that supports users in performing complex tasks in industrial environments. Specifically, our system leverages a VR environment that resembles a juice mixer setup. This digital twin of a physical setup simulates complex industrial machinery used to mix preparations or liquids (e.g., similar to the pharmaceutical industry) and includes various containers, sensors, pumps, and flow controllers. This setup demonstrates our system's capabilities in a controlled environment while acting as a proof-of-concept for broader industrial applications. The core components of our multimodal AI assistant are a large language model and a speech-to-text model that process a video and audio recording of an expert performing the task in a VR environment. The video and speech input extracted from the expert's video enables it to provide step-by-step guidance to support users in executing complex tasks. This demonstration showcases the potential of our AI-powered assistant to reduce cognitive load, increase productivity, and enhance safety in industrial environments. △ Less

Submitted 12 July, 2024; originally announced July 2024.

Comments: 3 pages, 2 figures, Demo Paper accepted at the 50th European Conference on Artificial Intelligence

arXiv:2407.09140 [pdf, other]

FlyEye Ground-Based Telescope: Unveiling New Frontiers in Astronomical Science

Authors: Carmelo Arcidiacono, Matteo Simioni, Roberto Ragazzoni, Piero Gregori, Paolo Lorenzi, Francesco Cerutti, Roberto Ziano, Matteo Bisiani, Roberta Pellegrini, Andrea Guazzora, Silvano Pieri, Marco Dima, Silvio Di Rosa, Simone Zaggia, Jacopo Farinato, Demetrio Magrin, Andrea Grazian, Marco Gullieuszik

Abstract: The FlyEye design makes its debut in the ESA's NEOSTEL developed by OHB-Italia. This pioneering FlyEye telescope integrates a monolithic 1-meter class primary mirror feeding 16 CCD cameras for discovering Near-Earth Object (NEO) and any class of transient phenomena. OHB-Italia is the prime contractor, receiving extended support from the Italian National Institute for Astrophysics (INAF) in the ESA… ▽ More The FlyEye design makes its debut in the ESA's NEOSTEL developed by OHB-Italia. This pioneering FlyEye telescope integrates a monolithic 1-meter class primary mirror feeding 16 CCD cameras for discovering Near-Earth Object (NEO) and any class of transient phenomena. OHB-Italia is the prime contractor, receiving extended support from the Italian National Institute for Astrophysics (INAF) in the ESA's NEOSTED program's integration and testing. The FlyEye distinctive design splits the Field of View into 16 channels, creating a unique multi-telescope system with a panoramic 44 square degree Field of View and a seeing-size pixel-scale, enabling NEOs detection down to apparent magnitudes 21.5 insisting on a 1m diameter spherical mirror. The scientific products of a similar FlyEye telescope can complement facilities such as Vera Rubin (former LSST) and ZTF. The FlyEye has the ability to survey two-thirds of the visible sky about three times per night can revolutionize time-domain astronomy, enabling comprehensive studies of transient phenomena, placing FlyEye in a new era of exploration of the dynamic universe. Efforts to develop automated calibration and testing procedures are keys to realizing this transformative potential. △ Less

Submitted 12 July, 2024; originally announced July 2024.

Comments: 9 pages, 1 figure, SPIE Astronomical Telescopes + Instrumentation, Ground-based and Airborne Instrumentation for Astronomy X, 16-21 June 2024

arXiv:2407.09139 [pdf, other]

Measurement of $CP$ asymmetries in $B^0 \to K^0_S π^0 γ$ decays at Belle II

Authors: Belle II Collaboration, I. Adachi, L. Aggarwal, H. Ahmed, H. Aihara, N. Akopov, A. Aloisio, N. Anh Ky, D. M. Asner, H. Atmacan, T. Aushev, V. Aushev, M. Aversano, R. Ayad, V. Babu, H. Bae, S. Bahinipati, P. Bambade, Sw. Banerjee, S. Bansal, M. Barrett, J. Baudot, A. Baur, A. Beaubien, F. Becherer , et al. (414 additional authors not shown)

Abstract: We report measurements of time-dependent $CP$ asymmetries in $B^0 \to K^0_S π^0 γ$ decays based on a data sample of $(388\pm6)\times10^6$ $B\bar{B}$ events collected at the $Υ(4S)$ resonance with the Belle II detector. The Belle II experiment operates at the SuperKEKB asymmetric-energy $e^+e^-$ collider. We measure decay-time distributions to determine $CP$-violating parameters $S$ and $C$. We det… ▽ More We report measurements of time-dependent $CP$ asymmetries in $B^0 \to K^0_S π^0 γ$ decays based on a data sample of $(388\pm6)\times10^6$ $B\bar{B}$ events collected at the $Υ(4S)$ resonance with the Belle II detector. The Belle II experiment operates at the SuperKEKB asymmetric-energy $e^+e^-$ collider. We measure decay-time distributions to determine $CP$-violating parameters $S$ and $C$. We determine these parameters for two ranges of $K^0_S π^0$ invariant mass: $m(K^0_S π^0)\in (0.8, 1.0)$ $GeV/c^2$, which is dominated by $B^0 \to K^{*0} (\to K^0_S π^0) γ$ decays, and a complementary region $m(K^0_S π^0)\in (0.6, 0.8)\cup(1.0, 1.8)$ $GeV/c^2$. Our results have improved precision as compared to previous measurements and are consistent with theory predictions. △ Less

Submitted 12 July, 2024; originally announced July 2024.

Comments: 10 pages, 4 figures

Report number: Belle II Preprint 2024-009, KEK Preprint 2024-1

arXiv:2407.09135 [pdf, other]

A numerical simulation study of an astrometry case for MORFEO at the ELT

Authors: Carmelo Arcidiacono, Elisa Portaluri, Marco Gullieuszik, Michele Cantiello, Francesca Annibali, Paolo Ciliegi, Matteo Simioni, Daniela Fantinel, Guido Agapito, Demetrio Magrin

Abstract: We report results from numerical simulations assessing astrometry measurements with the Multiconjugate Adaptive Optics Relay for ELT Observations (MORFEO) instrument on the Extremely Large Telescope (ELT). Using the Advanced Exposure Time Calculator (AETC), we evaluate MORFEO astrometric accuracy in moderately crowded fields. Our simulations account for spatially variable Point Spread Function (PS… ▽ More We report results from numerical simulations assessing astrometry measurements with the Multiconjugate Adaptive Optics Relay for ELT Observations (MORFEO) instrument on the Extremely Large Telescope (ELT). Using the Advanced Exposure Time Calculator (AETC), we evaluate MORFEO astrometric accuracy in moderately crowded fields. Our simulations account for spatially variable Point Spread Function (PSF), geometric distortion, and rotation-dependent variations. We computed focal plane coordinates using observed stellar distribution and computed population synthesis with the SPISEA tool, generating stellar magnitude distributions for MICADO filters at selected metallicities and stellar ages. Our analysis shows that MORFEO can achieve high-precision astrometry in the galaxy neighborhood (within $μ< 24$ mag) by minimizing PSF enlargement and optimizing calibration strategies. These results inform future observational campaigns and contribute to the development of astrometric science cases for the ELT. △ Less

Submitted 12 July, 2024; originally announced July 2024.

Comments: 7 pages, 6 figures, Proceeding for SPIE Astronomical Telescopes + Instrumentation, Adaptive Optics Systems IX, 16-21 June 2024

arXiv:2407.09132 [pdf, ps, other]

The MICADO first light imager for the ELT: the PSF Reconstruction Software

Authors: Andrea Grazian, Elisa Portaluri, Matteo Simioni, Carmelo Arcidiacono, Marco Gullieuszik, Johanna Hartke, Daniel Jodlbauer, Fernando Pedichini, Roberto Piazzesi, Piero Vaccari, Benedetta Vulcani, Roland Wagner, Anita Zanella

Abstract: MICADO is the first-light camera of the ESO ELT, allowing NIR imaging and long-slit spectroscopy assisted by adaptive optics. MICADO is now entering its construction phase, and the software for data reduction is reaching an adequate maturity level. The PSF Reconstruction (PSF-R) of MICADO is a software tool for the blind derivation of the PSF, only using adaptive optics telemetry data. An update o… ▽ More MICADO is the first-light camera of the ESO ELT, allowing NIR imaging and long-slit spectroscopy assisted by adaptive optics. MICADO is now entering its construction phase, and the software for data reduction is reaching an adequate maturity level. The PSF Reconstruction (PSF-R) of MICADO is a software tool for the blind derivation of the PSF, only using adaptive optics telemetry data. An update of the status of the PSF-R service is provided here. The PSF-R prototype has been tested on ERIS@VLT data in order to check the reconstruction of on- and off-axis PSFs. The on-axis PSF-R is accurate at a few percent level on Strehl, FWHM, Encircled Energy, and half light radius, while for the off-axis case the match is within 10-15 percent at a distance of half isoplanatic angle. The first version of the workflow for the PSF-R pipeline has been developed and verified using the latest release of the ESO data processing system. A set of simulations has been implemented on the morphological analysis of distant galaxies, showing that the accuracy of the PSF-R matches the goals needed to study their morphology. In summary, the PSF-R team is on the right track towards the ELT first light. △ Less

Submitted 12 July, 2024; originally announced July 2024.

Comments: 5 pages, 3 figures, Proceedings for the SPIE Astronomical Telescopes and Instrumentation 2024, Adaptive Optics Systems IX, Paper No.13097-234

arXiv:2407.09126 [pdf, ps, other]

On the Problem of Defining Charge Operators for the Dirac Quantum Field

Authors: Pablo Costa Rico, Roderich Tumulka

Abstract: It is well known how to define the operator $Q$ for the total charge (i.e., positron number minus electron number) on the standard Hilbert space of the second-quantized Dirac equation. Here we ask about operators $Q_A$ representing the charge content of a region $A\subseteq \mathbb{R}^3$ in 3d physical space. There is a natural formula for $Q_A$ but, as we explain, there are difficulties about tur… ▽ More It is well known how to define the operator $Q$ for the total charge (i.e., positron number minus electron number) on the standard Hilbert space of the second-quantized Dirac equation. Here we ask about operators $Q_A$ representing the charge content of a region $A\subseteq \mathbb{R}^3$ in 3d physical space. There is a natural formula for $Q_A$ but, as we explain, there are difficulties about turning it into a mathematically precise definition. First, $Q_A$ can be written as a series but its convergence seems hopeless. Second, we show for some choices of $A$ that if $Q_A$ could be defined then its domain could not contain either the vacuum vector or any vector obtained from the vacuum by applying a polynomial in creation and annihilation operators. Both observations speak against the existence of $Q_A$ for generic $A$. △ Less

Submitted 12 July, 2024; originally announced July 2024.

Comments: 19 pages LaTeX

arXiv:2407.09121 [pdf, other]

Refuse Whenever You Feel Unsafe: Improving Safety in LLMs via Decoupled Refusal Training

Authors: Youliang Yuan, Wenxiang Jiao, Wenxuan Wang, Jen-tse Huang, Jiahao Xu, Tian Liang, Pinjia He, Zhaopeng Tu

Abstract: This study addresses a critical gap in safety tuning practices for Large Language Models (LLMs) by identifying and tackling a refusal position bias within safety tuning data, which compromises the models' ability to appropriately refuse generating unsafe content. We introduce a novel approach, Decoupled Refusal Training (DeRTa), designed to empower LLMs to refuse compliance to harmful prompts at a… ▽ More This study addresses a critical gap in safety tuning practices for Large Language Models (LLMs) by identifying and tackling a refusal position bias within safety tuning data, which compromises the models' ability to appropriately refuse generating unsafe content. We introduce a novel approach, Decoupled Refusal Training (DeRTa), designed to empower LLMs to refuse compliance to harmful prompts at any response position, significantly enhancing their safety capabilities. DeRTa incorporates two novel components: (1) Maximum Likelihood Estimation (MLE) with Harmful Response Prefix, which trains models to recognize and avoid unsafe content by appending a segment of harmful response to the beginning of a safe response, and (2) Reinforced Transition Optimization (RTO), which equips models with the ability to transition from potential harm to safety refusal consistently throughout the harmful response sequence. Our empirical evaluation, conducted using LLaMA3 and Mistral model families across six attack scenarios, demonstrates that our method not only improves model safety without compromising performance but also surpasses well-known models such as GPT-4 in defending against attacks. Importantly, our approach successfully defends recent advanced attack methods (e.g., CodeAttack) that have jailbroken GPT-4 and LLaMA3-70B-Instruct. Our code and data can be found at https://github.com/RobustNLP/DeRTa. △ Less

Submitted 12 July, 2024; originally announced July 2024.

arXiv:2407.09119 [pdf, other]

Enhanced quantum state transfer via feedforward cancellation of optical phase noise

Authors: Benjamin P. Maddox, Jonathan M. Mortlock, Tom R. Hepworth, Adarsh P. Raghuram, Philip D. Gregory, Alexander Guttridge, Simon L. Cornish

Abstract: Many experimental platforms for quantum science depend on state control via laser fields. Frequently, however, the control fidelity is limited by optical phase noise. This is exacerbated in stabilized laser systems where high-frequency phase noise is an unavoidable consequence of feedback. Here we implement an optical feedforward technique to suppress laser phase noise in the STIRAP state transfer… ▽ More Many experimental platforms for quantum science depend on state control via laser fields. Frequently, however, the control fidelity is limited by optical phase noise. This is exacerbated in stabilized laser systems where high-frequency phase noise is an unavoidable consequence of feedback. Here we implement an optical feedforward technique to suppress laser phase noise in the STIRAP state transfer of ultracold RbCs molecules, across 114 THz, from a weakly bound Feshbach state to the rovibrational ground state. By performing over 100 state transfers on single molecules, we measure a significantly enhanced transfer efficiency of 98.7(1)% limited only by available laser intensity. △ Less

Submitted 12 July, 2024; originally announced July 2024.

arXiv:2407.09116 [pdf, ps, other]

On a High-Frequency Analysis of Some Relevant Integral Equations in Electromagnetics

Authors: V. Giunzioni, A. Merlini, F. P. Andriulli

Abstract: In this contribution we analyze the spectral properties of some commonly used boundary integral operators in computational electromagnetics and of their discrete counterparts, highlighting peculiar features of their spectra. In particular, a comparison with the eigenvalues of the continuous operators will be presented that highlights deviations in the high frequency regime and impacts, in a peculi… ▽ More In this contribution we analyze the spectral properties of some commonly used boundary integral operators in computational electromagnetics and of their discrete counterparts, highlighting peculiar features of their spectra. In particular, a comparison with the eigenvalues of the continuous operators will be presented that highlights deviations in the high frequency regime and impacts, in a peculiar way, the accuracy of the numerical solutions of each formulation. A study and a proactive analysis of numerical results from standard boundary element solvers and the predictions from the theoretical analysis will corroborate the analytical framework employed and the validity of our observations. △ Less

Submitted 12 July, 2024; originally announced July 2024.

arXiv:2407.09113 [pdf, other]

Relative class numbers and Euler-Kronecker constants of maximal real cyclotomic subfields

Authors: Neelam Kandhil, Alessandro Languasco, Pieter Moree, Sumaia Saad Eddin, Alisa Sedunova

Abstract: The Euler--Kronecker constant of a number field $K$ is the ratio of the constant and the residue of the Laurent series of the Dedekind zeta function $ζ_K(s)$ at $s=1$. We study the distribution of the Euler--Kronecker constant $γ_q^+$ of the maximal real subfield of $\mathbb Q(ζ_q)$ as $q$ ranges over the primes. Further, we consider the distribution of $γ_q^+-γ_q$, with $γ_q$ the Euler--Kronecker… ▽ More The Euler--Kronecker constant of a number field $K$ is the ratio of the constant and the residue of the Laurent series of the Dedekind zeta function $ζ_K(s)$ at $s=1$. We study the distribution of the Euler--Kronecker constant $γ_q^+$ of the maximal real subfield of $\mathbb Q(ζ_q)$ as $q$ ranges over the primes. Further, we consider the distribution of $γ_q^+-γ_q$, with $γ_q$ the Euler--Kronecker constant of $\mathbb Q(ζ_q)$ and show how it is connected with Kummer's conjecture, which predicts the asymptotic growth of the relative class number of $\mathbb Q(ζ_q)$. We improve, for example, the known results on the bounds on average for the Kummer ratio and we prove analogous sharp bounds for $γ_q^+-γ_q$. The methods employed are partly inspired by those used by Granville (1990) and Croot and Granville (2002) to investigate Kummer's conjecture. We supplement our theoretical findings with numerical illustrations to reinforce our conclusions. △ Less

Submitted 12 July, 2024; originally announced July 2024.

Comments: 38 pages, 2 tables, 6 figures. arXiv admin note: text overlap with arXiv:2402.13829

Report number: MPIM-Bonn-2024 MSC Class: 11N37; 11R18; 11R29; 11R47; 11Y60

arXiv:2407.09112 [pdf, ps, other]

Solving recurrence relations for multiloop integrals in the limit of large values of the dimensional regularization parameter

Authors: P. A. Baikov

Abstract: A method for calculating the $1/d$ expansion coefficients for solutions of the integration by parts relations for Feynman integrals is presented. The idea is to use linear substitutions to transform these relations to an explicitly recursive form. The possibility of such a transformation is demonstrated for several families of massless (with one massive line) vacuum integrals up to the 7-loop leve… ▽ More A method for calculating the $1/d$ expansion coefficients for solutions of the integration by parts relations for Feynman integrals is presented. The idea is to use linear substitutions to transform these relations to an explicitly recursive form. The possibility of such a transformation is demonstrated for several families of massless (with one massive line) vacuum integrals up to the 7-loop level. △ Less

Submitted 12 July, 2024; originally announced July 2024.

Comments: 12 pages, 1 figure

arXiv:2407.09103 [pdf, other]

DANIEL: A fast Document Attention Network for Information Extraction and Labelling of handwritten documents

Authors: Thomas Constum, Pierrick Tranouez, Thierry Paquet

Abstract: Information extraction from handwritten documents involves traditionally three distinct steps: Document Layout Analysis, Handwritten Text Recognition, and Named Entity Recognition. Recent approaches have attempted to integrate these steps into a single process using fully end-to-end architectures. Despite this, these integrated approaches have not yet matched the performance of language models, wh… ▽ More Information extraction from handwritten documents involves traditionally three distinct steps: Document Layout Analysis, Handwritten Text Recognition, and Named Entity Recognition. Recent approaches have attempted to integrate these steps into a single process using fully end-to-end architectures. Despite this, these integrated approaches have not yet matched the performance of language models, when applied to information extraction in plain text. In this paper, we introduce DANIEL (Document Attention Network for Information Extraction and Labelling), a fully end-to-end architecture integrating a language model and designed for comprehensive handwritten document understanding. DANIEL performs layout recognition, handwriting recognition, and named entity recognition on full-page documents. Moreover, it can simultaneously learn across multiple languages, layouts, and tasks. For named entity recognition, the ontology to be applied can be specified via the input prompt. The architecture employs a convolutional encoder capable of processing images of any size without resizing, paired with an autoregressive decoder based on a transformer-based language model. DANIEL achieves competitive results on four datasets, including a new state-of-the-art performance on RIMES 2009 and M-POPP for Handwriting Text Recognition, and IAM NER for Named Entity Recognition. Furthermore, DANIEL is much faster than existing approaches. We provide the source code and the weights of the trained models at \url{https://github.com/Shulk97/daniel}. △ Less

Submitted 12 July, 2024; originally announced July 2024.

arXiv:2407.09102 [pdf, ps, other]

Quantitative diffusion approximation for the Neutral $r$-Alleles Wright-Fisher Model with Mutations

Authors: Peng Chen, Jie Xiong, Lihu Xu, Jiayu Zheng

Abstract: We apply a Lindeberg principle under the Markov process setting to approximate the Wright-Fisher model with neutral $r$-alleles using a diffusion process, deriving an error rate based on a function class distance involving fourth-order bounded differentiable functions. This error rate consists of a linear combination of the maximum mutation rate and the reciprocal of the population size. Our resul… ▽ More We apply a Lindeberg principle under the Markov process setting to approximate the Wright-Fisher model with neutral $r$-alleles using a diffusion process, deriving an error rate based on a function class distance involving fourth-order bounded differentiable functions. This error rate consists of a linear combination of the maximum mutation rate and the reciprocal of the population size. Our result improves the error bound in the seminal work [PNAS,1977], where only the special case $r=2$ was studied. △ Less

Submitted 12 July, 2024; originally announced July 2024.

arXiv:2407.09100 [pdf, other]

Retrospective for the Dynamic Sensorium Competition for predicting large-scale mouse primary visual cortex activity from videos

Authors: Polina Turishcheva, Paul G. Fahey, Michaela Vystrčilová, Laura Hansel, Rachel Froebe, Kayla Ponder, Yongrong Qiu, Konstantin F. Willeke, Mohammad Bashiri, Ruslan Baikulov, Yu Zhu, Lei Ma, Shan Yu, Tiejun Huang, Bryan M. Li, Wolf De Wulf, Nina Kudryashova, Matthias H. Hennig, Nathalie L. Rochefort, Arno Onken, Eric Wang, Zhiwei Ding, Andreas S. Tolias, Fabian H. Sinz, Alexander S Ecker

Abstract: Understanding how biological visual systems process information is challenging because of the nonlinear relationship between visual input and neuronal responses. Artificial neural networks allow computational neuroscientists to create predictive models that connect biological and machine vision. Machine learning has benefited tremendously from benchmarks that compare different model on the same ta… ▽ More Understanding how biological visual systems process information is challenging because of the nonlinear relationship between visual input and neuronal responses. Artificial neural networks allow computational neuroscientists to create predictive models that connect biological and machine vision. Machine learning has benefited tremendously from benchmarks that compare different model on the same task under standardized conditions. However, there was no standardized benchmark to identify state-of-the-art dynamic models of the mouse visual system. To address this gap, we established the Sensorium 2023 Benchmark Competition with dynamic input, featuring a new large-scale dataset from the primary visual cortex of ten mice. This dataset includes responses from 78,853 neurons to 2 hours of dynamic stimuli per neuron, together with the behavioral measurements such as running speed, pupil dilation, and eye movements. The competition ranked models in two tracks based on predictive performance for neuronal responses on a held-out test set: one focusing on predicting in-domain natural stimuli and another on out-of-distribution (OOD) stimuli to assess model generalization. As part of the NeurIPS 2023 competition track, we received more than 160 model submissions from 22 teams. Several new architectures for predictive models were proposed, and the winning teams improved the previous state-of-the-art model by 50%. Access to the dataset as well as the benchmarking infrastructure will remain online at www.sensorium-competition.net. △ Less

Submitted 12 July, 2024; originally announced July 2024.

arXiv:2407.09099 [pdf, other]

Music Proofreading with RefinPaint: Where and How to Modify Compositions given Context

Authors: Pedro Ramoneda, Martin Rocamora, Taketo Akama

Abstract: Autoregressive generative transformers are key in music generation, producing coherent compositions but facing challenges in human-machine collaboration. We propose RefinPaint, an iterative technique that improves the sampling process. It does this by identifying the weaker music elements using a feedback model, which then informs the choices for resampling by an inpainting model. This dual-focus… ▽ More Autoregressive generative transformers are key in music generation, producing coherent compositions but facing challenges in human-machine collaboration. We propose RefinPaint, an iterative technique that improves the sampling process. It does this by identifying the weaker music elements using a feedback model, which then informs the choices for resampling by an inpainting model. This dual-focus methodology not only facilitates the machine's ability to improve its automatic inpainting generation through repeated cycles but also offers a valuable tool for humans seeking to refine their compositions with automatic proofreading. Experimental results suggest RefinPaint's effectiveness in inpainting and proofreading tasks, demonstrating its value for refining music created by both machines and humans. This approach not only facilitates creativity but also aids amateur composers in improving their work. △ Less

Submitted 12 July, 2024; originally announced July 2024.

arXiv:2407.09093 [pdf, ps, other]

On Exact Bit-level Reversible Transformers Without Changing Architectures

Authors: Guoqiang Zhang, J. P. Lewis, W. B. Kleijn

Abstract: In the literature, various reversible deep neural networks (DNN) models have been proposed to reduce memory consumption or improve data-throughput in the training process. However, almost all existing reversible DNNs either are constrained to have special structures or are constructed by modifying the original DNN architectures considerably to enable reversibility. In this work, we propose exact b… ▽ More In the literature, various reversible deep neural networks (DNN) models have been proposed to reduce memory consumption or improve data-throughput in the training process. However, almost all existing reversible DNNs either are constrained to have special structures or are constructed by modifying the original DNN architectures considerably to enable reversibility. In this work, we propose exact bit-level reversible transformers without changing the architectures in the inference procedure. The basic idea is to first treat each transformer block as the Euler integration approximation for solving an ordinary differential equation (ODE) and then incorporate the technique of bidirectional integration approximation (BDIA) (see [26]) for BDIA-based diffusion inversion) into the neural architecture together with activation quantization to make it exactly bit-level reversible, referred to as BDIA-transformer. In the training process, we let a hyper-parameter $γ$ in BDIA-transformer randomly take one of the two values $\{0.5, -0.5\}$ per transformer block for averaging two consecutive integration approximations, which regularizes the models for improving the validation accuracy. Light-weight side information per transformer block is required to be stored in the forward process to account for binary quantization loss to enable exact bit-level reversibility. In the inference procedure, the expectation $\mathbb{E}(γ)=0$ is taken to make the resulting architectures of BDIA-transformer be identical to transformers up to activation quantization. Empirical study indicates that BDIA-transformers outperform their original counterparts notably due to the regularization effect of the $γ$ parameter. △ Less

Submitted 12 July, 2024; originally announced July 2024.

arXiv:2407.09082 [pdf, other]

The MeerKAT Fornax Survey. III. Ram-pressure strip** of the tidally interacting galaxy NGC 1427A in the Fornax cluster

Authors: P. Serra, T. A. Oosterloo, P. Kamphuis, G. I. G. Jozsa, W. J. G. de Blok, G. L. Bryan, J. H. van Gorkom, E. Iodice, D. Kleiner, A. Loni, S. I. Loubser, F. M. Maccagni, D. Molnar, R. Peletier, D. J. Pisano, M. Ramatsoku, M. W. L. Smith, M. A. W. Verheijen, N. Zabel

Abstract: We present MeerKAT Fornax Survey HI observations of NGC 1427A, a blue irregular galaxy with a stellar mass of 2e+9 Msun located near the centre of the Fornax galaxy cluster. Thanks to the excellent resolution (1 to 6 kpc spatially, 1.4 km/s in velocity) and HI column density sensitivity (4e+19/cm^2 to 1e+18/cm^2 depending on resolution), our data deliver new insights on the long-debated interactio… ▽ More We present MeerKAT Fornax Survey HI observations of NGC 1427A, a blue irregular galaxy with a stellar mass of 2e+9 Msun located near the centre of the Fornax galaxy cluster. Thanks to the excellent resolution (1 to 6 kpc spatially, 1.4 km/s in velocity) and HI column density sensitivity (4e+19/cm^2 to 1e+18/cm^2 depending on resolution), our data deliver new insights on the long-debated interaction of this galaxy with the cluster environment. We confirm the presence of a broad, one-sided, starless HI tail stretching from the outer regions of the stellar body and pointing away from the cluster centre. We find the tail to have 50% more HI (4e+8 Msun) and to be 3 times longer (70 kpc) than in previous observations. In fact, we detect scattered HI clouds out to 300 kpc from the galaxy in the direction of the tail -- possibly the most ancient remnant of the passage of NGC 1427A through the intracluster medium of Fornax. Both the velocity gradient along the HI tail and the peculiar kinematics of HI in the outer region of the stellar body are consistent with the effect of ram pressure given the line-of-sight motion of the galaxy within the cluster. However, several properties cannot be explained solely by ram pressure and suggest an ongoing tidal interaction. This includes: the close match between dense HI and stars within the disturbed stellar body; the abundant kinematically-anomalous HI; and the inversion of the HI velocity gradient near the base of the HI tail. We rule out an interaction with the cluster tidal field, and conclude that NGC 1427A is the result of a high-speed galaxy encounter or of a merger started at least 300 Myr ago, where ram pressure shapes the distribution and kinematics of the HI in the perturbed outer stellar body and in the tidal tails. △ Less

Submitted 12 July, 2024; originally announced July 2024.

Comments: Astronomy & Astrophysics, accepted. Data available at the MeerKAT Fornax Survey website, https://sites.google.com/inaf.it/meerkatfornaxsurvey

arXiv:2407.09064 [pdf, other]

Multi-Modal Dataset Creation for Federated~Learning with DICOM Structured Reports

Authors: Malte Tölle, Lukas Burger, Halvar Kelm, Florian André, Peter Bannas, Gerhard Diller, Norbert Frey, Philipp Garthe, Stefan Groß, Anja Hennemuth, Lars Kaderali, Nina Krüger, Andreas Leha, Simon Martin, Alexander Meyer, Eike Nagel, Stefan Orwat, Clemens Scherer, Moritz Seiffert, Jan Moritz Seliger, Stefan Simm, Tim Friede, Tim Seidler, Sandy Engelhardt

Abstract: Purpose: Federated training is often hindered by heterogeneous datasets due to divergent data storage options, inconsistent naming schemes, varied annotation procedures, and disparities in label quality. This is particularly evident in the emerging multi-modal learning paradigms, where dataset harmonization including a uniform data representation and filtering options are of paramount importance.… ▽ More Purpose: Federated training is often hindered by heterogeneous datasets due to divergent data storage options, inconsistent naming schemes, varied annotation procedures, and disparities in label quality. This is particularly evident in the emerging multi-modal learning paradigms, where dataset harmonization including a uniform data representation and filtering options are of paramount importance. Methods: DICOM structured reports enable the standardized linkage of arbitrary information beyond the imaging domain and can be used within Python deep learning pipelines with highdicom. Building on this, we developed an open platform for data integration and interactive filtering capabilities that simplifies the process of assembling multi-modal datasets. Results: In this study, we extend our prior work by showing its applicability to more and divergent data types, as well as streamlining datasets for federated training within an established consortium of eight university hospitals in Germany. We prove its concurrent filtering ability by creating harmonized multi-modal datasets across all locations for predicting the outcome after minimally invasive heart valve replacement. The data includes DICOM data (i.e. computed tomography images, electrocardiography scans) as well as annotations (i.e. calcification segmentations, pointsets and pacemaker dependency), and metadata (i.e. prosthesis and diagnoses). Conclusion: Structured reports bridge the traditional gap between imaging systems and information systems. Utilizing the inherent DICOM reference system arbitrary data types can be queried concurrently to create meaningful cohorts for clinical studies. The graphical interface as well as example structured report templates will be made publicly available. △ Less

Submitted 12 July, 2024; originally announced July 2024.

arXiv:2407.09060 [pdf, other]

Reference CC3 Excitation Energies for Organic Chromophores: Benchmarking TD-DFT, BSE/$GW$ and Wave Function Methods

Authors: Iryna Knysh, Filippo Lipparini, Aymeric Blondel, Ivan Duchemin, Xavier Blase, Pierre-François Loos, Denis Jacquemin

Abstract: To expand the QUEST database of highly-accurate vertical transition energies, we consider a series of large organic chromogens ubiquitous in dye chemistry, such as anthraquinone, azobenzene, BODIPY, and naphthalimide. We compute, at the CC3 level of theory, the singlet and triplet vertical transition energies associated with the low-lying excited states. This leads to a collection of more than 120… ▽ More To expand the QUEST database of highly-accurate vertical transition energies, we consider a series of large organic chromogens ubiquitous in dye chemistry, such as anthraquinone, azobenzene, BODIPY, and naphthalimide. We compute, at the CC3 level of theory, the singlet and triplet vertical transition energies associated with the low-lying excited states. This leads to a collection of more than 120 new highly-accurate excitation energies. Subsequently, we employ these reference values to benchmark a series of lower-order wave function approaches, including the popular ADC(2) and CC2 schemes, as well as time-dependent density-functional theory (TD-DFT), both with and without applying the Tamm-Dancoff approximation (TDA). At the TD-DFT level, we evaluate a large panel of global, range-separated, local, and double hybrid functionals. Additionally, we assess the performance of the Bethe-Salpeter equation (BSE) formalism relying on both $G_0W_0$ and ev$GW$ quasiparticle energies evaluated from various starting points. It turns out that CC2 and ADC(2.5) are the most accurate models amongst those with respective $\mathcal{O}(N^5)$ and $\mathcal{O}(N^6)$ scalings with system size. In contrast, CCSD does not outperform CC2. The best performing exchange-correlation functionals include BMK, M06-2X, M06-SX, CAM-B3LYP, $ω$B97X-D, and LH20t, with average deviations of approximately 0.20 eV or slightly below. Errors on vertical excitation energies can be further reduced by considering double hybrids. Both SOS-$ω$B88PP86 and SOS-$ω$PBEPP86 exhibit particularly attractive performances with overall quality on par with CC2, whereas PBE0-DH and PBE-QIDH are only slightly less efficient. BSE/ev$GW$ calculations based on Kohn-Sham starting points have been found to be particularly effective for singlet transitions, but much less for their triplet counterparts. △ Less

Submitted 12 July, 2024; originally announced July 2024.

Comments: 26 pages, 10 figures (Supporting Information available)

arXiv:2407.09057 [pdf, other]

PersonificationNet: Making customized subject act like a person

Authors: Tianchu Guo, Pengyu Li, Biao Wang, Xiansheng Hua

Abstract: Recently customized generation has significant potential, which uses as few as 3-5 user-provided images to train a model to synthesize new images of a specified subject. Though subsequent applications enhance the flexibility and diversity of customized generation, fine-grained control over the given subject acting like the person's pose is still lack of study. In this paper, we propose a Personifi… ▽ More Recently customized generation has significant potential, which uses as few as 3-5 user-provided images to train a model to synthesize new images of a specified subject. Though subsequent applications enhance the flexibility and diversity of customized generation, fine-grained control over the given subject acting like the person's pose is still lack of study. In this paper, we propose a PersonificationNet, which can control the specified subject such as a cartoon character or plush toy to act the same pose as a given referenced person's image. It contains a customized branch, a pose condition branch and a structure alignment module. Specifically, first, the customized branch mimics specified subject appearance. Second, the pose condition branch transfers the body structure information from the human to variant instances. Last, the structure alignment module bridges the structure gap between human and specified subject in the inference stage. Experimental results show our proposed PersonificationNet outperforms the state-of-the-art methods. △ Less

Submitted 12 July, 2024; originally announced July 2024.

arXiv:2407.09054 [pdf]

doi 10.1007/s12540-022-01226-4

Numerical Analysis on the Spatiotemporal Characteristics of the Portevin-Le Chatelier Effect in Ti-12Mo Alloy

Authors: Shiyuan Luo, Yongxin Jiang, Sandrine Thuillier, Philippe Castany, Liangcai Zeng

Abstract: A simplified 3D FE model based on McCormick's model is developed to numerically predict the spatiotemporal behaviors of the PLC effect in Ti-12Mo alloy tensile tests at 350 degrees C with strain rates from the order of $10^{-4}$ s$^{-1}$ to $10^{-2}$ s$^{-1}$. The material parameter identification procedure is firstly presented in details, and the simulated results are highly consistent with exper… ▽ More A simplified 3D FE model based on McCormick's model is developed to numerically predict the spatiotemporal behaviors of the PLC effect in Ti-12Mo alloy tensile tests at 350 degrees C with strain rates from the order of $10^{-4}$ s$^{-1}$ to $10^{-2}$ s$^{-1}$. The material parameter identification procedure is firstly presented in details, and the simulated results are highly consistent with experimental ones, especially in terms of stress drop magnitudes and PLC band widths. The distribution of simulated stress drop magnitudes at a constant tensile velocity (0.01 mm/s) follows a normal distribution and its peak value is in the range of 26-28 MPa. Furthermore, the simulated band width slightly fluctuates with the increase of true strain and its average value is about 1.5 mm. Besides, the staircase behavior of strain-time curves and the hop** propagation of the PLC band are observed in Ti-12Mo alloy tensile process, which are related to the strain localization and stress drop magnitudes. △ Less

Submitted 12 July, 2024; originally announced July 2024.

Journal ref: Metals and Materials International, 2023, 29 (2), pp.269-279

arXiv:2407.09052 [pdf, other]

From MIDI to Rich Tablatures: an Automatic Generative System incorporating Lead Guitarists' Fingering and Stylistic choices

Authors: Pierluigi Bontempi, Daniele Manerba, Alexandre D'Hooge, Sergio Canazza

Abstract: Although the automatic identification of the optimal fingering for the performance of melodies on fretted string instruments has already been addressed (at least partially) in the literature, the specific case regarding lead electric guitar requires a dedicated approach. We propose a system that can generate, from simple MIDI melodies, tablatures enriched by fingerings, articulations, and expressi… ▽ More Although the automatic identification of the optimal fingering for the performance of melodies on fretted string instruments has already been addressed (at least partially) in the literature, the specific case regarding lead electric guitar requires a dedicated approach. We propose a system that can generate, from simple MIDI melodies, tablatures enriched by fingerings, articulations, and expressive techniques. The basic fingering is derived by solving a constrained and multi-attribute optimization problem, which derives the best position of the fretting hand, not just the finger used at each moment.Then, by analyzing statistical data from the mySongBook corpus, the most common clich{é}s and biomechanical feasibility, articulations, and expressive techniques are introduced. Finally, the obtained output is converted into MusicXML format, which allows for easy visualization and use. The quality of the tablatures derived and the high configurability of the proposed approach can have several impacts, in particular in the fields of instrumental teaching, assisted composition and arranging, and computational expressive music performance models. △ Less

Submitted 12 July, 2024; originally announced July 2024.

Journal ref: Sound and Music Computing Conference, Jul 2024, Porto, Portugal

arXiv:2407.09051 [pdf, other]

DroneMOT: Drone-based Multi-Object Tracking Considering Detection Difficulties and Simultaneous Moving of Drones and Objects

Authors: Peng Wang, Yongcai Wang, Deying Li

Abstract: Multi-object tracking (MOT) on static platforms, such as by surveillance cameras, has achieved significant progress, with various paradigms providing attractive performances. However, the effectiveness of traditional MOT methods is significantly reduced when it comes to dynamic platforms like drones. This decrease is attributed to the distinctive challenges in the MOT-on-drone scenario: (1) object… ▽ More Multi-object tracking (MOT) on static platforms, such as by surveillance cameras, has achieved significant progress, with various paradigms providing attractive performances. However, the effectiveness of traditional MOT methods is significantly reduced when it comes to dynamic platforms like drones. This decrease is attributed to the distinctive challenges in the MOT-on-drone scenario: (1) objects are generally small in the image plane, blurred, and frequently occluded, making them challenging to detect and recognize; (2) drones move and see objects from different angles, causing the unreliability of the predicted positions and feature embeddings of the objects. This paper proposes DroneMOT, which firstly proposes a Dual-domain Integrated Attention (DIA) module that considers the fast movements of drones to enhance the drone-based object detection and feature embedding for small-sized, blurred, and occluded objects. Then, an innovative Motion-Driven Association (MDA) scheme is introduced, considering the concurrent movements of both the drone and the objects. Within MDA, an Adaptive Feature Synchronization (AFS) technique is presented to update the object features seen from different angles. Additionally, a Dual Motion-based Prediction (DMP) method is employed to forecast the object positions. Finally, both the refined feature embeddings and the predicted positions are integrated to enhance the object association. Comprehensive evaluations on VisDrone2019-MOT and UAVDT datasets show that DroneMOT provides substantial performance improvements over the state-of-the-art in the domain of MOT on drones. △ Less

Submitted 12 July, 2024; originally announced July 2024.

Comments: 8 pages, 6 figures, ICRA 2024

arXiv:2407.09041 [pdf, other]

Optimization of Long-Haul C+L+S Systems by means of a Closed Form EGN Model

Authors: Y. Jiang, J. Sarkis, A. Nespola, F. Forghieri, S. Piciaccia, A. Tanzi, M. Ranjbar Zefreh, P. Poggiolini

Abstract: We investigate C+L+S long-haul systems using a closed-form GN/EGN non-linearity model. We perform accurate launch power and Raman pump optimization. We show a potential 4x throughput increase over legacy C-band systems in 1000 km links, using moderate S-only Raman amplification. We simultaneously achieve extra-flat GSNR, within +/-0.5 dB across the whole C+L+S spectrum. We investigate C+L+S long-haul systems using a closed-form GN/EGN non-linearity model. We perform accurate launch power and Raman pump optimization. We show a potential 4x throughput increase over legacy C-band systems in 1000 km links, using moderate S-only Raman amplification. We simultaneously achieve extra-flat GSNR, within +/-0.5 dB across the whole C+L+S spectrum. △ Less

Submitted 12 July, 2024; originally announced July 2024.

Comments: The paper is identical to a manuscript submitted to PTL in June 2024, except this arXiv version has been updated in the references. Ref. [8] and [10] are about CFM6 and its experimental validation

arXiv:2407.09039 [pdf, other]

Overcoming Catastrophic Forgetting in Tabular Data Classification: A Pseudorehearsal-based approach

Authors: Pablo García-Santaclara, Bruno Fernández-Castro, Rebeca P. Díaz-Redondo

Abstract: Continual learning (CL) poses the important challenge of adapting to evolving data distributions without forgetting previously acquired knowledge while consolidating new knowledge. In this paper, we introduce a new methodology, coined as Tabular-data Rehearsal-based Incremental Lifelong Learning framework (TRIL3), designed to address the phenomenon of catastrophic forgetting in tabular data classi… ▽ More Continual learning (CL) poses the important challenge of adapting to evolving data distributions without forgetting previously acquired knowledge while consolidating new knowledge. In this paper, we introduce a new methodology, coined as Tabular-data Rehearsal-based Incremental Lifelong Learning framework (TRIL3), designed to address the phenomenon of catastrophic forgetting in tabular data classification problems. TRIL3 uses the prototype-based incremental generative model XuILVQ to generate synthetic data to preserve old knowledge and the DNDF algorithm, which was modified to run in an incremental way, to learn classification tasks for tabular data, without storing old samples. After different tests to obtain the adequate percentage of synthetic data and to compare TRIL3 with other CL available proposals, we can conclude that the performance of TRIL3 outstands other options in the literature using only 50% of synthetic data. △ Less

Submitted 12 July, 2024; originally announced July 2024.

Comments: 11 pages, 4 tables, 3 figures

arXiv:2407.09032 [pdf, other]

DRM Revisited: A Complete Error Analysis

Authors: Yuling Jiao, Ruoxuan Li, Peiying Wu, Jerry Zhijian Yang, **wen Zhang

Abstract: In this work, we address a foundational question in the theoretical analysis of the Deep Ritz Method (DRM) under the over-parameteriztion regime: Given a target precision level, how can one determine the appropriate number of training samples, the key architectural parameters of the neural networks, the step size for the projected gradient descent optimization procedure, and the requisite number o… ▽ More In this work, we address a foundational question in the theoretical analysis of the Deep Ritz Method (DRM) under the over-parameteriztion regime: Given a target precision level, how can one determine the appropriate number of training samples, the key architectural parameters of the neural networks, the step size for the projected gradient descent optimization procedure, and the requisite number of iterations, such that the output of the gradient descent process closely approximates the true solution of the underlying partial differential equation to the specified precision? △ Less

Submitted 12 July, 2024; originally announced July 2024.

arXiv:2407.09023 [pdf, other]

Challenges of Anomaly Detection in the Object-Centric Setting: Dimensions and the Role of Domain Knowledge

Authors: Alessandro Berti, Urszula Jessen, Wil M. P. van der Aalst, Dirk Fahland

Abstract: Object-centric event logs, allowing events related to different objects of different object types, represent naturally the execution of business processes, such as ERP (O2C and P2P) and CRM. However, modeling such complex information requires novel process mining techniques and might result in complex sets of constraints. Object-centric anomaly detection exploits both the lifecycle and the interac… ▽ More Object-centric event logs, allowing events related to different objects of different object types, represent naturally the execution of business processes, such as ERP (O2C and P2P) and CRM. However, modeling such complex information requires novel process mining techniques and might result in complex sets of constraints. Object-centric anomaly detection exploits both the lifecycle and the interactions between the different objects. Therefore, anomalous patterns are proposed to the user without requiring the definition of object-centric process models. This paper proposes different methodologies for object-centric anomaly detection and discusses the role of domain knowledge for these methodologies. We discuss the advantages and limitations of Large Language Models (LLMs) in the provision of such domain knowledge. Following our experience in a real-life P2P process, we also discuss the role of algorithms (dimensionality reduction+anomaly detection), suggest some pre-processing steps, and discuss the role of feature propagation. △ Less

Submitted 12 July, 2024; originally announced July 2024.

arXiv:2407.08996 [pdf, ps, other]

doi 10.1103/PhysRevLett.133.022501

Emergence of High-Order Deformation in Rotating Transfermium Nuclei: A Microscopic Understanding

Authors: F. F. Xu, Y. K. Wang, Y. P. Wang, P. Ring, P. W. Zhao

Abstract: The rotational properties of the transfermium nuclei are investigated in the full deformation space by implementing a shell-model-like approach in the cranking covariant density functional theory on a three-dimensional lattice, where the pairing correlations, deformations, and moments of inertia are treated in a microscopic and self-consistent way. The kinematic and dynamic moments of inertia of t… ▽ More The rotational properties of the transfermium nuclei are investigated in the full deformation space by implementing a shell-model-like approach in the cranking covariant density functional theory on a three-dimensional lattice, where the pairing correlations, deformations, and moments of inertia are treated in a microscopic and self-consistent way. The kinematic and dynamic moments of inertia of the rotational bands observed in the transfermium nuclei $^{252}$No, $^{254}$No, $^{254}$Rf, and $^{256}$Rf are well reproduced without any adjustable parameters using a well-determined universal density functional. It is found for the first time that the emergence of the octupole deformation should be responsible for the significantly different rotational behavior observed in $^{252}$No and $^{254}$No. The present results provide a microscopic solution to the long-standing puzzle on the rotational behavior in No isotopes, and highlight the risk of investigating only the hexacontetrapole ($β_{60}$) deformation effects in rotating transfermium nuclei without considering the octupole deformation. △ Less

Submitted 12 July, 2024; originally announced July 2024.

arXiv:2407.08994 [pdf, other]

Global Attention-Guided Dual-Domain Point Cloud Feature Learning for Classification and Segmentation

Authors: Zihao Li, Pan Gao, Kang You, Chuan Yan, Manoranjan Paul

Abstract: Previous studies have demonstrated the effectiveness of point-based neural models on the point cloud analysis task. However, there remains a crucial issue on producing the efficient input embedding for raw point coordinates. Moreover, another issue lies in the limited efficiency of neighboring aggregations, which is a critical component in the network stem. In this paper, we propose a Global Atten… ▽ More Previous studies have demonstrated the effectiveness of point-based neural models on the point cloud analysis task. However, there remains a crucial issue on producing the efficient input embedding for raw point coordinates. Moreover, another issue lies in the limited efficiency of neighboring aggregations, which is a critical component in the network stem. In this paper, we propose a Global Attention-guided Dual-domain Feature Learning network (GAD) to address the above-mentioned issues. We first devise the Contextual Position-enhanced Transformer (CPT) module, which is armed with an improved global attention mechanism, to produce a global-aware input embedding that serves as the guidance to subsequent aggregations. Then, the Dual-domain K-nearest neighbor Feature Fusion (DKFF) is cascaded to conduct effective feature aggregation through novel dual-domain feature learning which appreciates both local geometric relations and long-distance semantic connections. Extensive experiments on multiple point cloud analysis tasks (e.g., classification, part segmentation, and scene semantic segmentation) demonstrate the superior performance of the proposed method and the efficacy of the devised modules. △ Less

Submitted 12 July, 2024; originally announced July 2024.

arXiv:2407.08992 [pdf, ps, other]

Emotion Talk: Emotional Support via Audio Messages for Psychological Assistance

Authors: Fabrycio Leite Nakano Almada, Kauan Divino Pouso Mariano, Maykon Adriell Dutra, Victor Emanuel da Silva Monteiro

Abstract: This paper presents "Emotion Talk," a system designed to provide continuous emotional support through audio messages for psychological assistance. The primary objective is to offer consistent support to patients outside traditional therapy sessions by analyzing audio messages to detect emotions and generate appropriate responses. The solution focuses on Portuguese-speaking users, ensuring that the… ▽ More This paper presents "Emotion Talk," a system designed to provide continuous emotional support through audio messages for psychological assistance. The primary objective is to offer consistent support to patients outside traditional therapy sessions by analyzing audio messages to detect emotions and generate appropriate responses. The solution focuses on Portuguese-speaking users, ensuring that the system is linguistically and culturally relevant. This system aims to complement and enhance the psychological follow-up process conducted by therapists, providing immediate and accessible assistance, especially in emergency situations where rapid response is crucial. Experimental results demonstrate the effectiveness of the proposed system, highlighting its potential in applications of psychological support. △ Less

Submitted 12 July, 2024; originally announced July 2024.

arXiv:2407.08990 [pdf, other]

Dynamic neural network with memristive CIM and CAM for 2D and 3D vision

Authors: Yue Zhang, Woyu Zhang, Shaocong Wang, Ning Lin, Yifei Yu, Yangu He, Bo Wang, Hao Jiang, Peng Lin, Xiaoxin Xu, Xiaojuan Qi, Zhongrui Wang, Xumeng Zhang, Dashan Shang, Qi Liu, Kwang-Ting Cheng, Ming Liu

Abstract: The brain is dynamic, associative and efficient. It reconfigures by associating the inputs with past experiences, with fused memory and processing. In contrast, AI models are static, unable to associate inputs with past experiences, and run on digital computers with physically separated memory and processing. We propose a hardware-software co-design, a semantic memory-based dynamic neural network… ▽ More The brain is dynamic, associative and efficient. It reconfigures by associating the inputs with past experiences, with fused memory and processing. In contrast, AI models are static, unable to associate inputs with past experiences, and run on digital computers with physically separated memory and processing. We propose a hardware-software co-design, a semantic memory-based dynamic neural network (DNN) using memristor. The network associates incoming data with the past experience stored as semantic vectors. The network and the semantic memory are physically implemented on noise-robust ternary memristor-based Computing-In-Memory (CIM) and Content-Addressable Memory (CAM) circuits, respectively. We validate our co-designs, using a 40nm memristor macro, on ResNet and PointNet++ for classifying images and 3D points from the MNIST and ModelNet datasets, which not only achieves accuracy on par with software but also a 48.1% and 15.9% reduction in computational budget. Moreover, it delivers a 77.6% and 93.3% reduction in energy consumption. △ Less

Submitted 12 July, 2024; originally announced July 2024.

Comments: In press

arXiv:2407.08984 [pdf, ps, other]

Measurement of branching fractions, CP asymmetry, and isospin asymmetry for $\boldsymbol{B\rightarrowργ}$ decays using Belle and Belle II data

Authors: Belle II Collaboration, I. Adachi, K. Adamczyk, L. Aggarwal, H. Aihara, N. Akopov, A. Aloisio, N. Anh Ky, D. M. Asner, H. Atmacan, T. Aushev, V. Aushev, M. Aversano, R. Ayad, V. Babu, H. Bae, S. Bahinipati, P. Bambade, Sw. Banerjee, S. Bansal, M. Barrett, J. Baudot, A. Baur, A. Beaubien, F. Becherer , et al. (385 additional authors not shown)

Abstract: We present measurements of $B^{+}\rightarrowρ^{+}γ$ and $B^{0}\rightarrowρ^{0}γ$ decays using a combined data sample of $772 \times 10^6$ $B\overline{B}$ pairs collected by the Belle experiment and $387\times 10^6$ $B\overline{B}$ pairs collected by the Belle II experiment in $e^{+}e^{-}$ collisions at the $Υ(4S)$ resonance. After an optimized selection, a simultaneous fit to the Belle and Belle I… ▽ More We present measurements of $B^{+}\rightarrowρ^{+}γ$ and $B^{0}\rightarrowρ^{0}γ$ decays using a combined data sample of $772 \times 10^6$ $B\overline{B}$ pairs collected by the Belle experiment and $387\times 10^6$ $B\overline{B}$ pairs collected by the Belle II experiment in $e^{+}e^{-}$ collisions at the $Υ(4S)$ resonance. After an optimized selection, a simultaneous fit to the Belle and Belle II data sets yields $114\pm 12$ $B^{+}\rightarrowρ^{+}γ$ and $99\pm 12$ $B^{0}\rightarrowρ^{0}γ$ decays. The measured branching fractions are $(13.1^{+2.0 +1.3}_{-1.9 -1.2})\times 10^{-7}$ and $(7.5\pm 1.3^{+1.0}_{-0.8})\times 10^{-7}$ for $B^{+}\rightarrowρ^{+}γ$ and $B^{0}\rightarrowρ^{0}γ$ decays, respectively, where the first uncertainty is statistical and the second is systematic. We also measure the isospin asymmetry $A_{\rm I}(B\rightarrowργ)=(10.9^{+11.2 +7.8}_{-11.7 -7.3})\%$ and the direct CP asymmetry $A_{CP}(B^{+}\rightarrowρ^{+}γ)=(-8.2\pm 15.2^{+1.6}_{-1.2})\%$. △ Less

Submitted 12 July, 2024; originally announced July 2024.

Comments: 12 pages, 4 figures

Report number: Belle II Preprint 2023-019; KEK Preprint 2023-37

arXiv:2407.08969 [pdf, ps, other]

Detect Llama -- Finding Vulnerabilities in Smart Contracts using Large Language Models

Authors: Peter Ince, Xiapu Luo, Jiangshan Yu, Joseph K. Liu, Xiaoning Du

Abstract: In this paper, we test the hypothesis that although OpenAI's GPT-4 performs well generally, we can fine-tune open-source models to outperform GPT-4 in smart contract vulnerability detection. We fine-tune two models from Meta's Code Llama and a dataset of 17k prompts, Detect Llama - Foundation and Detect Llama - Instruct, and we also fine-tune OpenAI's GPT-3.5 Turbo model (GPT-3.5FT). We then evalu… ▽ More In this paper, we test the hypothesis that although OpenAI's GPT-4 performs well generally, we can fine-tune open-source models to outperform GPT-4 in smart contract vulnerability detection. We fine-tune two models from Meta's Code Llama and a dataset of 17k prompts, Detect Llama - Foundation and Detect Llama - Instruct, and we also fine-tune OpenAI's GPT-3.5 Turbo model (GPT-3.5FT). We then evaluate these models, plus a random baseline, on a testset we develop against GPT-4, and GPT-4 Turbo's, detection of eight vulnerabilities from the dataset and the two top identified vulnerabilities - and their weighted F1 scores. We find that for binary classification (i.e., is this smart contract vulnerable?), our two best-performing models, GPT-3.5FT and Detect Llama - Foundation, achieve F1 scores of $0.776$ and $0.68$, outperforming both GPT-4 and GPT-4 Turbo, $0.66$ and $0.675$. For the evaluation against individual vulnerability identification, our top two models, GPT-3.5FT and Detect Llama - Foundation, both significantly outperformed GPT-4 and GPT-4 Turbo in both weighted F1 for all vulnerabilities ($0.61$ and $0.56$ respectively against GPT-4's $0.218$ and GPT-4 Turbo's $0.243$) and weighted F1 for the top two identified vulnerabilities ($0.719$ for GPT-3.5FT, $0.674$ for Detect Llama - Foundation against GPT-4's $0.363$ and GPT-4 Turbo's $0.429$). △ Less

Submitted 11 July, 2024; originally announced July 2024.

arXiv:2407.08962 [pdf, ps, other]

Information vs Thermodynamic Entropy

Authors: Phil Attard

Abstract: The Shannon information is shown to be different to the thermodynamic entropy, and indifferent to the Second Law of Thermodynamics. The Shannon information is shown to be different to the thermodynamic entropy, and indifferent to the Second Law of Thermodynamics. △ Less

Submitted 11 July, 2024; originally announced July 2024.

Comments: 4 pages, 10 equations

arXiv:2407.08960 [pdf, other]

Hot Spot Offset Variability from Magnetohydrodynamical Thermoresistive Instability in Hot Jupiters

Authors: Raphaël Hardy, Paul Charbonneau, Andrew Cumming

Abstract: Hot Jupiter atmospheres are possibly subject to a thermoresistive instability. Such an instability may develop as the ohmic heating increases the electrical conductivity in a positive feedback loop, which ultimately leads to a runaway of the atmospheric temperature. We extend our previous axisymmetric one-dimensional radial model, by representing the temperature and magnetic diffusivity as a first… ▽ More Hot Jupiter atmospheres are possibly subject to a thermoresistive instability. Such an instability may develop as the ohmic heating increases the electrical conductivity in a positive feedback loop, which ultimately leads to a runaway of the atmospheric temperature. We extend our previous axisymmetric one-dimensional radial model, by representing the temperature and magnetic diffusivity as a first order Fourier expansion in longitude. This allows us to predict the hot spot offset during the unfolding of the thermoresistive instability and following Alfvénic oscillations. We show a representative simulation undergoing the thermoresistive instability, in which the peak flux offset varies between approximately $\pm 60^{\circ}$ on timescales of a few days with potentially observable brightness variations. Therefore, this thermoresistive instability could be an observable feature of hot Jupiters, given the right timing of observation and transit and the right planetary parameters. △ Less

Submitted 11 July, 2024; originally announced July 2024.

Comments: 12 pages, 8 figures, 1 table

Showing 51–100 of 418,612 results for author: P.