Search | arXiv e-print repository

arXiv:2407.00285 [pdf, other]

Imaging of single barium atoms in a second matrix site in solid xenon for barium tagging in a $^{136}$Xe double beta decay experiment

Authors: M. Yvaine, D. Fairbank, J. Soderstrom, C. Taylor, J. Stanley, T. Walton, C. Chambers, A. Iverson, W. Fairbank, S. Al Kharusi, A. Amy, E. Angelico, A. Anker, I. J. Arnquist, A. Atencio, J. Bane, V. Belov, E. P. Bernard, T. Bhatta, A. Bolotnikov, J. Breslin, P. A. Breur, J. P. Brodsky, E. Brown, T. Brunner , et al. (112 additional authors not shown)

Abstract: Neutrinoless double beta decay is one of the most sensitive probes for new physics beyond the Standard Model of particle physics. One of the isotopes under investigation is $^{136}$Xe, which would double beta decay into $^{136}$Ba. Detecting the single $^{136}$Ba daughter provides a sort of ultimate tool in the discrimination against backgrounds. Previous work demonstrated the ability to perform s… ▽ More Neutrinoless double beta decay is one of the most sensitive probes for new physics beyond the Standard Model of particle physics. One of the isotopes under investigation is $^{136}$Xe, which would double beta decay into $^{136}$Ba. Detecting the single $^{136}$Ba daughter provides a sort of ultimate tool in the discrimination against backgrounds. Previous work demonstrated the ability to perform single atom imaging of Ba atoms in a single-vacancy site of a solid xenon matrix. In this paper, the effort to identify signal from individual barium atoms is extended to Ba atoms in a hexa-vacancy site in the matrix and is achieved despite increased photobleaching in this site. Abrupt fluorescence turn-off of a single Ba atom is also observed. Significant recovery of fluorescence signal lost through photobleaching is demonstrated upon annealing of Ba deposits in the Xe ice. Following annealing, it is observed that Ba atoms in the hexa-vacancy site exhibit antibleaching while Ba atoms in the tetra-vacancy site exhibit bleaching. This may be evidence for a matrix site transfer upon laser excitation. Our findings offer a path of continued research toward tagging of Ba daughters in all significant sites in solid xenon. △ Less

Submitted 28 June, 2024; originally announced July 2024.

Comments: 9 pages, 8 figures

arXiv:2406.18871 [pdf, other]

DeSTA: Enhancing Speech Language Models through Descriptive Speech-Text Alignment

Authors: Ke-Han Lu, Zhehuai Chen, Szu-Wei Fu, He Huang, Boris Ginsburg, Yu-Chiang Frank Wang, Hung-yi Lee

Abstract: Recent speech language models (SLMs) typically incorporate pre-trained speech models to extend the capabilities from large language models (LLMs). In this paper, we propose a Descriptive Speech-Text Alignment approach that leverages speech captioning to bridge the gap between speech and text modalities, enabling SLMs to interpret and generate comprehensive natural language descriptions, thereby fa… ▽ More Recent speech language models (SLMs) typically incorporate pre-trained speech models to extend the capabilities from large language models (LLMs). In this paper, we propose a Descriptive Speech-Text Alignment approach that leverages speech captioning to bridge the gap between speech and text modalities, enabling SLMs to interpret and generate comprehensive natural language descriptions, thereby facilitating the capability to understand both linguistic and non-linguistic features in speech. Enhanced with the proposed approach, our model demonstrates superior performance on the Dynamic-SUPERB benchmark, particularly in generalizing to unseen tasks. Moreover, we discover that the aligned model exhibits a zero-shot instruction-following capability without explicit speech instruction tuning. These findings highlight the potential to reshape instruction-following SLMs by incorporating rich, descriptive speech captions. △ Less

Submitted 26 June, 2024; originally announced June 2024.

Comments: Accepted to Interspeech 2024

arXiv:2406.18847 [pdf, other]

doi 10.18653/v1/2023.emnlp-main.154

Learning Retrieval Augmentation for Personalized Dialogue Generation

Authors: Qiushi Huang, Shuai Fu, Xubo Liu, Wenwu Wang, Tom Ko, Yu Zhang, Lilian Tang

Abstract: Personalized dialogue generation, focusing on generating highly tailored responses by leveraging persona profiles and dialogue context, has gained significant attention in conversational AI applications. However, persona profiles, a prevalent setting in current personalized dialogue datasets, typically composed of merely four to five sentences, may not offer comprehensive descriptions of the perso… ▽ More Personalized dialogue generation, focusing on generating highly tailored responses by leveraging persona profiles and dialogue context, has gained significant attention in conversational AI applications. However, persona profiles, a prevalent setting in current personalized dialogue datasets, typically composed of merely four to five sentences, may not offer comprehensive descriptions of the persona about the agent, posing a challenge to generate truly personalized dialogues. To handle this problem, we propose $\textbf{L}$earning Retrieval $\textbf{A}$ugmentation for $\textbf{P}$ersonalized $\textbf{D}$ial$\textbf{O}$gue $\textbf{G}$eneration ($\textbf{LAPDOG}$), which studies the potential of leveraging external knowledge for persona dialogue generation. Specifically, the proposed LAPDOG model consists of a story retriever and a dialogue generator. The story retriever uses a given persona profile as queries to retrieve relevant information from the story document, which serves as a supplementary context to augment the persona profile. The dialogue generator utilizes both the dialogue history and the augmented persona profile to generate personalized responses. For optimization, we adopt a joint training framework that collaboratively learns the story retriever and dialogue generator, where the story retriever is optimized towards desired ultimate metrics (e.g., BLEU) to retrieve content for the dialogue generator to generate personalized responses. Experiments conducted on the CONVAI2 dataset with ROCStory as a supplementary data source show that the proposed LAPDOG method substantially outperforms the baselines, indicating the effectiveness of the proposed method. The LAPDOG model code is publicly available for further exploration. https://github.com/hqsiswiliam/LAPDOG △ Less

Submitted 26 June, 2024; originally announced June 2024.

Comments: Accepted to EMNLP-2023

arXiv:2406.12380 [pdf, other]

Search for fractionally charged particles with CUORE

Authors: CUORE Collaboration, D. Q. Adams, C. Alduino, K. Alfonso, F. T. Avignone III, O. Azzolini, G. Bari, F. Bellini, G. Benato, M. Beretta, M. Biassoni, A. Branca, C. Brofferio, C. Bucci, J. Camilleri, A. Caminata, A. Campani, J. Cao, S. Capelli, C. Capelli, L. Cappelli, L. Cardani, P. Carniti, N. Casali, E. Celi , et al. (95 additional authors not shown)

Abstract: The Cryogenic Underground Observatory for Rare Events (CUORE) is a detector array comprised by 988 5$\;$cm$\times$5$\;$cm$\times$5$\;$cm TeO$_2$ crystals held below 20 mK, primarily searching for neutrinoless double-beta decay in $^{130}$Te. Unprecedented in size amongst cryogenic calorimetric experiments, CUORE provides a promising setting for the study of exotic through-going particles. Using th… ▽ More The Cryogenic Underground Observatory for Rare Events (CUORE) is a detector array comprised by 988 5$\;$cm$\times$5$\;$cm$\times$5$\;$cm TeO$_2$ crystals held below 20 mK, primarily searching for neutrinoless double-beta decay in $^{130}$Te. Unprecedented in size amongst cryogenic calorimetric experiments, CUORE provides a promising setting for the study of exotic through-going particles. Using the first tonne-year of CUORE's exposure, we perform a search for hypothesized fractionally charged particles (FCPs), which are well-motivated by various Standard Model extensions and would have suppressed interactions with matter. No excess of FCP candidate tracks is observed over background, setting leading limits on the underground FCP flux with charges between $e/24-e/5$ at 90\% confidence level. Using the low background environment and segmented geometry of CUORE, we establish the sensitivity of tonne-scale sub-Kelvin detectors to diverse signatures of new physics. △ Less

Submitted 18 June, 2024; originally announced June 2024.

Comments: 7 pages, 5 figures

arXiv:2406.11941 [pdf, other]

Crossfusor: A Cross-Attention Transformer Enhanced Conditional Diffusion Model for Car-Following Trajectory Prediction

Authors: Junwei You, Haotian Shi, Keshu Wu, Keke Long, Sicheng Fu, Sikai Chen, Bin Ran

Abstract: Vehicle trajectory prediction is crucial for advancing autonomous driving and advanced driver assistance systems (ADAS), enhancing road safety and traffic efficiency. While traditional methods have laid foundational work, modern deep learning techniques, particularly transformer-based models and generative approaches, have significantly improved prediction accuracy by capturing complex and non-lin… ▽ More Vehicle trajectory prediction is crucial for advancing autonomous driving and advanced driver assistance systems (ADAS), enhancing road safety and traffic efficiency. While traditional methods have laid foundational work, modern deep learning techniques, particularly transformer-based models and generative approaches, have significantly improved prediction accuracy by capturing complex and non-linear patterns in vehicle motion and traffic interactions. However, these models often overlook the detailed car-following behaviors and inter-vehicle interactions essential for real-world driving scenarios. This study introduces a Cross-Attention Transformer Enhanced Conditional Diffusion Model (Crossfusor) specifically designed for car-following trajectory prediction. Crossfusor integrates detailed inter-vehicular interactions and car-following dynamics into a robust diffusion framework, improving both the accuracy and realism of predicted trajectories. The model leverages a novel temporal feature encoding framework combining GRU, location-based attention mechanisms, and Fourier embedding to capture historical vehicle dynamics. It employs noise scaled by these encoded historical features in the forward diffusion process, and uses a cross-attention transformer to model intricate inter-vehicle dependencies in the reverse denoising process. Experimental results on the NGSIM dataset demonstrate that Crossfusor outperforms state-of-the-art models, particularly in long-term predictions, showcasing its potential for enhancing the predictive capabilities of autonomous driving systems. △ Less

Submitted 17 June, 2024; originally announced June 2024.

arXiv:2406.11255 [pdf, other]

Liberal Entity Matching as a Compound AI Toolchain

Authors: Silvery D. Fu, David Wang, Wen Zhang, Kathleen Ge

Abstract: Entity matching (EM), the task of identifying whether two descriptions refer to the same entity, is essential in data management. Traditional methods have evolved from rule-based to AI-driven approaches, yet current techniques using large language models (LLMs) often fall short due to their reliance on static knowledge and rigid, predefined prompts. In this paper, we introduce Libem, a compound AI… ▽ More Entity matching (EM), the task of identifying whether two descriptions refer to the same entity, is essential in data management. Traditional methods have evolved from rule-based to AI-driven approaches, yet current techniques using large language models (LLMs) often fall short due to their reliance on static knowledge and rigid, predefined prompts. In this paper, we introduce Libem, a compound AI system designed to address these limitations by incorporating a flexible, tool-oriented approach. Libem supports entity matching through dynamic tool use, self-refinement, and optimization, allowing it to adapt and refine its process based on the dataset and performance metrics. Unlike traditional solo-AI EM systems, which often suffer from a lack of modularity that hinders iterative design improvements and system optimization, Libem offers a composable and reusable toolchain. This approach aims to contribute to ongoing discussions and developments in AI-driven data management. △ Less

Submitted 17 June, 2024; originally announced June 2024.

Comments: 2 pages, compound ai systems 2024

arXiv:2406.11227 [pdf, ps, other]

Compound Schema Registry

Authors: Silvery D. Fu, Xuewei Chen

Abstract: Schema evolution is critical in managing database systems to ensure compatibility across different data versions. A schema registry typically addresses the challenges of schema evolution in real-time data streaming by managing, validating, and ensuring schema compatibility. However, current schema registries struggle with complex syntactic alterations like field renaming or type changes, which oft… ▽ More Schema evolution is critical in managing database systems to ensure compatibility across different data versions. A schema registry typically addresses the challenges of schema evolution in real-time data streaming by managing, validating, and ensuring schema compatibility. However, current schema registries struggle with complex syntactic alterations like field renaming or type changes, which often require significant manual intervention and can disrupt service. To enhance the flexibility of schema evolution, we propose the use of generalized schema evolution (GSE) facilitated by a compound AI system. This system employs Large Language Models (LLMs) to interpret the semantics of schema changes, supporting a broader range of syntactic modifications without interrupting data streams. Our approach includes develo** a task-specific language, Schema Transformation Language (STL), to generate schema map**s as an intermediate representation (IR), simplifying the integration of schema changes across different data processing platforms. Initial results indicate that this approach can improve schema map** accuracy and efficiency, demonstrating the potential of GSE in practical applications. △ Less

Submitted 17 June, 2024; originally announced June 2024.

Comments: 2 pages, compound ai system workshop 2024

arXiv:2406.07209 [pdf, other]

MS-Diffusion: Multi-subject Zero-shot Image Personalization with Layout Guidance

Authors: X. Wang, Siming Fu, Qihan Huang, Wanggui He, Hao Jiang

Abstract: Recent advancements in text-to-image generation models have dramatically enhanced the generation of photorealistic images from textual prompts, leading to an increased interest in personalized text-to-image applications, particularly in multi-subject scenarios. However, these advances are hindered by two main challenges: firstly, the need to accurately maintain the details of each referenced subje… ▽ More Recent advancements in text-to-image generation models have dramatically enhanced the generation of photorealistic images from textual prompts, leading to an increased interest in personalized text-to-image applications, particularly in multi-subject scenarios. However, these advances are hindered by two main challenges: firstly, the need to accurately maintain the details of each referenced subject in accordance with the textual descriptions; and secondly, the difficulty in achieving a cohesive representation of multiple subjects in a single image without introducing inconsistencies. To address these concerns, our research introduces the MS-Diffusion framework for layout-guided zero-shot image personalization with multi-subjects. This innovative approach integrates grounding tokens with the feature resampler to maintain detail fidelity among subjects. With the layout guidance, MS-Diffusion further improves the cross-attention to adapt to the multi-subject inputs, ensuring that each subject condition acts on specific areas. The proposed multi-subject cross-attention orchestrates harmonious inter-subject compositions while preserving the control of texts. Comprehensive quantitative and qualitative experiments affirm that this method surpasses existing models in both image and text fidelity, promoting the development of personalized text-to-image generation. △ Less

Submitted 11 June, 2024; originally announced June 2024.

arXiv:2406.00110 [pdf, other]

DECam Multi-Messenger Astrophysics Pipeline. I. from Raw Data to Single-Exposure Candidates

Authors: Shenming Fu, Thomas Matheson, Aaron Meisner, Yuanyuan Zhang, Sebastián Vicencio, Destry Saul

Abstract: We introduce a pipeline that performs rapid image subtraction and source selection to detect transients, with a focus on identifying gravitational wave optical counterparts using the Dark Energy Camera (DECam). In this work, we present the pipeline steps from processing raw data to identification of astrophysical transients on individual exposures. We process DECam data and build difference images… ▽ More We introduce a pipeline that performs rapid image subtraction and source selection to detect transients, with a focus on identifying gravitational wave optical counterparts using the Dark Energy Camera (DECam). In this work, we present the pipeline steps from processing raw data to identification of astrophysical transients on individual exposures. We process DECam data and build difference images using the Vera C. Rubin Observatory Legacy Survey of Space and Time (LSST) Science Pipelines software, and we use flags and principal component analysis to select transients on a per-exposure basis, without associating the results from different exposures. Those candidates will be sent to brokers for further classification and alert distribution. We validate our pipeline using archival exposures that cover various types of objects, and the tested targets include a kilonova (GW170817), supernovae, stellar flares, variable stars (in a resolved galaxy or the Milky Way Bulge), and serendipitous objects. Overall, the data processing produces clean light curves that are comparable with published results, demonstrating the photometric quality of our pipeline. Real transients can be well selected by our pipeline when sufficiently bright (S/N $\gtrsim15$). This pipeline is intended to serve as a tool for the broader research community. Although this pipeline is designed for DECam, our method can be easily applied to other instruments and future LSST observations. △ Less

Submitted 31 May, 2024; originally announced June 2024.

Comments: 35 pages, 25 figures, 5 tables. Submitted to AJ. Comments are welcome and appreciated

arXiv:2405.19419 [pdf, other]

Supernova Electron-Neutrino Interactions with Xenon in the nEXO Detector

Authors: nEXO Collaboration, S. Hedges, S. Al Kharusi, E. Angelico, J. P. Brodsky, G. Richardson, S. Wilde, A. Amy, A. Anker, I. J. Arnquist, P. Arsenault, A. Atencio, I. Badhrees, J. Bane, V. Belov, E. P. Bernard, T. Bhatta, A. Bolotnikov, J. Breslin, P. A. Breur, E. Brown, T. Brunner, E. Caden, G. F. Cao, L. Q. Cao , et al. (121 additional authors not shown)

Abstract: Electron-neutrino charged-current interactions with xenon nuclei were modeled in the nEXO neutrinoless double-beta decay detector (~5-tonne, 90% ${}^{136}$Xe, 10% ${}^{134}$Xe) to evaluate its sensitivity to supernova neutrinos. Predictions for event rates and detectable signatures were modeled using the MARLEY event generator. We find good agreement between MARLEY's predictions and existing theor… ▽ More Electron-neutrino charged-current interactions with xenon nuclei were modeled in the nEXO neutrinoless double-beta decay detector (~5-tonne, 90% ${}^{136}$Xe, 10% ${}^{134}$Xe) to evaluate its sensitivity to supernova neutrinos. Predictions for event rates and detectable signatures were modeled using the MARLEY event generator. We find good agreement between MARLEY's predictions and existing theoretical calculations of the inclusive cross sections at supernova neutrino energies. The interactions modeled by MARLEY were simulated within the nEXO simulation framework and were run through an example reconstruction algorithm to determine the detector's efficiency for reconstructing these events. The simulated data, incorporating the detector response, were used to study the ability of nEXO to reconstruct the incident electron-neutrino spectrum and these results were extended to a larger xenon detector of the same isotope enrichment. We estimate that nEXO will be able to observe electron-neutrino interactions with xenon from supernovae as far as 5 to 8 kpc from earth, while the ability to reconstruct incident electron-neutrino spectrum parameters from observed interactions in nEXO is limited to closer supernovae. △ Less

Submitted 29 May, 2024; originally announced May 2024.

Comments: 17 pages, 16 figures

Report number: LLNL-JRNL-864783-DRAFT

arXiv:2405.17937 [pdf, other]

Data-driven background model for the CUORE experiment

Authors: CUORE Collaboration, D. Q. Adams, C. Alduino, K. Alfonso, F. T. Avignone III, O. Azzolini, G. Bari, F. Bellini, G. Benato, M. Beretta, M. Biassoni, A. Branca, C. Brofferio, C. Bucci, J. Camilleri, A. Caminata, A. Campani, J. Cao, S. Capelli, C. Capelli, L. Cappelli, L. Cardani, P. Carniti, N. Casali, E. Celi , et al. (93 additional authors not shown)

Abstract: We present the model we developed to reconstruct the CUORE radioactive background based on the analysis of an experimental exposure of 1038.4 kg yr. The data reconstruction relies on a simultaneous Bayesian fit applied to energy spectra over a broad energy range. The high granularity of the CUORE detector, together with the large exposure and extended stable operations, allow for an in-depth explo… ▽ More We present the model we developed to reconstruct the CUORE radioactive background based on the analysis of an experimental exposure of 1038.4 kg yr. The data reconstruction relies on a simultaneous Bayesian fit applied to energy spectra over a broad energy range. The high granularity of the CUORE detector, together with the large exposure and extended stable operations, allow for an in-depth exploration of both spatial and time dependence of backgrounds. We achieve high sensitivity to both bulk and surface activities of the materials of the setup, detecting levels as low as 10 nBq kg$^{-1}$ and 0.1 nBq cm$^{-2}$, respectively. We compare the contamination levels we extract from the background model with prior radio-assay data, which informs future background risk mitigation strategies. The results of this background model play a crucial role in constructing the background budget for the CUPID experiment as it will exploit the same CUORE infrastructure. △ Less

Submitted 28 May, 2024; originally announced May 2024.

arXiv:2405.12699 [pdf, other]

GeckoGraph: A Visual Language for Polymorphic Types

Authors: Shuai Fu, Tim Dwyer, Peter J. Stuckey

Abstract: Polymorphic types are an important feature in most strongly typed programming languages. They allow functions to be written in a way that can be used with different data types, while still enforcing the relationship and constraints between the values. However, programmers often find polymorphic types difficult to use and understand and tend to reason using concrete types. We propose GeckoGraph, a… ▽ More Polymorphic types are an important feature in most strongly typed programming languages. They allow functions to be written in a way that can be used with different data types, while still enforcing the relationship and constraints between the values. However, programmers often find polymorphic types difficult to use and understand and tend to reason using concrete types. We propose GeckoGraph, a graphical notation for types. GeckoGraph aims to accompany traditional text-based type notation and to make reading, understanding, and comparing types easier. We conducted a large-scale human study using GeckoGraph compared to text-based type notation. To our knowledge, this is the largest controlled user study on functional programming ever conducted. The results of the study show that GeckoGraph helps improve programmers' ability to succeed in the programming tasks we designed, especially for novice programmers. △ Less

Submitted 21 May, 2024; originally announced May 2024.

arXiv:2405.12697 [pdf, other]

Goanna: Resolving Haskell Type Errors With Minimal Correction Subsets

Authors: Shuai Fu, Tim Dwyer, Peter J. Stuckey, John Grundy

Abstract: Statically typed languages offer significant advantages, such as bug prevention, enhanced code quality, and reduced maintenance costs. However, these benefits often come at the expense of a steep learning curve and a slower development pace. Haskell, known for its expressive and strict type system, poses challenges for inexperienced programmers in learning and using its type system, especially in… ▽ More Statically typed languages offer significant advantages, such as bug prevention, enhanced code quality, and reduced maintenance costs. However, these benefits often come at the expense of a steep learning curve and a slower development pace. Haskell, known for its expressive and strict type system, poses challenges for inexperienced programmers in learning and using its type system, especially in debugging type errors. We introduce Goanna, a novel tool that serves as a type checker and an interactive type error debugging tool for Haskell. When encountering type errors, Goanna identifies a comprehensive list of potential causes and resolutions based on the minimum correction subsets (MCS) enumeration. We evaluated Goanna's effectiveness using 86 diverse Haskell programs from online discourse, demonstrating its ability to accurately identify and resolve type errors. Additionally, we present a collection of techniques and heuristics to enhance Goanna's suggestion-based error diagnosis and show their effectiveness from our evaluation. △ Less

Submitted 21 May, 2024; originally announced May 2024.

arXiv:2405.11810 [pdf]

Formation of Iron-Helium Compounds under High Pressure

Authors: Haruki Takezawa, Han Hsu, Kei Hirose, Fumiya Sakai, Suyu Fu, Hitoshi Gomi

Abstract: While helium is a representative noble gas element characterized by its chemical inertness at ambient conditions, recent experiments and calculations argued that a minor amount of helium is incorporated into molten iron from silicate under high pressures. Here we examined the reaction between iron and helium at 8-42 GPa and ~1000-2820 K and found remarkable volume expansion of the Fe lattice, whic… ▽ More While helium is a representative noble gas element characterized by its chemical inertness at ambient conditions, recent experiments and calculations argued that a minor amount of helium is incorporated into molten iron from silicate under high pressures. Here we examined the reaction between iron and helium at 8-42 GPa and ~1000-2820 K and found remarkable volume expansion of the Fe lattice, which is attributed to the formations of fcc and distorted hcp iron-helium compounds with x in FeHex up to 0.13 and 0.42, respectively. Upon releasing pressure under room temperature, these fcc and distorted hcp FeHex were still observed while the former lost some helium. In addition, our first-principles calculations indicate that fcc FeHe0.25, with He atoms in the tetrahedral interstitial sites, is dynamically stable in different magnetic states throughout 0-50 GPa. These results support that the Earth's core can be a large reservoir of primordial 3He. △ Less

Submitted 20 May, 2024; originally announced May 2024.

Comments: 30 pages, 14 figures, 4 tables

arXiv:2405.08977 [pdf, other]

Constraints on the variation of the fine-structure constant at 3<z<10 with JWST emission-line galaxies

Authors: Linhua Jiang, Shuqi Fu, Feige Wang, Sarah E. I. Bosman, Zheng Cai, Hyunsung D. Jun, Zhiwei Pan, Fengwu Sun, **yi Yang, Huanian Zhang

Abstract: We present constraints on the spacetime variation of the fine-structure constant $α$ at redshifts $3<z<10$ using JWST emission-line galaxies. The galaxy sample consists of 572 high-quality spectra with strong and narrow [O III] $λλ$4959,5007 doublet emission lines from 522 galaxies, including 267 spectra at $z>5$. The [O III] doublet lines are arguably the best emission lines to probe the variatio… ▽ More We present constraints on the spacetime variation of the fine-structure constant $α$ at redshifts $3<z<10$ using JWST emission-line galaxies. The galaxy sample consists of 572 high-quality spectra with strong and narrow [O III] $λλ$4959,5007 doublet emission lines from 522 galaxies, including 267 spectra at $z>5$. The [O III] doublet lines are arguably the best emission lines to probe the variation in $α$. We divide our sample into 5 subsamples based on redshift and calculate the relative variation $Δα/α$ for the individual subsamples. The calculated $Δα/α$ values are consistent with zero within $1σ$ at all redshifts, suggesting no time variation in $α$ above a level of $(1-2) \times10^{-4}$ ($1σ$) in the past 13.2 billion years. When the whole sample is combined, the constraint is improved to be $Δα/α= (0.4\pm0.7) \times10^{-4}$. We further test the spatial variation in $α$ using four subsamples of galaxies in four different directions on the sky. The measured $Δα/α$ values are consistent with zero at a $1σ$ level of $\sim10^{-4}$. While the constraints in this work are not as stringent as those from lower-redshift quasar absorption lines in previous studies, this work uses an independent tracer and provides the first constraints on $Δα/α$ at the highest redshifts. Our analyses also indicate that the relative wavelength calibration of the JWST spectra is robust. With the growing number of emission-line galaxies from JWST, we expect to achieve stronger constraints in the future. △ Less

Submitted 14 May, 2024; originally announced May 2024.

Comments: 9 pages, 6 figures, submitted to ApJ

arXiv:2405.06573 [pdf, other]

An Investigation of Incorporating Mamba for Speech Enhancement

Authors: Rong Chao, Wen-Huang Cheng, Moreno La Quatra, Sabato Marco Siniscalchi, Chao-Han Huck Yang, Szu-Wei Fu, Yu Tsao

Abstract: This work aims to study a scalable state-space model (SSM), Mamba, for the speech enhancement (SE) task. We exploit a Mamba-based regression model to characterize speech signals and build an SE system upon Mamba, termed SEMamba. We explore the properties of Mamba by integrating it as the core model in both basic and advanced SE systems, along with utilizing signal-level distances as well as metric… ▽ More This work aims to study a scalable state-space model (SSM), Mamba, for the speech enhancement (SE) task. We exploit a Mamba-based regression model to characterize speech signals and build an SE system upon Mamba, termed SEMamba. We explore the properties of Mamba by integrating it as the core model in both basic and advanced SE systems, along with utilizing signal-level distances as well as metric-oriented loss functions. SEMamba demonstrates promising results and attains a PESQ score of 3.55 on the VoiceBank-DEMAND dataset. When combined with the perceptual contrast stretching technique, the proposed SEMamba yields a new state-of-the-art PESQ score of 3.69. △ Less

Submitted 10 May, 2024; originally announced May 2024.

arXiv:2405.02559 [pdf]

A Literature Review and Framework for Human Evaluation of Generative Large Language Models in Healthcare

Authors: Thomas Yu Chow Tam, Sonish Sivarajkumar, Sumit Kapoor, Alisa V Stolyar, Katelyn Polanska, Karleigh R McCarthy, Hunter Osterhoudt, Xizhi Wu, Shyam Visweswaran, Sunyang Fu, Piyush Mathur, Giovanni E. Cacciamani, Cong Sun, Yifan Peng, Yanshan Wang

Abstract: As generative artificial intelligence (AI), particularly Large Language Models (LLMs), continues to permeate healthcare, it remains crucial to supplement traditional automated evaluations with human expert evaluation. Understanding and evaluating the generated texts is vital for ensuring safety, reliability, and effectiveness. However, the cumbersome, time-consuming, and non-standardized nature of… ▽ More As generative artificial intelligence (AI), particularly Large Language Models (LLMs), continues to permeate healthcare, it remains crucial to supplement traditional automated evaluations with human expert evaluation. Understanding and evaluating the generated texts is vital for ensuring safety, reliability, and effectiveness. However, the cumbersome, time-consuming, and non-standardized nature of human evaluation presents significant obstacles to the widespread adoption of LLMs in practice. This study reviews existing literature on human evaluation methodologies for LLMs within healthcare. We highlight a notable need for a standardized and consistent human evaluation approach. Our extensive literature search, adhering to the Preferred Reporting Items for Systematic Reviews and Meta-Analyses (PRISMA) guidelines, spans publications from January 2018 to February 2024. This review provides a comprehensive overview of the human evaluation approaches used in diverse healthcare applications.This analysis examines the human evaluation of LLMs across various medical specialties, addressing factors such as evaluation dimensions, sample types, and sizes, the selection and recruitment of evaluators, frameworks and metrics, the evaluation process, and statistical analysis of the results. Drawing from diverse evaluation strategies highlighted in these studies, we propose a comprehensive and practical framework for human evaluation of generative LLMs, named QUEST: Quality of Information, Understanding and Reasoning, Expression Style and Persona, Safety and Harm, and Trust and Confidence. This framework aims to improve the reliability, generalizability, and applicability of human evaluation of generative LLMs in different healthcare applications by defining clear evaluation dimensions and offering detailed guidelines. △ Less

Submitted 4 May, 2024; originally announced May 2024.

arXiv:2404.18873 [pdf, other]

OpenStreetView-5M: The Many Roads to Global Visual Geolocation

Authors: Guillaume Astruc, Nicolas Dufour, Ioannis Siglidis, Constantin Aronssohn, Nacim Bouia, Stephanie Fu, Romain Loiseau, Van Nguyen Nguyen, Charles Raude, Elliot Vincent, Lintao XU, Hongyu Zhou, Loic Landrieu

Abstract: Determining the location of an image anywhere on Earth is a complex visual task, which makes it particularly relevant for evaluating computer vision algorithms. Yet, the absence of standard, large-scale, open-access datasets with reliably localizable images has limited its potential. To address this issue, we introduce OpenStreetView-5M, a large-scale, open-access dataset comprising over 5.1 milli… ▽ More Determining the location of an image anywhere on Earth is a complex visual task, which makes it particularly relevant for evaluating computer vision algorithms. Yet, the absence of standard, large-scale, open-access datasets with reliably localizable images has limited its potential. To address this issue, we introduce OpenStreetView-5M, a large-scale, open-access dataset comprising over 5.1 million geo-referenced street view images, covering 225 countries and territories. In contrast to existing benchmarks, we enforce a strict train/test separation, allowing us to evaluate the relevance of learned geographical features beyond mere memorization. To demonstrate the utility of our dataset, we conduct an extensive benchmark of various state-of-the-art image encoders, spatial representations, and training strategies. All associated codes and models can be found at https://github.com/gastruc/osv5m. △ Less

Submitted 29 April, 2024; originally announced April 2024.

Comments: CVPR 2024

arXiv:2404.16425 [pdf, other]

Soft X-ray prompt emission from a high-redshift gamma-ray burst EP240315a

Authors: Y. Liu, H. Sun, D. Xu, D. S. Svinkin, J. Delaunay, N. R. Tanvir, H. Gao, C. Zhang, Y. Chen, X. -F. Wu, B. Zhang, W. Yuan, J. An, G. Bruni, D. D. Frederiks, G. Ghirlanda, J. -W. Hu, A. Li, C. -K. Li, J. -D. Li, D. B. Malesani, L. Piro, G. Raman, R. Ricci, E. Troja , et al. (170 additional authors not shown)

Abstract: Long gamma-ray bursts (GRBs) are believed to originate from core collapse of massive stars. High-redshift GRBs can probe the star formation and reionization history of the early universe, but their detection remains rare. Here we report the detection of a GRB triggered in the 0.5--4 keV band by the Wide-field X-ray Telescope (WXT) on board the Einstein Probe (EP) mission, designated as EP240315a,… ▽ More Long gamma-ray bursts (GRBs) are believed to originate from core collapse of massive stars. High-redshift GRBs can probe the star formation and reionization history of the early universe, but their detection remains rare. Here we report the detection of a GRB triggered in the 0.5--4 keV band by the Wide-field X-ray Telescope (WXT) on board the Einstein Probe (EP) mission, designated as EP240315a, whose bright peak was also detected by the Swift Burst Alert Telescope and Konus-Wind through off-line analyses. At a redshift of $z=4.859$, EP240315a showed a much longer and more complicated light curve in the soft X-ray band than in gamma-rays. Benefiting from a large field-of-view ($\sim$3600 deg$^2$) and a high sensitivity, EP-WXT captured the earlier engine activation and extended late engine activity through a continuous detection. With a peak X-ray flux at the faint end of previously known high-$z$ GRBs, the detection of EP240315a demonstrates the great potential for EP to study the early universe via GRBs. △ Less

Submitted 25 April, 2024; originally announced April 2024.

Comments: 41 pages, 8 figures, 7 tables

arXiv:2404.12955 [pdf]

Anisotropic electron-phonon interactions in 2D lead-halide perovskites

Authors: Jaco J. Geuchies, Johan Klarbring, Lucia Di Virgillio, Shuai Fu, Sheng Qu, Guangyu Liu, Hai Wang, Jarvist M. Frost, Aron Walsh, Mischa Bonn, Heejae Kim

Abstract: Two-dimensional hybrid organic-inorganic metal halide perovskites offer enhanced stability for perovskite-based applications. Their crystal structure's soft and ionic nature gives rise to strong interactions between charge carriers and ionic rearrangements. Here, we investigate the interaction of photo-generated electrons and ionic polarizations in single-crystal 2D perovskite butylammonium lead i… ▽ More Two-dimensional hybrid organic-inorganic metal halide perovskites offer enhanced stability for perovskite-based applications. Their crystal structure's soft and ionic nature gives rise to strong interactions between charge carriers and ionic rearrangements. Here, we investigate the interaction of photo-generated electrons and ionic polarizations in single-crystal 2D perovskite butylammonium lead iodide, varying the inorganic lammelae thickness in the 2D single crystals. We determined the directionality of the transition dipole moments of the relevant phonon modes (in the 0.3-3 THz range) by angle-and-polarization dependent THz transmission measurements. We find a clear anisotropy of the in-plane photoconductivity, with a 10% reduction along the axis parallel with the transition dipole moment of the most strongly coupled phonon. Detailed calculations, based on Feynman polaron theory, indicate that the anisotropy originates from directional electron-phonon interactions. △ Less

Submitted 19 April, 2024; originally announced April 2024.

Comments: 50 pages, 5 figures main text, 13 figures in supplementary information

arXiv:2404.04453 [pdf, other]

With or without $ν$? Hunting for the seed of the matter-antimatter asymmetry

Authors: CUORE Collaboration, D. Q. Adams, C. Alduino, K. Alfonso, F. T. Avignone III, O. Azzolini, G. Bari, F. Bellini, G. Benato, M. Beretta, M. Biassoni, A. Branca, C. Brofferio, C. Bucci, J. Camilleri, A. Caminata, A. Campani, J. Cao, S. Capelli, C. Capelli, L. Cappelli, L. Cardani, P. Carniti, N. Casali, E. Celi , et al. (93 additional authors not shown)

Abstract: The matter-antimatter asymmetry underlines the incompleteness of the current understanding of particle physics. Neutrinoless double-beta ($0νββ$) decay may help explain this asymmetry, while unveiling the Majorana nature of the neutrino. The CUORE experiment searches for $0νββ$ decay of $^{130}$Te using a tonne-scale cryogenic calorimeter operated at milli-kelvin temperatures. We report no evidenc… ▽ More The matter-antimatter asymmetry underlines the incompleteness of the current understanding of particle physics. Neutrinoless double-beta ($0νββ$) decay may help explain this asymmetry, while unveiling the Majorana nature of the neutrino. The CUORE experiment searches for $0νββ$ decay of $^{130}$Te using a tonne-scale cryogenic calorimeter operated at milli-kelvin temperatures. We report no evidence for $0νββ$ decay and place a lower limit on the half-life of T$_{1/2}$ $>$ 3.8 $\times$ 10$^{25}$ years (90% C.I.) with over 2 tonne$\cdot$year TeO$_2$ exposure. The tools and techniques developed for this result and the 5 year stable operation of nearly 1000 detectors demonstrate the infrastructure for a next-generation experiment capable of searching for $0νββ$ decay across multiple isotopes. △ Less

Submitted 5 April, 2024; originally announced April 2024.

arXiv:2404.02463 [pdf, other]

Spin-NeuroMem: A Low-Power Neuromorphic Associative Memory Design Based on Spintronic Devices

Authors: Siqing Fu, Tiejun Li, Chunyuan Zhang, Sheng Ma, Jianmin Zhang, Lizhou Wu

Abstract: Biologically-inspired computing models have made significant progress in recent years, but the conventional von Neumann architecture is inefficient for the large-scale matrix operations and massive parallelism required by these models. This paper presents Spin-NeuroMem, a low-power circuit design of Hopfield network for the function of associative memory. Spin-NeuroMem is equipped with energy-effi… ▽ More Biologically-inspired computing models have made significant progress in recent years, but the conventional von Neumann architecture is inefficient for the large-scale matrix operations and massive parallelism required by these models. This paper presents Spin-NeuroMem, a low-power circuit design of Hopfield network for the function of associative memory. Spin-NeuroMem is equipped with energy-efficient spintronic synapses which utilize magnetic tunnel junctions (MTJs) to store weight matrices of multiple associative memories. The proposed synapse design achieves as low as 17.4% power consumption compared to the state-of-the-art synapse designs. Spin-NeuroMem also encompasses a novel voltage converter with 60% less transistor usage for effective Hopfield network computation. In addition, we propose an associative memory simulator for the first time, which achieves a 5.05Mx speedup with a comparable associative memory effect. By harnessing the potential of spintronic devices, this work sheds light on the development of energy-efficient and scalable neuromorphic computing systems. The source code will be publicly available after the manuscript is reviewed. △ Less

Submitted 3 April, 2024; originally announced April 2024.

arXiv:2404.02433 [pdf, other]

A fast cosine transformation accelerated method for predicting effective thermal conductivity

Authors: Changqing Ye, Shubin Fu, Eric T. Chung

Abstract: Predicting effective thermal conductivity by solving a Partial Differential Equation (PDE) defined on a high-resolution Representative Volume Element (RVE) is a computationally intensive task. In this paper, we tackle the task by proposing an efficient and implementation-friendly computational method that can fully leverage the computing power offered by hardware accelerators, namely, graphical pr… ▽ More Predicting effective thermal conductivity by solving a Partial Differential Equation (PDE) defined on a high-resolution Representative Volume Element (RVE) is a computationally intensive task. In this paper, we tackle the task by proposing an efficient and implementation-friendly computational method that can fully leverage the computing power offered by hardware accelerators, namely, graphical processing units (GPUs). We first employ the Two-Point Flux-Approximation scheme to discretize the PDE and then utilize the preconditioned conjugate gradient method to solve the resulting algebraic linear system. The construction of the preconditioner originates from FFT-based homogenization methods, and an engineered linear programming technique is utilized to determine the homogeneous reference parameters. The fundamental observation presented in this paper is that the preconditioner system can be effectively solved using multiple Fast Cosine Transformations (FCT) and parallel tridiagonal matrix solvers. Regarding the fact that default multiple FCTs are unavailable on the CUDA platform, we detail how to derive FCTs from FFTs with nearly optimal memory usage. Numerical experiments including the stability comparison with standard preconditioners are conducted for 3D RVEs. Our performance reports indicate that the proposed method can achieve a $5$-fold acceleration on the GPU platform over the pure CPU platform and solve the problems with $512^3$ degrees of freedom and reasonable contrast ratios in less than $30$ seconds. △ Less

Submitted 2 April, 2024; originally announced April 2024.

arXiv:2403.19356 [pdf, other]

A robust two-level overlap** preconditioner for Darcy flow in high-contrast media

Authors: Changqing Ye, Shubin Fu, Eric T. Chung, Jizu Huang

Abstract: In this article, a two-level overlap** domain decomposition preconditioner is developed for solving linear algebraic systems obtained from simulating Darcy flow in high-contrast media. Our preconditioner starts at a mixed finite element method for discretizing the partial differential equation by Darcy's law with the no-flux boundary condition and is then followed by a velocity elimination techn… ▽ More In this article, a two-level overlap** domain decomposition preconditioner is developed for solving linear algebraic systems obtained from simulating Darcy flow in high-contrast media. Our preconditioner starts at a mixed finite element method for discretizing the partial differential equation by Darcy's law with the no-flux boundary condition and is then followed by a velocity elimination technique to yield a linear algebraic system with only unknowns of pressure. Then, our main objective is to design a robust and efficient domain decomposition preconditioner for this system, which is accomplished by engineering a multiscale coarse space that is capable of characterizing high-contrast features of the permeability field. A generalized eigenvalue problem is solved in each non-overlap** coarse element in a communication-free manner to form the global solver, which is accompanied by local solvers originated from additive Schwarz methods but with a non-Galerkin discretization to derive the two-level preconditioner. We provide a rigorous analysis that indicates that the condition number of the preconditioned system could be bounded above with several assumptions. Extensive numerical experiments with various types of three-dimensional high-contrast models are exhibited. In particular, we study the robustness against the contrast of the media as well as the influences of numbers of eigenfunctions, oversampling sizes, and subdomain partitions on the efficiency of the proposed preconditioner. Besides, strong and weak scalability performances are also examined. △ Less

Submitted 28 March, 2024; originally announced March 2024.

arXiv:2403.19342 [pdf, other]

An efficient multiscale multigrid preconditioner for Darcy flow in high-contrast media

Authors: Changqing Ye, Shubin Fu, Eric T. Chung, Jizu Huang

Abstract: In this paper, we develop a multigrid preconditioner to solve Darcy flow in highly heterogeneous porous media. The key component of the preconditioner is to construct a sequence of nested subspaces $W_{\mathcal{L}}\subset W_{\mathcal{L}-1}\subset\cdots\subset W_1=W_h$. An appropriate spectral problem is defined in the space of $W_{i-1}$, then the eigenfunctions of the spectral problems are utilize… ▽ More In this paper, we develop a multigrid preconditioner to solve Darcy flow in highly heterogeneous porous media. The key component of the preconditioner is to construct a sequence of nested subspaces $W_{\mathcal{L}}\subset W_{\mathcal{L}-1}\subset\cdots\subset W_1=W_h$. An appropriate spectral problem is defined in the space of $W_{i-1}$, then the eigenfunctions of the spectral problems are utilized to form $W_i$. The preconditioner is applied to solve a positive semidefinite linear system which results from discretizing the Darcy flow equation with the lowest order Raviart-Thomas spaces and adopting a trapezoidal quadrature rule. Theoretical analysis and numerical investigations of this preconditioner will be presented. In particular, we will consider several typical highly heterogeneous permeability fields whose resolutions are up to $1024^3$ and examine the computational performance of the preconditioner in several aspects, such as strong scalability, weak scalability, and robustness against the contrast of the media. We also demonstrate an application of this preconditioner for solving a two-phase flow benchmark problem. △ Less

Submitted 28 March, 2024; originally announced March 2024.

arXiv:2403.15183 [pdf, other]

CRPlace: Camera-Radar Fusion with BEV Representation for Place Recognition

Authors: Shaowei Fu, Yifan Duan, Yao Li, Chengzhen Meng, Yingjie Wang, Jianmin Ji, Yanyong Zhang

Abstract: The integration of complementary characteristics from camera and radar data has emerged as an effective approach in 3D object detection. However, such fusion-based methods remain unexplored for place recognition, an equally important task for autonomous systems. Given that place recognition relies on the similarity between a query scene and the corresponding candidate scene, the stationary backgro… ▽ More The integration of complementary characteristics from camera and radar data has emerged as an effective approach in 3D object detection. However, such fusion-based methods remain unexplored for place recognition, an equally important task for autonomous systems. Given that place recognition relies on the similarity between a query scene and the corresponding candidate scene, the stationary background of a scene is expected to play a crucial role in the task. As such, current well-designed camera-radar fusion methods for 3D object detection can hardly take effect in place recognition because they mainly focus on dynamic foreground objects. In this paper, a background-attentive camera-radar fusion-based method, named CRPlace, is proposed to generate background-attentive global descriptors from multi-view images and radar point clouds for accurate place recognition. To extract stationary background features effectively, we design an adaptive module that generates the background-attentive mask by utilizing the camera BEV feature and radar dynamic points. With the guidance of a background mask, we devise a bidirectional cross-attention-based spatial fusion strategy to facilitate comprehensive spatial interaction between the background information of the camera BEV feature and the radar BEV feature. As the first camera-radar fusion-based place recognition network, CRPlace has been evaluated thoroughly on the nuScenes dataset. The results show that our algorithm outperforms a variety of baseline methods across a comprehensive set of metrics (recall@1 reaches 91.2%). △ Less

Submitted 22 March, 2024; originally announced March 2024.

arXiv:2403.10516 [pdf, other]

FeatUp: A Model-Agnostic Framework for Features at Any Resolution

Authors: Stephanie Fu, Mark Hamilton, Laura Brandt, Axel Feldman, Zhoutong Zhang, William T. Freeman

Abstract: Deep features are a cornerstone of computer vision research, capturing image semantics and enabling the community to solve downstream tasks even in the zero- or few-shot regime. However, these features often lack the spatial resolution to directly perform dense prediction tasks like segmentation and depth prediction because models aggressively pool information over large areas. In this work, we in… ▽ More Deep features are a cornerstone of computer vision research, capturing image semantics and enabling the community to solve downstream tasks even in the zero- or few-shot regime. However, these features often lack the spatial resolution to directly perform dense prediction tasks like segmentation and depth prediction because models aggressively pool information over large areas. In this work, we introduce FeatUp, a task- and model-agnostic framework to restore lost spatial information in deep features. We introduce two variants of FeatUp: one that guides features with high-resolution signal in a single forward pass, and one that fits an implicit model to a single image to reconstruct features at any resolution. Both approaches use a multi-view consistency loss with deep analogies to NeRFs. Our features retain their original semantics and can be swapped into existing applications to yield resolution and performance gains even without re-training. We show that FeatUp significantly outperforms other feature upsampling and image super-resolution approaches in class activation map generation, transfer learning for segmentation and depth prediction, and end-to-end training for semantic segmentation. △ Less

Submitted 1 April, 2024; v1 submitted 15 March, 2024; originally announced March 2024.

Comments: Accepted to the International Conference on Learning Representations (ICLR) 2024

arXiv:2403.05383 [pdf]

Thermal cycling induced evolution and colossal exchange bias in MnPS3/Fe3GeTe2 van der Waals heterostructures

Authors: Aravind Puthirath Balan, Aditya Kumar, Patrick Reiser, Joseph Vas, Thibaud Denneulin, Khoa Dang Lee, Tom G Saunderson, Märta Tschudin, Clement Pellet-Mary, Debarghya Dutta, Carolin Schrader, Tanja Scholz, Jaco Geuchies, Shuai Fu, Hai Wang, Alberta Bonanni, Bettina V. Lotsch, Ulrich Nowak, Gerhard Jakob, Jacob Gayles, Andras Kovacs, Rafal E. Dunin-Borkowski, Patrick Maletinsky, Mathias Kläui

Abstract: The exchange bias phenomenon, inherent in exchange-coupled ferromagnetic and antiferromagnetic systems, has intrigued researchers for decades. Van der Waals materials, with their layered structure, provide an optimal platform for probing such physical phenomena. However, achieving a facile and effective means to manipulate exchange bias in pristine van der Waals heterostructures remains challengin… ▽ More The exchange bias phenomenon, inherent in exchange-coupled ferromagnetic and antiferromagnetic systems, has intrigued researchers for decades. Van der Waals materials, with their layered structure, provide an optimal platform for probing such physical phenomena. However, achieving a facile and effective means to manipulate exchange bias in pristine van der Waals heterostructures remains challenging. In this study, we investigate the origin of exchange bias in MnPS3/Fe3GeTe2 van der Waals heterostructures. Our work demonstrates a method to modulate unidirectional exchange anisotropy, achieving an unprecedented nearly 1000% variation through simple thermal cycling. Despite the compensated interfacial spin configuration of MnPS3, magneto-transport measurements reveal a huge 170 mT exchange bias at 5 K, the largest observed in pristine van der Waals antiferromagnet-ferromagnet interfaces. This substantial magnitude of the exchange bias is linked to an anomalous weak ferromagnetic ordering in MnPS3 below 40 K. On the other hand, the tunability of exchange bias during thermal cycling is ascribed to the modified arrangement of interfacial atoms and changes in the vdW gap during field cooling. Our findings highlight a robust and easily adjustable exchange bias in van der Waals antiferromagnetic/ferromagnetic heterostructures, presenting a straightforward approach to enhance other interface related spintronic phenomena for practical applications. △ Less

Submitted 8 March, 2024; originally announced March 2024.

arXiv:2403.05264 [pdf, other]

doi 10.1145/3613904.3642912

To Reach the Unreachable: Exploring the Potential of VR Hand Redirection for Upper Limb Rehabilitation

Authors: Peixuan Xiong, Yukai Zhang, Nandi Zhang, Shihan Fu, Xin Li, Yadan Zheng, **ni Zhou, Xiquan Hu, Mingming Fan

Abstract: Rehabilitation therapies are widely employed to assist people with motor impairments in regaining control over their affected body parts. Nevertheless, factors such as fatigue and low self-efficacy can hinder patient compliance during extensive rehabilitation processes. Utilizing hand redirection in virtual reality (VR) enables patients to accomplish seemingly more challenging tasks, thereby bolst… ▽ More Rehabilitation therapies are widely employed to assist people with motor impairments in regaining control over their affected body parts. Nevertheless, factors such as fatigue and low self-efficacy can hinder patient compliance during extensive rehabilitation processes. Utilizing hand redirection in virtual reality (VR) enables patients to accomplish seemingly more challenging tasks, thereby bolstering their motivation and confidence. While previous research has investigated user experience and hand redirection among able-bodied people, its effects on motor-impaired people remain unexplored. In this paper, we present a VR rehabilitation application that harnesses hand redirection. Through a user study and semi-structured interviews, we examine the impact of hand redirection on the rehabilitation experiences of people with motor impairments and its potential to enhance their motivation for upper limb rehabilitation. Our findings suggest that patients are not sensitive to hand movement inconsistency, and the majority express interest in incorporating hand redirection into future long-term VR rehabilitation programs. △ Less

Submitted 8 March, 2024; originally announced March 2024.

Comments: Proceedings of the CHI Conference on Human Factors in Computing Systems (CHI '24), May 11--16, 2024, Honolulu, HI, USA

arXiv:2403.01966 [pdf, other]

Enhancing Information Maximization with Distance-Aware Contrastive Learning for Source-Free Cross-Domain Few-Shot Learning

Authors: Huali Xu, Li Liu, Shuaifeng Zhi, Shao**g Fu, Zhuo Su, Ming-Ming Cheng, Yongxiang Liu

Abstract: Existing Cross-Domain Few-Shot Learning (CDFSL) methods require access to source domain data to train a model in the pre-training phase. However, due to increasing concerns about data privacy and the desire to reduce data transmission and training costs, it is necessary to develop a CDFSL solution without accessing source data. For this reason, this paper explores a Source-Free CDFSL (SF-CDFSL) pr… ▽ More Existing Cross-Domain Few-Shot Learning (CDFSL) methods require access to source domain data to train a model in the pre-training phase. However, due to increasing concerns about data privacy and the desire to reduce data transmission and training costs, it is necessary to develop a CDFSL solution without accessing source data. For this reason, this paper explores a Source-Free CDFSL (SF-CDFSL) problem, in which CDFSL is addressed through the use of existing pretrained models instead of training a model with source data, avoiding accessing source data. This paper proposes an Enhanced Information Maximization with Distance-Aware Contrastive Learning (IM-DCL) method to address these challenges. Firstly, we introduce the transductive mechanism for learning the query set. Secondly, information maximization (IM) is explored to map target samples into both individual certainty and global diversity predictions, hel** the source model better fit the target data distribution. However, IM fails to learn the decision boundary of the target task. This motivates us to introduce a novel approach called Distance-Aware Contrastive Learning (DCL), in which we consider the entire feature set as both positive and negative sets, akin to Schrodinger's concept of a dual state. Instead of a rigid separation between positive and negative sets, we employ a weighted distance calculation among features to establish a soft classification of the positive and negative sets for the entire feature set. Furthermore, we address issues related to IM by incorporating contrastive constraints between object features and their corresponding positive and negative sets. Evaluations of the 4 datasets in the BSCD-FSL benchmark indicate that the proposed IM-DCL, without accessing the source domain, demonstrates superiority over existing methods, especially in the distant domain task. △ Less

Submitted 4 March, 2024; originally announced March 2024.

Comments: Accepted by TIP, 16 pages, 11 figures, 8 tables

arXiv:2403.00378 [pdf, ps, other]

A group action on cyclic compositions and $γ$-positivity

Authors: Shishuo Fu, Jie Yang

Abstract: Let $w_{n,k,m}$ be the number of Dyck paths of semilength $n$ with $k$ occurrences of $UD$ and $m$ occurrences of $UUD$. We establish in two ways a new interpretation of the numbers $w_{n,k,m}$ in terms of plane trees and internal nodes. The first way builds on a new characterization of plane trees that involves cyclic compositions. The second proof utilizes a known interpretation of $w_{n,k,m}$ i… ▽ More Let $w_{n,k,m}$ be the number of Dyck paths of semilength $n$ with $k$ occurrences of $UD$ and $m$ occurrences of $UUD$. We establish in two ways a new interpretation of the numbers $w_{n,k,m}$ in terms of plane trees and internal nodes. The first way builds on a new characterization of plane trees that involves cyclic compositions. The second proof utilizes a known interpretation of $w_{n,k,m}$ in terms of plane trees and leaves, and a recent involution on plane trees constructed by Li, Lin, and Zhao. Moreover, a group action on the set of cyclic compositions (or equivalently, $2$-dominant compositions) is introduced, which amounts to give a combinatorial proof of the $γ$-positivity of the Narayana polynomial, as well as the $γ$-positivity of the polynomial $W_{2k+1,k}(t):=\sum_{1\le m\le k}w_{2k+1,k,m}t^m$ previously obtained by Bóna et al, with apparently new combinatorial interpretations of their $γ$-coefficients. △ Less

Submitted 1 March, 2024; originally announced March 2024.

Comments: 19 pages, 3 figures

MSC Class: 05A05; 05A10; 05A15; 05C05

arXiv:2402.16321 [pdf, other]

Self-Supervised Speech Quality Estimation and Enhancement Using Only Clean Speech

Authors: Szu-Wei Fu, Kuo-Hsuan Hung, Yu Tsao, Yu-Chiang Frank Wang

Abstract: Speech quality estimation has recently undergone a paradigm shift from human-hearing expert designs to machine-learning models. However, current models rely mainly on supervised learning, which is time-consuming and expensive for label collection. To solve this problem, we propose VQScore, a self-supervised metric for evaluating speech based on the quantization error of a vector-quantized-variatio… ▽ More Speech quality estimation has recently undergone a paradigm shift from human-hearing expert designs to machine-learning models. However, current models rely mainly on supervised learning, which is time-consuming and expensive for label collection. To solve this problem, we propose VQScore, a self-supervised metric for evaluating speech based on the quantization error of a vector-quantized-variational autoencoder (VQ-VAE). The training of VQ-VAE relies on clean speech; hence, large quantization errors can be expected when the speech is distorted. To further improve correlation with real quality scores, domain knowledge of speech processing is incorporated into the model design. We found that the vector quantization mechanism could also be used for self-supervised speech enhancement (SE) model training. To improve the robustness of the encoder for SE, a novel self-distillation mechanism combined with adversarial training is introduced. In summary, the proposed speech quality estimation method and enhancement models require only clean speech for training without any label requirements. Experimental results show that the proposed VQScore and enhancement model are competitive with supervised baselines. The code will be released after publication. △ Less

Submitted 26 February, 2024; originally announced February 2024.

Comments: Published as a conference paper at ICLR 2024

arXiv:2402.15719 [pdf, other]

"It Is Hard to Remove from My Eye": Design Makeup Residue Visualization System for Chinese Traditional Opera (Xiqu) Performers

Authors: Zeyu Xiong, Shihan Fu, Yanying Zhu, Chenqing Zhu, Xiaojuan Ma, Mingming Fan

Abstract: Chinese traditional opera (Xiqu) performers often experience skin problems due to the long-term use of heavy-metal-laden face paints. To explore the current skincare challenges encountered by Xiqu performers, we conducted an online survey (N=136) and semi-structured interviews (N=15) as a formative study. We found that incomplete makeup removal is the leading cause of human-induced skin problems,… ▽ More Chinese traditional opera (Xiqu) performers often experience skin problems due to the long-term use of heavy-metal-laden face paints. To explore the current skincare challenges encountered by Xiqu performers, we conducted an online survey (N=136) and semi-structured interviews (N=15) as a formative study. We found that incomplete makeup removal is the leading cause of human-induced skin problems, especially the difficulty in removing eye makeup. Therefore, we proposed EyeVis, a prototype that can visualize the residual eye makeup and record the time make-up was worn by Xiqu performers. We conducted a 7-day deployment study (N=12) to evaluate EyeVis. Results indicate that EyeVis helps to increase Xiqu performers' awareness about removing makeup, as well as boosting their confidence and security in skincare. Overall, this work also provides implications for studying the work of people who wear makeup on a daily basis, and helps to promote and preserve the intangible cultural heritage of practitioners. △ Less

Submitted 24 February, 2024; originally announced February 2024.

Comments: Proceedings of the 2024 CHI Conference on Human Factors in Computing Systems (CHI '24), May 11-16, 2024, Honolulu, HI, USA

arXiv:2402.11778 [pdf, other]

Towards Theoretical Understandings of Self-Consuming Generative Models

Authors: Shi Fu, Sen Zhang, Yingjie Wang, Xinmei Tian, Dacheng Tao

Abstract: This paper tackles the emerging challenge of training generative models within a self-consuming loop, wherein successive generations of models are recursively trained on mixtures of real and synthetic data from previous generations. We construct a theoretical framework to rigorously evaluate how this training procedure impacts the data distributions learned by future models, including parametric a… ▽ More This paper tackles the emerging challenge of training generative models within a self-consuming loop, wherein successive generations of models are recursively trained on mixtures of real and synthetic data from previous generations. We construct a theoretical framework to rigorously evaluate how this training procedure impacts the data distributions learned by future models, including parametric and non-parametric models. Specifically, we derive bounds on the total variation (TV) distance between the synthetic data distributions produced by future models and the original real data distribution under various mixed training scenarios for diffusion models with a one-hidden-layer neural network score function. Our analysis demonstrates that this distance can be effectively controlled under the condition that mixed training dataset sizes or proportions of real data are large enough. Interestingly, we further unveil a phase transition induced by expanding synthetic data amounts, proving theoretically that while the TV distance exhibits an initial ascent, it declines beyond a threshold point. Finally, we present results for kernel density estimation, delivering nuanced insights such as the impact of mixed data training on error propagation. △ Less

Submitted 24 June, 2024; v1 submitted 18 February, 2024; originally announced February 2024.

Comments: Accepted at ICML 2024

arXiv:2402.10337 [pdf, other]

LoVoCCS -- II. Weak Lensing Mass Distributions, Red-Sequence Galaxy Distributions, and Their Alignment with the Brightest Cluster Galaxy in 58 Nearby X-ray-Luminous Galaxy Clusters

Authors: Shenming Fu, Ian Dell'Antonio, Zacharias Escalante, Jessica Nelson, Anthony Englert, Søren Helhoski, Rahul Shinde, Julia Brockland, Philip LaDuca, Christelyn Larkin, Lucca Paris, Shane Weiner, William K. Black, Ranga-Ram Chary, Douglas Clowe, M. C. Cooper, Megan Donahue, August Evrard, Mark Lacy, Tod Lauer, Binyang Liu, Jacqueline McCleary, Massimo Meneghetti, Hironao Miyatake, Mireia Montes , et al. (9 additional authors not shown)

Abstract: The Local Volume Complete Cluster Survey (LoVoCCS) is an on-going program to observe nearly a hundred low-redshift X-ray-luminous galaxy clusters (redshifts $0.03<z<0.12$ and X-ray luminosities in the 0.1-2.4 keV band $L_{X500c}>10^{44}$ erg/s) with the Dark Energy Camera (DECam), capturing data in $u,g,r,i,z$ bands with a $5σ$ point source depth of approximately 25-26th AB magnitudes. Here, we ma… ▽ More The Local Volume Complete Cluster Survey (LoVoCCS) is an on-going program to observe nearly a hundred low-redshift X-ray-luminous galaxy clusters (redshifts $0.03<z<0.12$ and X-ray luminosities in the 0.1-2.4 keV band $L_{X500c}>10^{44}$ erg/s) with the Dark Energy Camera (DECam), capturing data in $u,g,r,i,z$ bands with a $5σ$ point source depth of approximately 25-26th AB magnitudes. Here, we map the aperture masses in 58 galaxy cluster fields using weak gravitational lensing. These clusters span a variety of dynamical states, from nearly relaxed to merging systems, and approximately half of them have not been subject to detailed weak lensing analysis before. In each cluster field, we analyze the alignment between the 2D mass distribution described by the aperture mass map, the 2D red-sequence (RS) galaxy distribution, and the brightest cluster galaxy (BCG). We find that the orientations of the BCG and the RS distribution are strongly aligned throughout the interiors of the clusters: the median misalignment angle is 19 deg within 2 Mpc. We also observe the alignment between the orientations of the RS distribution and the overall cluster mass distribution (by a median difference of 32 deg within 1 Mpc), although this is constrained by galaxy shape noise and the limitations of our cluster sample size. These types of alignment suggest long-term dynamical evolution within the clusters over cosmic timescales. △ Less

Submitted 15 February, 2024; originally announced February 2024.

Comments: 40 pages, 16 figures, 5 tables. Submitted to ApJ. Comments are welcome and appreciated

arXiv:2402.02130 [pdf, other]

GITA: Graph to Visual and Textual Integration for Vision-Language Graph Reasoning

Authors: Yanbin Wei, Shuai Fu, Weisen Jiang, Zejian Zhang, Zhixiong Zeng, Qi Wu, James T. Kwok, Yu Zhang

Abstract: Large Language Models (LLMs) are increasingly used for various tasks with graph structures. Though LLMs can process graph information in a textual format, they overlook the rich vision modality, which is an intuitive way for humans to comprehend structural information and conduct general graph reasoning. The potential benefits and capabilities of representing graph structures as visual images (i.e… ▽ More Large Language Models (LLMs) are increasingly used for various tasks with graph structures. Though LLMs can process graph information in a textual format, they overlook the rich vision modality, which is an intuitive way for humans to comprehend structural information and conduct general graph reasoning. The potential benefits and capabilities of representing graph structures as visual images (i.e., $\textit{visual graph}$) are still unexplored. To fill the gap, we innovatively propose an end-to-end framework, called $\textbf{G}$raph to v$\textbf{I}$sual and $\textbf{T}$extual Integr$\textbf{A}$tion (GITA), which firstly incorporates visual graphs into general graph reasoning. Besides, we establish $\textbf{G}$raph-based $\textbf{V}$ision-$\textbf{L}$anguage $\textbf{Q}$uestion $\textbf{A}$nswering (GVLQA) dataset from existing graph data, which is the first vision-language dataset for general graph reasoning purposes. Extensive experiments on the GVLQA dataset and five real-world datasets show that GITA outperforms mainstream LLMs in terms of general graph reasoning capabilities. Moreover, We highlight the effectiveness of the layout augmentation on visual graphs and pretraining on the GVLQA dataset. △ Less

Submitted 24 May, 2024; v1 submitted 3 February, 2024; originally announced February 2024.

arXiv:2402.01080 [pdf, other]

Photonic Spin-Orbit Coupling Induced by Deep-Subwavelength Structured Light

Authors: Xin Zhang, Guohua Liu, Yanwen Hu, Haolin Lin, Zepei Zeng, Xiliang Zhang, Zhen Li, Zhenqiang Chen, Shenhe Fu

Abstract: We demonstrate both theoretically and experimentally beam-dependent photonic spin-orbit coupling in a two-wave mixing process described by an equivalent of the Pauli equation in quantum mechanics. The considered structured light in the system is comprising a superposition of two orthogonal spin-orbit-coupled states defined as spin up and spin down equivalents. The spin-orbit coupling is manifested… ▽ More We demonstrate both theoretically and experimentally beam-dependent photonic spin-orbit coupling in a two-wave mixing process described by an equivalent of the Pauli equation in quantum mechanics. The considered structured light in the system is comprising a superposition of two orthogonal spin-orbit-coupled states defined as spin up and spin down equivalents. The spin-orbit coupling is manifested by prominent pseudo spin precession as well as spin-transport-induced orbital angular momentum generation in a photonic crystal film of wavelength thickness. The coupling effect is significantly enhanced by using a deep-subwavelength carrier envelope, different from previous studies which depend on materials. The beam-dependent coupling effect can find intriguing applications; for instance, it is used in precisely measuring variation of light with spatial resolution up to 15 nm. △ Less

Submitted 1 February, 2024; originally announced February 2024.

Comments: 9 pages, 8 figures, and 68 conferences

arXiv:2401.12870 [pdf, other]

Unlocking the Potential: Multi-task Deep Learning for Spaceborne Quantitative Monitoring of Fugitive Methane Plumes

Authors: Guoxin Si, Shiliang Fu, Wei Yao

Abstract: With the intensification of global warming, the monitoring of methane emission and detection of gas plumes from landfills have increasingly received attention. We decompose methane emission monitoring into three sub-tasks: methane concentration inversion, plume segmentation, and emission rate estimation. Conventional algorithms have limitations: methane concentration inversion usually uses the mat… ▽ More With the intensification of global warming, the monitoring of methane emission and detection of gas plumes from landfills have increasingly received attention. We decompose methane emission monitoring into three sub-tasks: methane concentration inversion, plume segmentation, and emission rate estimation. Conventional algorithms have limitations: methane concentration inversion usually uses the matched filter, which is sensitive to global spectrum distribution and contains a large amount of noises. There is limited research on plume segmentation, with many studies resorting to manual segmentation that is likely to be subjective. The estimation of methane emission rate often utilizes IME algorithm, which relies on obtaining meteorological measurement data. Using the WENT landfill site in Hong Kong and PRISMA hyperspectral satellite imagery, we propose a new deep learning-based framework for quantitative monitoring of methane emissions from remote sensing images based on physical simulation. We generate simulated methane plumes using large eddy simulation (LES) and different concentration maps of fugitive emission using the radiative transfer equation (RTE), while combining augmentation techniques to create a simulated PRISMA dataset. We train a U-Net network for methane concentration inversion, a Mask R-CNN network for methane plume segmentation, and a ResNet-50 network for methane emission rate estimation. All three deep networks achieve higher validation accuracy compared to conventional algorithms. We further respectively combine the first two sub-tasks and the last two sub-tasks to design the multi-task learning models - MTL-01 and MTL-02, both of which achieve higher accuracy than single-task models. Our research serves as a demonstration of applying multi-task deep learning to quantitative methane monitoring and can be extended to a broad range of methane monitoring tasks. △ Less

Submitted 23 January, 2024; originally announced January 2024.

arXiv:2401.12468 [pdf, ps, other]

Minimum observability of probabilistic Boolean networks

Authors: Jiayi Xu, Shihua Fu, Liyuan Xia, Jianjun Wang

Abstract: This paper studies the minimum observability of probabilistic Boolean networks (PBNs), the main objective of which is to add the fewest measurements to make an unobservable PBN become observable. First of all, the algebraic form of a PBN is established with the help of semi-tensor product (STP) of matrices. By combining the algebraic forms of two identical PBNs into a parallel system, a method to… ▽ More This paper studies the minimum observability of probabilistic Boolean networks (PBNs), the main objective of which is to add the fewest measurements to make an unobservable PBN become observable. First of all, the algebraic form of a PBN is established with the help of semi-tensor product (STP) of matrices. By combining the algebraic forms of two identical PBNs into a parallel system, a method to search the states that need to be H-distinguishable is proposed based on the robust set reachability technique. Secondly, a necessary and sufficient condition is given to find the minimum measurements such that a given set can be H-distinguishable. Moreover, by comparing the numbers of measurements for all the feasible H-distinguishable state sets, the least measurements that make the system observable are gained. Finally, an example is given to verify the validity of the obtained results. △ Less

Submitted 22 January, 2024; originally announced January 2024.

arXiv:2401.06258 [pdf, other]

LUCE: A milli-Kelvin calorimeter experiment to study the electron capture of 176Lu

Authors: Shihong Fu, Giovanni Benato, Carlo Bucci, Paolo Gorla, Pedro V. Guillaumon, Jiang Li, Serge Nagorny, Francesco Nozzoli, Lorenzo Pagnanini, Andrei Puiu, Matthew Stukel

Abstract: The LUCE (LUtetium sCintillation Experiment) project will search for the 176Lu electron capture based on a milli-Kelvin calorimetric approach. This decay is of special interest in the field of nuclear structure, with implications for the s-process and for a better comprehension of the nuclear matrix elements of neutrinoless double beta decay (0ν\b{eta}\b{eta}) and two-neutrino double beta decay (2… ▽ More The LUCE (LUtetium sCintillation Experiment) project will search for the 176Lu electron capture based on a milli-Kelvin calorimetric approach. This decay is of special interest in the field of nuclear structure, with implications for the s-process and for a better comprehension of the nuclear matrix elements of neutrinoless double beta decay (0ν\b{eta}\b{eta}) and two-neutrino double beta decay (2ν\b{eta}\b{eta}). Possible impacts also include the development of a new class of coherent elastic neutrino-nucleus scattering (CEνNS) and spin-dependent (independent) dark matter detectors. We report on the current status and design of a novel detector cryogenic-module for the measurement of the electron capture and detail a future measurement plan. △ Less

Submitted 8 November, 2023; originally announced January 2024.

Comments: proceedings

arXiv:2401.05920 [pdf, other]

The Magellan M2FS spectroscopic survey of high-redshift galaxies: the brightest Lyman-break galaxies at $z \sim 6$

Authors: Shuqi Fu, Linhua Jiang, Yuanhang Ning, Weiyang Liu, Zhiwei Pan

Abstract: We present a study of a sample of 45 spectroscopically confirmed, UV luminous galaxies at $z\sim 6$. They were selected as bright Lyman-break galaxies (LBGs) using deep multi-band optical images in more than 2 deg$^2$ of the sky, and subsequently identified via their strong Ly$α$ emission. The majority of these LBGs span an absolute UV magnitude range from $-22.0$ to $-20.5$ mag with Ly$α$ equival… ▽ More We present a study of a sample of 45 spectroscopically confirmed, UV luminous galaxies at $z\sim 6$. They were selected as bright Lyman-break galaxies (LBGs) using deep multi-band optical images in more than 2 deg$^2$ of the sky, and subsequently identified via their strong Ly$α$ emission. The majority of these LBGs span an absolute UV magnitude range from $-22.0$ to $-20.5$ mag with Ly$α$ equivalent width (EW) between $\sim$10 and $\sim$200 Å, representing the most luminous galaxies at $z\sim 6$ in terms of both UV continuum emission and Ly$α$ line emission. We model the SEDs of 10 LBGs that have deep infrared observations from HST, JWST, and/or Spitzer, and find that they have a wide range of stellar masses and ages. They also have high star-formation rates ranging from a few tens to a few hundreds of Solar mass per year. Five of the LBGs have JWST or HST images and four of them show compact morphology in these images, including one that is roughly consistent with a point source, suggesting that UV luminous galaxies at this redshift are generally compact. The fraction of our photometrically selected LBGs with strong Ly$α$ emission ($\mathrm{EW}>25$ Å) is about $0.2$, which is consistent with previous results and supports a moderate evolution of the IGM opacity at the end of cosmic reionization. Using deep X-ray images, we do not find evidence of strong AGN activity in these galaxies, but our constraint is loose and we are not able to rule out the possibility of any weak AGN activity. △ Less

Submitted 11 January, 2024; originally announced January 2024.

Comments: 19 pages, 11 figures, Accepted for publication in ApJ

arXiv:2401.03579 [pdf, other]

doi 10.1093/mnras/stad3989

GRB 201015A: from seconds to months of optical monitoring and supernova discovery

Authors: S. Belkin, A. S. Pozanenko, P. Y. Minaev, N. S. Pankov, A. A. Volnova, A. Rossi, G. Stratta, S. Benetti, E. Palazzi, A. S. Moskvitin, O. Burhonov, V. V. Rumyantsev, E. V. Klunko, R. Ya. Inasaridze, I. V. Reva, V. Kim, M. Jelinek, D. A. Kann, A. E. Volvach, L. N. Volvach, D. Xu, Z. Zhu, S. Fu, A. A. Mkrtchyan

Abstract: We present full photometric coverage and spectroscopic data for soft GRB 201015A with a redshift z = 0.426. Our data spans a time range of 85 days following the detection of GRB. These observations revealed an underlying supernova SN 201015A with a maximum at $8.54 \pm $1.48 days (rest frame) and an optical peak absolute magnitude $-19.45_{-0.47}^{+0.85}$ mag. The supernova stands out clearly, sin… ▽ More We present full photometric coverage and spectroscopic data for soft GRB 201015A with a redshift z = 0.426. Our data spans a time range of 85 days following the detection of GRB. These observations revealed an underlying supernova SN 201015A with a maximum at $8.54 \pm $1.48 days (rest frame) and an optical peak absolute magnitude $-19.45_{-0.47}^{+0.85}$ mag. The supernova stands out clearly, since the contribution of the afterglow at this time is not dominant, which made it possible to determine SN's parameters. A comparison of these parameters reveals that the SN 201015A is the earliest (the minimum $T_{max}$) known supernova associated with gamma-ray bursts. Spectroscopic observations during the supernova decay stage showed broad lines, indicating a large photospheric velocity, and identified this supernova as a type Ic-BL. Thus, the SN 201015A associated with the GRB 201015A becomes the 27th SN/GRB confirmed by both photometric and spectroscopic observations. Using the results of spectral analysis based on the available data of Fermi-GBM experiment, the parameters $E_\text{p,i} = 20.0 \pm 8.5$ keV and $E_\text{iso} = (1.1 \pm 0.2) \times 10^{50}$ erg were obtained. According to the position of the burst on the $E_\text{p,i}$-$E_\text{iso}$ correlation, GRB 201015A was classified as a Type II (long) gamma-ray burst, which was also confirmed by the $T_\text{90,i}$-$EH$ diagram. △ Less

Submitted 7 January, 2024; originally announced January 2024.

Comments: 15 pages, 10 figures

MSC Class: 85-11 ACM Class: J.2

Journal ref: Monthly Notices of the Royal Astronomical Society, Volume 527, Issue 4, February 2024, Pages 11507-11520,

arXiv:2401.02063 [pdf, other]

Windows on the Universe: Establishing the Infrastructure for a Collaborative Multi-messenger Ecosystem

Authors: The 2023 Windows on the Universe Workshop White Paper Working Group, T. Ahumada, J. E. Andrews, S. Antier, E. Blaufuss, P. R. Brady, A. M. Brazier, E. Burns, S. B. Cenko, P. Chandra, D. Chatterjee, A. Corsi, M. W. Coughlin, D. A. Coulter, S. Fu, A. Goldstein, L. P. Guy, E. J. Hooper, S. B. Howell, T. B. Humensky, J. A. Kennea, S. M. Jarrett, R. M. Lau, T. R. Lewis, L. Lu , et al. (21 additional authors not shown)

Abstract: In this White Paper, we present recommendations for the scientific community and funding agencies to foster the infrastructure for a collaborative multi-messenger and time-domain astronomy (MMA/TDA) ecosystem. MMA/TDA is poised for breakthrough discoveries in the coming decade. In much the same way that expanding beyond the optical bandpass revealed entirely new and unexpected discoveries, cosmic… ▽ More In this White Paper, we present recommendations for the scientific community and funding agencies to foster the infrastructure for a collaborative multi-messenger and time-domain astronomy (MMA/TDA) ecosystem. MMA/TDA is poised for breakthrough discoveries in the coming decade. In much the same way that expanding beyond the optical bandpass revealed entirely new and unexpected discoveries, cosmic messengers beyond light (i.e., gravitational waves, neutrinos, and cosmic rays) open entirely new windows to answer some of the most fundamental questions in (astro)physics: heavy element synthesis, equation of state of dense matter, particle acceleration, etc. This field was prioritized as a frontier scientific pursuit in the 2020 Decadal Survey on Astronomy and Astrophysics via its "New Windows on the Dynamic Universe" theme. MMA/TDA science presents technical challenges distinct from those experienced in other disciplines. Successful observations require coordination across myriad boundaries -- different cosmic messengers, ground vs. space, international borders, etc. -- all for sources that may not be well localized, and whose brightness may be changing rapidly with time. Add that all of this work is undertaken by real human beings, with distinct backgrounds, experiences, cultures, and expectations, that often conflict. To address these challenges and help MMA/TDA realize its full scientific potential in the coming decade (and beyond), the second in a series of community workshops sponsored by the U.S. National Science Foundation (NSF) and NASA titled "Windows on the Universe: Establishing the Infrastructure for a Collaborative Multi-Messenger Ecosystem" was held on October 16-18, 2023 in Tucson, AZ. Here we present the primary recommendations from this workshop focused on three key topics -- hardware, software, and people and policy. [abridged] △ Less

Submitted 3 April, 2024; v1 submitted 3 January, 2024; originally announced January 2024.

Comments: Workshop white paper

arXiv:2401.01862 [pdf, other]

A Vision Check-up for Language Models

Authors: Pratyusha Sharma, Tamar Rott Shaham, Manel Baradad, Stephanie Fu, Adrian Rodriguez-Munoz, Shivam Duggal, Phillip Isola, Antonio Torralba

Abstract: What does learning to model relationships between strings teach large language models (LLMs) about the visual world? We systematically evaluate LLMs' abilities to generate and recognize an assortment of visual concepts of increasing complexity and then demonstrate how a preliminary visual representation learning system can be trained using models of text. As language models lack the ability to con… ▽ More What does learning to model relationships between strings teach large language models (LLMs) about the visual world? We systematically evaluate LLMs' abilities to generate and recognize an assortment of visual concepts of increasing complexity and then demonstrate how a preliminary visual representation learning system can be trained using models of text. As language models lack the ability to consume or output visual information as pixels, we use code to represent images in our study. Although LLM-generated images do not look like natural images, results on image generation and the ability of models to correct these generated images indicate that precise modeling of strings can teach language models about numerous aspects of the visual world. Furthermore, experiments on self-supervised visual representation learning, utilizing images generated with text models, highlight the potential to train vision models capable of making semantic assessments of natural images using just LLMs. △ Less

Submitted 3 January, 2024; originally announced January 2024.

arXiv:2401.01165 [pdf, other]

Reinforcement Learning for SAR View Angle Inversion with Differentiable SAR Renderer

Authors: Yanni Wang, Hecheng Jia, Shilei Fu, Hui** Lin, Feng Xu

Abstract: The electromagnetic inverse problem has long been a research hotspot. This study aims to reverse radar view angles in synthetic aperture radar (SAR) images given a target model. Nonetheless, the scarcity of SAR data, combined with the intricate background interference and imaging mechanisms, limit the applications of existing learning-based approaches. To address these challenges, we propose an in… ▽ More The electromagnetic inverse problem has long been a research hotspot. This study aims to reverse radar view angles in synthetic aperture radar (SAR) images given a target model. Nonetheless, the scarcity of SAR data, combined with the intricate background interference and imaging mechanisms, limit the applications of existing learning-based approaches. To address these challenges, we propose an interactive deep reinforcement learning (DRL) framework, where an electromagnetic simulator named differentiable SAR render (DSR) is embedded to facilitate the interaction between the agent and the environment, simulating a human-like process of angle prediction. Specifically, DSR generates SAR images at arbitrary view angles in real-time. And the differences in sequential and semantic aspects between the view angle-corresponding images are leveraged to construct the state space in DRL, which effectively suppress the complex background interference, enhance the sensitivity to temporal variations, and improve the capability to capture fine-grained information. Additionally, in order to maintain the stability and convergence of our method, a series of reward mechanisms, such as memory difference, smoothing and boundary penalty, are utilized to form the final reward function. Extensive experiments performed on both simulated and real datasets demonstrate the effectiveness and robustness of our proposed method. When utilized in the cross-domain area, the proposed method greatly mitigates inconsistency between simulated and real domains, outperforming reference methods significantly. △ Less

Submitted 2 January, 2024; originally announced January 2024.

arXiv:2401.00371 [pdf]

Multi-Granularity Representation Learning for Sketch-based Dynamic Face Image Retrieval

Authors: Liang Wang, Dawei Dai, Shiyu Fu, Guoyin Wang

Abstract: In specific scenarios, face sketch can be used to identify a person. However, drawing a face sketch often requires exceptional skill and is time-consuming, limiting its widespread applications in actual scenarios. The new framework of sketch less face image retrieval (SLFIR)[1] attempts to overcome the barriers by providing a means for humans and machines to interact during the drawing process. Co… ▽ More In specific scenarios, face sketch can be used to identify a person. However, drawing a face sketch often requires exceptional skill and is time-consuming, limiting its widespread applications in actual scenarios. The new framework of sketch less face image retrieval (SLFIR)[1] attempts to overcome the barriers by providing a means for humans and machines to interact during the drawing process. Considering SLFIR problem, there is a large gap between a partial sketch with few strokes and any whole face photo, resulting in poor performance at the early stages. In this study, we propose a multigranularity (MG) representation learning (MGRL) method to address the SLFIR problem, in which we learn the representation of different granularity regions for a partial sketch, and then, by combining all MG regions of the sketches and images, the final distance was determined. In the experiments, our method outperformed state-of-the-art baselines in terms of early retrieval on two accessible datasets. Codes are available at https://github.com/ddw2AIGROUP2CQUPT/MGRL. △ Less

Submitted 30 December, 2023; originally announced January 2024.

Comments: 5 pages,5 figures

arXiv:2312.17453 [pdf, other]

doi 10.1109/TVLSI.2023.3298327

RHS-TRNG: A Resilient High-Speed True Random Number Generator Based on STT-MTJ Device

Authors: Siqing Fu, Tiejun Li, Chunyuan Zhang, Hanqing Li, Sheng Ma, Jianmin Zhang, Ruiyi Zhang, Lizhou Wu

Abstract: High-quality random numbers are very critical to many fields such as cryptography, finance, and scientific simulation, which calls for the design of reliable true random number generators (TRNGs). Limited by entropy source, throughput, reliability, and system integration, existing TRNG designs are difficult to be deployed in real computing systems to greatly accelerate target applications. This st… ▽ More High-quality random numbers are very critical to many fields such as cryptography, finance, and scientific simulation, which calls for the design of reliable true random number generators (TRNGs). Limited by entropy source, throughput, reliability, and system integration, existing TRNG designs are difficult to be deployed in real computing systems to greatly accelerate target applications. This study proposes a TRNG circuit named RHS-TRNG based on spin-transfer torque magnetic tunnel junction (STT-MTJ). RHS-TRNG generates resilient and high-speed random bit sequences exploiting the stochastic switching characteristics of STT-MTJ. By circuit/system co-design, we integrate RHS-TRNG into a RISC-V processor as an acceleration component, which is driven by customized random number generation instructions. Our experimental results show that a single cell of RHS-TRNG has a random bit generation speed of up to 303 Mb/s, which is the highest among existing MTJ-based TRNGs. Higher throughput can be achieved by exploiting cell-level parallelism. RHS-TRNG also shows strong resilience against PVT variations thanks to our designs using bidirectional switching currents and dual generator units. In addition, our system evaluation results using gem5 simulator suggest that the system equipped with RHS-TRNG can achieve 3.4-12x higher performance in speeding up option pricing programs than software implementations of random number generation. △ Less

Submitted 28 December, 2023; originally announced December 2023.

Comments: Published in: IEEE Transactions on Very Large Scale Integration (VLSI) Systems ( Volume: 31, Issue: 10, October 2023)

Journal ref: IEEE Transactions on Very Large Scale Integration (VLSI) Systems, vol. 31, no. 10, pp. 1578-1591, Oct. 2023

arXiv:2312.16141 [pdf, other]

VirtualPainting: Addressing Sparsity with Virtual Points and Distance-Aware Data Augmentation for 3D Object Detection

Authors: Sudip Dhakal, Dominic Carrillo, Deyuan Qu, Michael Nutt, Qing Yang, Song Fu

Abstract: In recent times, there has been a notable surge in multimodal approaches that decorates raw LiDAR point clouds with camera-derived features to improve object detection performance. However, we found that these methods still grapple with the inherent sparsity of LiDAR point cloud data, primarily because fewer points are enriched with camera-derived features for sparsely distributed objects. We pres… ▽ More In recent times, there has been a notable surge in multimodal approaches that decorates raw LiDAR point clouds with camera-derived features to improve object detection performance. However, we found that these methods still grapple with the inherent sparsity of LiDAR point cloud data, primarily because fewer points are enriched with camera-derived features for sparsely distributed objects. We present an innovative approach that involves the generation of virtual LiDAR points using camera images and enhancing these virtual points with semantic labels obtained from image-based segmentation networks to tackle this issue and facilitate the detection of sparsely distributed objects, particularly those that are occluded or distant. Furthermore, we integrate a distance aware data augmentation (DADA) technique to enhance the models capability to recognize these sparsely distributed objects by generating specialized training samples. Our approach offers a versatile solution that can be seamlessly integrated into various 3D frameworks and 2D semantic segmentation methods, resulting in significantly improved overall detection accuracy. Evaluation on the KITTI and nuScenes datasets demonstrates substantial enhancements in both 3D and birds eye view (BEV) detection benchmarks △ Less

Submitted 26 December, 2023; originally announced December 2023.

arXiv:2312.15104 [pdf, other]

A demonstrator for a real-time AI-FPGA-based triggering system for sPHENIX at RHIC

Authors: J. Kvapil, G. Borca-Tasciuc, H. Bossi, K. Chen, Y. Chen, Y. Corrales Morales, H. Da Costa, C. Da Silva, C. Dean, J. Durham, S. Fu, C. Hao, P. Harris, O. Hen, H. Jheng, Y. Lee, P. Li, X. Li, Y. Lin, M. X. Liu, A. Olvera, M. L. Purschke, M. Rigatti, G. Roland, J. Schambach , et al. (6 additional authors not shown)

Abstract: The RHIC interaction rate at sPHENIX will reach around 3 MHz in pp collisions and requires the detector readout to reject events by a factor of over 200 to fit the DAQ bandwidth of 15 kHz. Some critical measurements, such as heavy flavor production in pp collisions, often require the analysis of particles produced at low momentum. This prohibits adopting the traditional approach, where data rates… ▽ More The RHIC interaction rate at sPHENIX will reach around 3 MHz in pp collisions and requires the detector readout to reject events by a factor of over 200 to fit the DAQ bandwidth of 15 kHz. Some critical measurements, such as heavy flavor production in pp collisions, often require the analysis of particles produced at low momentum. This prohibits adopting the traditional approach, where data rates are reduced through triggering on rare high momentum probes. We explore a new approach based on real-time AI technology, adopt an FPGA-based implementation using a custom designed FELIX-712 board with the Xilinx Kintex Ultrascale FPGA, and deploy the system in the detector readout electronics loop for real-time trigger decision. △ Less

Submitted 27 December, 2023; v1 submitted 22 December, 2023; originally announced December 2023.

Comments: 7 pages, 5 figures, proceedings for TWEPP 2023 conference, v2: corrected Table 1 numbers

Report number: LA-UR-23-32546

arXiv:2312.05981 [pdf]

Stellar Metallicities and Gradients in the Isolated, Quenched Low-Mass Galaxy Tucana

Authors: Sal Wanying Fu, Daniel R. Weisz, Else Starkenburg, Nicolas Martin, Francisco J. Mercado, Alessandro Savino, Michael Boylan-Kolchin, Patrick Côté, Andrew E. Dolphin, Nicolas Longeard, Mario L. Mateo, Jenna Samuel, Nathan R. Sandford

Abstract: We measure the metallicities of 374 red giant branch (RGB) stars in the isolated, quenched dwarf galaxy Tucana using Hubble Space Telescope (HST) narrow-band (F395N) Calcium H & K (CaHK) imaging. Our sample is a factor of $\sim7$ larger than what is published. Our main findings are: (i) A global metallicity distribution function (MDF) with $\langle \mbox{[Fe/H]} \rangle = -1.55 \pm 0.04$ and… ▽ More We measure the metallicities of 374 red giant branch (RGB) stars in the isolated, quenched dwarf galaxy Tucana using Hubble Space Telescope (HST) narrow-band (F395N) Calcium H & K (CaHK) imaging. Our sample is a factor of $\sim7$ larger than what is published. Our main findings are: (i) A global metallicity distribution function (MDF) with $\langle \mbox{[Fe/H]} \rangle = -1.55 \pm 0.04$ and $σ_{\mbox{[Fe/H]}}=0.54\pm0.03$; (ii) A metallicity gradient of $-0.54 \pm 0.07$ dex $R_e^{-1}$ ($-2.1 \pm 0.3$ dex kpc$^{-1}$) over the extent of our imaging ($\sim 2.5 R_e$), which is steeper than literature measurements. Our finding is consistent with predicted gradients from the publicly-available FIRE-2 simulations, in which bursty star formation creates stellar population gradients and dark matter cores; (iii) Tucana's bifurcated RGB has distinct metallicities: a blue RGB with $\langle \mbox{[Fe/H]} \rangle = -1.78 \pm 0.06$ and $σ_{\mbox{[Fe/H]}}=0.44^{+0.07}_{-0.06}$, and a red RGB with $\langle \mbox{[Fe/H]} \rangle = -1.08 \pm 0.07$ and $σ_{\mbox{[Fe/H]}}=0.42 \pm 0.06$; (iv) At fixed stellar mass, Tucana is more MR than MW satellites by $\sim 0.4$ dex, but its blue RGB is chemically comparable to the satellites. Tucana's MDF appears consistent with star-forming isolated dwarfs, though MDFs of the latter are not as well-populated; (v) $\sim2$% of Tucana's stars have $\mbox{[Fe/H]} < -3$ and 20% $\mbox{[Fe/H]} > -1$. We provide a catalog for community spectroscopic follow-up. △ Less

Submitted 17 April, 2024; v1 submitted 10 December, 2023; originally announced December 2023.

Comments: Replaced with ApJ published version; 23 pages, 18 figures

Showing 1–50 of 456 results for author: Fu, S