Search | arXiv e-print repository

Theoretical Insights for Diffusion Guidance: A Case Study for Gaussian Mixture Models

Authors: Yuchen Wu, Minshuo Chen, Zihao Li, Mengdi Wang, Yuting Wei

Abstract: Diffusion models benefit from instillation of task-specific information into the score function to steer the sample generation towards desired properties. Such information is coined as guidance. For example, in text-to-image synthesis, text input is encoded as guidance to generate semantically aligned images. Proper guidance inputs are closely tied to the performance of diffusion models. A common… ▽ More Diffusion models benefit from instillation of task-specific information into the score function to steer the sample generation towards desired properties. Such information is coined as guidance. For example, in text-to-image synthesis, text input is encoded as guidance to generate semantically aligned images. Proper guidance inputs are closely tied to the performance of diffusion models. A common observation is that strong guidance promotes a tight alignment to the task-specific information, while reducing the diversity of the generated samples. In this paper, we provide the first theoretical study towards understanding the influence of guidance on diffusion models in the context of Gaussian mixture models. Under mild conditions, we prove that incorporating diffusion guidance not only boosts classification confidence but also diminishes distribution diversity, leading to a reduction in the differential entropy of the output distribution. Our analysis covers the widely adopted sampling schemes including DDPM and DDIM, and leverages comparison inequalities for differential equations as well as the Fokker-Planck equation that characterizes the evolution of probability density function, which may be of independent theoretical interest. △ Less

Submitted 3 March, 2024; originally announced March 2024.

Comments: 41 pages, 12 figures

arXiv:2403.00985 [pdf, other]

Episodic energy release during the main- and post-impulsive phase of a solar flare

Authors: Yuqian Wei, Bin Chen, Sijie Yu, Haimin Wang, Yixian Zhang, Lindsay Glesener

Abstract: When and where the magnetic field energy is released and converted in eruptive solar flares remains an outstanding topic in solar physics. To shed light on this question, here we report multi-wavelength observations of a C9.4-class eruptive limb flare that occurred on 2017 August 20. The flare, accompanied by a magnetic flux rope eruption and a white light coronal mass ejection, features three pos… ▽ More When and where the magnetic field energy is released and converted in eruptive solar flares remains an outstanding topic in solar physics. To shed light on this question, here we report multi-wavelength observations of a C9.4-class eruptive limb flare that occurred on 2017 August 20. The flare, accompanied by a magnetic flux rope eruption and a white light coronal mass ejection, features three post-impulsive X-ray and microwave bursts immediately following its main impulsive phase. For each burst, both microwave and X-ray imaging suggest that the non-thermal electrons are located in the above-the-loop-top region. Interestingly, contrary to many other flares, the peak flux of the three post-impulsive microwave and X-ray bursts shows an increase for later bursts. Spectral analysis reveals that the sources have a hardening spectral index, suggesting a more efficient electron acceleration into the later post-impulsive bursts. We observe a positive correlation between the acceleration of the magnetic flux rope and the non-thermal energy release during the post-impulsive bursts in the same event. Intriguingly, different from some other eruptive events, this correlation does not hold for the main impulse phase of this event, which we interpret as energy release due to the tether-cutting reconnection before the primary flux rope acceleration occurs. In addition, using footpoint brightenings at conjugate flare ribbons, a weakening reconnection guide field is inferred, which may also contribute to the hardening of the non-thermal electrons during the post-impulsive phase. △ Less

Submitted 1 March, 2024; originally announced March 2024.

Comments: 21 pages, 14 figures

arXiv:2403.00728 [pdf]

Emergence of interfacial magnetism in strongly-correlated nickelate-titanate superlattices

Authors: Teguh Citra Asmara, Robert J. Green, Andreas Suter, Yuan Wei, Wenliang Zhang, Grant Harris, Yi Tseng, Tianlun Yu, Davide Betto, Mirian Garcia-Fernandez, Stefano Agrestini, Yannick Maximilian Klein, Neeraj Kumar, Carlos William Galdino, Zaher Salman, Thomas Prokscha, Marisa Medarde, Elisabeth Müller, Yona Soh, Nicholas B. Brookes, Ke-** Zhou, Milan Radovic, Thorsten Schmitt

Abstract: Strongly-correlated transition-metal oxides are widely known for their various exotic phenomena. This is exemplified by rare-earth nickelates such as LaNiO$_{3}$, which possess intimate interconnections between their electronic, spin, and lattice degrees of freedom. Their properties can be further enhanced by pairing them in hybrid heterostructures, which can lead to hidden phases and emergent phe… ▽ More Strongly-correlated transition-metal oxides are widely known for their various exotic phenomena. This is exemplified by rare-earth nickelates such as LaNiO$_{3}$, which possess intimate interconnections between their electronic, spin, and lattice degrees of freedom. Their properties can be further enhanced by pairing them in hybrid heterostructures, which can lead to hidden phases and emergent phenomena. An important example is the LaNiO$_{3}$/LaTiO$_{3}$ superlattice, where an interlayer electron transfer has been observed from LaTiO$_{3}$ into LaNiO$_{3}$ and is predicted to result in a high-spin state. However, macroscopic emergence of magnetic order has so far not been observed. Here, by using muon spin rotation, x-ray absorption, and resonant inelastic x-ray scattering, we present direct evidence of an emergent antiferromagnetic order with high magnon energy and exchange interactions at the LaNiO$_{3}$/LaTiO$_{3}$ interface. As the magnetism is purely interfacial, a single LaNiO$_{3}$/LaTiO$_{3}$ interface can essentially behave as an atomically thin quasi-two-dimensional antiferromagnet, potentially allowing its technological utilisation in advanced spintronic devices. Furthermore, its strong quasi-two-dimensional magnetic correlations and orbitally-polarized planar ligand holes make its electronic and magnetic configurations resemble the precursor states of superconducting cuprates and nickelates, but with an S $\rightarrow$ 1 spin state instead. △ Less

Submitted 1 March, 2024; originally announced March 2024.

Comments: 41 pages, 13 figures

arXiv:2403.00628 [pdf, other]

Region-Adaptive Transform with Segmentation Prior for Image Compression

Authors: Yuxi Liu, Wenhan Yang, Huihui Bai, Yunchao Wei, Yao Zhao

Abstract: Learned Image Compression (LIC) has shown remarkable progress in recent years. Existing works commonly employ CNN-based or self-attention-based modules as transform methods for compression. However, there is no prior research on neural transform that focuses on specific regions. In response, we introduce the class-agnostic segmentation masks (i.e. semantic masks without category labels) for extrac… ▽ More Learned Image Compression (LIC) has shown remarkable progress in recent years. Existing works commonly employ CNN-based or self-attention-based modules as transform methods for compression. However, there is no prior research on neural transform that focuses on specific regions. In response, we introduce the class-agnostic segmentation masks (i.e. semantic masks without category labels) for extracting region-adaptive contextual information. Our proposed module, Region-Adaptive Transform, applies adaptive convolutions on different regions guided by the masks. Additionally, we introduce a plug-and-play module named Scale Affine Layer to incorporate rich contexts from various regions. While there have been prior image compression efforts that involve segmentation masks as additional intermediate inputs, our approach differs significantly from them. Our advantages lie in that, to avoid extra bitrate overhead, we treat these masks as privilege information, which is accessible during the model training stage but not required during the inference phase. To the best of our knowledge, we are the first to employ class-agnostic masks as privilege information and achieve superior performance in pixel-fidelity metrics, such as Peak Signal to Noise Ratio (PSNR). The experimental results demonstrate our improvement compared to previously well-performing methods, with about 8.2% bitrate saving compared to VTM-17.0. The source code is available at https://github.com/GityuxiLiu/SegPIC-for-Image-Compression. △ Less

Submitted 9 July, 2024; v1 submitted 1 March, 2024; originally announced March 2024.

Comments: Accepted to ECCV 2024

arXiv:2402.19173 [pdf, other]

StarCoder 2 and The Stack v2: The Next Generation

Authors: Anton Lozhkov, Raymond Li, Loubna Ben Allal, Federico Cassano, Joel Lamy-Poirier, Nouamane Tazi, Ao Tang, Dmytro Pykhtar, Jiawei Liu, Yuxiang Wei, Tianyang Liu, Max Tian, Denis Kocetkov, Arthur Zucker, Younes Belkada, Zijian Wang, Qian Liu, Dmitry Abulkhanov, Indraneil Paul, Zhuang Li, Wen-Ding Li, Megan Risdal, Jia Li, Jian Zhu, Terry Yue Zhuo , et al. (41 additional authors not shown)

Abstract: The BigCode project, an open-scientific collaboration focused on the responsible development of Large Language Models for Code (Code LLMs), introduces StarCoder2. In partnership with Software Heritage (SWH), we build The Stack v2 on top of the digital commons of their source code archive. Alongside the SWH repositories spanning 619 programming languages, we carefully select other high-quality data… ▽ More The BigCode project, an open-scientific collaboration focused on the responsible development of Large Language Models for Code (Code LLMs), introduces StarCoder2. In partnership with Software Heritage (SWH), we build The Stack v2 on top of the digital commons of their source code archive. Alongside the SWH repositories spanning 619 programming languages, we carefully select other high-quality data sources, such as GitHub pull requests, Kaggle notebooks, and code documentation. This results in a training set that is 4x larger than the first StarCoder dataset. We train StarCoder2 models with 3B, 7B, and 15B parameters on 3.3 to 4.3 trillion tokens and thoroughly evaluate them on a comprehensive set of Code LLM benchmarks. We find that our small model, StarCoder2-3B, outperforms other Code LLMs of similar size on most benchmarks, and also outperforms StarCoderBase-15B. Our large model, StarCoder2- 15B, significantly outperforms other models of comparable size. In addition, it matches or outperforms CodeLlama-34B, a model more than twice its size. Although DeepSeekCoder- 33B is the best-performing model at code completion for high-resource languages, we find that StarCoder2-15B outperforms it on math and code reasoning benchmarks, as well as several low-resource languages. We make the model weights available under an OpenRAIL license and ensure full transparency regarding the training data by releasing the SoftWare Heritage persistent IDentifiers (SWHIDs) of the source code data. △ Less

Submitted 29 February, 2024; originally announced February 2024.

arXiv:2402.18463 [pdf, other]

Understanding the Impact of AI Generated Content on Social Media: The Pixiv Case

Authors: Yiluo Wei, Gareth Tyson

Abstract: In the last two years, Artificial Intelligence Generated Content (AIGC) has received significant attention, leading to an anecdotal rise in the amount of AIGC being shared via social media platforms. The impact of AIGC and its implications are of key importance to social platforms, e.g., regarding the implementation of policies, community formation, and algorithmic design. Yet, to date, we know li… ▽ More In the last two years, Artificial Intelligence Generated Content (AIGC) has received significant attention, leading to an anecdotal rise in the amount of AIGC being shared via social media platforms. The impact of AIGC and its implications are of key importance to social platforms, e.g., regarding the implementation of policies, community formation, and algorithmic design. Yet, to date, we know little about how the arrival of AIGC has impacted the social media ecosystem. To fill this gap, we present a comprehensive study of Pixiv, an online community for artists who wish to share and receive feedback on their illustrations. Pixiv hosts over 100 million artistic submissions and receives more than 1 billion page views per month (as of 2023). Importantly, it allows both human and AI generated content to be uploaded. Exploiting this, we perform the first analysis of the impact that AIGC has had on the social media ecosystem, through the lens of Pixiv. Based on a dataset of 15.2 million posts (including 2.4 million AI-generated images), we measure the impact of AIGC on the Pixiv community, as well as the differences between AIGC and human-generated content in terms of content creation and consumption patterns. Our results offer key insight to how AIGC is changing the dynamics of social media platforms like Pixiv. △ Less

Submitted 28 February, 2024; originally announced February 2024.

arXiv:2402.16213 [pdf, ps, other]

Sparse gradient bounds for divergence form elliptic equations

Authors: Olli Saari, Hua-Yang Wang, Yuanhong Wei

Abstract: We provide sparse estimates for gradients of solutions to divergence form elliptic partial differential equations in terms of the source data. We give a general result of Meyers (or Gehring) type, a result for linear equations with VMO coefficients and a result for linear equations with Dini continuous coefficients. In addition, we provide an abstract theorem conditional on PDE estimates available… ▽ More We provide sparse estimates for gradients of solutions to divergence form elliptic partial differential equations in terms of the source data. We give a general result of Meyers (or Gehring) type, a result for linear equations with VMO coefficients and a result for linear equations with Dini continuous coefficients. In addition, we provide an abstract theorem conditional on PDE estimates available. The linear results have the full range of weighted estimates with Muckenhoupt weights as a consequence. △ Less

Submitted 23 May, 2024; v1 submitted 25 February, 2024; originally announced February 2024.

Comments: v2: writing improved all over and more details added

arXiv:2402.16033 [pdf, other]

Diving Deep into Regions: Exploiting Regional Information Transformer for Single Image Deraining

Authors: Baiang Li, Zhao Zhang, Huan Zheng, Xiaogang Xu, Yanyan Wei, **gyi Zhang, Jicong Fan, Meng Wang

Abstract: Transformer-based Single Image Deraining (SID) methods have achieved remarkable success, primarily attributed to their robust capability in capturing long-range interactions. However, we've noticed that current methods handle rain-affected and unaffected regions concurrently, overlooking the disparities between these areas, resulting in confusion between rain streaks and background parts, and inab… ▽ More Transformer-based Single Image Deraining (SID) methods have achieved remarkable success, primarily attributed to their robust capability in capturing long-range interactions. However, we've noticed that current methods handle rain-affected and unaffected regions concurrently, overlooking the disparities between these areas, resulting in confusion between rain streaks and background parts, and inabilities to obtain effective interactions, ultimately resulting in suboptimal deraining outcomes. To address the above issue, we introduce the Region Transformer (Regformer), a novel SID method that underlines the importance of independently processing rain-affected and unaffected regions while considering their combined impact for high-quality image reconstruction. The crux of our method is the innovative Region Transformer Block (RTB), which integrates a Region Masked Attention (RMA) mechanism and a Mixed Gate Forward Block (MGFB). Our RTB is used for attention selection of rain-affected and unaffected regions and local modeling of mixed scales. The RMA generates attention maps tailored to these two regions and their interactions, enabling our model to capture comprehensive features essential for rain removal. To better recover high-frequency textures and capture more local details, we develop the MGFB as a compensation module to complete local mixed scale modeling. Extensive experiments demonstrate that our model reaches state-of-the-art performance, significantly improving the image deraining quality. Our code and trained models are publicly available. △ Less

Submitted 25 February, 2024; originally announced February 2024.

arXiv:2402.15296 [pdf, other]

Polaritonic Chemistry Enabled by Non-Local Metasurfaces

Authors: Francesco Verdelli, Yu-Chen Wei, Kripa Joseph, Mohamed S. Abdelkhalik, Goudarzi Masoumeh, Sven H. C. Askes, Andrea Baldi, E. W. Meijer, Jaime Gómez Rivas

Abstract: Vibrational strong coupling can modify chemical reaction pathways in unconventional ways. Thus far, Fabry-Perot cavities formed by pairs of facing mirrors have been mostly utilized to achieve vibrational strong coupling. In this study, we demonstrate the application of plasmonic microparticle arrays defining non-local metasurfaces that can sustain surface lattice resonances as a novel tool to enab… ▽ More Vibrational strong coupling can modify chemical reaction pathways in unconventional ways. Thus far, Fabry-Perot cavities formed by pairs of facing mirrors have been mostly utilized to achieve vibrational strong coupling. In this study, we demonstrate the application of plasmonic microparticle arrays defining non-local metasurfaces that can sustain surface lattice resonances as a novel tool to enable chemical reactions under vibrational strong coupling. We show that the solvolysis kinetics of \textit{para}-nitrophenyl acetate can be accelerated by a factor of 2.7 by strong coupling to the carbonyl bond of the solvent and the solute with a surface lattice resonance. Our work introduces a new platform to investigate and control polaritonic chemical reactions. In contrast to Fabry-Perot cavities, metasurfaces define open optical cavities with single surfaces, which removes alignment hurdles, facilitating polaritonic chemistry across large areas. △ Less

Submitted 23 February, 2024; originally announced February 2024.

arXiv:2402.15051 [pdf]

Prediction of Fishbone Linear Instability in Tokamaks with Machine Learning Methods

Authors: Z. Y. Liu, H. R. Qiu, G. Y. Fu, Y. Xiao, Y. C. Chen, Z. J. Wang, Y. X. Wei

Abstract: A machine learning based surrogate model for fishbone linear instability in tokamaks is constructed. Hybrid simulations with the kinetic-magnetohydrodynamic (MHD) code M3D-K is used to generate the database of fishbone linear instability, through scanning the four key parameters which are thought to determine the fishbone physics. The four key parameters include (1) central total beta of both ther… ▽ More A machine learning based surrogate model for fishbone linear instability in tokamaks is constructed. Hybrid simulations with the kinetic-magnetohydrodynamic (MHD) code M3D-K is used to generate the database of fishbone linear instability, through scanning the four key parameters which are thought to determine the fishbone physics. The four key parameters include (1) central total beta of both thermal plasma and fast ions, (2) the fast ion pressure fraction, (3) central value of safety factor $q$ and (4) the radius of $q=1$ surface. Four machine learning methods including linear regression, support vector machines (SVM) with linear kernel, SVM with nonlinear kernel and multi-layer perceptron are used to predict the fishbone instability, growth rate and real frequency, mode structure respectively. Among the four methods, SVM with nonlinear kernel performs very well to predict the linear instability with accuracy $\approx$95%, growth rate and real frequency with $R^2\approx$98%, mode structure with $R^2\approx$98%. △ Less

Submitted 22 February, 2024; originally announced February 2024.

Comments: 28 pages,19 figures

arXiv:2402.14975 [pdf, other]

doi 10.1103/PhysRevLett.132.242301

Modification of $χ_{c1}$(3872) and $ψ$(2$S$) production in $p$Pb collisions at $\sqrt{s_{NN}} = 8.16$ TeV

Authors: LHCb collaboration, R. Aaij, A. S. W. Abdelmotteleb, C. Abellan Beteta, F. Abudinén, T. Ackernley, B. Adeva, M. Adinolfi, P. Adlarson, C. Agapopoulou, C. A. Aidala, Z. Ajaltouni, S. Akar, K. Akiba, P. Albicocco, J. Albrecht, F. Alessio, M. Alexander, A. Alfonso Albero, Z. Aliouche, P. Alvarez Cartelle, R. Amalric, S. Amato, J. L. Amey, Y. Amhis , et al. (1082 additional authors not shown)

Abstract: The LHCb collaboration measures production of the exotic hadron $χ_{c1}$(3872) in proton-nucleus collisions for the first time. Comparison with the charmonium state $ψ$(2$S$) suggests that the exotic $χ_{c1}$(3872) experiences different dynamics in the nuclear medium than conventional hadrons, and comparison with data from proton-proton collisions indicates that the presence of the nucleus may mod… ▽ More The LHCb collaboration measures production of the exotic hadron $χ_{c1}$(3872) in proton-nucleus collisions for the first time. Comparison with the charmonium state $ψ$(2$S$) suggests that the exotic $χ_{c1}$(3872) experiences different dynamics in the nuclear medium than conventional hadrons, and comparison with data from proton-proton collisions indicates that the presence of the nucleus may modify $χ_{c1}$(3872) production rates. This is the first measurement of the nuclear modification factor of an exotic hadron. △ Less

Submitted 19 June, 2024; v1 submitted 22 February, 2024; originally announced February 2024.

Comments: All figures and tables, along with any supplementary material and additional information, are available at https://cern.ch/lhcbproject/Publications/p/LHCb-PAPER-2023-026.html (LHCb public pages)

Report number: LHCb-PAPER-2023-026, CERN-EP-2024-033

Journal ref: Phys. Rev. Lett. 132 (2024) 242301

arXiv:2402.14748 [pdf, other]

TMD factorisation for diffractive jets in photon-nucleus interactions

Authors: S. Hauksson, E. Iancu, A. H. Mueller, D. N. Triantafyllopoulos, S. Y. Wei

Abstract: Using the colour dipole picture and the colour glass condensate effective theory, we study the diffractive production of two or three jets via coherent photon-nucleus interactions at high energy. We consider the hard regime where the photon virtuality and/or the transverse momenta of the produced jets are much larger than the saturation momentum $Q_s$ of the nuclear target. We show that, despite t… ▽ More Using the colour dipole picture and the colour glass condensate effective theory, we study the diffractive production of two or three jets via coherent photon-nucleus interactions at high energy. We consider the hard regime where the photon virtuality and/or the transverse momenta of the produced jets are much larger than the saturation momentum $Q_s$ of the nuclear target. We show that, despite this hardness, the leading-twist contributions are controlled by relatively large parton configurations, with transverse sizes $R\sim 1/Q_s$, which undergo strong scattering and probe gluon saturation. For exclusive dijets, this implies that both final jets have semi-hard transverse momenta ($P_\perp\sim Q_s$) and that one of them is aligned with the photon. The dominant contributions to the diffractive production of hard dijets ($P_\perp\gg Q_s$) rather come from three-jet final states, which are very asymmetric and will be referred to as 2+1 jets: two of the jets are hard, while the third one is semi-hard. We demonstrate that the leading-twist contributions to both exclusive dijets and the diffractive production of 2+1 jets admit transverse-momentum dependent (TMD) factorisation, in terms of quark and gluon diffractive TMD distribution functions, for which we obtain explicit expressions from first principles. We show that the contribution of 2+1 jets to diffractive SIDIS (semi-inclusive deep inelastic scattering) takes the form of one step in the DGLAP evolution of the quark diffractive PDF. △ Less

Submitted 22 February, 2024; originally announced February 2024.

Comments: 89 pages, 24 figures, 4 appendices

arXiv:2402.14655 [pdf]

Evaluating Cognitive and Neuropsychological Assessments -- A Comprehensive Review

Authors: Chuang Li, Rubing Lin, Yantong Liu, Yichen Wei

Abstract: Cognitive impairments in older adults represent a significant public health concern, necessitating accurate diagnostic and monitoring strategies. In this study, the principal cognitive and neuropsychological evaluations employed for the diagnosis and longitudinal observation of cognitive deficits in the elderly are investigated. An analytical review of instruments including the Mini-Mental State E… ▽ More Cognitive impairments in older adults represent a significant public health concern, necessitating accurate diagnostic and monitoring strategies. In this study, the principal cognitive and neuropsychological evaluations employed for the diagnosis and longitudinal observation of cognitive deficits in the elderly are investigated. An analytical review of instruments including the Mini-Mental State Examination (MMSE), Digit Symbol Substitution Test (DSST), Montreal Cognitive Assessment (MoCA), and Trail Making Test (TMT) is conducted. This examination encompasses an assessment of each instrument's methodology, efficacy, advantages, and limitations. The objective is to enhance comprehension of these assessments for the early identification and effective management of conditions such as dementia and mild cognitive impairment, thereby contributing to the advancement of cognitive health within the geriatric population. △ Less

Submitted 22 February, 2024; originally announced February 2024.

arXiv:2402.14328 [pdf, other]

Understanding and Patching Compositional Reasoning in LLMs

Authors: Zhaoyi Li, Gangwei Jiang, Hong Xie, Linqi Song, Defu Lian, Ying Wei

Abstract: LLMs have marked a revolutonary shift, yet they falter when faced with compositional reasoning tasks. Our research embarks on a quest to uncover the root causes of compositional reasoning failures of LLMs, uncovering that most of them stem from the improperly generated or leveraged implicit reasoning results. Inspired by our empirical findings, we resort to Logit Lens and an intervention experimen… ▽ More LLMs have marked a revolutonary shift, yet they falter when faced with compositional reasoning tasks. Our research embarks on a quest to uncover the root causes of compositional reasoning failures of LLMs, uncovering that most of them stem from the improperly generated or leveraged implicit reasoning results. Inspired by our empirical findings, we resort to Logit Lens and an intervention experiment to dissect the inner hidden states of LLMs. This deep dive reveals that implicit reasoning results indeed surface within middle layers and play a causative role in sha** the final explicit reasoning results. Our exploration further locates multi-head self-attention (MHSA) modules within these layers, which emerge as the linchpins in accurate generation and leveraing of implicit reasoning results. Grounded on the above findings, we develop CREME, a lightweight method to patch errors in compositional reasoning via editing the located MHSA modules. Our empirical evidence stands testament to CREME's effectiveness, paving the way for autonomously and continuously enhancing compositional reasoning capabilities in language models. △ Less

Submitted 6 June, 2024; v1 submitted 22 February, 2024; originally announced February 2024.

Comments: Accepted by ACL'2024 Findings

arXiv:2402.13717 [pdf, other]

Neeko: Leveraging Dynamic LoRA for Efficient Multi-Character Role-Playing Agent

Authors: Xiaoyan Yu, Tongxu Luo, Yifan Wei, Fangyu Lei, Yiming Huang, Hao Peng, Liehuang Zhu

Abstract: Large Language Models (LLMs) have revolutionized open-domain dialogue agents but encounter challenges in multi-character role-playing (MCRP) scenarios. To address the issue, we present Neeko, an innovative framework designed for efficient multiple characters imitation. Unlike existing methods, Neeko employs a dynamic low-rank adapter (LoRA) strategy, enabling it to adapt seamlessly to diverse char… ▽ More Large Language Models (LLMs) have revolutionized open-domain dialogue agents but encounter challenges in multi-character role-playing (MCRP) scenarios. To address the issue, we present Neeko, an innovative framework designed for efficient multiple characters imitation. Unlike existing methods, Neeko employs a dynamic low-rank adapter (LoRA) strategy, enabling it to adapt seamlessly to diverse characters. Our framework breaks down the role-playing process into agent pre-training, multiple characters playing, and character incremental learning, effectively handling both seen and unseen roles. This dynamic approach, coupled with distinct LoRA blocks for each character, enhances Neeko's adaptability to unique attributes, personalities, and speaking patterns. As a result, Neeko demonstrates superior performance in MCRP over most existing methods, offering more engaging and versatile user interaction experiences. Code and data are available at https://github.com/weiyifan1023/Neeko. △ Less

Submitted 1 March, 2024; v1 submitted 21 February, 2024; originally announced February 2024.

arXiv:2402.13491 [pdf, other]

Algebraic Riccati Tensor Equations with Applications in Multilinear Control Systems

Authors: Yuchao Wang, Yimin Wei, Guofeng Zhang, Shih Yu Chang

Abstract: In a recent interesting paper [8], Chen et al. initialized the control-theoretic study of a class of discrete-time multilinear time-invariant (MLTI) control systems, where system states, inputs and outputs are all tensors endowed with the Einstein product. Criteria for fundamental system-theoretic notions such as stability, reachability and observability are established by means of tensor decompos… ▽ More In a recent interesting paper [8], Chen et al. initialized the control-theoretic study of a class of discrete-time multilinear time-invariant (MLTI) control systems, where system states, inputs and outputs are all tensors endowed with the Einstein product. Criteria for fundamental system-theoretic notions such as stability, reachability and observability are established by means of tensor decomposition. The purpose of this paper is to continue this novel research direction. Specifically, we focus on continuous-time MLTI control systems. We define Hamiltonian tensors and symplectic tensors and establish the Schur-Hamiltonian tensor decomposition and symplectic tensor singular value decomposition (SVD). Based on these we propose the algebraic Riccati tensor equation (ARTE) and show that it has a unique positive semidefinite solution if the system is stablizable and detectable. A tensor-based Newton method is proposed to find numerical solutions of the ARTE. The tensor version of the bounded real lemma is also established. A first-order robustness analysis of the ARTE is conducted. Finally, a numerical example is used to demonstrate the proposed theory and algorithms. △ Less

Submitted 20 February, 2024; originally announced February 2024.

Comments: 25 pages, 6 figures

MSC Class: 15A69; 93B35; 93C05; 93D15

arXiv:2402.11949 [pdf, ps, other]

Multiwavelength Polarization Observations of Mrk 501

Authors: Xin-Ke Hu, Yu-Wei Yu, ** Zhang, Xiang-Gao Wang, Kishore C. Patra, Thomas G. Brink, Wei-Kang Zheng, Qi Wang, De-Feng Kong, Liang-Jun Chen, Ji-Wang Zhou, Jia-Xin Cao, Ming-Xuan Lu, Zi-Min Zhou, Yi-Ning Wei, Xin-Bo Huang, Xing-Lin Li, Hao Lou, Ji-Rong Mao, En-Wei Liang, Alexei V. Filippenko

Abstract: Mrk 501 is a prototypical high-synchrotron-peaked blazar (HBL) and serves as one of the primary targets for the {\it Imaging X-ray Polarimetry Explorer} ({\it IXPE}). In this study, we report X-ray polarization measurements of Mrk 501 based on six {\it IXPE} observations. The detection of X-ray polarization at a confidence level exceeding 99\% is achieved in four out of the six observations conduc… ▽ More Mrk 501 is a prototypical high-synchrotron-peaked blazar (HBL) and serves as one of the primary targets for the {\it Imaging X-ray Polarimetry Explorer} ({\it IXPE}). In this study, we report X-ray polarization measurements of Mrk 501 based on six {\it IXPE} observations. The detection of X-ray polarization at a confidence level exceeding 99\% is achieved in four out of the six observations conducted across the entire energy range (2--8 keV) of {\it IXPE}. The maximum polarization degree ($Π_{\rm X}$) is measured to be $15.8\%\pm2.8\%$, accompanied by a polarization angle ($ψ_{\rm X}$) of $98.0°\pm5.1°$ at a confidence level of $5.6 σ$. During the remaining two observations, only an upper limit of $Π_{\rm X}<$12\% could be derived at the 99\% confidence level. No temporal variability in polarization is observed throughout all six {\it IXPE} observations for Mrk 501. A discernible trend of energy-dependent variation in the polarization degree is detected in optical spectropolarimetry; however, no analogous indication is observed in $Π_{\rm X}$. The chromatic behavior of $Π$ and the consistent values of $ψ$ across different frequencies from X-rays to radio waves, along with the agreement between $ψ$ and jet position angle, strongly support the interpretation of the energy-stratified model with shock-accelerated particles in the jet of Mrk 501. Additionally, the possibility of the presence of a global helical magnetic field in the jet of Mrk 501 is discussed. △ Less

Submitted 3 July, 2024; v1 submitted 19 February, 2024; originally announced February 2024.

Comments: 20 pages, 8 figures, 4 tables, accepted for publication in ApJL

arXiv:2402.10932 [pdf]

Roadmap on Data-Centric Materials Science

Authors: Stefan Bauer, Peter Benner, Tristan Bereau, Volker Blum, Mario Boley, Christian Carbogno, C. Richard A. Catlow, Gerhard Dehm, Sebastian Eibl, Ralph Ernstorfer, Ádám Fekete, Lucas Foppa, Peter Fratzl, Christoph Freysoldt, Baptiste Gault, Luca M. Ghiringhelli, Sajal K. Giri, Anton Gladyshev, Pawan Goyal, Jason Hattrick-Simpers, Lara Kabalan, Petr Karpov, Mohammad S. Khorrami, Christoph Koch, Sebastian Kokott , et al. (36 additional authors not shown)

Abstract: Science is and always has been based on data, but the terms "data-centric" and the "4th paradigm of" materials research indicate a radical change in how information is retrieved, handled and research is performed. It signifies a transformative shift towards managing vast data collections, digital repositories, and innovative data analytics methods. The integration of Artificial Intelligence (AI) a… ▽ More Science is and always has been based on data, but the terms "data-centric" and the "4th paradigm of" materials research indicate a radical change in how information is retrieved, handled and research is performed. It signifies a transformative shift towards managing vast data collections, digital repositories, and innovative data analytics methods. The integration of Artificial Intelligence (AI) and its subset Machine Learning (ML), has become pivotal in addressing all these challenges. This Roadmap on Data-Centric Materials Science explores fundamental concepts and methodologies, illustrating diverse applications in electronic-structure theory, soft matter theory, microstructure research, and experimental techniques like photoemission, atom probe tomography, and electron microscopy. While the roadmap delves into specific areas within the broad interdisciplinary field of materials science, the provided examples elucidate key concepts applicable to a wider range of topics. The discussed instances offer insights into addressing the multifaceted challenges encountered in contemporary materials research. △ Less

Submitted 1 May, 2024; v1 submitted 1 February, 2024; originally announced February 2024.

Comments: Review, outlook, roadmap, perspective

arXiv:2402.08931 [pdf, other]

Depth-aware Volume Attention for Texture-less Stereo Matching

Authors: Tong Zhao, Mingyu Ding, Wei Zhan, Masayoshi Tomizuka, Yintao Wei

Abstract: Stereo matching plays a crucial role in 3D perception and scenario understanding. Despite the proliferation of promising methods, addressing texture-less and texture-repetitive conditions remains challenging due to the insufficient availability of rich geometric and semantic information. In this paper, we propose a lightweight volume refinement scheme to tackle the texture deterioration in practic… ▽ More Stereo matching plays a crucial role in 3D perception and scenario understanding. Despite the proliferation of promising methods, addressing texture-less and texture-repetitive conditions remains challenging due to the insufficient availability of rich geometric and semantic information. In this paper, we propose a lightweight volume refinement scheme to tackle the texture deterioration in practical outdoor scenarios. Specifically, we introduce a depth volume supervised by the ground-truth depth map, capturing the relative hierarchy of image texture. Subsequently, the disparity discrepancy volume undergoes hierarchical filtering through the incorporation of depth-aware hierarchy attention and target-aware disparity attention modules. Local fine structure and context are emphasized to mitigate ambiguity and redundancy during volume aggregation. Furthermore, we propose a more rigorous evaluation metric that considers depth-wise relative error, providing comprehensive evaluations for universal stereo matching and depth estimation models. We extensively validate the superiority of our proposed methods on public datasets. Results demonstrate that our model achieves state-of-the-art performance, particularly excelling in scenarios with texture-less images. The code is available at https://github.com/ztsrxh/DVANet. △ Less

Submitted 26 February, 2024; v1 submitted 13 February, 2024; originally announced February 2024.

Comments: 10 pages, 6 figures

arXiv:2402.07904 [pdf]

An entertaining resonance experiment with just two spring scales

Authors: Yajun Wei

Abstract: Resonance is a topic included in most introductory physics courses. Any mechanical system experiences resonance if it is driven by a periodic force with a frequency that matches its natural frequency. There are plenty of simple demonstrations of the resonance phenomena of mechanical systems which can be set up using readily available items . This work presents a very simple approach to demonstrate… ▽ More Resonance is a topic included in most introductory physics courses. Any mechanical system experiences resonance if it is driven by a periodic force with a frequency that matches its natural frequency. There are plenty of simple demonstrations of the resonance phenomena of mechanical systems which can be set up using readily available items . This work presents a very simple approach to demonstrate the phenomena using just two spring scales. The experiment presented here performs a "frequency sweep" and is also very entertaining to watch. △ Less

Submitted 11 December, 2023; originally announced February 2024.

Comments: 5 pages, 3 figures, Accepted by The Physics Teacher

arXiv:2402.07802 [pdf, ps, other]

Towards a mathematical theory for consistency training in diffusion models

Authors: Gen Li, Zhihan Huang, Yuting Wei

Abstract: Consistency models, which were proposed to mitigate the high computational overhead during the sampling phase of diffusion models, facilitate single-step sampling while attaining state-of-the-art empirical performance. When integrated into the training phase, consistency models attempt to train a sequence of consistency functions capable of map** any point at any time step of the diffusion proce… ▽ More Consistency models, which were proposed to mitigate the high computational overhead during the sampling phase of diffusion models, facilitate single-step sampling while attaining state-of-the-art empirical performance. When integrated into the training phase, consistency models attempt to train a sequence of consistency functions capable of map** any point at any time step of the diffusion process to its starting point. Despite the empirical success, a comprehensive theoretical understanding of consistency training remains elusive. This paper takes a first step towards establishing theoretical underpinnings for consistency models. We demonstrate that, in order to generate samples within $\varepsilon$ proximity to the target in distribution (measured by some Wasserstein metric), it suffices for the number of steps in consistency learning to exceed the order of $d^{5/2}/\varepsilon$, with $d$ the data dimension. Our theory offers rigorous insights into the validity and efficacy of consistency models, illuminating their utility in downstream inference tasks. △ Less

Submitted 12 February, 2024; originally announced February 2024.

Comments: The first two authors contributed equally

arXiv:2402.06265 [pdf, other]

Rational Design of Molybdenum Transition-Metal subnanoclusters catalysts with Particle Swarm Optimization

Authors: Yao Wei, Alejandro Santana-Bonilla, Lev Kantorovich

Abstract: The development of novel sub-nanometer clusters (SNCs) catalysts with superior catalytic performance depends on the precise control of clusters' atomistic sizes, shapes, and accurate deposition onto surfaces. Recent advancements in manufacturing and characterization techniques have paved the way for the production and atomic resolution characterization of transition-metal SNCs catalysts, positioni… ▽ More The development of novel sub-nanometer clusters (SNCs) catalysts with superior catalytic performance depends on the precise control of clusters' atomistic sizes, shapes, and accurate deposition onto surfaces. Recent advancements in manufacturing and characterization techniques have paved the way for the production and atomic resolution characterization of transition-metal SNCs catalysts, positioning them as a promising new class of materials for this application. Nevertheless, the intrinsic complexity of the adsorption process complicates the ability to achieve an atomistic understanding of the most relevant structure-reactivity relationships hampering the rational design of novel catalytic materials. In most cases, existing computational approaches rely on just a few structures to conclude clusters' reactivity thereby neglecting the complexity of the existing energy landscapes thus leading to insufficient sampling and, most likely, unreliable predictions. Moreover, modelling of the actual experimental procedure that is responsible for the deposition of SNCs on surfaces is often not done even though in some cases this procedure may enhance the significance of certain (e.g., metastable) adsorption geometries. This study proposes a novel approach that utilizes particle swarm optimization (PSO) method, in conjunction with ab-initio calculations, to predict the most relevant SNCs structures on a surface in beam experiments, and consequently their reactivity. To illustrate the main steps of our approach, we consider the deposition of Molybdenum SNC of 6 Mo atoms on a free-standing graphene surface, as well as their catalytic properties concerning the CO molecule dissociation reaction. This study demonstrates the feasibility of the PSO technique for studying catalyst transition-metal SNCs and establishes a reliable procedure for performing theoretical rational design predictions. △ Less

Submitted 9 February, 2024; originally announced February 2024.

Comments: 19 pages, 11 figures, due to the limitation "The abstract field cannot be longer than 1,920 characters", the abstract appearing here is slightly shorter than that in the PDF file

arXiv:2402.06244 [pdf, other]

Quantifying and Enhancing Multi-modal Robustness with Modality Preference

Authors: Zequn Yang, Yake Wei, Ce Liang, Di Hu

Abstract: Multi-modal models have shown a promising capability to effectively integrate information from various sources, yet meanwhile, they are found vulnerable to pervasive perturbations, such as uni-modal attacks and missing conditions. To counter these perturbations, robust multi-modal representations are highly expected, which are positioned well away from the discriminative multi-modal decision bound… ▽ More Multi-modal models have shown a promising capability to effectively integrate information from various sources, yet meanwhile, they are found vulnerable to pervasive perturbations, such as uni-modal attacks and missing conditions. To counter these perturbations, robust multi-modal representations are highly expected, which are positioned well away from the discriminative multi-modal decision boundary. In this paper, different from conventional empirical studies, we focus on a commonly used joint multi-modal framework and theoretically discover that larger uni-modal representation margins and more reliable integration for modalities are essential components for achieving higher robustness. This discovery can further explain the limitation of multi-modal robustness and the phenomenon that multi-modal models are often vulnerable to attacks on the specific modality. Moreover, our analysis reveals how the widespread issue, that the model has different preferences for modalities, limits the multi-modal robustness by influencing the essential components and could lead to attacks on the specific modality highly effective. Inspired by our theoretical finding, we introduce a training procedure called Certifiable Robust Multi-modal Training (CRMT), which can alleviate this influence from modality preference and explicitly regulate essential components to significantly improve robustness in a certifiable manner. Our method demonstrates substantial improvements in performance and robustness compared with existing methods. Furthermore, our training procedure can be easily extended to enhance other robust training strategies, highlighting its credibility and flexibility. △ Less

Submitted 18 April, 2024; v1 submitted 9 February, 2024; originally announced February 2024.

Comments: Accepted to ICLR 2024

arXiv:2402.05910 [pdf, other]

Background independent tensor networks

Authors: Chris Akers, Annie Y. Wei

Abstract: Conventional holographic tensor networks can be described as toy holographic maps constructed from many small linear maps acting in a spatially local way, all connected together with ``background entanglement'', i.e. links of a fixed state, often the maximally entangled state. However, these constructions fall short of modeling real holographic maps. One reason is that their ``areas'' are trivial,… ▽ More Conventional holographic tensor networks can be described as toy holographic maps constructed from many small linear maps acting in a spatially local way, all connected together with ``background entanglement'', i.e. links of a fixed state, often the maximally entangled state. However, these constructions fall short of modeling real holographic maps. One reason is that their ``areas'' are trivial, taking the same value for all states, unlike in gravity where the geometry is dynamical. Recently, new constructions have ameliorated this issue by adding degrees of freedom that ``live on the links''. This makes areas non-trivial, equal to the background entanglement piece plus a new positive piece that depends on the state of the link degrees of freedom. Nevertheless, this still has the downside that there is background entanglement, and hence it only models relatively limited code subspaces in which every area has a definite minimum value given by the background entanglement. In this note, we simply point out that a version of these constructions goes one step further: they can be background independent, with no background entanglement in the holographic map. This is advantageous because it allows tensor networks to model holographic maps for larger code subspaces. In addition to pointing this out, we address some subtleties involved in making it work and point out a nice connection it offers to recent discussions of random CFT data. △ Less

Submitted 10 April, 2024; v1 submitted 8 February, 2024; originally announced February 2024.

Comments: 20 pages, 2 figures, v2 added citation and fixed typos

arXiv:2402.05746 [pdf, other]

Editable Scene Simulation for Autonomous Driving via Collaborative LLM-Agents

Authors: Yuxi Wei, Zi Wang, Yifan Lu, Chenxin Xu, Changxing Liu, Hao Zhao, Siheng Chen, Yanfeng Wang

Abstract: Scene simulation in autonomous driving has gained significant attention because of its huge potential for generating customized data. However, existing editable scene simulation approaches face limitations in terms of user interaction efficiency, multi-camera photo-realistic rendering and external digital assets integration. To address these challenges, this paper introduces ChatSim, the first sys… ▽ More Scene simulation in autonomous driving has gained significant attention because of its huge potential for generating customized data. However, existing editable scene simulation approaches face limitations in terms of user interaction efficiency, multi-camera photo-realistic rendering and external digital assets integration. To address these challenges, this paper introduces ChatSim, the first system that enables editable photo-realistic 3D driving scene simulations via natural language commands with external digital assets. To enable editing with high command flexibility,~ChatSim leverages a large language model (LLM) agent collaboration framework. To generate photo-realistic outcomes, ChatSim employs a novel multi-camera neural radiance field method. Furthermore, to unleash the potential of extensive high-quality digital assets, ChatSim employs a novel multi-camera lighting estimation method to achieve scene-consistent assets' rendering. Our experiments on Waymo Open Dataset demonstrate that ChatSim can handle complex language commands and generate corresponding photo-realistic scene videos. △ Less

Submitted 26 June, 2024; v1 submitted 8 February, 2024; originally announced February 2024.

Comments: CVPR 2024(Highlight)

arXiv:2402.05709 [pdf, other]

Exploring the Nostr Ecosystem: A Study of Decentralization and Resilience

Authors: Yiluo Wei, Gareth Tyson

Abstract: Nostr is an open decentralized social network launched in 2022. From a user's perspective, it is similar to a micro-blogging service like Twitter. However, the underlying infrastructure is very different, and Nostr boasts a range of unique features that set it apart. Nostr introduces the concept of relays, which act as open storage servers that receive, store, and distribute user posts. Each user… ▽ More Nostr is an open decentralized social network launched in 2022. From a user's perspective, it is similar to a micro-blogging service like Twitter. However, the underlying infrastructure is very different, and Nostr boasts a range of unique features that set it apart. Nostr introduces the concept of relays, which act as open storage servers that receive, store, and distribute user posts. Each user is uniquely identified by a public key, ensuring authenticity of posts through digital signatures. Consequently, users are able to securely send and receive posts through various relays, which frees them from single-server reliance and enhances post availability (e.g., making it more censorship resistant). The Nostr ecosystem has garnered significant attention, boasting 4 million users and 60 million posts in just 2 years. To understand its characteristics and challenges, we conduct the first large-scale measurement of the Nostr ecosystem, spanning from July 1, 2023, to December 31, 2023. Our study focuses on two key aspects: Nostr relays and post replication strategies. We find that Nostr achieves superior decentralization compared to traditional Fediverse applications. However, relay availability remains a challenge, where financial sustainability (particularly for free-to-use relays) emerges as a contributing factor. We also find that the replication of posts across relays enhances post availability but introduces significant overhead. To address this, we propose two design innovations. One to control the number of post replications, and another to reduce the overhead during post retrieval. Via data-driven evaluations, we demonstrate their effectiveness without negatively impacting the system. △ Less

Submitted 8 February, 2024; originally announced February 2024.

Comments: Under Review

arXiv:2402.05528 [pdf, other]

doi 10.1007/JHEP05(2024)065

Measurement of the Branching Fraction of $B^{0} \rightarrow J/ψπ^{0}$ Decays

Authors: LHCb collaboration, R. Aaij, A. S. W. Abdelmotteleb, C. Abellan Beteta, F. Abudinén, T. Ackernley, J. A. Adams, A. A. Adefisoye, B. Adeva, M. Adinolfi, P. Adlarson, C. Agapopoulou, C. A. Aidala, Z. Ajaltouni, S. Akar, K. Akiba, P. Albicocco, J. Albrecht, F. Alessio, M. Alexander, Z. Aliouche, P. Alvarez Cartelle, R. Amalric, S. Amato, J. L. Amey , et al. (1067 additional authors not shown)

Abstract: The ratio of branching fractions between $B^{0} \rightarrow J/ψπ^{0}$ and $B^{+} \rightarrow J/ψK^{*+}$ decays is measured with proton-proton collision data collected by the LHCb experiment, corresponding to an integrated luminosity of 9 fb$^{-1}$. The measured value is… ▽ More The ratio of branching fractions between $B^{0} \rightarrow J/ψπ^{0}$ and $B^{+} \rightarrow J/ψK^{*+}$ decays is measured with proton-proton collision data collected by the LHCb experiment, corresponding to an integrated luminosity of 9 fb$^{-1}$. The measured value is $\frac{\mathcal{B}_{B^{0} \rightarrow J/ψπ^{0}}}{\mathcal{B}_{B^{+} \rightarrow J/ψK^{*+}}} = (1.153 \pm 0.053 \pm 0.048 ) \times 10^{-2}$, where the first uncertainty is statistical and the second is systematic. The branching fraction for $B^{0} \rightarrow J/ψπ^{0}$ decays is determined using the branching fraction of the normalisation channel, resulting in $\mathcal{B}_{B^{0} \rightarrow J/ψπ^{0}} = (1.670 \pm 0.077 \pm 0.069 \pm 0.095) \times 10^{-5}$, where the last uncertainty corresponds to that of the external input. This result is consistent with the current world average value and competitive with the most precise single measurement to date. △ Less

Submitted 23 May, 2024; v1 submitted 8 February, 2024; originally announced February 2024.

Comments: All figures and tables, along with any supplementary material and additional information, are available at https://cern.ch/lhcbproject/Publications/p/LHCb-PAPER-2023-041.html (LHCb public pages)

Report number: LHCb-PAPER-2023-041, CERN-EP-2024-009

Journal ref: J. High Energ. Phys. 2024, 65 (2024)

arXiv:2402.05523 [pdf, other]

doi 10.1007/JHEP04(2024)151

Observation of the $B_c^+ \to J/ψπ^+ π^0$ decay

Authors: LHCb collaboration, R. Aaij, A. S. W. Abdelmotteleb, C. Abellan Beteta, F. Abudinén, T. Ackernley, J. A. Adams, A. A. Adefisoye, B. Adeva, M. Adinolfi, P. Adlarson, C. Agapopoulou, C. A. Aidala, Z. Ajaltouni, S. Akar, K. Akiba, P. Albicocco, J. Albrecht, F. Alessio, M. Alexander, Z. Aliouche, P. Alvarez Cartelle, R. Amalric, S. Amato, J. L. Amey , et al. (1064 additional authors not shown)

Abstract: The first observation of the $B_c^+ \to J/ψπ^+ π^0$ decay is reported with high significance using proton-proton collision data, corresponding to an integrated luminosity of 9fb$^{-1}$, collected with the LHCb detector at centre-of-mass energies of 7, 8, and 13 TeV. The ratio of its branching fraction relative to the $B_c^+ \to J/ψπ^+$ channel is measured to be… ▽ More The first observation of the $B_c^+ \to J/ψπ^+ π^0$ decay is reported with high significance using proton-proton collision data, corresponding to an integrated luminosity of 9fb$^{-1}$, collected with the LHCb detector at centre-of-mass energies of 7, 8, and 13 TeV. The ratio of its branching fraction relative to the $B_c^+ \to J/ψπ^+$ channel is measured to be $$ \frac{ {\cal{B}}( B_c^+ \to J/ψπ^+π^0 ) } { {\cal{B}}( B_c^+ \to J/ψπ^+ ) } = 2.80 \pm 0.15 \pm 0.11 \pm 0.16 \,, $$ where the first uncertainty is statistical, the second systematic and the third related to imprecise knowledge of the branching fractions for $B^+ \to J/ψK^{*+}$ and $B^+ \to J/ψK^+$ decays, which are used to determine the $π^0$ detection efficiency. The $π^+π^0$ mass spectrum is found to be consistent with the dominance of an intermediate $ρ^+$ contribution in accordance with a model based on QCD factorisation. △ Less

Submitted 15 May, 2024; v1 submitted 8 February, 2024; originally announced February 2024.

Comments: 30 pages, 6 figures. All figures and tables, along with any supplementary material and additional information, are available at https://cern.ch/lhcbproject/Publications/p/LHCb-PAPER-2023-046.html (LHCb public pages)

Report number: LHCb-PAPER-2023-046, CERN-EP-2024-019

Journal ref: JHEP04 (2024) 151

arXiv:2402.05383 [pdf, other]

First measurement of the yield of $^8$He isotopes produced in liquid scintillator by cosmic-ray muons at Daya Bay

Authors: Daya Bay Collaboration, F. P. An, W. D. Bai, A. B. Balantekin, M. Bishai, S. Blyth, G. F. Cao, J. Cao, J. F. Chang, Y. Chang, H. S. Chen, H. Y. Chen, S. M. Chen, Y. Chen, Y. X. Chen, Z. Y. Chen, J. Cheng, Y. C. Cheng, Z. K. Cheng, J. J. Cherwinka, M. C. Chu, J. P. Cummings, O. Dalager, F. S. Deng, X. Y. Ding , et al. (177 additional authors not shown)

Abstract: Daya Bay presents the first measurement of cosmogenic $^8$He isotope production in liquid scintillator, using an innovative method for identifying cascade decays of $^8$He and its child isotope, $^8$Li. We also measure the production yield of $^9$Li isotopes using well-established methodology. The results, in units of 10$^{-8}μ^{-1}$g$^{-1}$cm$^{2}$, are 0.307$\pm$0.042, 0.341$\pm$0.040, and 0.546… ▽ More Daya Bay presents the first measurement of cosmogenic $^8$He isotope production in liquid scintillator, using an innovative method for identifying cascade decays of $^8$He and its child isotope, $^8$Li. We also measure the production yield of $^9$Li isotopes using well-established methodology. The results, in units of 10$^{-8}μ^{-1}$g$^{-1}$cm$^{2}$, are 0.307$\pm$0.042, 0.341$\pm$0.040, and 0.546$\pm$0.076 for $^8$He, and 6.73$\pm$0.73, 6.75$\pm$0.70, and 13.74$\pm$0.82 for $^9$Li at average muon energies of 63.9~GeV, 64.7~GeV, and 143.0~GeV, respectively. The measured production rate of $^8$He isotopes is more than an order of magnitude lower than any other measurement of cosmogenic isotope production. It replaces the results of previous attempts to determine the ratio of $^8$He to $^9$Li production that yielded a wide range of limits from 0 to 30\%. The results provide future liquid-scintillator-based experiments with improved ability to predict cosmogenic backgrounds. △ Less

Submitted 7 February, 2024; originally announced February 2024.

arXiv:2402.04422 [pdf]

Immunogenic cell death triggered by pathogen ligands via host germ line-encoded receptors

Authors: Chuang Li, Yichen Wei, Chao Qin, Shifan Chen, Xiaolong Shao

Abstract: The strategic induction of cell death serves as a crucial immune defense mechanism for the eradication of pathogenic infections within host cells. Investigating the molecular mechanisms underlying immunogenic cell pathways has significantly enhanced our understanding of the host's immunity. This review provides a comprehensive overview of the immunogenic cell death mechanisms triggered by pathogen… ▽ More The strategic induction of cell death serves as a crucial immune defense mechanism for the eradication of pathogenic infections within host cells. Investigating the molecular mechanisms underlying immunogenic cell pathways has significantly enhanced our understanding of the host's immunity. This review provides a comprehensive overview of the immunogenic cell death mechanisms triggered by pathogen infections, focusing on the critical role of pattern recognition receptors. In response to infections, host cells dictate a variety of cell death pathways, including apoptosis, pyroptosis, necrosis, and lysosomal cell death, which are essential for amplifying immune responses and controlling pathogen dissemination. Key components of these mechanisms are host cellular receptors that recognize pathogen-associated ligands. These receptors activate downstream signaling cascades, leading to the expression of immunoregulatory genes and the production of antimicrobial cytokines and chemokines. Particularly, the inflammasome, a multi-protein complex, plays a pivotal role in these responses by processing pro-inflammatory cytokines and inducing pyroptotic cell death. Pathogens, in turn, have evolved strategies to manipulate these cell death pathways, either by inhibiting them to facilitate their replication or by triggering them to evade host defenses. This dynamic interplay between host immune mechanisms and pathogen strategies highlights the intricate co-evolution of microbial virulence and host immunity. △ Less

Submitted 6 February, 2024; originally announced February 2024.

Comments: 30 pages, 3 figures

arXiv:2402.03596 [pdf, other]

PandaX-xT: a Multi-ten-tonne Liquid Xenon Observatory at the China **** Underground Laboratory

Authors: PandaX Collaboration, Abdusalam Abdukerim, Zihao Bo, Wei Chen, Xun Chen, Chen Cheng, Zhaokan Cheng, Xiangyi Cui, Yingjie Fan, Deqing Fang, Lisheng Geng, Karl Giboni, Linhui Gu, Xunan Guo, Xuyuan Guo, Zhichao Guo, Chencheng Han, Ke Han, Changda He, **rong He, Di Huang, Junting Huang, Zhou Huang, Ruquan Hou, Yu Hou , et al. (68 additional authors not shown)

Abstract: We propose a major upgrade to the existing PandaX-4T experiment in the China **** Underground Laboratory. The new experiment, PandaX-xT, will be a multi-ten-tonne liquid xenon, ultra-low background, and general-purpose observatory. The full-scaled PandaX-xT contains a 43-tonne liquid xenon active target. Such an experiment will significantly advance our fundamental understanding of particle phy… ▽ More We propose a major upgrade to the existing PandaX-4T experiment in the China **** Underground Laboratory. The new experiment, PandaX-xT, will be a multi-ten-tonne liquid xenon, ultra-low background, and general-purpose observatory. The full-scaled PandaX-xT contains a 43-tonne liquid xenon active target. Such an experiment will significantly advance our fundamental understanding of particle physics and astrophysics. The sensitivity of dark matter direct detection will be improved by nearly two orders of magnitude compared to the current best limits, approaching the so-called "neutrino floor" for a dark matter mass above 10 GeV/$c^2$, providing a decisive test to the Weakly Interacting Massive Particle paradigm. By searching for the neutrinoless double beta decay of $^{136}$Xe isotope in the detector, the effective Majorana neutrino mass can be measured to a [10 -- 41] meV/$c^2$ sensitivity, providing a key test to the Dirac/Majorana nature of neutrino s. Astrophysical neutrinos and other ultra-rare interactions can also be measured and searched for with an unprecedented background level, opening up new windows of discovery. Depending on the findings, PandaX-xT will seek the next stage upgrade utilizing isotopic separation on natural xenon. △ Less

Submitted 5 February, 2024; originally announced February 2024.

arXiv:2402.02389 [pdf, other]

doi 10.18653/v1/2023.findings-emnlp.580

KICGPT: Large Language Model with Knowledge in Context for Knowledge Graph Completion

Authors: Yanbin Wei, Qiushi Huang, James T. Kwok, Yu Zhang

Abstract: Knowledge Graph Completion (KGC) is crucial for addressing knowledge graph incompleteness and supporting downstream applications. Many models have been proposed for KGC. They can be categorized into two main classes: triple-based and text-based approaches. Triple-based methods struggle with long-tail entities due to limited structural information and imbalanced entity distributions. Text-based met… ▽ More Knowledge Graph Completion (KGC) is crucial for addressing knowledge graph incompleteness and supporting downstream applications. Many models have been proposed for KGC. They can be categorized into two main classes: triple-based and text-based approaches. Triple-based methods struggle with long-tail entities due to limited structural information and imbalanced entity distributions. Text-based methods alleviate this issue but require costly training for language models and specific finetuning for knowledge graphs, which limits their efficiency. To alleviate these limitations, in this paper, we propose KICGPT, a framework that integrates a large language model (LLM) and a triple-based KGC retriever. It alleviates the long-tail problem without incurring additional training overhead. KICGPT uses an in-context learning strategy called Knowledge Prompt, which encodes structural knowledge into demonstrations to guide the LLM. Empirical results on benchmark datasets demonstrate the effectiveness of KICGPT with smaller training overhead and no finetuning. △ Less

Submitted 23 February, 2024; v1 submitted 4 February, 2024; originally announced February 2024.

Comments: Accepted to EMNLP 2023 Findings

arXiv:2402.02330 [pdf, other]

Enhance Reasoning for Large Language Models in the Game Werewolf

Authors: Shuang Wu, Liwen Zhu, Tao Yang, Shiwei Xu, Qiang Fu, Yang Wei, Haobo Fu

Abstract: This paper presents an innovative framework that integrates Large Language Models (LLMs) with an external Thinker module to enhance the reasoning capabilities of LLM-based agents. Unlike augmenting LLMs with prompt engineering, Thinker directly harnesses knowledge from databases and employs various optimization techniques. The framework forms a reasoning hierarchy where LLMs handle intuitive Syste… ▽ More This paper presents an innovative framework that integrates Large Language Models (LLMs) with an external Thinker module to enhance the reasoning capabilities of LLM-based agents. Unlike augmenting LLMs with prompt engineering, Thinker directly harnesses knowledge from databases and employs various optimization techniques. The framework forms a reasoning hierarchy where LLMs handle intuitive System-1 tasks such as natural language processing, while the Thinker focuses on cognitive System-2 tasks that require complex logical analysis and domain-specific knowledge. Our framework is presented using a 9-player Werewolf game that demands dual-system reasoning. We introduce a communication protocol between LLMs and the Thinker, and train the Thinker using data from 18800 human sessions and reinforcement learning. Experiments demonstrate the framework's effectiveness in deductive reasoning, speech generation, and online game evaluation. Additionally, we fine-tune a 6B LLM to surpass GPT4 when integrated with the Thinker. This paper also contributes the largest dataset for social deduction games to date. △ Less

Submitted 29 March, 2024; v1 submitted 3 February, 2024; originally announced February 2024.

arXiv:2402.02130 [pdf, other]

GITA: Graph to Visual and Textual Integration for Vision-Language Graph Reasoning

Authors: Yanbin Wei, Shuai Fu, Weisen Jiang, Zejian Zhang, Zhixiong Zeng, Qi Wu, James T. Kwok, Yu Zhang

Abstract: Large Language Models (LLMs) are increasingly used for various tasks with graph structures. Though LLMs can process graph information in a textual format, they overlook the rich vision modality, which is an intuitive way for humans to comprehend structural information and conduct general graph reasoning. The potential benefits and capabilities of representing graph structures as visual images (i.e… ▽ More Large Language Models (LLMs) are increasingly used for various tasks with graph structures. Though LLMs can process graph information in a textual format, they overlook the rich vision modality, which is an intuitive way for humans to comprehend structural information and conduct general graph reasoning. The potential benefits and capabilities of representing graph structures as visual images (i.e., $\textit{visual graph}$) are still unexplored. To fill the gap, we innovatively propose an end-to-end framework, called $\textbf{G}$raph to v$\textbf{I}$sual and $\textbf{T}$extual Integr$\textbf{A}$tion (GITA), which firstly incorporates visual graphs into general graph reasoning. Besides, we establish $\textbf{G}$raph-based $\textbf{V}$ision-$\textbf{L}$anguage $\textbf{Q}$uestion $\textbf{A}$nswering (GVLQA) dataset from existing graph data, which is the first vision-language dataset for general graph reasoning purposes. Extensive experiments on the GVLQA dataset and five real-world datasets show that GITA outperforms mainstream LLMs in terms of general graph reasoning capabilities. Moreover, We highlight the effectiveness of the layout augmentation on visual graphs and pretraining on the GVLQA dataset. △ Less

Submitted 24 May, 2024; v1 submitted 3 February, 2024; originally announced February 2024.

arXiv:2402.01439 [pdf, other]

From Words to Molecules: A Survey of Large Language Models in Chemistry

Authors: Chang Liao, Yemin Yu, Yu Mei, Ying Wei

Abstract: In recent years, Large Language Models (LLMs) have achieved significant success in natural language processing (NLP) and various interdisciplinary areas. However, applying LLMs to chemistry is a complex task that requires specialized domain knowledge. This paper provides a thorough exploration of the nuanced methodologies employed in integrating LLMs into the field of chemistry, delving into the c… ▽ More In recent years, Large Language Models (LLMs) have achieved significant success in natural language processing (NLP) and various interdisciplinary areas. However, applying LLMs to chemistry is a complex task that requires specialized domain knowledge. This paper provides a thorough exploration of the nuanced methodologies employed in integrating LLMs into the field of chemistry, delving into the complexities and innovations at this interdisciplinary juncture. Specifically, our analysis begins with examining how molecular information is fed into LLMs through various representation and tokenization methods. We then categorize chemical LLMs into three distinct groups based on the domain and modality of their input data, and discuss approaches for integrating these inputs for LLMs. Furthermore, this paper delves into the pretraining objectives with adaptations to chemical LLMs. After that, we explore the diverse applications of LLMs in chemistry, including novel paradigms for their application in chemistry tasks. Finally, we identify promising research directions, including further integration with chemical knowledge, advancements in continual learning, and improvements in model interpretability, paving the way for groundbreaking developments in the field. △ Less

Submitted 2 February, 2024; originally announced February 2024.

Comments: Submitted to IJCAI 2024 survey track

arXiv:2402.01336 [pdf, other]

Measurements of the branching fraction ratio $\cal{B}(φ\to μ^+μ^-)/\cal{B}(φ\to e^+e^-)$ with charm meson decays

Authors: LHCb collaboration, R. Aaij, A. S. W. Abdelmotteleb, C. Abellan Beteta, F. Abudinén, T. Ackernley, B. Adeva, M. Adinolfi, P. Adlarson, C. Agapopoulou, C. A. Aidala, Z. Ajaltouni, S. Akar, K. Akiba, P. Albicocco, J. Albrecht, F. Alessio, M. Alexander, A. Alfonso Albero, Z. Aliouche, P. Alvarez Cartelle, R. Amalric, S. Amato, J. L. Amey, Y. Amhis , et al. (1080 additional authors not shown)

Abstract: Measurements of the branching fraction ratio ${\cal{B}(φ\to μ^+ μ^-)/\cal{B}(φ\to e^+e^-)}$ with ${D_{s}^{+} \to π^{+} φ}$ and ${D^{+} \to π^{+} φ}$ decays, denoted $R^{s}_{φπ}$ and $R^{d}_{φπ}$, are presented. The analysis is performed using a dataset corresponding to an integrated luminosity of 5.4$\,\rm{fb}^{-1}$ of $pp$ collision data collected with the LHCb experiment. The branching fractions… ▽ More Measurements of the branching fraction ratio ${\cal{B}(φ\to μ^+ μ^-)/\cal{B}(φ\to e^+e^-)}$ with ${D_{s}^{+} \to π^{+} φ}$ and ${D^{+} \to π^{+} φ}$ decays, denoted $R^{s}_{φπ}$ and $R^{d}_{φπ}$, are presented. The analysis is performed using a dataset corresponding to an integrated luminosity of 5.4$\,\rm{fb}^{-1}$ of $pp$ collision data collected with the LHCb experiment. The branching fractions are normalised with respect to the ${B^{+} \to K^{+} J/ψ(\to e^+e^-)}$ and ${B^{+} \to K^{+} J/ψ(\to μ^+μ^-)}$ decay modes. The combination of the results yields $$ R_{φπ} = 1.022 \pm 0.012 \,({\rm stat}) \, \pm 0.048 \,({\rm syst}). $$ The result is compatible with previous measurements of the $φ\to \ell^{+}\ell^{-}$ branching fractions and predictions based on the Standard Model. △ Less

Submitted 1 May, 2024; v1 submitted 2 February, 2024; originally announced February 2024.

Comments: All figures and tables, along with any supplementary material and additional information, are available at https://cern.ch/lhcbproject/Publications/p/LHCb-PAPER-2023-038.html (LHCb public pages)

Report number: LHCb-PAPER-2023-038, CERN-EP-2024-001

arXiv:2401.17934 [pdf, other]

doi 10.1007/JHEP05(2024)025

Study of $CP$ violation in $B^0_{(s)} \to D K^{*}(892)^0$ decays with $D \to K π( ππ)$, $ ππ( ππ)$, and $KK$ final states

Authors: LHCb collaboration, R. Aaij, A. S. W. Abdelmotteleb, C. Abellan Beteta, F. Abudinén, T. Ackernley, B. Adeva, M. Adinolfi, P. Adlarson, C. Agapopoulou, C. A. Aidala, Z. Ajaltouni, S. Akar, K. Akiba, P. Albicocco, J. Albrecht, F. Alessio, M. Alexander, A. Alfonso Albero, Z. Aliouche, P. Alvarez Cartelle, R. Amalric, S. Amato, J. L. Amey, Y. Amhis , et al. (1072 additional authors not shown)

Abstract: A measurement of $CP$-violating observables associated with the interference of $B^0\to D^0 K^{*}(892)^0$ and $B^0\to \bar{D}^0 K^*(892)^0$ decay amplitudes is performed in the $D^0 \to K^{\mp}π^{\pm}(π^+π^-),$ $D^0 \to π^+π^-(π^+π^-)$, and $D^0\to K^+K^-$ final states using data collected by the LHCb experiment corresponding to an integrated luminosity of $9$ $\text{fb}^{-1}$. $CP$-violating obse… ▽ More A measurement of $CP$-violating observables associated with the interference of $B^0\to D^0 K^{*}(892)^0$ and $B^0\to \bar{D}^0 K^*(892)^0$ decay amplitudes is performed in the $D^0 \to K^{\mp}π^{\pm}(π^+π^-),$ $D^0 \to π^+π^-(π^+π^-)$, and $D^0\to K^+K^-$ final states using data collected by the LHCb experiment corresponding to an integrated luminosity of $9$ $\text{fb}^{-1}$. $CP$-violating observables related to the interference of $B^0_s\to D^0 \bar{K}^*(892)^0$ and $B_s^0\to \bar{D}^0 \bar{K}^*(892)^0$ are also measured, but no evidence for interference is found. The $B^0$ observables are used to constrain the parameter space of the CKM angle $γ$ and the hadronic parameters $r_{B^0}^{DK^*}$ and $δ_{B^0}^{DK^*}$ with inputs from other measurements. In a combined analysis, these measurements allow for four solutions in the parameter space, only one of which is consistent with the world average. △ Less

Submitted 13 May, 2024; v1 submitted 31 January, 2024; originally announced January 2024.

Comments: All figures and tables, along with any supplementary material and additional information, are available at https://cern.ch/lhcbproject/Publications/p/LHCb-PAPER-2023-040.html (LHCb public pages)

Report number: LHCb-PAPER-2023-040, CERN-EP-2024-007

Journal ref: JHEP 05(2024) 025

arXiv:2401.17197 [pdf, other]

Data-efficient Fine-tuning for LLM-based Recommendation

Authors: Xinyu Lin, Wenjie Wang, Yongqi Li, Shuo Yang, Fuli Feng, Yinwei Wei, Tat-Seng Chua

Abstract: Leveraging Large Language Models (LLMs) for recommendation has recently garnered considerable attention, where fine-tuning plays a key role in LLMs' adaptation. However, the cost of fine-tuning LLMs on rapidly expanding recommendation data limits their practical application. To address this challenge, few-shot fine-tuning offers a promising approach to quickly adapt LLMs to new recommendation data… ▽ More Leveraging Large Language Models (LLMs) for recommendation has recently garnered considerable attention, where fine-tuning plays a key role in LLMs' adaptation. However, the cost of fine-tuning LLMs on rapidly expanding recommendation data limits their practical application. To address this challenge, few-shot fine-tuning offers a promising approach to quickly adapt LLMs to new recommendation data. We propose the task of data pruning for efficient LLM-based recommendation, aimed at identifying representative samples tailored for LLMs' few-shot fine-tuning. While coreset selection is closely related to the proposed task, existing coreset selection methods often rely on suboptimal heuristic metrics or entail costly optimization on large-scale recommendation data. To tackle these issues, we introduce two objectives for the data pruning task in the context of LLM-based recommendation: 1) high accuracy aims to identify the influential samples that can lead to high overall performance; and 2) high efficiency underlines the low costs of the data pruning process. To pursue the two objectives, we propose a novel data pruning method based on two scores, i.e., influence score and effort score, to efficiently identify the influential samples. Particularly, the influence score is introduced to accurately estimate the influence of sample removal on the overall performance. To achieve low costs of the data pruning process, we use a small-sized surrogate model to replace LLMs to obtain the influence score. Considering the potential gap between the surrogate model and LLMs, we further propose an effort score to prioritize some hard samples specifically for LLMs. Empirical results on three real-world datasets validate the effectiveness of our proposed method. In particular, the proposed method uses only 2% samples to surpass the full data fine-tuning, reducing time costs by 97%. △ Less

Submitted 4 June, 2024; v1 submitted 30 January, 2024; originally announced January 2024.

Comments: Accepted by SIGIR 2024

arXiv:2401.16836 [pdf, other]

Coseparable Nonnegative Tensor Factorization With T-CUR Decomposition

Authors: Juefei Chen, Longxiu Huang, Yimin Wei

Abstract: Nonnegative Matrix Factorization (NMF) is an important unsupervised learning method to extract meaningful features from data. To address the NMF problem within a polynomial time framework, researchers have introduced a separability assumption, which has recently evolved into the concept of coseparability. This advancement offers a more efficient core representation for the original data. However,… ▽ More Nonnegative Matrix Factorization (NMF) is an important unsupervised learning method to extract meaningful features from data. To address the NMF problem within a polynomial time framework, researchers have introduced a separability assumption, which has recently evolved into the concept of coseparability. This advancement offers a more efficient core representation for the original data. However, in the real world, the data is more natural to be represented as a multi-dimensional array, such as images or videos. The NMF's application to high-dimensional data involves vectorization, which risks losing essential multi-dimensional correlations. To retain these inherent correlations in the data, we turn to tensors (multidimensional arrays) and leverage the tensor t-product. This approach extends the coseparable NMF to the tensor setting, creating what we term coseparable Nonnegative Tensor Factorization (NTF). In this work, we provide an alternating index selection method to select the coseparable core. Furthermore, we validate the t-CUR sampling theory and integrate it with the tensor Discrete Empirical Interpolation Method (t-DEIM) to introduce an alternative, randomized index selection process. These methods have been tested on both synthetic and facial analysis datasets. The results demonstrate the efficiency of coseparable NTF when compared to coseparable NMF. △ Less

Submitted 7 May, 2024; v1 submitted 30 January, 2024; originally announced January 2024.

arXiv:2401.15849 [pdf, other]

Aperiodic-quasiperiodic-periodic properties and topological transitions in twisted nested Moiré patterns

Authors: Peng Peng, Yuchen Peng, Aoqian Shi, Xiaogen Yi, Yizhou Wei, Jianjun Liu

Abstract: The Moiré patterns generated by altering the structural parameters in a two or more layers of periodic materials, including single-layer structure, interlayer stacking, and twisting parameters, exhibit prosperous topological physical properties. However, the intricate characteristics of twisted nested Moiré patterns and their relationship with topological transitions remain unclear. In this Letter… ▽ More The Moiré patterns generated by altering the structural parameters in a two or more layers of periodic materials, including single-layer structure, interlayer stacking, and twisting parameters, exhibit prosperous topological physical properties. However, the intricate characteristics of twisted nested Moiré patterns and their relationship with topological transitions remain unclear. In this Letter, based on the proposed twisted nested photonic crystal (TNPC), we derive its spatial geometric functions (SGFs), aperiodic-quasiperiodic-periodic properties in twisted nested Moiré patterns, and the SSHφ Hamiltonian. We reveal the intrinsic correlation between twisted nested Moiré patterns and topological transitions, obtaining higher-order topological states (HOTSs) with C2z symmetry. This work will provide theoretical references for the design and application of twisted topological PC and their devices. △ Less

Submitted 17 February, 2024; v1 submitted 28 January, 2024; originally announced January 2024.

arXiv:2401.15704 [pdf, other]

Phoneme-Based Proactive Anti-Eavesdrop** with Controlled Recording Privilege

Authors: Peng Huang, Yao Wei, Peng Cheng, Zhongjie Ba, Li Lu, Feng Lin, Yang Wang, Kui Ren

Abstract: The widespread smart devices raise people's concerns of being eavesdropped on. To enhance voice privacy, recent studies exploit the nonlinearity in microphone to jam audio recorders with inaudible ultrasound. However, existing solutions solely rely on energetic masking. Their simple-form noise leads to several problems, such as high energy requirements and being easily removed by speech enhancemen… ▽ More The widespread smart devices raise people's concerns of being eavesdropped on. To enhance voice privacy, recent studies exploit the nonlinearity in microphone to jam audio recorders with inaudible ultrasound. However, existing solutions solely rely on energetic masking. Their simple-form noise leads to several problems, such as high energy requirements and being easily removed by speech enhancement techniques. Besides, most of these solutions do not support authorized recording, which restricts their usage scenarios. In this paper, we design an efficient yet robust system that can jam microphones while preserving authorized recording. Specifically, we propose a novel phoneme-based noise with the idea of informational masking, which can distract both machines and humans and is resistant to denoising techniques. Besides, we optimize the noise transmission strategy for broader coverage and implement a hardware prototype of our system. Experimental results show that our system can reduce the recognition accuracy of recordings to below 50\% under all tested speech recognition systems, which is much better than existing solutions. △ Less

Submitted 28 January, 2024; originally announced January 2024.

Comments: 14 pages, 28 figures; submitted to IEEE TDSC

arXiv:2401.13933 [pdf, ps, other]

Solutions to the First Order Difference Equations in the Multivariate Difference Field

Authors: Lixin Du, Yarong Wei

Abstract: The bivariate difference field provides an algebraic framework for a sequence satisfying a recurrence of order two. Based on this, we focus on sequences satisfying a recurrence of higher order, and consider the multivariate difference field, in which the summation problem could be transformed into solving the first order difference equations. We then show a criterion for deciding whether the diffe… ▽ More The bivariate difference field provides an algebraic framework for a sequence satisfying a recurrence of order two. Based on this, we focus on sequences satisfying a recurrence of higher order, and consider the multivariate difference field, in which the summation problem could be transformed into solving the first order difference equations. We then show a criterion for deciding whether the difference equation has a rational solution and present an algorithm for computing one rational solution of such a difference equation, if it exists. Moreover we get the rational solution set of such an equation. △ Less

Submitted 24 January, 2024; originally announced January 2024.

arXiv:2401.12034 [pdf, other]

doi 10.1016/j.nima.2024.169315

Unfolding environmental $γ$ flux spectrum with portable CZT detector

Authors: Taiyuan Liu, Mingxuan Xue, Hai** Peng, Kangkang Zhao, Deyong Duan, Yichao Wang, Changqing Feng, Yifeng Wei, Qing Lin, Zizong Xu, Xiaolian Wang

Abstract: Environmental $γ$-rays constitute a crucial source of background in various nuclear, particle and quantum physics experiments. To evaluate the flux rate and the spectrum of $γ$ background, we have developed a novel and straightforward approach to reconstruct the environmental $γ$ flux spectrum by applying a portable CZT $γ$ detector and iterative Bayesian unfolding, which possesses excellent trans… ▽ More Environmental $γ$-rays constitute a crucial source of background in various nuclear, particle and quantum physics experiments. To evaluate the flux rate and the spectrum of $γ$ background, we have developed a novel and straightforward approach to reconstruct the environmental $γ$ flux spectrum by applying a portable CZT $γ$ detector and iterative Bayesian unfolding, which possesses excellent transferability for broader applications. In this paper, the calibration and GEANT4 Monte-Carlo modeling of the CZT detector, the unfolding procedure as well as the uncertainty estimation are demonstrated in detail. The reconstructed spectrum reveals an environmental $γ$ flux intensity of $3.3\pm 0.9\times 10^{7}$~ (m$^2\cdot$sr$\cdot$hour)$^{-1}$ ranging from 73 to 3033~keV, along with characteristic peaks primarily arising from $^{232}$Th series, $^{238}$U series and $^{40}$K. We also give an instance of background rate evaluation with the unfolded spectrum for validation of the approach. △ Less

Submitted 5 April, 2024; v1 submitted 22 January, 2024; originally announced January 2024.

Journal ref: Nuclear Inst. and Methods in Physics Research, A (2024)

arXiv:2401.11978 [pdf, other]

$Λ_{b}\rightarrow P \ell$ factorization in QCD

Authors: Lei-Yi Li, Cai-Dian Lü, ** Wang, Yan-Bing Wei

Abstract: We calculate the form factors for the baryon number violation processes of a heavy-flavor baryon decaying into a pseudoscalar meson and a lepton. In the framework of the Standard Model effective field theory, the leptoquark operators at the bottom quark scale, whose matrix elements define the form factors, are derived by integrating out the high energy physics. Under the QCD factorization approach… ▽ More We calculate the form factors for the baryon number violation processes of a heavy-flavor baryon decaying into a pseudoscalar meson and a lepton. In the framework of the Standard Model effective field theory, the leptoquark operators at the bottom quark scale, whose matrix elements define the form factors, are derived by integrating out the high energy physics. Under the QCD factorization approach, the form factors of the baryon number violation processes at leading power can be factorized into the convolution of the long-distance hadron wave functions as well as the short-distance hard and jet functions representing the hard scale and hard-collinear scale effects, separately. Based on measurements of the baryon number violation processes by LHCb, we further impose constraints on the new physics constants of leptoquark operators. △ Less

Submitted 18 May, 2024; v1 submitted 22 January, 2024; originally announced January 2024.

Comments: 13 pages, 3 figures

arXiv:2401.11388 [pdf, ps, other]

Polynomial Solutions to the First Order Difference Equations in the Bivariate Difference Field

Authors: Yarong Wei

Abstract: The bivariate difference filed $(\mathbb{F}(α, β), σ)$ provides an algebraic framework for a sequence satisfying a recurrence of order two and it could transform the summation involving a sequence satisfying a recurrence of order two into the first order difference equations in the bivariate difference field. Based on it, we present an algorithm for finding all the polynomial solutions of such equ… ▽ More The bivariate difference filed $(\mathbb{F}(α, β), σ)$ provides an algebraic framework for a sequence satisfying a recurrence of order two and it could transform the summation involving a sequence satisfying a recurrence of order two into the first order difference equations in the bivariate difference field. Based on it, we present an algorithm for finding all the polynomial solutions of such equations in the bivariate difference field, and show an upper bound on the degree for polynomial solutions which is sufficient to compute polynomial solution by using the undetermined method. △ Less

Submitted 20 January, 2024; originally announced January 2024.

arXiv:2401.11387 [pdf, ps, other]

Rational Solutions to the First Order Difference Equations in the Bivariate Difference Field

Authors: Qing-Hu Hou, Yarong Wei

Abstract: Inspired by Karr's algorithm, we consider the summations involving a sequence satisfying a recurrence of order two. The structure of such summations provides an algebraic framework for solving the difference equations of form $aσ(g)+bg=f$ in the bivariate difference field $(\mathbb{F}(α, β), σ)$, where $a, b,f\in\mathbb{F}(α,β)\setminus\{0\}$ are known binary functions of $α$, $β$, and $α$, $β$ ar… ▽ More Inspired by Karr's algorithm, we consider the summations involving a sequence satisfying a recurrence of order two. The structure of such summations provides an algebraic framework for solving the difference equations of form $aσ(g)+bg=f$ in the bivariate difference field $(\mathbb{F}(α, β), σ)$, where $a, b,f\in\mathbb{F}(α,β)\setminus\{0\}$ are known binary functions of $α$, $β$, and $α$, $β$ are two algebraically independent transcendental elements, $σ$ is a transformation that satisfies $σ(α)=β$, $σ(β)=uα+vβ$, where $u,v\neq 0\in\mathbb{F}$. Based on it, we then describe algorithms for finding the universal denominator for those equations in the bivariate difference field under certain assumptions. This reduces the general problem of finding the rational solutions of such equations to the problem of finding the polynomial solutions of such equations. △ Less

Submitted 20 January, 2024; originally announced January 2024.

arXiv:2401.11342 [pdf, other]

doi 10.1007/JHEP04(2024)111

Prompt and nonprompt $ψ(2S)$ production in $p$Pb collisions at $\sqrt{s_{NN}}=8.16$ TeV

Authors: LHCb collaboration, R. Aaij, A. S. W. Abdelmotteleb, C. Abellan Beteta, F. Abudinén, T. Ackernley, B. Adeva, M. Adinolfi, P. Adlarson, H. Afsharnia, C. Agapopoulou, C. A. Aidala, Z. Ajaltouni, S. Akar, K. Akiba, P. Albicocco, J. Albrecht, F. Alessio, M. Alexander, A. Alfonso Albero, Z. Aliouche, P. Alvarez Cartelle, R. Amalric, S. Amato, J. L. Amey , et al. (1079 additional authors not shown)

Abstract: The production of $ψ(2S)$ mesons in proton-lead collisions at a centre-of-mass energy per nucleon pair of $\sqrt{s_{NN}}=8.16$ TeV is studied with the LHCb detector using data corresponding to an integrated luminosity of 34 nb$^{-1}$. The prompt and nonprompt $ψ(2S)$ production cross-sections and the ratio of the $ψ(2S)$ to $J/ψ$ cross-section are measured as a function of the meson transverse mom… ▽ More The production of $ψ(2S)$ mesons in proton-lead collisions at a centre-of-mass energy per nucleon pair of $\sqrt{s_{NN}}=8.16$ TeV is studied with the LHCb detector using data corresponding to an integrated luminosity of 34 nb$^{-1}$. The prompt and nonprompt $ψ(2S)$ production cross-sections and the ratio of the $ψ(2S)$ to $J/ψ$ cross-section are measured as a function of the meson transverse momentum and rapidity in the nucleon-nucleon centre-of-mass frame, together with forward-to-backward ratios and nuclear modification factors. The production of prompt $ψ(2S)$ is observed to be more suppressed compared to $pp$ collisions than the prompt $J/ψ$ production, while the nonprompt productions have similar suppression factors. △ Less

Submitted 22 April, 2024; v1 submitted 20 January, 2024; originally announced January 2024.

Comments: All figures and tables, along with machine-readable versions and any supplementary material and additional information, are available at https://cern.ch/lhcbproject/Publications/p/LHCb-PAPER-2023-024.html (LHCb public pages)

Report number: LHCb-PAPER-2023-024, CERN-EP-2023-293

Journal ref: JHEP 04 (2024) 111

arXiv:2401.10600 [pdf, other]

Changing-look NLS1 galaxies, their detection with SVOM, and the case of NGC 1566

Authors: D. W. Xu, S. Komossa, D. Grupe, J. Wang, L. P. Xin, X. H. Han, J. Y. Wei, J. Y. Bai, E. Bon, F. Cangemi, B. Cordier, M. Dennefeld, L. C. Gallo, W. Kollatschny, De-Feng Kong, M. W. Ochmann, Y. L. Qiu, N. Schartel

Abstract: We discuss applications of the study of the new and barely explored class of changing-look (CL) narrow-line Seyfert 1 (NLS1) galaxies and comment on their detection with the space mission SVOM (Space Variable Objects Monitor). We highlight the case of NGC 1566, which is outstanding in many respects, for instance as one of the nearest known CL AGN undergoing exceptional outbursts. Its NLS1 nature i… ▽ More We discuss applications of the study of the new and barely explored class of changing-look (CL) narrow-line Seyfert 1 (NLS1) galaxies and comment on their detection with the space mission SVOM (Space Variable Objects Monitor). We highlight the case of NGC 1566, which is outstanding in many respects, for instance as one of the nearest known CL AGN undergoing exceptional outbursts. Its NLS1 nature is discussed, and we take it as a nearby prototype for systems that could be discovered and studied in the near future, including with SVOM. Finally, we briefly examine the broader implications and applications of CL events in NLS1 galaxies and show that such systems, once discovered in larger numbers, will greatly advance our understanding of the physics of the environment of rapidly growing supermassive black holes. This White Paper is part of a sequence of publications which explore aspects of our understanding of (CL) NLS1 galaxy physics with future missions. △ Less

Submitted 19 January, 2024; originally announced January 2024.

Comments: 14 pages, 4 figures. Accepted for publication in the Universe Special Issue "A Multimessenger View of Supermassive Black Holes and the Quasar Main Sequence"

arXiv:2401.10272 [pdf, other]

Multi-Source Collaborative Gradient Discrepancy Minimization for Federated Domain Generalization

Authors: Yikang Wei, Yahong Han

Abstract: Federated Domain Generalization aims to learn a domain-invariant model from multiple decentralized source domains for deployment on unseen target domain. Due to privacy concerns, the data from different source domains are kept isolated, which poses challenges in bridging the domain gap. To address this issue, we propose a Multi-source Collaborative Gradient Discrepancy Minimization (MCGDM) method… ▽ More Federated Domain Generalization aims to learn a domain-invariant model from multiple decentralized source domains for deployment on unseen target domain. Due to privacy concerns, the data from different source domains are kept isolated, which poses challenges in bridging the domain gap. To address this issue, we propose a Multi-source Collaborative Gradient Discrepancy Minimization (MCGDM) method for federated domain generalization. Specifically, we propose intra-domain gradient matching between the original images and augmented images to avoid overfitting the domain-specific information within isolated domains. Additionally, we propose inter-domain gradient matching with the collaboration of other domains, which can further reduce the domain shift across decentralized domains. Combining intra-domain and inter-domain gradient matching, our method enables the learned model to generalize well on unseen domains. Furthermore, our method can be extended to the federated domain adaptation task by fine-tuning the target model on the pseudo-labeled target domain. The extensive experiments on federated domain generalization and adaptation indicate that our method outperforms the state-of-the-art methods significantly. △ Less

Submitted 4 January, 2024; originally announced January 2024.

Comments: Accepted by AAAI 2024

arXiv:2401.08472 [pdf, other]

Instilling Multi-round Thinking to Text-guided Image Generation

Authors: Lidong Zeng, Zhedong Zheng, Yinwei Wei, Tat-seng Chua

Abstract: This paper delves into the text-guided image editing task, focusing on modifying a reference image according to user-specified textual feedback to embody specific attributes. Despite recent advancements, a persistent challenge remains that the single-round generation often overlooks crucial details, particularly in the realm of fine-grained changes like shoes or sleeves. This issue compounds over… ▽ More This paper delves into the text-guided image editing task, focusing on modifying a reference image according to user-specified textual feedback to embody specific attributes. Despite recent advancements, a persistent challenge remains that the single-round generation often overlooks crucial details, particularly in the realm of fine-grained changes like shoes or sleeves. This issue compounds over multiple rounds of interaction, severely limiting customization quality. In an attempt to address this challenge, we introduce a new self-supervised regularization, \ie, multi-round regularization, which is compatible with existing methods. Specifically, the multi-round regularization encourages the model to maintain consistency across different modification orders. It builds upon the observation that the modification order generally should not affect the final result. Different from traditional one-round generation, the mechanism underpinning the proposed method is the error amplification of initially minor inaccuracies in capturing intricate details. Qualitative and quantitative experiments affirm that the proposed method achieves high-fidelity editing quality, especially the local modification, in both single-round and multiple-round generation, while also showcasing robust generalization to irregular text inputs. The effectiveness of our semantic alignment with textual feedback is further substantiated by the retrieval improvements on FahisonIQ and Fashion200k. △ Less

Submitted 9 March, 2024; v1 submitted 16 January, 2024; originally announced January 2024.

Comments: 14 pages, 6 figures

Showing 151–200 of 1,555 results for author: Wei, Y