Search | arXiv e-print repository

arXiv:2407.02569 [pdf, other]

Structure-inspired Ansatz and Warm Start of Variational Quantum Algorithms for Quadratic Unconstrained Binary Optimization Problems

Authors: Yahui Chai, Karl Jansen, Stefan Kühn, Tim Schwägerl, Tobias Stollenwerk

Abstract: This paper introduces a structure-inspired ansatz for addressing quadratic unconstrained binary optimization problems with the Variational Quantum Eigensolver. We propose a novel warm start technique that is based on imaginary time evolution, and allows for determining a set of initial parameters prioritizing lower energy states in a resource-efficient way. Using classical simulations, we demonstr… ▽ More This paper introduces a structure-inspired ansatz for addressing quadratic unconstrained binary optimization problems with the Variational Quantum Eigensolver. We propose a novel warm start technique that is based on imaginary time evolution, and allows for determining a set of initial parameters prioritizing lower energy states in a resource-efficient way. Using classical simulations, we demonstrate that this warm start method significantly improves the success rate and reduces the number of iterations required for the convergence of Variational Quantum Eigensolver. The numerical results also indicate that the warm start approach effectively mitigates statistical errors arising from a finite number of measurements, and to a certain extent alleviates the effect of barren plateaus. △ Less

Submitted 2 July, 2024; originally announced July 2024.

arXiv:2407.02343 [pdf, other]

A detailed study of the very-high-energy Crab pulsar emission with the LST-1

Authors: CTA-LST Project, :, K. Abe, S. Abe, A. Abhishek, F. Acero, A. Aguasca-Cabot, I. Agudo, N. Alvarez Crespo, L. A. Antonelli, C. Aramo, A. Arbet-Engels, C. Arcaro, M. Artero, K. Asano, P. Aubert, A. Baktash, A. Bamba, A. Baquero Larriva, L. Baroncelli, U. Barres de Almeida, J. A. Barrio, I. Batkovic, J. Baxter, J. Becerra González , et al. (272 additional authors not shown)

Abstract: Context: There are currently three pulsars firmly detected by imaging atmospheric Cherenkov telescopes (IACTs), two of them reaching TeV energies, challenging models of very-high-energy (VHE) emission in pulsars. More precise observations are needed to better characterize pulsar emission at these energies. The LST-1 is the prototype of the Large-Sized Telescope, that will be part of the Cherenkov… ▽ More Context: There are currently three pulsars firmly detected by imaging atmospheric Cherenkov telescopes (IACTs), two of them reaching TeV energies, challenging models of very-high-energy (VHE) emission in pulsars. More precise observations are needed to better characterize pulsar emission at these energies. The LST-1 is the prototype of the Large-Sized Telescope, that will be part of the Cherenkov Telescope Array Observatory (CTAO). Its improved performance over previous IACTs makes it well suited for studying pulsars. Aims: To study the Crab pulsar emission with the LST-1, improving and complementing the results from other telescopes. These observations can also be used to characterize the potential of the LST-1 to study other pulsars and detect new ones. Methods: We analyzed a total of $\sim$103 hours of gamma-ray observations of the Crab pulsar conducted with the LST-1 in the period from September 2020 to January 2023. The observations were carried out at zenith angles less than 50 degrees. A new analysis of the Fermi-LAT data was also performed, including $\sim$14 years of observations. Results: The Crab pulsar phaseogram, long-term light-curve, and phase-resolved spectra are reconstructed with the LST-1 from 20 GeV to 450 GeV for P1 and up to 700 GeV for P2. The pulsed emission is detected with a significance of 15.2$σ$. The two characteristic emission peaks of the Crab pulsar are clearly detected (>10$σ$), as well as the so-called bridge emission (5.7$σ$). We find that both peaks are well described by power laws, with spectral indices of $\sim$3.44 and $\sim$3.03 respectively. The joint analysis of Fermi-LAT and LST-1 data shows a good agreement between both instruments in the overlap** energy range. The detailed results obtained in the first observations of the Crab pulsar with LST-1 show the potential that CTAO will have to study this type of sources. △ Less

Submitted 2 July, 2024; originally announced July 2024.

Comments: Accepted by A&A

arXiv:2407.00719 [pdf]

A Whole-Process Certifiably Robust Aggregation Method Against Backdoor Attacks in Federated Learning

Authors: Anqi Zhou, Yezheng Liu, Yidong Chai, Hongyi Zhu, Xinyue Ge, Yuanchun Jiang, Meng Wang

Abstract: Federated Learning (FL) has garnered widespread adoption across various domains such as finance, healthcare, and cybersecurity. Nonetheless, FL remains under significant threat from backdoor attacks, wherein malicious actors insert triggers into trained models, enabling them to perform certain tasks while still meeting FL's primary objectives. In response, robust aggregation methods have been prop… ▽ More Federated Learning (FL) has garnered widespread adoption across various domains such as finance, healthcare, and cybersecurity. Nonetheless, FL remains under significant threat from backdoor attacks, wherein malicious actors insert triggers into trained models, enabling them to perform certain tasks while still meeting FL's primary objectives. In response, robust aggregation methods have been proposed, which can be divided into three types: ex-ante, ex-durante, and ex-post methods. Given the complementary nature of these methods, combining all three types is promising yet unexplored. Such a combination is non-trivial because it requires leveraging their advantages while overcoming their disadvantages. Our study proposes a novel whole-process certifiably robust aggregation (WPCRA) method for FL, which enhances robustness against backdoor attacks across three phases: ex-ante, ex-durante, and ex-post. Moreover, since the current geometric median estimation method fails to consider differences among clients, we propose a novel weighted geometric median estimation algorithm (WGME). This algorithm estimates the geometric median of model updates from clients based on each client's weight, further improving the robustness of WPCRA against backdoor attacks. We also theoretically prove that WPCRA offers improved certified robustness guarantees with a larger certified radius. We evaluate the advantages of our methods based on the task of loan status prediction. Comparison with baselines shows that our methods significantly improve FL's robustness against backdoor attacks. This study contributes to the literature with a novel WPCRA method and a novel WGME algorithm. Our code is available at https://github.com/brick-brick/WPCRAM. △ Less

Submitted 30 June, 2024; originally announced July 2024.

Comments: 14 pages

arXiv:2406.16251 [pdf, other]

Probing critical spin fluctuations with a composite magnetoelectric method: A case study on a Kitaev spin liquid candidate Na$_3$Co$_2$SbO$_6$

Authors: Xinrun Mi, Xintong Li, Long Zhang, Aifeng Wang, Yuan Li, Yisheng Chai, Mingquan He

Abstract: In correlated quantum materials, divergent critical fluctuations near the quantum critical point are often closely associated with exotic quantum phases of matter, such as unconventional superconductivity and quantum spin liquids. Here we present a simple yet highly sensitive composite magnetoelectric (ME) method for detecting the critical spin fluctuations in quantum magnets. The ME signal is pro… ▽ More In correlated quantum materials, divergent critical fluctuations near the quantum critical point are often closely associated with exotic quantum phases of matter, such as unconventional superconductivity and quantum spin liquids. Here we present a simple yet highly sensitive composite magnetoelectric (ME) method for detecting the critical spin fluctuations in quantum magnets. The ME signal is proportional the magnetostriction coefficient, which directly probes the product of magnetization and spin-spin correlation. As a demonstration, the composite ME method is applied to a Kitaev quantum spin liquid candidate Na$_3$Co$_2$SbO$_6$, which shows signs of magnetic field-induced quantum criticality. Notably, the ME signal prominently diverges at the magnetic field-induced tricritical points, particularly at a tricritical point that lies in close proximity to a zero-temperature quantum critical point. A crucial aspect of these tricritical points is their tunability through the modification of the in-plane magnetic field's direction. The direction of magnetic field can thus serve as a handful yet important tuning parameter, alongside pressure and chemical do**, for searching quantum critical points in quantum magnets with pronounced magnetic anisotropy. △ Less

Submitted 23 June, 2024; originally announced June 2024.

Comments: 6 pages, 4 figures

arXiv:2406.14090 [pdf, other]

Personalized Music Recommendation with a Heterogeneity-aware Deep Bayesian Network

Authors: Erkang **g, Yezheng Liu, Yidong Chai, Shuo Yu, Longshun Liu, Yuanchun Jiang, Yang Wang

Abstract: Music recommender systems are crucial in music streaming platforms, providing users with music they would enjoy. Recent studies have shown that user emotions can affect users' music mood preferences. However, existing emotion-aware music recommender systems (EMRSs) explicitly or implicitly assume that users' actual emotional states expressed by an identical emotion word are homogeneous. They also… ▽ More Music recommender systems are crucial in music streaming platforms, providing users with music they would enjoy. Recent studies have shown that user emotions can affect users' music mood preferences. However, existing emotion-aware music recommender systems (EMRSs) explicitly or implicitly assume that users' actual emotional states expressed by an identical emotion word are homogeneous. They also assume that users' music mood preferences are homogeneous under an identical emotional state. In this article, we propose four types of heterogeneity that an EMRS should consider: emotion heterogeneity across users, emotion heterogeneity within a user, music mood preference heterogeneity across users, and music mood preference heterogeneity within a user. We further propose a Heterogeneity-aware Deep Bayesian Network (HDBN) to model these assumptions. The HDBN mimics a user's decision process to choose music with four components: personalized prior user emotion distribution modeling, posterior user emotion distribution modeling, user grou**, and Bayesian neural network-based music mood preference prediction. We constructed a large-scale dataset called EmoMusicLJ to validate our method. Extensive experiments demonstrate that our method significantly outperforms baseline approaches on widely used HR and NDCG recommendation metrics. Ablation experiments and case studies further validate the effectiveness of our HDBN. The source code is available at https://github.com/**grk/HDBN. △ Less

Submitted 20 June, 2024; originally announced June 2024.

Comments: 34 pages, 19 figures

arXiv:2406.11687 [pdf, other]

Tokenization Falling Short: The Curse of Tokenization

Authors: Yekun Chai, Yewei Fang, Qiwei Peng, Xuhong Li

Abstract: Language models typically tokenize raw text into sequences of subword identifiers from a predefined vocabulary, a process inherently sensitive to typographical errors, length variations, and largely oblivious to the internal structure of tokens-issues we term the curse of tokenization. In this study, we delve into these drawbacks and demonstrate that large language models (LLMs) remain susceptible… ▽ More Language models typically tokenize raw text into sequences of subword identifiers from a predefined vocabulary, a process inherently sensitive to typographical errors, length variations, and largely oblivious to the internal structure of tokens-issues we term the curse of tokenization. In this study, we delve into these drawbacks and demonstrate that large language models (LLMs) remain susceptible to these problems. This study systematically investigates these challenges and their impact on LLMs through three critical research questions: (1) complex problem solving, (2) token structure probing, and (3) resilience to typographical variation. Our findings reveal that scaling model parameters can mitigate the issue of tokenization; however, LLMs still suffer from biases induced by typos and other text format variations. Our experiments show that subword regularization such as BPE-dropout can mitigate this issue. We will release our code and data to facilitate further research. △ Less

Submitted 17 June, 2024; originally announced June 2024.

arXiv:2406.09475 [pdf, other]

Search for $X(1870)$ via the decay $J/ψ\to ωK^+ K^-η$

Authors: BESIII Collaboration, M. Ablikim, M. N. Achasov, P. Adlarson, O. Afedulidis, X. C. Ai, R. Aliberti, A. Amoroso, Q. An, Y. Bai, O. Bakina, I. Balossino, Y. Ban, H. -R. Bao, V. Batozskaya, K. Begzsuren, N. Berger, M. Berlowski, M. Bertani, D. Bettoni, F. Bianchi, E. Bianco, A. Bortone, I. Boyko, R. A. Briere , et al. (644 additional authors not shown)

Abstract: Using a sample of $(10087\pm 44)\times10^{6}$ $J/ψ$ events collected by the BESIII detector at the BEPCII collider, we search for the decay $X(1870)\to K^+ K^-η$ via the $J/ψ\to ωK^+ K^- η$ process for the first time. No significant $X(1870)$ signal is observed. The upper limit on the branching fraction of the decay $ J/ψ\to ωX(1870) \toωK^+ K^- η$ is determined to be $9.55\times 10^{-7}$ at the… ▽ More Using a sample of $(10087\pm 44)\times10^{6}$ $J/ψ$ events collected by the BESIII detector at the BEPCII collider, we search for the decay $X(1870)\to K^+ K^-η$ via the $J/ψ\to ωK^+ K^- η$ process for the first time. No significant $X(1870)$ signal is observed. The upper limit on the branching fraction of the decay $ J/ψ\to ωX(1870) \toωK^+ K^- η$ is determined to be $9.55\times 10^{-7}$ at the $90\%$ confidence level. In addition, the branching faction $B(J/ψ\toωK^+ K^- η)$ is measured to be $(3.33\pm0.02(\rm{stat.})\pm 0.12(\rm{syst.}))\times 10^{-4}$. △ Less

Submitted 13 June, 2024; originally announced June 2024.

arXiv:2406.07140 [pdf, other]

Constraints on Lorentz invariance violation from the extraordinary Mrk 421 flare of 2014 using a novel analysis method

Authors: MAGIC Collaboration, S. Abe, J. Abhir, A. Abhishek, V. A. Acciari, A. Aguasca-Cabot, I. Agudo, T. Aniello, S. Ansoldi, L. A. Antonelli, A. Arbet Engels, C. Arcaro, M. Artero, K. Asano, A. Babić, A. Baquero, U. Barres de Almeida, J. A. Barrio, I. Batković, A. Bautista, J. Baxter, J. Becerra González, W. Bednarek, E. Bernardini, J. Bernete , et al. (192 additional authors not shown)

Abstract: The Lorentz Invariance Violation (LIV), a proposed consequence of certain quantum gravity (QG) scenarios, could instigate an energy-dependent group velocity for ultra-relativistic particles. This energy dependence, although suppressed by the massive QG energy scale $E_\mathrm{QG}$, expected to be on the level of the Planck energy $1.22 \times 10^{19}$ GeV, is potentially detectable in astrophysica… ▽ More The Lorentz Invariance Violation (LIV), a proposed consequence of certain quantum gravity (QG) scenarios, could instigate an energy-dependent group velocity for ultra-relativistic particles. This energy dependence, although suppressed by the massive QG energy scale $E_\mathrm{QG}$, expected to be on the level of the Planck energy $1.22 \times 10^{19}$ GeV, is potentially detectable in astrophysical observations. In this scenario, the cosmological distances traversed by photons act as an amplifier for this effect. By leveraging the observation of a remarkable flare from the blazar Mrk\,421, recorded at energies above 100 GeV by the MAGIC telescopes on the night of April 25 to 26, 2014, we look for time delays scaling linearly and quadratically with the photon energies. Using for the first time in LIV studies a binned-likelihood approach we set constraints on the QG energy scale. For the linear scenario, we set $95\%$ lower limits $E_\mathrm{QG}>2.7\times10^{17}$ GeV for the subluminal case and $E_\mathrm{QG}> 3.6 \times10^{17}$ GeV for the superluminal case. For the quadratic scenario, the $95\%$ lower limits for the subluminal and superluminal cases are $E_\mathrm{QG}>2.6 \times10^{10}$ GeV and $E_\mathrm{QG}>2.5\times10^{10}$ GeV, respectively. △ Less

Submitted 11 June, 2024; originally announced June 2024.

arXiv:2404.11502 [pdf, other]

Towards Coarse-to-Fine Evaluation of Inference Efficiency for Large Language Models

Authors: Yushuo Chen, Tianyi Tang, Erge Xiang, Linjiang Li, Wayne Xin Zhao, **g Wang, Yunpeng Chai, Ji-Rong Wen

Abstract: In real world, large language models (LLMs) can serve as the assistant to help users accomplish their jobs, and also support the development of advanced applications. For the wide application of LLMs, the inference efficiency is an essential concern, which has been widely studied in existing work, and numerous optimization algorithms and code libraries have been proposed to improve it. Nonetheless… ▽ More In real world, large language models (LLMs) can serve as the assistant to help users accomplish their jobs, and also support the development of advanced applications. For the wide application of LLMs, the inference efficiency is an essential concern, which has been widely studied in existing work, and numerous optimization algorithms and code libraries have been proposed to improve it. Nonetheless, users still find it challenging to compare the effectiveness of all the above methods and understand the underlying mechanisms. In this work, we perform a detailed coarse-to-fine analysis of the inference performance of various code libraries. To evaluate the overall effectiveness, we examine four usage scenarios within two practical applications. We further provide both theoretical and empirical fine-grained analyses of each module in the Transformer architecture. Our experiments yield comprehensive results that are invaluable for researchers to evaluate code libraries and improve inference strategies. △ Less

Submitted 17 April, 2024; originally announced April 2024.

arXiv:2404.10710 [pdf, other]

Dual Modalities of Text: Visual and Textual Generative Pre-training

Authors: Yekun Chai, Qingyi Liu, **gwu Xiao, Shuohuan Wang, Yu Sun, Hua Wu

Abstract: Harnessing visual texts represents a burgeoning frontier in the evolution of language modeling. In this paper, we introduce a novel pre-training framework for a suite of pixel-based autoregressive language models, pre-training on a corpus of over 400 million documents rendered as RGB images. Our approach is characterized by a dual-modality training regimen, engaging both visual data through next p… ▽ More Harnessing visual texts represents a burgeoning frontier in the evolution of language modeling. In this paper, we introduce a novel pre-training framework for a suite of pixel-based autoregressive language models, pre-training on a corpus of over 400 million documents rendered as RGB images. Our approach is characterized by a dual-modality training regimen, engaging both visual data through next patch prediction with a regression head and textual data via next token prediction with a classification head. This study is particularly focused on investigating the synergistic interplay between visual and textual modalities of language. Our comprehensive evaluation across a diverse array of benchmarks reveals that the confluence of visual and textual data substantially augments the efficacy of pixel-based language models. Notably, our findings show that a unidirectional pixel-based model, devoid of textual data during training, can match the performance levels of advanced bidirectional pixel-based models on various language understanding benchmarks. This work highlights the considerable untapped potential of integrating visual and textual information for language modeling purposes. We will release our code, data, and checkpoints to inspire further research advancement. △ Less

Submitted 17 April, 2024; v1 submitted 16 April, 2024; originally announced April 2024.

arXiv:2404.07840 [pdf, other]

On Training Data Influence of GPT Models

Authors: Qingyi Liu, Yekun Chai, Shuohuan Wang, Yu Sun, Qiwei Peng, Keze Wang, Hua Wu

Abstract: Amidst the rapid advancements in generative language models, the investigation of how training data shapes the performance of GPT models is still emerging. This paper presents GPTfluence, a novel approach that leverages a featurized simulation to assess the impact of training examples on the training dynamics of GPT models. Our approach not only traces the influence of individual training instance… ▽ More Amidst the rapid advancements in generative language models, the investigation of how training data shapes the performance of GPT models is still emerging. This paper presents GPTfluence, a novel approach that leverages a featurized simulation to assess the impact of training examples on the training dynamics of GPT models. Our approach not only traces the influence of individual training instances on performance trajectories, such as loss and other key metrics, on targeted test points but also enables a comprehensive comparison with existing methods across various training scenarios in GPT models, ranging from 14 million to 2.8 billion parameters, across a range of downstream tasks. Contrary to earlier methods that struggle with generalization to new data, GPTfluence introduces a parameterized simulation of training dynamics, demonstrating robust generalization capabilities to unseen training data. This adaptability is evident across both fine-tuning and instruction-tuning scenarios, spanning tasks in natural language understanding and generation. We will make our code and data publicly available. △ Less

Submitted 16 April, 2024; v1 submitted 11 April, 2024; originally announced April 2024.

arXiv:2404.03659 [pdf, other]

Federated Unlearning for Human Activity Recognition

Authors: Kongyang Chen, Dong** zhang, Ya** Chai, Weibin Zhang, Shaowei Wang, Jiaxing Shen

Abstract: The rapid evolution of Internet of Things (IoT) technology has spurred the widespread adoption of Human Activity Recognition (HAR) in various daily life domains. Federated Learning (FL) is frequently utilized to build a global HAR model by aggregating user contributions without transmitting raw individual data. Despite substantial progress in user privacy protection with FL, challenges persist. Re… ▽ More The rapid evolution of Internet of Things (IoT) technology has spurred the widespread adoption of Human Activity Recognition (HAR) in various daily life domains. Federated Learning (FL) is frequently utilized to build a global HAR model by aggregating user contributions without transmitting raw individual data. Despite substantial progress in user privacy protection with FL, challenges persist. Regulations like the General Data Protection Regulation (GDPR) empower users to request data removal, raising a new query in FL: How can a HAR client request data removal without compromising other clients' privacy? In response, we propose a lightweight machine unlearning method for refining the FL HAR model by selectively removing a portion of a client's training data. Our method employs a third-party dataset unrelated to model training. Using KL divergence as a loss function for fine-tuning, we aim to align the predicted probability distribution on forgotten data with the third-party dataset. Additionally, we introduce a membership inference evaluation method to assess unlearning effectiveness. Experimental results across diverse datasets show our method achieves unlearning accuracy comparable to \textit{retraining} methods, resulting in speedups ranging from hundreds to thousands. △ Less

Submitted 17 January, 2024; originally announced April 2024.

arXiv:2404.00399 [pdf, other]

Aurora-M: The First Open Source Multilingual Language Model Red-teamed according to the U.S. Executive Order

Authors: Taishi Nakamura, Mayank Mishra, Simone Tedeschi, Yekun Chai, Jason T Stillerman, Felix Friedrich, Prateek Yadav, Tanmay Laud, Vu Minh Chien, Terry Yue Zhuo, Diganta Misra, Ben Bogin, Xuan-Son Vu, Marzena Karpinska, Arnav Varma Dantuluri, Wojciech Kusa, Tommaso Furlanello, Rio Yokota, Niklas Muennighoff, Suhas Pai, Tosin Adewumi, Veronika Laippala, Xiaozhe Yao, Adalberto Junior, Alpay Ariyak , et al. (20 additional authors not shown)

Abstract: Pretrained language models underpin several AI applications, but their high computational cost for training limits accessibility. Initiatives such as BLOOM and StarCoder aim to democratize access to pretrained models for collaborative community development. However, such existing models face challenges: limited multilingual capabilities, continual pretraining causing catastrophic forgetting, where… ▽ More Pretrained language models underpin several AI applications, but their high computational cost for training limits accessibility. Initiatives such as BLOOM and StarCoder aim to democratize access to pretrained models for collaborative community development. However, such existing models face challenges: limited multilingual capabilities, continual pretraining causing catastrophic forgetting, whereas pretraining from scratch is computationally expensive, and compliance with AI safety and development laws. This paper presents Aurora-M, a 15B parameter multilingual open-source model trained on English, Finnish, Hindi, Japanese, Vietnamese, and code. Continually pretrained from StarCoderPlus on 435 billion additional tokens, Aurora-M surpasses 2 trillion tokens in total training token count. It is the first open-source multilingual model fine-tuned on human-reviewed safety instructions, thus aligning its development not only with conventional red-teaming considerations, but also with the specific concerns articulated in the Biden-Harris Executive Order on the Safe, Secure, and Trustworthy Development and Use of Artificial Intelligence. Aurora-M is rigorously evaluated across various tasks and languages, demonstrating robustness against catastrophic forgetting and outperforming alternatives in multilingual settings, particularly in safety evaluations. To promote responsible open-source LLM development, Aurora-M and its variants are released at https://huggingface.co/collections/aurora-m/aurora-m-models-65fdfdff62471e09812f5407 . △ Less

Submitted 23 April, 2024; v1 submitted 30 March, 2024; originally announced April 2024.

Comments: Preprint

arXiv:2403.04244 [pdf]

Resonant Quantum Magnetodielectric Effect in Multiferroic Metal-Organic Framework [CH3NH3]Co(HCOO)3

Authors: Na Su, Shuang Liu, Yingjie He, Yan Liu, Huixia Fu, Yi-Sheng Chai, Young Sun

Abstract: We report the observation of both resonant quantum tunneling of magnetization (RQTM) and resonant quantum magnetodielectric (RQMD) effect in the perovskite multiferroic metal-organic framework [CH3NH3]Co(HCOO)3. An intrinsic magnetic phase separation emerges at low temperatures due to hydrogen-bond-modified long range super-exchange interaction, leading to the coexistence of canted antiferromagnet… ▽ More We report the observation of both resonant quantum tunneling of magnetization (RQTM) and resonant quantum magnetodielectric (RQMD) effect in the perovskite multiferroic metal-organic framework [CH3NH3]Co(HCOO)3. An intrinsic magnetic phase separation emerges at low temperatures due to hydrogen-bond-modified long range super-exchange interaction, leading to the coexistence of canted antiferromagnetic order and single-ion magnet. Subsequently, a stair-shaped magnetic hysteresis loop along the [101] direction characterizing the RQTM appears below the magnetic blocking temperature. More interestingly, the magnetic field dependence of dielectric permittivity exhibits pronounced negative peaks at the critical fields corresponding to the RQTM, a phenomenon termed the RQMD effect which enables electrical detection of the RQTM. These intriguing properties make the multiferroic metal-organic framework a promising candidate for solid-state quantum computing. △ Less

Submitted 7 March, 2024; originally announced March 2024.

Comments: 13 pages, 4 figures

arXiv:2403.02159 [pdf, other]

doi 10.1093/mnras/stae649

The variability patterns of the TeV blazar PG 1553+113 from a decade of MAGIC and multi-band observations

Authors: MAGIC Collaboration, H. Abe, S. Abe, J. Abhir, V. A. Acciari, I. Agudo, T. Aniello, S. Ansoldi, L. A. Antonelli, A. Arbet Engels, C. Arcaro, M. Artero, K. Asano, D. Baack, A. Babić, A. Baquero, U. Barres de Almeida, I. Batković, J. Baxter, J. Becerra González, E. Bernardini, J. Bernete, A. Berti, J. Besenrieder, C. Bigongiari , et al. (242 additional authors not shown)

Abstract: PG 1553+113 is one of the few blazars with a convincing quasi-periodic emission in the gamma-ray band. The source is also a very high-energy (VHE; >100 GeV) gamma-ray emitter. To better understand its properties and identify the underlying physical processes driving its variability, the MAGIC Collaboration initiated a multiyear, multiwavelength monitoring campaign in 2015 involving the OVRO 40-m a… ▽ More PG 1553+113 is one of the few blazars with a convincing quasi-periodic emission in the gamma-ray band. The source is also a very high-energy (VHE; >100 GeV) gamma-ray emitter. To better understand its properties and identify the underlying physical processes driving its variability, the MAGIC Collaboration initiated a multiyear, multiwavelength monitoring campaign in 2015 involving the OVRO 40-m and Medicina radio telescopes, REM, KVA, and the MAGIC telescopes, Swift and Fermi satellites, and the WEBT network. The analysis presented in this paper uses data until 2017 and focuses on the characterization of the variability. The gamma-ray data show a (hint of a) periodic signal compatible with literature, but the X-ray and VHE gamma-ray data do not show statistical evidence for a periodic signal. In other bands, the data are compatible with the gamma-ray period, but with a relatively high p-value. The complex connection between the low and high-energy emission and the non-monochromatic modulation and changes in flux suggests that a simple one-zone model is unable to explain all the variability. Instead, a model including a periodic component along with multiple emission zones is required. △ Less

Submitted 4 March, 2024; originally announced March 2024.

Comments: Accepted for publication in Monthly Notices of the Royal Astronomical Society. 19 pages, 9 figures. Corresponding authors: Elisa Prandini, Antonio Stamerra, Talvikki Hovatta

arXiv:2403.00920 [pdf, other]

Atacama Large Aperture Submillimeter Telescope (AtLAST) Science: Solar and stellar observations

Authors: Sven Wedemeyer, Miroslav Barta, Roman Brajsa, Yi Chai, Joaquim Costa, Dale Gary, Guillermo Gimenez de Castro, Stanislav Gunar, Gregory Fleishman, Antonio Hales, Hugh Hudson, Mats Kirkaune, Atul Mohan, Galina Motorina, Alberto Pellizzoni, Maryam Saberi, Caius L. Selhorst, Paulo J. A. Simoes, Masumi Shimojo, Ivica Skokic, Davor Sudar, Fabian Menezes, Stephen White, Mark Booth, Pamela Klaassen , et al. (13 additional authors not shown)

Abstract: Observations at (sub-)millimeter wavelengths offer a complementary perspective on our Sun and other stars, offering significant insights into both the thermal and magnetic composition of their chromospheres. Despite the fundamental progress in (sub-)millimeter observations of the Sun, some important aspects require diagnostic capabilities that are not offered by existing observatories. In particul… ▽ More Observations at (sub-)millimeter wavelengths offer a complementary perspective on our Sun and other stars, offering significant insights into both the thermal and magnetic composition of their chromospheres. Despite the fundamental progress in (sub-)millimeter observations of the Sun, some important aspects require diagnostic capabilities that are not offered by existing observatories. In particular, simultaneously observations of the radiation continuum across an extended frequency range would facilitate the map** of different layers and thus ultimately the 3D structure of the solar atmosphere. Map** large regions on the Sun or even the whole solar disk at a very high temporal cadence would be crucial for systematically detecting and following the temporal evolution of flares, while synoptic observations, i.e., daily maps, over periods of years would provide an unprecedented view of the solar activity cycle in this wavelength regime. As our Sun is a fundamental reference for studying the atmospheres of active main sequence stars, observing the Sun and other stars with the same instrument would unlock the enormous diagnostic potential for understanding stellar activity and its impact on exoplanets. The Atacama Large Aperture Submillimeter Telescope (AtLAST), a single-dish telescope with 50\,m aperture proposed to be built in the Atacama desert in Chile, would be able to provide these observational capabilities. Equipped with a large number of detector elements for probing the radiation continuum across a wide frequency range, AtLAST would address a wide range of scientific topics including the thermal structure and heating of the solar chromosphere, flares and prominences, and the solar activity cycle. In this white paper, the key science cases and their technical requirements for AtLAST are discussed. △ Less

Submitted 6 March, 2024; v1 submitted 1 March, 2024; originally announced March 2024.

Comments: 14 pages, 4 figures, submitted to Open Research Europe as part of a collection on the Atacama Large Aperture Submillimeter Telescope (AtLAST)

arXiv:2402.19173 [pdf, other]

StarCoder 2 and The Stack v2: The Next Generation

Authors: Anton Lozhkov, Raymond Li, Loubna Ben Allal, Federico Cassano, Joel Lamy-Poirier, Nouamane Tazi, Ao Tang, Dmytro Pykhtar, Jiawei Liu, Yuxiang Wei, Tianyang Liu, Max Tian, Denis Kocetkov, Arthur Zucker, Younes Belkada, Zijian Wang, Qian Liu, Dmitry Abulkhanov, Indraneil Paul, Zhuang Li, Wen-Ding Li, Megan Risdal, Jia Li, Jian Zhu, Terry Yue Zhuo , et al. (41 additional authors not shown)

Abstract: The BigCode project, an open-scientific collaboration focused on the responsible development of Large Language Models for Code (Code LLMs), introduces StarCoder2. In partnership with Software Heritage (SWH), we build The Stack v2 on top of the digital commons of their source code archive. Alongside the SWH repositories spanning 619 programming languages, we carefully select other high-quality data… ▽ More The BigCode project, an open-scientific collaboration focused on the responsible development of Large Language Models for Code (Code LLMs), introduces StarCoder2. In partnership with Software Heritage (SWH), we build The Stack v2 on top of the digital commons of their source code archive. Alongside the SWH repositories spanning 619 programming languages, we carefully select other high-quality data sources, such as GitHub pull requests, Kaggle notebooks, and code documentation. This results in a training set that is 4x larger than the first StarCoder dataset. We train StarCoder2 models with 3B, 7B, and 15B parameters on 3.3 to 4.3 trillion tokens and thoroughly evaluate them on a comprehensive set of Code LLM benchmarks. We find that our small model, StarCoder2-3B, outperforms other Code LLMs of similar size on most benchmarks, and also outperforms StarCoderBase-15B. Our large model, StarCoder2- 15B, significantly outperforms other models of comparable size. In addition, it matches or outperforms CodeLlama-34B, a model more than twice its size. Although DeepSeekCoder- 33B is the best-performing model at code completion for high-resource languages, we find that StarCoder2-15B outperforms it on math and code reasoning benchmarks, as well as several low-resource languages. We make the model weights available under an OpenRAIL license and ensure full transparency regarding the training data by releasing the SoftWare Heritage persistent IDentifiers (SWHIDs) of the source code data. △ Less

Submitted 29 February, 2024; originally announced February 2024.

arXiv:2402.16694 [pdf, other]

HumanEval-XL: A Multilingual Code Generation Benchmark for Cross-lingual Natural Language Generalization

Authors: Qiwei Peng, Yekun Chai, Xuhong Li

Abstract: Large language models (LLMs) have made significant progress in generating codes from textual prompts. However, existing benchmarks have mainly concentrated on translating English prompts to multilingual codes or have been constrained to very limited natural languages (NLs). These benchmarks have overlooked the vast landscape of massively multilingual NL to multilingual code, leaving a critical gap… ▽ More Large language models (LLMs) have made significant progress in generating codes from textual prompts. However, existing benchmarks have mainly concentrated on translating English prompts to multilingual codes or have been constrained to very limited natural languages (NLs). These benchmarks have overlooked the vast landscape of massively multilingual NL to multilingual code, leaving a critical gap in the evaluation of multilingual LLMs. In response, we introduce HumanEval-XL, a massively multilingual code generation benchmark specifically crafted to address this deficiency. HumanEval-XL establishes connections between 23 NLs and 12 programming languages (PLs), and comprises of a collection of 22,080 prompts with an average of 8.33 test cases. By ensuring parallel data across multiple NLs and PLs, HumanEval-XL offers a comprehensive evaluation platform for multilingual LLMs, allowing the assessment of the understanding of different NLs. Our work serves as a pioneering step towards filling the void in evaluating NL generalization in the area of multilingual code generation. We make our evaluation code and data publicly available at \url{https://github.com/FloatAI/humaneval-xl}. △ Less

Submitted 24 March, 2024; v1 submitted 26 February, 2024; originally announced February 2024.

Comments: LREC-COLING 2024

arXiv:2402.15583 [pdf, other]

Cohere3D: Exploiting Temporal Coherence for Unsupervised Representation Learning of Vision-based Autonomous Driving

Authors: Yichen Xie, Hongge Chen, Gregory P. Meyer, Yong Jae Lee, Eric M. Wolff, Masayoshi Tomizuka, Wei Zhan, Yuning Chai, Xin Huang

Abstract: Due to the lack of depth cues in images, multi-frame inputs are important for the success of vision-based perception, prediction, and planning in autonomous driving. Observations from different angles enable the recovery of 3D object states from 2D image inputs if we can identify the same instance in different input frames. However, the dynamic nature of autonomous driving scenes leads to signific… ▽ More Due to the lack of depth cues in images, multi-frame inputs are important for the success of vision-based perception, prediction, and planning in autonomous driving. Observations from different angles enable the recovery of 3D object states from 2D image inputs if we can identify the same instance in different input frames. However, the dynamic nature of autonomous driving scenes leads to significant changes in the appearance and shape of each instance captured by the camera at different time steps. To this end, we propose a novel contrastive learning algorithm, Cohere3D, to learn coherent instance representations in a long-term input sequence robust to the change in distance and perspective. The learned representation aids in instance-level correspondence across multiple input frames in downstream tasks. In the pretraining stage, the raw point clouds from LiDAR sensors are utilized to construct the long-term temporal correspondence for each instance, which serves as guidance for the extraction of instance-level representation from the vision-based bird's eye-view (BEV) feature map. Cohere3D encourages a consistent representation for the same instance at different frames but distinguishes between representations of different instances. We evaluate our algorithm by finetuning the pretrained model on various downstream perception, prediction, and planning tasks. Results show a notable improvement in both data efficiency and task performance. △ Less

Submitted 23 February, 2024; originally announced February 2024.

arXiv:2402.14477 [pdf, other]

Pressure tunable magnetic skyrmion phase in Co8Zn8Mn4 single crystals

Authors: Zhun Li, Xinrun Mi, Xinming Wang, Jian Lyu, Na Su, Aifeng Wang, Yisheng Chai, Bao Yuan, Wanju Luo, Hui Cheng, Jianxiang Gao, Hongliang Wang, Lijie Hao, Mingquan He, Junying Shen, Young Sun, Xin Tong

Abstract: In a magnetic skyrmion phase, magnetic moments form vortex-like topological textures which are of both fundamental and industrial interests. In $β$-Mn-type Co-Zn-Mn alloys, chrial magnetic skyrmions emerge above room temperature, providing a unique system for studying the skrymion physics and exploring spintronics applications. However, the magnetic skyrmion phase is typically confined in a narrow… ▽ More In a magnetic skyrmion phase, magnetic moments form vortex-like topological textures which are of both fundamental and industrial interests. In $β$-Mn-type Co-Zn-Mn alloys, chrial magnetic skyrmions emerge above room temperature, providing a unique system for studying the skrymion physics and exploring spintronics applications. However, the magnetic skyrmion phase is typically confined in a narrow and limited temperature ($T$) and magnetic field ($H$) range. Here, we demonstrate that hydrostatic pressure can expand the skyrmion phase in the $T-H$ phase diagram of single-crystalline Co$_8$Zn$_8$Mn$_4$. At ambient pressure, signatures of skyrmions are seen within $T\sim302-308$ K and $H\sim50-100$ Oe. Applying a moderate pressure of 6 kbar extends this range to $T\sim300-310$ K and $H\sim50-150$ Oe. However, further escalation of pressure to 10 kbar results in a slight contraction of the skyrmion phase. These findings underscore the sensitivity of the skyrmion phase in Co$_8$Zn$_8$Mn$_4$ to external pressures, and hint at the potential of strain engineering, particularly in $β$-Mn-type Co-Zn-Mn thin films, as a promising avenue to customize the skyrmion phase. △ Less

Submitted 22 February, 2024; originally announced February 2024.

Comments: 7 pages, 4 figures

arXiv:2402.11164 [pdf]

TinyLIC-High efficiency lossy image compression method

Authors: Gaocheng Ma, Yinfeng Chai, Tianhao Jiang, Ming Lu, Tong Chen

Abstract: Image compression has been the subject of extensive research for several decades, resulting in the development of well-known standards such as JPEG, JPEG2000, and H.264/AVC. However, recent advancements in deep learning have led to the emergence of learned image compression methods that offer significant improvements in coding efficiency compared to traditional codecs. These learned compression te… ▽ More Image compression has been the subject of extensive research for several decades, resulting in the development of well-known standards such as JPEG, JPEG2000, and H.264/AVC. However, recent advancements in deep learning have led to the emergence of learned image compression methods that offer significant improvements in coding efficiency compared to traditional codecs. These learned compression techniques have shown noticeable gains and even outperformed traditional schemes △ Less

Submitted 16 February, 2024; originally announced February 2024.

arXiv:2402.10045 [pdf]

Short-Form Videos and Mental Health: A Knowledge-Guided Neural Topic Model

Authors: Jiaheng Xie, Ruicheng Liang, Yidong Chai, Yang Liu, Daniel Zeng

Abstract: While short-form videos head to reshape the entire social media landscape, experts are exceedingly worried about their depressive impacts on viewers, as evidenced by medical studies. To prevent widespread consequences, platforms are eager to predict these videos' impact on viewers' mental health. Subsequently, they can take intervention measures, such as revising recommendation algorithms and disp… ▽ More While short-form videos head to reshape the entire social media landscape, experts are exceedingly worried about their depressive impacts on viewers, as evidenced by medical studies. To prevent widespread consequences, platforms are eager to predict these videos' impact on viewers' mental health. Subsequently, they can take intervention measures, such as revising recommendation algorithms and displaying viewer discretion. Nevertheless, applicable predictive methods lack relevance to well-established medical knowledge, which outlines clinically proven external and environmental factors of depression. To account for such medical knowledge, we resort to an emergent methodological discipline, seeded Neural Topic Models (NTMs). However, existing seeded NTMs suffer from the limitations of single-origin topics, unknown topic sources, unclear seed supervision, and suboptimal convergence. To address those challenges, we develop a novel Knowledge-guided Multimodal NTM to predict a short-form video's depressive impact on viewers. Extensive empirical analyses using TikTok and Douyin datasets prove that our method outperforms state-of-the-art benchmarks. Our method also discovers medically relevant topics from videos that are linked to depressive impact. We contribute to IS with a novel video analytics method that is generalizable to other video classification problems. Practically, our method can help platforms understand videos' mental impacts, thus adjusting recommendations and video topic disclosure. △ Less

Submitted 21 March, 2024; v1 submitted 10 January, 2024; originally announced February 2024.

arXiv:2402.04755 [pdf, other]

doi 10.1093/mnras/stae697

Performance and first measurements of the MAGIC Stellar Intensity Interferometer

Authors: MAGIC Collaboration, S. Abe, J. Abhir, V. A. Acciari, A. Aguasca-Cabot, I. Agudo, T. Aniello, S. Ansoldi, L. A. Antonelli, A. Arbet Engels, C. Arcaro, M. Artero, K. Asano, A. Babić, A. Baquero, U. Barres de Almeida, J. A. Barrio, I. Batković, A. Bautista, J. Baxter, J. Becerra González, E. Bernardini, M. Bernardos, J. Bernete, A. Berti , et al. (195 additional authors not shown)

Abstract: In recent years, a new generation of optical intensity interferometers has emerged, leveraging the existing infrastructure of Imaging Atmospheric Cherenkov Telescopes (IACTs). The MAGIC telescopes host the MAGIC-SII system (Stellar Intensity Interferometer), implemented to investigate the feasibility and potential of this technique on IACTs. After the first successful measurements in 2019, the sys… ▽ More In recent years, a new generation of optical intensity interferometers has emerged, leveraging the existing infrastructure of Imaging Atmospheric Cherenkov Telescopes (IACTs). The MAGIC telescopes host the MAGIC-SII system (Stellar Intensity Interferometer), implemented to investigate the feasibility and potential of this technique on IACTs. After the first successful measurements in 2019, the system was upgraded and now features a real-time, dead-time-free, 4-channel, GPU-based correlator. These hardware modifications allow seamless transitions between MAGIC's standard very-high-energy gamma-ray observations and optical interferometry measurements within seconds. We establish the feasibility and potential of employing IACTs as competitive optical Intensity Interferometers with minimal hardware adjustments. The measurement of a total of 22 stellar diameters are reported, 9 corresponding to reference stars with previous comparable measurements, and 13 with no prior measurements. A prospective implementation involving telescopes from the forthcoming Cherenkov Telescope Array Observatory's northern hemisphere array, such as the first prototype of its Large-Sized Telescopes, LST-1, is technically viable. This integration would significantly enhance the sensitivity of the current system and broaden the UV-plane coverage. This advancement would enable the system to achieve competitive sensitivity with the current generation of long-baseline optical interferometers over blue wavelengths. △ Less

Submitted 7 February, 2024; originally announced February 2024.

Comments: 18 pages, 13 figures, submitted to MNRAS

arXiv:2401.16361 [pdf, other]

MIRI MRS Observations of Beta Pictoris I. The Inner Dust, the Planet, and the Gas

Authors: Kadin Worthen, Christine H. Chen, David R. Law, Cicero X. Lu, Kielan Hoch, Yiwei Chai, G. C. Sloan, B. A. Sargent, Jens Kammerer, Dean C. Hines, Isabel Rebollido, William O. Balmer, Marshall D. Perrin, Dan M. Watson, Laurent Pueyo, Julien H. Girard, Carey M. Lisse, Christopher C. Stark

Abstract: We present JWST MIRI Medium Resolution Spectrograph (MRS) observations of the $β$ Pictoris system. We detect an infrared excess from the central unresolved point source from 5 to 7.5 $μ$m which is indicative of dust within the inner $\sim$7 au of the system. We perform PSF subtraction on the MRS data cubes and detect a spatially resolved dust population emitting at 5 $μ$m. This spatially resolved… ▽ More We present JWST MIRI Medium Resolution Spectrograph (MRS) observations of the $β$ Pictoris system. We detect an infrared excess from the central unresolved point source from 5 to 7.5 $μ$m which is indicative of dust within the inner $\sim$7 au of the system. We perform PSF subtraction on the MRS data cubes and detect a spatially resolved dust population emitting at 5 $μ$m. This spatially resolved hot dust population is best explained if the dust grains are in the small grain limit (2$π$a$\ll$$λ$). The combination of unresolved and resolved dust at 5 $μ$m could suggest that dust grains are being produced in the inner few au of the system and are then radiatively driven outwards, where the particles could accrete onto the known planets in the system $β$ Pic b and c. We also report the detection of an emission line at 6.986 $μ$m that we attribute to be [Ar II]. We find that the [Ar II] emission is spatially resolved with JWST and appears to be aligned with the dust disk. Through PSF subtraction techniques, we detect $β$ Pic b at the 5$σ$ level in our MRS data cubes and present the first mid-IR spectrum of the planet from 5 to 7 $μ$m. The planet's spectrum is consistent with having absorption from water vapor between 5 and 6.5 $μ$m. We perform atmosphere model grid fitting on spectra and photometry of $β$ Pic b and find that the planet's atmosphere likely has a sub-stellar C/O ratio. △ Less

Submitted 29 January, 2024; originally announced January 2024.

Comments: Accepted for Publication in ApJ

arXiv:2401.14995 [pdf, other]

A ThermalKinetic Inductance Detectors Pixel Design for Cosmic Microwave Background Observations at 90/150 GHz bands

Authors: Ye Chai, Shibo Shu, Yong** Li, Jiamin Sun, Zhouhui Liu, Yu Xu, Daikang Yan, Zhengwei Li, Yang Liu, Yiwen Wang, Weijie Guo, Juexian Cao, Congzhan Liu

Abstract: The highly sensitive millimeter-wave telescope is an important tool for accurate measurement of Cosmic Microwave Background (CMB) radiation, and its core component is a detector array located in a cryogenic focal plane. The feasibility of utilizing thermal kinetic inductance detectors (TKIDs) for CMB observations has been demonstrated. We propose a pixel design of TKIDs for observing CMB through a… ▽ More The highly sensitive millimeter-wave telescope is an important tool for accurate measurement of Cosmic Microwave Background (CMB) radiation, and its core component is a detector array located in a cryogenic focal plane. The feasibility of utilizing thermal kinetic inductance detectors (TKIDs) for CMB observations has been demonstrated. We propose a pixel design of TKIDs for observing CMB through atmospheric windows for observations in the 90/150 GHz bands. Assuming lossless dielectric, the coupling efficiency of a single pixel is around 90%. This pixel design will be utilized for future large-scale TKIDs array designs for CMB observations. △ Less

Submitted 26 January, 2024; originally announced January 2024.

arXiv:2401.12988 [pdf]

Few-Shot Learning for Chronic Disease Management: Leveraging Large Language Models and Multi-Prompt Engineering with Medical Knowledge Injection

Authors: Haoxin Liu, Wenli Zhang, Jiaheng Xie, Buomsoo Kim, Zhu Zhang, Yidong Chai

Abstract: This study harnesses state-of-the-art AI technology for chronic disease management, specifically in detecting various mental disorders through user-generated textual content. Existing studies typically rely on fully supervised machine learning, which presents challenges such as the labor-intensive manual process of annotating extensive training data for each disease and the need to design speciali… ▽ More This study harnesses state-of-the-art AI technology for chronic disease management, specifically in detecting various mental disorders through user-generated textual content. Existing studies typically rely on fully supervised machine learning, which presents challenges such as the labor-intensive manual process of annotating extensive training data for each disease and the need to design specialized deep learning architectures for each problem. To address such challenges, we propose a novel framework that leverages advanced AI techniques, including large language models and multi-prompt engineering. Specifically, we address two key technical challenges in data-driven chronic disease management: (1) develo** personalized prompts to represent each user's uniqueness and (2) incorporating medical knowledge into prompts to provide context for chronic disease detection, instruct learning objectives, and operationalize prediction goals. We evaluate our method using four mental disorders, which are prevalent chronic diseases worldwide, as research cases. On the depression detection task, our method (F1 = 0.975~0.978) significantly outperforms traditional supervised learning paradigms, including feature engineering (F1 = 0.760) and architecture engineering (F1 = 0.756). Meanwhile, our approach demonstrates success in few-shot learning, i.e., requiring only a minimal number of training examples to detect chronic diseases based on user-generated textual content (i.e., only 2, 10, or 100 subjects). Moreover, our method can be generalized to other mental disorder detection tasks, including anorexia, pathological gambling, and self-harm (F1 = 0.919~0.978). △ Less

Submitted 16 January, 2024; originally announced January 2024.

MSC Class: K.5 ACM Class: I.2.7; H.4.m

arXiv:2401.08560 [pdf, other]

Insights into the broad-band emission of the TeV blazar Mrk 501 during the first X-ray polarization measurements

Authors: S. Abe, J. Abhir, V. A. Acciari, A. Aguasca-Cabot, I. Agudo, T. Aniello, S. Ansoldi, L. A. Antonelli, A. Arbet Engels, C. Arcaro, K. Asano, A. Babić, A. Baquero, U. Barres de Almeida, J. A. Barrio, I. Batković, A. Bautista, J. Baxter, J. Becerra González, W. Bednarek, E. Bernardini, M. Bernardos, J. Bernete, A. Berti, J. Besenrieder , et al. (239 additional authors not shown)

Abstract: We present the first multi-wavelength study of Mrk 501 including very-high-energy (VHE) gamma-ray observations simultaneous to X-ray polarization measurements from the Imaging X-ray Polarimetry Explorer (IXPE). We use radio-to-VHE data from a multi-wavelength campaign organized between 2022-03-01 and 2022-07-19. The observations were performed by MAGIC, Fermi-LAT, NuSTAR, Swift (XRT and UVOT), and… ▽ More We present the first multi-wavelength study of Mrk 501 including very-high-energy (VHE) gamma-ray observations simultaneous to X-ray polarization measurements from the Imaging X-ray Polarimetry Explorer (IXPE). We use radio-to-VHE data from a multi-wavelength campaign organized between 2022-03-01 and 2022-07-19. The observations were performed by MAGIC, Fermi-LAT, NuSTAR, Swift (XRT and UVOT), and several instruments covering the optical and radio bands. During the IXPE pointings, the VHE state is close to the average behavior with a 0.2-1 TeV flux of 20%-50% the emission of the Crab Nebula. Despite the average VHE activity, an extreme X-ray behavior is measured for the first two IXPE pointings in March 2022 with a synchrotron peak frequency >1 keV. For the third IXPE pointing in July 2022, the synchrotron peak shifts towards lower energies and the optical/X-ray polarization degrees drop. The X-ray polarization is systematically higher than at lower energies, suggesting an energy-stratification of the jet. While during the IXPE epochs the polarization angle in the X-ray, optical and radio bands align well, we find a clear discrepancy in the optical and radio polarization angles in the middle of the campaign. We model the broad-band spectra simultaneous to the IXPE pointings assuming a compact zone dominating in the X-rays and VHE, and an extended zone stretching further downstream the jet dominating the emission at lower energies. NuSTAR data allow us to precisely constrain the synchrotron peak and therefore the underlying electron distribution. The change between the different states observed in the three IXPE pointings can be explained by a change of magnetization and/or emission region size, which directly connects the shift of the synchrotron peak to lower energies with the drop in polarization degree. △ Less

Submitted 16 January, 2024; originally announced January 2024.

Comments: Accepted for publication in Astronomy & Astrophysics. 19 pages, 9 figures. Corresponding authors: Lea Heckmann, Axel Arbet Engels, David Paneque

arXiv:2401.07798 [pdf, other]

doi 10.1016/j.dark.2024.101425

Constraints on axion-like particles with the Perseus Galaxy Cluster with MAGIC

Authors: MAGIC Collaboration, H. Abe, S. Abe, J. Abhir, V. A. Acciari, I. Agudo, T. Aniello, S. Ansoldi, L. A. Antonelli, A. Arbet Engels, C. Arcaro, M. Artero, K. Asano, D. Baack, A. Babić, A. Baquero, U. Barres de Almeida, J. A. Barrio, I. Batković, J. Baxter, J. Becerra González, W. Bednarek, E. Bernardini, J. Bernete, A. Berti , et al. (189 additional authors not shown)

Abstract: Axion-like particles (ALPs) are pseudo-Nambu-Goldstone bosons that emerge in various theories beyond the standard model. These particles can interact with high-energy photons in external magnetic fields, influencing the observed gamma-ray spectrum. This study analyzes 41.3 hrs of observational data from the Perseus Galaxy Cluster collected with the MAGIC telescopes. We focused on the spectra the r… ▽ More Axion-like particles (ALPs) are pseudo-Nambu-Goldstone bosons that emerge in various theories beyond the standard model. These particles can interact with high-energy photons in external magnetic fields, influencing the observed gamma-ray spectrum. This study analyzes 41.3 hrs of observational data from the Perseus Galaxy Cluster collected with the MAGIC telescopes. We focused on the spectra the radio galaxy in the center of the cluster: NGC 1275. By modeling the magnetic field surrounding this target, we searched for spectral indications of ALP presence. Despite finding no statistical evidence of ALP signatures, we were able to exclude ALP models in the sub-micro electronvolt range. Our analysis improved upon previous work by calculating the full likelihood and statistical coverage for all considered models across the parameter space. Consequently, we achieved the most stringent limits to date for ALP masses around 50 neV, with cross sections down to $g_{aγ} = 3 \times 10^{-12}$ GeV$^{-1}$. △ Less

Submitted 15 January, 2024; originally announced January 2024.

Comments: 25 pages, 10 figures, accepted for publication in Physics of the Dark Universe

arXiv:2401.05059 [pdf, ps, other]

DQ-integral and DL-integral generalized wheel graphs

Authors: Yirui Chai, Ligong Wang, Yuwei Zhou

Abstract: A graph G is said to be M-integral (resp. A-integral, D-integral, DL-integral or DQ-integral) if all eigenvalues of its matrix M (resp. adjacency matrix A(G), distance matrix D(G), distance Laplacian matrix DL(G) or distance signless Laplacian matrix DQ(G)) are integers. Lu et al. [Discrete Math, 346 (2023)] defined the generalized wheel graph GW(a, m, n) as the join of two regular graphs aKm and… ▽ More A graph G is said to be M-integral (resp. A-integral, D-integral, DL-integral or DQ-integral) if all eigenvalues of its matrix M (resp. adjacency matrix A(G), distance matrix D(G), distance Laplacian matrix DL(G) or distance signless Laplacian matrix DQ(G)) are integers. Lu et al. [Discrete Math, 346 (2023)] defined the generalized wheel graph GW(a, m, n) as the join of two regular graphs aKm and Cn, and obtained all D-integral generalized wheel graphs. Based on the above research, in this paper, we determine all DL-integral and DQ-integral generalized wheel graphs respectively. As byproducts, we give a sufficient and necessary condition for the join of two regular graphs G1 and G2 to be DL-integral, from which we can get infinitely many new classes of DL-integral graphs according to the large number of research results about the A-integral graphs. △ Less

Submitted 10 January, 2024; originally announced January 2024.

arXiv:2401.00410 [pdf]

doi 10.1007/s42864-022-00192-z

Electrical and thermal transport properties of kagome metals AV$_3$Sb$_5$ (A=K, Rb, Cs)

Authors: Xinrun Mi, Kunya Yang, Yuhan Gan, Long Zhang, Aifeng Wang, Yisheng Chai, Xiaoyuan Zhou, Mingquan He

Abstract: The interplay between lattice geometry, band topology and electronic correlations in the newly discovered kagome compounds AV$_3$Sb$_5$ (A=K, Rb, Cs) makes this family a novel playground to investigate emergent quantum phenomena, such as unconventional superconductivity, chiral charge density wave and electronic nematicity. These exotic quantum phases naturally leave nontrivial fingerprints in tra… ▽ More The interplay between lattice geometry, band topology and electronic correlations in the newly discovered kagome compounds AV$_3$Sb$_5$ (A=K, Rb, Cs) makes this family a novel playground to investigate emergent quantum phenomena, such as unconventional superconductivity, chiral charge density wave and electronic nematicity. These exotic quantum phases naturally leave nontrivial fingerprints in transport properties of AV$_3$Sb$_5$, both in electrical and thermal channels, which are prominent probes to uncover the underlying mechanisms. In this brief review, we highlight the unusual electrical and thermal transport properties observed in the unconventional charge ordered state of AV3Sb5, including giant anomalous Hall, anomalous Nernst, ambipolar Nernst and anomalous thermal Hall effects. Connections of these anomalous transport properties to time-reversal symmetry breaking, topological and multiband fermiology, as well as electronic nematicity, are also discussed. Finally, a perspective together with challenges of this rapid growing field are given. △ Less

Submitted 31 December, 2023; originally announced January 2024.

Comments: 34 pages,9 figures,an review article published in Tungsten 5,300(2023)

Journal ref: Tungsten 5,300(2023)

arXiv:2312.11276 [pdf, other]

Compositional Generalization for Multi-label Text Classification: A Data-Augmentation Approach

Authors: Yuyang Chai, Zhuang Li, Jiahui Liu, Lei Chen, Fei Li, Donghong Ji, Chong Teng

Abstract: Despite significant advancements in multi-label text classification, the ability of existing models to generalize to novel and seldom-encountered complex concepts, which are compositions of elementary ones, remains underexplored. This research addresses this gap. By creating unique data splits across three benchmarks, we assess the compositional generalization ability of existing multi-label text… ▽ More Despite significant advancements in multi-label text classification, the ability of existing models to generalize to novel and seldom-encountered complex concepts, which are compositions of elementary ones, remains underexplored. This research addresses this gap. By creating unique data splits across three benchmarks, we assess the compositional generalization ability of existing multi-label text classification models. Our results show that these models often fail to generalize to compositional concepts encountered infrequently during training, leading to inferior performance on tests with these new combinations. To address this, we introduce a data augmentation method that leverages two innovative text generation models designed to enhance the classification models' capacity for compositional generalization. Our experiments show that this data augmentation approach significantly improves the compositional generalization capabilities of classification models on our benchmarks, with both generation models surpassing other text generation baselines. △ Less

Submitted 20 December, 2023; v1 submitted 18 December, 2023; originally announced December 2023.

Comments: Accepted by AAAI'24

arXiv:2312.10732 [pdf, other]

First characterization of the emission behavior of Mrk421 from radio to VHE gamma rays with simultaneous X-ray polarization measurements

Authors: S. Abe, J. Abhir, V. A. Acciari, I. Agudo, T. Aniello, S. Ansoldi, L. A. Antonelli, A. Arbet Engels, C. Arcaro, M. Artero, K. Asano, A. Babić, A. Baquero, U. Barres de Almeida, J. A. Barrio, I. Batković, J. Baxter, J. Becerra González, W. Bednarek, E. Bernardini, J. Bernete, A. Berti, J. Besenrieder, C. Bigongiari, A. Biland , et al. (229 additional authors not shown)

Abstract: We perform the first broadband study of Mrk421 from radio to TeV gamma rays with simultaneous measurements of the X-ray polarization from IXPE. The data were collected within an extensive multiwavelength campaign organized between May and June 2022 using MAGIC, Fermi-LAT, NuSTAR, XMM-Newton, Swift, and several optical and radio telescopes to complement IXPE. During the IXPE exposures, the measured… ▽ More We perform the first broadband study of Mrk421 from radio to TeV gamma rays with simultaneous measurements of the X-ray polarization from IXPE. The data were collected within an extensive multiwavelength campaign organized between May and June 2022 using MAGIC, Fermi-LAT, NuSTAR, XMM-Newton, Swift, and several optical and radio telescopes to complement IXPE. During the IXPE exposures, the measured 0.2-1 TeV flux is close to the quiescent state and ranges from 25% to 50% of the Crab Nebula without intra-night variability. Throughout the campaign, the VHE and X-ray emission are positively correlated at a $4σ$ significance level. The IXPE measurements unveil a X-ray polarization degree that is a factor of 2-5 higher than in the optical/radio bands; that implies an energy-stratified jet in which the VHE photons are emitted co-spatially with the X-rays, in the vicinity of a shock front. The June 2022 observations exhibit a rotation of the X-ray polarization angle. Despite no simultaneous VHE coverage being available during a large fraction of the swing, the Swift-XRT monitoring unveils an X-ray flux increase with a clear spectral hardening. It suggests that flares in high synchrotron peaked blazars can be accompanied by a polarization angle rotation, as observed in some flat spectrum radio quasars. Finally, during the polarization angle rotation, NuSTAR data reveal two contiguous spectral hysteresis loops in opposite directions (clockwise and counter-clockwise), implying important changes in the particle acceleration efficiency on $\sim$hour timescales. △ Less

Submitted 17 December, 2023; originally announced December 2023.

Comments: Accepted for publication in Astronomy & Astrophysics. 29 pages, 22 figures. Corresponding authors: Axel Arbet Engels, Felix Schmuckermaier, David Paneque

arXiv:2312.07235 [pdf, other]

doi 10.1109/QCE57702.2023.10268

Gaussian Boson Sampling for binary optimization

Authors: Jean Cazalis, Yahui Chai, Karl Jansen, Stefan Kühn, Tirth Shah

Abstract: In this study, we consider a Gaussian Boson Sampler for solving a Flight Gate Assignment problem. We employ a Variational Quantum Eigensolver approach using the Conditional Value-at-risk cost function. We provide proof of principle by carrying out numerical simulations on randomly generated instances. In this study, we consider a Gaussian Boson Sampler for solving a Flight Gate Assignment problem. We employ a Variational Quantum Eigensolver approach using the Conditional Value-at-risk cost function. We provide proof of principle by carrying out numerical simulations on randomly generated instances. △ Less

Submitted 12 December, 2023; originally announced December 2023.

Comments: 3 pages, 2 figures, presented at 2023 IEEE International Conference on Quantum Computing and Engineering (QCE)

Journal ref: in 2023 IEEE International Conference on Quantum Computing and Engineering (QCE), Bellevue, WA, USA, 2023 pp. 332-333

arXiv:2312.02272 [pdf, other]

Fermionic wave packet scattering: a quantum computing approach

Authors: Yahui Chai, Arianna Crippa, Karl Jansen, Stefan Kühn, Vincent R. Pascuzzi, Francesco Tacchino, Ivano Tavernelli

Abstract: We propose a method to prepare Gaussian wave packets with momentum on top of the interacting ground state of a fermionic Hamiltonian. Using Givens rotation, we show how to efficiently obtain expectation values of observables throughout the evolution of the wave packets on digital quantum computers. We demonstrate our technique by applying it to the staggered lattice formulation of the Thirring mod… ▽ More We propose a method to prepare Gaussian wave packets with momentum on top of the interacting ground state of a fermionic Hamiltonian. Using Givens rotation, we show how to efficiently obtain expectation values of observables throughout the evolution of the wave packets on digital quantum computers. We demonstrate our technique by applying it to the staggered lattice formulation of the Thirring model and studying the scattering of two wave packets. Monitoring the the particle density and the entropy produced during the scattering process, we characterize the phenomenon and provide a first step towards studying more complicated collision processes on digital quantum computers. In addition, we perform a small-scale demonstration on IBM's quantum hardware, showing that our method is suitable for current and near-term quantum devices. △ Less

Submitted 20 March, 2024; v1 submitted 4 December, 2023; originally announced December 2023.

arXiv:2312.00784 [pdf, other]

ViP-LLaVA: Making Large Multimodal Models Understand Arbitrary Visual Prompts

Authors: Mu Cai, Haotian Liu, Dennis Park, Siva Karthik Mustikovela, Gregory P. Meyer, Yuning Chai, Yong Jae Lee

Abstract: While existing large vision-language multimodal models focus on whole image understanding, there is a prominent gap in achieving region-specific comprehension. Current approaches that use textual coordinates or spatial encodings often fail to provide a user-friendly interface for visual prompting. To address this challenge, we introduce a novel multimodal model capable of decoding arbitrary visual… ▽ More While existing large vision-language multimodal models focus on whole image understanding, there is a prominent gap in achieving region-specific comprehension. Current approaches that use textual coordinates or spatial encodings often fail to provide a user-friendly interface for visual prompting. To address this challenge, we introduce a novel multimodal model capable of decoding arbitrary visual prompts. This allows users to intuitively mark images and interact with the model using natural cues like a "red bounding box" or "pointed arrow". Our simple design directly overlays visual markers onto the RGB image, eliminating the need for complex region encodings, yet achieves state-of-the-art performance on region-understanding tasks like Visual7W, PointQA, and Visual Commonsense Reasoning benchmark. Furthermore, we present ViP-Bench, a comprehensive benchmark to assess the capability of models in understanding visual prompts across multiple dimensions, enabling future research in this domain. Code, data, and model are publicly available. △ Less

Submitted 26 April, 2024; v1 submitted 1 December, 2023; originally announced December 2023.

Comments: Accepted to CVPR2024. Project page: https://vip-llava.github.io/

arXiv:2311.13951 [pdf, other]

MLLM-Bench: Evaluating Multimodal LLMs with Per-sample Criteria

Authors: Wentao Ge, Shunian Chen, Guiming Hardy Chen, Zhihong Chen, Junying Chen, Shuo Yan, Chenghao Zhu, Ziyue Lin, Wenya Xie, Xinyi Zhang, Yichen Chai, Xiaoyu Liu, Dingjie Song, Xidong Wang, Anningzhe Gao, Zhiyi Zhang, Jianquan Li, Xiang Wan, Benyou Wang

Abstract: Multimodal large language models (MLLMs) (e.g., GPT-4V, LLaVA, and Claude-3) have broadened the scope of AI applications. Yet, evaluating their performance presents a significant challenge owing to the inherently subjective nature of tasks that do not yield clear-cut solutions especially for those open-ended queries. Existing automatic evaluation methodologies are mainly limited in evaluating obje… ▽ More Multimodal large language models (MLLMs) (e.g., GPT-4V, LLaVA, and Claude-3) have broadened the scope of AI applications. Yet, evaluating their performance presents a significant challenge owing to the inherently subjective nature of tasks that do not yield clear-cut solutions especially for those open-ended queries. Existing automatic evaluation methodologies are mainly limited in evaluating objective queries without considering real-world user experiences, inadequately addressing the nuances of creative and associative multimodal tasks. In our paper, we propose a new evaluation paradigm for MLLMs, which is evaluating MLLMs with \textit{per-sample criteria} using potent MLLM as the judge. To validate the feasibility and effectiveness of this paradigm, we design a benchmark, dubbed \textit{MLLM-Bench}, with the evaluation samples across six critical levels following the revised Bloom's Taxonomy with the ethical consideration. We benchmark 21 popular MLLMs in a pairwise-comparison fashion, showing diverse performance across models. Moreover, the validity of our benchmark manifests itself in reaching 88.02\% agreement with human evaluation. We contend that the proposed paradigm explores the potential of MLLMs as effective evaluation tools with the help of per-sample criteria, and that MLLM-Bench will serve as a catalyst for encouraging the development of user-centric MLLMs tailored to real-world applications. Our benchmark data, online leaderboard and submission entry are at https://mllm-bench.llmzoo.com. △ Less

Submitted 27 April, 2024; v1 submitted 23 November, 2023; originally announced November 2023.

Comments: 23 pages

arXiv:2310.06473 [pdf, other]

doi 10.1093/mnras/stad2958

MAGIC detection of GRB 201216C at $z=1.1$

Authors: H. Abe, S. Abe, V. A. Acciari, I. Agudo, T. Aniello, S. Ansoldi, L. A. Antonelli, A. Arbet Engels, C. Arcaro, M. Artero, K. Asano, D. Baack, A. Babić, A. Baquero, U. Barres de Almeida, J. A. Barrio, I. Batković, J. Baxter, J. Becerra González, W. Bednarek, E. Bernardini, J. Bernete, A. Berti, J. Besenrieder, C. Bigongiari , et al. (195 additional authors not shown)

Abstract: Gamma-ray bursts (GRBs) are explosive transient events occurring at cosmological distances, releasing a large amount of energy as electromagnetic radiation over several energy bands. We report the detection of the long GRB~201216C by the MAGIC telescopes. The source is located at $z=1.1$ and thus it is the farthest one detected at very high energies. The emission above \SI{70}{\GeV} of GRB~201216C… ▽ More Gamma-ray bursts (GRBs) are explosive transient events occurring at cosmological distances, releasing a large amount of energy as electromagnetic radiation over several energy bands. We report the detection of the long GRB~201216C by the MAGIC telescopes. The source is located at $z=1.1$ and thus it is the farthest one detected at very high energies. The emission above \SI{70}{\GeV} of GRB~201216C is modelled together with multi-wavelength data within a synchrotron and synchrotron-self Compton (SSC) scenario. We find that SSC can explain the broadband data well from the optical to the very-high-energy band. For the late-time radio data, a different component is needed to account for the observed emission. Differently from previous GRBs detected in the very-high-energy range, the model for GRB~201216C strongly favors a wind-like medium. The model parameters have values similar to those found in past studies of the afterglows of GRBs detected up to GeV energies. △ Less

Submitted 10 October, 2023; originally announced October 2023.

Comments: 13 pages, 6 figures, 2 tables. Accepted for publication in Monthly Notices of the Royal Astronomical Society

Journal ref: Monthly Notices of the Royal Astronomical Society, Volume 527, Issue 3, January 2024, Pages 5856-5867

arXiv:2310.03922 [pdf, other]

doi 10.1051/0004-6361/202347845

Multi-year characterisation of the broad-band emission from the intermittent extreme BL Lac 1ES~2344+514

Authors: H. Abe, S. Abe, V. A. Acciari, I. Agudo, T. Aniello, S. Ansoldi, L. A. Antonelli, A. Arbet Engels, C. Arcaro, M. Artero, K. Asano, D. Baack, A. Babić, A. Baquero, U. Barres de Almeida, I. Batković, J. Baxter, J. Becerra González, E. Bernardini, J. Bernete, A. Berti, J. Besenrieder, C. Bigongiari, A. Biland, O. Blanch , et al. (210 additional authors not shown)

Abstract: The BL Lac 1ES 2344+514 is known for temporary extreme properties (e.g., a shift of the synchrotron SED peak energy $ν_{synch,p}$ above 1keV). While those extreme states were so far observed only during high flux levels, additional multi-year observing campaigns are required to achieve a coherent picture. Here, we report the longest investigation of the source from radio to VHE performed so far, f… ▽ More The BL Lac 1ES 2344+514 is known for temporary extreme properties (e.g., a shift of the synchrotron SED peak energy $ν_{synch,p}$ above 1keV). While those extreme states were so far observed only during high flux levels, additional multi-year observing campaigns are required to achieve a coherent picture. Here, we report the longest investigation of the source from radio to VHE performed so far, focusing on a systematic characterisation of the intermittent extreme states. While our results confirm that 1ES 2344+514 typically exhibits $ν_{synch,p}>$1keV during elevated flux periods, we also find periods where the extreme state coincides with low flux activity. A strong spectral variability thus happens in the quiescent state, and is likely caused by an increase of the electron acceleration efficiency without a change in the electron injection luminosity. We also report a strong X-ray flare (among the brightest for 1ES 2344+514) without a significant shift of $ν_{synch,p}$. During this particular flare, the X-ray spectrum is among the softest of the campaign. It unveils complexity in the spectral evolution, where the common harder-when-brighter trend observed in BL Lacs is violated. During a low and hard X-ray state, we find an excess of the UV flux with respect to an extrapolation of the X-ray spectrum to lower energies. This UV excess implies that at least two regions contribute significantly to the infrared/optical/ultraviolet/X-ray emission. Using the simultaneous MAGIC, XMM-Newton, NuSTAR, and AstroSat observations, we argue that a region possibly associated with the 10 GHz radio core may explain such an excess. Finally, we investigate a VHE flare, showing an absence of simultaneous variability in the 0.3-2keV band. Using a time-dependent leptonic modelling, we show that this behaviour, in contradiction to single-zone scenarios, can instead be explained by a two-component model. △ Less

Submitted 5 October, 2023; originally announced October 2023.

Comments: Accepted for publication in Astronomy & Astrophysics

Journal ref: A&A 682, A114 (2024)

arXiv:2310.01954 [pdf, other]

doi 10.1051/0004-6361/202346927

Performance of the joint LST-1 and MAGIC observations evaluated with Crab Nebula data

Authors: H. Abe, K. Abe, S. Abe, V. A. Acciari, A. Aguasca-Cabot, I. Agudo, N. Alvarez Crespo, T. Aniello, S. Ansoldi, L. A. Antonelli, C. Aramo, A. Arbet-Engels, C. Arcaro, M. Artero, K. Asano, P. Aubert, D. Baack, A. Babić, A. Baktash, A. Bamba, A. Baquero Larriva, L. Baroncelli, U. Barres de Almeida, J. A. Barrio, I. Batković , et al. (344 additional authors not shown)

Abstract: Aims. LST-1, the prototype of the Large-Sized Telescope for the upcoming Cherenkov Telescope Array Observatory, is concluding its commissioning in Observatorio del Roque de los Muchachos on the island of La Palma. The proximity of LST-1 (Large-Sized Telescope 1) to the two MAGIC (Major Atmospheric Gamma Imaging Cherenkov) telescopes permits observations of the same gamma-ray events with both syste… ▽ More Aims. LST-1, the prototype of the Large-Sized Telescope for the upcoming Cherenkov Telescope Array Observatory, is concluding its commissioning in Observatorio del Roque de los Muchachos on the island of La Palma. The proximity of LST-1 (Large-Sized Telescope 1) to the two MAGIC (Major Atmospheric Gamma Imaging Cherenkov) telescopes permits observations of the same gamma-ray events with both systems. Methods. We describe the joint LST-1+MAGIC analysis pipeline and use simultaneous Crab Nebula observations and Monte Carlo simulations to assess the performance of the three-telescope system. The addition of the LST-1 telescope allows the recovery of events in which one of the MAGIC images is too dim to survive analysis quality cuts. Results. Thanks to the resulting increase in the collection area and stronger background rejection, we find a significant improvement in sensitivity, allowing the detection of 30% weaker fluxes in the energy range between 200 GeV and 3 TeV. The spectrum of the Crab Nebula, reconstructed in the energy range ~60 GeV to ~10 TeV, is in agreement with previous measurements. △ Less

Submitted 3 October, 2023; originally announced October 2023.

Comments: Accepted for publication in Astronomy & Astrophysics

Journal ref: A&A 680, A66 (2023)

arXiv:2310.01045 [pdf, other]

Tool-Augmented Reward Modeling

Authors: Lei Li, Yekun Chai, Shuohuan Wang, Yu Sun, Hao Tian, Ningyu Zhang, Hua Wu

Abstract: Reward modeling (a.k.a., preference modeling) is instrumental for aligning large language models with human preferences, particularly within the context of reinforcement learning from human feedback (RLHF). While conventional reward models (RMs) have exhibited remarkable scalability, they oft struggle with fundamental functionality such as arithmetic computation, code execution, and factual lookup… ▽ More Reward modeling (a.k.a., preference modeling) is instrumental for aligning large language models with human preferences, particularly within the context of reinforcement learning from human feedback (RLHF). While conventional reward models (RMs) have exhibited remarkable scalability, they oft struggle with fundamental functionality such as arithmetic computation, code execution, and factual lookup. In this paper, we propose a tool-augmented preference modeling approach, named Themis, to address these limitations by empowering RMs with access to external environments, including calculators and search engines. This approach not only fosters synergy between tool utilization and reward grading but also enhances interpretive capacity and scoring reliability. Our study delves into the integration of external tools into RMs, enabling them to interact with diverse external sources and construct task-specific tool engagement and reasoning traces in an autoregressive manner. We validate our approach across a wide range of domains, incorporating seven distinct external tools. Our experimental results demonstrate a noteworthy overall improvement of 17.7% across eight tasks in preference ranking. Furthermore, our approach outperforms Gopher 280B by 7.3% on TruthfulQA task in zero-shot evaluation. In human evaluations, RLHF trained with Themis attains an average win rate of 32% when compared to baselines across four distinct tasks. Additionally, we provide a comprehensive collection of tool-related RM datasets, incorporating data from seven distinct tool APIs, totaling 15,000 instances. We have made the code, data, and model checkpoints publicly available to facilitate and inspire further research advancements\footnote{\url{https://github.com/ernie-research/Tool-Augmented-Reward-Model}}. △ Less

Submitted 11 February, 2024; v1 submitted 2 October, 2023; originally announced October 2023.

Comments: ICLR 2024 Spotlight

arXiv:2309.16148 [pdf, other]

OSM-Net: One-to-Many One-shot Talking Head Generation with Spontaneous Head Motions

Authors: ** Liu, Xi Wang, Xiaomeng Fu, Yesheng Chai, Cai Yu, Jiao Dai, Jizhong Han

Abstract: One-shot talking head generation has no explicit head movement reference, thus it is difficult to generate talking heads with head motions. Some existing works only edit the mouth area and generate still talking heads, leading to unreal talking head performance. Other works construct one-to-one map** between audio signal and head motion sequences, introducing ambiguity correspondences into the m… ▽ More One-shot talking head generation has no explicit head movement reference, thus it is difficult to generate talking heads with head motions. Some existing works only edit the mouth area and generate still talking heads, leading to unreal talking head performance. Other works construct one-to-one map** between audio signal and head motion sequences, introducing ambiguity correspondences into the map** since people can behave differently in head motions when speaking the same content. This unreasonable map** form fails to model the diversity and produces either nearly static or even exaggerated head motions, which are unnatural and strange. Therefore, the one-shot talking head generation task is actually a one-to-many ill-posed problem and people present diverse head motions when speaking. Based on the above observation, we propose OSM-Net, a \textit{one-to-many} one-shot talking head generation network with natural head motions. OSM-Net constructs a motion space that contains rich and various clip-level head motion features. Each basis of the space represents a feature of meaningful head motion in a clip rather than just a frame, thus providing more coherent and natural motion changes in talking heads. The driving audio is mapped into the motion space, around which various motion features can be sampled within a reasonable range to achieve the one-to-many map**. Besides, the landmark constraint and time window feature input improve the accurate expression feature extraction and video generation. Extensive experiments show that OSM-Net generates more natural realistic head motions under reasonable one-to-many map** paradigm compared with other methods. △ Less

Submitted 27 September, 2023; originally announced September 2023.

Comments: Paper Under Review

arXiv:2309.14614 [pdf]

Multiferroic Magnon Spin-Torque Based Reconfigurable Logic-In-Memory

Authors: Yahong Chai, Yuhan Liang, Cancheng Xiao, Yue Wang, Bo Li, Dingsong Jiang, Pratap Pal, Yongjian Tang, Hetian Chen, Yuejie Zhang, Witold Skowroński, Qinghua Zhang, Lin Gu, **g Ma, Pu Yu, Jianshi Tang, Yuan-Hua Lin, Di Yi, Daniel C. Ralph, Chang-Beom Eom, Huaqiang Wu, Tianxiang Nan

Abstract: Magnons, bosonic quasiparticles carrying angular momentum, can flow through insulators for information transmission with minimal power dissipation. However, it remains challenging to develop a magnon-based logic due to the lack of efficient electrical manipulation of magnon transport. Here we present a magnon logic-in-memory device in a spin-source/multiferroic/ferromagnet structure, where multife… ▽ More Magnons, bosonic quasiparticles carrying angular momentum, can flow through insulators for information transmission with minimal power dissipation. However, it remains challenging to develop a magnon-based logic due to the lack of efficient electrical manipulation of magnon transport. Here we present a magnon logic-in-memory device in a spin-source/multiferroic/ferromagnet structure, where multiferroic magnon modes can be electrically excited and controlled. In this device, magnon information is encoded to ferromagnetic bits by the magnon-mediated spin torque. We show that the ferroelectric polarization can electrically modulate the magnon spin-torque by controlling the non-collinear antiferromagnetic structure in multiferroic bismuth ferrite thin films with coupled antiferromagnetic and ferroelectric orders. By manipulating the two coupled non-volatile state variables (ferroelectric polarization and magnetization), we further demonstrate reconfigurable logic-in-memory operations in a single device. Our findings highlight the potential of multiferroics for controlling magnon information transport and offer a pathway towards room-temperature voltage-controlled, low-power, scalable magnonics for in-memory computing. △ Less

Submitted 25 September, 2023; originally announced September 2023.

arXiv:2309.12113 [pdf, other]

Incentivizing Massive Unknown Workers for Budget-Limited Crowdsensing: From Off-Line and On-Line Perspectives

Authors: Feng Li, Yuqi Chai, Huan Yang, Pengfei Hu, Lingjie Duan

Abstract: How to incentivize strategic workers using limited budget is a very fundamental problem for crowdsensing systems; nevertheless, since the sensing abilities of the workers may not always be known as prior knowledge due to the diversities of their sensor devices and behaviors, it is difficult to properly select and pay the unknown workers. Although the uncertainties of the workers can be addressed b… ▽ More How to incentivize strategic workers using limited budget is a very fundamental problem for crowdsensing systems; nevertheless, since the sensing abilities of the workers may not always be known as prior knowledge due to the diversities of their sensor devices and behaviors, it is difficult to properly select and pay the unknown workers. Although the uncertainties of the workers can be addressed by the standard Combinatorial Multi-Armed Bandit (CMAB) framework in existing proposals through a trade-off between exploration and exploitation, we may not have sufficient budget to enable the trade-off among the individual workers, especially when the number of the workers is huge while the budget is limited. Moreover, the standard CMAB usually assumes the workers always stay in the system, whereas the workers may join in or depart from the system over time, such that what we have learnt for an individual worker cannot be applied after the worker leaves. To address the above challenging issues, in this paper, we first propose an off-line Context-Aware CMAB-based Incentive (CACI) mechanism. We innovate in leveraging the exploration-exploitation trade-off in an elaborately partitioned context space instead of the individual workers, to effectively incentivize the massive unknown workers with a very limited budget. We also extend the above basic idea to the on-line setting where unknown workers may join in or depart from the systems dynamically, and propose an on-line version of the CACI mechanism. We perform rigorous theoretical analysis to reveal the upper bounds on the regrets of our CACI mechanisms and to prove their truthfulness and individual rationality, respectively. Extensive experiments on both synthetic and real datasets are also conducted to verify the efficacy of our mechanisms. △ Less

Submitted 2 January, 2024; v1 submitted 21 September, 2023; originally announced September 2023.

arXiv:2309.11903 [pdf]

Full mesh networking technology with peer to peer grid topology based on variable parameter full dimensional space

Authors: Wenqiang Song, Chuan He, Zhaoyang Xie, Yuanyuan Chai

Abstract: The continuous development of computer network technology has accelerated the pace of informatization, and at the same time, network security issues are becoming increasingly prominent. Networking technology with different network topologies is one of the important means to solve network security problems. The security of VPN is based on the division of geographical boundaries, but the granularity… ▽ More The continuous development of computer network technology has accelerated the pace of informatization, and at the same time, network security issues are becoming increasingly prominent. Networking technology with different network topologies is one of the important means to solve network security problems. The security of VPN is based on the division of geographical boundaries, but the granularity is relatively coarse, which is difficult to cope with the dynamic changes of the security situation. Zero trust network solves the VPN problem through peer to peer authorization and continuous verification, but most of the solutions use a central proxy device, resulting in the central node becoming the bottleneck of the network. This paper put forward the hard-Nat traversal formula based on the birthday paradox, which solves the long-standing problem of hard NAT traversal. A full mesh networking mechanism with variable parameter full-dimensional spatial peer-to-peer grid topology was proposed, which covers all types of networking schemes and achieve peer-2-peer resource interconnection on both methodological and engineering level. △ Less

Submitted 21 September, 2023; originally announced September 2023.

Comments: 9th International Conference on Networks & Communications (NWCOM 2023)

arXiv:2309.09686 [pdf, other]

Simulating the flight gate assignment problem on a trapped ion quantum computer

Authors: Yahui Chai, Evgeny Epifanovsky, Karl Jansen, Ananth Kaushik, Stefan Kühn

Abstract: We study the flight gate assignment problem on IonQ's Aria trapped ion quantum computer using the variational quantum eigensolver. Utilizing the conditional value at risk as an aggregation function, we demonstrate that current trapped ion quantum hardware is able to obtain good solutions for this combinatorial optimization problem with high probability. In particular, we run the full variational q… ▽ More We study the flight gate assignment problem on IonQ's Aria trapped ion quantum computer using the variational quantum eigensolver. Utilizing the conditional value at risk as an aggregation function, we demonstrate that current trapped ion quantum hardware is able to obtain good solutions for this combinatorial optimization problem with high probability. In particular, we run the full variational quantum eigensolver for small instances and we perform inference runs for larger systems, demonstrating that current and near-future quantum hardware is suitable for addressing combinatorial optimization problems. △ Less

Submitted 18 September, 2023; originally announced September 2023.

Comments: 7 pages, 4 figures

arXiv:2309.05810 [pdf, other]

SHIFT3D: Synthesizing Hard Inputs For Tricking 3D Detectors

Authors: Hongge Chen, Zhao Chen, Gregory P. Meyer, Dennis Park, Carl Vondrick, Ashish Shrivastava, Yuning Chai

Abstract: We present SHIFT3D, a differentiable pipeline for generating 3D shapes that are structurally plausible yet challenging to 3D object detectors. In safety-critical applications like autonomous driving, discovering such novel challenging objects can offer insight into unknown vulnerabilities of 3D detectors. By representing objects with a signed distanced function (SDF), we show that gradient error s… ▽ More We present SHIFT3D, a differentiable pipeline for generating 3D shapes that are structurally plausible yet challenging to 3D object detectors. In safety-critical applications like autonomous driving, discovering such novel challenging objects can offer insight into unknown vulnerabilities of 3D detectors. By representing objects with a signed distanced function (SDF), we show that gradient error signals allow us to smoothly deform the shape or pose of a 3D object in order to confuse a downstream 3D detector. Importantly, the objects generated by SHIFT3D physically differ from the baseline object yet retain a semantically recognizable shape. Our approach provides interpretable failure modes for modern 3D object detectors, and can aid in preemptive discovery of potential safety risks within 3D perception systems before these risks become critical failures. △ Less

Submitted 11 September, 2023; originally announced September 2023.

Comments: Accepted by ICCV 2023

arXiv:2308.16635 [pdf, other]

MFR-Net: Multi-faceted Responsive Listening Head Generation via Denoising Diffusion Model

Authors: ** Liu, Xi Wang, Xiaomeng Fu, Yesheng Chai, Cai Yu, Jiao Dai, Jizhong Han

Abstract: Face-to-face communication is a common scenario including roles of speakers and listeners. Most existing research methods focus on producing speaker videos, while the generation of listener heads remains largely overlooked. Responsive listening head generation is an important task that aims to model face-to-face communication scenarios by generating a listener head video given a speaker video and… ▽ More Face-to-face communication is a common scenario including roles of speakers and listeners. Most existing research methods focus on producing speaker videos, while the generation of listener heads remains largely overlooked. Responsive listening head generation is an important task that aims to model face-to-face communication scenarios by generating a listener head video given a speaker video and a listener head image. An ideal generated responsive listening video should respond to the speaker with attitude or viewpoint expressing while maintaining diversity in interaction patterns and accuracy in listener identity information. To achieve this goal, we propose the \textbf{M}ulti-\textbf{F}aceted \textbf{R}esponsive Listening Head Generation Network (MFR-Net). Specifically, MFR-Net employs the probabilistic denoising diffusion model to predict diverse head pose and expression features. In order to perform multi-faceted response to the speaker video, while maintaining accurate listener identity preservation, we design the Feature Aggregation Module to boost listener identity features and fuse them with other speaker-related features. Finally, a renderer finetuned with identity consistency loss produces the final listening head videos. Our extensive experiments demonstrate that MFR-Net not only achieves multi-faceted responses in diversity and speaker identity information but also in attitude and viewpoint expression. △ Less

Submitted 31 August, 2023; originally announced August 2023.

Comments: Accepted by ACM MM 2023

arXiv:2308.15012 [pdf, other]

SALI: A Scalable Adaptive Learned Index Framework based on Probability Models

Authors: Jiake Ge, Huanchen Zhang, Boyu Shi, Yuanhui Luo, Yunda Guo, Yunpeng Chai, Yuxing Chen, Anqun Pan

Abstract: The growth in data storage capacity and the increasing demands for high performance have created several challenges for concurrent indexing structures. One promising solution is learned indexes, which use a learning-based approach to fit the distribution of stored data and predictively locate target keys, significantly improving lookup performance. Despite their advantages, prevailing learned inde… ▽ More The growth in data storage capacity and the increasing demands for high performance have created several challenges for concurrent indexing structures. One promising solution is learned indexes, which use a learning-based approach to fit the distribution of stored data and predictively locate target keys, significantly improving lookup performance. Despite their advantages, prevailing learned indexes exhibit constraints and encounter issues of scalability on multi-core data storage. This paper introduces SALI, the Scalable Adaptive Learned Index framework, which incorporates two strategies aimed at achieving high scalability, improving efficiency, and enhancing the robustness of the learned index. Firstly, a set of node-evolving strategies is defined to enable the learned index to adapt to various workload skews and enhance its concurrency performance in such scenarios. Secondly, a lightweight strategy is proposed to maintain statistical information within the learned index, with the goal of further improving the scalability of the index. Furthermore, to validate their effectiveness, SALI applied the two strategies mentioned above to the learned index structure that utilizes fine-grained write locks, known as LIPP. The experimental results have demonstrated that SALI significantly enhances the insertion throughput with 64 threads by an average of 2.04x compared to the second-best learned index. Furthermore, SALI accomplishes a lookup throughput similar to that of LIPP+. △ Less

Submitted 4 September, 2023; v1 submitted 29 August, 2023; originally announced August 2023.

Comments: Accepted by Conference SIGMOD 24, June 09-15, 2024, Santiago, Chile

arXiv:2308.12560 [pdf, other]

NOVA: NOvel View Augmentation for Neural Composition of Dynamic Objects

Authors: Dakshit Agrawal, Jiajie Xu, Siva Karthik Mustikovela, Ioannis Gkioulekas, Ashish Shrivastava, Yuning Chai

Abstract: We propose a novel-view augmentation (NOVA) strategy to train NeRFs for photo-realistic 3D composition of dynamic objects in a static scene. Compared to prior work, our framework significantly reduces blending artifacts when inserting multiple dynamic objects into a 3D scene at novel views and times; achieves comparable PSNR without the need for additional ground truth modalities like optical flow… ▽ More We propose a novel-view augmentation (NOVA) strategy to train NeRFs for photo-realistic 3D composition of dynamic objects in a static scene. Compared to prior work, our framework significantly reduces blending artifacts when inserting multiple dynamic objects into a 3D scene at novel views and times; achieves comparable PSNR without the need for additional ground truth modalities like optical flow; and overall provides ease, flexibility, and scalability in neural composition. Our codebase is on GitHub. △ Less

Submitted 24 August, 2023; originally announced August 2023.

Comments: Accepted for publication in ICCV Computer Vision for Metaverse Workshop 2023 (code is available at https://github.com/dakshitagrawal/NoVA)

arXiv:2308.12467 [pdf, other]

doi 10.1103/PhysRevB.108.L060407

First-principles demonstration of Roman surface topological multiferroicity

Authors: Ziwen Wang, Yisheng Chai, Shuai Dong

Abstract: The concept of topology has been widely applied to condensed matter, going beyond the band crossover in reciprocal spaces. A recent breakthrough suggested unconventional topological physics in a quadruple perovskite TbMn$_3$Cr$_4$O$_{12}$, whose magnetism-induced polarization manifests a unique Roman surface topology [Nat. Commun. \textbf{13}, 2373 (2022)]. However, the available experimental evid… ▽ More The concept of topology has been widely applied to condensed matter, going beyond the band crossover in reciprocal spaces. A recent breakthrough suggested unconventional topological physics in a quadruple perovskite TbMn$_3$Cr$_4$O$_{12}$, whose magnetism-induced polarization manifests a unique Roman surface topology [Nat. Commun. \textbf{13}, 2373 (2022)]. However, the available experimental evidence based on tiny polarizations of polycrystalline samples is far from sufficient. Here, this topological multiferroicity is demonstrated by using density functional theory calculations, which ideally confirms the Roman surface trajectory of magnetism-induced polarization. In addition, an alternative material in this category is proposed to systematically enhance the performance, by promoting its magnetism-induced polarization to an easily detectable level. △ Less

Submitted 23 August, 2023; originally announced August 2023.

Comments: 6 pages, 4 figures

Journal ref: Physical Review B 108, L060407 (2023)

Showing 1–50 of 243 results for author: Chai, Y