Search | arXiv e-print repository

A new subclass of gamma-ray burst originating from compact binary merger

Authors: Chen-Wei Wang, Wen-Jun Tan, Shao-Lin Xiong, Shu-Xu Yi, Rahim Moradi, Bing Li, Zhen Zhang, Yu Wang, Yan-Zhi Meng, Jia-Cong Liu, Yue Wang, Sheng-Lun Xie, Wang-Chen Xue, Zheng-Hang Yu, Peng Zhang, Wen-Long Zhang, Yan-Qiu Zhang, Chao Zheng

Abstract: Type I gamma-ray bursts (GRBs) are believed to originate from compact binary merger usually with duration less than 2 seconds for the main emission. However, recent observations of GRB 211211A and GRB 230307A indicate that some merger-origin GRBs could last much longer. Since they show strikingly similar properties (indicating a common mechanism) which are different from the classic "long"-short b… ▽ More Type I gamma-ray bursts (GRBs) are believed to originate from compact binary merger usually with duration less than 2 seconds for the main emission. However, recent observations of GRB 211211A and GRB 230307A indicate that some merger-origin GRBs could last much longer. Since they show strikingly similar properties (indicating a common mechanism) which are different from the classic "long"-short burst (e.g. GRB 060614), forming an interesting subclass of type I GRBs, we suggest to name them as type IL GRBs. By identifying the first peak of GRB 230307A as a quasi-thermal precursor, we find that the prompt emission of type IL GRB is composed of three episodes: (1) a precursor followed by a short quiescent (or weak emission) period, (2) a long-duration main emission, and (3) an extended emission. With this burst pattern, a good candidate, GRB 170228A, was found in the Fermi/GBM archive data, and subsequent temporal and spectral analyses indeed show that GRB 170228A falls in the same cluster with GRB 211211A and GRB 230307A in many diagnostic figures. Thus this burst pattern could be a good reference for rapidly identifying type IL GRB and conducting low-latency follow-up observation. We estimated the occurrence rate and discussed the physical origins and implications for the three emission episodes of type IL GRBs. Our analysis suggests the pre-merger precursor model, especially the super flare model, is more favored for type IL GRBs. △ Less

Submitted 2 July, 2024; originally announced July 2024.

arXiv:2406.13764 [pdf, other]

Can LLMs Reason in the Wild with Programs?

Authors: Yuan Yang, Siheng Xiong, Ali Payani, Ehsan Shareghi, Faramarz Fekri

Abstract: Large Language Models (LLMs) have shown superior capability to solve reasoning problems with programs. While being a promising direction, most of such frameworks are trained and evaluated in settings with a prior knowledge of task requirements. However, as LLMs become more capable, it is necessary to assess their reasoning abilities in more realistic scenarios where many real-world problems are op… ▽ More Large Language Models (LLMs) have shown superior capability to solve reasoning problems with programs. While being a promising direction, most of such frameworks are trained and evaluated in settings with a prior knowledge of task requirements. However, as LLMs become more capable, it is necessary to assess their reasoning abilities in more realistic scenarios where many real-world problems are open-ended with ambiguous scope, and often require multiple formalisms to solve. To investigate this, we introduce the task of reasoning in the wild, where an LLM is tasked to solve a reasoning problem of unknown type by identifying the subproblems and their corresponding formalisms, and writing a program to solve each subproblem, guided by a tactic. We create a large tactic-guided trajectory dataset containing detailed solutions to a diverse set of reasoning problems, ranging from well-defined single-form reasoning (e.g., math, logic), to ambiguous and hybrid ones (e.g., commonsense, combined math and logic). This allows us to test various aspects of LLMs reasoning at the fine-grained level such as the selection and execution of tactics, and the tendency to take undesired shortcuts. In experiments, we highlight that existing LLMs fail significantly on problems with ambiguous and mixed scope, revealing critical limitations and overfitting issues (e.g. accuracy on GSM8K drops by at least 50\%). We further show the potential of finetuning a local LLM on the tactic-guided trajectories in achieving better performance. Project repo is available at github.com/gblackout/Reason-in-the-Wild △ Less

Submitted 19 June, 2024; originally announced June 2024.

arXiv:2406.07824 [pdf, other]

Efficient Arbitrated Quantum Digital Signature with Multi-Receiver Verification

Authors: Siyu Xiong, Bangying Tang, Hui Han, **quan Huang, Mingqiang Bai, Fangzhao Li, Wanrong Yu Zhiwen Mo, Bo Liu

Abstract: Quantum digital signature is used to authenticate the identity of the signer with information theoretical security, while providing non-forgery and non-repudiation services. In traditional multi-receiver quantum digital signature schemes without an arbitrater, the transferability of one-to-one signature is always required to achieve unforgeability, with complicated implementation and heavy key con… ▽ More Quantum digital signature is used to authenticate the identity of the signer with information theoretical security, while providing non-forgery and non-repudiation services. In traditional multi-receiver quantum digital signature schemes without an arbitrater, the transferability of one-to-one signature is always required to achieve unforgeability, with complicated implementation and heavy key consumption. In this article, we propose an arbitrated quantum digital signature scheme, in which the signature can be verified by multiple receivers simultaneously, and meanwhile, the transferability of the signature is still kept. Our scheme can be simplified performed to various quantum secure networks, due to the proposed efficient signature calculation procedure with low secure key consumption and low computation complexity, by employing one-time universal hashing algorithm and one-time pad encryption scheme. The evaluation results show that our scheme uses at least two orders of magnitude less key than existing signature schemes with transferability when signing files of the same length with the same number of receivers and security parameter settings. △ Less

Submitted 11 June, 2024; originally announced June 2024.

arXiv:2406.04652 [pdf, other]

Quantum state preparation for a velocity field based on the spherical Clebsch wave function

Authors: Hao Su, Shiying Xiong, Yue Yang

Abstract: We propose a method for preparing the quantum state for a given velocity field, e.g., in fluid dynamics, via the spherical Clebsch wave function (SCWF). Using the pointwise normalization constraint for the SCWF, we develop a variational ansatz comprising parameterized controlled rotation gates. Employing the variational quantum algorithm, we iteratively optimize the circuit parameters to transform… ▽ More We propose a method for preparing the quantum state for a given velocity field, e.g., in fluid dynamics, via the spherical Clebsch wave function (SCWF). Using the pointwise normalization constraint for the SCWF, we develop a variational ansatz comprising parameterized controlled rotation gates. Employing the variational quantum algorithm, we iteratively optimize the circuit parameters to transform the target velocity field into the SCWF and its corresponding discrete quantum state, enabling subsequent quantum simulation of fluid dynamics. Validations for one- and two-dimensional flow fields confirm the accuracy and robustness of our method, emphasizing its effectiveness in handling multiscale and multidimensional velocity fields. Our method is able to capture critical flow features like sources, sinks, and saddle points. Furthermore, it enables the generation of SCWFs for various vector fields, which can then be applied in quantum simulations through SCWF evolution. △ Less

Submitted 7 June, 2024; originally announced June 2024.

arXiv:2406.04375 [pdf, other]

Verifying components of Arm(R) Confidential Computing Architecture with ESBMC

Authors: Tong Wu, Shale Xiong, Edoardo Manino, Gareth Stockwell, Lucas C. Cordeiro

Abstract: Realm Management Monitor (RMM) is an essential firmware component within the recent Arm Confidential Computing Architecture (Arm CCA). Previous work applies formal techniques to verify the specification and prototype reference implementation of RMM. However, relying solely on a single verification tool may lead to the oversight of certain bugs or vulnerabilities. This paper discusses the applicati… ▽ More Realm Management Monitor (RMM) is an essential firmware component within the recent Arm Confidential Computing Architecture (Arm CCA). Previous work applies formal techniques to verify the specification and prototype reference implementation of RMM. However, relying solely on a single verification tool may lead to the oversight of certain bugs or vulnerabilities. This paper discusses the application of ESBMC, a state-of-the-art Satisfiability Modulo Theories (SMT)-based software model checker, to further enhance RRM verification. We demonstrate ESBMC's ability to precisely parse the source code and identify specification failures within a reasonable time frame. Moreover, we propose potential improvements for ESBMC to enhance its efficiency for industry engineers. This work contributes to exploring the capabilities of formal verification techniques in real-world scenarios and suggests avenues for further improvements to better meet industrial verification needs. △ Less

Submitted 5 June, 2024; originally announced June 2024.

arXiv:2405.20137 [pdf, other]

A unified framework of principal component analysis and factor analysis

Authors: Shifeng Xiong

Abstract: Principal component analysis and factor analysis are fundamental multivariate analysis methods. In this paper a unified framework to connect them is introduced. Under a general latent variable model, we present matrix optimization problems from the viewpoint of loss function minimization, and show that the two methods can be viewed as solutions to the optimization problems with specific loss funct… ▽ More Principal component analysis and factor analysis are fundamental multivariate analysis methods. In this paper a unified framework to connect them is introduced. Under a general latent variable model, we present matrix optimization problems from the viewpoint of loss function minimization, and show that the two methods can be viewed as solutions to the optimization problems with specific loss functions. Specifically, principal component analysis can be derived from a broad class of loss functions including the L2 norm, while factor analysis corresponds to a modified L0 norm problem. Related problems are discussed, including algorithms, penalized maximum likelihood estimation under the latent variable model, and a principal component factor model. These results can lead to new tools of data analysis and research topics. △ Less

Submitted 30 May, 2024; originally announced May 2024.

Comments: 24 pages, 2 figures

MSC Class: 62H25

arXiv:2405.17240 [pdf, other]

Content-Style Decoupling for Unsupervised Makeup Transfer without Generating Pseudo Ground Truth

Authors: Zhaoyang Sun, Shengwu Xiong, Yaxiong Chen, Yi Rong

Abstract: The absence of real targets to guide the model training is one of the main problems with the makeup transfer task. Most existing methods tackle this problem by synthesizing pseudo ground truths (PGTs). However, the generated PGTs are often sub-optimal and their imprecision will eventually lead to performance degradation. To alleviate this issue, in this paper, we propose a novel Content-Style Deco… ▽ More The absence of real targets to guide the model training is one of the main problems with the makeup transfer task. Most existing methods tackle this problem by synthesizing pseudo ground truths (PGTs). However, the generated PGTs are often sub-optimal and their imprecision will eventually lead to performance degradation. To alleviate this issue, in this paper, we propose a novel Content-Style Decoupled Makeup Transfer (CSD-MT) method, which works in a purely unsupervised manner and thus eliminates the negative effects of generating PGTs. Specifically, based on the frequency characteristics analysis, we assume that the low-frequency (LF) component of a face image is more associated with its makeup style information, while the high-frequency (HF) component is more related to its content details. This assumption allows CSD-MT to decouple the content and makeup style information in each face image through the frequency decomposition. After that, CSD-MT realizes makeup transfer by maximizing the consistency of these two types of information between the transferred result and input images, respectively. Two newly designed loss functions are also introduced to further improve the transfer performance. Extensive quantitative and qualitative analyses show the effectiveness of our CSD-MT method. Our code is available at https://github.com/Snowfallingplum/CSD-MT. △ Less

Submitted 27 May, 2024; originally announced May 2024.

Comments: Accepted by CVPR2024

arXiv:2405.12977 [pdf, other]

Implication of Jet Physics from MeV Line Emission of GRB 221009A

Authors: Zhen Zhang, Haoxiang Lin, Zhuo Li, Shao-Lin Xiong, Yan-Qiu Zhang, Qinyuan Zhang, Shu-Xu Yi, Xilu Wang

Abstract: Ultra-relativistic jets are believed to play important role in producing prompt emission and afterglow of gamma-ray burst (GRB), but the nature of the jet is poorly known owing to the lacking of decisive features observed in the prompt emission. A series of bright, narrow and regularly-evolving MeV emission line detected in the brightest-of-all-time GRB 221009A provide unprecedented opportunity to… ▽ More Ultra-relativistic jets are believed to play important role in producing prompt emission and afterglow of gamma-ray burst (GRB), but the nature of the jet is poorly known owing to the lacking of decisive features observed in the prompt emission. A series of bright, narrow and regularly-evolving MeV emission line detected in the brightest-of-all-time GRB 221009A provide unprecedented opportunity to probe GRB jet physics. The time evolution of the central energy of the line with power-law index $-1$ is naturally explained by high-latitude curvature effect. Under the assumption that the line emission is generated in the prompt emission by $e^\pm$ pair production, cooling and annihilation in the jet, we can strictly constrain jet physics with observed line emission properties. We find the radius of the emission region is $r\sim10^{16}$cm. The narrow line width of $10\%$ implies that pairs cool fast down to non-relativistic state within a time of tenth of the dynamical time. This requires a magnetic-field energy density much larger than the prompt gamma-ray energy density in the jet, implying a magnetic field dominated jet. The temporal behavior of line flux suggests some angle dependence of line emission. We also discuss the difficulties of other scenarios to interpret the observed emission line. △ Less

Submitted 27 May, 2024; v1 submitted 21 May, 2024; originally announced May 2024.

Comments: 8 pages, 3 figures, submitted to ApJL, comments welcome

arXiv:2405.09672 [pdf, other]

doi 10.1145/3658180

Eulerian-Lagrangian Fluid Simulation on Particle Flow Maps

Authors: Junwei Zhou, Duowen Chen, Molin Deng, Yitong Deng, Yuchen Sun, Sinan Wang, Shiying Xiong, Bo Zhu

Abstract: We propose a novel Particle Flow Map (PFM) method to enable accurate long-range advection for incompressible fluid simulation. The foundation of our method is the observation that a particle trajectory generated in a forward simulation naturally embodies a perfect flow map. Centered on this concept, we have developed an Eulerian-Lagrangian framework comprising four essential components: Lagrangian… ▽ More We propose a novel Particle Flow Map (PFM) method to enable accurate long-range advection for incompressible fluid simulation. The foundation of our method is the observation that a particle trajectory generated in a forward simulation naturally embodies a perfect flow map. Centered on this concept, we have developed an Eulerian-Lagrangian framework comprising four essential components: Lagrangian particles for a natural and precise representation of bidirectional flow maps; a dual-scale map representation to accommodate the map** of various flow quantities; a particle-to-grid interpolation scheme for accurate quantity transfer from particles to grid nodes; and a hybrid impulse-based solver to enforce incompressibility on the grid. The efficacy of PFM has been demonstrated through various simulation scenarios, highlighting the evolution of complex vortical structures and the details of turbulent flows. Notably, compared to NFM, PFM reduces computing time by up to 49 times and memory consumption by up to 41%, while enhancing vorticity preservation as evidenced in various tests like leapfrog, vortex tube, and turbulent flow. △ Less

Submitted 15 May, 2024; originally announced May 2024.

arXiv:2404.16425 [pdf, other]

Soft X-ray prompt emission from a high-redshift gamma-ray burst EP240315a

Authors: Y. Liu, H. Sun, D. Xu, D. S. Svinkin, J. Delaunay, N. R. Tanvir, H. Gao, C. Zhang, Y. Chen, X. -F. Wu, B. Zhang, W. Yuan, J. An, G. Bruni, D. D. Frederiks, G. Ghirlanda, J. -W. Hu, A. Li, C. -K. Li, J. -D. Li, D. B. Malesani, L. Piro, G. Raman, R. Ricci, E. Troja , et al. (170 additional authors not shown)

Abstract: Long gamma-ray bursts (GRBs) are believed to originate from core collapse of massive stars. High-redshift GRBs can probe the star formation and reionization history of the early universe, but their detection remains rare. Here we report the detection of a GRB triggered in the 0.5--4 keV band by the Wide-field X-ray Telescope (WXT) on board the Einstein Probe (EP) mission, designated as EP240315a,… ▽ More Long gamma-ray bursts (GRBs) are believed to originate from core collapse of massive stars. High-redshift GRBs can probe the star formation and reionization history of the early universe, but their detection remains rare. Here we report the detection of a GRB triggered in the 0.5--4 keV band by the Wide-field X-ray Telescope (WXT) on board the Einstein Probe (EP) mission, designated as EP240315a, whose bright peak was also detected by the Swift Burst Alert Telescope and Konus-Wind through off-line analyses. At a redshift of $z=4.859$, EP240315a showed a much longer and more complicated light curve in the soft X-ray band than in gamma-rays. Benefiting from a large field-of-view ($\sim$3600 deg$^2$) and a high sensitivity, EP-WXT captured the earlier engine activation and extended late engine activity through a continuous detection. With a peak X-ray flux at the faint end of previously known high-$z$ GRBs, the detection of EP240315a demonstrates the great potential for EP to study the early universe via GRBs. △ Less

Submitted 25 April, 2024; originally announced April 2024.

Comments: 41 pages, 8 figures, 7 tables

arXiv:2404.15878 [pdf, other]

Simulating unsteady fluid flows on a superconducting quantum processor

Authors: Zhaoyuan Meng, Jiarun Zhong, Shibo Xu, Ke Wang, Jiachen Chen, Feitong **, Xuhao Zhu, Yu Gao, Yaozu Wu, Chuanyu Zhang, Ning Wang, Yiren Zou, Aosai Zhang, Zhengyi Cui, Fanhao Shen, Zehang Bao, Zitian Zhu, Ziqi Tan, Tingting Li, Pengfei Zhang, Shiying Xiong, Hekang Li, Qiujiang Guo, Zhen Wang, Chao Song , et al. (2 additional authors not shown)

Abstract: Recent advancements of intermediate-scale quantum processors have triggered tremendous interest in the exploration of practical quantum advantage. The simulation of fluid dynamics, a highly challenging problem in classical physics but vital for practical applications, emerges as a good candidate for showing quantum utility. Here, we report an experiment on the digital simulation of unsteady flows,… ▽ More Recent advancements of intermediate-scale quantum processors have triggered tremendous interest in the exploration of practical quantum advantage. The simulation of fluid dynamics, a highly challenging problem in classical physics but vital for practical applications, emerges as a good candidate for showing quantum utility. Here, we report an experiment on the digital simulation of unsteady flows, which consists of quantum encoding, evolution, and detection of flow states, with a superconducting quantum processor. The quantum algorithm is based on the Hamiltonian simulation using the hydrodynamic formulation of the Schrödinger equation. With the median fidelities of 99.97% and 99.67% for parallel single- and two-qubit gates respectively, we simulate the dynamics of a two-dimensional (2D) compressible diverging flow and a 2D decaying vortex with ten qubits. The experimental results well capture the temporal evolution of averaged density and momentum profiles, and qualitatively reproduce spatial flow fields with moderate noises. This work demonstrates the potential of quantum computing in simulating more complex flows, such as turbulence, for practical applications. △ Less

Submitted 24 April, 2024; originally announced April 2024.

arXiv:2404.11877 [pdf, other]

doi 10.3847/1538-4357/ad4093

Finding the Particularity of the Active Episode of SGR J1935+2154 during Which FRB 20200428 Occurred: Implication from Statistics of Fermi/GBM X-Ray Bursts

Authors: Sheng-Lun Xie, Yun-Wei Yu, Shao-Lin Xiong, Lin Lin, ** Wang, Yi Zhao, Yue Wang, Wen-Long Zhang

Abstract: By using the Fermi/Gamma-ray Burst Monitor data of the X-ray bursts (XRBs) of SGR J1935+2154, we investigate the temporal clustering of the bursts and the cumulative distribution of the waiting time and fluence/flux. It is found that the bursts occurring in the episode hosting FRB 20200428 have obviously shorter waiting times than those in the other episodes. The general statistical properties of… ▽ More By using the Fermi/Gamma-ray Burst Monitor data of the X-ray bursts (XRBs) of SGR J1935+2154, we investigate the temporal clustering of the bursts and the cumulative distribution of the waiting time and fluence/flux. It is found that the bursts occurring in the episode hosting FRB 20200428 have obviously shorter waiting times than those in the other episodes. The general statistical properties of the XRBs further indicate they could belong to a self-organized critical (SOC) system (e.g., starquakes), making them very similar to the earthquake phenomena. Then, according to a unified scaling law between the waiting time and energy of the earthquakes as well as their aftershocks, we implement an analogy analysis on the XRBs and find that the FRB episode owns more dependent burst events than the other episodes. It is indicated that the fast radio burst (FRB) emission could be produced by the interaction between different burst events, which could correspond to a collision between different seismic/Alfven waves or different explosion outflows. Such a situation could appear when the magnetar enters into a global intensive activity period. △ Less

Submitted 8 June, 2024; v1 submitted 17 April, 2024; originally announced April 2024.

arXiv:2404.03229 [pdf, other]

Relation between the keV-MeV and TeV emission of GRB 221009A and its implications

Authors: Yan-Qiu Zhang, Hao-Xiang Lin, Shao-Lin Xiong, Zhuo Li, Ming-Yu Ge, Chen-Wei Wang, Shu-Xu Yi, Zhen Zhang, Shuang-Nan Zhang, Li-Ming Song, Chao Zheng, Wang-Chen Xue, Jia-Cong Liu, Wen-Jun Tan, Yue Wang, Wen-Long Zhang

Abstract: Gamma-ray bursts (GRBs) are believed to launch relativistic jets, which generate prompt emission by their internal processes and drive external shocks into surrounding medium, accounting for the long-lasting afterglow emission. However, how the jet powers the external shock is an open question. The unprecedented observations of the keV-MeV emission with GECAM and the TeV emission with LHAASO of so… ▽ More Gamma-ray bursts (GRBs) are believed to launch relativistic jets, which generate prompt emission by their internal processes and drive external shocks into surrounding medium, accounting for the long-lasting afterglow emission. However, how the jet powers the external shock is an open question. The unprecedented observations of the keV-MeV emission with GECAM and the TeV emission with LHAASO of so far the brightest burst, GRB 221009A, offer a great opportunity to study the prompt-to-afterglow transition and the early dynamical evolution of the external shock. In this letter, we find that the cumulative light curve of keV-MeV emission could well fit the rising stage of the TeV light curve of GRB 221009A, with a time delay of $4.45^{+0.26}_{-0.26}$\,s for TeV emission. Moreover, both the rapid increase in the initial stage and the excess from about \T+260\,s to 270\,s in the TeV light curve could be interpreted by inverse Compton (IC) scatterings of the inner-coming photons by the energetic electrons in external shock. Our results not only reveal a close relation between the keV-MeV and TeV emission, but also indicate a continuous, rather than impulsive, energy injection to the external shock. Assuming an energy injection rate proportional to the keV-MeV flux, we build a continuous energy injection model which well fits the TeV light curve of GRB 221009A, and provides an estimate of the Lorentz factor of the jet. △ Less

Submitted 4 April, 2024; originally announced April 2024.

arXiv:2404.03181 [pdf, other]

MonoCD: Monocular 3D Object Detection with Complementary Depths

Authors: Longfei Yan, Pei Yan, Shengzhou Xiong, Xuanyu Xiang, Yihua Tan

Abstract: Monocular 3D object detection has attracted widespread attention due to its potential to accurately obtain object 3D localization from a single image at a low cost. Depth estimation is an essential but challenging subtask of monocular 3D object detection due to the ill-posedness of 2D to 3D map**. Many methods explore multiple local depth clues such as object heights and keypoints and then formu… ▽ More Monocular 3D object detection has attracted widespread attention due to its potential to accurately obtain object 3D localization from a single image at a low cost. Depth estimation is an essential but challenging subtask of monocular 3D object detection due to the ill-posedness of 2D to 3D map**. Many methods explore multiple local depth clues such as object heights and keypoints and then formulate the object depth estimation as an ensemble of multiple depth predictions to mitigate the insufficiency of single-depth information. However, the errors of existing multiple depths tend to have the same sign, which hinders them from neutralizing each other and limits the overall accuracy of combined depth. To alleviate this problem, we propose to increase the complementarity of depths with two novel designs. First, we add a new depth prediction branch named complementary depth that utilizes global and efficient depth clues from the entire image rather than the local clues to reduce the correlation of depth predictions. Second, we propose to fully exploit the geometric relations between multiple depth clues to achieve complementarity in form. Benefiting from these designs, our method achieves higher complementarity. Experiments on the KITTI benchmark demonstrate that our method achieves state-of-the-art performance without introducing extra data. In addition, complementary depth can also be a lightweight and plug-and-play module to boost multiple existing monocular 3d object detectors. Code is available at https://github.com/elvintanhust/MonoCD. △ Less

Submitted 3 April, 2024; originally announced April 2024.

Comments: Accepted to CVPR 2024

arXiv:2404.00667 [pdf, other]

Weakly-Supervised Cross-Domain Segmentation of Electron Microscopy with Sparse Point Annotation

Authors: Dafei Qiu, Shan Xiong, Jia** Yi, Jialin Peng

Abstract: Accurate segmentation of organelle instances from electron microscopy (EM) images plays an essential role in many neuroscience researches. However, practical scenarios usually suffer from high annotation costs, label scarcity, and large domain diversity. While unsupervised domain adaptation (UDA) that assumes no annotation effort on the target data is promising to alleviate these challenges, its p… ▽ More Accurate segmentation of organelle instances from electron microscopy (EM) images plays an essential role in many neuroscience researches. However, practical scenarios usually suffer from high annotation costs, label scarcity, and large domain diversity. While unsupervised domain adaptation (UDA) that assumes no annotation effort on the target data is promising to alleviate these challenges, its performance on complicated segmentation tasks is still far from practical usage. To address these issues, we investigate a highly annotation-efficient weak supervision, which assumes only sparse center-points on a small subset of object instances in the target training images. To achieve accurate segmentation with partial point annotations, we introduce instance counting and center detection as auxiliary tasks and design a multitask learning framework to leverage correlations among the counting, detection, and segmentation, which are all tasks with partial or no supervision. Building upon the different domain-invariances of the three tasks, we enforce counting estimation with a novel soft consistency loss as a global prior for center detection, which further guides the per-pixel segmentation. To further compensate for annotation sparsity, we develop a cross-position cut-and-paste for label augmentation and an entropy-based pseudo-label selection. The experimental results highlight that, by simply using extremely weak annotation, e.g., 15\% sparse points, for model training, the proposed model is capable of significantly outperforming UDA methods and produces comparable performance as the supervised counterpart. The high robustness of our model shown in the validations and the low requirement of expert knowledge for sparse point annotation further improve the potential application value of our model. △ Less

Submitted 31 March, 2024; originally announced April 2024.

arXiv:2403.13737 [pdf, ps, other]

EthioLLM: Multilingual Large Language Models for Ethiopian Languages with Task Evaluation

Authors: Atnafu Lambebo Tonja, Israel Abebe Azime, Tadesse Destaw Belay, Mesay Gemeda Yigezu, Moges Ahmed Mehamed, Abinew Ali Ayele, Ebrahim Chekol Jibril, Michael Melese Woldeyohannis, Olga Kolesnikova, Philipp Slusallek, Dietrich Klakow, Shengwu Xiong, Seid Muhie Yimam

Abstract: Large language models (LLMs) have gained popularity recently due to their outstanding performance in various downstream Natural Language Processing (NLP) tasks. However, low-resource languages are still lagging behind current state-of-the-art (SOTA) developments in the field of NLP due to insufficient resources to train LLMs. Ethiopian languages exhibit remarkable linguistic diversity, encompassin… ▽ More Large language models (LLMs) have gained popularity recently due to their outstanding performance in various downstream Natural Language Processing (NLP) tasks. However, low-resource languages are still lagging behind current state-of-the-art (SOTA) developments in the field of NLP due to insufficient resources to train LLMs. Ethiopian languages exhibit remarkable linguistic diversity, encompassing a wide array of scripts, and are imbued with profound religious and cultural significance. This paper introduces EthioLLM -- multilingual large language models for five Ethiopian languages (Amharic, Ge'ez, Afan Oromo, Somali, and Tigrinya) and English, and Ethiobenchmark -- a new benchmark dataset for various downstream NLP tasks. We evaluate the performance of these models across five downstream NLP tasks. We open-source our multilingual language models, new benchmark datasets for various downstream tasks, and task-specific fine-tuned language models and discuss the performance of the models. Our dataset and models are available at the https://huggingface.co/EthioNLP repository. △ Less

Submitted 23 June, 2024; v1 submitted 20 March, 2024; originally announced March 2024.

Comments: Accepted at LREC-Coling 2024

arXiv:2403.12851 [pdf, other]

doi 10.1007/s11433-023-2381-0

Observation of spectral lines in the exceptional GRB 221009A

Authors: Yan-Qiu Zhang, Shao-Lin Xiong, Ji-Rong Mao, Shuang-Nan Zhang, Wang-Chen Xue, Chao Zheng, Jia-Cong Liu, Zhen Zhang, Xi-Lu Wang, Ming-Yu Ge, Shu-Xu Yi, Li-Ming Song, Zheng-Hua An, Ce Cai, Xin-Qiao Li, Wen-Xi Peng, Wen-Jun Tan, Chen-Wei Wang, Xiang-Yang Wen, Yue Wang, Shuo Xiao, Fan Zhang, Peng Zhang, Shi-Jie Zheng

Abstract: As the brightest gamma-ray burst ever observed, GRB 221009A provided a precious opportunity to explore spectral line features. In this paper, we performed a comprehensive spectroscopy analysis of GRB 221009A jointly with GECAM-C and Fermi/GBM data to search for emission and absorption lines. For the first time we investigated the line feature throughout this GRB including the most bright part wher… ▽ More As the brightest gamma-ray burst ever observed, GRB 221009A provided a precious opportunity to explore spectral line features. In this paper, we performed a comprehensive spectroscopy analysis of GRB 221009A jointly with GECAM-C and Fermi/GBM data to search for emission and absorption lines. For the first time we investigated the line feature throughout this GRB including the most bright part where many instruments suffered problems, and identified prominent emission lines in multiple time intervals. The central energy of the Gaussian emission line evolves from about 37 MeV to 6 MeV, with a nearly constant ratio (about 10\%) between the line width and central energy. Particularly, we find that both the central energy and the energy flux of the emission line evolve with time as a power law decay with power law index of -1 and -2 respectively. We suggest that the observed emission lines most likely originate from the blue-shifted electron positron pair annihilation 511 keV line. We find that a standard high latitude emission scenario cannot fully interpret the observation, thus we propose that the emission line comes from some dense clumps with electron positron pairs traveling together with the jet. In this scenario, we can use the emission line to directly, for the first time, measure the bulk Lorentz factor of the jet ($Γ$) and reveal its time evolution (i.e. $Γ\sim t^{-1}$) during the prompt emission. Interestingly, we find that the flux of the annihilation line in the co-moving frame keeps constant. These discoveries of the spectral line features shed new and important lights on the physics of GRB and relativistic jet. △ Less

Submitted 28 May, 2024; v1 submitted 19 March, 2024; originally announced March 2024.

Comments: Accepted by SCIENCE CHINA Physics, Mechanics & Astronomy (SCPMA)

Journal ref: Observation of spectral lines in the exceptional GRB 221009A. Sci. China-Phys. Mech. Astron. 67, 289511 (2024)

arXiv:2402.17282 [pdf, other]

doi 10.1051/0004-6361/202449200

Distribution of number of peaks within a long gamma-ray burst

Authors: C. Guidorzi, M. Sartori, R. Maccary, A. Tsvetkova, L. Amati, L. Bazzanini, M. Bulla, A. E. Camisasca, L. Ferro, F. Frontera, C. K. Li, S. L. Xiong, S. N. Zhang

Abstract: The variety of long duration gamma-ray burst (LGRB) light curves (LCs) encode a wealth of information on how LGRB engines release energy following the collapse of the progenitor star. Attempts to characterise GRB LCs focused on a number of properties, such as the minimum variability timescale, power density spectra (both ensemble average and individual), or with different definitions of variabilit… ▽ More The variety of long duration gamma-ray burst (LGRB) light curves (LCs) encode a wealth of information on how LGRB engines release energy following the collapse of the progenitor star. Attempts to characterise GRB LCs focused on a number of properties, such as the minimum variability timescale, power density spectra (both ensemble average and individual), or with different definitions of variability. In parallel, a characterisation as a stochastic process was pursued by studying the distributions of waiting times, peak flux, fluence of individual peaks within GRB time profiles. Yet, the question remains as to whether the diversity of profiles can be described in terms of a common stochastic process. Here we address this issue by studying for the first time the distribution of the number of peaks in a GRB profile. We used four different GRB catalogues: CGRO/BATSE, Swift/BAT, BeppoSAX/GRBM, and Insight-HXMT. The statistically significant peaks were identified by means of well tested algorithm MEPSA and further selected by applying a set of thresholds on signal-to-noise ratio. We then extracted the corresponding distributions of number of peaks per GRB. Among the different models considered (power-law, simple or stretched exponential) only a mixture of two exponentials models all the observed distributions, suggesting the existence of two distinct behaviours: (i) an average number of 2.1+-0.1 peaks per GRB ("peak poor") and accounting for about 80% of the observed population of GRBs; (ii) an average number of 8.3+-1.0 peaks per GRB ("peak rich") and accounting for the remaining 20% of the observed population. We associate the class of peak-rich GRBs with the presence of sub-second variability, which seems to be absent among peak-poor GRBs. The two classes could result from two different regimes through which GRB engines release energy or through which energy is dissipated into gamma-rays. △ Less

Submitted 27 February, 2024; originally announced February 2024.

Comments: 7 pages, 5 figures, accepted by A&A

Journal ref: A&A 685, A34 (2024)

arXiv:2402.15275 [pdf, ps, other]

doi 10.1007/s10686-024-09924-0

Simulation Studies for the First Pathfinder of the CATCH Space Mission

Authors: Yiming Huang, Juan Zhang, Lian Tao, Zhengwei Li, Donghua Zhao, Qian-Qing Yin, Xiangyang Wen, **gyu Xiao, Chen Zhang, Shuang-Nan Zhang, Shaolin Xiong, Qingcui Bu, Jirong Cang, Dezhi Cao, Wen Chen, Siran Ding, Min Gao, Yang Gao, Shu** Hou, Li** Jia, Ge **, Dalin Li, **song Li, Pan** Li, Yajun Li , et al. (20 additional authors not shown)

Abstract: The Chasing All Transients Constellation Hunters (CATCH) space mission is an intelligent constellation consisting of 126 micro-satellites in three types (A, B, and C), designed for X-ray observation with the objective of studying the dynamic universe. Currently, we are actively develo** the first Pathfinder (CATCH-1) for the CATCH mission, specifically for type-A satellites. CATCH-1 is equipped… ▽ More The Chasing All Transients Constellation Hunters (CATCH) space mission is an intelligent constellation consisting of 126 micro-satellites in three types (A, B, and C), designed for X-ray observation with the objective of studying the dynamic universe. Currently, we are actively develo** the first Pathfinder (CATCH-1) for the CATCH mission, specifically for type-A satellites. CATCH-1 is equipped with Micro Pore Optics (MPO) and a 4-pixel Silicon Drift Detector (SDD) array. To assess its scientific performance, including the effective area of the optical system, on-orbit background, and telescope sensitivity, we employ the Monte Carlo software Geant4 for simulation in this study. The MPO optics exhibit an effective area of $41$ cm$^2$ at the focal spot for 1 keV X-rays, while the entire telescope system achieves an effective area of $29$ cm$^2$ at 1 keV when taking into account the SDD detector's detection efficiency. The primary contribution to the background is found to be from the Cosmic X-ray Background. Assuming a 625 km orbit with an inclination of $29^\circ$, the total background for CATCH-1 is estimated to be $8.13\times10^{-2}$ counts s$^{-1}$ in the energy range of 0.5--4 keV. Based on the background within the central detector and assuming a Crab-like source spectrum, the estimated ideal sensitivity could achieve $1.9\times10^{-12}$ erg cm$^{-2}$ s$^{-1}$ for an exposure of 10$^4$ s in the energy band of 0.5--4 keV. Furthermore, after simulating the background caused by low-energy charged particles near the geomagnetic equator, we have determined that there is no need to install a magnetic deflector. △ Less

Submitted 23 February, 2024; originally announced February 2024.

arXiv:2402.13580 [pdf, ps, other]

Mechanism Design with Sequential-Move Games: Revelation Principle

Authors: Siyang Xiong

Abstract: Traditionally, mechanism design focuses on simultaneous-move games (e.g., Myerson (1981)). In this paper, we study mechanism design with sequential-move games, and provide two results on revelation principles for general solution concepts (e.g., perfect Bayesian equilibrium, obvious dominance, strong-obvious dominance). First, if a solution concept is additive, implementation in sequential-move ga… ▽ More Traditionally, mechanism design focuses on simultaneous-move games (e.g., Myerson (1981)). In this paper, we study mechanism design with sequential-move games, and provide two results on revelation principles for general solution concepts (e.g., perfect Bayesian equilibrium, obvious dominance, strong-obvious dominance). First, if a solution concept is additive, implementation in sequential-move games is equivalent to implementation in simultaneous-move games. Second, for any solution concept \r{ho} and any social choice function f, we identify a canonical operator γ^{(\r{ho},f)}, which is defined on primitives. We prove that, if \r{ho} is monotonic, f can be implemented by a sequential-move game if and only if γ^{(\r{ho},f)} is achievable, which translates a complicated mechanism design problem into checking some conditions defined on primitives. Most of the existing solution concepts are either additive or monotonic. △ Less

Submitted 8 March, 2024; v1 submitted 21 February, 2024; originally announced February 2024.

arXiv:2402.12309 [pdf, other]

TILP: Differentiable Learning of Temporal Logical Rules on Knowledge Graphs

Authors: Siheng Xiong, Yuan Yang, Faramarz Fekri, James Clayton Kerce

Abstract: Compared with static knowledge graphs, temporal knowledge graphs (tKG), which can capture the evolution and change of information over time, are more realistic and general. However, due to the complexity that the notion of time introduces to the learning of the rules, an accurate graph reasoning, e.g., predicting new links between entities, is still a difficult problem. In this paper, we propose T… ▽ More Compared with static knowledge graphs, temporal knowledge graphs (tKG), which can capture the evolution and change of information over time, are more realistic and general. However, due to the complexity that the notion of time introduces to the learning of the rules, an accurate graph reasoning, e.g., predicting new links between entities, is still a difficult problem. In this paper, we propose TILP, a differentiable framework for temporal logical rules learning. By designing a constrained random walk mechanism and the introduction of temporal operators, we ensure the efficiency of our model. We present temporal features modeling in tKG, e.g., recurrence, temporal order, interval between pair of relations, and duration, and incorporate it into our learning process. We compare TILP with state-of-the-art methods on two benchmark datasets. We show that our proposed framework can improve upon the performance of baseline methods while providing interpretable results. In particular, we consider various scenarios in which training samples are limited, data is biased, and the time range between training and inference are different. In all these cases, TILP works much better than the state-of-the-art methods. △ Less

Submitted 19 February, 2024; originally announced February 2024.

Comments: ICLR 2023 poster

arXiv:2401.17556 [pdf, ps, other]

Model-Theoretic Logic for Mathematical Theory of Semantic Information and Communication

Authors: Ahmet Faruk Saz, Siheng Xiong, Yashas Malur Saidutta, Faramarz Fekri

Abstract: In this paper, we propose an advancement to Tarskian model-theoretic semantics, leading to a unified quantitative theory of semantic information and communication. We start with description of inductive logic and probabilities, which serve as notable tools in development of the proposed theory. Then, we identify two disparate kinds of uncertainty in semantic communication, that of physical and con… ▽ More In this paper, we propose an advancement to Tarskian model-theoretic semantics, leading to a unified quantitative theory of semantic information and communication. We start with description of inductive logic and probabilities, which serve as notable tools in development of the proposed theory. Then, we identify two disparate kinds of uncertainty in semantic communication, that of physical and content, present refined interpretations of semantic information measures, and conclude with proposing a new measure for semantic content-information and entropy. Our proposition standardizes semantic information across different universes and systems, hence bringing measurability and comparability into semantic communication. We then proceed with introducing conditional and mutual semantic cont-information measures and point out to their utility in formulating practical and optimizable lossless and lossy semantic compression objectives. Finally, we experimentally demonstrate the value of our theoretical propositions. △ Less

Submitted 30 January, 2024; originally announced January 2024.

arXiv:2401.17503 [pdf, other]

Differentiated Service Entanglement Routing for Quantum Networks

Authors: Hui Han, Bo Liu, Bangying Tang, Siyu Xiong, **quan Huang, Wanrong Yu, Shuhui Chen

Abstract: The entanglement distribution networks with various topologies are mainly implemented by active wavelength multiplexing routing strategies. However, designing an entanglement routing scheme, which achieves the maximized network connections and the optimal overall network efficiency simultaneously, remains a huge challenge for quantum networks. In this article, we propose a differentiated service e… ▽ More The entanglement distribution networks with various topologies are mainly implemented by active wavelength multiplexing routing strategies. However, designing an entanglement routing scheme, which achieves the maximized network connections and the optimal overall network efficiency simultaneously, remains a huge challenge for quantum networks. In this article, we propose a differentiated service entanglement routing (DSER) scheme, which firstly finds out the lowest loss paths and supported wavelength channels with the tensor-based path searching algorithm, and then allocates the paired channels with the differentiated routing strategies. The evaluation results show that the proposed DSER scheme can be performed for constructing various large scale quantum networks. △ Less

Submitted 30 January, 2024; originally announced January 2024.

Comments: 25 pages, 14 figures

arXiv:2401.16249 [pdf, other]

doi 10.1063/5.0200833

Molecular dynamics simulations of heat transport using machine-learned potentials: A mini review and tutorial on GPUMD with neuroevolution potentials

Authors: Haikuan Dong, Yongbo Shi, Penghua Ying, Ke Xu, Ting Liang, Yanzhou Wang, Zezhu Zeng, Xin Wu, Wenjiang Zhou, Shiyun Xiong, Shunda Chen, Zheyong Fan

Abstract: Molecular dynamics (MD) simulations play an important role in understanding and engineering heat transport properties of complex materials. An essential requirement for reliably predicting heat transport properties is the use of accurate and efficient interatomic potentials. Recently, machine-learned potentials (MLPs) have shown great promise in providing the required accuracy for a broad range of… ▽ More Molecular dynamics (MD) simulations play an important role in understanding and engineering heat transport properties of complex materials. An essential requirement for reliably predicting heat transport properties is the use of accurate and efficient interatomic potentials. Recently, machine-learned potentials (MLPs) have shown great promise in providing the required accuracy for a broad range of materials. In this mini review and tutorial, we delve into the fundamentals of heat transport, explore pertinent MD simulation methods, and survey the applications of MLPs in MD simulations of heat transport. Furthermore, we provide a step-by-step tutorial on develo** MLPs for highly efficient and predictive heat transport simulations, utilizing the neuroevolution potentials (NEPs) as implemented in the GPUMD package. Our aim with this mini review and tutorial is to empower researchers with valuable insights into cutting-edge methodologies that can significantly enhance the accuracy and efficiency of MD simulations for heat transport studies. △ Less

Submitted 24 April, 2024; v1 submitted 29 January, 2024; originally announced January 2024.

Comments: 25 pages, 9 figures. This paper is part of the special topic, Machine Learning for Thermal Transport

Journal ref: J. Appl. Phys. 135, 161101 (2024)

arXiv:2401.11675 [pdf, other]

Rethinking Cross-Attention for Infrared and Visible Image Fusion

Authors: Lihua Jian, Songlei Xiong, Han Yan, Xiaoguang Niu, Shaowu Wu, Di Zhang

Abstract: The salient information of an infrared image and the abundant texture of a visible image can be fused to obtain a comprehensive image. As can be known, the current fusion methods based on Transformer techniques for infrared and visible (IV) images have exhibited promising performance. However, the attention mechanism of the previous Transformer-based methods was prone to extract common information… ▽ More The salient information of an infrared image and the abundant texture of a visible image can be fused to obtain a comprehensive image. As can be known, the current fusion methods based on Transformer techniques for infrared and visible (IV) images have exhibited promising performance. However, the attention mechanism of the previous Transformer-based methods was prone to extract common information from source images without considering the discrepancy information, which limited fusion performance. In this paper, by reevaluating the cross-attention mechanism, we propose an alternate Transformer fusion network (ATFuse) to fuse IV images. Our ATFuse consists of one discrepancy information injection module (DIIM) and two alternate common information injection modules (ACIIM). The DIIM is designed by modifying the vanilla cross-attention mechanism, which can promote the extraction of the discrepancy information of the source images. Meanwhile, the ACIIM is devised by alternately using the vanilla cross-attention mechanism, which can fully mine common information and integrate long dependencies. Moreover, the successful training of ATFuse is facilitated by a proposed segmented pixel loss function, which provides a good trade-off for texture detail and salient structure preservation. The qualitative and quantitative results on public datasets indicate our ATFFuse is effective and superior compared to other state-of-the-art methods. △ Less

Submitted 21 January, 2024; originally announced January 2024.

arXiv:2401.11427 [pdf, other]

Correcting force error-induced underestimation of lattice thermal conductivity in machine learning molecular dynamics

Authors: Xiguang Wu, Wenjiang Zhou, Haikuang Dong, Penghua Ying, Yanzhou Wang, Bai Song, Zheyong Fan, Shiyun Xiong

Abstract: Machine learned potentials (MLPs) have been widely employed in molecular dynamics (MD) simulations to study thermal transport. However, literature results indicate that MLPs generally underestimate the lattice thermal conductivity (LTC) of typical solids. Here, we quantitatively analyze this underestimation in the context of the neuroevolution potential (NEP), which is a representative MLP that ba… ▽ More Machine learned potentials (MLPs) have been widely employed in molecular dynamics (MD) simulations to study thermal transport. However, literature results indicate that MLPs generally underestimate the lattice thermal conductivity (LTC) of typical solids. Here, we quantitatively analyze this underestimation in the context of the neuroevolution potential (NEP), which is a representative MLP that balances efficiency and accuracy. Taking crystalline silicon, GaAs, graphene, and PbTe as examples, we reveal that the fitting errors in the machine-learned forces against the reference ones are responsible for the underestimated LTC as they constitute external perturbations to the interatomic forces. Since the force errors of a NEP model and the random forces in the Langevin thermostat both follow a Gaussian distribution, we propose an approach to correcting the LTC by intentionally introducing different levels of force noises via the Langevin thermostat and then extrapolating to the limit of zero force error. Excellent agreement with experiments is obtained by using this correction for all the prototypical materials over a wide range of temperatures. Based on spectral analyses, we find that the LTC underestimation mainly arises from increased phonon scatterings in the low-frequency region caused by the random force errors. △ Less

Submitted 26 May, 2024; v1 submitted 21 January, 2024; originally announced January 2024.

arXiv:2401.07513 [pdf, other]

Detector performance of the Gamma-ray Transient Monitor onboard DRO-A Satellite

Authors: Pei-Yi Feng, Zheng-Hua An, Da-Li Zhang, Chen-Wei Wang, Chao Zheng, Sheng Yang, Shao-Lin Xiong, Jia-Cong Liu, Xin-Qiao Li, Ke Gong, Xiao-**g Liu, Min Gao, Xiang-Yang Wen, Ya-Qing liu, Xiao-Yun Zhao, Fan Zhang, Xi-Lei Sun, Hong Lu

Abstract: Gamma-ray Transient Monitor (GTM) is an all-sky monitor onboard the Distant Retrograde Orbit-A (DRO-A) satellite with the scientific objective of detecting gamma-ray transients ranging from 20 keV to 1 MeV. GTM is equipped with 5 Gamma-ray Transient Probe (GTP) detector modules, utilizing the NaI(Tl) scintillator coupled with a SiPM array. To reduce the SiPM noise, GTP makes use of a dedicated dua… ▽ More Gamma-ray Transient Monitor (GTM) is an all-sky monitor onboard the Distant Retrograde Orbit-A (DRO-A) satellite with the scientific objective of detecting gamma-ray transients ranging from 20 keV to 1 MeV. GTM is equipped with 5 Gamma-ray Transient Probe (GTP) detector modules, utilizing the NaI(Tl) scintillator coupled with a SiPM array. To reduce the SiPM noise, GTP makes use of a dedicated dual-channel coincident readout design. In this work, we firstly studied the impact of different coincidence times on detection efficiency and ultimately selected the 500 ns time coincidence window for offline data processing. To test the performance of GTPs and validate the Monte Carlo simulated energy response, we conducted comprehensive ground calibration tests using Hard X-ray Calibration Facility (HXCF) and radioactive sources, including energy response, detection efficiency, spatial response, bias-voltage response, and temperature dependence. We extensively presented the ground calibration results, and validated the design and mass model of GTP detector. These work paved the road for the in-flight observation and science data analysis. △ Less

Submitted 15 January, 2024; originally announced January 2024.

Comments: 13 pages, 25 figures

arXiv:2401.06853 [pdf, other]

Large Language Models Can Learn Temporal Reasoning

Authors: Siheng Xiong, Ali Payani, Ramana Kompella, Faramarz Fekri

Abstract: While large language models (LLMs) have demonstrated remarkable reasoning capabilities, they are not without their flaws and inaccuracies. Recent studies have introduced various methods to mitigate these limitations. Temporal reasoning (TR), in particular, presents a significant challenge for LLMs due to its reliance on diverse temporal concepts and intricate temporal logic. In this paper, we prop… ▽ More While large language models (LLMs) have demonstrated remarkable reasoning capabilities, they are not without their flaws and inaccuracies. Recent studies have introduced various methods to mitigate these limitations. Temporal reasoning (TR), in particular, presents a significant challenge for LLMs due to its reliance on diverse temporal concepts and intricate temporal logic. In this paper, we propose TG-LLM, a novel framework towards language-based TR. Instead of reasoning over the original context, we adopt a latent representation, temporal graph (TG) that enhances the learning of TR. A synthetic dataset (TGQA), which is fully controllable and requires minimal supervision, is constructed for fine-tuning LLMs on this text-to-TG translation task. We confirmed in experiments that the capability of TG translation learned on our dataset can be transferred to other TR tasks and benchmarks. On top of that, we teach LLM to perform deliberate reasoning over the TGs via Chain-of-Thought (CoT) bootstrap** and graph data augmentation. We observed that those strategies, which maintain a balance between usefulness and diversity, bring more reliable CoTs and final results than the vanilla CoT distillation. △ Less

Submitted 10 June, 2024; v1 submitted 12 January, 2024; originally announced January 2024.

Comments: ACL24 (main)

arXiv:2401.05955 [pdf, other]

The Self-organized Criticality Behaviors of Two New Parameters in SGR J1935+2154

Authors: Shuo Xiao, Shuang-Nan Zhang, Shao-Lin Xiong, ** Wang, Xiu-Juan Li, Ai-Jun Dong, Qi-Jun Zhi, Di Li

Abstract: The minimum variation timescale (MVT) and spectral lag of hundreds of X-ray bursts (XRBs) from soft gamma-ray repeater (SGR) J1935+2154 were analyzed in detail for the first time in our recent work, which are important probes for studying the physical mechanism and radiation region. In this work, we investigate their differential and cumulative distributions carefully and find that they follow pow… ▽ More The minimum variation timescale (MVT) and spectral lag of hundreds of X-ray bursts (XRBs) from soft gamma-ray repeater (SGR) J1935+2154 were analyzed in detail for the first time in our recent work, which are important probes for studying the physical mechanism and radiation region. In this work, we investigate their differential and cumulative distributions carefully and find that they follow power-law models. Besides, the distributions of fluctuations in both parameters follow the Tsallis $q$-Gaussian distributions and the $q$ values are consistent for different scale intervals. Therefore, these results indicate that both parameters are scale-invariant, which provides new parameters for the study of self-organized criticality systems. Interestingly, we find that the $q$ values for MVT and spectral lag are similar with duration and fluence, respectively. △ Less

Submitted 11 January, 2024; originally announced January 2024.

Comments: accepted by MNRAS

arXiv:2401.03804 [pdf, other]

TeleChat Technical Report

Authors: Zhongjiang He, Zihan Wang, Xinzhang Liu, Shixuan Liu, Yitong Yao, Yuyao Huang, Xuelong Li, Yongxiang Li, Zhonghao Che, Zhaoxi Zhang, Yan Wang, Xin Wang, Luwen Pu, Huinan Xu, Ruiyu Fang, Yu Zhao, Jie Zhang, Xiaomeng Huang, Zhilong Lu, Jiaxin Peng, Wenjun Zheng, Shiquan Wang, Bingkai Yang, Xuewei he, Zhuoru Jiang , et al. (11 additional authors not shown)

Abstract: In this technical report, we present TeleChat, a collection of large language models (LLMs) with parameters of 3 billion, 7 billion and 12 billion. It includes pretrained language models as well as fine-tuned chat models that is aligned with human preferences. TeleChat is initially pretrained on an extensive corpus containing a diverse collection of texts from both English and Chinese languages, i… ▽ More In this technical report, we present TeleChat, a collection of large language models (LLMs) with parameters of 3 billion, 7 billion and 12 billion. It includes pretrained language models as well as fine-tuned chat models that is aligned with human preferences. TeleChat is initially pretrained on an extensive corpus containing a diverse collection of texts from both English and Chinese languages, including trillions of tokens. Subsequently, the model undergoes fine-tuning to align with human preferences, following a detailed methodology that we describe. We evaluate the performance of TeleChat on various tasks, including language understanding, mathematics, reasoning, code generation, and knowledge-based question answering. Our findings indicate that TeleChat achieves comparable performance to other open-source models of similar size across a wide range of public benchmarks. To support future research and applications utilizing LLMs, we release the fine-tuned model checkpoints of TeleChat's 7B and 12B variant, along with code and a portion of our pretraining data, to the public community. △ Less

Submitted 1 April, 2024; v1 submitted 8 January, 2024; originally announced January 2024.

Comments: 28 pages, 2 figures

ACM Class: I.2.7

arXiv:2401.00226 [pdf, other]

The Intrinsic Energy Resolution of LaBr$_3$(Ce) Crystal for GECAM

Authors: Pei-Yi Feng, Xi-Lei Sun, Cheng-Er Wang, Yong Deng, Zheng-Hua An, Da-Li Zhang, Chao Zheng, Xin-Qiao Li, Shao-Lin Xiong, Hong Lu

Abstract: The intrinsic resolution is the primary limitation on the total energy resolution of LaBr$_3$(Ce) crystal. This intrinsic resolution arises from two effects: fluctuations occurring in the process of energy transfer to luminescent centers within the LaBr$_3$(Ce) crystal and the LaBr$_3$(Ce) crystal's non-proportional luminescence. Presently, experimental measurements regarding the intrinsic resolut… ▽ More The intrinsic resolution is the primary limitation on the total energy resolution of LaBr$_3$(Ce) crystal. This intrinsic resolution arises from two effects: fluctuations occurring in the process of energy transfer to luminescent centers within the LaBr$_3$(Ce) crystal and the LaBr$_3$(Ce) crystal's non-proportional luminescence. Presently, experimental measurements regarding the intrinsic resolution of LaBr$_3$(Ce) crystal are scarce, and the underlying physical mechanisms remain incompletely understood. In this paper, we aim to elucidate the concept of intrinsic resolution. We investigated the entire physical process of luminescence following energy deposition in the LaBr$_3$(Ce) crystal, quantifying the various components in the total energy resolution. We conducted a series of experimental measurements and Geant4 simulations, determining the intrinsic resolution of LaBr$_3$(Ce) crystal to 100 keV electrons as 2.12%. The non-proportionality contributes significantly at 1.43%, while fluctuations in the energy transfer process accounted for 0.27%. It is evident that non-proportionality in light output constitutes the primary source of intrinsic resolution. Horizontal and vertical unevenness in light collection contributed 0.25% and 0.07%, respectively. Statistical fluctuations showed the largest impact on the total energy resolution, at 2.86%. The contribution from fluctuations in single-photoelectron events was 0.77%. Furthermore, we reconstructed the photon response using Geant4, and the consistency between the simulated relative light yield and the experimentally measured one confirmed the reliability of the LaBr$_3$(Ce) detector mass model employed in the simulation. △ Less

Submitted 30 December, 2023; originally announced January 2024.

Comments: 11 pages, 16 figures

arXiv:2312.16658 [pdf, other]

The Energy Response of LaBr3(Ce), LaBr3(Ce,Sr) and NaI(Tl) Crystals for GECAM

Authors: Pei-Yi Feng, Xi-Lei Sun, Zheng-Hua An, Yong Deng, Cheng-Er Wang, Huang Jiang, Jun-Jie Li, Da-Li Zhang, Xin-Qiao Li, Shao-Lin Xiong, Chao Zheng, Ke Gong, Sheng Yang, Xiao-**g Liu, Min Gao, Xiang-Yang Wen, Ya-Qing Liu, Yan-Bing Xu, Xiao-Yun Zhao, Jia-Cong Liu, Fan Zhang, Hong Lu

Abstract: The GECAM series of satellites utilize LaBr3(Ce), LaBr3(Ce,Sr), and NaI(Tl) crystals as sensitive materials for gamma-ray detectors (GRDs). To investigate the non-linearity in the detection of low-energy gamma rays and address errors in the E-C relationship calibration, comprehensive tests and comparative studies of the non-linearity of these three crystals were conducted using Compton electrons,… ▽ More The GECAM series of satellites utilize LaBr3(Ce), LaBr3(Ce,Sr), and NaI(Tl) crystals as sensitive materials for gamma-ray detectors (GRDs). To investigate the non-linearity in the detection of low-energy gamma rays and address errors in the E-C relationship calibration, comprehensive tests and comparative studies of the non-linearity of these three crystals were conducted using Compton electrons, radioactive sources, and mono-energetic X-rays. The non-linearity test results for Compton electrons and X-rays displayed substantial differences, with all three crystals showing higher non-linearity for X-rays and gamma-rays than for Compton electrons. Despite LaBr3(Ce) and LaBr3(Ce,Sr) crystals having higher absolute light yields, they exhibited a noticeable non-linear decrease in light yield, especially at energies below 400 keV. The NaI(Tl) crystal demonstrated excess light output in the 6~200 keV range, reaching a maximum excess of 9.2% at 30 keV in X-ray testing and up to 15.5% at 14 keV during Compton electron testing, indicating a significant advantage in the detection of low-energy gamma rays. Furthermore, this paper explores the underlying causes of the observed non-linearity in these crystals. This study not only elucidates the detector responses of GECAM, but also marks the inaugural comprehensive investigation into the non-linearity of domestically produced lanthanum bromide and sodium iodide crystals. △ Less

Submitted 27 December, 2023; originally announced December 2023.

Comments: 12pages, 16 figures

arXiv:2312.15989 [pdf, other]

doi 10.1093/mnras/stad1778

Parameter Estimation of LAMOST Medium-resolution Stellar Spectra

Authors: Xiangru Li, Xiaoyu Zhang, Shengchun Xiong, Yulong Zheng, Hui Li

Abstract: This paper investigates the problem of estimating three stellar atmospheric physical parameters and thirteen elemental abundances for medium-resolution spectra from Large Sky Area Multi-Object Fiber Spectroscopic Telescope (LAMOST). Typical characteristics of these spectra are their huge scale, wide range of spectral signal-to-noise ratios, and uneven distribution in parameter space.These characte… ▽ More This paper investigates the problem of estimating three stellar atmospheric physical parameters and thirteen elemental abundances for medium-resolution spectra from Large Sky Area Multi-Object Fiber Spectroscopic Telescope (LAMOST). Typical characteristics of these spectra are their huge scale, wide range of spectral signal-to-noise ratios, and uneven distribution in parameter space.These characteristics lead to unsatisfactory results on the spectra with low temperature, high temperature or low metallicity.To this end, this paper proposes a Stellar Parameter Estimation method based on Multiple Regions (SPEMR) that effectively improves parameter estimation accuracy. On the spectra with {S/N $\geq 10$}, the precisions are 47 K, 0.08 dex, 0.03 dex respectively for the estimations of ($T_{\rm eff}$, $\log \,g$ and $\rm [Fe/H]$), 0.03 dex to 0.06 dex for elements C, Mg, Al, Si, Ca, Mn and Ni, 0.07 dex to 0.13 dex for N, O, S, K and Ti, while that of Cr is 0.16 dex.For the reference of astronomical science researchers and algorithm researchers, we released a catalog for 4.19 million medium-resolution spectra from the LAMOST DR8, experimental code, trained model, training data, and test data. △ Less

Submitted 26 December, 2023; originally announced December 2023.

Comments: 19 pages, 19 figures, 2 tables

Journal ref: Monthly Notices of the Royal Astronomical Society,523(4): 5230-5247, 2023

arXiv:2312.15816 [pdf, other]

TEILP: Time Prediction over Knowledge Graphs via Logical Reasoning

Authors: Siheng Xiong, Yuan Yang, Ali Payani, James C Kerce, Faramarz Fekri

Abstract: Conventional embedding-based models approach event time prediction in temporal knowledge graphs (TKGs) as a ranking problem. However, they often fall short in capturing essential temporal relationships such as order and distance. In this paper, we propose TEILP, a logical reasoning framework that naturally integrates such temporal elements into knowledge graph predictions. We first convert TKGs in… ▽ More Conventional embedding-based models approach event time prediction in temporal knowledge graphs (TKGs) as a ranking problem. However, they often fall short in capturing essential temporal relationships such as order and distance. In this paper, we propose TEILP, a logical reasoning framework that naturally integrates such temporal elements into knowledge graph predictions. We first convert TKGs into a temporal event knowledge graph (TEKG) which has a more explicit representation of time in term of nodes of the graph. The TEKG equips us to develop a differentiable random walk approach to time prediction. Finally, we introduce conditional probability density functions, associated with the logical rules involving the query interval, using which we arrive at the time prediction. We compare TEILP with state-of-the-art methods on five benchmark datasets. We show that our model achieves a significant improvement over baselines while providing interpretable explanations. In particular, we consider several scenarios where training samples are limited, event types are imbalanced, and forecasting the time of future events based on only past events is desired. In all these cases, TEILP outperforms state-of-the-art methods in terms of robustness. △ Less

Submitted 28 January, 2024; v1 submitted 25 December, 2023; originally announced December 2023.

Comments: AAAI24 (Oral)

arXiv:2312.12978 [pdf, other]

Memory in the Burst Occurrence of Repeating FRBs

Authors: ** Wang, Li-Ming Song, Shao-Lin Xiong, Xiao-Yun Zhao, ** Wang, Shu-Min Zhao, Shuo Xiao, Ce Cai, Sheng-Lun Xie, Wang-Chen Xue, Chen-Wei Wang, Yue Wang, Wen-Long Zhang

Abstract: Understanding the nature of repeating FRBs is crucial to probe the physics of FRBs. In this work, we analyze the statistics of waiting time between bursts of three repeating FRBs from four data sets. We find a universally pronounced dependency of the waiting times on the previous time interval (denoted as $λ_0$). We observe a temporal clustering where short waiting times tend to be followed by sho… ▽ More Understanding the nature of repeating FRBs is crucial to probe the physics of FRBs. In this work, we analyze the statistics of waiting time between bursts of three repeating FRBs from four data sets. We find a universally pronounced dependency of the waiting times on the previous time interval (denoted as $λ_0$). We observe a temporal clustering where short waiting times tend to be followed by short ones, and long by long. This memory dependency is manifested in the conditional mean waiting time as well as in the conditional mean residual time to the next burst, both of which increase in direct proportion to $λ_0$. Consequently, the likelihood of experiencing a subsequent FRB burst within a given time window after the preceding burst is significantly influenced by the burst history. We reveal that, for the first time, these memory effects are present in the scale-invariant preconditioned waiting time distribution. We show that the memory effect provides a unified description of waiting times which may account for both the repeating FRBs and the apparent non-repeating FRBs (i.e. only observed one time). These results shed new light on the mechanism of FRBs. △ Less

Submitted 20 December, 2023; originally announced December 2023.

arXiv:2312.12037 [pdf, other]

Founder-GPT: Self-play to evaluate the Founder-Idea fit

Authors: Sichao Xiong, Yigit Ihlamur

Abstract: This research introduces an innovative evaluation method for the "founder-idea" fit in early-stage startups, utilizing advanced large language model techniques to assess founders' profiles against their startup ideas to enhance decision-making. Embeddings, self-play, tree-of-thought, and critique-based refinement techniques show early promising results that each idea's success patterns are unique… ▽ More This research introduces an innovative evaluation method for the "founder-idea" fit in early-stage startups, utilizing advanced large language model techniques to assess founders' profiles against their startup ideas to enhance decision-making. Embeddings, self-play, tree-of-thought, and critique-based refinement techniques show early promising results that each idea's success patterns are unique and they should be evaluated based on the context of the founder's background. △ Less

Submitted 20 December, 2023; v1 submitted 19 December, 2023; originally announced December 2023.

arXiv:2310.11821 [pdf, other]

Evidence of Hadronic Emission from the brightest-of-all-time GRB 221009A

Authors: Kai Wang, Qing-Wen Tang, Yan-Qiu Zhang, Chao Zheng, Shao-Lin Xiong, Jia Ren, Bing Zhang

Abstract: Acceleration of hadrons in relativistic shocks has been long expected and invoked to model GRB high-energy photon and neutrino emissions. However, so far there has been no direct observational evidence of hadronic emission from GRBs. The B.O.A.T. ("brightest of all time") gamma-ray burst (GRB) 221009A had extreme energies (with an isotropic energy exceeding $10^{55}$ erg) and was detected in broad… ▽ More Acceleration of hadrons in relativistic shocks has been long expected and invoked to model GRB high-energy photon and neutrino emissions. However, so far there has been no direct observational evidence of hadronic emission from GRBs. The B.O.A.T. ("brightest of all time") gamma-ray burst (GRB) 221009A had extreme energies (with an isotropic energy exceeding $10^{55}$ erg) and was detected in broad-band including the very-high-energy (VHE, $>100\,\rm GeV$) band up to $>10$ TeV. Here we perform a comprehensive spectral analysis of the GRB from keV to TeV energy range and perform detailed spectral and light curve modelings considering both the traditional synchrotron self-Compton process and the electromagnetic (EM) cascade process initiated by hadronic interactions by accelerated cosmic rays in the external shock. We find that the leptonic scenario alone is not adequate to account for the observations, whereas the proposed scenario with the combination of hadronic and leptonic components can well reproduce the multi-wavelength spectra and the light curve. This result reveals the existence of the accelerated hadronic component in the early afterglow of this extreme burst. According to this scenario, the observed TeV light curve should contain imprints of the prompt MeV emission. △ Less

Submitted 18 October, 2023; originally announced October 2023.

Comments: 15 pages, 4 figures, 3 tables. originally submitted version for Nature Astronomy

arXiv:2310.10522 [pdf, other]

Observation of GRB 221009A early afterglow in X/$γ$-ray energy band

Authors: Chao Zheng, Yan-Qiu Zhang, Shao-Lin Xiong, Cheng-Kui Li, He Gao, Wang-Chen Xue, Jia-Cong Liu, Chen-Wei Wang, Wen-Jun Tan, Wen-Xi Peng, Zheng-Hua An, Ce Cai, Ming-Yu Ge, Dong-Ya Guo, Yue Huang, Bing Li, Ti-Pei Li, Xiao-Bo Li, Xin-Qiao Li, Xu-Fang Li, **-Yuan Liao, Cong-Zhan Liu, Fang-Jun Lu, Xiang Ma, Rui Qiao , et al. (23 additional authors not shown)

Abstract: The early afterglow of a Gamma-ray burst (GRB) can provide critical information on the jet and progenitor of the GRB. The extreme brightness of GRB 221009A allows us to probe its early afterglow in unprecedented detail. In this letter, we report comprehensive observation results of the early afterglow of GRB 221009A (from $T_0$+660 s to $T_0$+1860 s, where $T_0$ is the \textit{Insight}-HXMT/HE tri… ▽ More The early afterglow of a Gamma-ray burst (GRB) can provide critical information on the jet and progenitor of the GRB. The extreme brightness of GRB 221009A allows us to probe its early afterglow in unprecedented detail. In this letter, we report comprehensive observation results of the early afterglow of GRB 221009A (from $T_0$+660 s to $T_0$+1860 s, where $T_0$ is the \textit{Insight}-HXMT/HE trigger time) in X/$γ$-ray energy band (from 20 keV to 20 MeV) by \textit{Insight}-HXMT/HE, GECAM-C and \textit{Fermi}/GBM. We find that the spectrum of the early afterglow in 20 keV-20 MeV could be well described by a cutoff power-law with an extra power-law which dominates the low and high energy bands respectively. The cutoff power-law $E_{\rm peak}$ is $\sim$ 30 keV and the power-law photon index is $\sim$ 1.8 throughout the early afterglow phase. By fitting the light curves in different energy bands, we find that a significant achromatic break (from keV to TeV) is required at $T_0$ + 1246$^{+27}_{-26}$ s (i.e. 1021 s since the afterglow starting time $T_{\rm AG}$=$T_0$+225 s), providing compelling evidence of a jet break. Interestingly, both the pre-break and post-break decay slopes vary with energy, and these two slopes become closer in the lower energy band, making the break less identifiable. Intriguingly, the spectrum of the early afterglow experienced a slight hardening before the break and a softening after the break. These results provide new insights into the understanding of this remarkable GRB. △ Less

Submitted 19 January, 2024; v1 submitted 16 October, 2023; originally announced October 2023.

Comments: Accepted for publication in ApJ Letters on 19-Jan-2024, 11 pages, 7 figures and 2 tables

arXiv:2310.07205 [pdf, other]

Evidence of mini-jet emission in a large emission zone from a magnetically-dominated gamma-ray burst jet

Authors: S. -X. Yi, C. -W. Wang, X. -Y. Shao, R. Moradi, H. Gao, B. Zhang, S. -L. Xiong, S. -N. Zhang, W. -J. Tan, J. -C. Liu, W. -C. Xue, Y. -Q. Zhang, C. Zheng, Y. Wang, P. Zhang, Z. -H. An, C. Cai, P. -Y. Feng, K. Gong, D. -Y. Guo, Y. Huang, B. Li, X. -B. Li, X. -Q. Li, X. -J. Liu , et al. (21 additional authors not shown)

Abstract: The second brightest GRB in history, GRB230307A provides an ideal laboratory to study the details of GRB prompt emission thanks to its extraordinarily high photon statistics and its single broad pulse overall shape characterized by an energy-dependent fast-rise-exponential-decay (FRED) profile. Here we demonstrate that its broad pulse is composed of many rapidly variable short pulses, rather than… ▽ More The second brightest GRB in history, GRB230307A provides an ideal laboratory to study the details of GRB prompt emission thanks to its extraordinarily high photon statistics and its single broad pulse overall shape characterized by an energy-dependent fast-rise-exponential-decay (FRED) profile. Here we demonstrate that its broad pulse is composed of many rapidly variable short pulses, rather than being the superposition of many short pulses on top of a slow component. Such a feature is consistent with the picture of many mini-jets due to local magnetic reconnection events in a large emission zone far from the GRB central engine, as envisaged in the internal-collision-induced magnetic reconnection and turbulence (ICMART) model, but raises a great challenge to the internal shock models that attribute all variability components to collisions among different shells. Since relativistic mini-jets demand strong magnetization in the outflow, this work provides strong evidence for a Poynting-flux-dominated jet composition of this bright GRB. △ Less

Submitted 16 March, 2024; v1 submitted 11 October, 2023; originally announced October 2023.

Comments: 7 pages and 2 figures in the main text. 27 pages and 9 figures in total

arXiv:2310.04992 [pdf, other]

VisionFM: a Multi-Modal Multi-Task Vision Foundation Model for Generalist Ophthalmic Artificial Intelligence

Authors: Jianing Qiu, Jian Wu, Hao Wei, Peilun Shi, Minqing Zhang, Yunyun Sun, Lin Li, Hanruo Liu, Hongyi Liu, Simeng Hou, Yuyang Zhao, Xuehui Shi, Junfang Xian, Xiaoxia Qu, Sirui Zhu, Lijie Pan, Xiaoniao Chen, Xiaojia Zhang, Shuai Jiang, Kebing Wang, Chenlong Yang, Mingqiang Chen, Sujie Fan, Jianhua Hu, Aiguo Lv , et al. (17 additional authors not shown)

Abstract: We present VisionFM, a foundation model pre-trained with 3.4 million ophthalmic images from 560,457 individuals, covering a broad range of ophthalmic diseases, modalities, imaging devices, and demography. After pre-training, VisionFM provides a foundation to foster multiple ophthalmic artificial intelligence (AI) applications, such as disease screening and diagnosis, disease prognosis, subclassifi… ▽ More We present VisionFM, a foundation model pre-trained with 3.4 million ophthalmic images from 560,457 individuals, covering a broad range of ophthalmic diseases, modalities, imaging devices, and demography. After pre-training, VisionFM provides a foundation to foster multiple ophthalmic artificial intelligence (AI) applications, such as disease screening and diagnosis, disease prognosis, subclassification of disease phenotype, and systemic biomarker and disease prediction, with each application enhanced with expert-level intelligence and accuracy. The generalist intelligence of VisionFM outperformed ophthalmologists with basic and intermediate levels in jointly diagnosing 12 common ophthalmic diseases. Evaluated on a new large-scale ophthalmic disease diagnosis benchmark database, as well as a new large-scale segmentation and detection benchmark database, VisionFM outperformed strong baseline deep neural networks. The ophthalmic image representations learned by VisionFM exhibited noteworthy explainability, and demonstrated strong generalizability to new ophthalmic modalities, disease spectrum, and imaging devices. As a foundation model, VisionFM has a large capacity to learn from diverse ophthalmic imaging data and disparate datasets. To be commensurate with this capacity, in addition to the real data used for pre-training, we also generated and leveraged synthetic ophthalmic imaging data. Experimental results revealed that synthetic data that passed visual Turing tests, can also enhance the representation learning capability of VisionFM, leading to substantial performance gains on downstream ophthalmic AI tasks. Beyond the ophthalmic AI applications developed, validated, and demonstrated in this work, substantial further applications can be achieved in an efficient and cost-effective manner using VisionFM as the foundation. △ Less

Submitted 7 October, 2023; originally announced October 2023.

arXiv:2309.11595 [pdf, ps, other]

Common Agency with Non-Delegation or Imperfect Commitment

Authors: Seung** Han, Siyang Xiong

Abstract: In classical contract theory, we usually impose two assumptions: delegated contracts and perfect commitment. While the second assumption is demanding, the first one suffers no loss of generality. Following this tradition, current common-agency models impose delegated contracts and perfect commitment. We first show that non-delegated contracts expand the set of equilibrium outcomes under common age… ▽ More In classical contract theory, we usually impose two assumptions: delegated contracts and perfect commitment. While the second assumption is demanding, the first one suffers no loss of generality. Following this tradition, current common-agency models impose delegated contracts and perfect commitment. We first show that non-delegated contracts expand the set of equilibrium outcomes under common agency. Furthermore, the powerful menu theorem for common agency (Peters (2001) and Martimort and Stole (2002)}) fails for either non-delegated contracts or imperfect commitment. We identify canonical contracts in such environments, and re-establish generalized menu theorems. Given imperfect commitment, our results for common-agency models are analogous to those in Bester and Strausz (2001) and Doval and Skreta (2012) for the classical contract theory, which re-establish the revelation principle. △ Less

Submitted 20 September, 2023; originally announced September 2023.

arXiv:2308.11362 [pdf, other]

Calibration of the Timing Performance of GECAM-C

Authors: Shuo Xiao, Ya-Qing Liu, Ke Gong, Zheng-Hua An, Shao-Lin Xiong, Xin-Qiao Li, Xiang-Yang Wen, Wen-Xi Peng, Da-Li Zhang, You-Li Tuo, Shi-Jie Zheng, Li-Ming Song, ** Wang, Xiao-Yun Zhao, Yue Huang, Xiang Ma, Xiao-**g Liu, Rui Qiao, Yan-Bing Xu, Sheng Yang, Fan Zhang, Yue Wang, Yan-Qiu Zhang, Wang-Chen Xue, Jia-Cong Liu , et al. (13 additional authors not shown)

Abstract: As a new member of the Gravitational wave high-energy Electromagnetic Counterpart All-sky Monitor (GECAM) after GECAM-A and GECAM-B, GECAM-C (originally called HEBS), which was launched on board the SATech-01 satellite on July 27, 2022, aims to monitor and localize X-ray and gamma-ray transients from $\sim$ 6 keV to 6 MeV. GECAM-C utilizes a similar design to GECAM but operates in a more complex o… ▽ More As a new member of the Gravitational wave high-energy Electromagnetic Counterpart All-sky Monitor (GECAM) after GECAM-A and GECAM-B, GECAM-C (originally called HEBS), which was launched on board the SATech-01 satellite on July 27, 2022, aims to monitor and localize X-ray and gamma-ray transients from $\sim$ 6 keV to 6 MeV. GECAM-C utilizes a similar design to GECAM but operates in a more complex orbital environment. In this work, we utilize the secondary particles simultaneously produced by the cosmic-ray events on orbit and recorded by multiple detectors, to calibrate the relative timing accuracy between all detectors of GECAM-C. We find the result is 0.1 $μ\rm s$, which is the highest time resolution among all GRB detectors ever flown and very helpful in timing analyses such as minimum variable timescale and spectral lags, as well as in time delay localization. Besides, we calibrate the absolute time accuracy using the one-year Crab pulsar data observed by GECAM-C and Fermi/GBM, as well as GECAM-C and GECAM-B. The results are $2.02\pm 2.26\ μ\rm s$ and $5.82\pm 3.59\ μ\rm s$, respectively. Finally, we investigate the spectral lag between the different energy bands of Crab pulsar observed by GECAM and GBM, which is $\sim -0.2\ {\rm μs\ keV^{-1}}$. △ Less

Submitted 22 August, 2023; originally announced August 2023.

Comments: submitted

arXiv:2307.14884 [pdf, other]

Individual and Averaged Power Density Spectra of X-ray bursts from SGR J1935+2154: Quasiperiodic Oscillation Search and Slopes

Authors: Shuo Xiao, Xiao-Bo Li, Wang-Chen Xue, Shao-Lin Xiong, Shuang-Nan Zhang, Wen-Xi Peng, Ai-Jun Dong, You-Li Tuo, Ce Cai, Xi-Hong Luo, Jiao-Jiao Yang, Yue Wang, Chao Zheng, Yan-Qiu Zhang, Jia-Cong Liu, Wen-Jun Tan, Chen-Wei Wang, ** Wang, Cheng-Kui Li, Shu-Xu Yi, Shi-Jun Dang, Lun-Hua Shang, Ru-Shuang Zhao, Qing-Bo Ma, Wei Xie , et al. (7 additional authors not shown)

Abstract: The study of quasi-periodic oscillations (QPOs) and power density spectra (PDS) continuum properties can help shed light on the still illusive emission physics of magnetars and as a window into the interiors of neutron stars using asteroseismology. In this work, we employ a Bayesian method to search for the QPOs in the hundreds of X-ray bursts from SGR J1935+2154 observed by {\it Insight}-HXMT, GE… ▽ More The study of quasi-periodic oscillations (QPOs) and power density spectra (PDS) continuum properties can help shed light on the still illusive emission physics of magnetars and as a window into the interiors of neutron stars using asteroseismology. In this work, we employ a Bayesian method to search for the QPOs in the hundreds of X-ray bursts from SGR J1935+2154 observed by {\it Insight}-HXMT, GECAM and Fermi/GBM from July 2014 to January 2022. Although no definitive QPO signal (significance $>3σ$) is detected in individual bursts or the averaged periodogram of the bursts grouped by duration, we identify several bursts exhibiting possible QPO at $\sim$ 40 Hz, which is consistent with that reported in the X-ray burst associated with FRB 200428. We investigate the PDS continuum properties and find that the distribution of the PDS slope in the simple power-law model peaks $\sim$ 2.5, which is consistent with other magnetars but higher than 5/3 commonly seen in gamma-ray bursts. Besides, the distribution of the break frequency in the broken power-law model peaks at $\sim$ 60 Hz. Finally, we report that the power-law index of PDS has an anti-correlation and power-law dependence on the burst duration as well as the minimum variation timescale. △ Less

Submitted 27 July, 2023; originally announced July 2023.

Comments: comments welcome

arXiv:2307.07079 [pdf, other]

The Minimum Variation Timescales of X-ray bursts from SGR J1935+2154

Authors: Shuo Xiao, Jiao-Jiao Yang, Xi-Hong Luo, Shao-Lin Xiong, Yuan-Hong Qu, Shuang-Nan Zhang, Wang-Chen Xue, Xiao-Bo Li, You-Li Tuo, Ai-Jun Dong, Ru-Shuang Zhao, Shi-Jun Dang, Lun-Hua Shang, Qing-Bo Ma, Ce Cai, ** Wang, ** Wang, Cheng-Kui Li, Shu-Xu Yi, Zhen Zhang, Ming-Yu Ge, Shi-Jie Zheng, Li-Ming Song, Wen-Xi Peng, Xiang-Yang Wen , et al. (12 additional authors not shown)

Abstract: The minimum variation timescale (MVT) of soft gamma-ray repeaters can be an important probe to estimate the emission region in pulsar-like models, as well as the Lorentz factor and radius of the possible relativistic jet in gamma-ray burst (GRB)-like models, thus revealing their progenitors and physical mechanisms. In this work, we systematically study the MVTs of hundreds of X-ray bursts (XRBs) f… ▽ More The minimum variation timescale (MVT) of soft gamma-ray repeaters can be an important probe to estimate the emission region in pulsar-like models, as well as the Lorentz factor and radius of the possible relativistic jet in gamma-ray burst (GRB)-like models, thus revealing their progenitors and physical mechanisms. In this work, we systematically study the MVTs of hundreds of X-ray bursts (XRBs) from SGR J1935+2154 observed by {\it Insight}-HXMT, GECAM and Fermi/GBM from July 2014 to Jan 2022 through the Bayesian Block algorithm. We find that the MVTs peak at $\sim$ 2 ms, corresponding to a light travel time size of about 600 km, which supports the magnetospheric origin in pulsar-like models. The shock radius and the Lorentz factor of the jet are also constrained in GRB-like models. Interestingly, the MVT of the XRB associated with FRB 200428 is $\sim$ 70 ms, which is longer than that of most bursts and implies its special radiation mechanism. Besides, the median of MVTs is 7 ms, shorter than the median MVTs of 40 ms and 480 ms for short GRBs or long GRBs, respectively. However, the MVT is independent of duration, similar to GRBs. Finally, we investigate the energy dependence of MVT and suggest that there is a marginal evidence for a power-law relationship like GRBs but the rate of variation is at least about an order of magnitude smaller. These features may provide an approach to identify bursts with a magnetar origin. △ Less

Submitted 13 July, 2023; originally announced July 2023.

Comments: accepted for publication in ApJS

arXiv:2307.05689 [pdf, other]

Magnetar emergence in a peculiar gamma-ray burst from a compact star merger

Authors: H. Sun, C. -W. Wang, J. Yang, B. -B. Zhang, S. -L. Xiong, Y. -H. I. Yin, Y. Liu, Y. Li, W. -C. Xue, Z. Yan, C. Zhang, W. -J. Tan, H. -W. Pan, J. -C. Liu, H. -Q. Cheng, Y. -Q. Zhang, J. -W. Hu, C. Zheng, Z. -H. An, C. Cai, L. Hu, C. **, D. -Y. Li, X. -Q. Li, H. -Y. Liu , et al. (19 additional authors not shown)

Abstract: The central engine that powers gamma-ray bursts (GRBs), the most powerful explosions in the universe, is still not identified. Besides hyper-accreting black holes, rapidly spinning and highly magnetized neutron stars, known as millisecond magnetars, have been suggested to power both long and short GRBs. The presence of a magnetar engine following compact star mergers is of particular interest as i… ▽ More The central engine that powers gamma-ray bursts (GRBs), the most powerful explosions in the universe, is still not identified. Besides hyper-accreting black holes, rapidly spinning and highly magnetized neutron stars, known as millisecond magnetars, have been suggested to power both long and short GRBs. The presence of a magnetar engine following compact star mergers is of particular interest as it would provide essential constraints on the poorly understood equation of state for neutron stars. Indirect indications of a magnetar engine in these merger sources have been observed in the form of plateau features present in the X-ray afterglow light curves of some short GRBs. Additionally, some X-ray transients lacking gamma-ray bursts (GRB-less) have been identified as potential magnetar candidates originating from compact star mergers. Nevertheless, smoking gun evidence is still lacking for a magnetar engine in short GRBs, and the associated theoretical challenges have been addressed. Here we present a comprehensive analysis of the broad-band prompt emission data of a peculiar, very bright GRB 230307A. Despite its apparently long duration, the prompt emission and host galaxy properties point toward a compact star merger origin, being consistent with its association with a kilonova. More intriguingly, an extended X-ray emission component emerges as the $γ$-ray emission dies out, signifying the emergence of a magnetar central engine. We also identify an achromatic temporal break in the high-energy band during the prompt emission phase, which was never observed in previous bursts and reveals a narrow jet with half opening angle of approximately $3.4^\circ$. △ Less

Submitted 11 July, 2023; originally announced July 2023.

Comments: 44 pages, 10 figures, 5 tables

arXiv:2307.04999 [pdf, other]

The GECAM Real-Time Burst Alert System

Authors: Yue Huang, Dongli Shi, Xiaolu Zhang, Xiang Ma, Peng Zhang, Shijie Zheng, Liming Song, Xiaoyun Zhao, Wei Chen, Rui Qiao, Xinying Song, ** Wang, Ce Cai, Shuo Xiao, Yanqiu Zhang, Shaolin Xiong

Abstract: Gravitational Wave High-energy Electromagnetic Counterpart All-sky Monitor (GECAM), consisting of two micro-satellites, is designed to detect gamma-ray bursts associated with gravitational-wave events. Here, we introduce the real-time burst alert system of GECAM, with the adoption of the BeiDou-3 short message communication service. We present the post-trigger operations, the detailed ground-based… ▽ More Gravitational Wave High-energy Electromagnetic Counterpart All-sky Monitor (GECAM), consisting of two micro-satellites, is designed to detect gamma-ray bursts associated with gravitational-wave events. Here, we introduce the real-time burst alert system of GECAM, with the adoption of the BeiDou-3 short message communication service. We present the post-trigger operations, the detailed ground-based analysis, and the performance of the system. In the first year of the in-flight operation, GECAM was triggered by 42 GRBs. GECAM real-time burst alert system has the ability to distribute the alert within $\sim$1 minute after being triggered, which enables timely follow-up observations. △ Less

Submitted 10 July, 2023; originally announced July 2023.

Comments: 17 pages, 10 figures; Accepted for publication in RAA

arXiv:2307.01010 [pdf, other]

The 2021 X-ray outburst of magnetar SGR J1935+2154 -- I. Spectral properties

Authors: Sheng-Lun Xie, Yi Zhao, Wang-Chen Xue, Yun-Wei Yu, Shao-Lin Xiong, Heng Yu, Ce Cai, Shuang-Nan Zhang

Abstract: Over a period of multiple active episodes between January 2021 and January 2022, the magnetar SGR J1935+2154 emitted a total of 82 bursts observed by GECAM-B. Temporal and spectral analyses reveal that the bursts have an average duration of $\sim$145 ms and a fluence ranging from $1.2 \times 10^{-8} \ \mathrm{erg \cdot cm^{-2}}$ to $3.7 \times 10^{-5} \ \mathrm{erg \cdot cm^{-2}}$ (30 - 200 keV).… ▽ More Over a period of multiple active episodes between January 2021 and January 2022, the magnetar SGR J1935+2154 emitted a total of 82 bursts observed by GECAM-B. Temporal and spectral analyses reveal that the bursts have an average duration of $\sim$145 ms and a fluence ranging from $1.2 \times 10^{-8} \ \mathrm{erg \cdot cm^{-2}}$ to $3.7 \times 10^{-5} \ \mathrm{erg \cdot cm^{-2}}$ (30 - 200 keV). The spectral properties of these bursts are similar to those of earlier active episodes. Specifically, we find that the emission area of the Double Black Body (BB2) model shows a Log-Linear correlation to its temperature, and there is a weak relation between fluence and $E_{\mathrm{peak}}$ (or $α$) in the Cut-Off Power Law (CPL) model. However, we note that the temperature distributions of BB2/BB models in GECAM-B samples are different from those in GBM-GECAM samples, due to differences in the energy range used for fitting. To understand this difference, we propose a Multi-Temperature Black Body (MBB) model, assuming that the BB temperatures follow a power law distribution. Our analysis shows that the minimum temperature $kT_{\mathrm{min}} \sim 5$ keV of the MBB model, which is consistent between GECAM-B and GBM-GECAM. This indicates that both samples originated from similar magnetar bursts. We also reveal the spectra of magnetar bursts tend to be soft. It indicates that magnetar bursts may be composed of multiple low BB temperatures and the majority of the BB temperatures are concentrated around the minimum temperature. △ Less

Submitted 14 July, 2023; v1 submitted 3 July, 2023; originally announced July 2023.

arXiv:2306.10255 [pdf, other]

doi 10.1029/2022GL102325

The First GECAM Observation Results on Terrestrial Gamma-ray Flashes and Terrestrial Electron Beams

Authors: Y. Zhao, J. C. Liu, S. L. Xiong, W. C. Xue, Q. B. Yi, G. P. Lu, W. Xu, F. C. Lyu, J. C. Sun, W. X. Peng, C. Zheng, Y. Q. Zhang, C. Cai, S. Xiao, S. L. Xie, C. W. Wang, W. J. Tan, Z. H. An, G. Chen, Y. Q. Du, Y. Huang, M. Gao, K. Gong, D. Y. Guo, J. J. He , et al. (37 additional authors not shown)

Abstract: Gravitational-wave high-energy Electromagnetic Counterpart All-sky Monitor (GECAM) is a space-borne instrument dedicated to monitoring high-energy transients, including Terrestrial Gamma-ray Flashes (TGFs) and Terrestrial Electron Beams (TEBs). We implemented a TGF/TEB search algorithm for GECAM, with which 147 bright TGFs, 2 typical TEBs and 2 special TEB-like events are identified during an effe… ▽ More Gravitational-wave high-energy Electromagnetic Counterpart All-sky Monitor (GECAM) is a space-borne instrument dedicated to monitoring high-energy transients, including Terrestrial Gamma-ray Flashes (TGFs) and Terrestrial Electron Beams (TEBs). We implemented a TGF/TEB search algorithm for GECAM, with which 147 bright TGFs, 2 typical TEBs and 2 special TEB-like events are identified during an effective observation time of $\sim$9 months. We show that, with gamma-ray and charged particle detectors, GECAM can effectively identify and distinguish TGFs and TEBs, and measure their temporal and spectral properties in detail. A very high TGF-lightning association rate of $\sim$80\% is obtained between GECAM and GLD360 in east Asia region. △ Less

Submitted 17 June, 2023; originally announced June 2023.

Comments: The paper was accepted by Geophysical Research Letters on June 16th, 2023

arXiv:2305.18628 [pdf, other]

doi 10.1051/0004-6361/202245303

Simultaneous and panchromatic observations of the Fast Radio Burst FRB 20180916B

Authors: M. Trudu, M. Pilia, L. Nicastro, C. Guidorzi, M. Orlandini, L. Zampieri, V. R. Marthi, F. Ambrosino, A. Possenti, M. Burgay, C. Casentini, I. Mereminskiy, V. Savchenko, E. Palazzi, F. Panessa, A. Ridolfi, F. Verrecchia, M. Anedda, G. Bernardi, M. Bachetti, R. Burenin, A. Burtovoi, P. Casella, M. Fiori, F. Frontera , et al. (25 additional authors not shown)

Abstract: Aims. Fast Radio Bursts are bright radio transients whose origin has not yet explained. The search for a multi-wavelength counterpart of those events can put a tight constrain on the emission mechanism and the progenitor source. Methods. We conducted a multi-wavelength observational campaign on FRB 20180916B between October 2020 and August 2021 during eight activity cycles of the source. Observati… ▽ More Aims. Fast Radio Bursts are bright radio transients whose origin has not yet explained. The search for a multi-wavelength counterpart of those events can put a tight constrain on the emission mechanism and the progenitor source. Methods. We conducted a multi-wavelength observational campaign on FRB 20180916B between October 2020 and August 2021 during eight activity cycles of the source. Observations were led in the radio band by the SRT both at 336 MHz and 1547 MHz and the uGMRT at 400 MHz. Simultaneous observations have been conducted by the optical telescopes Asiago (Galileo and Copernico), CMO SAI MSU, CAHA 2.2m, RTT-150 and TNG, and X/Gamma-ray detectors on board the AGILE, Insight-HXMT, INTEGRAL and Swift satellites. Results. We present the detection of 14 new bursts detected with the SRT at 336 MHz and seven new bursts with the uGMRT from this source. We provide the deepest prompt upper limits in the optical band fro FRB 20180916B to date. In fact, the TNG/SiFAP2 observation simultaneous to a burst detection by uGMRT gives an upper limit E_optical / E_radio < 1.3 x 10^2. Another burst detected by the SRT at 336 MHz was also co-observed by Insight-HMXT. The non-detection in the X-rays yields an upper limit (1-30 keV band) of E_X-ray / E_radio in the range of (0.9-1.3) x 10^7, depending on which model is considered for the X-ray emission. △ Less

Submitted 29 May, 2023; originally announced May 2023.

Comments: A&A accepted

Journal ref: A&A 676, A17 (2023)

arXiv:2305.15541 [pdf, other]

Harnessing the Power of Large Language Models for Natural Language to First-Order Logic Translation

Authors: Yuan Yang, Siheng Xiong, Ali Payani, Ehsan Shareghi, Faramarz Fekri

Abstract: Translating natural language sentences to first-order logic (NL-FOL translation) is a longstanding challenge in the NLP and formal logic literature. This paper introduces LogicLLaMA, a LLaMA-7B model fine-tuned for NL-FOL translation using LoRA on a single GPU. LogicLLaMA is capable of directly translating natural language into FOL rules, which outperforms GPT-3.5. LogicLLaMA is also equipped to c… ▽ More Translating natural language sentences to first-order logic (NL-FOL translation) is a longstanding challenge in the NLP and formal logic literature. This paper introduces LogicLLaMA, a LLaMA-7B model fine-tuned for NL-FOL translation using LoRA on a single GPU. LogicLLaMA is capable of directly translating natural language into FOL rules, which outperforms GPT-3.5. LogicLLaMA is also equipped to correct FOL rules predicted by GPT-3.5, and can achieve similar performance as GPT-4 with a fraction of the cost. This correction ability was achieved by a novel supervised fine-tuning (SFT) + reinforcement learning with human feedback (RLHF) framework, which initially trains on synthetically perturbed NL-FOL pairs to encourage chain-of-thought reasoning and then fine-tunes with RLHF on GPT-3.5 outputs using a FOL verifier as the reward model. To train LogicLLaMA, we present MALLS (large language $\textbf{M}$odel gener$\textbf{A}$ted N$\textbf{L}$-FO$\textbf{L}$ pair$\textbf{S}$), a dataset of 34K high-quality and diverse sentence-level NL-FOL pairs collected from GPT-4. The dataset was created by implementing a pipeline that prompts GPT-4 for pairs, and dynamically adjusts the prompts to ensure the collection of pairs with rich and diverse contexts at different levels of complexity, and verifies the validity of the generated FOL rules. Codes, weights, and data are available at $\href{https://github.com/gblackout/LogicLLaMA}{\small \text{https://github.com/gblackout/LogicLLaMA}}$. △ Less

Submitted 24 May, 2023; originally announced May 2023.

Showing 1–50 of 329 results for author: Xiong, S